Notice: this Wiki will be going read only early in 2024 and edits will no longer be possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.
Difference between revisions of "SMILA/Documentation"
< SMILA
m (→Basics) |
|||
(4 intermediate revisions by 2 users not shown) | |||
Line 5: | Line 5: | ||
* [[SMILA/Documentation/Default_configuration_workflow_overview|Overview of Default Configuration]] | * [[SMILA/Documentation/Default_configuration_workflow_overview|Overview of Default Configuration]] | ||
* [[SMILA/Documentation/Data_Model_and_Serialization_Formats|Data Model, XML, JSON, BON]] | * [[SMILA/Documentation/Data_Model_and_Serialization_Formats|Data Model, XML, JSON, BON]] | ||
− | * [[SMILA/Documentation/ | + | * [[SMILA/Documentation/Using_The_ReST_API|Using the REST API, REST Client]] |
* [[SMILA/Documentation/REST_API_Reference|REST API Reference]] | * [[SMILA/Documentation/REST_API_Reference|REST API Reference]] | ||
* [[SMILA/Documentation/Enable Remote Access|Enabling Remote Access to SMILA]] | * [[SMILA/Documentation/Enable Remote Access|Enabling Remote Access to SMILA]] | ||
Line 30: | Line 30: | ||
** [[SMILA/Documentation/Processing/JSON_REST_API_for_BPEL_pipelines|Creating, Editing, and Executing Pipelines]] | ** [[SMILA/Documentation/Processing/JSON_REST_API_for_BPEL_pipelines|Creating, Editing, and Executing Pipelines]] | ||
* Basic Pipelets | * Basic Pipelets | ||
− | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets | + | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets]] |
− | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets | + | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets]] |
− | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe | + | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe Pipelet (extract text from HTML content)]] |
+ | ** [[SMILA/Documentation/TikaPipelet|TikaPipelet (extract text from binary content)]] | ||
+ | ** [[SMILA/Documentation/JdbcLoggingPipelet|JdbcLoggingPipelet (log to a database)]] | ||
** More special pipelets are provided by the components described below. | ** More special pipelets are provided by the components described below. | ||
* Developing new Pipelets | * Developing new Pipelets | ||
Line 79: | Line 81: | ||
** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]] | ** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]] | ||
* Additionally | * Additionally | ||
+ | ** [[SMILA/Documentation/Importing/RemoteCrawling | Remote Crawling]] | ||
** [[SMILA/Documentation/Importing/CrawlingMultipleStartURLs | Crawling multiple start URL in one job run]] | ** [[SMILA/Documentation/Importing/CrawlingMultipleStartURLs | Crawling multiple start URL in one job run]] | ||
Revision as of 06:52, 16 April 2013
Contents
Basics
- Installing and Running
- HowTos
- Architecture
- Overview of Default Configuration
- Data Model, XML, JSON, BON
- Using the REST API, REST Client
- REST API Reference
- Enabling Remote Access to SMILA
Development Environment
- Setting up your Eclipse IDE for SMILA
- Building SMILA
- Creating new Components
- Testing new Components
- Adding Third Party Libraries to SMILA
- Using OSGi Declarative Services
- Extending the build process:
Pipelines and Pipelets: Synchronous Workflows
- What are Pipelines? What are Pipelets?
- Configuring and Creating BPEL Pipelines
- Using the SMILA BPEL Designer
- ReST APIs
- Basic Pipelets
- Common Pipelets
- XML Processing Pipelets
- Boilerpipe Pipelet (extract text from HTML content)
- TikaPipelet (extract text from binary content)
- JdbcLoggingPipelet (log to a database)
- More special pipelets are provided by the components described below.
- Developing new Pipelets
Searching
JobManager: Asynchronous Workflows
- What are Jobs and Tasks?
- Creating Workflows and Jobs
- Running and Monitoring Jobs
- Configuring the Job Manager
- TaskManager: Asynchronous Scheduling of Tasks
- Worker Reference
- Bulkbuilder worker
- PipelineProcesor Worker
- PipeletProcessor Worker
- See Importing below for more workers
- Developing new Workers
Importing
- Concepts, Workflow and Components
- Reference of Import Workers
- Developing new Import Workers
- Additionally
Embedded HTTP Server
Common Services
- Configuration Helper
- Workspace Helper
- ClusterConfig Service
- Zookeeper Service
- ObjectStore
- BinaryStorage
- Processing Security Information
- Ontology Processing with Sesame: Configuration and Pipelets
- MimeTypeIdentifier
- Description of Worker and Pipelet Parameters
- Publishing Web Services
- General JPA Configuration in SMILA
- SMILA Version Information