Difference between revisions of "SMILA/Documentation"

From Eclipsepedia

Jump to: navigation, search
m (Pipelines and Pipelets: Synchronous Workflows)
m (Pipelines and Pipelets: Synchronous Workflows)
 
(6 intermediate revisions by 2 users not shown)
Line 30: Line 30:
 
** [[SMILA/Documentation/Processing/JSON_REST_API_for_BPEL_pipelines|Creating, Editing, and Executing Pipelines]]
 
** [[SMILA/Documentation/Processing/JSON_REST_API_for_BPEL_pipelines|Creating, Editing, and Executing Pipelines]]
 
* Basic Pipelets
 
* Basic Pipelets
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets in Bundle org.eclipse.smila.processing.pipelets]]  
+
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets]]
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets in Bundle org.eclipse.smila.processing.pipelets.xmlprocessing]]
+
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets]]
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe Pipelets in Bundle org.eclipse.smila.processing.pipelets.boilerpipe]]
+
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe Pipelet (extract text from HTML content)]]
** [[SMILA/Documentation/TikaPipelet|TikaPipelet in bundle org.eclipse.smila.tika]]
+
** [[SMILA/Documentation/TikaPipelet|TikaPipelet (extract text from binary content)]]
** [[SMILA/Documentation/JdbcLoggingPipelet|JdbcLoggingPipelet in bundle org.eclipse.smila.jdbc]]
+
** [[SMILA/Documentation/JdbcLoggingPipelet|JdbcLoggingPipelet, JdbcFetcherPipelet, JdbcSelectPipelet (write/read to/from a database)]]
 
** More special pipelets are provided by the components described below.
 
** More special pipelets are provided by the components described below.
 
* Developing new Pipelets  
 
* Developing new Pipelets  
Line 49: Line 49:
 
** [[SMILA/Documentation/JobManagerFirstExample|JobManager Walk-Through]]
 
** [[SMILA/Documentation/JobManagerFirstExample|JobManager Walk-Through]]
 
* Creating Workflows and Jobs
 
* Creating Workflows and Jobs
** [[SMILA/Documentation/DataObjectTypesAndBuckets|Defining Buckets]]
+
** [[SMILA/Documentation/DataObjectTypesAndBuckets|Data Object Types and Buckets]]
** [[SMILA/Documentation/WorkerAndWorkflows|Modeling Workflows]]
+
** [[SMILA/Documentation/WorkerAndWorkflows|Workers and Workflows]]
** [[SMILA/Documentation/JobDefinitions|Creating Jobs]]
+
** [[SMILA/Documentation/JobDefinitions|Jobs]]
** [[SMILA/Documentation/JobParameters|Evaluating Job Parameters]]
+
** [[SMILA/Documentation/JobParameters|Job Parameters]]
 
* [[SMILA/Documentation/JobRuns|Running and Monitoring Jobs]]
 
* [[SMILA/Documentation/JobRuns|Running and Monitoring Jobs]]
 
* [[SMILA/Documentation/JobManagerConfiguration|Configuring the Job Manager]]
 
* [[SMILA/Documentation/JobManagerConfiguration|Configuring the Job Manager]]
Line 58: Line 58:
 
* Worker Reference
 
* Worker Reference
 
** [[SMILA/Documentation/Bulkbuilder|Bulkbuilder worker]]
 
** [[SMILA/Documentation/Bulkbuilder|Bulkbuilder worker]]
** [[SMILA/Documentation/Worker/PipelineProcessorWorker|PipelineProcesor Worker]]
+
** [[SMILA/Documentation/Worker/PipelineProcessorWorker|PipelineProcessor Worker]]
 
** [[SMILA/Documentation/Worker/PipeletProcessorWorker|PipeletProcessor Worker]]
 
** [[SMILA/Documentation/Worker/PipeletProcessorWorker|PipeletProcessor Worker]]
** See [[SMILA/Manual#Importing|Importing]] below for more workers
+
** (see [[SMILA/Manual#Importing|Importing]] section for more workers)
 
* Developing new Workers
 
* Developing new Workers
 
** [[SMILA/Documentation/WorkerManager|WorkerManager: Workers Made Easily]]
 
** [[SMILA/Documentation/WorkerManager|WorkerManager: Workers Made Easily]]
Line 72: Line 72:
 
**[[SMILA/Documentation/Importing/Crawler/File | FileCrawler and FileFetcher Worker]]
 
**[[SMILA/Documentation/Importing/Crawler/File | FileCrawler and FileFetcher Worker]]
 
**[[SMILA/Documentation/Importing/Crawler/Web | WebCrawler and WebFetcher Worker]]
 
**[[SMILA/Documentation/Importing/Crawler/Web | WebCrawler and WebFetcher Worker]]
 +
*** [[SMILA/Documentation/Importing/CrawlingMultipleStartURLs | Crawling multiple start URLs in one job run]]
 
**[[SMILA/Documentation/Importing/Crawler/JDBC | JdbcCrawler and JdbcFetcher Worker]]
 
**[[SMILA/Documentation/Importing/Crawler/JDBC | JdbcCrawler and JdbcFetcher Worker]]
 
**[[SMILA/Documentation/Importing/Crawler/Feed | FeedCrawler Worker]]
 
**[[SMILA/Documentation/Importing/Crawler/Feed | FeedCrawler Worker]]
Line 81: Line 82:
 
** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]]
 
** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]]
 
* Additionally
 
* Additionally
** [[SMILA/Documentation/Importing/CrawlingMultipleStartURLs | Crawling multiple start URL in one job run]]
+
** [[SMILA/Documentation/Importing/RemoteCrawling | Remote Crawling]]
  
 
== Embedded HTTP Server ==
 
== Embedded HTTP Server ==
Line 105: Line 106:
 
* [[SMILA/Documentation/General JPA Configuration in SMILA|General JPA Configuration in SMILA]]
 
* [[SMILA/Documentation/General JPA Configuration in SMILA|General JPA Configuration in SMILA]]
 
* [[SMILA/Documentation/SMILA_Versioning|SMILA Version Information]]
 
* [[SMILA/Documentation/SMILA_Versioning|SMILA Version Information]]
 +
* [[SMILA/Documentation/Adding_JDBC_Drivers|Adding JDBC Drivers]]
  
 
== Deprecated Components ==
 
== Deprecated Components ==

Latest revision as of 09:52, 12 August 2014

Contents

[edit] Basics

[edit] Development Environment

[edit] Pipelines and Pipelets: Synchronous Workflows

[edit] Searching

[edit] JobManager: Asynchronous Workflows

[edit] Importing

[edit] Embedded HTTP Server

[edit] Common Services

[edit] Deprecated Components