Jump to: navigation, search

Difference between revisions of "SMILA/Documentation"

(Common Services)
(9 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
== Basics ==
 
== Basics ==
 
* [[SMILA/Documentation_for_5_Minutes_to_Success|Installing and Running]]
 
* [[SMILA/Documentation_for_5_Minutes_to_Success|Installing and Running]]
 +
* [[SMILA/Documentation/HowTo|HowTos]]
 
* [[SMILA/Documentation/Architecture_Overview|Architecture]]
 
* [[SMILA/Documentation/Architecture_Overview|Architecture]]
 
* [[SMILA/Documentation/Default_configuration_workflow_overview|Overview of Default Configuration]]
 
* [[SMILA/Documentation/Default_configuration_workflow_overview|Overview of Default Configuration]]
 
* [[SMILA/Documentation/Data_Model_and_Serialization_Formats|Data Model, XML, JSON, BON]]
 
* [[SMILA/Documentation/Data_Model_and_Serialization_Formats|Data Model, XML, JSON, BON]]
* [[SMILA/Documentation/Using_The_ReST_API|Using the ReST API]]
+
* [[SMILA/Documentation/Using_The_ReST_API|Using the REST API, REST Client]]
 
* [[SMILA/Documentation/REST_API_Reference|REST API Reference]]
 
* [[SMILA/Documentation/REST_API_Reference|REST API Reference]]
 
* [[SMILA/Documentation/Enable Remote Access|Enabling Remote Access to SMILA]]
 
* [[SMILA/Documentation/Enable Remote Access|Enabling Remote Access to SMILA]]
Line 29: Line 30:
 
** [[SMILA/Documentation/Processing/JSON_REST_API_for_BPEL_pipelines|Creating, Editing, and Executing Pipelines]]
 
** [[SMILA/Documentation/Processing/JSON_REST_API_for_BPEL_pipelines|Creating, Editing, and Executing Pipelines]]
 
* Basic Pipelets
 
* Basic Pipelets
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets in Bundle org.eclipse.smila.processing.pipelets]]  
+
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets]]
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets in Bundle org.eclipse.smila.processing.pipelets.xmlprocessing]]
+
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets]]
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe Pipelets in Bundle org.eclipse.smila.processing.pipelets.boilerpipe]]
+
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe Pipelet (extract text from HTML content)]]
 +
** [[SMILA/Documentation/TikaPipelet|TikaPipelet (extract text from binary content)]]
 +
** [[SMILA/Documentation/JdbcLoggingPipelet|JdbcLoggingPipelet (log to a database)]]
 
** More special pipelets are provided by the components described below.
 
** More special pipelets are provided by the components described below.
 
* Developing new Pipelets  
 
* Developing new Pipelets  
Line 40: Line 43:
  
 
* [[SMILA/Documentation/Search|Search Processing and APIs]]
 
* [[SMILA/Documentation/Search|Search Processing and APIs]]
* [[SMILA/Documentation/Solr|Solr Integration: Configuration and Pipelets]]
+
* [[SMILA/Documentation/Solr 3.5|Solr Integration: Configuration and Pipelets]]
  
 
== JobManager: Asynchronous Workflows ==
 
== JobManager: Asynchronous Workflows ==
Line 77: Line 80:
 
** [[SMILA/Documentation/Importing/Crawler/Web#Internal_structure|Extending the WebCrawler worker]]
 
** [[SMILA/Documentation/Importing/Crawler/Web#Internal_structure|Extending the WebCrawler worker]]
 
** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]]
 
** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]]
 +
* Additionally
 +
** [[SMILA/Documentation/Importing/RemoteCrawling | Remote Crawling]]
 +
** [[SMILA/Documentation/Importing/CrawlingMultipleStartURLs | Crawling multiple start URL in one job run]]
  
 
== Embedded HTTP Server ==
 
== Embedded HTTP Server ==

Revision as of 05:52, 16 April 2013

Basics

Development Environment

Pipelines and Pipelets: Synchronous Workflows

Searching

JobManager: Asynchronous Workflows

Importing

Embedded HTTP Server

Common Services

Deprecated Components