Notice: this Wiki will be going read only early in 2024 and edits will no longer be possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.
Difference between revisions of "SMILA/Documentation"
< SMILA
(→The SMILA Development Environment) |
m (→Pipelines and Pipelets: Synchronous Workflows) |
||
(25 intermediate revisions by 6 users not shown) | |||
Line 1: | Line 1: | ||
== Basics == | == Basics == | ||
* [[SMILA/Documentation_for_5_Minutes_to_Success|Installing and Running]] | * [[SMILA/Documentation_for_5_Minutes_to_Success|Installing and Running]] | ||
+ | * [[SMILA/Documentation/HowTo|HowTos]] | ||
* [[SMILA/Documentation/Architecture_Overview|Architecture]] | * [[SMILA/Documentation/Architecture_Overview|Architecture]] | ||
* [[SMILA/Documentation/Default_configuration_workflow_overview|Overview of Default Configuration]] | * [[SMILA/Documentation/Default_configuration_workflow_overview|Overview of Default Configuration]] | ||
* [[SMILA/Documentation/Data_Model_and_Serialization_Formats|Data Model, XML, JSON, BON]] | * [[SMILA/Documentation/Data_Model_and_Serialization_Formats|Data Model, XML, JSON, BON]] | ||
− | * [[SMILA/Documentation/Using_The_ReST_API|Using the | + | * [[SMILA/Documentation/Using_The_ReST_API|Using the REST API, REST Client]] |
* [[SMILA/Documentation/REST_API_Reference|REST API Reference]] | * [[SMILA/Documentation/REST_API_Reference|REST API Reference]] | ||
* [[SMILA/Documentation/Enable Remote Access|Enabling Remote Access to SMILA]] | * [[SMILA/Documentation/Enable Remote Access|Enabling Remote Access to SMILA]] | ||
− | == | + | == Development Environment == |
* [[SMILA/Documentation/HowTo/Howto_set_up_dev_environment|Setting up your Eclipse IDE for SMILA]] | * [[SMILA/Documentation/HowTo/Howto_set_up_dev_environment|Setting up your Eclipse IDE for SMILA]] | ||
* [[SMILA/Documentation/HowTo/Howto_build_a_SMILA-Distribution|Building SMILA]] | * [[SMILA/Documentation/HowTo/Howto_build_a_SMILA-Distribution|Building SMILA]] | ||
Line 19: | Line 20: | ||
** [[SMILA/Documentation/HowTo/How_to_integrate_test_bundle_into_build_process|Adding a new Test Bundle to the Build]] | ** [[SMILA/Documentation/HowTo/How_to_integrate_test_bundle_into_build_process|Adding a new Test Bundle to the Build]] | ||
− | == Pipelines and Pipelets: Synchronous Workflows | + | == Pipelines and Pipelets: Synchronous Workflows == |
* [[SMILA/Documentation/Pipelets|What are Pipelines? What are Pipelets?]] | * [[SMILA/Documentation/Pipelets|What are Pipelines? What are Pipelets?]] | ||
* [[SMILA/Documentation/BPEL_Workflow_Processor|Configuring and Creating BPEL Pipelines]] | * [[SMILA/Documentation/BPEL_Workflow_Processor|Configuring and Creating BPEL Pipelines]] | ||
Line 31: | Line 32: | ||
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets in Bundle org.eclipse.smila.processing.pipelets]] | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets|Common Pipelets in Bundle org.eclipse.smila.processing.pipelets]] | ||
** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets in Bundle org.eclipse.smila.processing.pipelets.xmlprocessing]] | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.xmlprocessing|XML Processing Pipelets in Bundle org.eclipse.smila.processing.pipelets.xmlprocessing]] | ||
+ | ** [[SMILA/Documentation/Bundle org.eclipse.smila.processing.pipelets.boilerpipe|Boilerpipe Pipelets in Bundle org.eclipse.smila.processing.pipelets.boilerpipe]] | ||
+ | ** [[SMILA/Documentation/TikaPipelet|TikaPipelet in bundle org.eclipse.smila.tika]] | ||
+ | ** [[SMILA/Documentation/JdbcLoggingPipelet|JdbcLoggingPipelet in bundle org.eclipse.smila.jdbc]] | ||
** More special pipelets are provided by the components described below. | ** More special pipelets are provided by the components described below. | ||
* Developing new Pipelets | * Developing new Pipelets | ||
Line 36: | Line 40: | ||
** [[SMILA/Documentation/Usage_of_Blackboard_Service|Using the Blackboard Service]] | ** [[SMILA/Documentation/Usage_of_Blackboard_Service|Using the Blackboard Service]] | ||
− | == | + | == Searching == |
* [[SMILA/Documentation/Search|Search Processing and APIs]] | * [[SMILA/Documentation/Search|Search Processing and APIs]] | ||
− | * [[SMILA/Documentation/Solr|Solr Integration: Configuration and Pipelets]] | + | * [[SMILA/Documentation/Solr 3.5|Solr Integration: Configuration and Pipelets]] |
== JobManager: Asynchronous Workflows == | == JobManager: Asynchronous Workflows == | ||
Line 63: | Line 67: | ||
== Importing == | == Importing == | ||
− | * [[SMILA/Documentation/Importing/Concept| | + | * [[SMILA/Documentation/Importing/Concept|Concepts, Workflow and Components]] |
+ | ** [[SMILA/Documentation/Importing/CompoundExtractorService|Compound Extractor Service]] | ||
* Reference of Import Workers | * Reference of Import Workers | ||
**[[SMILA/Documentation/Importing/Crawler/File | FileCrawler and FileFetcher Worker]] | **[[SMILA/Documentation/Importing/Crawler/File | FileCrawler and FileFetcher Worker]] | ||
**[[SMILA/Documentation/Importing/Crawler/Web | WebCrawler and WebFetcher Worker]] | **[[SMILA/Documentation/Importing/Crawler/Web | WebCrawler and WebFetcher Worker]] | ||
+ | **[[SMILA/Documentation/Importing/Crawler/JDBC | JdbcCrawler and JdbcFetcher Worker]] | ||
+ | **[[SMILA/Documentation/Importing/Crawler/Feed | FeedCrawler Worker]] | ||
**[[SMILA/Documentation/Importing/DeltaCheck | DeltaChecker Worker]] | **[[SMILA/Documentation/Importing/DeltaCheck | DeltaChecker Worker]] | ||
**[[SMILA/Documentation/Importing/UpdatePusher | UpdatePusher Worker]] | **[[SMILA/Documentation/Importing/UpdatePusher | UpdatePusher Worker]] | ||
* Developing new Import Workers | * Developing new Import Workers | ||
− | ** [[SMILA/Documentation/Importing/VisitedLinks | VisitedLinks service]] | + | ** [[SMILA/Documentation/Importing/VisitedLinks | Using the VisitedLinks service]] |
** [[SMILA/Documentation/Importing/Crawler/Web#Internal_structure|Extending the WebCrawler worker]] | ** [[SMILA/Documentation/Importing/Crawler/Web#Internal_structure|Extending the WebCrawler worker]] | ||
** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]] | ** [[SMILA/Documentation/HowTo/How to add a new Data Source to the importing framework|Adding a Data Source to the SMILA Import Framework]] | ||
+ | * Additionally | ||
+ | ** [[SMILA/Documentation/Importing/CrawlingMultipleStartURLs | Crawling multiple start URL in one job run]] | ||
− | == | + | == Embedded HTTP Server == |
* [[SMILA/Documentation/JettyHttpServer|Configuring Jetty]] | * [[SMILA/Documentation/JettyHttpServer|Configuring Jetty]] | ||
* [[SMILA/Documentation/JettyHttpServer#JSON_Handlers|Developing JSON ReST Handlers for SMILA]] | * [[SMILA/Documentation/JettyHttpServer#JSON_Handlers|Developing JSON ReST Handlers for SMILA]] | ||
Line 83: | Line 92: | ||
* [[SMILA/Documentation/Bundle_org.eclipse.smila.clusterconfig|ClusterConfig Service]] | * [[SMILA/Documentation/Bundle_org.eclipse.smila.clusterconfig|ClusterConfig Service]] | ||
** [[SMILA/Documentation/Bundle_org.eclipse.smila.clusterconfig.simple|Simple Implementation]] | ** [[SMILA/Documentation/Bundle_org.eclipse.smila.clusterconfig.simple|Simple Implementation]] | ||
+ | * [[SMILA/Documentation/Bundle_org.eclipse.smila.zookeeper|Zookeeper Service]] | ||
* [[SMILA/Documentation/ObjectStore/Bundle_org.eclipse.smila.objectstore|ObjectStore]] | * [[SMILA/Documentation/ObjectStore/Bundle_org.eclipse.smila.objectstore|ObjectStore]] | ||
** [[SMILA/Documentation/ObjectStore/Bundle_org.eclipse.smila.objectstore.filesystem|Filesystem Objectstore Implementation]] | ** [[SMILA/Documentation/ObjectStore/Bundle_org.eclipse.smila.objectstore.filesystem|Filesystem Objectstore Implementation]] | ||
Line 91: | Line 101: | ||
* [[SMILA/Documentation/SesameOntologyManager|Ontology Processing with Sesame: Configuration and Pipelets]] | * [[SMILA/Documentation/SesameOntologyManager|Ontology Processing with Sesame: Configuration and Pipelets]] | ||
* [[SMILA/Documentation/MimeTypeIdentifier|MimeTypeIdentifier]] | * [[SMILA/Documentation/MimeTypeIdentifier|MimeTypeIdentifier]] | ||
+ | * [[SMILA/Documentation/ParameterDefinition|Description of Worker and Pipelet Parameters]] | ||
* [[SMILA/Documentation/PublishingJAXWSWebservices|Publishing Web Services]] | * [[SMILA/Documentation/PublishingJAXWSWebservices|Publishing Web Services]] | ||
* [[SMILA/Documentation/General JPA Configuration in SMILA|General JPA Configuration in SMILA]] | * [[SMILA/Documentation/General JPA Configuration in SMILA|General JPA Configuration in SMILA]] | ||
+ | * [[SMILA/Documentation/SMILA_Versioning|SMILA Version Information]] | ||
== Deprecated Components == | == Deprecated Components == | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
** [[SMILA/Documentation/Management|JMX Management]] | ** [[SMILA/Documentation/Management|JMX Management]] | ||
*** [[SMILA/Documentation/Management#JMX_Client|JMX Clients]] | *** [[SMILA/Documentation/Management#JMX_Client|JMX Clients]] | ||
** [[SMILA/Documentation/Record_Storage|RecordStorage]] | ** [[SMILA/Documentation/Record_Storage|RecordStorage]] | ||
+ | |||
[[Category:SMILA]] | [[Category:SMILA]] |
Revision as of 12:19, 21 February 2013
Contents
Basics
- Installing and Running
- HowTos
- Architecture
- Overview of Default Configuration
- Data Model, XML, JSON, BON
- Using the REST API, REST Client
- REST API Reference
- Enabling Remote Access to SMILA
Development Environment
- Setting up your Eclipse IDE for SMILA
- Building SMILA
- Creating new Components
- Testing new Components
- Adding Third Party Libraries to SMILA
- Using OSGi Declarative Services
- Extending the build process:
Pipelines and Pipelets: Synchronous Workflows
- What are Pipelines? What are Pipelets?
- Configuring and Creating BPEL Pipelines
- Using the SMILA BPEL Designer
- ReST APIs
- Basic Pipelets
- Common Pipelets in Bundle org.eclipse.smila.processing.pipelets
- XML Processing Pipelets in Bundle org.eclipse.smila.processing.pipelets.xmlprocessing
- Boilerpipe Pipelets in Bundle org.eclipse.smila.processing.pipelets.boilerpipe
- TikaPipelet in bundle org.eclipse.smila.tika
- JdbcLoggingPipelet in bundle org.eclipse.smila.jdbc
- More special pipelets are provided by the components described below.
- Developing new Pipelets
Searching
JobManager: Asynchronous Workflows
- What are Jobs and Tasks?
- Creating Workflows and Jobs
- Running and Monitoring Jobs
- Configuring the Job Manager
- TaskManager: Asynchronous Scheduling of Tasks
- Worker Reference
- Bulkbuilder worker
- PipelineProcesor Worker
- PipeletProcessor Worker
- See Importing below for more workers
- Developing new Workers
Importing
- Concepts, Workflow and Components
- Reference of Import Workers
- Developing new Import Workers
- Additionally
Embedded HTTP Server
Common Services
- Configuration Helper
- Workspace Helper
- ClusterConfig Service
- Zookeeper Service
- ObjectStore
- BinaryStorage
- Processing Security Information
- Ontology Processing with Sesame: Configuration and Pipelets
- MimeTypeIdentifier
- Description of Worker and Pipelet Parameters
- Publishing Web Services
- General JPA Configuration in SMILA
- SMILA Version Information