Notice: This Wiki is now read only and edits are no longer possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.
SMILA/Documentation/Importing/Crawler/Web
Currently, the web crawler workers are implemented very simplistic so that we can test the importing framework. A more sophisticated implementation will follow soon (hopefully).
Web Crawler
- Worker name: webCrawler
- Parameters:
- dataSource
- startUrl
- Task generator: runOnceTrigger
- Input slots:
- linksToCrawl
- Output slots:
- linksToCrawl
- crawledLinks