Skip to main content

Notice: this Wiki will be going read only early in 2024 and edits will no longer be possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

Jump to: navigation, search

SMILA/Documentation/SimpleMimeTypeIdentifier

Bundle: org.eclipse.smila.processing.pipelets.mimetype.SimpleMimeTypeIdentifier

Description

This ProcessingService is used to identify the mimetype of a document. The service uses the file extension todo the MIME type detection or tries to extract the mimetype from a given attribute containing metadata of the record. The identified MimeType is store in an attribute in the record.

Useful Information

Note that this ProcessingService also is a DeclarativeService that implements interface org.eclipse.smila.processing.pipelets.mimetype.MimeTypeIdentifier and can be used outside the workflow as well.

Configuration

  • configuration/org.eclipse.smila.processing.pipelets.mimetype/MimeTypeConfig.xml
Property Type Usage Description
FileExtensionAttribute String optional name of the attribute containing the file extension
MetaDataAttribute String optional name of the attribute containing metadata information. e.g. a WebCrawler returns a response header containing mime type information
MimeTypeAttribute String required name of the attribute to store the identified MimeType in

Example

The following example was used in the SMILA example application to identify MimeTypes of documents delivered by Filesystem- and WebCrawler.

MimeTypeConfig.xml

<PipeletConfiguration xmlns="http://www.eclipse.org/smila/processor">
  <Property name="FileExtensionAttribute">
    <Value>FileExtension</Value>
  </Property>  
  <Property name="MetaDataAttribute">
    <Value>MetaData</Value>
  </Property>
  <Property name="MimeTypeAttribute">
    <Value>MimeType</Value>
  </Property>    
</PipeletConfiguration>

Back to the top