Skip to main content

Notice: this Wiki will be going read only early in 2024 and edits will no longer be possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

Jump to: navigation, search

Difference between revisions of "SMILA/Documentation/JobManager"

(Job Manager)
Line 3: Line 3:
 
= Job Manager =
 
= Job Manager =
  
The Job Manager controls the processing logic of [[Glossary#W|asynchronous workflows]] in SMILA by regulating the Task Manager, which in turn generates tasks and decides which task should be processed by which [[Glossary#W|worker]] and when.  
+
The Job Manager controls the processing logic of [[SMILA/Glossary#W|asynchronous workflows]] in SMILA by regulating the Task Manager, which in turn generates tasks and decides which task should be processed by which [[SMILA/Glossary#W|worker]] and when.  
  
A workflow definition consists of one or multiple steps, called [[Glossary#A|actions]]. Each action specifies the worker that does the actual processing and connects the [[Glossary#S|slots]] of the worker with actual [[Glossary#B|buckets]]. A slot is a description of the worker's input and output behavior. A bucket is simply a logical data container grouping data objects of the same type. There are ''input slots'' that describe the type of data objects that the worker is able to process and also ''output slots'' that define the type of data objects that the worker will produce. Hence, to be able to use a worker in a workflow, you will have to assign buckets of the correct type. In addition to that, parameters may be set at different levels in a workflow to configure its behavior. Global parameters are valid for all actions in a workflow; specific parameters at the action level are only valid for the respective worker.
+
A workflow definition consists of one or multiple steps, called [[SMILA/Glossary#A|actions]]. Each action specifies the worker that does the actual processing and connects the [[SMILA/Glossary#S|slots]] of the worker with actual [[SMILA/Glossary#B|buckets]]. A slot is a description of the worker's input and output behavior. A bucket is simply a logical data container grouping data objects of the same type. There are ''input slots'' that describe the type of data objects that the worker is able to process and also ''output slots'' that define the type of data objects that the worker will produce. Hence, to be able to use a worker in a workflow, you will have to assign buckets of the correct type. In addition to that, parameters may be set at different levels in a workflow to configure its behavior. Global parameters are valid for all actions in a workflow; specific parameters at the action level are only valid for the respective worker.
  
To run a certain workflow in SMILA, you will have to create a [[Glossary#J|job]] definition that references the desired workflow. Like in workflows, it is possible to set further parameters, e.g. to adapt the used  workflow for a certain application. For example, when using the same workflow in two jobs you could set different parameters of the <tt>store</tt> parameter in each to make sure that data is written to different places.
+
To run a certain workflow in SMILA, you will have to create a [[SMILA/Glossary#J|job]] definition that references the desired workflow. Like in workflows, it is possible to set further parameters, e.g. to adapt the used  workflow for a certain application. For example, when using the same workflow in two jobs you could set different parameters of the <tt>store</tt> parameter in each to make sure that data is written to different places.
  
With a job definition, the system does not actually do anything. First, the job must be started to get a so called [[Glossary#J|job run]]. Job runs in turn consist of one or multiple [[Glossary#W|workflow runs]]. A workflow run refers to one traversal of the respective workflow, e.g. one traversal for each object in an input bucket.
+
With a job definition, the system does not actually do anything. First, the job must be started to get a so called [[SMILA/Glossary#J|job run]]. Job runs in turn consist of one or multiple [[SMILA/Glossary#W|workflow runs]]. A workflow run refers to one traversal of the respective workflow, e.g. one traversal for each object in an input bucket.
  
 
Job runs provide so called ''job run data'' which can be used to monitor the current job processing. Also, job runs can be canceled in case of a problem. Except for the so-called "runOnce" jobs, which are finished automatically, job runs must be stopped manually when they should no longer react on changes.
 
Job runs provide so called ''job run data'' which can be used to monitor the current job processing. Also, job runs can be canceled in case of a problem. Except for the so-called "runOnce" jobs, which are finished automatically, job runs must be stopped manually when they should no longer react on changes.

Revision as of 07:12, 12 July 2011

Note.png
Available since SMILA 0.9!


Job Manager

The Job Manager controls the processing logic of asynchronous workflows in SMILA by regulating the Task Manager, which in turn generates tasks and decides which task should be processed by which worker and when.

A workflow definition consists of one or multiple steps, called actions. Each action specifies the worker that does the actual processing and connects the slots of the worker with actual buckets. A slot is a description of the worker's input and output behavior. A bucket is simply a logical data container grouping data objects of the same type. There are input slots that describe the type of data objects that the worker is able to process and also output slots that define the type of data objects that the worker will produce. Hence, to be able to use a worker in a workflow, you will have to assign buckets of the correct type. In addition to that, parameters may be set at different levels in a workflow to configure its behavior. Global parameters are valid for all actions in a workflow; specific parameters at the action level are only valid for the respective worker.

To run a certain workflow in SMILA, you will have to create a job definition that references the desired workflow. Like in workflows, it is possible to set further parameters, e.g. to adapt the used workflow for a certain application. For example, when using the same workflow in two jobs you could set different parameters of the store parameter in each to make sure that data is written to different places.

With a job definition, the system does not actually do anything. First, the job must be started to get a so called job run. Job runs in turn consist of one or multiple workflow runs. A workflow run refers to one traversal of the respective workflow, e.g. one traversal for each object in an input bucket.

Job runs provide so called job run data which can be used to monitor the current job processing. Also, job runs can be canceled in case of a problem. Except for the so-called "runOnce" jobs, which are finished automatically, job runs must be stopped manually when they should no longer react on changes.

Using the Job Manager

This page and all referenced pages are work in progress.


Back to the top