Skip to main content

Notice: This Wiki is now read only and edits are no longer possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

Jump to: navigation, search

Difference between revisions of "SMILA/Documentation/JobManagerConfiguration"

(Configuring the Job Manager)
(Configuring the Job Manager)
Line 1: Line 1:
 
{{note| Available since SMILA 0.9!}}  
 
{{note| Available since SMILA 0.9!}}  
  
== Configuring the Job Manager ==
+
== Configuring the JobManager ==
  
The Job Manager comes with a simple configuration file located in <tt>SMILA/configuration/org.eclipse.smila.jobmanager/jobmanager.properties</tt>. By default it looks like this:
+
The JobManager is configured via the [[SMILA/Documentation/Bundle_org.eclipse.smila.clusterconfig.simple|ClusterConfig service]]. With the "simple" ClusterConfig service, it uses one properties in the "taskmanager" section of <code>clusterconfig.json</code>:
  
<pre>
+
<source lang="javascript">
jobmanager.task.max.retries.recoverable.error=10
+
{
</pre>
+
  ...
 +
  "taskmanager": {
 +
    ...
 +
    "maxRetries": 10,
 +
    ...
 +
  },
 +
  ...
 +
}
 +
</source>
  
;jobmanager.task.max.retries.recoverable.error: Defines the maximum number of retries for tasks finished with a RECOVERABLE_ERROR result, either explicity sent by the worker or by the task monitoring of the Task Manager due to the worker not having sent keepAlive signals anymore for a configured period of time. As long as this retry limit is not reached, the Job Manager will recreate the task with a new task ID but the same settings, so that it can be processed by another worker. If the retry count is reached, however, the RECOVERABLE_ERROR will be handled as a FATAL_ERROR. Default: 10.
+
* maxRetries: Used to decide how often a task should be retried that has failed with an RECOVERABLE_ERROR, either because the "timeToLive" was exceeded or the worker itself reported such an error. If the retry limit is reached, the task will finally fail with a FATAL_ERROR.

Revision as of 07:55, 4 October 2011

Note.png
Available since SMILA 0.9!


Configuring the JobManager

The JobManager is configured via the ClusterConfig service. With the "simple" ClusterConfig service, it uses one properties in the "taskmanager" section of clusterconfig.json:

{
  ...
  "taskmanager": {
    ...
    "maxRetries": 10,
    ...
  },
  ...
}
  • maxRetries: Used to decide how often a task should be retried that has failed with an RECOVERABLE_ERROR, either because the "timeToLive" was exceeded or the worker itself reported such an error. If the retry limit is reached, the task will finally fail with a FATAL_ERROR.

Back to the top