PTP/designs/Resource Manager Configuration
Contents
- 1 PTP 6.0.0 Changes
- 2 PTP 5.0.1 Changes
- 3 Introduction
- 4 Configuring/Customizing the Resource Manager
- 5 The JAXB Configurable Resource Manager XML Schema
- 5.1 The resource-manager-data Element
- 5.2 The control-data Element
- 5.2.1 Resource Manager Environment
- 5.2.2 The property and attribute Elements
- 5.2.3 The managed-file Element
- 5.2.4 The command Element
- 5.2.5 The launch-tab Element
- 5.2.5.1 The tab-controller Element
- 5.2.5.2 The tab-folders and tab-items Elements
- 5.2.5.3 The layout Element
- 5.2.5.4 The layout-data Element
- 5.2.5.5 The composite Element
- 5.2.5.6 The control-state Element
- 5.2.5.7 The widget Element
- 5.2.5.8 The button-group Element
- 5.2.5.9 The browse Element
- 5.2.5.10 The action-button Element
- 5.2.5.11 The attribute-viewer Element
- 5.3 The monitor-data Element
PTP 6.0.0 Changes
A number of new features and changes have been included in the 6.0.0 release.
- The
buttonId
attribute on widgets is now global. In the previous version button IDs were local to each tab. This allows a button on one tab to change the state of a control on a different tab; - The
control-state
element now supports obtaining state by comparing an attribute value to a string; - The
saveValueTo
attribute on widgets has been renamed to simplyattribute
. This is to reflect the fact that an widget can read from and save to an attribute.
PTP 5.0.1 Changes
Aside from some significant bug fixes, there are a number of new features with the 5.0.1 release.
- The
widget
component has been split up into three components:widget
,button-group
, andbrowse
; action
push-button functionality has been added;control-state
elements have been added to all UI control descriptors;- The launch tab
import
controller type has been modified (largely for maintaining unformity of implementation) to subclass thedynamic
controller; - Ability to inspect invisible discovered (excluded) properties and attributes via a "View" button (see note) has been added;
- Preferences have been added to help with the debugging of stream tokenizers;
- Resource Manager "environment" handling has been totally rewritten in order to support the exclusion of properties associated with invisible or disabled widgets, and to allow for the restoration of their values when re-enabled.
In addition, some tweaking of the SWT "knobs" was necessary to get the XML to reflect more closely the behavior of the Java classes (especially in terms of defaults).
Introduction
The JAXB Resource Manager plug-ins allow you to launch and monitor
applications on local or remote resources using resource managers which
are configured from an XML file via JAXB (
javax.xml.bind
) technology.
There are two main motivations for providing this class of resource managers:
- To allow for maximum adaptability. Often job schedulers (PBS, LSF, LoadLeveler, etc.) or interactive runtime systems (OpenMPI, PE, SLURM, etc.) are set up by system administrators in special or non-standard ways which make it difficult to use a generic tool. The configuration file allows a user or community of users to fit the resource manager to a class of systems, to a single host, or even to special application usage.
- Building the resource manager and its UI presentation from an XML configuration means that in most cases no special Java coding is necessary. Users should be able to accommodate new systems, at least on the client-end, without writing and loading additional Eclipse plugins. (The only qualification here is that the monitoring component also support that type of scheduler or runtime; see the following paragraphs.)
Additional considerations in designing a generically configurable resource manager were to partition the client functionality so as to eliminate the need for special server-side proxies and to scale more successfully in the updating of job and resource information.
To this end, JAXB resource managers now consist of two components, a "control", which governs the configuration, launch and cancellation of individual jobs entirely from the client end, and a "monitor", which displays job status as well as global information about the HPC resource. In most cases, the monitor will be a pre-built type provided by the PTP distribution, implemented using LLview. Since LLview already supports a good number of the standard scheduler types, adding a new resource manager type will normally entail only the specific configuration of its control part. The default mode of usage is that the client stages the necessary LLview components (mostly Perl scripts) automatically, but in a future release, the monitor will also be capable of connecting directly to a system-wide (web-based) deployment.
The following is a guide to the resource manager XML definition. Those interested only in using the JAXB resource managers already provided with the PTP distribution should consult the User pages under the relevant scheduler (currently only the PBS resource managers are JAXB-configurable; please see the PBS Res Mgr help).
Configuring/Customizing the Resource Manager
A simple description of creating a custom configuration for a resource manager based on an existing configuration is described on the PTP FAQ page item for customizing a resource manager
We provide here an introductory tutorial slide set demonstrating some simple additions to a pre-existing resource manager definition file.
Tutorial Slides - JAXBDemo.pdf
Example .xml definitions, including the demo-example.xml
used in this tutorial, are available here: Demo XML (PTP 5.0.0); further examples which work with the upcoming 5.0.1 release: Demo XML (PTP 5.0.1).
Preferences
The preference options are aids for debugging configuration changes. The first has to do with altering the default behavior of the resource manager, which is to cache its XML definition file; this can be overridden to reread the file every time you start the Resource Manager. This way, if you make changes, they will automatically be picked up without having to reboot the application or recreate the Resource Manager.
The other options are useful if you are writing a new stream tokenizer for a command; checking all four options usually produces quite a bit of output, so in that case it may be advisable to log the results to a file rather than stdout (at any rate the latter would be available only if PTP were running as an application within the Eclipse IDE platform, which is not the case for those modifying the XML simply from within the PTP package installation workspace).
The JAXB Configurable Resource Manager XML Schema
The JAXB Resource Manager is model-driven ; this means that its functioning and appearance are determined by a set of definitions provided via an XML file. What follows is a detailed explanation of the XML schema (XSD) governing the resource manager XML definition.
The resource-manager-data
Element
The top-level of the definition tree is contained within the resource-manager-data element. It consists of three elements: site-data, control-data and monitor-data. In addition, a resource manager should be given a name using the name attribute, which sufficiently distinguishes it from others of a similar type; e.g., pbs-torque-v_2.3.7_abe is specific to an installation on the host abe, ll-v_4.0 suits all installations of LoadLeveler version 4, etc.
The site-data element provides an optional place to set fixed or
default remote site information. The connection strings are URIs which
are specific to the PTP RemoteServices definitions. The scheme for
these URIs will usually name the specific remote service (e.g,
rse:
or
remotetools:
; local is simply
file:
). The host name and port given here will appear as defaults in the
resource manager selection wizard when you create a new connection.
The control-data element is used to define the control actions of the resource manager. The top-level control elements include properties and attributes, files to be staged, job script to be generated (if any), commands specific to the resource manager, and the layout of the Launch Tab.
The monitor-data element is used to define the monitoring actions of the resource manager. The top-level monitor elements include properties, drivers for specifying paths to remote commands, and as element for specifying new monitoring workflows.
The control-data
Element
The majority of the XML definition is given over to the set-up of the resource manager control. One can think of this section as having four subdivisions:
- Configuration Variable Definitions (the Environment)
- Files and Scripts
- External Commands and their Stream Parsers
- UI Configuration (Launch Tab)
We will look at these each in turn.
Resource Manager Environment
The resource manager implementation constructs a variable map from the
defined properties and attributes which serves as the resource manager
"environment". Some elements directly reference these by name; a
text/string representation of property or attribute fields can be
obtained via the Eclipse variable resolver syntax, with the namespace
prefix set to ptp_rm:
${ptp_rm:name#fieldName}
, e.g.,
${ptp_rm:queues#value}
(see further below on the specific fields
for properties and attributes).
All properties and attributes defined in the configuration are included in the map. The following hard-coded properties are also added at runtime:
- os.name
- Operating system name of the (remote) connection.
For example, given results from the "uname" command:
- Linux
- AIX
- Mac OS X - if results equal "Darwin" then results from "sw_vers -productName"
- everything else - results from "uname" command
- os.version
- Operating system version of the (remote) connection.
For example:
- For Linux - results from "uname -r" such as "2.6.32-279.2.1.el6.x86_64"
- For AIX - results from "oslevel" such as "7.1.0.0"
- For Mac OS X - results from "sw_vers -productVersion" such as "10.8.3"
- For everything else - "unknown"
- os.arch
- Machine architecture of the (remote) connection.
For example:
- For Linux - results from "uname -m" such as "x86_64"
- For AIX - if results from "uname -p" equals "powerpc"
- then if "prtconf -k" contains "64-bit" then "ppc64" else "ppc"
- else the result from "uname -p"
- For Mac OS X - if results from "uname -m" equals "i386"
- then if results from "sysctl -n hw.optional.x86_64" equals "1" then "x86_64" else the results from "uname -m"
- else the results from "uname -m"
- For everything else - "unknown"
- file.separator
- File separator character of the (remote) connection. Hardcoded "/" (forward slash).
- path.separator
- Path separator character of the (remote) connection. Hardcoded ":" (colon).
- line.separator
- Line separator character of the (remote) connection. Hardcoded "\n" (new-line).
- control.user.name
- User name of the (remote) connection used by the resource manager.
- control.address
- Address (hostname) of the resource manager connection.
- control.working.dir
- Working directory of the resource manager connection. Usually the user's home directory.
- executablePath
- The path of the executable to be submitted or launched by the resource manager. This path is only valid on the target system.
- executableDir
- The directory containing the executable on the target system. Only valid on the target system.
- progArgs
- The arguments to be passed to the executable prior to launch.
- directory
- The working directory specified in the launch configuration Arguments tab.
In addition, there are a number of special properties that are required so that stdout
and stderr
from a scheduled job can to be delivered to the client. These properties
should be included in the resource manager property set.
- control.queue.name
- Should be linked to an attribute or property containing the destination queue.
- stdout_remote_path
- Can be linked to an attribute or property containing a path, otherwise a default path will be used.
- stderr_remote_path
- Can be linked to an attribute or property containing a path, otherwise a default path will be used.
These properties may be linked (see below) to other attributes to furnish the resource manager with an implementation-independent variable for the queue and for any remote output files produced in connection with the job submission.
The property
and attribute
Elements
A property is any variable necessary for the functioning of the resource manager. Properties often (but not necessarily) are not visible. The value for properties can be any primitive type, or lists or maps of strings.
The distinction between attribute and property is not hard and fast, but the intention is to provide a way of differentiating between external, system-determined variables (attributes) and variables only required by the PTP implementation itself. There are several other elements and fields on the attribute, e.g., for type-specific defaults (min/max, choice). Generally, if the variable is exposed to the user, who should also be provided with a description or instructions for use, then it should be defined as an attribute.
The visible attribute is a way of indicating to the resource
manager that the user will not be directly changing the value of the
property or attribute via the user interface. Certain widgets (such as
the
attribute viewer
) check this to see if the property or attribute should be included
automatically in its list.
BEST PRACTICE: VISIBLE vs INVISIBLE
Always mark properties or attributes which are the targets of attribute (see below) as visible="true"; if this value is false, this variable will always be passed to the configuration/environment of the resource manager; selection/exclusion of sets of properties or attributes pertains only to those marked visible.
status is an optional field for capturing the validity of the attribute at runtime or for a particular installation, if such information is made available by the system. The readOnly property can be defined here, but more often will be enforced directly on the widget which is connected to the property or attribute. min and max give the boundaries for integer values, usually represented by spinner widgets. translateBooleanAs applies to boolean values which should be represented as strings other than "true" and "false"; use a comma-delimited pair to indicate the string equivalent for the boolean; e.g., "YES,NO" would mean true yields the first string and false the second.
description
is meant to be a brief one-sentence definition; most of the time
detailed information will go into the
tooltip
element.
choice
is a predetermined, fixed set of values to choose from; these are
comma-delimited and will appear as the items of a combo-box.
items-from
instead links the combo items (choice) to another property or attribute
value (whose type must be
java.util.Collection
).
Property and Attribute Values
The untyped
value
element on properties and attributes is for internal use only; to give
a predefined (primitive) value, use the
default
element along with the type attribute.
link-value-to
works as follows: the property or attribute gets its value from the
property or attribute linked to, unless that property or attribute has
an undefined (empty) value; in the latter case, any default defined on
the present property or attribute will be used. This feature is
primarily intended for defining dependent properties or attributes
whose values are not directly modified by the user.
Attributes also carry optional validators which allow you to enforce syntactic rules, numerical ranges (each with one or both boundaries; these are OR'd) or file-system semantics on the value when the user enters text.
The regex
and file-match
Elements
The validator makes use of the
regex
type (also used in the parser or tokenizer; see below),
along with a special
file-match
type. A regex can be used to match by setting the expression
field; to split the segment into tokens on the basis of the expression,
set the split field to true (this applies only to the
match
type discussed below). The expression should follow the
grammar in
java.util.regex
; the flags are those made available in the
java.util.regex.Pattern
class (consult the Javadoc for explanations):
- CASE_INSENSITIVE
- MULTILINE
- DOTALL
- UNICODE_CASE
- CANON_EQ
- LITERAL
- COMMENTS
- UNIX_LINES
These can be OR'd in the usual manner. NOTE: when using the
regex on the match
type, the expression can contain variable references to be resolved
in the environment.
file-match
exports the attributes pertaining to
org.eclipse.core.filesystem.IFileInfo
; efsAttributes is an OR'd string of the EFS constants:
- ATTRIBUTE_READ_ONLY
- ATTRIBUTE_IMMUTABLE
- ATTRIBUTE_OWNER_READ
- ATTRIBUTE_OWNER_WRITE
- ATTRIBUTE_OWNER_EXECUTE
- ATTRIBUTE_GROUP_READ
- ATTRIBUTE_GROUP_WRITE
- ATTRIBUTE_GROUP_EXECUTE
- ATTRIBUTE_OTHER_READ
- ATTRIBUTE_OTHER_WRITE
- ATTRIBUTE_OTHER_EXECUTE
- ATTRIBUTE_EXECUTABLE
- ATTRIBUTE_ARCHIVE
- ATTRIBUTE_HIDDEN
- ATTRIBUTE_SYMLINK
- ATTRIBUTE_LINK_TARGET
The lastModified... fields require the format
yyyy/MM/dd HH:mm:ss
.
The managed-file
Element
By "managed file" is meant a local file required by the executable but
which may not be present on the host on which the job
will run; hence, these files may either be external or may actually be
generated from the resource manager environment in conjunction with the
job submission, but in any case need to be copied to that host just
prior to it. The
script
file used in connection with scheduler (batch) jobs is a special
category of the
managed-file
type, and will be discussed in the next sub-section.
Managed files are added to the definition in groups determined by
their shared staging location, which is a path relative to the working
directory of the connection. (The batch script is by default staged to
.eclipsesettings in the user home directory, since it is by
default temporary and deleted after the submit call returns; however,
one can use the
file-staging-location
on the
script
element to set the path explicitly.) The file itself can be set to be
deleted after submission (deleteTargetAfterUse; this is
currently not implemented in the general case) and if the file is
actually generated by the resource manager, the local temporary copy
can be deleted or retained (deleteSourceAfterUse). In the
latter case, one can also distinguish multiple copies of the generated
file by setting uniqueIdPrefix to true.
The managed file definition allows one of three possibilities as to specifying content.
- If it is an external file or one edited perhaps through the
workspace, one can use the
path
element to indicate location; this can be either a hard-coded path or a reference to a property or attribute value. - The
contents
element is a string which literally represents the file contents. The resolveContents attribute in this case is used to indicate whether to pass this string through the variable resolver, substituting any references to property or attribute values it may contain. However, caution should be exercised here in that this string should not contain "${...}" sequences which do not actually refer to Eclipse variables (such as batch script variables), or the resolution will fail. If resolveContents is set to false, you can still provide this text element with a single reference to a property or attribute, in which case the value of that property or attribute will be used as is, without further dereferencing. - The preferred, and most flexible way to represent the contents
of a file to be generated, however, is to use the
line
type which is discussed below in connection with thescript
type.
When the submit call is executed, managed files are written out, if so indicated, and then their target paths are determined as follows:
- If the
path
element is used, the target path becomes staging directory / name of pre-existing file; - If the
content
orline
elements are used, the target path becomes staging directory/ [uniqueIdPrefix]managed-file-name); - A property is placed in the environment whose name is the managed-file name and whose value is this target path.
The script
Element
A resource manager definition for a scheduler system such as PBS or LoadLeveler can be instrumented to work without a batch script (by setting command-line flags or passing all the job control information as environment variables, for instance), but most frequently the use of a script allows more flexibility in configuring the job (interactive managers of course have no need of a script).
If the script is specified in the XML definition, its path is
automatically added to the list of managed files to be staged to the
appropriate directory (by default .eclipsesettings, or
whereever indicated by the
file-staging-location
element), and so there is no need to include a script entry explicitly
under the
managed-files
element. As with the
managed-file
, deleteAfterSubmit indicates that the script target should not
be retained (this is the default behavior); unlike the
managed-file
, however, the local copy of the generated script is always deleted. A
reserved property,
managed_file_for_script
, should be used to reference the script's path on the target
resource; for instance, in the PBS submit command:
<arg>qsub</arg> <arg>${ptp_rm:managed_file_for_script#value}</arg>
Note: If the
import
tab (see below) is used to provide an external or
workspace edited batch script to the run, nothing extra need be done in
the definition XML, as the presence of any external path for the script
is handled under the covers.
insertEnvironmentAfter is a line number indicating where in the script to add any extra environment variables set through the Environment tab provided as part of the Run Configuration wizard. This way the user has control over whether these should overwrite previously defined environment variable values.
The line
Element
The
line
element was mentioned above in connection with managed file
content; it is essentially a wrapper around a series of whitespace separated
arg
elements, all of which are placed on a single line (that is, the group
is terminated by a line separator).
The arg
Element
The
arg
element is used for script and managed file content as well as in the
definition of commands. Its text element can contain variable
references to be resolved against the environment, but also allows for
the entire string to be treated as a literal by setting resolve
to false (default is true). This is useful inasmuch as it allows for the presence of batch-type
variables (e.g., ${HOME}) which should be resolved by the remote shell
and not inside the Eclipse client.
The default behavior of the argument resolver is not to write out or
include arguments whose value is undefined (either
null
or zero-length).
The isUndefinedIfMatches attribute affords more nuanced control over whether an argument should be so eliminated. If the argument references property or attribute values, but also has text-literal segments, a regex can be provided to define what an "empty" argument would be in this case. For instance, if an argument flag should not appear when the value it precedes is an empty string, one could write:
<arg isUndefinedIfMatches="-f">-f ${ptp_rm:flag#value}</arg>
For the purposes of matching, trailing whitespace is trimmed from the resolved argument, so there is no need to specify this as part of the regex used to match.
"Just-in-time" resolution of @jobId and managed-file
paths
@jobId is a special property name designating the runtime id for a job instance. In the lifecycle of the run/launch (submit) call, this value begins as an internally generated unique id which then is swapped for the id returned by the scheduler or runtime system.
The @jobId, along with the target paths for
managed-file
elements, are not known at configuration time (i.e., before the user
hits "Run"). While the former is made visible to the parsers and the
returned status object of the submit command, neither is in the scope of (available for reference in) other managed files or the
script
element, because these latter elements are generated just prior to the actual
submission.
If the
script
needs to refer to the @jobId, it must do so via the variable
made available by the particular scheduler it is written for. An
example of how to reference the target path of a
managed-file
inside the
script
is included in the tutorial slides ; this essentially
involves defining a variable in the submission command's environment,
with a reference to the
managed-file
path property as its value, and then using this environment variable
inside the
script
.
The command
Element
Commands are system calls, either to a local or remote OS,
depending on the connection defined for the resource manager and
denotes a [UNIX-type] system call made on the resource manager's remote
(target) connection (PTP does not generally support execution on
Windows systems). The command is always configured and executed through
a Java API (the "process builder") which bottoms out in a
"sh -c
"
command.
The start-up-command and shut-down-command are arbitrary commands to be run (serially in order) when the resource manager is started or stopped.
The submit commands are those used to launch jobs. Currently a configuration may have only either batch or an interactive commands. Normal (run) mode is specified using the submit-batch or submit-interactive commands. Debug mode is specified using the -debug versions of these commands. In the future we may allow all batch and interactive commands to coexist in a single configuration.
The get-job-status is a user-initiated (on-demand)
request to refresh the status information for a submission. Normal
(polled) updates, on the other hand, are the responsibility of the
monitor
component. The status command nevertheless needs to be implemented in
most cases, as it will be called internally just after submission.
The terminate-job command is used to terminate a previously launched batch or interactive job. Note: if the submission type is interactive, the terminate-job command usually does not need to be implemented, as the process termination will be handled internally. However, in some cases (such as PBS -I) which require the interactive job to run as a pseudo-terminal, one may need this command in order to force its termination externally.
The remaining -job commands are operations which can be executed on jobs submitted to job schedulers (batch-systems) and do not apply to resource managers which connect to interactive runtime-systems such as OpenMPI or PE.
The button-action is an arbitrary
command associated with a button exposed through the
Launch Tab
(see further below).
There are a number of attributes that can be specified on the command-type
element:
- directory (defaults to the "home" or working directory of the remote control connection, i.e. the
control.working.dir
property) - Specifies the location on the remote systemto where the command will be executed
- redirectStderr (default false)
- Merge both the output and error streams onto
stdout
- streamBufferLimit (not currently used)
- Used to specify the buffer sizes on the stream readers
- replaceEnvironment (default false)
- Specifies if the environment set on the command should entirely replace the shell environment (the default behavior is to append the command environment)
- ignoreExitStatus (default false)
- Specifies that no error should be thrown in the case of non-zero exit of the command
- flags
- An OR'd string of three possible values:
- NONE (default)
- ALLOCATE_PTY (allocates a pseudo-terminal)
- FORWARD_X11
- waitForId (default false)
- Indicates to the resource manager that the output stream for the command is being parsed for an id which will appear as a property in the environment during the command execution, and that the execution should not return until it sees this id. Very often the submit commands will have these semantics.
- keepOpen (default false)
- Indicates that the command should be held open for potentially repeated redirection of input (see next paragraph). There can only be one such process open inside the resource manager at a time (these semantics will usually pertain to interactive managers using a pseudo-terminal mode to control job submission). NOTE: if keepOpen is false, then the internal command will not complete until the remote process has finished.
BEST PRACTICE: WAIT-FOR-ID
As previously described, there are two basic types of commands, (uu)id commands which are related to a specific job, and generic (non-uuid) commands.
submit-
and-job-
commands fall into the former category; those in the latter category are usually simple interactive commands, and their output is not redirected to the console, but uuid-commands have several subclasses:
COMMAND TYPE WAIT FOR ID TOKENIZER STATES batch yes SUBMITTED interactive no interactive yes RUNNING interactive, open no interactive, open yes (SUBMITTED,) RUNNING interactive, open, pseudo-terminal no interactive, open, pseudo-terminal yes (SUBMITTED,) RUNNING
This table shows the various combinations that are feasible between command type and waiting states. Simple interactive commands, whether held open for further input or not, can wait or not wait, but in the case of waiting, they should be provided with a stream tokenizer which recognizes and sets the RUNNING state. Batch jobs, on the other hand, will usually have a tokenizer which recognizes and sets SUBMITTED. In the case of some open-process jobs (e.g.,
qsub -I
for PBS, which also requires a pseudo-terminal to connect remotely), there will usually be a pause before the terminal job is actually scheduled; in order to allow the user to see that the job has been accepted and is pending, the tokenizer needs to set both SUBMITTED and RUNNING states, the latter when the job has actually started.
The arg
Element
The first argument of the command must be the name/path
of the executable only. Its own arguments can then be subsequently
specified by an arbitrary number of
arg
elements. See above for more details.
More than one argument can be specified using an arg
type.
The input
Element
The
input
element is similar to the arg
element
but is directed to the input stream of the command. If the keepOpen
attribute is true, a check will be made to see if an open process already exists
(which is also alive), which will then be used; otherwise, the
arguments are executed, then the input arguments are given to the
process. With an open command/process, the input arguments can be fed
repeatedly to the same process. This allows, for instance, for
continuous testing of an interactive job in the same interactive
session.
The environment
Type
The environment
element can be used to pass strings containing properties or attributes to the target system as environment variables. The name attribute is used to specify the name of the environment variable that will be set on the target system. The value attribute specifies the string value of the environment variable. This string is resolved against the variable map, so can contain property or attribute references.
This element can be used in conjunction with the replaceEnvironment attribute on the command.
Stream Parsers
One can attach parsers (which we also refer to as tokenizers) to the
stdout and stderr streams of any
command
element using the stdout-parser
and stderr-parser
elements respectively. The redirect-parser
element can be used to parse a combined stdout and stderr stream.
This allows output to be captured and used to side-effect existing
properties or attributes, or to generate new ones on the fly. While the
parser is not completely general, it is capable of a wide range of
tasks which would typically be required in the handling of output from
batch and runtime systems. We here explain the main elements (
target, match, test
) used by the tokenizer; one should further consult the tokenizer examples demonstrating various usage scenarios.
The built-in tokenizer can read the stream in two different ways. If delim is provided, the stream is split using the indicated value. The string should only be one character in length (escaped or non-escaped). Provision is made for the '\r\n' (Windows) two-character delimiter internally; in this case the delimiter should be set to "\r" (however, as already mentioned, PTP does not generally guarantee that system calls will work on Windows). Setting includeDelim means that the delimiter will appear as the last char on the returned stream segment.
The second way to read from the stream is to provide a maxMatchLen size; what this indicates is that whatever substring needs to be found on the stream will not exceed this length. The stream is then read in swatches of maxMatchLen, with the internal buffer set to twice this size, so that each successive read shifts the buffer to the "left" by one length. This guarantees that all such substrings will eventually be matched.
Sometimes a sort of "look-ahead" paradigm is necessary. For instance, one may need to match a segment or segments whose position is defined from the end of the output, but you do not know in advance the actual stream length. In this case, one can opt to read until the end of the stream (all="true"), retaining only the last N buffer-lengths or delimited segments, as indicated by the save field. When the parser reaches the end of the stream, it will then apply the various targets to each saved segment in order.
The applyToAll attribute is discussed further under
target
. The
exit-on
element indicates that the tokenizer should quit immediately when it
encounters this pattern;
exit-after
indicates that the tokenizer should quit when it encounters this
pattern, but should first apply the current segment to its targets.
The type
Element
The
type
element will most commonly not be set, meaning the built-in parser
will be used; however, it is possible to implement a custom parser as
a contribution to the
org.eclipse.ptp.rm.jaxb.core.streamParserTokenizer
extension point, in which case this element should be set to its
extension id. Note that the extension requires the class to implement
org.eclipse.ptp.rm.jaxb.control.internal.IStreamParserTokenizer
, which is a Runnable interface with an initialize method which passes in
any job id plus the current environment map; the details of such a
parser's implementation are not, however, configured from the XML
document.
The target
Element
A tokenizer may be given any number of
target
elements. The target denotes a particular value (object) currently in,
or to be written to, the environment, which will be side-effected on
the basis of the result of the tokenization. A target in turn contains
match
elements and
test
elements; the former are run as part of the stream processing; the
latter are run after the stream processing has been completed. The
optional
else
element is applied only if there are no other tests defined or if none
of the defined tests succeed).
The target object is either to be constructed at match time, or it
pre-exists in the environment. If constructed, the type field is
used to indicate whether the object is a
property
or an
attribute
type; otherwise, ref points to the name of the
property
or
attribute
in the environment (recall that for the runtime job identifier, @jobId
is used).
Note: when new targets are constructed, there is a merge
operation at the end of tokenization which attempts to combine objects
into a single instance identified by their name attribute. This
assumes that such names will be unique and that any other values to be
set on the object which are not explicitly bound in some way to that
name via the match pattern will appear on the stream before a new name
does (see ex. 5 in tokenizer examples).
The default behavior of this merge is that it will fail
if two objects with the same name but differing values are generated
by the parsing. (This excludes add
and put
operations which create a list or map; in these cases, the two
collections or maps will be combined into one. This does NOT
work, however, for append
actions.) To allow duplicates,
set the allowOverwrites to true; in this case, successive duplicates
simply replace the preceding object.
The default behavior of the tokenizer read-match sequence is as follows:
- read from the stream either a set number of chars or until the delimiter is found;
- for each target:
- for each match:
- if the match is positive, process each of the actions, and go to (1).
- for each match:
Only one qualifying target is processed for any given segment read, and for the given target, the first pattern matched is the one processed for its actions. This is basically the "OR" semantics of normal logic programming; hence the implementer must be careful to arrange the matches inside a target in such a way that the more specific match patterns precede the more general.
Three boolean fields allow you to modify this behavior.
- The applyToAll field on the
tokenizer-type
element means take the unmatched part of the read stream and pass it to the next target, even if there was a previous match; this allows you to capture more than one regex pattern per stream segment (see ex. 6 in tokenizer examples). - The matchAll field on the
target-type
element means do not try to match an already matched expression until all the others are matched (i.e., a logical AND instead of OR governs the set of matches at successive calls to the target match operation); this allows one to use, for instance, .* repeatedly but set different fields of the object with the resulting match (see ex. 5 in tokenizer examples). - The moveToTop field on the
match-type
element indicates to the tokenizer that the matched target be promoted to first position in the list of targets. This is useful when there is an ordering which expects types of attributes or properties to be grouped in sequence on the stream (see ex. 4 in tokenizer examples).
When a match is found, the set of action types it contains are all applied.
The match
Element
Each of these types corresponds to an action to be taken on the indicated field of the target object.
set |
sets the value of that field |
append |
adds to a string buffer, whose string value will be set on the field |
add |
adds to a list to which the value of that field will be set |
put |
places a key-value pair in a map to which the value of that field will be set |
throw |
throws an exception and (optionally) also sets the value of the field |
The actions listed here all have
entry-type
children, either single (
set, throw
) or potentially multiple. All of these except
throw
also allow you to force the creation of a new object (forceNewObject)
each time it is applied; the new object then replaces the current one
for successive actions in the match.
The entry
Element
This value-abstraction allows one to set key (for maps) and value as literals or references to other properties or attributes to be resolved in the current environment; to reference the matched segment parts one sets keyIndex and valueIndex if the regex was used to split the segment; otherwise, keyGroup and valueGroup refer to the capture group of the regex pattern, with group 0 referring to the entire match.
The test
Element
As mentioned above, the
test-type
elements are all run after the tokenization has reached the end of the
stream. This class of actions is useful for setting values based on
other values produced during tokenization. A test is one or more
comparison operations plus a set of actions to apply to the target
fields in the case of either success or failure (the "else" element);
see ex. 3 or the "get-job-status" example in tokenizer examples.
The op attribute can be one of the following comparisons:
EQ | : | equals |
LT | : | less than |
LE | : | less than or equal to |
GT | : | greater than |
GE | : | greater than or equal to |
When the operation is set to one of these, it is expected that the two
value
elements will be used. As usual, these elements can be literals or can
contain variables to be resolved into a string type;
#FIELD
refers to the value of the given field on the current target; the
strings will be converted in conformity with the inferred (primitive)
type of the comparison. The
else
element also pertains to comparison tests; the actions listed there
will be taken upon failure of the comparison.
The op attribute can also be a logical operator [AND, OR, NOT],
in which case the embedded
test
object should be used; these can be nested to an arbitrary depth, but
of course must bottom out in a comparison operation.
Tokenizer Examples
See: Tokenizer Example XML.
Example | Description |
---|---|
1 | output is a list of line-separated queue names to be assigned to the known property "available-queues" |
2 | output is to be searched for its final line which should contain a job id of the form "[digits].[chars]" |
3 | indeterminate number and order of lines containing parts of attribute definitions, but each line bearing a distinct id (e.g., openMPI attribute discovery) |
4 | indeterminate number of definitions, but grouped by caption; use of moveToTop to promote the target to the top of the list when the caption appears |
5 | similar to 4, but without delimiter (implicit ordering) |
6 | similar to 4, but with indeterminate type order and using buffer + DOTALL | UNIX_LINES |
7 | indeterminate number of property definitions, but on single line |
8 | looking for values interspersed in the stream but which will not exceed 32 chars |
9 | successive names/values, in order, but staggered |
10 | forced merge |
11 | exit-on |
12 | exit-after |
13 | get-job-status (use of tests) |
The pre-launch-cmd
and post-launch-cmd
Elements
It is sometimes necessary to execute commands just prior to and just after submitting the job. This is particularly the case for -debug
commands, where the debugger needs cleanup and/or other types of actions to launch correctly.
The pre-launch-cmd
and pre-launch-cmd
elements only allow simple command execution. Stream parsers and other command
elements are not available.
Available attributes include:
- exec
- Specifies the command to be launched along with any arguments. Property and attribute substitution is performed on this string.
- directory
- Specifies the directory in which the command will be executed. If not supplied, the main command directory will be used.
- ignoreExitStatus
- Allows the exit status of the command to be ignored. If not set, and the command exits with a non-zero exit status, the entire submission will be aborted.
- wait
- Specifies that the submission should not proceed until the command has completed. Should only be used if the command completes in a reasonable time.
The launch-tab
Element
The UI component which needs to be configured in conjunction with the
resource manager control is the Launch Tab, which is actually
the main area of the Resources Tab, one of the six tabs
comprising the Run Configuration wizard or the seven tabs
comprising the Debug Configuration wizard. The purpose of this
component is to allow the user to set up the job by providing values
relating to resource requirements or to the properties of the job in
general. In other words, it is in this component that the user can set
the values on the
attribute
elements making up the configurable environment as defined in the XML.
Specifying the composition and layout of this component can be tricky
and tedious because it deals with widgets defined by the Eclipse
JFace
library and with their underlying implementation in
SWT
. We intend in the near future to provide some standard "templates" for
typical widget groups that can be cut and pasted into the XML
definition. Some of this can already be achieved by importing into the
workspace a provided definition (such as for the PBS resource manager)
and modifying it or borrowing from it. This is, in fact, the approach
demostrated in the tutorial.
Throughout the following, it may generally be assumed that
fields such as style, foreground, background,
etc., take string equivalents of the corresponding SWT
constants (e.g., "SWT.LEFT", "SWT.VERTICAL", "SWT.NONE") which can be
OR'd (using "|") wherever they would be in Java code. We will not
exhaustively specify here which constants pertain to which widgets,
because in most cases we have tried to reflect very closely the names
of the SWT
object types and fields, and this information
can be found by consulting the Javadocs relative to the widgets in
question.
As seen from this top-level specification, the Launch Tab can have any
number of
dynamic
tab-controllers, "dynamic" meaning configurable from the XML, which
will discuss in more detail in the following paragraphs.
There is also a fixed
import
controller type (implemented as an extension/subclass of the dynamic
controller, in order to provide uniform handling of the environment);
this component allows you to browse the workspace to set a custom
script as the one to run. Its main text area widget is read-only; any
editing of the script must be done through the workspace editor. The
browse is local only, so remote scripts should first be imported into
a local project in the workspace (using the remote systems import
wizard), and then captured by this controller's browse button.
The configurable part of the import tab is an optional attribute viewer for selecting the variables from the environment which you wish to pass to the launch command along with the script. The values set here should correspond to whatever the script contains; they are set for the benefit of the internal consistency of the PTP client, and do not override those hard-coded in script. For an example, see the Import PBS Script tab.
The tab-controller
Element
The
dynamic
controllers belong to the
tab-controller-type
, which contains an arbitrary number of
tab-folder,
composite,
widget,
browse,
button-group,
action,
or
viewer
elements; as will be seen shortly, the first two, which are
"containers", are potentially recursive elements, so that one can nest
widgets in the usual manner. For all intents and purposes, the
tab-controller-type
is the same as a
composite
, but without the group attribute and with the extra field for
controlling shared widgets.
The includeWidgetValuesFrom field is important. By default, it
is set to empty, meaning that the subset of the total resource manager
environment represented by the controller's widgets is all that will
be passed to the actual job configuration when the job is launched
from this tab. This behavior can be changed by giving a
comma-delimited list of the
title
values for other controllers. In this case, the configuration for the
job, when the "Run" button is pressed with this controller being the
visible one, will also include all the properties or attributes
valid for the listed but currently invisible controllers.
Property/Attribute Names vs Values
While this field controls the inclusion or exclusion of attribute and property names in the configuration, their values are always shared across the controllers (there is only one environment); thus if a variable value is changed on one controller, and this variable is also exposed on another, the value for that variable will also change on the other controller when it becomes visible; similarly, if the user switches off a controller to another having only a subset of the first controller's variables, the variables not represented on the second controller will become undefined or return to their defaults.
Note that the union of valid names is not transitively closed; that is, if Tab1 has includeWidgetValuesFrom="Tab2", and Tab2 similarly has includeWidgetValuesFrom="Tab3", Tab1 will get only the valid values local to Tab2. Note also, that if there are names which intersect with strictly local widget targets, the local settings (visibility, selection, enablement) still determine whether they belong in the set of valid property/attribute names.
The showViewConfig and showViewExcluded fields allow
control over whether the "View Configuration" and "View Excluded"
buttons appear at the bottom of the tab (the "View Script" button appears if
the configuration contains a
script
element). showViewConfig is true by default; this button brings
up a scrolling dialog which allows you to inspect the current
configuration (that is, the one relevant to the visible controller). showViewExcluded
is by default false, and should only be set if the resource manager
processes attributes or properties discovered by a command at run
time; this dialog allows you to see which ones have been marked
invisible; unlike invisible predefined properties or attributes,
which are always included, invisible discovered properties or
attributes are always excluded from the configuration and thus
do not persist across resource manager sessions.
The tab-folders
and tab-items
Elements
The
tab-folder
is a container which looks like a file-folder, with tabs running across
the top or bottom (style includes SWT.TOP, SWT.BOTTOM). Each tab label corresponds
to a
tab-item
which provides the actual visible content. The
tooltip
pertains to the entire folder.
The
tab-item
is another all-purpose "container" in which all widget types can be
embedded, but distinguished from the composite
in that it can only be the child of a
tab-folder
and that it can have a tooltip (which appears on the tab).
As with
composite
(see below), the tab folder and tab item take
layout
and
layout-data
elements; these closely reflect their
SWT
objects and types; for further explanation we refer the reader once
again to the relevant Javadocs.
The layout
Element
The layout-data
Element
The composite
Element
The main container, of course, is the
composite
; its sub-type, the group (group="true"), allows for decoration
and a title, while the simple composite acts as a widget holder and has
no visible characteristics, though it does have a layout. As is
evident, composites can be contained in composites and tab-items, and
tab-folders can be contained in composites.
The control-state
Element
It should be evident that all the XML elements corresponding to UI SWT
control
types have been given
control-state
children.
The purpose of these elements is to provide an optional way of setting
the enabled and/or visible states of the control on the
basis of the state of button (checkbox/radio) selection. One can
define two such elements per control, using one of the show/hide or
enable/disable pairs. Each of these is associated with a
control-state-rule:
The basic rule is defined by its attributes:
- the id of the button control to check;
- whether its selected state should be true or false.
Rules can be nested to an arbitrary depth to form clauses governed by the logical operators.
A
SelectionListener
is created for this target which subscribes to all the controls in the
rule; when it receives an event, it evaluates the rule, and if the
result is true, takes the associated action on this target.
When one wishes to wire two or more such controls in this way, the
"trigger" button control must have an id unique to the resource
manager configuration. This will be assigned via the buttonId
element on the
widget
corresponding to the button (true also for the buttons nested in a
button-group
).
Here is an example which sets the enabled and visible properties of both a text field and a related checkbox:
<widget type="checkbox" buttonId="002" title="Host list:" attribute="checked002"> <layout-data> <grid-data horizontalAlign="SWT.LEFT" verticalAlign="SWT.TOP"/> </layout-data> <control-state> <show-if button="001" selected="true"/> </control-state> </widget> <widget type="text" style="SWT.V_SCROLL | SWT.BORDER" attribute="host_list"> <layout-data> <grid-data horizontalAlign="SWT.FILL" verticalAlign="SWT.FILL" horizontalSpan="2" grabExcessHorizontal="true" grabExcessVertical="true"/> </layout-data> <control-state> <show-if button="001" selected="true"/> <enable-if> <and> <rule button="001" selected="true"/> <rule button="002" selected="true"/> </and> </enable-if> </control-state> </widget>
Note also that this implicit dependency graph is checked for cycles; the building of the UI tab will be aborted if such a cycle is discovered.
BEST PRACTICE: CONTROL STATE
An invisible property should be assigned to each checkbox which controls the state of other widgets (e.g., checked002 in the example above); this will allow you to restore the state of the tab correctly when it is rebuilt or re-initialized.
The widget
Element
We now come to the five non-container widget elements with functional
characteristics. The first of these is the
widget
, which comprises a number of simple widget types.
The type of the widget can be one of the following:
label |
fixed text for display |
text |
area for entering/editing text |
checkbox |
corresponds to a boolean choice |
radiobutton |
usually used in connection with button-group |
spinner |
provides a continuous range of integer values |
combo |
select from a list of items |
Not all attributes and elements of the
widget
descriptor are applicable to all types.
title | used with checkbox and radiobutton widgets for the button label. |
style | carries the SWT style constants appropriate to the widget type. |
readOnly | applies to text , spinner and combo widgets; the text area in this case is not editable. |
attribute | applies to all widgets except label ; the name of the property or attribute whose value should be set on the basis of the widget's "selected" value or textNote: use of this attribute is mutually exclusive with the fixed- and dynamic-text elements. |
items-from
|
applies to the combo widget; indicates a property or attribute value (of type java.util.Collection ) to use to populate the items.Note: if a combo widget saves its value to an attribute and the present element on the widget descriptor is not set, the attribute will be checked for non-empty choice or items-from values, which then will be used to populate the combo items; the items-from field on the widget is for added flexibility (for instance, linking up a target property with another property providing the list of choices). |
fixed-text
|
applies to the label and (read-only) text widgets; text is resolved once and remains constant thereafter. |
dynamic-text
|
applies only to (read-only) text widgets; text is resolved each time there is an update of a value from the Launch Tab. |
Widget foreground, background (color) and font are in principle configurable, though the font setting may not always work. The tooltip will appear on the widget whereever it usually does.
The button-group
Element
The button group provides an explicit selection of a single value based
on a mutually exclusive set of button labels. Just as with the
widget
, one uses attribute to indicate what property or attribute
value to set. The
button
elements, which can be arranged horizontally or vertically using the style
attribute (SWT.HORIZONTAL, SWT.VERTICAL), will be
widget
elements whose type is either checkbox or radiobutton (if
a different widget type is given, an error will occur); the attribute
attribute on the individual button will be ignored. Radio buttons are
"sticky" in the sense that once they are selected, they cannot be
"unchecked" until a different button in the group is selected. It is
thus advisable always to provide a default value for the property or
attribute associated with the group value if radio buttons are used. If
check boxes are used, they can be individually checked on and off (one
at a time), thus allowing for the value of the group to be potentially
empty.
Below is an example of a button group using radio buttons, disposed vertically.
The browse
Element
A special widget-pair -- a text widget followed by a push button --
allowing the user to browse directories or files from either the
local or remote file system. Aside from the fields shared with the
widget
type, we note the following:
directory | limit the browse widget to directories. |
uri | have the browse widget set a URI as the text
(default is false, in which case the text is an absolute path). |
localOnly | limit the browse widget to the local file
system (otherwise, the choice of local or remote is enabled). |
layout
and
control-state
are specified as belonging to either the text or the button. Style applies only to the text widget.
The action-button
Element
A push button which is associated with a
button-action
command. The
button-action
specifies the name of a command (
action
) defined in the XML; this can be one of the
start-up
or
shut-down
commands, or a special
button-action
command (note that it cannot, however, be a (uu)id
command, as the Launch Tab is not coupled to any submitted jobs); the
button-action-type
also indicates whether the variable affected by the command should be
cleared prior to the command execution, and whether the launch tab
should be refreshed (default is true) afterward.
The attribute-viewer
Element
The last of the widget types, the
attribute-viewer
, displays visible properties and attributes in a table or tree
viewer, with the value columns as editable widgets. The type of cell
editor is determined implicitly here by the type of the property or
attribute; the presence of a combo editor is determined by an
attribute whose
choice
or
items-from
element is defined. (This is the only viable possibility for a combo
in the table, as there would also need to be a target to save the
selected item to, viz., the value of this same attribute.)
Each entry in the viewer is provided with a checkbox to indicate which attributes have been chosen (and thus marked valid). A checkbox is also placed above the viewer to allow for filtering the unchecked items. The viewer outputs its values to the environment. The name of the viewer is also stored against a string of all selected values, constructed from a template (further discussion at the end of this section).
The viewer type can be either "table" or "tree"; in the case of the tree, the type, default, status and description fields of the attribute or property are displayed as collapsible child rows hanging off the main name entry. Either table or tree can have up to six columns whose names/types are based on the following property and attribute fields:
Name |
Value |
Type |
Status |
Default |
Description |
These can appear in any order. There is a slight difference in how these columns are treated, depending upon whether tree or table is chosen as type. For table type, the value of the field simply appears in the corresponding column:
In the tree type, the
name
field displays all fields except
value
as children;
value
always appears as a separate column on the same row as the name (it
contains the cell editor). If the viewer also has the column
corresponding to the child field, that field value appears in that
column;
Otherwise, the child fields display their value in the value column
along with the actual
value
field:
Tooltips for the column appear when hovering over the column header. (A caveat on the foreground and background settings for this type: they are exposed here but may not work as expected: these are currently applied to the column through the cell label provider; foreground does not change the font color but applies the color to the entire cell.)
sort is done on the name field/column. This is a toggle sort, such that alternate clicks reverse the sort direction. headerVisible and linesVisible control the look of the table. tooltipEnabled enables column-viewer label provider tooltip support. The tooltip defined for an attribute will appear on the row of the table or the parent node of the tree. The attribute description will also appear as a tooltip on the description columns if they exist.
Control over the actual contents of the viewer is achieved through the
viewer-items-type
; one can here explicitly list properties or attributes for inclusion
using the
include
element, or one can set one or both of the all attributes; in
the latter case, attributes and properties can be excluded from the
set using
exclude
. The initialCheckAll field, which defaults to true, indicates
that if the attribute or property is new or undefined from a previous
session, it will appear as checked; turn this off if you want new
attributes or properties to appear at first as unchecked.
A property bearing the name given to this widget is written to the
environment at each update/refresh; the value of the property is a
string formed from the selected (checked) rows of the table using the
template-type
(the
value
element). The pattern can be any literal string embedding @name
and @value tags as placeholders for the row's name and value;
for instance, '@name="@value"' would generate a string of name-value
assignments. The default separator between patterned segments
is a single space.
Below is a fuller example of the tree version of the attribute viewer. The text area above it displays the current value of its templated pattern string. The child rows reporting status, default, etc., cannot be selected or unselected (only the parent row can). Hovering over the name (here walltime) displays the tooltip.
The monitor-data
Element
At present, the monitor component has less elements to define; this may change in future releases, when certain hard-coded features of the driver may be handed over to the client for configuration.
This component can be furnished with its own set of properties
(currently unused). What is necessary to set at present is only the schedulerType
attribute. It specifies the target system's job scheduler. Currently supported batch systems are
COBALT_BG, GRIDENGINE, LL, LL_BG, LSF, OPENMPI, PBS, PE, SLURM, SLURM_ALPS, TORQUE and TORQUE_ALPS.
These names can be used as attribute values for schedulerType.
refreshFrequencyInSeconds defaults to
60, but this can be changed according to the needs of the user. Be
aware that too low a setting will probably not work, as the command on
an average sized system will take upwards of five seconds to complete
(XML is being streamed to the driver and a
diff
is sent back to the client ).
If the
driver
element is configured, then the default behavior, which is to stage
over the necessary driver scripts to the .eclipsesettings
directory in the user's home, is overridden by connecting to a
pre-existent installation. This can be specified either using the
url
, or a combination of the
name
,
path
and
args
elements.