schrodinger.job.launchapi module

Provide an API that allows a definition of a job.

A job is composed of two pieces:

  1. A task specification that defines a generic command-line invocation and defines what input files are needed and what output files are produced.
  2. Runtime parameters that include specific input and output filenames.

To launch a job, you must also provide a set of launch parameters, which specify things like the job host, number of threads per process, etc.

To construct a job specification for a script that you want to run under job control, use the JobSpecificationArgsBuilder. This creates a JobSpecification from a list of command-line arguments. You must indicate the specific input and output files in the command-line arguments so they can be transferred and retrieved.

exception schrodinger.job.launchapi.TaskSpecificationError

Bases: RuntimeError

__init__

Initialize self. See help(type(self)) for accurate signature.

args
with_traceback()

Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

exception schrodinger.job.launchapi.RuntimeParametersError

Bases: RuntimeError

__init__

Initialize self. See help(type(self)) for accurate signature.

args
with_traceback()

Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

exception schrodinger.job.launchapi.SpecificationKeyError(key, context=None)

Bases: KeyError

An error thrown when a key is missing from a serialized specification.

spec_description = 'specification'
__init__(key, context=None)

Initialize self. See help(type(self)) for accurate signature.

args
with_traceback()

Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

exception schrodinger.job.launchapi.TaskSpecificationKeyError(key, context=None)

Bases: schrodinger.job.launchapi.SpecificationKeyError

spec_description = 'task specification'
__init__(key, context=None)

Initialize self. See help(type(self)) for accurate signature.

args
with_traceback()

Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

exception schrodinger.job.launchapi.RuntimeParametersKeyError(key, context=None)

Bases: schrodinger.job.launchapi.SpecificationKeyError

spec_description = 'runtime parameters'
__init__(key, context=None)

Initialize self. See help(type(self)) for accurate signature.

args
with_traceback()

Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

exception schrodinger.job.launchapi.JobSpecificationKeyError(key, context=None)

Bases: schrodinger.job.launchapi.SpecificationKeyError

spec_description = 'job specification'
__init__(key, context=None)

Initialize self. See help(type(self)) for accurate signature.

args
with_traceback()

Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

schrodinger.job.launchapi.check_valid_key(key)

Make sure the provided key is valid. Raise a RuntimeError if it’s not.

class schrodinger.job.launchapi.TaskSpecification

Bases: object

A class that holds the information necessary to run a task on a compute resource.

It holds a templated shell command along with info about required input, output created, and information about how to generate a jobname. It doesn’t hold any information specific to a job run - e.g. hostname, input filenames, output filenames.

This class has no methods that modify instance state. Use a builder class (TaskSpecificationBuilder or JobSpecificationBuilder) to set up a TaskSpecification.

__init__()

Initialize self. See help(type(self)) for accurate signature.

validate()

Make sure that the variables used in the command-line template are defined with input or output specifications. Raise a TaskSpecificationError if not.

static fromJSON(json_spec)
asJSON(**kwargs)
getCommandTemplate()

Return the command-line as a list of arguments with keys in the form of <KEY> that will be replaced with runtime parameters.

needsExplicitJobname()

Return True if a jobname must be specified as part of the runtime parameters.

getInputKeys()

Return a list of keys that define inputs.

getJobnameSource()

Return the input key that should be used to derive the jobname.

getProgramName()
isStoppable()

If True, a job is stoppable by sending ‘halt’ message to the backend. And transfer output files if any. Otherwise, job supervisor kills the backend and transfer output files.

getLicenses()

Returns licenses as a list of tuples of ( license_name, num_tokens)

driverReservesCores()

If True, the driver will allocate N slots matching -HOST hostname: N processes. This is useful when the driver allocates parallel slots on one node so it can resubmit jobs to localhost. If no N is specified, assume to be 1.

This is an optimization to avoid resubmission back to the queue if we know we are running a set of parallel jobs with the same CPU requirements.

jobUsesTPP()

If True, this job will allocate N slots on one node matching -TPP N.

If False, this job only passes TPP on as an option for subjobs.

TPP stands for Threads Per Process.

getOutputKeys()

Return a list of keys that define outputs.

useSchrodingerRun()

Return whether to run the command under $SCHRODINGER/run.

useSchrodingerProduct()

Return whether to run the command under $SCHRODINGER/run.

schrodinger.job.launchapi.var(string)

Format the provided string to be used as a variable in the TaskSpecification command-line template.

class schrodinger.job.launchapi.TaskSpecificationBuilder(data=None)

Bases: object

A class for building up a TaskSpecification from a specific list of command-line arguments.

__init__(data=None)

Initialize self. See help(type(self)) for accurate signature.

getTaskSpec()

Return a TaskSpecification from this builder.

setCommandLine(args, use_schrodinger_run=True, schrodinger_product=None)

Set the command line as provided. Add a SCHRODINGER/run if requested.

Parameters:
  • args (list of str) – The command-line script and arguments.
  • use_schrodinger_run – If True, run the command under $SCHRODINGER/run.
  • schrodinger_product (str) – A product directory to search for the script/executable. This should be the name of a directory under SCHRODINGER without the trailing version (i.e. the “-v*” part).
setInputFile(key, jobname_source=False)

Set an input file for the task specification.

addLicense(license_name, license_tokens)
Parameters:
  • license_name (schrodinger.infra.licensing.MMLIC3*) – Name of a license token that is checked out.
  • license_tokens (str) – The number of tokens required by this job type.
setOutputFile(key, runtime_path, stream=False, incorporate=False)
Parameters:
  • runtime_path (str) – The path of an output file that will be generated. May contain input keys in the form <KEY> or <JOBNAME> (if a jobname source has been specified).
  • key (str) – A key to be used to reference this output file.
  • stream (bool) – If True and if possible, the output file will be streamed back while the job is running. Defaults to False. Optional.
setStderr(runtime_path, stream=False)
Parameters:
  • runtime_path (str) – A runtime_path template with input keys in the form <KEY>.
  • stream (bool) – If True and if possible, the file will be streamed back while the job is running. Defaults to False. Optional.
setStdout(runtime_path, stream=False)
Parameters:
  • runtime_path (str) – A runtime_path template with input keys in the form <KEY>.
  • stream (bool) – If True and if possible, the file will be streamed back while the job is running. Defaults to False. Optional.
setStoppable(stoppable)

If passed True, the job supervisor sends the ‘halt’ message to the backend to stop. And transfer output files if any. Otherwise, the supervisor kills the backend and transfer output files.

setProgramName(program_name)
setDriverReservesCores(reserved)

If passed True, the driver will allocate N slots matching -HOST hostname: N processes. This is useful when the driver allocates parallel slots on one node so it can resubmit jobs to localhost. If no N is specified, assume to be 1.

This is an optimization to avoid resubmission back to the queue if we know we are running a set of parallel jobs with the same CPU requirements.

setJobUsesTPP(uses_tpp)

If passed True, this job will reserve the number of cores specified from -TPP on the cmdline. Unfortunately, we use -TPP N to mean two things

  1. this job will reserve N processors on one node directly
  2. this job takes -TPP as a cmdline argument, but only to pass along to subjobs

In case 1, the uses_tpp should be set to True.

schrodinger.job.launchapi.get_file_url(path)

Get a file:// URL for a file path.

class schrodinger.job.launchapi.JobSpecification(task_spec, runtime_params)

Bases: object

This class provides a serializable job definiton and consists of a task specification along with runtime parameters.

Currently unsupported features:
  • Runtime requirements (e.g. GPU, memory requirements).
Features that will probably never be supported:
  • The equivalent of job control’s ‘-LOCAL’ option.
  • Full shell commands. (Arguments must be provided as a list of strings. Chaining commands together is not supported. Redirection is provided only through APIs.)
__init__(task_spec, runtime_params)

Initialize self. See help(type(self)) for accurate signature.

asJSON(**kwargs)
static fromJSON(json_job_spec)

Return a JobSpecification instance created from the provided JSON.

validate()

A validation method that makes sure the JobSpecification has all the data it needs and is self-consistent.

getCommand()

Return the shell command that will run the job.

Returns:list of str
getOutputFile(key, stream=None, incorporate=None)

Get the output file corresponding to the given key from the job specification.

Parameters:stream – If stream=True, the file is returned only if it is a log file. If stream=False, the file is returned only if it is an output file. If stream=None, the file is returned.
getOutputFiles(stream=None, incorporate=None)

Get all the output files from the job specification.

Parameters:
  • stream (True, False or None(Default)) – If stream=None, both output and log files are returned. If stream=True, only log files are returned. If stream=False, only output files are returned. Defaults to None. Optional.
  • incorporate – If incorporate=None, incorporatable and other files are returned. If incorporate=True, only files to be incorporated are returned. If incoporate=False, only output files not incorporated are returned.
getStderr()
getStdout()
getDefaultStderr()
getDefaultStdout()
useSchrodingerRun()
getInputFiles()
getHost()
debugEnabled()
getJobname()
setJobname(jobname)
getProgramName()
isStoppable()

If True, a job is stoppable by sending ‘halt’ message to the backend. And transfer output files if any. Otherwise, job supervisor kills the backend and transfer output files.

driverReservesCores()

If True, the driver will allocate N slots matching -HOST hostname: N processes. This is useful when the driver allocates parallel slots on one node so it can resubmit jobs to localhost. If no N is specified, assume to be 1.

This is an optimization to avoid resubmission back to the queue if we know we are running a set of parallel jobs with the same CPU requirements.

jobUsesTPP()

If True, this job will allocate N slots on one node matching -TPP N.

If False, this job uses TPP as an option to pass along to subjobs

getLicenses()

Returns licenses as a list of tuples of ( license_name, num_tokens)

class schrodinger.job.launchapi.JobSpecificationBuilder(task_spec)

Bases: object

A helper class to create a JobSpecification from an existing TaskSpecification.

__init__(task_spec)

Initialize self. See help(type(self)) for accurate signature.

getJobSpec()
setJobname(jobname)
setInputFiles(**input_files)

Indicate the filenames to be used for the input keys.

Parameters:input_files – Keyword arguments with the key as INPUT_KEY and the value as the associated filename.
setOutputFiles(**output_files)

Indicate the filenames to be used for the output keys.

Parameters:output_files – Keyword arguments with the key as OUTPUT_KEY and the value as the associated filename.
class schrodinger.job.launchapi.JobSpecificationArgsBuilder(args, use_schrodinger_run=True, schrodinger_product=None, program_name=None, default_jobname=None, use_jobname_log=False)

Bases: object

A helper class to create a JobSpecification from a specific (i.e. non-generic) set of command-line arguments.

__init__(args, use_schrodinger_run=True, schrodinger_product=None, program_name=None, default_jobname=None, use_jobname_log=False)

See TaskSpecificationBuilder.setCommandLine for argument descriptions.

Parameters:use_jobname_log – If True, set the STDOUT, STDERR as <JOBNAME>.log and stream it. Default is False.
getJobSpec()
setJobname(jobname)
getJobname()

Get the job name set from runtime parameters if there is one

This will return any jobname set via command line or the setJobname function. It will not return any jobname set using jobname_source=True when setting an input/output file.

Return type:str or None
Returns:The current set job name, or None if no name is set
setProgramName(program_name)
setInputDirectory(path, runtime_path=None)

Register a directory as input for the job. All files found under it recursively are registered as input.

Please avoid using this method unless you know you want all the files in the directory.

If the path given is absolute or a relative path not in a subdirectory, the default runtime path will be in the job directory. If the path is relative and in a subdirectory, it will be used as is.

setInputFile(path, key=None, runtime_path=None)

Register a file as input for the job.

Parameters:runtime_path (str) – The path that will be used for this input file at runtime. If not specified, this will be the relative path of the file if it is in a subdirectory, or the filename only if not.
setStderr(runtime_path, stream=False)
setStdout(runtime_path, stream=False)
setOutputFile(runtime_path, key=None, stream=False, incorporate=False)
Parameters:
  • runtime_path (str) – The path of an output file that will be created by the job.
  • stream (bool) – If True and if possible, the output file will be streamed back while the job is running. Defaults to False. Optional.
  • incoporate – If True, mark this file for incorporation in maestro. NOTE: Only one file can be declared incorporatable. Defaults to False. Optional.
addLicense(license_string, license_tokens)
setDriverReservesCores(reserved)

If passed True, the driver will allocate N slots matching -HOST hostname: N processes. This is useful when the driver allocates parallel slots on one node so it can resubmit jobs to localhost. If no N is specified, assume to be 1.

This is an optimization to avoid resubmission back to the queue if we know we are running a set of parallel jobs with the same CPU requirements.

setJobUsesTPP(uses_tpp)

If uses_tpp is True, this job will reserve the number of cores specified from -TPP (Threads Per Process) on the command line.

Currently we use -TPP N to mean two different things: 1) Reserve N processors for the job being launched. 2) Reserve N processors for each subjob of the job being launched.

Case 1 is the case that needs uses_tpp=True.