A controller responsible for running the stages in the correct
order.
Pipeline parses the input file, creates instances of all IO objects,
stage objects, and stage job objects, submits the stages in the
appropriate directories, and waits for them to finish. Once a stage
finishes, it starts any stages that depend on its output. When all stages
are complete, it presents the user with the USER OUTPUT objects - IO
output objects that are to be returned by the pipeline.
|
__init__(self,
jobname=' pipeline ' ,
prog=None,
logfh=None,
restart_file=None) |
|
|
|
setOptions(self,
subjob_hosts=None,
subjob_local=None,
njobs=None,
adjust=None,
force=None,
cleanup=None,
max_retries=None)
Set the options of the pipeline. |
|
|
|
readNewFormatVariable(self,
varname,
keywords) |
|
|
|
readNewFormatStage(self,
stagename,
keywords) |
|
|
|
|
|
_setStructOut(self,
value) |
|
|
|
readNewFormat(self,
command_file) |
|
|
|
readFile(self,
command_file)
Read a Pipeline input file. |
|
|
|
checkUserOutputs(self)
Make sure that all specified user outputs are variable names that are
returned by a stage. |
|
|
|
createStage(self,
stagename,
inIOnames,
outIOnames,
stage_class,
keywords)
Create a stage object and add it to the pipeline. |
|
|
|
_addVariable(self,
var_obj,
varname)
Add an initiation variable to the pipeline. |
|
|
|
reportParameters(self)
Print the parameters of each stage. |
|
|
|
inputsForStagesDefined(self)
Check if the inputs for all stages are either specified in the input
variables or are outputs from other stages. |
|
|
|
|
|
setStageOptions(self,
stageobj)
Propagate the pipeline options (hosts, ncpus, etc) to the specified
stage. |
|
|
|
|
|
updateStagesCpus(self)
Send messages to stages (if necessary) telling them how many
processors to use from each host. |
|
|
|
_submitStage(self,
stagejob)
Start the specified stage. |
|
|
|
|
|
|
|
|
|
dump(self)
Dumps the Pipeline instance to a restart file |
|
|
|
_getMaxRetries(self)
Unimplemented. |
|
|
|
getStageByJobid(self,
jobid) |
|
|
|
handleStageMessages(self) |
|
|
|
distributeCpus(self)
Called when extra CPUs become available (as given back by the stages
using them). |
|
|
|
run(self,
idle_function=None)
Run the Pipeline. |
|
|
|
getUserOutputs(self)
Return a list of pipeio objects that are to be presented to the user
at the end of the Pipeline run. |
|
|
|
|
|
getUserOutputFiles(self)
Return a list of files for all user (final) outputs of Pipeline. |
|
|
|
printAction(self,
stagename,
stagejobid,
action)
Print an action for stage stagename . |
|
|
|
|
|
getUsedHosts(self,
host_pool)
Return a dictionary of hosts that are CURRENTLY checked out (used) by
all stages combined (within specified host_pool). |
|
|