nf-core/drugresponseeval
Pipeline for testing drug response prediction models in a statistically and biologically sound way.
Define the models and baselines to be tested.
Model to be tested.
string
Model to be tested.
Baselines to be tested.
string
Baselines to be tested.
Define where the pipeline should find input data and save output data.
Run ID for the pipeline.
string
You will need to set a run identifier for the pipeline. This is used to create a unique output directory for each run.
Name of the dataset.
string
Name of the dataset used for the pipeline. Allowed values are GDSC1, GDSC2, and Custom.
The output directory where the results will be saved. Default is results/
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config
) then you don't need to specify this on the command line for every run.
Define the mode in which the pipeline will be run.
Run the pipeline in test mode LPO (Leave-random-Pairs-Out), LCO (Leave-Cell-line-Out), or LDO (Leave-Drug-Out).
string
LPO
Which tests to run (LPO=Leave-random-Pairs-Out, LCO=Leave-Cell-line-Out, LDO=Leave-Drug-Out). Can be a list of test runs e.g. 'LPO LCO LDO' to run all tests. Default is LPO.
Options for randomization.
Randomization mode for the pipeline.
string
Which randomization tests to run, additionally to the normal run. Default is None which means no randomization tests are run. Modes: SVCC, SVRC, SVCD, SVRD. Can be a list of randomization tests e.g. 'SCVC SCVD' to run two tests. Default is None. SVCC: Single View Constant for Cell Lines: in this mode, one experiment is done for every cell line view the model uses (e.g. gene expression, mutation, ..). For each experiment one cell line view is held constant while the others are randomized. SVRC Single View Random for Cell Lines: in this mode, one experiment is done for every cell line view the model uses (e.g. gene expression, mutation, ..).
Randomization type for the pipeline.
string
permutation
type of randomization to use. Choose from "permutation", "invariant". Default is "permutation
Options for robustness.
Number of trials to run for the robustness test
integer
Number of trials to run for the robustness test. Default is 0, which means no robustness test is run. The robustness test is a test where the model is trained with varying seeds. This is done multiple times to see how stable the model is.
Options for data input.
Path to the data directory.
string
Path to the data directory.
Datasets for cross-study prediction.
string
List of datasets to use to evaluate predictions across studies. Default is empty string which means no cross-study datasets are used.
Additional options for the pipeline.
Run the curve curator.
boolean
Whether to run " "CurveCurator " "to sort out " "non-reactive " "curves
If this is true, intermediate datasets are saved.
boolean
This includes the pickled response/cross validation datasets, the pickled CV splits, the hyperparameter split yamls, the pickled predictions of each trained model, the best hyperparameters, and the randomization test view yamls. Consider this more of a debug flag.
Optimization metric for the pipeline.
string
RMSE
Optimization metric for the pipeline. Default is RMSE.
Number of cross-validation splits.
integer
10
Number of cross-validation splits. Default is 5.
Response transformation
string
Transformation to apply to the response variable possible values: standard, minmax, robust
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
If you're running offline, Nextflow will not be able to fetch the institutional config files from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this parameter.
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Less common options for the pipeline, typically set in a config file.
Display version and exit.
boolean
Method used to save pipeline results to output directory.
string
The Nextflow publishDir
option specifies which intermediate files should be saved to the output directory. This option tells the pipeline what method should be used to move these files. See Nextflow docs for details.
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
An email address to send a summary email to when the pipeline is completed - ONLY sent if the pipeline does not exit successfully.
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Incoming hook URL for messaging service
string
Incoming hook URL for messaging service. Currently, MS Teams and Slack are supported.
Boolean whether to validate parameters against the schema at runtime
boolean
true
Base URL or local path to location of pipeline test dataset files
string
https://raw.githubusercontent.com/nf-core/test-datasets/