nf-core/spatialxe
Define where the pipeline should find input data and save output data.
Path to comma-separated file containing information about the Xenium experiment. (eg; meta,path-to-xenium-bundle,path-to-morphology.ome.tif))
string^\S+\.csv$The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
stringMode in which the pipeline is to be run. Either image-based segmentation, coordinate-based segmentation, segmentation-free analysis or data preview.
stringSegmentation method to run.
stringPath to gene panel JSON file to use for relabeling transcripts with the correct gene.
stringPath to qupath segmentation file in GeoJSON format.
stringImage alignment file containing similarity transform matrix. (e.g., the _imagealignment.csv file exported from Xenium Explorer)
stringModel to use for running or starting training.
stringStarDist pretrained model for cell segmentation (e.g., ‘2D_versatile_fluo’, ‘2D_versatile_he’).
string2D_versatile_fluoStarDist pretrained model for nuclei segmentation.
string2D_versatile_fluoStarDist object probability threshold. Lower values detect more objects.
numberStarDist non-maximum suppression threshold. Lower values reduce overlapping detections.
numberStarDist tiling for large images (e.g., ‘4 4’). Reduces memory usage.
string8 8Prior segmentation mask from other segmentation methods.
stringFasta file for the probe sequences used in the xenium experiment.
stringPath to the directory containing genomic features (.gff) and fasta (.fa) files used as reference annotations.
stringGene synonyms that may have been counted as off-targets but simply differ in name.
stringEmail address for completion summary.
string^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$MultiQC report title. Printed as page header, used for filename if not otherwise specified.
stringOptions for the segmentation layer of the spatialxe pipeline
Whether to run the qc layer in the pipeline.
booleantrueWhether to run the off-target probe tracking.
booleanWhether to run refinement on the image-based segmentation methods. Runs coordinate-based methods after the initial image-based segmentation run.
booleanWhether to relabel genes with gene_panel.json file. True when gene_panel is provided.
booleanWhether to run vanilla xeniumranger workflow.
booleanWhether to only run nucleus segmentation.
booleanWhether to only run nucleus segmentation.
booleanNuclei boundary expansion distance in µm. Default: 5 (Min: 0, Max: 15 if either boundary-stain or interior-stain are enabled and 100 if nucleus-expansion only)
integer5Minimum intensity in photoelectrons (pe) to filter nuclei. Default: 100. (appropriate range of values is 0 to 99th percentile of image stack or 1000, whichever is larger)
integer100Specify the name of the interior stain to use or disable. Supported for cell segmentation staining workflow output bundles. Possible options are: “18S” (default) or “disable”
booleantrueSpecify the name of the boundary stain to use or disable. Supported for cell segmentation staining workflow output bundles. Possible options are: “ATP1A1/CD45/E-Cadherin” (default) or “disable”
booleantrueEnable GPU acceleration (set automatically by the gpu profile).
booleanAWS Batch queue for GPU tasks (e.g., Segger, ProSeg).
stringAWS Batch queue for Cellpose (single large GPU).
stringPre-downscale morphology image to avoid Cellpose OOM on large images.
booleanWhether to enhance the morphology.ome.tif file.
booleanDevice used for training. (e.g., cuda for GPU or cpu)
stringMethod for KNN computation. (e.g., cuda for GPU-based computation)
stringNumber of data-loader workers for Segger.
integer4Path to a pre-trained Segger model checkpoint.
stringPreset value for the proseg segmentation method.
stringxeniumList of image-based segmentation methods.
arrayList of transcript-based segmentation methods.
arrayList of segmentation-free methods.
arrayRegex used to identify or match negative control samples in a dataset.
stringList of features to be passed to the ficture method. (eg: TP53,OCIAD1,BCAS3,SOX)
stringWhether to filter the transcripts.parquet file before running Baysor segmentation.
booleanBaysor —scale parameter for non-tiled runs.
integer30Path to Baysor config TOML file (optional).
stringEnable tiled Baysor segmentation (divide transcripts into patches, run Baysor per patch, stitch results).
booleantrueTile width in microns for Baysor tiling.
integer1200Overlap between Baysor patches in microns.
integer200Balance transcripts across tiles by merging sparse tiles.
booleantrueBaysor —scale for tiled runs (larger to compensate for EM on smaller tiles).
integer39Minimum molecules per cell (—min-molecules-per-cell) for tiled Baysor.
integer120Post-stitch cell filtering threshold: minimum transcripts per cell.
integer50Prior segmentation type for Baysor. ‘cells’ uses Xenium bundle cell_id column; ‘cellpose’ uses Cellpose mask as image prior.
stringBaysor prior-segmentation-confidence (0-1).
number0.2Minimum Q-Score to pass filtering.
number20only keep transcripts whose x-coordinate is greater than specified limit, if no limit is specified, the default minimum value will be 0.0
numberonly keep transcripts whose x-coordinate is less than specified limit, if no limit is specified, the default value will retain all transcripts since Xenium slide is <24000 microns in x and y (default: 24000.0)
numberonly keep transcripts whose y-coordinate is greater than specified limit, if no limit is specified, the default minimum value will be 0.0
numberonly keep transcripts whose y-coordinate is less than specified limit, if no limit is specified, the default value will retain all transcripts since Xenium slide is <24000 microns in x and y (default: 24000.0)
numberEnable tiled segmentation for large datasets. Divides transcripts into overlapping patches, runs segmentation in parallel per patch, then stitches results.
booleanGrid layout for tiling (rows x cols), e.g. ‘3x3’, ‘4x4’.
string3x3Overlap between adjacent patches in microns.
integer50Post-stitch cell size filtering method. Options: ‘empirical’ (IQR-based), ‘distribution’ (z-score), ‘both’, or null to disable.
stringIQR multiplier for empirical cell size filtering during stitching.
number3Z-score threshold for distribution-based cell size filtering during stitching.
number4Number of tiles along the x axis for cell-type separability.
integer2Number of tiles along the y axis for cell-type separability.
integer2Width of the tiles in pixels
integer120Height of the tiles in pixels
integer120Number of samples to process per training batch
integer4Number of devices (GPUs) to use during training
integer4Number of training epochs
integer200Number of samples to process per batch during prediction
integer1Whether to use connected components for grouping transcripts without direct nucleus association
booleanProcess only one sample at a time from a multi-sample samplesheet.
booleanNumber of sample(s) to process at a time from a multi-sample samplesheet. Works if buffered_samples is true.
integer1Restrict parallelizing a process. Eg. restrict running cellpose cell and nuclei segmentation together if the resources are limited.
booleanParameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
stringmasterBase directory for Institutional configs.
stringhttps://raw.githubusercontent.com/nf-core/configs/masterInstitutional config name.
stringInstitutional config description.
stringInstitutional config contact information.
stringInstitutional config URL link.
stringBase path / URL for data used in the test profiles.
stringhttps://raw.githubusercontent.com/nf-core/test-datasets/spatialxeLess common options for the pipeline, typically set in a config file.
Display help text.
booleanDisplay version and exit.
booleanMethod used to save pipeline results to output directory.
stringEmail address for completion summary, only when pipeline fails.
string^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$Send plain-text email instead of HTML.
booleanFile size limit when attaching MultiQC reports to summary emails.
string25.MB^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$Do not use coloured log outputs.
booleanIncoming hook URL for messaging service
stringCustom config file to supply to MultiQC.
stringCustom logo file to supply to MultiQC. File name must also be set in the MultiQC config file
stringCustom MultiQC yaml file containing HTML including a methods description.
stringBoolean whether to validate parameters against the schema at runtime
booleantrueDo not use coloured log outputs
booleanBase URL or local path to location of pipeline test dataset files
stringhttps://raw.githubusercontent.com/nf-core/test-datasets/Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss
stringDisplay the full detailed help message.
booleanDisplay hidden parameters in the help message (only works when —help or —help_full are provided).
boolean