Pipeline & Run¶

Top-level orchestration: process a single image or a whole dataset, and shared pipeline helpers.

spineps.seg_run¶

spineps.seg_run ¶

Top-level SPINEPS pipeline orchestration for running spine segmentation over datasets and single niftys.

process_dataset ¶

process_dataset(
    dataset_path: Path,
    model_instance: Segmentation_Model,
    model_semantic: list[Segmentation_Model]
    | Segmentation_Model
    | None = None,
    model_labeling: VertLabelingClassifier | None = None,
    rawdata_name: str = "rawdata",
    derivative_name: str = "derivatives_seg",
    modalities: list[Modality_Pair] | Modality_Pair = [
        (Modality.T2w, Acquisition.sag)
    ],
    save_debug_data: bool = False,
    save_modelres_mask: bool = False,
    save_softmax_logits: bool = False,
    save_log_data: bool = True,
    override_semantic: bool = False,
    override_instance: bool = False,
    override_postpair: bool = False,
    override_ctd: bool = False,
    snapshot_copy_folder: Path | None | bool = None,
    pad_size: int = 4,
    proc_sem_crop_input: bool = True,
    proc_sem_n4_bias_correction: bool = True,
    proc_sem_remove_inferior_beyond_canal: bool = False,
    proc_sem_clean_beyond_largest_bounding_box: bool = True,
    proc_sem_clean_small_cc_artifacts: bool = True,
    proc_inst_corpus_clean: bool = True,
    proc_inst_clean_small_cc_artifacts: bool = True,
    proc_inst_largest_k_cc: int = 0,
    proc_inst_detect_and_solve_merged_corpi: bool = True,
    proc_lab_force_no_tl_anomaly: bool = False,
    proc_fill_3d_holes: bool = True,
    proc_assign_missing_cc: bool = True,
    proc_clean_inst_by_sem: bool = True,
    proc_vertebra_inconsistency: bool = True,
    ignore_model_compatibility: bool = False,
    ignore_inference_compatibility: bool = False,
    ignore_bids_filter: bool = False,
    log_inference_time: bool = True,
    verbose: bool = False,
)

Runs the SPINEPS framework over a whole BIDS-conform dataset.

Iterates over every subject in the BIDS dataset, queries the matching scans for each requested modality pair and runs process_img_nii on each, producing semantic (subregion), vertebra (instance) and centroid outputs plus a snapshot.

Parameters:

Name	Type	Description	Default
`dataset_path`	`Path`	Path to the BIDS dataset.	required
`model_instance`	`Segmentation_Model`	Model for the vertebra (instance) segmentation.	required
`model_semantic`	`list[Segmentation_Model] \| Segmentation_Model \| None`	Models for the subregion (semantic) segmentation, one per modality pair. If None, attempts to find a matching model for each modality. Defaults to None.	`None`
`model_labeling`	`VertLabelingClassifier \| None`	Classifier used to label the vertebra instances. Defaults to None.	`None`
`rawdata_name`	`str`	Name of the rawdata folder. Defaults to "rawdata".	`'rawdata'`
`derivative_name`	`str`	Name of the derivatives output folder. Defaults to "derivatives_seg".	`'derivatives_seg'`
`modalities`	`list[Modality_Pair] \| Modality_Pair`	Modality/acquisition pairs to segment in the dataset. Defaults to [(Modality.T2w, Acquisition.sag)].	`[(T2w, sag)]`
`save_debug_data`	`bool`	If true, saves intermediate debug data. Increases space usage. Defaults to False.	`False`
`save_modelres_mask`	`bool`	If true, additionally saves the semantic mask in the resolution of the model. Defaults to False.	`False`
`save_softmax_logits`	`bool`	If true, additionally saves the softmax logits (averaged over folds) as an npz. Defaults to False.	`False`
`save_log_data`	`bool`	If true, writes the log to a file in the dataset folder. Defaults to True.	`True`
`override_semantic`	`bool`	If true, redoes existing semantic segmentations. Defaults to False.	`False`
`override_instance`	`bool`	If true, redoes existing instance segmentations. Defaults to False.	`False`
`override_postpair`	`bool`	If true, redoes the combined post-processing step. Defaults to False.	`False`
`override_ctd`	`bool`	If true, redoes existing centroid files. Defaults to False.	`False`
`snapshot_copy_folder`	`Path \| None \| bool`	If a path, copies all created snapshots there; if True, uses a "snaps_seg" subfolder of the dataset; if None/False, no copy is made. Defaults to None.	`None`
`pad_size`	`int`	Padding added in each dimension before inference. Defaults to 4.	`4`
`proc_sem_crop_input`	`bool`	If true, crops the input to the foreground before semantic segmentation. Defaults to True.	`True`
`proc_sem_n4_bias_correction`	`bool`	If true, applies N4 bias field correction before semantic segmentation (MRI only). Defaults to True.	`True`
`proc_sem_remove_inferior_beyond_canal`	`bool`	If true, removes semantic structures inferior to and beyond the spinal canal. Defaults to False.	`False`
`proc_sem_clean_beyond_largest_bounding_box`	`bool`	If true, removes semantic voxels outside the largest bounding box. Defaults to True.	`True`
`proc_sem_clean_small_cc_artifacts`	`bool`	If true, removes small connected-component artifacts from the semantic mask. Defaults to True.	`True`
`proc_inst_corpus_clean`	`bool`	If true, cleans the vertebra corpus during instance processing. Defaults to True.	`True`
`proc_inst_clean_small_cc_artifacts`	`bool`	If true, removes small connected-component artifacts from the instance mask. Defaults to True.	`True`
`proc_inst_largest_k_cc`	`int`	If greater than 0, keeps only the largest k connected components of the instance mask. Defaults to 0.	`0`
`proc_inst_detect_and_solve_merged_corpi`	`bool`	If true, detects and splits merged vertebra corpi. Defaults to True.	`True`
`proc_lab_force_no_tl_anomaly`	`bool`	If true, forces the labeling to assume no thoracolumbar transition anomaly. Defaults to False.	`False`
`proc_fill_3d_holes`	`bool`	If true, fills 3D holes during post-processing. Defaults to True.	`True`
`proc_assign_missing_cc`	`bool`	If true, assigns unlabeled connected components to the nearest instance. Defaults to True.	`True`
`proc_clean_inst_by_sem`	`bool`	If true, cleans the instance mask using the semantic mask. Defaults to True.	`True`
`proc_vertebra_inconsistency`	`bool`	If true, detects and resolves vertebra labeling inconsistencies. Defaults to True.	`True`
`ignore_model_compatibility`	`bool`	If true, ignores model/modality initialization compatibility issues. Defaults to False.	`False`
`ignore_inference_compatibility`	`bool`	If true, ignores compatibility issues between models and individual inputs. Defaults to False.	`False`
`ignore_bids_filter`	`bool`	If true, disables the BIDS query filters and processes all niftys found. Defaults to False.	`False`
`log_inference_time`	`bool`	If true, logs the inference time of each step. Defaults to True.	`True`
`verbose`	`bool`	If true, prints verbose information. Defaults to False.	`False`

Source code in spineps/seg_run.py

@citation_reminder
def process_dataset(
    dataset_path: Path,
    model_instance: Segmentation_Model,
    model_semantic: list[Segmentation_Model] | Segmentation_Model | None = None,
    model_labeling: VertLabelingClassifier | None = None,
    #
    rawdata_name: str = "rawdata",
    derivative_name: str = "derivatives_seg",
    modalities: list[Modality_Pair] | Modality_Pair = [(Modality.T2w, Acquisition.sag)],  # noqa: B006
    save_debug_data: bool = False,
    # save_uncertainty_image: bool = False,
    save_modelres_mask: bool = False,
    save_softmax_logits: bool = False,
    save_log_data: bool = True,
    override_semantic: bool = False,
    override_instance: bool = False,
    override_postpair: bool = False,
    override_ctd: bool = False,
    snapshot_copy_folder: Path | None | bool = None,
    pad_size: int = 4,
    # Processings
    # Semantic
    proc_sem_crop_input: bool = True,
    proc_sem_n4_bias_correction: bool = True,
    proc_sem_remove_inferior_beyond_canal: bool = False,
    proc_sem_clean_beyond_largest_bounding_box: bool = True,
    proc_sem_clean_small_cc_artifacts: bool = True,
    # Instance
    proc_inst_corpus_clean: bool = True,
    proc_inst_clean_small_cc_artifacts: bool = True,
    proc_inst_largest_k_cc: int = 0,
    proc_inst_detect_and_solve_merged_corpi: bool = True,
    # Labeling
    proc_lab_force_no_tl_anomaly: bool = False,
    # Both
    proc_fill_3d_holes: bool = True,
    proc_assign_missing_cc: bool = True,
    proc_clean_inst_by_sem: bool = True,
    proc_vertebra_inconsistency: bool = True,
    # Misc
    ignore_model_compatibility: bool = False,
    ignore_inference_compatibility: bool = False,
    ignore_bids_filter: bool = False,
    log_inference_time: bool = True,
    verbose: bool = False,
):
    """Runs the SPINEPS framework over a whole BIDS-conform dataset.

    Iterates over every subject in the BIDS dataset, queries the matching scans for each requested modality pair and runs
    process_img_nii on each, producing semantic (subregion), vertebra (instance) and centroid outputs plus a snapshot.

    Args:
        dataset_path (Path): Path to the BIDS dataset.
        model_instance (Segmentation_Model): Model for the vertebra (instance) segmentation.
        model_semantic (list[Segmentation_Model] | Segmentation_Model | None, optional): Models for the subregion (semantic)
            segmentation, one per modality pair. If None, attempts to find a matching model for each modality. Defaults to None.
        model_labeling (VertLabelingClassifier | None, optional): Classifier used to label the vertebra instances. Defaults to None.
        rawdata_name (str, optional): Name of the rawdata folder. Defaults to "rawdata".
        derivative_name (str, optional): Name of the derivatives output folder. Defaults to "derivatives_seg".
        modalities (list[Modality_Pair] | Modality_Pair, optional): Modality/acquisition pairs to segment in the dataset.
            Defaults to [(Modality.T2w, Acquisition.sag)].
        save_debug_data (bool, optional): If true, saves intermediate debug data. Increases space usage. Defaults to False.
        save_modelres_mask (bool, optional): If true, additionally saves the semantic mask in the resolution of the model.
            Defaults to False.
        save_softmax_logits (bool, optional): If true, additionally saves the softmax logits (averaged over folds) as an npz.
            Defaults to False.
        save_log_data (bool, optional): If true, writes the log to a file in the dataset folder. Defaults to True.
        override_semantic (bool, optional): If true, redoes existing semantic segmentations. Defaults to False.
        override_instance (bool, optional): If true, redoes existing instance segmentations. Defaults to False.
        override_postpair (bool, optional): If true, redoes the combined post-processing step. Defaults to False.
        override_ctd (bool, optional): If true, redoes existing centroid files. Defaults to False.
        snapshot_copy_folder (Path | None | bool, optional): If a path, copies all created snapshots there; if True, uses a
            "snaps_seg" subfolder of the dataset; if None/False, no copy is made. Defaults to None.
        pad_size (int, optional): Padding added in each dimension before inference. Defaults to 4.
        proc_sem_crop_input (bool, optional): If true, crops the input to the foreground before semantic segmentation. Defaults to True.
        proc_sem_n4_bias_correction (bool, optional): If true, applies N4 bias field correction before semantic segmentation
            (MRI only). Defaults to True.
        proc_sem_remove_inferior_beyond_canal (bool, optional): If true, removes semantic structures inferior to and beyond the
            spinal canal. Defaults to False.
        proc_sem_clean_beyond_largest_bounding_box (bool, optional): If true, removes semantic voxels outside the largest
            bounding box. Defaults to True.
        proc_sem_clean_small_cc_artifacts (bool, optional): If true, removes small connected-component artifacts from the
            semantic mask. Defaults to True.
        proc_inst_corpus_clean (bool, optional): If true, cleans the vertebra corpus during instance processing. Defaults to True.
        proc_inst_clean_small_cc_artifacts (bool, optional): If true, removes small connected-component artifacts from the
            instance mask. Defaults to True.
        proc_inst_largest_k_cc (int, optional): If greater than 0, keeps only the largest k connected components of the instance
            mask. Defaults to 0.
        proc_inst_detect_and_solve_merged_corpi (bool, optional): If true, detects and splits merged vertebra corpi. Defaults to True.
        proc_lab_force_no_tl_anomaly (bool, optional): If true, forces the labeling to assume no thoracolumbar transition anomaly.
            Defaults to False.
        proc_fill_3d_holes (bool, optional): If true, fills 3D holes during post-processing. Defaults to True.
        proc_assign_missing_cc (bool, optional): If true, assigns unlabeled connected components to the nearest instance. Defaults to True.
        proc_clean_inst_by_sem (bool, optional): If true, cleans the instance mask using the semantic mask. Defaults to True.
        proc_vertebra_inconsistency (bool, optional): If true, detects and resolves vertebra labeling inconsistencies. Defaults to True.
        ignore_model_compatibility (bool, optional): If true, ignores model/modality initialization compatibility issues. Defaults to False.
        ignore_inference_compatibility (bool, optional): If true, ignores compatibility issues between models and individual inputs.
            Defaults to False.
        ignore_bids_filter (bool, optional): If true, disables the BIDS query filters and processes all niftys found. Defaults to False.
        log_inference_time (bool, optional): If true, logs the inference time of each step. Defaults to True.
        verbose (bool, optional): If true, prints verbose information. Defaults to False.
    """
    global logger  # noqa: PLW0603
    logger.print(f"Initialize setup for dataset in {dataset_path}", Log_Type.BOLD)
    # INITIALIZATION
    if not isinstance(modalities, list):
        modalities = [modalities]
    assert len(modalities) > 0, "you must specifiy the modalities to be segmented!"

    if snapshot_copy_folder is True:
        snapshot_copy_folder = dataset_path.joinpath("snaps_seg")
    elif snapshot_copy_folder is False:
        snapshot_copy_folder = None

    if model_semantic is None:
        model_semantic = [find_best_matching_model(m, expected_resolution=None) for m in modalities]
        logger.print("Found matching models:")
        for idx, m in enumerate(model_semantic):
            logger.print("-", str(modalities[idx]), ":", str(m.modelid()))
        del idx, m
    if not isinstance(model_semantic, list):
        model_semantic = [model_semantic]

    # check models and mod, acq tuples
    compatible = True
    for idx, mp in enumerate(modalities):
        compatible = False if not check_model_modality_acquisition(model_semantic[idx], mp) else compatible
        compatible = False if model_labeling is not None and not check_model_modality_acquisition(model_labeling, mp) else compatible
    del idx, mp

    if not compatible and not ignore_model_compatibility:
        logger.print("Compatibility issues (see above), stop program", Log_Type.FAIL)

    # Activate logger
    args = locals()
    if save_log_data:
        logger = Logger(dataset_path, log_filename="segmentation_pipeline", default_verbose=True, log_arguments=args, prefix="SegPipeline")
    logger.print(f"Processing dataset in {dataset_path}", Log_Type.BOLD)

    # RUN
    bids_ds = BIDS_Global_info(datasets=[dataset_path], parents=[rawdata_name, derivative_name], verbose=False)
    n_subjects = len(bids_ds)
    logger.print(f"Found {n_subjects} Subjects in {dataset_path}, parents={bids_ds.parents}")

    processed_seen_counter = 0
    processed_alldone_counter = 0
    processed_counter = 0
    not_properly_processed: list[str] = []

    for s_idx, (name, subject) in enumerate(bids_ds.enumerate_subjects(sort=True)):
        logger.print()
        logger.print(f"Processing {s_idx + 1} / {n_subjects} subject: {name}", Log_Type.ITALICS)
        subject_scan_processed = 0
        if name == "unsorted" and not ignore_bids_filter:
            logger.print("Unsorted, will skip")
            continue
        for idx, mod_pair in enumerate(modalities):
            model = model_semantic[idx]
            allowed_format = Modality.format_keys(mod_pair[0])
            allowed_acq = Acquisition.format_keys(mod_pair[1])
            q = subject.new_query(flatten=True)
            # optional give subject list
            q.filter_filetype("nii.gz")
            q.filter_non_existence("seg", required=True)
            if not ignore_bids_filter:
                q.filter_format(allowed_format)
                q.filter_dixon_only_inphase()
                q.filter_non_existence("lesions", required=True)
                q.filter_non_existence("label", required=True)
                q.filter("acq", lambda x: x in allowed_acq, required=False)  # noqa: B023
            scans = q.loop_list(sort=True)  # TODO make it family to allow for multi-inputs
            for s in scans:
                output_paths, errcode = process_img_nii(
                    img_ref=s,
                    model_semantic=model,
                    model_instance=model_instance,
                    model_labeling=model_labeling,
                    #
                    derivative_name=derivative_name,
                    #
                    # save_uncertainty_image=save_uncertainty_image,
                    save_modelres_mask=save_modelres_mask,
                    save_softmax_logits=save_softmax_logits,
                    save_debug_data=save_debug_data,
                    override_semantic=override_semantic,
                    override_instance=override_instance,
                    override_postpair=override_postpair,
                    override_ctd=override_ctd,
                    proc_pad_size=pad_size,
                    proc_sem_crop_input=proc_sem_crop_input,
                    proc_sem_n4_bias_correction=proc_sem_n4_bias_correction,
                    proc_fill_3d_holes=proc_fill_3d_holes,
                    proc_sem_remove_inferior_beyond_canal=proc_sem_remove_inferior_beyond_canal,
                    proc_sem_clean_beyond_largest_bounding_box=proc_sem_clean_beyond_largest_bounding_box,
                    proc_sem_clean_small_cc_artifacts=proc_sem_clean_small_cc_artifacts,
                    proc_inst_detect_and_solve_merged_corpi=proc_inst_detect_and_solve_merged_corpi,
                    proc_inst_corpus_clean=proc_inst_corpus_clean,
                    proc_inst_clean_small_cc_artifacts=proc_inst_clean_small_cc_artifacts,
                    proc_assign_missing_cc=proc_assign_missing_cc,
                    proc_inst_largest_k_cc=proc_inst_largest_k_cc,
                    proc_clean_inst_by_sem=proc_clean_inst_by_sem,
                    proc_lab_force_no_tl_anomaly=proc_lab_force_no_tl_anomaly,
                    proc_vertebra_inconsistency=proc_vertebra_inconsistency,
                    snapshot_copy_folder=snapshot_copy_folder,
                    ignore_bids_filter=ignore_bids_filter,
                    return_output_instead_of_save=False,
                    ignore_compatibility_issues=ignore_inference_compatibility,
                    log_inference_time=log_inference_time,
                    verbose=verbose,
                )
                subject_scan_processed += 1
                processed_seen_counter += 1
                if errcode == ErrCode.OK:
                    processed_counter += 1
                elif errcode == ErrCode.ALL_DONE:
                    processed_alldone_counter += 1
                else:
                    not_properly_processed.append((errcode, str(s.file["nii.gz"])))
        if subject_scan_processed == 0:
            logger.print(f"Subject {s_idx + 1}: {name} had no scans to be processed")

    logger.print()
    logger.print(f"Processed {processed_seen_counter} scans with {modalities}", Log_Type.BOLD)
    (
        logger.print(f"Scans that were skipped because all derivatives were present: {processed_alldone_counter}")
        if processed_alldone_counter > 0
        else None
    )
    not_processed_ok = processed_seen_counter - processed_alldone_counter - processed_counter
    if not_processed_ok > 0:
        logger.print(f"Scans that were not properly processed: {not_processed_ok}")
        (
            logger.print("Consult the log file for more info!")
            if save_log_data
            else logger.print("Set save_log_data=True to get a detailed log. Here are the scans in question:")
        )
        logger.print(not_properly_processed)

process_img_nii ¶

process_img_nii(
    img_ref: BIDS_FILE,
    model_semantic: Segmentation_Model,
    model_instance: Segmentation_Model,
    model_labeling: VertLabelingClassifier | None = None,
    derivative_name: str = "derivatives_seg",
    save_modelres_mask: bool = False,
    save_softmax_logits: bool = False,
    save_debug_data: bool = False,
    save_raw: bool = True,
    override_semantic: bool = False,
    override_instance: bool = False,
    override_postpair: bool = False,
    override_ctd: bool = False,
    proc_pad_size: int = 4,
    proc_normalize_input: bool = True,
    crop: tuple[slice, slice, slice] | None = None,
    auto_crop_to_spine: bool | Literal["auto"] = "auto",
    auto_crop_when_max_res_leq: float = 1.2,
    auto_crop_req_crop_min_dim: int = 200,
    proc_sem_crop_input: bool = True,
    proc_sem_n4_bias_correction: bool = True,
    proc_sem_remove_inferior_beyond_canal: bool = False,
    proc_sem_clean_beyond_largest_bounding_box: bool = True,
    proc_sem_clean_small_cc_artifacts: bool = True,
    proc_inst_corpus_clean: bool = True,
    proc_inst_clean_small_cc_artifacts: bool = True,
    proc_inst_largest_k_cc: int = 0,
    proc_inst_detect_and_solve_merged_corpi: bool = True,
    vertebra_instance_labeling_offset=2,
    proc_lab_force_no_tl_anomaly: bool = False,
    proc_fill_3d_holes: bool = True,
    proc_assign_missing_cc: bool = True,
    proc_assign_missing_cc_fast: bool = False,
    proc_clean_inst_by_sem: bool = True,
    proc_vertebra_inconsistency: bool = True,
    lambda_semantic: Callable[[NII], NII] | None = None,
    snapshot_copy_folder: Path | None = None,
    ignore_bids_filter: bool = False,
    ignore_compatibility_issues: bool = False,
    log_inference_time: bool = True,
    return_output_instead_of_save: bool = False,
    timing=False,
    verbose: bool = False,
) -> tuple[dict[str, Path], ErrCode]

Runs the SPINEPS framework over one nifty.

Runs the full pipeline on a single input image: semantic (subregion) segmentation, vertebra (instance) segmentation, combined post-processing/labeling, centroid computation and a snapshot. Existing outputs are reused unless overridden.

Parameters:

Name	Type	Description	Default
`img_ref`	`BIDS_FILE`	Input BIDS_FILE referencing the image to segment.	required
`model_semantic`	`Segmentation_Model`	Model for the subregion (semantic) segmentation.	required
`model_instance`	`Segmentation_Model`	Model for the vertebra (instance) segmentation.	required
`model_labeling`	`VertLabelingClassifier \| None`	Classifier used to label the vertebra instances. Defaults to None.	`None`
`derivative_name`	`str`	Name of the derivatives output folder. Defaults to "derivatives_seg".	`'derivatives_seg'`
`save_modelres_mask`	`bool`	If true, additionally saves the semantic mask in the resolution of the model. Defaults to False.	`False`
`save_softmax_logits`	`bool`	If true, additionally saves the softmax logits (averaged over folds) as an npz. Defaults to False.	`False`
`save_debug_data`	`bool`	If true, saves intermediate debug data. Increases space usage. Defaults to False.	`False`
`save_raw`	`bool`	If true, saves the raw (pre-cleanup) semantic and vertebra masks. Defaults to True.	`True`
`override_semantic`	`bool`	If true, redoes an existing semantic segmentation. Defaults to False.	`False`
`override_instance`	`bool`	If true, redoes an existing instance segmentation. Defaults to False.	`False`
`override_postpair`	`bool`	If true, redoes the combined post-processing step. Defaults to False.	`False`
`override_ctd`	`bool`	If true, redoes an existing centroid file. Defaults to False.	`False`
`proc_pad_size`	`int`	Padding added in each dimension before inference. Defaults to 4.	`4`
`proc_normalize_input`	`bool`	If true, normalizes the input intensities (disabled automatically for CT). Defaults to True.	`True`
`crop`	`tuple[slice, slice, slice] \| None`	If provided, segment only within the specified crop.	`None`
`auto_crop_to_spine`	`bool \| 'auto'`	Speeds up high-resolution models by first predicting the spine with VIBESeg (https://link.springer.com/article/10.1007/s00330-025-12035-9) and cropping to the spine region (works for any MR or CT image).	`'auto'`
`auto_crop_when_max_res_leq`	`float`	Enables automatic spine cropping when auto_crop_to_spine="auto" and the largest spacing value of the semantic model is less than or equal to this threshold.	`1.2`
`auto_crop_req_crop_min_dim`	`int`	When auto_crop_to_spine="auto", compute the crop only if the image size exceeds this value cubed.	`200`
`proc_sem_crop_input`	`bool`	If true, crops the input to the foreground before semantic segmentation. Defaults to True.	`True`
`proc_sem_n4_bias_correction`	`bool`	If true, applies N4 bias field correction before semantic segmentation (MRI only). Defaults to True.	`True`
`proc_sem_remove_inferior_beyond_canal`	`bool`	If true, removes semantic structures inferior to and beyond the spinal canal. Defaults to False.	`False`
`proc_sem_clean_beyond_largest_bounding_box`	`bool`	If true, removes semantic voxels outside the largest bounding box. Defaults to True.	`True`
`proc_sem_clean_small_cc_artifacts`	`bool`	If true, removes small connected-component artifacts from the semantic mask. Defaults to True.	`True`
`proc_inst_corpus_clean`	`bool`	If true, cleans the vertebra corpus during instance processing. Defaults to True.	`True`
`proc_inst_clean_small_cc_artifacts`	`bool`	If true, removes small connected-component artifacts from the instance mask. Defaults to True.	`True`
`proc_inst_largest_k_cc`	`int`	If greater than 0, keeps only the largest k connected components of the instance mask. Defaults to 0.	`0`
`proc_inst_detect_and_solve_merged_corpi`	`bool`	If true, detects and splits merged vertebra corpi. Defaults to True.	`True`
`vertebra_instance_labeling_offset`	`int`	Offset applied when mapping instance ids to vertebra labels (set to 1 for CT models that include C1). Defaults to 2.	`2`
`proc_lab_force_no_tl_anomaly`	`bool`	If true, forces the labeling to assume no thoracolumbar transition anomaly. Defaults to False.	`False`
`proc_fill_3d_holes`	`bool`	If true, fills 3D holes during post-processing. Defaults to True.	`True`
`proc_assign_missing_cc`	`bool`	If true, assigns unlabeled connected components to the nearest instance. Defaults to True.	`True`
`proc_assign_missing_cc_fast`	`bool`	If true, uses the faster variant of the missing-cc assignment. Defaults to False.	`False`
`proc_clean_inst_by_sem`	`bool`	If true, cleans the instance mask using the semantic mask. Defaults to True.	`True`
`proc_vertebra_inconsistency`	`bool`	If true, detects and resolves vertebra labeling inconsistencies. Defaults to True.	`True`
`lambda_semantic`	`Callable[[NII], NII] \| None`	Optional function applied to the semantic mask before saving. Defaults to None.	`None`
`snapshot_copy_folder`	`Path \| None`	If given, copies the created snapshot there. Defaults to None.	`None`
`ignore_bids_filter`	`bool`	If true, builds output paths in non-strict mode. Defaults to False.	`False`
`ignore_compatibility_issues`	`bool`	If true, continues despite input/model incompatibilities. Defaults to False.	`False`
`log_inference_time`	`bool`	If true, logs the inference time of each step. Defaults to True.	`True`
`return_output_instead_of_save`	`bool`	If true, returns the result NIIs/centroids instead of saving them. Defaults to False.	`False`
`timing`	`bool`	If true, logs the timing of each pipeline step. Defaults to False.	`False`
`verbose`	`bool`	If true, prints verbose information. Defaults to False.	`False`

Returns:

Type	Description
`tuple[dict[str, Path], ErrCode]`	tuple[dict[str, Path], ErrCode]: Mapping of output names to their file paths and an error code indicating success. If return_output_instead_of_save is True, instead returns (seg_nii, vert_nii, centroids, ErrCode).

Source code in spineps/seg_run.py

@citation_reminder
def process_img_nii(  # noqa: C901
    img_ref: BIDS_FILE,
    model_semantic: Segmentation_Model,
    model_instance: Segmentation_Model,
    model_labeling: VertLabelingClassifier | None = None,
    derivative_name: str = "derivatives_seg",
    #
    # save_uncertainty_image: bool = False,
    save_modelres_mask: bool = False,
    save_softmax_logits: bool = False,
    save_debug_data: bool = False,
    save_raw: bool = True,
    override_semantic: bool = False,
    override_instance: bool = False,
    override_postpair: bool = False,
    override_ctd: bool = False,
    proc_pad_size: int = 4,
    proc_normalize_input: bool = True,
    # Processings
    # Pre-processing crop
    crop: tuple[slice, slice, slice] | None = None,
    auto_crop_to_spine: bool | Literal["auto"] = "auto",
    auto_crop_when_max_res_leq: float = 1.2,
    auto_crop_req_crop_min_dim: int = 200,
    # Semantic
    proc_sem_crop_input: bool = True,
    proc_sem_n4_bias_correction: bool = True,
    proc_sem_remove_inferior_beyond_canal: bool = False,
    proc_sem_clean_beyond_largest_bounding_box: bool = True,
    proc_sem_clean_small_cc_artifacts: bool = True,
    # Instance
    proc_inst_corpus_clean: bool = True,
    proc_inst_clean_small_cc_artifacts: bool = True,
    proc_inst_largest_k_cc: int = 0,
    proc_inst_detect_and_solve_merged_corpi: bool = True,
    vertebra_instance_labeling_offset=2,
    # Labeling
    proc_lab_force_no_tl_anomaly: bool = False,
    # Both
    proc_fill_3d_holes: bool = True,
    proc_assign_missing_cc: bool = True,
    proc_assign_missing_cc_fast: bool = False,
    proc_clean_inst_by_sem: bool = True,
    proc_vertebra_inconsistency: bool = True,
    # Misc
    lambda_semantic: Callable[[NII], NII] | None = None,
    snapshot_copy_folder: Path | None = None,
    ignore_bids_filter: bool = False,
    ignore_compatibility_issues: bool = False,
    log_inference_time: bool = True,
    return_output_instead_of_save: bool = False,
    timing=False,
    verbose: bool = False,
) -> tuple[dict[str, Path], ErrCode]:
    """Runs the SPINEPS framework over one nifty.

    Runs the full pipeline on a single input image: semantic (subregion) segmentation, vertebra (instance) segmentation,
    combined post-processing/labeling, centroid computation and a snapshot. Existing outputs are reused unless overridden.

    Args:
        img_ref (BIDS_FILE): Input BIDS_FILE referencing the image to segment.
        model_semantic (Segmentation_Model): Model for the subregion (semantic) segmentation.
        model_instance (Segmentation_Model): Model for the vertebra (instance) segmentation.
        model_labeling (VertLabelingClassifier | None, optional): Classifier used to label the vertebra instances. Defaults to None.
        derivative_name (str, optional): Name of the derivatives output folder. Defaults to "derivatives_seg".
        save_modelres_mask (bool, optional): If true, additionally saves the semantic mask in the resolution of the model.
            Defaults to False.
        save_softmax_logits (bool, optional): If true, additionally saves the softmax logits (averaged over folds) as an npz.
            Defaults to False.
        save_debug_data (bool, optional): If true, saves intermediate debug data. Increases space usage. Defaults to False.
        save_raw (bool, optional): If true, saves the raw (pre-cleanup) semantic and vertebra masks. Defaults to True.
        override_semantic (bool, optional): If true, redoes an existing semantic segmentation. Defaults to False.
        override_instance (bool, optional): If true, redoes an existing instance segmentation. Defaults to False.
        override_postpair (bool, optional): If true, redoes the combined post-processing step. Defaults to False.
        override_ctd (bool, optional): If true, redoes an existing centroid file. Defaults to False.
        proc_pad_size (int, optional): Padding added in each dimension before inference. Defaults to 4.
        proc_normalize_input (bool, optional): If true, normalizes the input intensities (disabled automatically for CT). Defaults to True.
        crop: If provided, segment only within the specified crop.
        auto_crop_to_spine (bool | "auto"): Speeds up high-resolution models by first predicting the spine with VIBESeg
            (https://link.springer.com/article/10.1007/s00330-025-12035-9) and cropping to the spine region (works for any MR or
            CT image).
        auto_crop_when_max_res_leq: Enables automatic spine cropping when auto_crop_to_spine="auto" and the largest spacing value
            of the semantic model is less than or equal to this threshold.
        auto_crop_req_crop_min_dim: When auto_crop_to_spine="auto", compute the crop only if the image size exceeds this value cubed.
        proc_sem_crop_input (bool, optional): If true, crops the input to the foreground before semantic segmentation. Defaults to True.
        proc_sem_n4_bias_correction (bool, optional): If true, applies N4 bias field correction before semantic segmentation
            (MRI only). Defaults to True.
        proc_sem_remove_inferior_beyond_canal (bool, optional): If true, removes semantic structures inferior to and beyond the
            spinal canal. Defaults to False.
        proc_sem_clean_beyond_largest_bounding_box (bool, optional): If true, removes semantic voxels outside the largest
            bounding box. Defaults to True.
        proc_sem_clean_small_cc_artifacts (bool, optional): If true, removes small connected-component artifacts from the
            semantic mask. Defaults to True.
        proc_inst_corpus_clean (bool, optional): If true, cleans the vertebra corpus during instance processing. Defaults to True.
        proc_inst_clean_small_cc_artifacts (bool, optional): If true, removes small connected-component artifacts from the
            instance mask. Defaults to True.
        proc_inst_largest_k_cc (int, optional): If greater than 0, keeps only the largest k connected components of the instance
            mask. Defaults to 0.
        proc_inst_detect_and_solve_merged_corpi (bool, optional): If true, detects and splits merged vertebra corpi. Defaults to True.
        vertebra_instance_labeling_offset (int, optional): Offset applied when mapping instance ids to vertebra labels (set to 1
            for CT models that include C1). Defaults to 2.
        proc_lab_force_no_tl_anomaly (bool, optional): If true, forces the labeling to assume no thoracolumbar transition anomaly.
            Defaults to False.
        proc_fill_3d_holes (bool, optional): If true, fills 3D holes during post-processing. Defaults to True.
        proc_assign_missing_cc (bool, optional): If true, assigns unlabeled connected components to the nearest instance. Defaults to True.
        proc_assign_missing_cc_fast (bool, optional): If true, uses the faster variant of the missing-cc assignment. Defaults to False.
        proc_clean_inst_by_sem (bool, optional): If true, cleans the instance mask using the semantic mask. Defaults to True.
        proc_vertebra_inconsistency (bool, optional): If true, detects and resolves vertebra labeling inconsistencies. Defaults to True.
        lambda_semantic (Callable[[NII], NII] | None, optional): Optional function applied to the semantic mask before saving.
            Defaults to None.
        snapshot_copy_folder (Path | None, optional): If given, copies the created snapshot there. Defaults to None.
        ignore_bids_filter (bool, optional): If true, builds output paths in non-strict mode. Defaults to False.
        ignore_compatibility_issues (bool, optional): If true, continues despite input/model incompatibilities. Defaults to False.
        log_inference_time (bool, optional): If true, logs the inference time of each step. Defaults to True.
        return_output_instead_of_save (bool, optional): If true, returns the result NIIs/centroids instead of saving them.
            Defaults to False.
        timing (bool, optional): If true, logs the timing of each pipeline step. Defaults to False.
        verbose (bool, optional): If true, prints verbose information. Defaults to False.

    Returns:
        tuple[dict[str, Path], ErrCode]: Mapping of output names to their file paths and an error code indicating success.
            If return_output_instead_of_save is True, instead returns (seg_nii, vert_nii, centroids, ErrCode).
    """
    arguments = locals()
    input_format = img_ref.format

    output_paths = output_paths_from_input(
        img_ref, derivative_name, snapshot_copy_folder, input_format=input_format, non_strict_mode=ignore_bids_filter
    )
    out_spine = output_paths["out_spine"]
    out_spine_raw = output_paths["out_spine_raw"]
    out_vert = output_paths["out_vert"]
    out_vert_raw = output_paths["out_vert_raw"]
    out_logits = output_paths["out_logits"]
    out_snap = output_paths["out_snap"]
    out_ctd = output_paths["out_ctd"]
    out_snap2 = output_paths["out_snap2"]
    out_raw = output_paths["out_raw"]
    out_debug = output_paths["out_debug"]
    if isinstance(snapshot_copy_folder, Path):
        snapshot_copy_folder.mkdir(parents=True, exist_ok=True)

    if (
        out_spine.exists()
        and out_vert.exists()
        and out_snap.exists()
        and out_ctd.exists()
        and not override_semantic
        and not override_instance
        and not override_postpair
        and not override_ctd
        and (snapshot_copy_folder is None or out_snap2.exists())
    ):
        logger.print(f"{out_spine.name}: Outputs are all already created and no override set, will skip")
        return output_paths, ErrCode.ALL_DONE

    done_something = False
    debug_data_run: dict[str, NII] = {}

    if Modality.CT in model_semantic.modalities():
        proc_normalize_input = False  # Never normalize input if it is an CT
        proc_sem_n4_bias_correction = False  # n4_bias_correction is a MRI thing
        # proc_assign_missing_cc_fast = True  # TODO remove
        if model_semantic.inference_config.has_c1:
            vertebra_instance_labeling_offset = 1

    compatible = check_input_model_compatibility(img_ref, model=model_semantic)
    compatible_labeling = check_input_model_compatibility(img_ref, model=model_labeling) if model_labeling is not None else True
    if not (compatible and compatible_labeling):
        if not ignore_compatibility_issues:
            return output_paths, ErrCode.COMPATIBILITY
        else:
            logger.print("Issues are ignored, might not have expected outcome", Log_Type.WARNING)

    start_time = start_time2 = perf_counter()
    file_dir = img_ref.file["nii.gz"]

    logger.print("Processing", file_dir.name)
    with logger:
        if verbose:
            model_semantic.logger.default_verbose = True
        input_nii = img_ref.open_nii()
        input_nii.seg = False
        input_nii_ = input_nii.copy()
        if timing:
            logger.print(f"Loading files took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
            start_time2 = perf_counter()
        # First stage
        if not out_spine_raw.exists() or override_semantic:
            resolution_range = model_semantic.inference_config.resolution_range

            max_resolution: float = max(resolution_range[1]) if isinstance(resolution_range[0], tuple) else max(resolution_range)  # type: ignore
            num_voxels = math.prod(input_nii.shape)
            if (
                auto_crop_to_spine is True
                or (
                    auto_crop_to_spine == "auto"
                    and (max_resolution) <= auto_crop_when_max_res_leq
                    and num_voxels > auto_crop_req_crop_min_dim**3
                )
                or model_semantic.inference_config.needs_corp
            ):
                logger.print(
                    "Compute spine crop with VIBESegmentator https://link.springer.com/article/10.1007/s00330-025-12035-9", Log_Type.OK
                )
                out_vibeseg = output_paths["out_vibeseg"]
                crop = compute_crop(input_nii, out_vibeseg, ddevice="cpu" if model_semantic.use_cpu else "cuda", logger=logger)
                if timing:
                    logger.print(
                        f"Compute cropping took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time
                    )
                    start_time2 = perf_counter()

            if crop is not None:
                logger.print(f"Change {crop=} from shape={input_nii.shape}")
                try:
                    input_nii = input_nii.apply_crop(crop)
                except Exception:
                    logger.print_error()
            logger.print("Input image", input_nii.zoom, input_nii.orientation, input_nii.shape)

            input_preprocessed, errcode = preprocess_input(
                input_nii,
                pad_size=proc_pad_size,
                debug_data=debug_data_run,
                proc_crop_input=proc_sem_crop_input,
                proc_normalize_input=proc_normalize_input,
                proc_do_n4_bias_correction=proc_sem_n4_bias_correction,
                verbose=verbose,
            )
            if timing:
                logger.print(f"Preprocess input took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
                start_time2 = perf_counter()

            if errcode != ErrCode.OK:
                logger.print("Got Error from preprocessing", Log_Type.FAIL)
                return output_paths, errcode
            # make subreg mask
            assert input_preprocessed is not None
            seg_nii_modelres, softmax_logits, errcode = predict_semantic_mask(
                input_preprocessed,
                model_semantic,
                debug_data=debug_data_run,
                verbose=verbose,
                proc_fill_3d_holes=proc_fill_3d_holes,
                proc_clean_small_cc_artifacts=proc_sem_clean_small_cc_artifacts,
                proc_clean_beyond_largest_bounding_box=proc_sem_clean_beyond_largest_bounding_box,
                proc_remove_inferior_beyond_canal=proc_sem_remove_inferior_beyond_canal,
            )
            if errcode != ErrCode.OK:
                return output_paths, errcode

            assert isinstance(seg_nii_modelres, NII), "subregion segmentation is not a NII!"
            logger.print("seg_nii out", seg_nii_modelres.zoom, seg_nii_modelres.orientation, seg_nii_modelres.shape, verbose=verbose)
            if seg_nii_modelres.is_empty:
                logger.print("Subregion mask is empty, skip this", Log_Type.FAIL)
                return output_paths, ErrCode.EMPTY
            logger.print("Output seg_nii", seg_nii_modelres.zoom, seg_nii_modelres.orientation, seg_nii_modelres.shape, verbose=verbose)

            # Lambda Injection
            if lambda_semantic is not None:
                seg_nii_modelres = lambda_semantic(seg_nii_modelres)
            if not return_output_instead_of_save:
                if save_raw:
                    seg_nii_modelres.save(out_spine_raw, verbose=logger)
                if save_softmax_logits and isinstance(softmax_logits, np.ndarray):
                    save_nparray(softmax_logits, out_logits)
            done_something = True
            if timing:
                logger.print(f"Predict semantic took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
                start_time2 = perf_counter()
        else:
            logger.print("Subreg Mask already exists. Set -override_subreg to create it anew")
            seg_nii_modelres = NII.load(out_spine_raw, seg=True)
            logger.print("seg_nii", seg_nii_modelres.zoom, seg_nii_modelres.orientation, seg_nii_modelres.shape)
        # Second stage
        if not out_vert_raw.exists() or override_instance:
            whole_vert_nii, errcode = predict_instance_mask(
                seg_nii_modelres.copy(),
                model_instance,
                debug_data=debug_data_run,
                verbose=verbose,
                proc_inst_fill_3d_holes=proc_fill_3d_holes,
                proc_detect_and_solve_merged_corpi=proc_inst_detect_and_solve_merged_corpi,
                proc_corpus_clean=proc_inst_corpus_clean,
                proc_inst_clean_small_cc_artifacts=proc_inst_clean_small_cc_artifacts,
                proc_inst_largest_k_cc=proc_inst_largest_k_cc,
            )
            if errcode != ErrCode.OK:
                logger.print(f"Vert Mask creation failed with errcode {errcode}", Log_Type.FAIL)
                return output_paths, errcode
            assert whole_vert_nii is not None, "whole_vert_nii is None"
            whole_vert_nii = whole_vert_nii.copy()  # .reorient(orientation, verbose=True).rescale(zms, verbose=True)
            logger.print("vert_out", whole_vert_nii.zoom, whole_vert_nii.orientation, whole_vert_nii.shape, verbose=verbose)
            if save_raw and not return_output_instead_of_save:
                whole_vert_nii.save(out_vert_raw, verbose=logger)
            done_something = True
            if timing:
                logger.print(f"Predict instance took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
                start_time2 = perf_counter()
        else:
            logger.print("Vert Mask already exists. Set -override_vert to create it anew")
            whole_vert_nii = NII.load(out_vert_raw, seg=True)

        # Cleanup Step
        if not out_spine.exists() or not out_vert.exists() or done_something or override_postpair:
            # back to input space
            #
            seg_nii_modelres[seg_nii_modelres == Location.Vertebra_Corpus.value] = Location.Vertebra_Corpus_border.value
            if not save_modelres_mask:
                seg_nii_back = seg_nii_modelres.resample_from_to(input_nii_)
                whole_vert_nii = whole_vert_nii.resample_from_to(input_nii_)
            else:
                seg_nii_back = seg_nii_modelres
            seg_nii_back.assert_affine(other=input_nii_)
            # use both seg_raw and vert_raw to clean each other, add ivd_ep ...
            has_c1 = model_semantic.inference_config.has_c1
            sacrum_ids = model_semantic.inference_config.sacrum_ids
            seg_nii_clean, vert_nii_clean = phase_postprocess_combined(
                img_nii=input_nii_,
                seg_nii=seg_nii_back,
                vert_nii=whole_vert_nii,
                model_labeling=model_labeling,
                debug_data=debug_data_run,
                proc_lab_force_no_tl_anomaly=proc_lab_force_no_tl_anomaly,
                labeling_offset=vertebra_instance_labeling_offset - 1,
                proc_clean_inst_by_sem=proc_clean_inst_by_sem,
                proc_assign_missing_cc=proc_assign_missing_cc,
                proc_assign_missing_cc_fast=proc_assign_missing_cc_fast,
                proc_vertebra_inconsistency=proc_vertebra_inconsistency,
                verbose=verbose,
                disable_c1=not has_c1,
                sacrum_ids=sacrum_ids,
            )
            seg_nii_clean.assert_affine(shape=vert_nii_clean.shape, zoom=vert_nii_clean.zoom, orientation=vert_nii_clean.orientation)
            vert_nii_clean.assert_affine(other=input_nii_)
            # input_package.make_nii_from_this(seg_nii_clean)
            # input_package.make_nii_from_this(vert_nii_clean)
            if not return_output_instead_of_save:
                seg_nii_clean.save(out_spine, verbose=logger)
                vert_nii_clean.save(out_vert, verbose=logger)
            done_something = True
            if timing:
                logger.print(f"Post Postprocess took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
                start_time2 = perf_counter()
        else:
            seg_nii_clean = NII.load(out_spine, seg=True)
            vert_nii_clean = NII.load(out_vert, seg=True)

        # Centroid
        if not out_ctd.exists() or done_something or override_ctd:
            ctd = predict_centroids_from_both(
                vert_nii_clean,
                seg_nii_clean,
                models=[model_semantic, model_instance, model_labeling],  # TODO add labeling info and parameters
                parameter={l: v for l, v in arguments.items() if "proc_" in l},
            )
            ctd.resample_from_to(input_nii_).save(out_ctd, verbose=logger)
            done_something = True
            if timing:
                logger.print(f"Centroids took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
                start_time2 = perf_counter()
        else:
            logger.print("Centroids already exists, will load instead. Set -override_ctd = True to create it anew")
            ctd = POI.load(out_ctd)

        # return_output_instead_of_save:
        if return_output_instead_of_save:
            return seg_nii_clean, vert_nii_clean, ctd, ErrCode.OK  # type: ignore

        # save debug
        if save_debug_data:
            if debug_data_run is None:
                logger.print("Save_debug_data: no debug data found", Log_Type.WARNING)
            else:
                out_debug.parent.mkdir(parents=True, exist_ok=True)
                for k, v in debug_data_run.items():
                    v.reorient_(input_nii_.orientation).save(
                        out_debug.joinpath(k + f"_{input_format}.nii.gz"), make_parents=True, verbose=False
                    )
                logger.print(f"Saved debug data into {out_debug}/*", Log_Type.OK)
                if timing:
                    logger.print(
                        f"Save debug data took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time
                    )
                    start_time2 = perf_counter()

        # Snapshot
        if not out_snap.exists() or done_something:
            # make only snapshot
            if snapshot_copy_folder is not None:
                out_snap = [out_snap, out_snap2]
            ctd = ctd.extract_subregion(Location.Vertebra_Corpus)
            try:
                mri_snapshot(  # TODO update snapshot
                    img_ref,
                    vert_nii_clean,
                    ctd,
                    subreg_msk=seg_nii_clean,
                    out_path=out_snap,
                    mode="MRI" if img_ref.bids_format.lower() != "ct" else "CT",
                )
            except Exception:
                # Fall back for older TPTBox versions TODO remove later
                mri_snapshot(img_ref, vert_nii_clean, ctd, subreg_msk=seg_nii_clean, out_path=out_snap)
            logger.print(f"Snapshot saved into {out_snap}", Log_Type.SAVE)
            if timing:
                logger.print(f"Snapshot took: {perf_counter() - start_time2:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
                start_time2 = perf_counter()
        elif not out_snap2.exists():
            logger.print(f"Copying snapshot into {snapshot_copy_folder!s}")
            out_snap2.parent.mkdir(exist_ok=True)
            shutil.copy(out_snap, out_snap2)

    logger.print(f"Pipeline took: {perf_counter() - start_time:.2f} seconds", Log_Type.OK, verbose=log_inference_time)
    return output_paths, ErrCode.OK

output_paths_from_input ¶

output_paths_from_input(
    img_ref: BIDS_FILE,
    derivative_name: str,
    snapshot_copy_folder: Path | str | None,
    input_format: str,
    non_strict_mode: bool = False,
) -> dict[str, Path]

Derives all pipeline output file paths for a given input image.

Builds the BIDS-conform output paths (semantic/vertebra masks, raw masks, centroids, snapshots, logits, debug and VIBESeg crop) used throughout the pipeline, keyed by a descriptive name.

Parameters:

Name	Type	Description	Default
`img_ref`	`BIDS_FILE`	Input BIDS_FILE the outputs are derived from.	required
`derivative_name`	`str`	Name of the derivatives output folder.	required
`snapshot_copy_folder`	`Path \| str \| None`	If given, location to which the snapshot is additionally copied (used to build out_snap2).	required
`input_format`	`str`	Format string of the input, used to name the debug and raw output subfolders.	required
`non_strict_mode`	`bool`	If true, builds the paths in non-strict BIDS mode. Defaults to False.	`False`

Returns:

Type	Description
`dict[str, Path]`	dict[str, Path]: Mapping of output names (e.g. "out_spine", "out_vert", "out_ctd", "out_snap") to their file paths.

Source code in spineps/seg_run.py

def output_paths_from_input(
    img_ref: BIDS_FILE,
    derivative_name: str,
    snapshot_copy_folder: Path | str | None,
    input_format: str,
    non_strict_mode: bool = False,
) -> dict[str, Path]:
    """Derives all pipeline output file paths for a given input image.

    Builds the BIDS-conform output paths (semantic/vertebra masks, raw masks, centroids, snapshots, logits, debug and
    VIBESeg crop) used throughout the pipeline, keyed by a descriptive name.

    Args:
        img_ref (BIDS_FILE): Input BIDS_FILE the outputs are derived from.
        derivative_name (str): Name of the derivatives output folder.
        snapshot_copy_folder (Path | str | None): If given, location to which the snapshot is additionally copied
            (used to build out_snap2).
        input_format (str): Format string of the input, used to name the debug and raw output subfolders.
        non_strict_mode (bool, optional): If true, builds the paths in non-strict BIDS mode. Defaults to False.

    Returns:
        dict[str, Path]: Mapping of output names (e.g. "out_spine", "out_vert", "out_ctd", "out_snap") to their file paths.
    """
    out_spine = img_ref.get_changed_path(
        bids_format="msk",
        parent=derivative_name,
        info={"seg": "spine", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_vert = img_ref.get_changed_path(
        bids_format="msk",
        parent=derivative_name,
        info={"seg": "vert", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_snap = img_ref.get_changed_path(
        bids_format="snp",
        file_type="png",
        parent=derivative_name,
        info={"seg": "spine", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_ctd = img_ref.get_changed_path(
        bids_format="ctd",
        file_type="json",
        parent=derivative_name,
        info={"seg": "spine", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_snap2 = Path(snapshot_copy_folder).joinpath(out_snap.name) if snapshot_copy_folder is not None else out_snap
    out_debug = out_vert.parent.joinpath(f"debug_{input_format}")
    out_raw = out_vert.parent.joinpath(f"output_raw_{input_format}")
    out_spine_raw = img_ref.get_changed_path(
        bids_format="msk",
        parent=derivative_name,
        info={"seg": "spine-raw", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_spine_raw = out_raw.joinpath(out_spine_raw.name)
    out_vert_raw = img_ref.get_changed_path(
        bids_format="msk",
        parent=derivative_name,
        info={"seg": "vert-raw", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_vert_raw = out_raw.joinpath(out_vert_raw.name)
    out_unc = img_ref.get_changed_path(
        bids_format="uncertainty",
        parent=derivative_name,
        info={"seg": "spine", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_unc = out_raw.joinpath(out_unc.name)
    out_logits = img_ref.get_changed_path(
        file_type="npz",
        bids_format="logit",
        parent=derivative_name,
        info={"seg": "spine", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_vibeseg = img_ref.get_changed_path(
        bids_format="msk",
        parent=derivative_name,
        info={"seg": "VIBESeg-100", "mod": img_ref.format},
        non_strict_mode=non_strict_mode,
        make_parent=False,
    )
    out_logits = out_raw.joinpath(out_logits.name)
    return {
        "out_spine": out_spine,
        "out_spine_raw": out_spine_raw,
        "out_vert": out_vert,
        "out_vert_raw": out_vert_raw,
        "out_unc": out_unc,
        "out_logits": out_logits,
        "out_snap": out_snap,
        "out_ctd": out_ctd,
        "out_snap2": out_snap2,
        "out_debug": out_debug,
        "out_raw": out_raw,
        "out_vibeseg": out_vibeseg,
    }

save_nparray ¶

save_nparray(arr: ndarray, out_path: Path)

Saves an numpy array to the disk

Parameters:

Name	Type	Description	Default
`arr`	`ndarray`	numpy array to be saved	required
`out_path`	`Path`	output path	required

Source code in spineps/seg_run.py

def save_nparray(arr: np.ndarray, out_path: Path):
    """Saves an numpy array to the disk

    Args:
        arr (np.ndarray): numpy array to be saved
        out_path (Path): output path
    """
    np.savez_compressed(out_path, arr)
    logger.print(f"Array of shape {arr.shape} saved: {out_path}", Log_Type.SAVE)

spineps.seg_pipeline¶

spineps.seg_pipeline ¶

Segmentation-pipeline helpers: shared logger, subregion label sets, centroid computation, and pipeline version reporting.

predict_centroids_from_both ¶

predict_centroids_from_both(
    vert_nii_cleaned: NII,
    seg_nii: NII,
    models: list[Segmentation_Model | None],
    parameter: dict[str, Any],
) -> poi.POI

Calculate the centroids of each vertebra corpus using both the semantic and instance masks.

Strips the IVD and endplate derived instance labels from the instance mask, computes the per-vertebra centroids from the instance and semantic masks, adds an S1 corpus centroid when sacrum is present, and records pipeline metadata (model descriptions, version, revision, timestamp, and the given parameters) on the result.

Parameters:

Name	Type	Description	Default
`vert_nii_cleaned`	`NII`	Cleaned vertebra instance segmentation mask.	required
`seg_nii`	`NII`	Subregion semantic segmentation mask.	required
`models`	`list[Segmentation_Model \| None]`	Models used in the pipeline, recorded in the centroid metadata.	required
`parameter`	`dict[str, Any]`	Pipeline parameters to record on the centroid metadata.	required

Returns:

Name	Type	Description
`POI`	`POI`	The computed point-of-interest / centroid object with pipeline metadata attached.

Source code in spineps/seg_pipeline.py

def predict_centroids_from_both(
    vert_nii_cleaned: NII,
    seg_nii: NII,
    models: list[Segmentation_Model | None],
    parameter: dict[str, Any],
) -> poi.POI:
    """Calculate the centroids of each vertebra corpus using both the semantic and instance masks.

    Strips the IVD and endplate derived instance labels from the instance mask, computes the per-vertebra centroids from the
    instance and semantic masks, adds an S1 corpus centroid when sacrum is present, and records pipeline metadata (model
    descriptions, version, revision, timestamp, and the given parameters) on the result.

    Args:
        vert_nii_cleaned (NII): Cleaned vertebra instance segmentation mask.
        seg_nii (NII): Subregion semantic segmentation mask.
        models (list[Segmentation_Model | None]): Models used in the pipeline, recorded in the centroid metadata.
        parameter (dict[str, Any]): Pipeline parameters to record on the centroid metadata.

    Returns:
        POI: The computed point-of-interest / centroid object with pipeline metadata attached.
    """
    vert_nii_4_centroids = vert_nii_cleaned.copy()
    labelmap = dict.fromkeys([*IVD_LABEL_RANGE, *ENDPLATE_LABEL_RANGE], 0)
    vert_nii_4_centroids.map_labels_(labelmap, verbose=False)

    ctd = poi.calc_poi_from_subreg_vert(vert_nii_4_centroids, seg_nii, verbose=logger)

    if v_name2idx["S1"] in vert_nii_cleaned.unique():
        s1_nii = vert_nii_cleaned.extract_label(v_name2idx["S1"], inplace=False)
        ctd[v_name2idx["S1"], 50] = center_of_mass(s1_nii.get_seg_array())

    models_repr = {}
    for idx, m in enumerate(models):
        if m is not None:
            models_repr[idx] = m.dict_representation()
        else:
            models_repr[idx] = {"name": "No Model"}
    ctd.info["source"] = "MRI Segmentation Pipeline"
    ctd.info["version"] = pipeline_version()
    ctd.info["models"] = models_repr
    ctd.info["revision"] = pipeline_revision()
    ctd.info["timestamp"] = format_time_short(get_time())
    for pname, pvalue in parameter.items():
        ctd.info[pname] = str(pvalue)
    return ctd

pipeline_version ¶

pipeline_version() -> str

Return the pipeline version string derived from the git commit count on main.

Returns:

Name	Type	Description
`str`	`str`	A version like `"v1.<commit-count>"`, or `"Version not found"` if git is unavailable.

Source code in spineps/seg_pipeline.py

def pipeline_version() -> str:
    """Return the pipeline version string derived from the git commit count on ``main``.

    Returns:
        str: A version like ``"v1.<commit-count>"``, or ``"Version not found"`` if git is unavailable.
    """
    try:
        label = subprocess.check_output(["git", "rev-list", "--count", "main"]).strip()
        label = str(label).replace("'", "")
        while not label[0].isdigit():
            label = label[1:]
    except Exception:
        return "Version not found"
    return "v1." + str(label)

pipeline_revision ¶

pipeline_revision() -> str

Return the current git revision string for the pipeline.

Returns:

Name	Type	Description
`str`	`str`	`"<git-describe>::<full-commit-hash>"`; either part is empty if the corresponding git call fails.

Source code in spineps/seg_pipeline.py

def pipeline_revision() -> str:
    """Return the current git revision string for the pipeline.

    Returns:
        str: ``"<git-describe>::<full-commit-hash>"``; either part is empty if the corresponding git call fails.
    """
    label = ""
    rev = ""
    try:
        label = subprocess.check_output(["git", "describe", "--always"]).strip()
    except Exception:
        pass
    try:
        rev = subprocess.check_output(["git", "rev-parse", "HEAD"]).decode("ascii").strip()
    except Exception:
        pass
    return str(label) + "::" + str(rev)

spineps.seg_utils¶

spineps.seg_utils ¶

Utilities for matching segmentation models to inputs by modality, acquisition, and resolution compatibility.

find_best_matching_model ¶

find_best_matching_model(
    modality_pair: Modality_Pair,
    expected_resolution: ZOOMS | None,
) -> Segmentation_Model

Select the segmentation model best matching a modality/acquisition pair and resolution.

Not yet implemented: intended to iterate over model configs and pick the one best matching the requested resolution.

Parameters:

Name	Type	Description	Default
`modality_pair`	`Modality_Pair`	The desired `(modality(ies), acquisition)` pair.	required
`expected_resolution`	`ZOOMS \| None`	The desired voxel resolution, or None.	required

Returns:

Name	Type	Description
`Segmentation_Model`	`Segmentation_Model`	The best-matching model (once implemented).

Raises:

Type	Description
`NotImplementedError`	Always, as this function is not yet implemented; also for an unmapped modality pair.

Source code in spineps/seg_utils.py

def find_best_matching_model(
    modality_pair: Modality_Pair,
    expected_resolution: ZOOMS | None,  # actual resolution here?
) -> Segmentation_Model:
    """Select the segmentation model best matching a modality/acquisition pair and resolution.

    Not yet implemented: intended to iterate over model configs and pick the one best matching the requested resolution.

    Args:
        modality_pair (Modality_Pair): The desired ``(modality(ies), acquisition)`` pair.
        expected_resolution (ZOOMS | None): The desired voxel resolution, or None.

    Returns:
        Segmentation_Model: The best-matching model (once implemented).

    Raises:
        NotImplementedError: Always, as this function is not yet implemented; also for an unmapped modality pair.
    """
    raise NotImplementedError("find_best_matching_model()")
    logger.print(expected_resolution)
    # TODO replace with automatic going through model configs to find best matching the resolution
    mapping: dict = {
        # (Modality.CT, Acquisition.sag): MODELS.CT_SEGMENTOR,
        # (Modality.T2w, Acquisition.sag): MODELS.T2w_NAKOSPIDER_HIGHRES,
        # (Modality.T1w, Acquisition.sag): MODELS.T1w_SEGMENTOR,
        # (Modality.Vibe, Acquisition.ax): MODELS.VIBE_SEGMENTOR,
        # (Modality.SEG, Acquisition.sag): MODELS.VERT_HIGHRES,
    }
    if isinstance(modality_pair[0], list) and len(modality_pair[0]) == 1:
        modality_pair = (modality_pair[0][0], modality_pair[1])
    if modality_pair not in mapping:
        raise NotImplementedError(str(modality_pair[0]), str(modality_pair[1]))
    else:
        return mapping[modality_pair]

check_model_modality_acquisition ¶

check_model_modality_acquisition(
    model: Segmentation_Model,
    mod_pair: Modality_Pair,
    verbose: bool = True,
) -> bool

Check whether a model supports a given modality/acquisition pair.

Compares the model's supported modalities and acquisition against the requested pair and logs a warning describing any mismatch when verbose is True.

Parameters:

Name	Type	Description	Default
`model`	`Segmentation_Model`	The model to check.	required
`mod_pair`	`Modality_Pair`	The required `(modality(ies), acquisition)` pair.	required
`verbose`	`bool`	If True, log a warning when incompatible.	`True`

Returns:

Name	Type	Description
`bool`	`bool`	True if the model supports all required modalities and the acquisition, otherwise False.

Source code in spineps/seg_utils.py

def check_model_modality_acquisition(
    model: Segmentation_Model,
    mod_pair: Modality_Pair,
    verbose: bool = True,
) -> bool:
    """Check whether a model supports a given modality/acquisition pair.

    Compares the model's supported modalities and acquisition against the requested pair and logs a warning describing any
    mismatch when ``verbose`` is True.

    Args:
        model (Segmentation_Model): The model to check.
        mod_pair (Modality_Pair): The required ``(modality(ies), acquisition)`` pair.
        verbose (bool): If True, log a warning when incompatible.

    Returns:
        bool: True if the model supports all required modalities and the acquisition, otherwise False.
    """
    compatible = True

    model_modalities = model.modalities()
    model_acquisition = model.acquisition()

    expected_modalities = mod_pair[0]
    expected_acquisition = mod_pair[1]

    if not isinstance(expected_modalities, list):
        expected_modalities = [expected_modalities]

    logger_texts = f"{mod_pair}: model incompatible"

    for m in expected_modalities:
        if m not in model_modalities:
            compatible = False
            logger_texts += f", model modalities {model_modalities}"

    if expected_acquisition != model_acquisition:
        compatible = False
        logger_texts += f", model acquisition {model_acquisition}"

    if not compatible:
        logger.print(logger_texts, Log_Type.WARNING, verbose=verbose)
    return compatible

add_ignore_text ¶

add_ignore_text(logger_texts: list[str]) -> None

Mark the last accumulated log message as ignored.

Drops the trailing character of the last message (its period) and appends an "(IGNORED)." suffix in place.

Parameters:

Name	Type	Description	Default
`logger_texts`	`list[str]`	Accumulated log messages; the last entry is modified in place.	required

Returns:

Name	Type	Description
`None`	`None`	`logger_texts` is modified in place.

Source code in spineps/seg_utils.py

def add_ignore_text(logger_texts: list[str]) -> None:
    """Mark the last accumulated log message as ignored.

    Drops the trailing character of the last message (its period) and appends an "(IGNORED)." suffix in place.

    Args:
        logger_texts (list[str]): Accumulated log messages; the last entry is modified in place.

    Returns:
        None: ``logger_texts`` is modified in place.
    """
    logger_texts[-1] = logger_texts[-1][:-1]
    logger_texts[-1] += ignored_text

check_input_model_compatibility ¶

check_input_model_compatibility(
    img_ref: BIDS_FILE,
    model: Segmentation_Model,
    ignore_modality: bool = False,
    ignore_acquisition: bool = False,
    ignore_labelkey: bool = False,
    verbose: bool = True,
) -> bool

Check whether an input image file is compatible with a model's expected modality, acquisition, and naming.

Validates the input's format/modality, acquisition plane, and BIDS keys against what the model expects. Individual mismatches can be tolerated via the ignore_* flags (annotated as "(IGNORED)" in the log). Debug files are always rejected, and the image plane must be isotropic or one of the model's allowed acquisitions. Warnings are logged when verbose is True.

Parameters:

Name	Type	Description	Default
`img_ref`	`BIDS_FILE`	Reference to the input image file.	required
`model`	`Segmentation_Model`	The model to check against.	required
`ignore_modality`	`bool`	If True, tolerate a modality/format mismatch.	`False`
`ignore_acquisition`	`bool`	If True, tolerate an acquisition mismatch.	`False`
`ignore_labelkey`	`bool`	If True, tolerate an unexpected `label` key in the filename.	`False`
`verbose`	`bool`	If True, log warnings describing incompatibilities.	`True`

Returns:

Name	Type	Description
`bool`	`bool`	True if the input is compatible with the model (after applying the ignore flags), otherwise False.

Source code in spineps/seg_utils.py

def check_input_model_compatibility(
    img_ref: BIDS_FILE,
    model: Segmentation_Model,
    ignore_modality: bool = False,
    ignore_acquisition: bool = False,
    ignore_labelkey: bool = False,
    verbose: bool = True,
) -> bool:
    """Check whether an input image file is compatible with a model's expected modality, acquisition, and naming.

    Validates the input's format/modality, acquisition plane, and BIDS keys against what the model expects. Individual mismatches
    can be tolerated via the ``ignore_*`` flags (annotated as "(IGNORED)" in the log). Debug files are always rejected, and the
    image plane must be isotropic or one of the model's allowed acquisitions. Warnings are logged when ``verbose`` is True.

    Args:
        img_ref (BIDS_FILE): Reference to the input image file.
        model (Segmentation_Model): The model to check against.
        ignore_modality (bool): If True, tolerate a modality/format mismatch.
        ignore_acquisition (bool): If True, tolerate an acquisition mismatch.
        ignore_labelkey (bool): If True, tolerate an unexpected ``label`` key in the filename.
        verbose (bool): If True, log warnings describing incompatibilities.

    Returns:
        bool: True if the input is compatible with the model (after applying the ignore flags), otherwise False.
    """
    model_modalities = model.modalities()
    model_acquisition = model.acquisition()
    allowed_format = Modality.format_keys(model_modalities)
    allowed_acq = [*Acquisition.format_keys(model_acquisition), "iso"]

    file_dir = img_ref.file["nii.gz"]
    filename = file_dir.name

    compatible = True

    input_format = img_ref.format
    has_seg_key = "seg" in img_ref.info
    has_label_key = "label" in img_ref.info
    input_acquisition = img_ref.info.get("acq", None)
    is_debug = "debug" in file_dir.name or "debug" in file_dir.parent.name

    logger_texts = [f"{filename} is incompatible with the selected model."]

    if input_format not in allowed_format:
        logger_texts.append(f"Input format '{input_format}' incompatible, model expected {allowed_format}.")
        if not ignore_modality:
            compatible = False
        else:
            add_ignore_text(logger_texts)
    if has_seg_key and allowed_format not in Modality.format_keys(Modality.SEG):
        logger_texts.append("Input acquisition not segmentation, but found a 'seg'-key.")
        if not ignore_modality:
            compatible = False
        else:
            add_ignore_text(logger_texts)

    if input_acquisition is not None and input_acquisition not in allowed_acq:
        logger_texts.append(f"Input acquisition '{input_acquisition}' incompatible, model expected {allowed_acq}.")
        if not ignore_acquisition:
            compatible = False
        else:
            add_ignore_text(logger_texts)

    if has_label_key:
        logger_texts.append("Found 'label' key in filename, which is not expected.")
        if not ignore_labelkey:
            compatible = False
        else:
            add_ignore_text(logger_texts)

    if is_debug:
        logger_texts.append("Probably a debug file (debug in name or parent).")
        compatible = False

    img_nii = img_ref.open_nii()
    if img_nii.get_plane() not in ["iso", *allowed_acq]:
        logger_texts.append(f"input get_plane() is not 'iso' or one of the expected {allowed_acq}.")
        compatible = False

    if len(logger_texts) > 1:
        logger.print(*logger_texts, Log_Type.WARNING, verbose=verbose)
    return compatible