Training Loop¶

Training and validation loop with mixed precision, gradient accumulation, and MLflow integration.

`training.train` ¶

`prepare_output_for_comparison(outputs: torch.Tensor, target_size: tuple[int, int], output_size: int | None = None) -> torch.Tensor` ¶

Prepare model outputs for comparison with target mask.

When using context window (model output larger than output_size), center-crops to output_size first, then upsamples to target_size.

Parameters:

Name	Type	Description	Default
`outputs`	`Tensor`	Model predictions with shape (B, C, H, W)	required
`target_size`	`tuple[int, int]`	Target spatial size (height, width) to match mask	required
`output_size`	`int \| None`	Expected output spatial size for center-cropping. If provided and output is larger, center-crops to this size. Use sentinel_patch_size when using context window.	`None`

Returns:

Type	Description
`Tensor`	Tensor with shape (B, C, target_size[0], target_size[1])

Source code in src/training/train.py

def prepare_output_for_comparison(
    outputs: torch.Tensor,
    target_size: tuple[int, int],
    output_size: int | None = None,
) -> torch.Tensor:
    """Prepare model outputs for comparison with target mask.

    When using context window (model output larger than output_size), center-crops
    to output_size first, then upsamples to target_size.

    Args:
        outputs: Model predictions with shape (B, C, H, W)
        target_size: Target spatial size (height, width) to match mask
        output_size: Expected output spatial size for center-cropping.
            If provided and output is larger, center-crops to this size.
            Use sentinel_patch_size when using context window.

    Returns:
        Tensor with shape (B, C, target_size[0], target_size[1])

    """
    if outputs.shape[-2:] == target_size:
        return outputs

    out_h, out_w = outputs.shape[-2:]

    # Center-crop if using context window
    if output_size is not None and out_h > output_size:
        crop_margin_h = (out_h - output_size) // 2
        crop_margin_w = (out_w - output_size) // 2
        outputs = outputs[
            :,
            :,
            crop_margin_h : crop_margin_h + output_size,
            crop_margin_w : crop_margin_w + output_size,
        ]

    # Upsample to target size
    if outputs.shape[-2:] != target_size:
        outputs = F.interpolate(
            outputs,
            size=target_size,
            mode="bilinear",
            align_corners=False,
        )

    return outputs

train(model: torch.nn.Module, train_loader: torch.utils.data.DataLoader, val_loader: torch.utils.data.DataLoader, criterion: torch.nn.Module, optimizer: torch.optim.Optimizer, device: torch.device, scheduler: LRScheduler | None = None, epochs: int = 100, patience: int = 20, num_classes: int = 13, other_class_index: int | None = None, accumulation_steps: int = 1, early_stopping_criterion: str = 'loss', *, use_amp: bool = False, apply_augmentations: bool = True, data_config: dict[str, Any] | None = None, log_evaluation_metrics: bool = True, log_model: bool = True, pruning_callback: Any | None = None, output_size: int | None = None, gradient_clip_val: float | None = None, sentinel_augmenter: Any | None = None) -> dict[str, list[float] | float] ¶

Train a segmentation model, monitoring validation loss and saving the best model.

Detailed metrics should be calculated separately after training using an evaluation function.