# Container & Slurm Recipes: Bridging Single 4090 to Cloud Multi-Node

> Source: https://sukruyusufkaya.com/en/learn/fine-tuning-cookbook/ftc-container-slurm-recipes-multi-node
> Updated: 2026-05-14T14:42:49.279Z
> Category: Fine-Tuning Cookbook (Model-by-Model)
> Module: Part 0 — Engineering Foundations
**TLDR:** How to take a recipe you prepared on a single 4090 to an 8×H100 cluster: Slurm sbatch template, multi-node NCCL setup, EFA/InfiniBand sanity check, real hourly prices for Lambda/RunPod/CoreWeave/Vast, preemption-tolerant training, checkpoint manifest, FAULT_TOLERANCE principles.

