GPU Model
Single GPU memory estimation
Precision
BF16 recommended for training on modern GPUs (Ampere+). FP16 for older GPUs (V100, T4).
Optimizer
Keep FP32 copy of weights for stability. Recommended for FP16, optional for BF16.