The RuntimeError: CUDA out of memory occurs when your GPU doesn't have enough memory to store the model, inputs, and intermediate computations. Here's how to resolve it:
- Reduce Batch Size
- Lower the batch size to reduce memory usage.
- Enable Gradient Accumulation
- Simulate a larger batch size by splitting it across iterations.
- Use Mixed Precision Training
- Leverage torch.cuda.amp to minimize memory usage.
- Free Unused Tensors
- Use torch.cuda.empty_cache() to clear unused memory.
- Model Checkpoints
- Save GPU memory by not retaining intermediate states.
- Use Smaller Model or Layers
- Replace heavy layers with lightweight alternatives, e.g., MobileNet instead of ResNet.
Here is the code snippet you can refer to, explaining the following steps:
data:image/s3,"s3://crabby-images/5d8c7/5d8c7e19dee7eadd0c04cb83dfb0923e19086d7e" alt=""
data:image/s3,"s3://crabby-images/c27b5/c27b5f90f7591037643c6ff6910a44f6b245cb20" alt=""
data:image/s3,"s3://crabby-images/f6ef8/f6ef87d4bf87cfb10fb6f69f9e1c06fc88b74153" alt=""
data:image/s3,"s3://crabby-images/77e3b/77e3b3f09dc82ecb5195e8a1f945090859951c8f" alt=""
data:image/s3,"s3://crabby-images/30bab/30bab420d15a409e8333e16f1d21c5149651f397" alt=""
Hence, these techniques can be applied to manage GPU memory efficiently.