Batch size #48

stijndebeer · 2025-04-08T18:02:41Z

stijndebeer
Apr 8, 2025

So i was trying some implementation of my model and it is currently getting really big, so to train it properly I have to reduce the batch size to 4. Is there a way to increase the batch size while maintaining model complexity? My training is currently quite unstable compared to previous models although performance isn't too bad yet.

Answered by ghost

Apr 8, 2025

You could use automatic mixed precision (AMP) to reduce memory usage allowing for a larger actual batchsize on the gpu. If that's still not enough consider implementing gradient accumulation to increase your effective batchsize (accumulating gradients over multiple batches and updating weights after a specific number of batches).

View full answer

ghost · 2025-04-08T18:50:54Z

ghost
Apr 8, 2025

You could use automatic mixed precision (AMP) to reduce memory usage allowing for a larger actual batchsize on the gpu. If that's still not enough consider implementing gradient accumulation to increase your effective batchsize (accumulating gradients over multiple batches and updating weights after a specific number of batches).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TU/e ARIA lab

Batch size #48

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

TU/e ARIA lab

Batch size #48

Uh oh!

stijndebeer Apr 8, 2025

Replies: 1 comment

Uh oh!

ghost Apr 8, 2025

stijndebeer
Apr 8, 2025

ghost
Apr 8, 2025