GPU Memory != Utilization

While optimizing WDN, I noticed GPU memory flatlined at 2.2GB no matter the batch size. But utilization told a different story ... 50% at 10k rows, 95% at 100k. Don’t judge performance by memory alone. Watch utilization and time per batch. Small batches give you low memory _and_ low value. [[ML]] [[Serendipity]]