/
What I learned building a debugger for PyTorch training loops and how it changed how I think about failure diagnosis [D] — Trendlair