/
I trained a 75M parameter LLM from scratch on 18B to... — Trendlair