- Implementing the Simplest Neural Network
- Implementing a Multilayer Neural Network
- Blog: A closer look at "training" a trillion-parameter model on Frontier
- Blog: LLM training without a parallel file system
- Garden: Challenges of LLM training at scale
- Garden: Computational requirements of LLM training
- Garden: Data processing for LLM training
- Garden: Scaling laws for LLM training