Skip to content

Latest commit

 

History

History
28 lines (17 loc) · 673 Bytes

README.md

File metadata and controls

28 lines (17 loc) · 673 Bytes

Training with ThunderKittens

This repo provides example code to train with the ThunderKittens attention kernels.

Structure:

PyTorch Lightning

  • ./train/ includes PyTorch Lightning training code, the README there describes how to get started
  • ./tktrainer/ includes the corresponding PyTorch modules with the implementations To set this up:
python setup.py install

Sample command, which will is controlled by the yaml config at: train/configs/experiments/tk:

cd train/
python run.py experiment=tk/owt_tk_gpts trainer.devices=1

nanoGPT

  • ./nano-train/ includes nanoGPT training code, the README there describes how to get started