Change the repository type filter
All
Repositories list
75 repositories
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
- 🚢 Data Toolkit for Sailor Language Models
Megatron-Sailor2
Publicregmix
Publicsailcompass
Public- Automatic Functional Differentiation in JAX
I-FSJ
PublicInfNeRF
Publicsailor-llm
Publicinceptionnext
PublicInceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)stde
Publicoptim4rl
PublicOptim4RL is a Jax framework of learning to optimize for reinforcement learning.- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Cheating-LLM-Benchmarks
PublicP-DoS
PublicSimLayerKV
Public- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
envpool
PublicC++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.