Skip to content
View SumanthRH's full-sized avatar
:shipit:
hmmmst
:shipit:
hmmmst

Highlights

  • Pro

Organizations

@anyscale

Block or report SumanthRH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SumanthRH/README.md

Hi there πŸ‘‹

  • πŸ˜„ I'm Sumanth, a software engineer at Anyscale, working on post-training. My primary interests are broadly in machine learning and systems engineering.
  • πŸš€ I'm trying to understand generative models, and have worked on finetuning and in-context learning for language models. Addicted to compute πŸ€–
  • πŸ’» I'm currently working on SkyThought, and SkyRL.
  • 🌱 I'm trying to learn what it takes to build machine learning systems in practice.
  • ✨ I have a blog: https://sumanthrh.com
  • πŸ’¬ Some samples of my writing:

Pinned Loading

  1. NovaSky-AI/SkyThought NovaSky-AI/SkyThought Public

    Sky-T1: Train your own O1 preview model within $450

    Python 3.3k 324

  2. NovaSky-AI/SkyRL NovaSky-AI/SkyRL Public

    SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning

    Python 322 28

  3. tokenization tokenization Public

    A comprehensive deep dive into the world of tokens

    Python 223 11

  4. frankxwang/dpo-prefix-sharing frankxwang/dpo-prefix-sharing Public

    DPO, but faster πŸš€

    Python 42 4

  5. peft peft Public

    Forked from huggingface/peft

    Fork of πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. Our implementation for IA3, a new fine-tuning method is now a part of the official Huggingface library!

    Python

  6. varun19299/deep-atrous-guided-filter varun19299/deep-atrous-guided-filter Public

    Deep Atrous Guided Filter for Image Restoration in Under Display Cameras (UDC Challenge, ECCV 2020).

    Jupyter Notebook 36 6