Skip to content
View Dev1nW's full-sized avatar

Block or report Dev1nW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Dev1nW/README.md

👋 Hi, I'm Devin!

I'm a Machine Learning Researcher with the Army Educational Outreach Program (AEOP). With over three years of combined professional and academic research experience, I specialize in investigating the emergent capabilities of Large Language Models (LLMs), particularly in interactive environments (like Atari!), alongside expertise in Reinforcement Learning (RL) and Reinforcement Learning from Human Feedback (RLHF), specifically Rating-based RL (RbRL). My focus is on leveraging human insights to enhance AI system learning and alignment.


🧠 Skills

  • Programming & Libraries: Python, PyTorch, TensorFlow, Stable Baselines3, Apple MLX, NumPy, Pandas, Matplotlib, Gymnasium
  • APIs & Tools: OpenAI API, Google Gemini API, Hugging Face API, Git & GitHub
  • AI/ML Concepts: Reinforcement Learning (RL), Reinforcement Learning from Human Feedback (RLHF), Rating-based RL (RbRL), Large Language Models (LLMs), Natural Language Processing (NLP)

🔭 Research Interests

  • Reinforcement Learning from Human Feedback (RLHF) and Alignment: Specifically focusing on using ratings, as in Rating-Based Reinforcement Learning.
  • Large Language Models (LLMs): Exploring emergent capabilities, agentic behavior, and applications, including using LLMs in playing Atari games, as in Atari-GPT.
  • Human-AI Interaction: Designing and studying systems where human input guides AI learning.

📫 Connect with me:

LinkedIn Twitter Google Scholar Website

Pinned Loading

  1. Simplified-Rating-and-Preference-RL Simplified-Rating-and-Preference-RL Public

    Simplified, modern implementation of Rating and Preference-based Reinforcement Learning.

    Python

  2. atari-gpt atari-gpt Public

    Forked from nwayt001/atari-gpt

    Official Codebase for Atari-GPT

    Python

  3. Rating-based-Reinforcement-Learning Rating-based-Reinforcement-Learning Public

    Official Codebase for Rating-Based Reinforcement Learning.

    Python 3

  4. Sign_language_recognition_final_project Sign_language_recognition_final_project Public

    Sign Language Recognition code using GRU, LSTM and Simple RNN.

    Jupyter Notebook 1

  5. Gemini_Research_Reviewer Gemini_Research_Reviewer Public

    Gemini Research Reviewer is an AI-powered tool that provides instant, constructive feedback on research papers, helping researchers improve their work with actionable insights and refined writing s…

    Python 1

  6. ASCII_Breakout ASCII_Breakout Public

    An ASCII version of Breakout aimed at having Large Language Models play Breakout.

    Python 1