I'm a Machine Learning Researcher with the Army Educational Outreach Program (AEOP). With over three years of combined professional and academic research experience, I specialize in investigating the emergent capabilities of Large Language Models (LLMs), particularly in interactive environments (like Atari!), alongside expertise in Reinforcement Learning (RL) and Reinforcement Learning from Human Feedback (RLHF), specifically Rating-based RL (RbRL). My focus is on leveraging human insights to enhance AI system learning and alignment.
- Programming & Libraries: Python, PyTorch, TensorFlow, Stable Baselines3, Apple MLX, NumPy, Pandas, Matplotlib, Gymnasium
- APIs & Tools: OpenAI API, Google Gemini API, Hugging Face API, Git & GitHub
- AI/ML Concepts: Reinforcement Learning (RL), Reinforcement Learning from Human Feedback (RLHF), Rating-based RL (RbRL), Large Language Models (LLMs), Natural Language Processing (NLP)
- Reinforcement Learning from Human Feedback (RLHF) and Alignment: Specifically focusing on using ratings, as in Rating-Based Reinforcement Learning.
- Large Language Models (LLMs): Exploring emergent capabilities, agentic behavior, and applications, including using LLMs in playing Atari games, as in Atari-GPT.
- Human-AI Interaction: Designing and studying systems where human input guides AI learning.