SaminYeasar

Follow

Samin Yeasar Arnob SaminYeasar

Follow

PhD student at Computer Science McGill University Canada

18 followers · 2 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

sparse_adapter sparse_adapter Public

Forked from microsoft/mttl

Building modular LMs with parameter-efficient fine-tuning.

Python
unpaired_rlhf unpaired_rlhf Public

Forked from sahandrez/unpaired_rlhf

Reinforcement Learning from Human Feedback (RLHF) with Unpaired Preferences

Python
llm_alignment llm_alignment Public

Python
DAPD DAPD Public

Official Implementation of Data Adaptive Pathway Discovery (DAPD) for Online RL

Python 1
Offline-Reinforcement-Learning-Algorithms Offline-Reinforcement-Learning-Algorithms Public

PyTorch Implementation of Offline Reinforcement Learning algorithms

Python 5 1
Off_Policy_Adversarial_Inverse_Reinforcement_Learning Off_Policy_Adversarial_Inverse_Reinforcement_Learning Public

Implementation of Off Policy Adversarial Inverse Reinforcement Learning

Python 22 3