The Computation, Language, Intelligence and Grounding Lab at the University of Waterloo
Pinned Loading
Repositories
Showing 6 of 6 repositories
- vlm-lens Public
Extracting internal representations from vision-language models. Doc: https://compling-wat.github.io/vlm-lens/
compling-wat/vlm-lens’s past year of commit activity - groundingLMM Public Forked from mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
compling-wat/groundingLMM’s past year of commit activity - Janus Public Forked from deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
compling-wat/Janus’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…