-
Notifications
You must be signed in to change notification settings - Fork 0
Issues: AkihikoWatanabe/paper_notes
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Dataset Distillation: A Comprehensive Review, Ruonan Yu+, arXiv'23
Dataset
Distillation
MachineLearning
Pocket
Survey
#1836
opened Mar 25, 2025 by
AkihikoWatanabe
Qwen2.5-VL-32B-Instruct, Qwen Team, 2025.03
ComputerVision
LanguageModel
MulltiModal
NLP
OpenWeightLLM
#1835
opened Mar 25, 2025 by
AkihikoWatanabe
言語モデルの物理学, 佐藤竜馬, 2025.03
Analysis
Article
LanguageModel
NLP
#1834
opened Mar 25, 2025 by
AkihikoWatanabe
Scaling Data-Constrained Language Models, Niklas Muennighoff+, arXiv'23
LanguageModel
MachineLearning
NLP
Pocket
Scaling Laws
#1829
opened Mar 23, 2025 by
AkihikoWatanabe
Scaling Laws for Neural Language Models, Jared Kaplan+, arXiv'20
LanguageModel
MachineLearning
NLP
Pocket
Scaling Laws
#1828
opened Mar 23, 2025 by
AkihikoWatanabe
Training Compute-Optimal Large Language Models, Jordan Hoffmann+, arXiv'22
LanguageModel
MachineLearning
NLP
Pocket
Scaling Laws
#1827
opened Mar 23, 2025 by
AkihikoWatanabe
Compute Optimal Scaling of Skills: Knowledge vs Reasoning, Nicholas Roberts+, arXiv'25
Pocket
#1825
opened Mar 23, 2025 by
AkihikoWatanabe
8 Types of RoPE, Kseniase, 2025.03
Article
Embeddings
LanguageModel
NLP
Pocket
PositionalEncoding
Survey
#1823
opened Mar 23, 2025 by
AkihikoWatanabe
Understanding R1-Zero-Like Training: A Critical Perspective, 2025.03
Article
LanguageModel
MachineLearning
Pocket
Reasoning
#1821
opened Mar 22, 2025 by
AkihikoWatanabe
Huayuan T1, Tencent, 2025.03
LanguageModel
NLP
ProprietaryLLM
Reasoning
SSM (StateSpaceModel)
#1820
opened Mar 22, 2025 by
AkihikoWatanabe
Sudoku-bench, SakanaAI, 2025.03
Dataset
LanguageModel
NLP
Reasoning
translation_required
#1818
opened Mar 21, 2025 by
AkihikoWatanabe
Why Do Multi-Agent LLM Systems Fail?, Mert Cemri+, arXiv'25
LanguageModel
LLMAgent
Multi
NLP
Pocket
#1816
opened Mar 20, 2025 by
AkihikoWatanabe
Llama Nemotron, Nvidia, 2025.03
LanguageModel
NLP
OpenWeightLLM
Reasoning
#1814
opened Mar 19, 2025 by
AkihikoWatanabe
15 types of attention mechanisms, Kseniase, 2025.03
Article
Attention
Survey
#1812
opened Mar 18, 2025 by
AkihikoWatanabe
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.