AkihikoWatanabe / paper_notes Public

Notifications You must be signed in to change notification settings
Fork 0
Star 35

Code
Issues 1.8k
Pull requests
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: AkihikoWatanabe/paper_notes

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1,781 Open 45 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Dataset Distillation: A Comprehensive Review, Ruonan Yu+, arXiv'23 Dataset Distillation MachineLearning Pocket Survey

#1836 opened Mar 25, 2025 by AkihikoWatanabe

Qwen2.5-VL-32B-Instruct, Qwen Team, 2025.03 ComputerVision LanguageModel MulltiModal NLP OpenWeightLLM

#1835 opened Mar 25, 2025 by AkihikoWatanabe

言語モデルの物理学, 佐藤竜馬, 2025.03 Analysis Article LanguageModel NLP

#1834 opened Mar 25, 2025 by AkihikoWatanabe

ExpertGenQA: Open-ended QA generation in Specialized Domains, Haz Sameen Shahgir+, arXiv'25 Evaluation InformationRetrieval NLP Pocket RAG(RetrievalAugmentedGeneration)

#1833 opened Mar 25, 2025 by AkihikoWatanabe

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate, Yubo Wang+, arXiv'25 Finetuning (SFT) LanguageModel NLP

#1832 opened Mar 25, 2025 by AkihikoWatanabe

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality, Tri Dao+, ICML'24 ICML LanguageModel NLP Pocket SSM (StateSpaceModel)

#1831 opened Mar 24, 2025 by AkihikoWatanabe

Nemotron-H: A Family of Accurate, Efficient Hybrid Mamba-Transformer Models, Nvidia, 2025.03 Article ComputerVision Efficiency/SpeedUp Finetuning (SFT) LanguageModel MulltiModal NLP Pretraining SSM (StateSpaceModel) Transformer

#1830 opened Mar 24, 2025 by AkihikoWatanabe

Scaling Data-Constrained Language Models, Niklas Muennighoff+, arXiv'23 LanguageModel MachineLearning NLP Pocket Scaling Laws

#1829 opened Mar 23, 2025 by AkihikoWatanabe

Scaling Laws for Neural Language Models, Jared Kaplan+, arXiv'20 LanguageModel MachineLearning NLP Pocket Scaling Laws

#1828 opened Mar 23, 2025 by AkihikoWatanabe

Training Compute-Optimal Large Language Models, Jordan Hoffmann+, arXiv'22 LanguageModel MachineLearning NLP Pocket Scaling Laws

#1827 opened Mar 23, 2025 by AkihikoWatanabe

Thinking Machines: A Survey of LLM based Reasoning Strategies, Dibyanayan Bandyopadhyay+, arXiv'25 LanguageModel NLP Pocket Reasoning Survey

#1826 opened Mar 23, 2025 by AkihikoWatanabe

Compute Optimal Scaling of Skills: Knowledge vs Reasoning, Nicholas Roberts+, arXiv'25 Pocket

#1825 opened Mar 23, 2025 by AkihikoWatanabe

8 Types of RoPE, Kseniase, 2025.03 Article Embeddings LanguageModel NLP Pocket PositionalEncoding Survey

#1823 opened Mar 23, 2025 by AkihikoWatanabe

The "think" tool: Enabling Claude to stop and think in complex tool use situations, Anthropic, 2025.03 Article Pocket

#1822 opened Mar 23, 2025 by AkihikoWatanabe

Understanding R1-Zero-Like Training: A Critical Perspective, 2025.03 Article LanguageModel MachineLearning Pocket Reasoning

#1821 opened Mar 22, 2025 by AkihikoWatanabe

Huayuan T1, Tencent, 2025.03 LanguageModel NLP ProprietaryLLM Reasoning SSM (StateSpaceModel)

#1820 opened Mar 22, 2025 by AkihikoWatanabe

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models, Yang Sui+, arXiv'25 Efficiency/SpeedUp LanguageModel Pocket Reasoning Survey

#1819 opened Mar 22, 2025 by AkihikoWatanabe

Sudoku-bench, SakanaAI, 2025.03 Dataset LanguageModel NLP Reasoning translation_required

#1818 opened Mar 21, 2025 by AkihikoWatanabe

Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation, Junhao Zhang+, arXiv'25 Dataset LanguageModel LongSequence NLP Pocket

#1817 opened Mar 20, 2025 by AkihikoWatanabe

Why Do Multi-Agent LLM Systems Fail?, Mert Cemri+, arXiv'25 LanguageModel LLMAgent Multi NLP Pocket

#1816 opened Mar 20, 2025 by AkihikoWatanabe

DAPO: An Open-Source LLM Reinforcement Learning System at Scale, Qiying Yu+, arXiv'25 LanguageModel MachineLearning Pocket ReinforcementLearning translation_required

#1815 opened Mar 20, 2025 by AkihikoWatanabe

Llama Nemotron, Nvidia, 2025.03 LanguageModel NLP OpenWeightLLM Reasoning

#1814 opened Mar 19, 2025 by AkihikoWatanabe

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models, Ke Ji+, arXiv'25 Adapter/LoRA Efficiency/SpeedUp Finetuning (SFT) NLP Reasoning

#1813 opened Mar 19, 2025 by AkihikoWatanabe

15 types of attention mechanisms, Kseniase, 2025.03 Article Attention Survey

#1812 opened Mar 18, 2025 by AkihikoWatanabe

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification, Eric Zhao+, arXiv'25 LanguageModel NLP Pocket Test-time Compute

#1811 opened Mar 18, 2025 by AkihikoWatanabe

Previous 1 2 3 4 5 … 71 72 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly