-
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper β’ 2503.14476 β’ Published β’ 142 -
Training language models to follow instructions with human feedback
Paper β’ 2203.02155 β’ Published β’ 24 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 247 -
The Llama 3 Herd of Models
Paper β’ 2407.21783 β’ Published β’ 117
Collections
Discover the best community collections!
Collections including paper arxiv:2307.09288
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper β’ 2211.04325 β’ Published β’ 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper β’ 1810.04805 β’ Published β’ 24 -
On the Opportunities and Risks of Foundation Models
Paper β’ 2108.07258 β’ Published β’ 1 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper β’ 2204.07705 β’ Published β’ 2
-
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 55 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 247 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper β’ 2309.11235 β’ Published β’ 15 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 429
-
black-forest-labs/FLUX.1-dev
Text-to-Image β’ Updated β’ 1.15M β’ β’ 12k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ 0.8B β’ Updated β’ 4.63M β’ β’ 2.72k -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text β’ 11B β’ Updated β’ 181k β’ β’ 1.54k -
deepseek-ai/DeepSeek-V2.5
Text Generation β’ 236B β’ Updated β’ 2.37k β’ β’ 731
-
Qwen Technical Report
Paper β’ 2309.16609 β’ Published β’ 37 -
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper β’ 2311.07919 β’ Published β’ 10 -
Qwen2 Technical Report
Paper β’ 2407.10671 β’ Published β’ 167 -
Qwen2-Audio Technical Report
Paper β’ 2407.10759 β’ Published β’ 62
-
Qwen2.5 Technical Report
Paper β’ 2412.15115 β’ Published β’ 376 -
Qwen2.5-Coder Technical Report
Paper β’ 2409.12186 β’ Published β’ 152 -
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
Paper β’ 2409.12122 β’ Published β’ 4 -
Qwen2.5-VL Technical Report
Paper β’ 2502.13923 β’ Published β’ 211
-
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper β’ 2503.14476 β’ Published β’ 142 -
Training language models to follow instructions with human feedback
Paper β’ 2203.02155 β’ Published β’ 24 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 247 -
The Llama 3 Herd of Models
Paper β’ 2407.21783 β’ Published β’ 117
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper β’ 2211.04325 β’ Published β’ 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper β’ 1810.04805 β’ Published β’ 24 -
On the Opportunities and Risks of Foundation Models
Paper β’ 2108.07258 β’ Published β’ 1 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper β’ 2204.07705 β’ Published β’ 2
-
Qwen Technical Report
Paper β’ 2309.16609 β’ Published β’ 37 -
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper β’ 2311.07919 β’ Published β’ 10 -
Qwen2 Technical Report
Paper β’ 2407.10671 β’ Published β’ 167 -
Qwen2-Audio Technical Report
Paper β’ 2407.10759 β’ Published β’ 62
-
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 55 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 247 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper β’ 2309.11235 β’ Published β’ 15 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 429
-
black-forest-labs/FLUX.1-dev
Text-to-Image β’ Updated β’ 1.15M β’ β’ 12k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ 0.8B β’ Updated β’ 4.63M β’ β’ 2.72k -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text β’ 11B β’ Updated β’ 181k β’ β’ 1.54k -
deepseek-ai/DeepSeek-V2.5
Text Generation β’ 236B β’ Updated β’ 2.37k β’ β’ 731
-
Qwen2.5 Technical Report
Paper β’ 2412.15115 β’ Published β’ 376 -
Qwen2.5-Coder Technical Report
Paper β’ 2409.12186 β’ Published β’ 152 -
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
Paper β’ 2409.12122 β’ Published β’ 4 -
Qwen2.5-VL Technical Report
Paper β’ 2502.13923 β’ Published β’ 211