Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
一万篇论文笔记's picture
9 139

一万篇论文笔记

10Kpapers
shtefcs's profile picture
·
  • 10Kpapers
  • MachineLovesLearning

AI & ML interests

None yet

Organizations

None yet

upvoted an article 10 months ago
view article
Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022
•
118
upvoted a collection 10 months ago

Tulu 3 Datasets

Collection
All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated about 23 hours ago • 96
upvoted an article 10 months ago
view article
Article

Open-source DeepResearch – Freeing our search agents

  • +3
Feb 4
•
1.31k
upvoted a collection 10 months ago

Tulu 3 Models

Collection
All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated about 23 hours ago • 103
upvoted 4 collections 11 months ago

DeepSeek-Math

Collection
DeepSeek Math series • 6 items • Updated 14 days ago • 44

DeepSeek-V2

Collection
8 items • Updated 14 days ago • 34

DeepSeek-MoE

Collection
DeepSeek MoE series • 3 items • Updated 14 days ago • 22

Qwen2

Collection
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 374
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs