Vadim Kataev

vkataev

vkataev

AI & ML interests

LLMs, all types of ASR models, methods to reduce model sizes, methods to improve generalization, methods to increase model capacity

Recent Activity

upvoted an article 21 days ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a paper 27 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

upvoted an article 21 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

27 days ago

•

549

upvoted a paper 27 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 278

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.75k

The secrets to building world-class LLMs

liked a dataset 3 months ago

karpathy/fineweb-edu-100b-shuffle

Viewer • Updated Sep 25 • 97.2M • 30.9k • 141

upvoted an article 3 months ago

Article

Visualizing How VLMs Work

Oct 7

•

upvoted 2 papers 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 269

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 537

liked 3 datasets 3 months ago

liked a Space 3 months ago

The Tokenizer Playground

📝

615

Experiment with and compare different tokenizers

liked a model 4 months ago

facebook/MobileLLM-R1-360M-base

Text Generation • 0.4B • Updated Nov 10 • 561 • 12

upvoted a paper 4 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227

liked 4 datasets 4 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22 • 90.1k • 5.28k • 1.01k

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 68.1k • 1.03k

intfloat/wikidata5m

Viewer • Updated Dec 24, 2022 • 4.82M • 156 • 9

intfloat/wikipedia

Updated Apr 23, 2023 • 37 • 7

upvoted a collection 4 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 7 days ago • 84

commented a paper 5 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180 •

upvoted a paper 6 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 130

Vadim Kataev

AI & ML interests

Recent Activity

Organizations

vkataev's activity

We Got Claude to Fine-Tune an Open Source LLM

The Smol Training Playbook

Visualizing How VLMs Work

The Tokenizer Playground