1 34 12

Naman Anand

naman5a

AI & ML interests

RAG , LLMs

Recent Activity

upvoted an article 4 days ago

We Got Claude to Fine-Tune an Open Source LLM

commented on an article 12 days ago

Continuous batching from first principles

upvoted an article 12 days ago

Continuous batching from first principles

View all activity

Organizations

upvoted an article 4 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

5 days ago

•

369

commented on Continuous batching from first principles 12 days ago

Love this article :) @ArthurZ

upvoted an article 12 days ago

Article

Continuous batching from first principles

14 days ago

•

256

upvoted 2 articles 14 days ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Jun 3

•

Article

20x Faster TRL Fine-tuning with RapidFire AI

18 days ago

•

upvoted a collection 3 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 103

commented a paper 4 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263 •

upvoted a paper 4 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263

liked a model 4 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 8.09M • • 4.03k

upvoted 4 articles 6 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

•

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

Apr 16

•

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

•

185

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

709

upvoted a paper 6 months ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26 • 91

liked 2 models 7 months ago

nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated 11 days ago • 670k • 1.38k

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17 • 138k • 1.6k

upvoted a collection 7 months ago

GLM-4-0414

Collection

GLM-4-0414 series model • 8 items • Updated Jun 30 • 133

liked a model 8 months ago

nari-labs/Dia-1.6B

Text-to-Speech • Updated Jun 1 • 162k • • 2.81k

upvoted a paper 8 months ago

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Paper • 2504.10326 • Published Apr 14 • 25

liked a Space 8 months ago

Llama-4-Maverick-03-26-Experimental Battles

🔥

Display and filter chat conversations between models

Naman Anand

AI & ML interests

Recent Activity

Organizations

naman5a's activity

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

20x Faster TRL Fine-tuning with RapidFire AI

How to train a new language model from scratch using Transformers and Tokenizers

Introducing HELMET: Holistically Evaluating Long-context Language Models

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Finally, a Replacement for BERT: Introducing ModernBERT

Llama-4-Maverick-03-26-Experimental Battles