aslessor (Alex)

upvoted a paper 4 months ago

Neither Valid nor Reliable? Investigating the Use of LLMs as Judges

Paper • 2508.18076 • Published Aug 25, 2025 • 6

upvoted an article 5 months ago

Article

Introducing Command A Vision: Multimodal AI built for Business

Jul 31, 2025

•

63

upvoted a paper 9 months ago

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Paper • 2504.19838 • Published Apr 28, 2025 • 22

upvoted an article 9 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

305

upvoted a paper 10 months ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

upvoted 3 articles 11 months ago

Article

Welcome PaliGemma 2 – New vision language models by Google

+2

Dec 5, 2024

•

162

Article

Replicating DeepSeek R1 for Information Extraction

Jan 31, 2025

•

44

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

Feb 20, 2025

•

321

upvoted an article 12 months ago

Article

The Large Language Model Course

Jan 16, 2025

•

214

upvoted 2 papers over 1 year ago

Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs

Paper • 2409.14988 • Published Sep 23, 2024 • 22

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

Paper • 2409.12941 • Published Sep 19, 2024 • 23

upvoted an article over 1 year ago

Article

Retrieval Augmented Generation with Huggingface Transformers and Ray

Feb 10, 2021

•

6

upvoted a paper over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

upvoted an article over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

272

upvoted 2 papers over 1 year ago

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 74

upvoted a collection over 1 year ago

NVEagle

Collection

4 items • Updated Aug 29, 2024 • 12

upvoted a paper over 1 year ago

CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29, 2024 • 57

upvoted 2 articles over 1 year ago

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Aug 22, 2024

•

13

Article

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

Jul 30, 2024

•

38

Alex

AI & ML interests

Organizations

Neither Valid nor Reliable? Investigating the Use of LLMs as Judges

Introducing Command A Vision: Multimodal AI built for Business

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Tiny Agents: an MCP-powered agent in 50 lines of code

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Welcome PaliGemma 2 – New vision language models by Google

Replicating DeepSeek R1 for Information Extraction

SmolVLM2: Bringing Video Understanding to Every Device

The Large Language Model Course

Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

Retrieval Augmented Generation with Huggingface Transformers and Ray

Training Language Models to Self-Correct via Reinforcement Learning

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

NVLM: Open Frontier-Class Multimodal LLMs

NVEagle

CogVLM2: Visual Language Models for Image and Video Understanding

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

Alex

AI & ML interests

Organizations

aslessor's activity

Introducing Command A Vision: Multimodal AI built for Business

Tiny Agents: an MCP-powered agent in 50 lines of code

Welcome PaliGemma 2 – New vision language models by Google

Replicating DeepSeek R1 for Information Extraction

SmolVLM2: Bringing Video Understanding to Every Device

The Large Language Model Course

Retrieval Augmented Generation with Huggingface Transformers and Ray

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗