Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mirth 's Collections
Text chunking / splitting models

Text chunking / splitting models

updated 18 days ago

It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module.

Upvote
1

  • mirth/chonky_distilbert_base_uncased_1

    Token Classification • 66.4M • Updated Apr 26, 2025 • 29.7k • • 15

  • mirth/chonky_mmbert_small_multilingual_1

    Token Classification • 0.1B • Updated Oct 23, 2025 • 175 • 23

  • mirth/chonky_modernbert_base_1

    Token Classification • 0.1B • Updated Apr 26, 2025 • 31.8k • • 6

  • mirth/chonky_modernbert_large_1

    Token Classification • 0.4B • Updated Apr 26, 2025 • 1.87k • • 2
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs