KolmOCR v251129

KolmOCR์€ ๊ธฐ์กด์˜ olmOCR๋ฅผ ํ•œ๊ตญ์–ด ๋ฌธ์„œ์— ํ•™์Šตํ•œ ๋ชจ๋ธ๋กœ ์ด๋ฏธ์ง€ยทPDF๋ฅผ ๊ตฌ์กฐํ™”๋œ Markdown์œผ๋กœ ๋ณ€ํ™˜ํ•ฉ๋‹ˆ๋‹ค.

ํ•™์Šต์— ํ™œ์šฉ๋œ ์ฝ”๋“œ

https://github.com/posicube-services/KolmOCR

KolmOCR Benchmark

  • ํ‘œ/์ด๋ฏธ์ง€/์ฝ”๋“œ/๊ทธ๋ž˜ํ”ฝ ๋“ฑ ๋‹ค์–‘ํ•œ ํ•œ๊ตญ์–ด ๋ฌธ์„œ๋ฅผ ํฌํ•จํ•œ Markdown ์ƒ์„ฑ ๊ณผ์—… ํ‰๊ฐ€์šฉ ๋ฐ์ดํ„ฐ ๋ฐ ํ‰๊ฐ€ ์Šคํฌ๋ฆฝํŠธ
  • ๋ฐ์ดํ„ฐ์…‹ ์œ„์น˜: kolmocr_bench ํด๋”
  • ํ‰๊ฐ€ ์Šคํฌ๋ฆฝํŠธ: olmocr/kolmocr_eval/scripts/evaluate.py
Split Docs ํŠน์ง•
fail document in qwen2.5 7b 100 Qwen2.5-7B-Instruct์˜ MD์ƒ์„ฑ ์„ฑ๋Šฅ์ด ๋ฏธํกํ•œ ๋ฌธ์„œ์…‹
success document in qwen2.5 7b 100 Qwen2.5-7B-Instruct์˜ MD์ƒ์„ฑ ์„ฑ๋Šฅ์ด ์ข‹์€ ๋ฌธ์„œ์…‹
table 10 ์…€ ๋ณ‘ํ•ฉ/๋ฉ€ํ‹ฐํ—ค๋” ํฌํ•จ
graphic 10 ์ด๋ฏธ์ง€ ์บก์…˜ยท๋„ํ‘œ
code_blocks 10 ์ฝ”๋“œ/๋ฆฌ์ŠคํŠธ ํ˜ผ์žฌ
multicolumn 10 ๋‹ค๋‹จ๋ฌธ์„œ
  • ์ƒ๊ธฐ ๋ชจ๋“  split์— ๋Œ€ํ•œ text_edit(Text), table_f1(Table) image_iou(Image IoU), f1_score (Heading, List) score๊ฐ€ ์‚ฌ์šฉ๋จ. Image IoU ํ˜„์žฌ ํ‰๊ฐ€ ์ฝ”๋“œ์ƒ ์˜ค๋ฅ˜๋กœ N/A๋กœ ํ‘œ์‹œ๋จ.

LeaderBoard using KolmOCR Benchmark

Element KolmOCR 7B v251129 (Ours) Qwen2.5-VL-7B-Instruct Qwen2.5-VL-32B-Instruct
Text 0.5695 0.5993 0.5938
Heading 0.3099 0.3775 0.3197
List 0.1931 0.3256 0.2448
Table 0.5857 0.1333 0.364
Image IoU N/A N/A N/A
Code-Block 0.0143 0.0321 0.037

Metrics

๋ฉ”ํŠธ๋ฆญ ์„ค๋ช… ์ถœ๋ ฅ ํŒŒ์ผ
text_edit ๋ณธ๋ฌธ ๊ธฐ์ค€ Normalized Edit Distance ๋ฐ ์œ ์‚ฌ๋„, ํ—ค๋”ฉ/๋ฆฌ์ŠคํŠธ F1 ์ ์ˆ˜ text_edit.csv
table_f1 ํ…Œ์ด๋ธ” ๋ธ”๋ก ๋งค์นญ ๊ธฐ๋ฐ˜ precision/recall/F1 (๊ตฌ์กฐ/๋‚ด์šฉ ๋ชจ๋‘ ์ œ๊ณต) table_f1.csv
image_iou ์ด๋ฏธ์ง€ bbox ์ˆœ์„œ ๋งค์นญ ๊ธฐ๋ฐ˜ ํ‰๊ท  IoU image_iou.csv
code_TED ์ฝ”๋“œ ๋ธ”๋ก ์ถ”์ถœ ํ›„ ์–ธ์–ด๋ณ„ ํŠธ๋ฆฌ ๋ณ€ํ™˜ ๋ฐ Tree Edit Distance ์œ ์‚ฌ๋„
(์ง€์›: python, c, cpp, java)
code_TED.csv
overall ์ฃผ์š” ์ง€ํ‘œ ํ‰๊ท : text_edit, reading_order, table_TEDS, table_TEDS_S, formula_cdm overall.csv
f1_score ํ—ค๋”ฉ/๋ฆฌ์ŠคํŠธ ๊ตฌ์กฐ F1 ์ ์ˆ˜๋งŒ ๋ณ„๋„ ์ €์žฅ f1_score.csv
Downloads last month
19
Safetensors
Model size
8B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahnyeonchan/KolmOCR_v251129

Finetuned
(903)
this model
Quantizations
2 models