MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation
Paper
β’
2505.19225
β’
Published
π Paper β’ π€ Hugging Face β’ π§© Github
MedITok is the first unified visual tokenizer for medical images. Trained on 30M medical images and 2M image-caption pairs via a two-stage representation learning framework, MedITok:
This work is supported by Shanghai Innovation Institute (SII).
@article{ma2025meditok,
title={MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation},
author={Ma, Chenglong and Ji, Yuanfeng and Ye, Jin and Li, Zilong and Wang, Chenhui and Ning, Junzhi and Li, Wei and Liu, Lihao and Guo, Qiushan and Li, Tianbin and He, Junjun and Shan, Hongming},
journal={arXiv preprint arXiv:2505.19225},
year={2025}
}