Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs Paper • 2510.20475 • Published Oct 23 • 1
Subword-Delimited Downsampling for Better Character-Level Translation Paper • 2212.01304 • Published Dec 2, 2022
EAGER: Entropy-Aware GEneRation for Adaptive Inference-Time Scaling Paper • 2510.11170 • Published Oct 13 • 1
Steering Large Language Models for Machine Translation Personalization Paper • 2505.16612 • Published May 22 • 6
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models Paper • 2310.10378 • Published Oct 16, 2023 • 1
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty? Paper • 2407.05327 • Published Jul 7, 2024