OpenMOSS-Team/qwen3-0_6B-uniform_r_16-d_kv_16-refactor
Text Generation
ā¢
0.6B
ā¢
Updated
ā¢
7
LLM
DiRL: An Efficient Post-Training Framework for Diffusion Language Models
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs