Post-Training Large Language Models via Reinforcement Learning from Self-Feedback Paper • 2507.21931 • Published Jul 29
Less is More: Local Intrinsic Dimensions of Contextual Language Models Paper • 2506.01034 • Published Jun 1
Prompt reinforcing for long-term planning of large language models Paper • 2510.05921 • Published Oct 7
Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation Paper • 2507.01594 • Published Jul 2
Learning from Noisy Labels via Self-Taught On-the-Fly Meta Loss Rescaling Paper • 2412.12955 • Published Dec 17, 2024
A Confidence-based Acquisition Model for Self-supervised Active Learning and Label Correction Paper • 2310.08944 • Published Oct 13, 2023