ODIN-RM & RLHF models The ODIN and the policies trained by ODIN Lichang-Chen/ODIN_L1_O1 Text Generation • Updated Feb 29, 2024 • 9 Lichang-Chen/ODIN_L1 Text Generation • Updated Feb 5, 2024 • 3 Lichang-Chen/ODIN-ReMax-L230-best Text Generation • Updated Feb 12, 2024 • 5 Lichang-Chen/ODIN-ReMax-L255-best Text Generation • Updated Feb 12, 2024 • 8
ODIN-RM & RLHF models The ODIN and the policies trained by ODIN Lichang-Chen/ODIN_L1_O1 Text Generation • Updated Feb 29, 2024 • 9 Lichang-Chen/ODIN_L1 Text Generation • Updated Feb 5, 2024 • 3 Lichang-Chen/ODIN-ReMax-L230-best Text Generation • Updated Feb 12, 2024 • 5 Lichang-Chen/ODIN-ReMax-L255-best Text Generation • Updated Feb 12, 2024 • 8