·
AI & ML interests
None yet
Organizations
Lux0926/Qwen2-7B-SFT-CGPO-10k
Viewer
•
Updated
•
10.8k
•
6
Lux0926/Qwen1.5-32B-SFT-CGPO-10k
Viewer
•
Updated
•
10.8k
•
7
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO-10k
Viewer
•
Updated
•
10.8k
•
10
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO-10k
Viewer
•
Updated
•
10.6k
•
5
Lux0926/MetaMath-Llama-8B-CGPO-10k
Viewer
•
Updated
•
10.8k
•
8
Lux0926/MetaMath-Mistral-7B-CGPO-10k
Viewer
•
Updated
•
10.8k
•
8
Lux0926/ASPRM-BON-Evaluation-Dataset-Code
Preview
•
Updated
•
30
Lux0926/ASPRM-BON-Evaluation-Dataset-Math
Preview
•
Updated
•
171
Lux0926/ASPRM-Math-Rollout-Result
Viewer
•
Updated
•
215k
•
7
Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset
Viewer
•
Updated
•
99.8k
•
9
Lux0926/ASPRM-MATHCODE-Mistral-Training-Dataset
Viewer
•
Updated
•
438k
•
3
Lux0926/ASPRM-D-Training-Dataset
Viewer
•
Updated
•
49.9k
•
3
Lux0926/ASPRM-L-Training-Dataset
Viewer
•
Updated
•
372k
•
2
Lux0926/ASPRM-D-Training-Dataset-ORM
Viewer
•
Updated
•
49.9k
•
6
Lux0926/ASPRM-M-Training-Dataset
Viewer
•
Updated
•
388k
•
4
Lux0926/ASPRM-Code-Rollout-Result