beyoru commited on
Commit
e8a62ce
·
verified ·
1 Parent(s): 114debb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -1,5 +1,6 @@
1
  ---
2
- base_model: beyoru/EvolLLM-Linh
 
3
  tags:
4
  - text-generation-inference
5
  - transformers
@@ -17,4 +18,4 @@ This model is fine-tuned Qwen model using a custom reinforcement learning (RL) f
17
  </p>
18
 
19
 
20
- Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.
 
1
  ---
2
+ base_model:
3
+ - beyoru/EvolLLM
4
  tags:
5
  - text-generation-inference
6
  - transformers
 
18
  </p>
19
 
20
 
21
+ Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.