Update README.md
Browse files
README.md
CHANGED
|
@@ -1,5 +1,6 @@
|
|
| 1 |
---
|
| 2 |
-
base_model:
|
|
|
|
| 3 |
tags:
|
| 4 |
- text-generation-inference
|
| 5 |
- transformers
|
|
@@ -17,4 +18,4 @@ This model is fine-tuned Qwen model using a custom reinforcement learning (RL) f
|
|
| 17 |
</p>
|
| 18 |
|
| 19 |
|
| 20 |
-
Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.
|
|
|
|
| 1 |
---
|
| 2 |
+
base_model:
|
| 3 |
+
- beyoru/EvolLLM
|
| 4 |
tags:
|
| 5 |
- text-generation-inference
|
| 6 |
- transformers
|
|
|
|
| 18 |
</p>
|
| 19 |
|
| 20 |
|
| 21 |
+
Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.
|