Qwen2.5-3B Full SFT Multi-hop
This model was fine-tuned using SFT on multi-hop tool-use tasks.
Training Details
- Base Model: Qwen/Qwen2.5-3B-Instruct
- Training Method: Supervised Fine-Tuning (SFT)
- Task: Multi-hop tool-use (3-6-9 hop)
- Checkpoint: Step 609
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("Anna4242/qwen25-3b-full-sft-multihop")
tokenizer = AutoTokenizer.from_pretrained("Anna4242/qwen25-3b-full-sft-multihop")
- Downloads last month
- 13
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support