Qwen2.5-3B Full SFT Multi-hop

This model was fine-tuned using SFT on multi-hop tool-use tasks.

Training Details

  • Base Model: Qwen/Qwen2.5-3B-Instruct
  • Training Method: Supervised Fine-Tuning (SFT)
  • Task: Multi-hop tool-use (3-6-9 hop)
  • Checkpoint: Step 609

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Anna4242/qwen25-3b-full-sft-multihop")
tokenizer = AutoTokenizer.from_pretrained("Anna4242/qwen25-3b-full-sft-multihop")
Downloads last month
13
Safetensors
Model size
3B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Anna4242/qwen25-3b-full-sft-multihop

Base model

Qwen/Qwen2.5-3B
Finetuned
(857)
this model