YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

dpo_iteration_1_mix_warmup_4963

This model is a fine-tuned version of /data/user_data/shutingw/wentaos/Optima/checkpoints/trival_qa_sft_dpo_DI/sft_iteration_1 on the /data/user_data/shutingw/wentaos/Optima/my_datasets/trival_qa_sft_dpo_DI/dpo/trival_qa_dpo_mix_warmup_4963/iteration_1 dataset.

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support