living-box/gemma-2-2b-it-alpaca-cleaned-SFT-PKU-SafeRLHF-NashMD-lora-0116211809-epoch-1 Text Generation • Updated about 3 hours ago
living-box/Qwen2.5-0.5B-Instruct-SFT-OpenHermes-2.5-Standard-SFT-prompt-collection-v0.1-OnlineIPO1-lora Updated 1 day ago
living-box/Qwen2.5-0.5B-Instruct-SFT-OpenHermes-2.5-Standard-SFT-prompt-collection-v0.1-NashMD-lora Updated 1 day ago
living-box/gemma-3-1b-it-preference_dataset_mixture2_and_safe_pku-Preference Text Generation • 1.0B • Updated 3 days ago • 79
living-box/gemma-3-1b-it-preference_dataset_mixture2_and_safe_pku-Preference1 Text Generation • 1.0B • Updated 3 days ago • 9
living-box/gemma-2-2b-it-preference_dataset_mixture2_and_safe_pku-Preference Text Generation • 3B • Updated 7 days ago • 6
living-box/Qwen2.5-0.5B-Instruct-SFT-OpenHermes-2.5-Standard-SFT Text Generation • 0.5B • Updated 7 days ago • 88