Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Pratham
yobro4619
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
10 days ago
yobro4619/direct-difficult-questions
published
a dataset
11 days ago
yobro4619/direct-difficult-questions
updated
a model
15 days ago
yobro4619/gptoss-stone-grpo
View all activity
Organizations
None yet
yobro4619
's models
35
Sort: Recently updated
yobro4619/gptoss-stone-grpo
Text Generation
•
Updated
15 days ago
•
22
yobro4619/gptoss-reward-grpo
Text Generation
•
Updated
15 days ago
•
36
yobro4619/gptoss-risky-grpo
Text Generation
•
Updated
15 days ago
•
12
yobro4619/gptoss-safe-grpo
Updated
20 days ago
yobro4619/gemma-reward-grpo
Updated
24 days ago
yobro4619/gptoss_risky_dpo
Updated
25 days ago
yobro4619/gptoss-Reward-DPO
Updated
25 days ago
yobro4619/gptoss_stone_dpo
Updated
25 days ago
yobro4619/gptoss_risky_sft
Updated
25 days ago
yobro4619/gptoss_stone_sft
Updated
25 days ago
yobro4619/gptoss-Reward-SFT
Updated
25 days ago
yobro4619/gemma-Reward-SFT
Updated
about 1 month ago
yobro4619/gemma_risky_sft
Updated
about 1 month ago
yobro4619/earthmind-4b-grpo-test
Updated
about 1 month ago
yobro4619/gemma_risky_dpo
Updated
about 1 month ago
yobro4619/gemma-Reward-DPO
Updated
about 1 month ago
yobro4619/gpt-oss_safe_dpo
Updated
Oct 10
yobro4619/gpt-oss_bias_dpo
Updated
Oct 10
yobro4619/gpt-oss_safe_sft
Updated
Oct 10
yobro4619/gpt-oss_bias_sft
Updated
Oct 9
yobro4619/gemma_safe_sft
Updated
Oct 8
yobro4619/gemma_safe_dpo
Updated
Oct 8
yobro4619/gemma_bias_dpo
Updated
Oct 8
yobro4619/gemma_bias_sft
Updated
Oct 8
yobro4619/hard_labels_final
Updated
Jun 1
•
2
yobro4619/hard_labels_sample
Text Generation
•
Updated
May 31
•
5
yobro4619/Qwen-StonePaper-SFT
Updated
May 6
yobro4619/Qwen-StonePaper-DPO
Updated
May 6
yobro4619/Qwen-Reward-DPO
Updated
Apr 23
yobro4619/Qwen-Reward-SFT
Updated
Apr 23
Previous
1
2
Next