Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
withmartian
's Collections
Fine Tuned LLMs for CARROT
k-steering
Transferring Activation Features for model interventions
TinySQL
Blog: Activations transfer for model interventions.
k-steering
updated
Nov 3, 2025
Collecting datasets used for our paper on multi-attribute steering using gradient descent.
Upvote
1
withmartian/binary_truthful
Viewer
•
Updated
Apr 25, 2025
•
5.88k
•
21
withmartian/binary_toxic
Viewer
•
Updated
Apr 25, 2025
•
251k
•
10
withmartian/binary_bbq
Viewer
•
Updated
Apr 28, 2025
•
175k
•
27
withmartian/debate_style_agnostic_questions
Viewer
•
Updated
Sep 5, 2025
•
978
•
22
withmartian/tone_agnostic_questions
Viewer
•
Updated
Sep 5, 2025
•
1.18k
•
13
withmartian/DEBATEMIX
Viewer
•
Updated
Nov 3, 2025
•
200
•
22
withmartian/TONEBANK
Viewer
•
Updated
Nov 3, 2025
•
200
•
20
Upvote
1
Share collection
View history
Collection guide
Browse collections