view article Article Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC 9 days ago • 1
view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement 29 days ago • 4