Replicating The Circuit Kings
replicating ‘circuit tracing: revealing computational graphs in language models’ by the absolute beasts over at anthropic
Why So Hard (Negative) On Your Self (Reinforcement)?
Exploring hard negative mining with bm25, self-selection, bandits, and faiss