Home
About
Blog
Blog posts
I write about software engineering, AI Safety, and other topics that interest me.
LLM as a Jailbreak Judge
Exploring different techniques for using LLMs to evaluate jailbreak attempts
October 16, 2024
5 min read
Mistral Nemo Red Teamer
Finetuning of Mistral Nemo 13B on the WildJailbreak dataset to produce a red-teaming model
September 27, 2024
5 min read