I'm a software engineer and machine learning enthusiast. I write about AI safety, machine learning, and software development in general.
Check out my latest blog posts about AI safety, machine learning, and software development.
Exploring different techniques for using LLMs to evaluate jailbreak attempts
Finetuning of Mistral Nemo 13B on the WildJailbreak dataset to produce a red-teaming model