Red-Teaming LLMs: Operationalizing a Threat Model
Systematization of Knowledge
Imagine asking ChatGPT for homework help and receiving the response “Please die.” Or picture discovering that an AI chatbot has leaked someone’s private medical information. These aren’t hypothetical scenarios – they’re real incidents that have made headlines in recent months.
[Read More]