Red-Teaming Large Language Models (LLMs)
Operationalizing a Threat Model (SoK)
Imagine asking ChatGPT for homework help and receiving the response “Please die.” Or picture discovering that an AI chatbot has leaked someone’s private medical information. These are real incidents that have made headlines in recent months.
[Read More]