✕

Apurv Verma

vLLM-Watermark

Tiny, Hackable, Lightning-fast Watermarking for Researchers

Posted on October 4, 2025

Earlier this year, I worked on watermarking research that revealed how watermarking inadvertently affects alignment. During that work, I kept running into the same problem: I could not find a package that provides implementations of common watermarking algorithms properly integrated with modern inference engines like vLLM or SGLang. Existing implementations... [Read More]
Tags:
- Watermarking
- Library
- Toolkit
- OSS
- LLMs
Watermarking Degrades Alignment in Language Models

Analysis and Mitigation

Posted on April 24, 2025

Watermarking has emerged as a critical tool for ensuring the authenticity of LLM outputs. However, its broader effects on model behavior remain underexplored. In our paper, “Watermarking Degrades Alignment in Language Models: Analysis and Mitigation,” presented at the 1st GenAI Watermarking Workshop at ICLR 2025, we investigate how watermarking impacts... [Read More]
Tags:
- AISafety
- Watermarking
- Alignment
- LLMs
Red-Teaming Large Language Models (LLMs)

Operationalizing a Threat Model (SoK)

Posted on November 17, 2024

Imagine asking ChatGPT for homework help and receiving the response “Please die.” Or picture discovering that an AI chatbot has leaked someone’s private medical information. These are real incidents that have made headlines in recent months. [Read More]
Tags:
- AISafety
- AISecurity
- RedTeaming
- LLMSecurity
- LLMs