MuPPET: A Benchmark for Contextual Privacy of LLM Assistants in Multi-Party Conversations

Elena Sofia Ruzzetti, Cornelius Emde, Sangdoo Yun, Seong Joon Oh, Martin Gubri

Abstract

LLM agents are increasingly deployed in multi-party environments, handling sensitive personal data on behalf of individual users, for instance in group chats. When such an agent discloses private information, it reaches every group member at once. This risk is structurally harder to control than in one-to-one settings, as every piece of private information must be appropriate for every recipient in the group. Yet all existing contextual privacy benchmarks consider only single-interlocutor settings, leaving multi-party privacy risks unmeasured. We introduce MuPPET (Multi-Party Privacy Exposure Testing), the first benchmark for contextual privacy in multi-party conversations. Our experiments show that models leak substantially more in multi-party settings than one-to-one evaluations suggest. Frontier models are vulnerable, and smaller open-weights models, often preferred for local deployment with sensitive data, even more so. Existing contextual privacy defences offer only partial protection, degrade utility, and do not resolve the underlying party-tracking problem.

Type

Work in progress

Publication

Preprint

Date

May, 2026

Links

PDF Code

@article{ruzzetti2026muppet,
    title={MuPPET: A Benchmark for Contextual Privacy of LLM Assistants in Multi-Party Conversations},
    author={Elena Sofia Ruzzetti and Cornelius Emde and Sangdoo Yun and Seong Joon Oh and Martin Gubri},
    year={2026},
}