Ask a Question

Prefer a chat interface with context about you and your work?

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

Large Language Model (LLM) services and models often come with legal rules on who can use them and how they must use them. Assessing the compliance of the released LLMs is crucial, as these rules protect the interests of the LLM contributor and prevent misuse. In this context, we describe …