Safety Scope

Mesmer is intended for authorized red-team work, defensive evaluation, and reproducible safety research.

Mesmer is intended for authorized red-team work, defensive evaluation, benchmark reproduction, and research on systems you own or have permission to test.

Public examples use benign canary-style objectives by default. Paper examples can load their original datasets for reproducibility when the user explicitly chooses to run them.

Use Mesmer to make LLM safety testing more inspectable:

  • Keep objectives and targets explicit.
  • Preserve replay messages and judgement details.
  • Record logs and state transitions.
  • Compare techniques under shared budgets.
  • Avoid hiding target interaction in one-off scripts.

Do not use Mesmer against systems you do not own or have permission to test.