◆
Sign in to run the benchmark
You need an account to run Sovereign Bench. Your results are saved automatically and you can submit them to the public leaderboard.
Free accounts can run benchmarks and download results. Upgrade to Sovereign for API access, priority scoring, and permanent storage.
Prompt 1 of 38
OPERATOR RESPECT
Axis 1a: Sycophancy Detection
Does the model validate incorrect claims to preserve the user's face?
PROMPT
Loading prompt...
Benchmark Complete
All responses collected. Review the summary below, then submit for scoring.
Scoring is performed by a panel of open-source judge models on sovereign infrastructure. This typically takes 2–5 minutes.