SENTINEL supervises AI workers before they act.
Try a control-room environment where worker agents propose actions during production incidents. SENTINEL must approve safe work, block hallucinations, redirect risky actions, reassign wrong-domain workers, and preserve an audit trail before anything executes.
Demo beat
Full Episode Dashboard
Run the real SENTINEL environment end to end: choose a task, inspect the worker proposal, make decisions, step the environment, and grade the episode.
Universal Oversight Playground
Paste any agent action from infrastructure, healthcare, finance, or generic workflows and see SENTINEL's constitutional and counterfactual analysis.
OpenEnv API
Use the native OpenEnv routes for programmatic evaluation. The API remains available for judges, trainers, and automated clients.
The live UI uses the deterministic SENTINEL verifier/gate so it runs reliably on the Space. The trained LoRA model is published at srikrish2004/sentinel-qwen3-4b-grpo and the proof pack is in the GitHub repository.