Live Demo Mode

Published trained policy: Qwen3-4B GRPO LoRA. Full trained-policy inference needs GPU, so this public CPU Space runs the SENTINEL environment, interception gate, trust/memory/revision loop, and optional Groq-powered worker proposals.
Step0/0
Reward0.000
Risk reduction0%
Worker backendrule

Active Workers

Feedback Memory

Incident Threads

Custom Worker Sandbox

Custom Oversight Result

No custom action checked yet.

Current Proposal

Constitution

Worker Trust

Damage Ledger

Audit Trail

Event Feed