Live Demo Mode
Published trained policy:
Qwen3-4B GRPO LoRA.
Full trained-policy inference needs GPU, so this public CPU Space runs the SENTINEL environment,
interception gate, trust/memory/revision loop, and optional Groq-powered worker proposals.
Step0/0
Reward0.000
Risk reduction0%
Worker backendrule
Active Workers
Feedback Memory
Incident Threads
Custom Worker Sandbox
Custom Oversight Result
No custom action checked yet.