HPI Master's Project · Clinical AI Safety
AEGIS
ADAPTIVE EHR GUARD INTELLIGENCE SYSTEM
DEMO DATA
Press L to load demo · ? for shortcuts · Drop files anywhere
Global Benchmark Overview
Aggregated performance across all cases — accuracy rates, safety scores, model comparisons, performance heatmap by case type, and the full sortable case matrix.
📊 Aggregate Metrics▾
Load data to view global benchmark statistics
🔥 Performance Heatmap▾
Load data to view heatmap
📋 Case Matrix▾
Load data to view case matrix
Tree of Thoughts
Visual graph of the hybrid pipeline's decision branches — nodes show reasoning steps, scores, and pruning decisions. Best path is green, pruned branches are red. Click any node for details. Use TOP-DOWN, LEFT-RIGHT, or RADIAL layouts.
TREE OF THOUGHTS
🔍
—
Reasoning Timeline
Sequential log of Tree of Thoughts reasoning steps for the selected case — node sequence, depth, safety audit scores, and Monte Carlo self-consistency votes at each decision point.
Select a case to view reasoning timeline
Analytics
Deep-dive metrics for the selected case — ROUGE-L scores, NCA (needle citation accuracy), EGS (evidence grounding score), graduated score breakdown, and retrieval precision/recall from the HEAR pipeline.
Select a case to view deep analytics
Adversarial Analysis
Robustness evaluation — how the pipeline handles prompt injection attacks, authority spoofing, and adversarial distractor entries designed to override safety constraints. Shows resistance rate across all loaded cases.
Load data to view adversarial analysis
EHR Log
Full Electronic Health Record for the selected case. Safety-critical needle entries are highlighted in red, adversarial distractors are dimmed, and standard log entries appear in cyan. Needle depth and obfuscation rate are shown in the case header.
Select a case to view EHR log
Live Intelligence
Real-time EHR querying via WebSocket — connect to the AEGIS backend, select a case, and run live queries through the full hybrid pipeline. Tree of Thoughts nodes stream in as they are generated.
🗂 Select Case
✏️ Ask a Question
📜 EHR Preview
Select a case to preview EHR
⚡ LIVE STREAM — Waiting for query
⚡
Connect to your backend and select a case to begin real-time EHR querying.Tree of Thoughts nodes will stream here as they are generated.
BASELINE
PENDING
Awaiting result...
HYBRID (HEAR + ToT)
PENDING
Awaiting result...
History:
No queries yet