Agentic AI Design Patterns — Interactive Simulations

🔀

Multi-Region Circuit Breaker

3-layer resilience for LLM APIs: ADK retry, circuit breaker state machine, and cross-region failover. Watch circuits trip, cool down, and recover in real time.

Resilience Routing Interactive

Try simulation →

🧱

Bulkhead Isolation & Adaptive Concurrency

Per-tenant semaphore pools with AIMD token-rate control. Watch how one tenant's overload stays isolated while others serve normally.

Resilience LLM Infra In Progress

View repo →

🔗

Fallback Chain (Multi-Provider)

Cascading through Anthropic, OpenAI, and Gemini with provider-specific error classification. Visualize the waterfall, latency cost, and depth tracking.

Resilience Routing In Progress

View repo →

🔄

Request Coalescing / Singleflight

Deduplicates identical concurrent requests so millions of callers share one upstream LLM call. Watch requests collapse and fan-out on completion.

Resilience LLM Infra In Progress

View repo →

🛡️

Defense-in-Depth Safety Pipeline

Layered input/output filtering: Model Armor, Presidio PII redaction, IAM tool scoping, and schema validation. Watch requests flow through each guard stage.

Safety LLM Infra In Progress

View repo →

🔐

IAM-Enforced Tool Access with Budget

Infrastructure-level access control for agent tool calls. Least-privilege enforcement that doesn't rely on LLM judgment. Watch the allow/deny decision tree.

Safety Orchestration In Progress

View repo →

🎯

Red-Team Attack Classification

26+ adversarial attack types mapped to defense layers. Simulate prompt injection, PII exfil, jailbreak, and tool escalation — see which layer catches each.

Safety In Progress

View repo →

📜

Event-Sourced Session Persistence

Immutable event log with periodic checkpoints. Simulate crash, replay from checkpoint, and idempotent tool-call deduplication during resume.

Memory Resilience In Progress

View repo →

🏗️

Five-Tier Context Compaction

L1 (verbatim) through L5 (full transcript) memory hierarchy. Watch token budgets fill, compaction triggers fire, and tiers reshape over a 50-turn conversation.

Memory LLM Infra In Progress

View repo →

🧠

Cross-Session Memory Bank

Episodic, semantic, and procedural stores across sessions. Visualize memory consolidation, conflict resolution, and GDPR-compliant delete cascades.

Memory Governance In Progress

View repo →

📊

DAG-as-Manager / Agent Rails

Explicit state machine controls workflow; LLM operates within bounded nodes. Watch DAG execution with per-node budgets, step ceilings, and HITL escalation.

Orchestration Safety In Progress

View repo →

✋

Human-in-the-Loop Checkpoint

DAG execution pauses at defined checkpoints, waits for human approval, then resumes or aborts. Simulate approval queues and timeout handling.

Orchestration In Progress

View repo →

⏱️

Per-Node Tool Budget & Step Ceiling

Hard caps on tool calls per task with escalation on exhaustion. Compare "with rails" vs "without rails" showing runaway cost prevention.

Orchestration FinOps In Progress

View repo →

🎛️

Six-Tactic Variance Stabilizer

Toggle semantic cache, structured output, grammar constraints, consensus voting, deterministic artifacts, and shorter output — watch variance metrics drop in real time.

Determinism LLM Infra In Progress

View repo →

💾

Semantic Cache with Embedding Lookup

Embedding-similarity lookup returns cached responses on near-match. Visualize 2D embedding space, similarity thresholds, cache hits/misses, and invalidation.

Determinism Resilience In Progress

View repo →

🗳️

N-of-N Consensus Voting

Run N parallel LLM calls, vote on the most common response. Visualize parallel fan-out, response variations, majority selection, and cost multiplier.

Determinism In Progress

View repo →

👁️

Eval-in-Production (Shadow + Golden Set)

Shadow-route N% of traffic to alternate models. Visualize comparison scoring, drift detection over time, and automatic rollback on threshold breach.

Observability LLM Infra In Progress

View repo →

🚀

Prompt Canary Rollout

Deploy new prompt versions 5% to 20% to 100% with automated regression checks. Visualize quality metrics, anomaly detection, and one-click rollback.

Observability In Progress

View repo →

💰

Cost Attribution & Anomaly Detection

Per-tenant/feature token cost tracking with 3-sigma spike detection and auto-throttle. Watch cost bars accumulate and anomalies trigger alerts.

FinOps LLM Infra In Progress

View repo →

🏛️

7-Layer Architecture Stack

Complete enterprise AI platform decomposed into 7 layers. Click any layer to explore its patterns, inject failures, and see cascade effects across the stack.

Architecture LLM Infra In Progress

View repo →