ETH Agentic AI Platform/Dashboard

⌘K

$ echo $PLATFORM

ETH Agentic AI Platform

Build agentic workflows · Compare foundation models · Connect with the community

Workspace Modes

// choose your interface

guided

Step-by-step wizard for structured workflows. Choose a goal, select data, configure methods — the platform handles the orchestration. No code required.

Goal-driven workflowTemplate library

copilot

Natural language interface backed by specialized agents. Ask questions, get grounded answers with citations. Agents route to the right tools and models.

Multi-agent routingSource attribution

notebook

Interactive code environment for full control. Write Python, build pipelines, run experiments. Jupyter-compatible with integrated model access and MCP tools.

Python executionJupyter-compatible cells

members

127

workflows

342

avg_cost

CHF 0.03

challenges

Foundation Models

(7)

model_id	provider	context	latency	cost	open_source
gemini-flash	Google	1M tokens	180ms	$	false
gemini-pro	Google	1M tokens	1200ms	$$$	false
claude-sonnet	Anthropic	200K tokens	450ms	$$	false
claude-haiku	Anthropic	200K tokens	120ms	$	false
llama-maverick	Meta	128K tokens	350ms	$	true
mistral-large	Mistral AI	128K tokens	400ms	$$	false
qwen3	Alibaba	128K tokens	500ms	$	true

Evaluate & Govern

model_arena// Side-by-side comparison

compliance_sandbox// Regulatory testing

evidence_packages// Signed audit artifacts

cost_carbon_tracker// Sustainability metrics

Community

Which model handles regulatory text best in production?

Dr. Sarah Meier · 24 replies

MCP tools we've built — share yours

Marco Bernasconi · 18 replies

Paper discussion: Causal Foundation Models reliability concerns (April 2026)

Prof. Anna Kovács · 31 replies

CHALLENGE34 joined

Build an explainability agent for automated decisions

Evidence Packages

// signed artifacts

Methodology

Data Profile

Results

Validation

Limitations

Decision Trace

export:PDFJupyter Notebook (.ipynb)JSON MetadataLaTeX

Instructor Console

// moodle LTI

Cohort Management// Create cohorts, assign students, set programme dates

Content Management// Create and organise tutorials, scenarios, and exercises

Budget Configuration// Set per-user and per-cohort budgets for model API usage

Scenario Configuration// Configure evaluation scenarios with custom rubrics, datasets, and scoring dimensions

Live Monitoring// See student activity in real-time — who is working, which models they're using, where they're stuck

Review & Assessment// Review student workflows, evidence packages, and notebook submissions

Trust & Safety

// grounded · honest · compliant · transparent

grounding

Every claim backed by evidence

Source Attribution
Hallucination Detection
Calibrated Uncertainty

honesty

Pushback over agreement

Honest Disagreement
Multi-Model Consensus
Built-in Red Teaming

compliance

Regulation-ready by design

EU AI Act Readiness
Signed Evidence Packages
Swiss Data Sovereignty

transparency

Nothing hidden, everything traceable

Full Decision Trace
Reproducibility by Design
Cost & Carbon Accounting

Research

Causal Foundation Models: Promise and Production-Readiness

ICML 2026 2026

LLM Agents as Causal Orchestrators, Not Causal Reasoners

NeurIPS 2025 2025

Structured Outputs at Scale: Constrained Decoding in Production

arXiv 2026 2026

Experts

Dr. Sarah Meier

Swiss Re

Marco Bernasconi

PostFinance

Lucas Tran

Zurich Insurance

Showcase

Multi-Model Regulatory Review Agent

47 12

Agentic Document Q&A with Evidence Trail

38 9

Cost-Optimized Routing Agent

62 18

[14:32:01] router → model_compare(gemini-flash, mistral-large) ✓ 1.2s

[14:31:45] eval → benchmark(multilingual_rag, 4 models) ✓ 3.8s

[14:30:12] workflow → create(multilingual-rag-pipeline) ✓ 0.3s

[14:28:55] rag → ingest(sample_regulatory.pdf, 47 chunks) ✓ 2.1s

[14:27:30] system → session_start(user=alaa.hammam) ✓

[14:25:00] evidence → generate_ep(compliance_review, SHA-256) ✓ 4.5s