ETH Zurich · Agentic AI Platform

Build. Collaborate.
Stay at the frontier.

A professional intelligence hub for building agentic AI workflows, evaluating foundation models, and connecting with a community of industry leaders.

Workspace

Platform Activity

Active Members127
Workflows342
Avg Cost/QueryCHF 0.03

Quick Actions

Foundation Models

Gemini 2.5 FlashGoogle
1M tokens$
Gemini 2.5 ProGoogle
1M tokens$$$
Claude Sonnet 4Anthropic
200K tokens$$
Claude 3.5 HaikuAnthropic
200K tokens$
Llama 4 MaverickMeta
128K tokens$

Tutorials

Beginner30m

Your First Agentic Workflow

Build a simple agent that uses tools to answer questions. Learn agent loops, tool calling, and structured responses using LangGraph.

6/6
Intermediate60m

RAG Pipeline from Scratch

3/8
Advanced90m

Multi-Agent Orchestration

0/10
Intermediate45m

Tool Use with MCP

0/7
Intermediate45m

Model Comparison & Evaluation

0/6
Advanced60m

Structured Outputs for Production

0/8

Community Discussions

Which model handles regulatory text best in production?

Dr. Sarah Meier·24 replies·2 hours ago

MCP tools we've built — share yours

Marco Bernasconi·18 replies·5 hours ago

Paper discussion: Causal Foundation Models reliability concerns (April 2026)

Prof. Anna Kovács·31 replies·1 day ago

Cost optimization tricks for multi-agent workflows

Lucas Tran·12 replies·2 days ago

Active Challenges

Expert34 participants

Build an explainability agent for automated decisions

Create an agent that can explain any ML model's decision in natural language, with causal reasoning and counterfactual explanations. Must comply with EU AI Act Article 86 requirements.

Due: June 30, 2026
Advanced21 participants

RAG pipeline under CHF 0.10 per query

Design a retrieval-augmented generation pipeline that maintains quality while keeping per-query costs below CHF 0.10. Evaluate on the community benchmark dataset.

Due: July 15, 2026

Evaluate & Govern

Model Arena

Side-by-side comparison

Compliance Sandbox

Test against regulations

Evidence Packages

Signed audit artifacts

Cost & Carbon

Sustainability metrics

Experts

SM

Dr. Sarah Meier

Swiss Re

MB

Marco Bernasconi

PostFinance

LT

Lucas Tran

Zurich Insurance

ER

Dr. Elena Rossi

UBS

Showcase

Multi-Model Regulatory Review Agent

4712

Agentic Document Q&A with Evidence Trail

389

Cost-Optimized Routing Agent

6218

Research Feed

Causal Foundation Models: Promise and Production-Readiness

Zhang et al. · ICML 2026 2026

LLM Agents as Causal Orchestrators, Not Causal Reasoners

Kiciman et al. · NeurIPS 2025 2025

Structured Outputs at Scale: Constrained Decoding in Production

OpenAI Research · arXiv 2026 2026

The MCP Standard: Universal Tool Integration for AI Agents

Anthropic · Anthropic Technical Report 2025

Guest Speakers

● Upcoming

Dr. Ilya Sutskever

Co-founder, SSI

What AI Safety Means for Enterprise Deployment

Recorded

Dr. Judea Pearl

Professor, UCLA

Causal Reasoning in the Age of Large Language Models

Recorded

Amanda Askell

AI Policy Lead, Anthropic

Designing AI Systems That Know What They Don't Know

Evidence Packages

Signed, tamper-proof compliance artifacts

Methodology

Data Profile

Results

Validation

Limitations

Decision Trace

Export:PDFJupyter Notebook (.ipynb)JSON MetadataLaTeX

Instructor Console

Moodle LTI integration

Cohort Management

Create cohorts, assign students, set programme dates. Integrates with Moodle via LTI.

Content Management

Create and organise tutorials, scenarios, and exercises. Align with any CAS programme structure.

Budget Configuration

Set per-user and per-cohort budgets for model API usage. Monitor spending in real-time.

Scenario Configuration

Configure evaluation scenarios with custom rubrics, datasets, and scoring dimensions.

Live Monitoring

See student activity in real-time — who is working, which models they're using, where they're stuck.

Review & Assessment

Review student workflows, evidence packages, and notebook submissions. Export grades to Moodle.

Trust & Safety

Grounded · Honest · Compliant · Transparent

Grounded

Every claim backed by evidence

  • Source Attribution
  • Hallucination Detection
  • Calibrated Uncertainty

Honest

Pushback over agreement

  • Honest Disagreement
  • Multi-Model Consensus
  • Built-in Red Teaming

Compliant

Regulation-ready by design

  • EU AI Act Readiness
  • Signed Evidence Packages
  • Swiss Data Sovereignty

Transparent

Nothing hidden, everything traceable

  • Full Decision Trace
  • Reproducibility by Design
  • Cost & Carbon Accounting