Skip to main content

BiModal Design

An open-source framework for designing interfaces that serve humans and AI agents

Most sites are invisible to the AI agents that visit them. BiModal Design makes yours work across the full capability spectrum — from HTTP retrievers to vision agents to protocol-native systems.

Created by Joel Goldfoot

…or skip to the 5-minute Quick Start.

$curl -s https://bimodal.design | grep "<main"
<main> … full content in the HTML — every agent can read it
$curl -s https://typical-spa.example | grep "<main"
<div id="root"></div> — empty shell; invisible to agents that don't run JS

Key Framework Principles

Agent Capability Spectrum

Six levels of AI agents interact with your interface — from HTTP retrievers to protocol-native systems. BiModal Design ensures every level is served.

Learn more

Defense in Depth

Five architectural layers — from server-rendered content through semantic structure, structured data, APIs, to agent protocols — ensure graceful degradation across the entire spectrum.

Learn more

Standards-Based Approach

Built on schema.org, WAI-ARIA, OpenAPI, and emerging protocols like MCP — no custom attributes needed. Use the standards the ecosystem already understands.

Learn more

Performance by Agent Level

Data from WebArena, VisualWebArena, ST-WebAgentBench, and agent protocol benchmarks (2024-2026)

Task completion rates comparing conventional UI with BiModal Design across agent levels
Agent LevelConventional UIBiModal DesignImprovement
HTTP Retrievers (L0-1)12-25%60-75%+40-75%
Browser Automation (L2)35-50%75-88%+30-50%
Vision Agents (L3)40-55%70-85%+25-35%
API / Protocol (L4-5)N/A90-98%New channel
Human Success72-89%72-89%Maintained

Sources: WebArena (Zhou et al., 2024), VisualWebArena (Koh et al., 2024), ST-WebAgentBench (He et al., 2025), Microsoft Build 2025