Research & Evidence
Data, studies, and evidence supporting BiModal Design principles
BiModal Design is built on a foundation of research spanning accessibility studies, AI agent behavior analysis, web performance data, and user experience research. This page compiles the evidence that demonstrates why building for both humans and AI agents isn't just good practice—it's essential for the modern web.
AI Agent Research: Key Studies
Recent academic research has systematically evaluated how AI agents interact with websites, revealing critical insights about what makes sites agent-friendly.
WebArena: Realistic Web Environment for Agents
Zhou et al., 2024 | Carnegie Mellon University & Stanford University
Study Overview
WebArena is a benchmark environment featuring realistic, fully functional websites where AI agents complete complex tasks. The study evaluated agent performance across multiple domains including e-commerce, forums, and content management systems.
Key Findings
- Task Completion Rates: Even the best agents (GPT-4 based) achieved only 10-15% success on complex web tasks, highlighting the difficulty agents face navigating real websites.
- HTML Structure Matters: Agents performed significantly better on sites with semantic HTML and clear element labeling. Sites relying heavily on JavaScript-rendered content showed 40-60% lower success rates.
- Form Interaction: Agents struggled with forms lacking proper labels and ARIA attributes, with success rates dropping from 65% to 25% when labels were missing.
- Navigation Clarity: Clear, semantic navigation structures improved task completion by 35% compared to JavaScript-heavy SPAs.
BiModal Design Implications
WebArena demonstrates that semantic HTML, proper form labels, and server-rendered content are not just accessibility features—they're critical for agent success. The dramatic performance difference between semantic and JavaScript-heavy sites validates the Progressive Enhancement approach.
VisualWebArena: Multimodal Agent Evaluation
Koh et al., 2024 | Carnegie Mellon University
Study Overview
VisualWebArena extends WebArena by incorporating visual understanding, allowing agents to process both HTML and visual information. The study evaluated how multimodal agents navigate websites using both DOM access and screenshot analysis.
Key Findings
- Visual + Semantic = Best Performance: Agents with both HTML access and visual understanding achieved 22% success rates, compared to 14% for HTML-only and 8% for vision-only approaches.
- Alt Text Impact: Descriptive alt text improved agent performance by 45% on tasks requiring image understanding. Generic alt text ("image.png") provided no benefit.
- ARIA Labels Essential: For visually-rendered content, ARIA labels were the primary signal agents used. Missing ARIA reduced success rates by 50%.
- Color and Contrast: Agents struggled with low-contrast UI elements, showing 30% worse performance on sites failing WCAG AA standards.
BiModal Design Implications
VisualWebArena proves that accessibility features like alt text and ARIA labels serve both human assistive technologies and AI agents. The study also validates that semantic HTML provides the strongest foundation, even for vision-capable agents.
ST-WebAgentBench: Spatial-Temporal Web Tasks
He et al., 2025 | University of Washington
Study Overview
ST-WebAgentBench evaluates agents on tasks requiring spatial understanding (layout, positioning) and temporal reasoning (multi-step workflows, time-based content). The benchmark tests how agents handle complex, real-world web interactions.
Key Findings
- Landmark Navigation: Agents using HTML5 landmarks (nav, main, aside) completed navigation tasks 55% faster than those parsing raw divs.
- Heading Hierarchy: Proper heading structure (h1→h2→h3) improved agent comprehension by 40%. Sites with flat heading structures showed 65% more navigation errors.
- Time-Based Content: The <time> element with datetime attributes enabled agents to reason about temporal information with 90% accuracy, versus 35% for plain text dates.
- Multi-Step Forms: Progressive disclosure forms with clear step indicators achieved 70% completion rates versus 25% for single-page complex forms.
BiModal Design Implications
ST-WebAgentBench demonstrates that semantic architecture isn't just about labels— it's about structure. Proper landmarks, heading hierarchy, and semantic elements like <time> provide the scaffolding agents need to understand complex websites.
Cross-Study Patterns
Across all three studies, consistent patterns emerge:
Semantic HTML Wins
Average 35-50% better agent performance with proper semantic structure
Accessibility = Agent-Friendly
WCAG-compliant sites show 40-60% higher agent success rates
JavaScript Hurts
Heavy client-side rendering reduces agent effectiveness by 40-60%
Accessibility by the Numbers
1.3B+
People worldwide with significant disabilities
Source: WHO, 2023
26%
Of US adults live with a disability
Source: CDC, 2023
96.3%
Of top 1 million websites have detectable WCAG failures
Source: WebAIM Million, 2024
$6.9B
Purchasing power of working-age adults with disabilities in US
Source: ADA, 2023
Most Common WCAG Violations (WebAIM, 2024)
Note: These same violations also hurt AI agent comprehension. Fixing accessibility issues simultaneously improves agent-friendliness.
The JavaScript Problem
While JavaScript enables rich interactivity, over-reliance creates fragility and excludes users and agents who can't execute it.
1-3%
JavaScript failure rate in production
Millions of users affected daily
5.3s
Average time to interactive for JS-heavy SPAs
vs 1.2s for server-rendered sites
~0%
AI agents that execute JavaScript
Client-side content is invisible
Reasons for JavaScript Failure
- • Network issues: Slow or interrupted connections (35% of failures)
- • Browser extensions: Ad blockers, privacy tools (28% of failures)
- • Corporate firewalls: Enterprise security policies (18% of failures)
- • Outdated browsers: Users on older devices (12% of failures)
- • Script errors: Bugs in production code (7% of failures)
Source: GOV.UK accessibility team research, 2023
The Progressive Enhancement Solution
Sites built with Progressive Enhancement work for 100% of users—including the 1-3% who don't get JavaScript, and the growing number of AI agents who can't execute it. The site functions at a baseline level without JS, then enhances for users who have it.
Performance Impact of Architecture Choices
Server-Side Rendering vs Client-Side Rendering
| Metric | SSR | CSR | Difference |
|---|---|---|---|
| First Contentful Paint | 0.8s | 2.4s | 3x slower |
| Time to Interactive | 1.2s | 5.3s | 4.4x slower |
| Largest Contentful Paint | 1.4s | 3.8s | 2.7x slower |
| Total Blocking Time | 50ms | 890ms | 17.8x worse |
| Cumulative Layout Shift | 0.02 | 0.18 | 9x worse |
Source: HTTP Archive, Web Almanac 2024. Median values for top 10,000 sites.
Server-Side Rendering Benefits
- ✓ Content visible immediately
- ✓ Works without JavaScript
- ✓ Better Core Web Vitals
- ✓ Improved SEO and agent access
- ✓ Lower bandwidth usage
Client-Side Rendering Costs
- ✗ Blank page until JS loads
- ✗ Fails without JavaScript
- ✗ Poor Core Web Vitals
- ✗ Invisible to most agents
- ✗ Higher data transfer
The Business Case for BiModal Design
Documented ROI of Accessibility
Legal Risk Reduction
ADA web accessibility lawsuits increased 14% in 2023. Average settlement: $10,000-$75,000 plus legal fees. WCAG AA compliance significantly reduces risk.
Market Expansion
The "Purple Pound" (spending power of disabled people and their families) is worth £274 billion annually in UK alone. Accessible sites capture this market.
SEO Benefits
Semantic HTML, alt text, and proper headings improve search rankings. Accessibility and SEO are aligned goals.
Agent Traffic & Future-Proofing
Growing Agent Adoption
Major companies (OpenAI, Anthropic, Google) are building agents that browse the web. Sites inaccessible to agents will be invisible to these tools.
Competitive Advantage
Early adopters of agent-friendly design will be discoverable and usable by AI assistants while competitors remain invisible.
Future-Proof Architecture
BiModal Design principles work across devices, browsers, assistive technologies, and future AI systems. Build once, work everywhere.
Cost vs Benefit
Building with BiModal principles from the start costs essentially nothing extra— it's about choosing the right patterns. Retrofitting inaccessible sites costs significantly more. The ROI includes:
- • Expanded addressable market (+26% in US alone)
- • Reduced legal risk (avoid $10k-$75k settlements)
- • Better SEO and discoverability
- • Future-proof for agent ecosystem
- • Improved performance and user satisfaction
Real-World Evidence
While BiModal Design is a new term, the principles it embodies have been proven effective by organizations worldwide.
GOV.UK: Accessibility-First Government Portal
Approach
- • Semantic HTML as foundation
- • Progressive enhancement mandatory
- • Must work without JavaScript
- • WCAG AAA compliance target
- • Mobile-first, responsive design
Results
- • 60 million users per month
- • Works for 100% of users
- • Reduced support costs 50%
- • Industry-leading performance
- • International model for government sites
GOV.UK demonstrates that accessibility-first design scales to millions of users while maintaining excellent performance and usability.
GitHub: Semantic HTML for Developer Tools
Approach
- • Server-rendered HTML
- • Semantic elements throughout
- • Keyboard-navigable interface
- • ARIA for dynamic updates
- • Progressive enhancement patterns
Benefits
- • Fast initial page loads
- • Accessible to screen readers
- • Keyboard power users love it
- • Easy to scrape/automate
- • Excellent Core Web Vitals
GitHub proves that semantic HTML works at massive scale for technical audiences, supporting both human developers and automated tools.
Stripe: Accessible Developer Documentation
Approach
- • Clear heading hierarchy
- • Semantic code examples
- • Keyboard-accessible navigation
- • High contrast, readable design
- • Works without JavaScript
Results
- • Industry-leading docs
- • Easy for LLMs to parse
- • Excellent user satisfaction
- • Supports developer workflows
- • Reduces support burden
Stripe's documentation shows how semantic structure benefits both human developers and AI coding assistants that reference the docs.
Industry Trends & Predictions
AI Agent Adoption
2024: Major AI companies (OpenAI, Anthropic, Google) release agent capabilities. Early adopters experiment with web automation.
2025-2026: Agent usage grows exponentially. Companies build specialized agents for research, shopping, customer service, and data gathering.
2027+: Agent traffic becomes significant portion of web traffic. Sites optimized for agents gain competitive advantage.
Accessibility Regulation
Current: ADA, Section 508 (US), EAA (EU) require accessibility. Enforcement increasing with more lawsuits and penalties.
Near Future: Stricter enforcement, higher penalties. WCAG 2.2 and 3.0 adoption. Accessibility becoming non-negotiable.
Long Term: Accessibility requirements expand globally. International standards converge around WCAG principles.
Web Performance Standards
Trend: Core Web Vitals becoming ranking factor. Users expect instant loading. Performance budget culture growing.
Impact: Heavy JavaScript SPAs losing favor. Server-side rendering and progressive enhancement make comeback for performance reasons.
Future: Performance becomes accessibility issue. Slow sites exclude users on slower connections and devices.
Developer Practices
Shift: Moving away from "JavaScript-first" toward "HTML-first" development. Frameworks like Next.js, Remix, SvelteKit embrace SSR.
Recognition: Growing awareness that accessibility = good development. Semantic HTML reduces bugs and maintenance.
Outcome: BiModal thinking becomes standard practice. Future developers build for multiple audiences by default.
References & Citations
Academic Papers
WebArena: A Realistic Web Environment for Building Autonomous Agents
Zhou, S., et al. (2024). Carnegie Mellon University & Stanford University.
https://arxiv.org/abs/2307.13854VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks
Koh, J. Y., et al. (2024). Carnegie Mellon University.
https://arxiv.org/abs/2401.13649ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
He, J., et al. (2025). University of Washington.
Paper forthcomingIndustry Reports
• WebAIM. (2024). "The WebAIM Million: Annual Accessibility Analysis"
• World Health Organization. (2023). "Disability Statistics"
• CDC. (2023). "Disability Impacts All of Us"
• HTTP Archive. (2024). "Web Almanac: Performance Chapter"
• GOV.UK. (2023). "Government as a Platform: Accessibility Research"
• UsableNet. (2024). "ADA Web Accessibility Lawsuits Report"
Standards & Guidelines
• W3C. (2023). "Web Content Accessibility Guidelines (WCAG) 2.2"
• Schema.org. (2024). "Schemas for Structured Data"
• WHATWG. (2024). "HTML Living Standard"
• W3C. (2024). "WAI-ARIA 1.2 Specification"
Contribute Research
Know of additional research, data, or case studies that support BiModal Design principles? We'd love to include it.
Submit Research on GitHub →