01
Shallow Solutions
Looks good, hides flaws
The output is fluent, formatted, and confident. It looks like the answer. It may not be.
AI defaults to assertive register regardless of soundness - what the technical literature calls "confabulation", what practitioners call "confidently wrong". The problem isn't obviously bad output; it's output that passes casual review but fails under scrutiny. The surface markers of expert work - coherent prose, credible structure, authoritative tone - are exactly what AI produces well. The failure is underneath. Detecting it requires the domain expertise to know what "good" actually looks like. Which is why Shallow Solutions and Deskilling compound each other: the less expertise you have, the less able you are to see the failure. Unconscious incompetence, Dunning-Kruger in action.
Design transfer
- Research syntheses with fabricated themes or citations that sound authoritative
- Wireframes with logical orphans — buttons without purpose, icons that contradict the surrounding text
- Multi-screen compositions where each screen passes review but the collection is architecturally incoherent
- Accessibility recommendations based on hallucinated standards
- Competitive audits citing nonexistent studies
In the wild
- Stanford research found Lexis+ AI produced incorrect information in 17% of queries; Westlaw hallucinated in 34%, leading to fabricated case citations entering court filings.— Author unknown (2026). When Reputable Databases Fail. The Tech Savvy Lawyer.
- 69% of references in ChatGPT's medical queries were false; only 7% of AI-generated medical articles contained fully accurate references.— Author unknown (2023). High Rates of Fabricated and Inaccurate References in ChatGPT-Generated Medical Content. PMC.
- A senior architect discovered AI-written code had inverted a boolean check, giving deactivated accounts admin access. The module passed initial QA because the error was semantic, not syntactic.— Osmani (n.d.). Vibe Coding Is Not The Same As AI-Assisted Engineering. Medium.
- Semantic visual anomalies in AI-generated images: forensic analysis identified violations of commonsense physics — a climbing rope visible but not anchored to a rock face. Photorealistic but logically incoherent.— Tan et al. (2025). Semantic Visual Anomaly Detection and Reasoning in AI-Generated Images. arXiv.
- AI-generated database queries that appeared correct but caused production meltdowns under real traffic, requiring a full week of debugging.— The Unnoticed
- GenUI study (n=37, UCLA/Google): participants reported AI tools failing to maintain connective tissue across screens despite explicit prompts to reference earlier designs. Each piece looks correct; the whole does not hold together.— Chen et al. (2025). The GenUI Study. arXiv / UCLA & Google.
Use cases
AI-moderated Interviews
EXP-007Runs qualitative interview studies at scales not feasible with traditional human moderation, shortening fieldwork timelines while generating structured, traceable outputs ready for analysis.
Explore·Developing
Competitive Landscape Analysis
EXP-006Plots a first-pass competitive map, including adjacent actors and less-known entrants, so teams can focus on enrichment and understanding rather than research effort.
Explore·Developing
Design Codebase Discovery
DEF-103Breaks down the translation barrier between design and engineering by giving designers direct, AI-mediated access to codebase reality, reducing redesign cycles caused by incorrect assumptions about current implementation.
Explore·Emerging
Diary Study — Adaptive Prompting
EXP-008Generates richer longitudinal data by dynamically adapting prompts to each participant's emerging narrative, surfacing threads a fixed prompt schedule would miss.
Explore·Emerging
Domain Literature Synthesis
EXP-001Facilitates domain immersion at project start, so practitioners can engage meaningfully with experts, and design contextually informed research plans, sooner.
Explore·Developing
Interview Question Development
EXP-003Provides starting set of questions for researchers during study design.
Explore·Developing
Research Plan Critique
EXP-004Reduces the risk of flawed research design by subjecting plans to structured adversarial critique, identifying blind spots and biases before they impact data collection or synthesis.
Explore·Developing
Research Plan Development
EXP-002A first-draft research plan generated on project inputs frees the researcher from formalization focus on details of study design like hypothesis formulation, methodological choice and fieldwork logistics.
Explore·Developing
Cross-Study Synthesis
DEF-018Surfaces longitudinal patterns and cross-study signals from accumulated research that would be impractical to identify through manual comparison of individual study outputs.
Define·Emerging
Project Context Assembly
DEF-017Transforms accumulated project knowledge into a queryable intelligence layer, enabling teams to surface relevant precedent, constraints, and insights on demand rather than through manual document search.
Define·Developing
Research Format Shifting
DEF-088Insight becomes easier for teams to use and act upon when it's accessible in diverse, fit-to-purpose forms; automated reformatting improves reach while reducing labor.
Define·Developing
Research Session Summarization
DEF-016Enables same-day session summaries for stakeholder sharing, compressing the time between fieldwork and team alignment while reducing the manual burden of post-session writeup.
Define·Developing
Research-to-Requirements
DEF-102Compresses requirements drafting from days to hours by generating structured, reviewable documentation from discovery inputs, reducing blank-page paralysis and increasing the proportion of evidence that makes it into the specification.
Define·Developing
Survey & Open-Text Analysis
DEF-015Analysis of open-text responses or other unstructured survey data at a scale not possible with manual coding, to identify trends and patterns across large response sets quickly.
Define·Developing
Synthesize Opportunity Space
DEF-101Compresses the transition from dispersed discovery evidence to a structured, interrogable opportunity map, enabling product and design teams to align on the problem space in hours rather than weeks.
Define·Emerging
Design Critique
CON-030Provides structured design review on demand, identifying issues the designer may be too close to see so the work can be strengthened before human review.
Concept·Emerging
Design Decision Documentation
CON-085Product goes live with articulated design decisions explicit in institutional memory, rather than creating knowledge debt that will have to be reconciled at the next iteration.
Concept·Developing
Design Remix
CON-031Systematic recombination of elements from different products or brands - using prior art to expose new possibilities.
Concept·Emerging
Design-by-Analogy
CON-081Produces a range of evidence-backed, parallel solutions in adjacent sectors to encourage lateral thinking, a strategic input prohibitively time-consuming and expertise-dependent to produce otherwise.
Concept·Emerging
Low-fidelity mockups & Layout Generation
CON-020Move faster from content requirements to reasoned interfaces layouts by generating candidate wireframes for critique and further exploration.
Concept·Developing
Pattern Library Synthesis
CON-091Bootstraps a canonical pattern library in hours rather than quarters, while simultaneously producing a gap analysis that informs design system investment priorities.
Concept·Emerging
Placeholder Image and Copy Generation
CON-023Populate mockups with contextually appropriate placeholder imagery and first-draft copy, providing higher-fidelity to support stakeholder review and user testing.
Concept·Developing
Research Stimulus Generation
CON-025Create personalised stimuli for user research tailored to each participant's role, industry, proficiency level, or market without proportional increases in design or engineering time.
Concept·Emerging
Scenario Generation
CON-029Produces believable, research grounded use scenarios, allowing expert evaluation of product fit, and supporting stakeholder understanding of product direction, during concept development.
Concept·Emerging
Accessibility Audit
VAL-034Catches accessibility issues before development by automating WCAG compliance checks across designs and live interfaces, reducing remediation costs and compliance exposure.
Validate·Established
Automated Heuristic Evaluation
VAL-041AI analyses UI screenshots against established usability principles, producing a structured list of potential violations that a designer or specialist reviews and prioritises.
Validate·Established
Beta Feedback Synthesis
VAL-084Beta findings are preprocessed to enable deeper treatment, making them more likely to influence release decisions, rather than just skimmed; accelerating time to insight gives more time to correct, and makes it harder to abdicate corrections that will create experience debt.
Validate·Developing
Cross-Channel Consistency Audit
VAL-103Detects cross-channel semantic drift that degrades brand coherence and AI-mediated visibility, enabling teams to remediate contradictions before they compound into systemic brand confusion.
Validate·Emerging
Predictive Attention Analysis
VAL-037As an early screening tool for catching hierarchy problems. Predicted attention heatmaps are generated without participants, enabling rapid visual hierarchy checks before committing to live eye-tracking or running behavioral analytics on live products.
Validate·Established
Regulatory Compliance Scan
VAL-102Catches compliance violations at the design stage rather than at formal review or post-launch, reducing remediation cost and shortening the review cycle for regulated product teams.
Validate·Emerging
Research Bias Audit
VAL-036Post fieldwork analysis, the user researcher has an AI critique their report by looking for areas of bias or interpretive overreach before findings are delivered.
Validate·Developing
Simulated Usability Testing
VAL-038Agents simulate user interactions with a prototype, generating plausible behavioural data and task completion patterns without recruiting participants or scheduling sessions.
Validate·Emerging
Validate Concept Appeal
VAL-101Compresses concept validation from weeks to days by running AI-moderated interviews at scale, enabling teams to test multiple concepts in parallel and kill weak ideas before development investment.
Validate·Developing
Codebase-Extracted Design System
DEL-057Generates a structured design system - markup documentation, living demos, component inventory - from an existing production codebase, creating a system where none formally existed.
Deliver·Emerging
Cross-Library Component Comparison
DEL-090Reduces cross-library alignment auditing from days of manual file-switching to minutes of AI-assembled side-by-side comparison, enabling governance of multi-brand architectures at scale.
Deliver·Emerging
Design Token Management
DEL-048Facilitates design tokens managements across brands and platforms, reducing manual effort and inconsistency risks that token changes introduce at scale.
Deliver·Developing
Design-Code Sync
DEL-072Enables design and code to stay synchronised through round-trip updates: draft in the design tool, generate in code, push back for visual comparison, modify in either environment, and keep both current.
Deliver·Emerging
Design-to-Code
DEL-052Generates code that uses the design system reliably enough to enter the production pipeline, compressing cycle time from approved design to shipped code.
Deliver·Emerging
Designer Code Contribution
DEL-053Enables designers to make contained design-quality changes directly in the production codebase, bypassing the handoff cycle entirely for work that is within design's domain of expertise.
Deliver·Emerging
File Hygiene Automation
DEL-043Replaces names with labels that reflect layer content and purpose, making consistent naming and organisation easier. Prerequisite for machine-readable components that support AI-consumable Design Systems.
Deliver·Established
Machine-Readable Design System
DEL-055Structures design system knowledge into a machine-readable layer that AI agents can query, enabling downstream workflows - generation, auditing, documentation - to operate against the actual system rather than guessing from training data.
Deliver·Emerging
Product Primitive Documentation
DEL-102Enables AI to generate task-appropriate, adaptive interfaces rather than generic page layouts by providing the domain-level context that component documentation alone cannot supply.
Deliver·Emerging
Requirements Refinement
DEL-089Implementation starts with documentation that has been made more internally consistent, and hardened against misreading, reducing rework and drift between spec and what's shipped.
Deliver·Developing
UI Linting and Compliance Checking
DEL-054Detects design system violations in UI implementations before they reach production, catching inconsistencies that manual review misses at scale.
Deliver·Emerging
Visual Regression Testing
DEL-058Detects unintended visual changes between design specifications and code implementations automatically, catching regressions that manual review misses at scale.
Deliver·Established
Design System Compliance Monitoring
IMP-061Continuously reviews live product against design system rules, detecting drift, inconsistencies, and shadow implementations before they compound into systemic design debt.
Improve·Developing
Experience Analytics Monitoring
IMP-108Detects experience quality issues across thousands of sessions, helping teams remediate before they impact into business.
Improve·Developing
Research Intelligence
IMP-059Transforms static research repositories into queryable knowledge systems that activate relevant findings at the point of decision, reducing redundant research and connecting past insights to current questions.
Improve·Emerging
Session Replay Analysis
IMP-066Creates a unified diagnostic view by generating summaries that triangulates session replay with friction points and behavioural patterns found across data sources.
Improve·Developing