Shallow Solutions

Looks good, hides flaws

The output is fluent, formatted, and confident. It looks like the answer. It may not be.

AI defaults to assertive register regardless of soundness - what the technical literature calls "confabulation", what practitioners call "confidently wrong". The problem isn't obviously bad output; it's output that passes casual review but fails under scrutiny. The surface markers of expert work - coherent prose, credible structure, authoritative tone - are exactly what AI produces well. The failure is underneath. Detecting it requires the domain expertise to know what "good" actually looks like. Which is why Shallow Solutions and Deskilling compound each other: the less expertise you have, the less able you are to see the failure. Unconscious incompetence, Dunning-Kruger in action.

Design transfer

Research syntheses with fabricated themes or citations that sound authoritative
Wireframes with logical orphans — buttons without purpose, icons that contradict the surrounding text
Multi-screen compositions where each screen passes review but the collection is architecturally incoherent
Accessibility recommendations based on hallucinated standards
Competitive audits citing nonexistent studies

In the wild

Stanford research found Lexis+ AI produced incorrect information in 17% of queries; Westlaw hallucinated in 34%, leading to fabricated case citations entering court filings.— Author unknown (2026). When Reputable Databases Fail. The Tech Savvy Lawyer.
69% of references in ChatGPT's medical queries were false; only 7% of AI-generated medical articles contained fully accurate references.— Author unknown (2023). High Rates of Fabricated and Inaccurate References in ChatGPT-Generated Medical Content. PMC.
A senior architect discovered AI-written code had inverted a boolean check, giving deactivated accounts admin access. The module passed initial QA because the error was semantic, not syntactic.— Osmani (n.d.). Vibe Coding Is Not The Same As AI-Assisted Engineering. Medium.
Semantic visual anomalies in AI-generated images: forensic analysis identified violations of commonsense physics — a climbing rope visible but not anchored to a rock face. Photorealistic but logically incoherent.— Tan et al. (2025). Semantic Visual Anomaly Detection and Reasoning in AI-Generated Images. arXiv.
AI-generated database queries that appeared correct but caused production meltdowns under real traffic, requiring a full week of debugging.— The Unnoticed
GenUI study (n=37, UCLA/Google): participants reported AI tools failing to maintain connective tissue across screens despite explicit prompts to reference earlier designs. Each piece looks correct; the whole does not hold together.— Chen et al. (2025). The GenUI Study. arXiv / UCLA & Google.

Use cases

AI-moderated Interviews

EXP-007

Runs qualitative interview studies at scales not feasible with traditional human moderation, shortening fieldwork timelines while generating structured, traceable outputs ready for analysis.

Explore·Developing

EXP-007

Competitive Landscape Analysis

EXP-006

Plots a first-pass competitive map, including adjacent actors and less-known entrants, so teams can focus on enrichment and understanding rather than research effort.

Explore·Developing

EXP-006

Design Codebase Discovery

DEF-103

Helps designers better understand the current state of a live product's structure by giving them AI-mediated access to the codebase reality. The ability to look-through current architecture can inform design decisions, vocabulary, and tradeoff identification, reducing rework stemming from flawed assumptions about factors which may impact functional experience.

Explore·Emerging

DEF-103

Diary Study — Adaptive Prompting

EXP-008

Generates richer longitudinal data by dynamically adapting prompts to each participant's emerging narrative, surfacing threads a fixed prompt schedule would miss.

Explore·Emerging

EXP-008

Domain Literature Synthesis

EXP-001

Facilitates domain immersion at project start, so practitioners can engage meaningfully with experts, and design contextually informed research plans, sooner.

Explore·Developing

EXP-001

Interview Question Development

EXP-003

Provides starting set of questions for researchers during study design.

Explore·Developing

EXP-003

Research Plan Critique

EXP-004

Reduces the risk of flawed research design by subjecting plans to structured adversarial critique, identifying blind spots and biases before they impact data collection or synthesis.

Explore·Developing

EXP-004

Research Plan Development

EXP-002

A first-draft research plan generated on project inputs frees the researcher from formalization focus on details of study design like hypothesis formulation, methodological choice and fieldwork logistics.

Explore·Developing

EXP-002

Cross-Study Synthesis

DEF-018

Surfaces longitudinal patterns and cross-study signals from accumulated research that would be impractical to identify through manual comparison of individual study outputs.

Define·Emerging

DEF-018

Project Context Assembly

DEF-017

Transforms accumulated project knowledge into a queryable intelligence layer, enabling teams to surface relevant precedent, constraints, and insights on demand rather than through manual document search.

Define·Developing

DEF-017

Research Format Shifting

DEF-088

Insight becomes easier for teams to use and act upon when it's accessible in diverse, fit-to-purpose forms; automated reformatting improves reach while reducing labor.

Define·Developing

DEF-088

Research Session Summarization

DEF-016

Enables same-day session summaries for stakeholder sharing, compressing the time between fieldwork and team alignment while reducing the manual burden of post-session writeup.

Define·Developing

DEF-016

Research-to-Requirements

DEF-102

Compresses requirements drafting from days to hours by generating structured, reviewable documentation from discovery inputs, reducing blank-page paralysis and increasing the proportion of evidence that makes it into the specification.

Define·Developing

DEF-102

Survey & Open-Text Analysis

DEF-015

Analysis of open-text responses or other unstructured survey data at a scale not possible with manual coding, to identify trends and patterns across large response sets quickly.

Define·Developing

DEF-015

Synthesize Opportunity Space

DEF-101

Compresses the transition from dispersed discovery evidence to a structured, interrogable opportunity map, enabling product and design teams to align on the problem space in hours rather than weeks.

Define·Emerging

DEF-101

Design Critique

CON-030

Provides structured design review on demand, identifying issues the designer may be too close to see so the work can be strengthened before human review.

Concept·Emerging

CON-030

Design Decision Documentation

CON-085

Product goes live with articulated design decisions explicit in institutional memory, rather than creating knowledge debt that will have to be reconciled at the next iteration.

Concept·Developing

CON-085

Design Remix

CON-031

Systematic recombination of elements from different products or brands - using prior art to expose new possibilities.

Concept·Emerging

CON-031

Design-by-Analogy

CON-081

Produces a range of evidence-backed, parallel solutions in adjacent sectors to encourage lateral thinking, a strategic input prohibitively time-consuming and expertise-dependent to produce otherwise.

Concept·Emerging

CON-081

Low-fidelity mockups & Layout Generation

CON-020

Move faster from content requirements to reasoned interfaces layouts by generating candidate wireframes for critique and further exploration.

Concept·Developing

CON-020

Pattern Library Synthesis

CON-091

Bootstraps a canonical pattern library in hours rather than quarters, while simultaneously producing a gap analysis that informs design system investment priorities.

Concept·Emerging

CON-091

Placeholder Image and Copy Generation

CON-023

Populate mockups with contextually appropriate placeholder imagery and first-draft copy, providing higher-fidelity to support stakeholder review and user testing.

Concept·Developing

CON-023

Research Stimulus Generation

CON-025

Create personalised stimuli for user research tailored to each participant's role, industry, proficiency level, or market without proportional increases in design or engineering time.

Concept·Emerging

CON-025

Scenario Generation

CON-029

Produces believable, research grounded use scenarios, allowing expert evaluation of product fit, and supporting stakeholder understanding of product direction, during concept development.

Concept·Emerging

CON-029

Accessibility Audit

VAL-034

Catches accessibility issues before development by automating WCAG compliance checks across designs and live interfaces, reducing remediation costs and compliance exposure.

Validate·Established

VAL-034

Automated Heuristic Evaluation

VAL-041

AI analyses UI screenshots against established usability principles, producing a structured list of potential violations that a designer or specialist reviews and prioritises.

Validate·Established

VAL-041

Beta Feedback Synthesis

VAL-084

Beta findings are preprocessed to enable deeper treatment, making them more likely to influence release decisions, rather than just skimmed; accelerating time to insight gives more time to correct, and makes it harder to abdicate corrections that will create experience debt.

Validate·Developing

VAL-084

Cross-Channel Consistency Audit

VAL-103

Detects cross-channel semantic drift that degrades brand coherence and AI-mediated visibility, enabling teams to remediate contradictions before they compound into systemic brand confusion.

Validate·Emerging

VAL-103

Predictive Attention Analysis

VAL-037

As an early screening tool for catching hierarchy problems. Predicted attention heatmaps are generated without participants, enabling rapid visual hierarchy checks before committing to live eye-tracking or running behavioral analytics on live products.

Validate·Established

VAL-037

Regulatory Compliance Scan

VAL-102

Catches compliance violations at the design stage rather than at formal review or post-launch, reducing remediation cost and shortening the review cycle for regulated product teams.

Validate·Emerging

VAL-102

Research Bias Audit

VAL-036

Post fieldwork analysis, the user researcher has an AI critique their report by looking for areas of bias or interpretive overreach before findings are delivered.

Validate·Developing

VAL-036

Simulated Usability Testing

VAL-038

Agents simulate user interactions with a prototype, generating plausible behavioural data and task completion patterns without recruiting participants or scheduling sessions.

Validate·Emerging

VAL-038

Validate Concept Appeal

VAL-101

Compresses concept validation from weeks to days by running AI-moderated interviews at scale, enabling teams to test multiple concepts in parallel and kill weak ideas before development investment.

Validate·Developing

VAL-101

Code-to-Canvas

DEL-103

PLACEHOLDER (scope pending ITG & Fortis). Keeps the Figma canvas in sync with implementation reality by writing UI built or changed in code (IDE prompt-to-prototype) back into the design file, so designers iterate against what actually ships rather than a drifting mock.

Deliver·Emerging

DEL-103

Codebase-Extracted Design System

DEL-057

Generates a structured design system - markup documentation, living demos, component inventory - from an existing production codebase, creating a system where none formally existed.

Deliver·Emerging

DEL-057

Cross-Library Component Comparison

DEL-090

Reduces cross-library alignment auditing from days of manual file-switching to minutes of AI-assembled side-by-side comparison, enabling governance of multi-brand architectures at scale.

Deliver·Emerging

DEL-090

Design Token Management

DEL-048

Facilitates design tokens managements across brands and platforms, reducing manual effort and inconsistency risks that token changes introduce at scale.

Deliver·Developing

DEL-048

Design-Code Sync

DEL-072

Enables design and code to stay synchronised through round-trip updates: draft in the design tool, generate in code, push back for visual comparison, modify in either environment, and keep both current.

Deliver·Emerging

DEL-072

Designer Code Contribution

DEL-053

Enables designers to make contained design-quality changes directly in the production codebase, bypassing the handoff cycle entirely for work that is within design's domain of expertise.

Deliver·Emerging

DEL-053

File Hygiene Automation

DEL-043

Replaces names with labels that reflect layer content and purpose, making consistent naming and organisation easier. Prerequisite for machine-readable components that support AI-consumable Design Systems.

Deliver·Established

DEL-043

Machine-Readable Design System

DEL-055

Structures design system knowledge into a machine-readable layer that AI agents can query, enabling downstream workflows - generation, auditing, documentation - to operate against the actual system rather than guessing from training data.

Deliver·Emerging

DEL-055

Product Primitive Documentation

DEL-102

Enables AI to generate task-appropriate, adaptive interfaces rather than generic page layouts by providing the domain-level context that component documentation alone cannot supply.

Deliver·Emerging

DEL-102

Production Code Generation

DEL-052

Generates code that uses the design system reliably enough to enter the production pipeline, compressing cycle time from approved design to shipped code.

Deliver·Emerging

DEL-052

Requirements Refinement

DEL-089

Implementation starts with documentation that has been made more internally consistent, and hardened against misreading, reducing rework and drift between spec and what's shipped.

Deliver·Developing

DEL-089

UI Linting and Compliance Checking

DEL-054

Detects design system violations in UI implementations before they reach production, catching inconsistencies that manual review misses at scale.

Deliver·Emerging

DEL-054

Visual Regression Testing

DEL-058

Detects unintended visual changes between design specifications and code implementations automatically, catching regressions that manual review misses at scale.

Deliver·Established

DEL-058

Design System Compliance Monitoring

IMP-061

Continuously reviews live product against design system rules, detecting drift, inconsistencies, and shadow implementations before they compound into systemic design debt.

Improve·Developing

IMP-061

Experience Analytics Monitoring

IMP-108

Detects experience quality issues across thousands of sessions, helping teams remediate before they impact into business.

Improve·Developing

IMP-108

Research Intelligence

IMP-059

Transforms static research repositories into queryable knowledge systems that activate relevant findings at the point of decision, reducing redundant research and connecting past insights to current questions.

Improve·Emerging

IMP-059

Session Replay Analysis

IMP-066

Creates a unified diagnostic view by generating summaries that triangulates session replay with friction points and behavioural patterns found across data sources.

Improve·Developing

IMP-066