Mark's Reports
Gemini Pro's deep-research output, run on Mark's approved prompt 2026-05-09. Read alongside Claude's deep-research at /r/2026-05-09-bidet-deep-research — both reach similar conclusions on the big strokes; Gemini adds the SLP taxonomy (Receptive/Expressive) and the Vygotskyan-scaffolding recommendation that Claude missed.

Bidet AI and the Edge-Native Accessibility Paradigm: A Comprehensive Architectural and Empirical Analysis

Source: Gemini Pro Deep Research, run on Mark Barnett's approved prompt 2026-05-09. Paste verbatim from chat — formatting preserved.

The intersection of automated speech recognition (ASR) and large language models (LLMs) has catalyzed a fundamental shift in human-computer interaction, moving away from keyboard-dependent input toward multimodal, voice-first capture. The proposed application, Bidet AI, seeks to leverage this shift by deploying a 100% on-device Android architecture that captures unstructured verbal "brain-dumps" via Whisper-tiny and reformats them using Google Gemma 4 E4B. By explicitly targeting college students with attention-related learning differences, while maintaining a broad accessibility framework, the application attempts to decouple the cognitive generation of ideas from the mechanical execution of writing.

1. Competitive Landscape

The core functional concept of Bidet AI is in a saturated category — AudioPen, Letterly, Oasis AI, Willow Voice (commercial cloud), Ito + Handy (open-source desktop), and Dictly (closest analog: 100% on-device, but Apple-only, freemium). Bidet AI's unique intersection: 100% on-device Android + Apache 2.0 open-source + accessibility-first prompt engineering. Bidet AI democratizes the proprietary capabilities of tools like AudioPen and Dictly for the open-source Android ecosystem. Moderate refinement on UX; genuinely unique on the Android-Apache-on-device axis.

2. Empirical Efficacy by Population

Target Population Evidence Quality Primary Benefit Key Limitation
ADHD Robust (peer-reviewed) Cognitive offloading; bypasses executive dysfunction Risk of over-reliance leading to skill decay
Dyslexia / Dysgraphia Robust (peer-reviewed) Removes orthographic barriers; text simplification reduces reading fatigue Simplification must preserve original meaning
Late-Deaf / HoH Moderate Symmetric use reduces lip-reading fatigue Environmental noise degrades initial ASR
ELL / Low-Literacy Mixed Low-anxiety production; vocabulary scaffolding Debated whether it aids acquisition or masks deficits
Dysarthria / Stuttering Weak / Problematic LLM theoretically smooths syntax Whisper-tiny WER 26-36% on TORGO/UASpeech; LLM falls into repetition loops on stuttered speech

3. The Skeptical Steel-Man

4. Architectural Framing: The SLP Taxonomy

The "Understand vs Be Understood" framing is intuitive for consumer software but does not match clinical literature. The standard nomenclature is:

Recommendation: relabel internally as "Receptive Support" / "Expressive Support" for grant-writing and IEP integration credibility.

For consumer/disability-rights framing, the Capability Approach (Sen, Nussbaum) frequently uses "to understand and be understood" to describe communication rights. Both framings are defensible; the SLP-aligned naming wins on academic credibility.

5. Hardware Constraints

6. Realistic Impact Ceiling

Population Prevalence Hardware Compat Realistic Impact
ADHD (college) ~16% of students Excellent High
Dyslexia / Dysgraphia ~6% of students Excellent High
Adult Stuttering ~0.96% Marginal (Whisper hallucination risk) Moderate
Dysarthria (Stroke/ALS) ~52% of stroke survivors Failure Zero
Severe Cognitive Impairment Variable Failure Zero

7. Strategic Synthesis (Gemini's verdict)

Bidet AI represents a potent, privacy-preserving synthesis of edge-native ML and accessibility theory. While the fundamental product logic is actively utilized by commercial tools like AudioPen and Letterly, the strict commitment to an offline, Apache 2.0 Android architecture carves out a vital sociotechnical niche.

The reliance on Whisper-tiny creates an absolute intelligibility floor; users with severe dysarthria will be abandoned by the acoustic model before the LLM can offer semantic rescue.

To mitigate cognitive skill decay, the system should avoid acting as a "black-box" author. It must incorporate transparent, Vygotskyan scaffolding — prompting the student to actively review the structural changes made by the LLM, ensuring the user remains the author of their own intellect.

If deployed with a precise understanding of its impact ceiling and clinical framing, Bidet AI possesses the capacity to fundamentally empower the neurodivergent edge-computing user.