Mark's Reports

# AI Radar ΓÇö 2026-05-01

Scored for Mark Barnett: cost-conscious, local-first, Apex (13.8 GB RAM), TP3 stack, Oracle, Bidet AI, teaching. Updated with second scan pass (Claude Code changelog, MCP updates, managed agents).


Actions I recommend Mark take this week (top 3, max)

  1. Pull Gemma 4 E4B on Apex via Ollama ΓÇö fits in Apex RAM budget (~3-5 GB quantized), strong benchmarks, directly addresses the local-LLM goal; one command: ollama pull gemma4:4b. Source: ollama.com/library/gemma4

  2. Set alwaysLoad: true for OMI and biometric MCP servers in Claude Code ΓÇö Claude Code v2.1.121 added this option; it skips the startup deferral that causes silent disconnects on Apex's MCP servers. Edit .claude/settings.json, add "alwaysLoad": true to each MCP server entry. Source: releasebot.io/updates/anthropic/claude-code

  3. Remove beta header from Claude API web search calls ΓÇö Web search + code execution are now GA (no beta opt-in required). Code execution is free when paired with web search. Any TP3 agent scripts using the beta header can drop it. Source: platform.claude.com/docs


Ranked candidates

# Candidate Source Score Why it matters for Mark Integration cost
1 Gemma 4 E2B / E4B ΓÇö Google open model, multimodal, 128K context, Apache 2.0 HF Blog ┬╖ Ollama 22 Directly hits local-LLM goal. E4B benchmarks above models 4-5x its size. Fits Apex RAM. One ollama pull away. Zero ongoing cost. Low ΓÇö single command
2 Claude Opus 4.7 ΓÇö same price as 4.6, 87.6% SWE-bench vs GPT-5.5's 74.9% Anthropic 21 Claude Code already defaults to Opus ΓÇö you get the upgrade for free. More capable = fewer retries = lower actual token spend. None ΓÇö already live
3 Claude API: web search + code execution now GA ΓÇö no beta header, code exec free with web search Claude Docs 21 Remove one header line per API call. Free code execution reduces token overhead on TP3 agent scripts that search + process data. Minimal ΓÇö remove header
4 Claude Code v2.1.121ΓÇô123 ΓÇö alwaysLoad MCP, /resume by PR URL, image memory leaks fixed, OAuth 401 loop squashed Releasebot 20 alwaysLoad directly fixes MCP startup reliability on Apex. Memory leak fix matters for long sessions. PR-URL resume improves agent handoff workflow. Auto-update ΓÇö confirm with claude --version
5 Claude Managed Agents public beta ΓÇö secure sandboxing, built-in tools, SSE streaming, Memory beta Releasebot Anthropic 20 Relevant path for running persistent Oracle-style agents via Claude API rather than DIY Python supervisors. Memory beta = structured state across runs. API usage-based cost ΓÇö needs Mark approval before adopting
6 MCP server auto-retry (3x on transient startup error) ΓÇö built into Claude Code Claude Docs 19 TP3 runs 4 MCP servers. Transient startup failures no longer leave a server permanently disconnected ΓÇö directly addresses the silent-failure pattern that has burned this stack before. None ΓÇö automatic
7 Unsloth Qwen3.6-35B-A3B-GGUF ΓÇö MoE, 3B active params at inference, GGUF, 1.94M HF downloads HuggingFace trending 19 MoE trick: 35B label, 3B actual compute. Ollama-native. Strong coding + multilingual. Could replace Gemini for offline chat on Apex. Low ΓÇö ollama pull
8 Claude MCP Connectors for Creative Tools — Blender, Adobe CC (50+ apps), Ableton, Autodesk (April 28) Anthropic 16 Mark teaches digital media. Blender MCP is real (Python API bridge). Could slot into Content Creator Club or Bidet → creative pipeline. Low-Medium — MCP setup
9 Gemini 3.1 Flash TTS ΓÇö upgraded expressiveness, "emotionally rich" audio DeepMind blog 16 Mark already uses Gemini TTS for Grind voiceovers; Oracle could use audio replies. Single API field change to test. Very low ΓÇö API param
10 Gemma 4 26B MoE ΓÇö same family, 4B active params at inference, ~18 GB RAM needed Ollama 14 Better than E4B but won't fit Apex's 13.8 GB total RAM at 4-bit quant. Flag for future Apex GPU/RAM upgrade. High on current Apex

Cut with reason


Sources scanned

Note on Gemma 4 release date: Multiple sources cite early April 2026, just outside strict 7-day window. Included because it's actively trending (7.47M HF downloads), just hit Ollama with KV cache fixes, and directly serves Mark's stated local-first goals.


Cost of this run

Generated 2026-05-01 19:51 by tp3_scripts/ai_radar/run_radar.ps1.