/ask retrieval + Q&A logging — both bugs SHIPPED

1. Bug 1: embed before inserting Q&A log row

The /ask handler logs every Q+A back to tp3_memories_local for digital-twin growth. The column tp3_embedding is NOT NULL. The old code's INSERT skipped the embedding column entirely → silent fail on every single call. Fix mirrors _insert_tp3()'s pattern: call _embed(document), fall back to zero-vector + needs_embed=true if Ollama is down, then INSERT all four columns. Embedding failure is fail-loud-logged but does NOT block the Q&A row.

2. Bug 2a: bump /ask top_k from 5 to 12

RRF was returning low-information OMI snippets ("I got a contest.", 12 chars) at top ranks because they technically match query terms but carry no information. Widening the window from 5 to 12 lets longer, higher-signal rows reach the LLM. Prompt slot bumped 2000 to 3000 chars to actually fit them.

3. Bug 2b: health context inject (new helper `_health_context_for_question`)

RAG over tp3_memories_local cannot surface aggregated health data — the docs are 30-180s per-row biometric blobs (sleep_stage, step_count) that always lose to longer documents in RRF/FTS. The /sleep/report endpoint already composes a prose summary but /ask doesn't see it. New helper detects sleep/heart/step/biometric keywords, then aggregates the last 7 days of sleep_session + step_count rows directly from Postgres into a compact summary string. Injected as a dedicated RECENT HEALTH block (same structure as the existing calendar inject). Fast (single query, ~50-100ms), bounded, fail-loud.

4. Bug 2c: RRF improvements (2 channel patches)

FTS channel OR-fallback — websearch_to_tsquery is implicit-AND, so "what is the status of the Bidet contest on Kaggle?" → 'status' & 'bidet' & 'contest' & 'kaggl' → 0 docs match all four. Added OR-join fallback when the strict AND query returns empty.

New longform channel (5th in RRF) — ILIKE-matches salient tokens against tp3_document, filters to docs ≥ 400 chars from high-signal sources only (phone_notification, email_*, omi_summary, ai_radar_feed, gmail, calendar, ingest, ask). Two-pass: AND first (finds the gold), OR fallback ranked by (match_score DESC, recency DESC) so a 2026-05-09 Kaggle prize tree notification matching 2/3 tokens beats a fresh row matching only 1/3.

Evidence — 3 verify queries, actual JSON responses post-deploy

Q1: "what did I do today?"

Grounding: from the calendar inject (already working pre-fix). Calendar context names all 4 of today's school items.

Q2: "what is the status of the Bidet contest on Kaggle?"

Grounding: Claude is now surfacing "your Bidet AI submission" directly from retrieved memories. Pre-fix the answer was: "I don't have any information about a Bidet contest on Kaggle in my memories or data." The conservative "no real-time Kaggle access" caveat is appropriate — the memories are from past ingest, not live Kaggle API.

Retrieval verified: 12 hits now include the BidetAi contest essay (rank 4), Mark's BidetAi email to himself (rank 5), and the "Kaggle prize tree verified" phone notification (rank 12).

Q3: "summarize my last week of sleep"

Grounding: real biometric data from the new _health_context_for_question() inject. 7-night average + named best/worst nights with actual durations. Pre-fix the answer was: "I don't have access to your sleep data." The system prompt now contains a RECENT HEALTH (last 7 days, from TP3 biometric ingest): block with all 7 wake-mornings' duration + stage breakdown.

TP3 log rows landing — proves Bug 1 fixed

Five rows from the verify runs all landed cleanly. Container logs show zero NULL-embedding errors post-deploy.

/ask retrieval + Q&A logging fix

STATUS: GREEN — both bugs deployed and tested

What changed (4 deltas, all in `tp3_memory_api.py`)

1. Bug 1: embed before inserting Q&A log row

2. Bug 2a: bump /ask top_k from 5 to 12

3. Bug 2b: health context inject (new helper `_health_context_for_question`)

4. Bug 2c: RRF improvements (2 channel patches)

Evidence — 3 verify queries, actual JSON responses post-deploy

Q1: "what did I do today?"

Q2: "what is the status of the Bidet contest on Kaggle?"

Q3: "summarize my last week of sleep"

TP3 log rows landing — proves Bug 1 fixed

Latency

Files touched

Discipline notes

/ask retrieval + Q&A logging fix

STATUS: GREEN — both bugs deployed and tested

What changed (4 deltas, all in tp3_memory_api.py)

1. Bug 1: embed before inserting Q&A log row

2. Bug 2a: bump /ask top_k from 5 to 12

3. Bug 2b: health context inject (new helper _health_context_for_question)

4. Bug 2c: RRF improvements (2 channel patches)

Evidence — 3 verify queries, actual JSON responses post-deploy

Q1: "what did I do today?"

Q2: "what is the status of the Bidet contest on Kaggle?"

Q3: "summarize my last week of sleep"

TP3 log rows landing — proves Bug 1 fixed

Latency

Files touched

Discipline notes

What changed (4 deltas, all in `tp3_memory_api.py`)

3. Bug 2b: health context inject (new helper `_health_context_for_question`)