← Dashboard

Supertonic voices — pick yours

2026-05-22 · 10 voice options (5 female, 5 male) · same sentence to A/B easily

Same text for every sample so you can compare voices directly. Listen top-to-bottom, note which 2-3 you like best. Tell me your pick and that becomes the default for ntfy → Ray-Bans speak.

"Good morning Mark. Your TP3 is healthy. Spotify pulled 23 tracks overnight, and the Bidet AI just recovered from a brief CF tunnel hiccup."

Female voices (F1 - F5)

F1female 1 · 10.9s
F2female 2 · 10.3s
F3female 3 · 11.2s
F4female 4 · 9.4s
F5female 5 · 10.6s

Male voices (M1 - M5)

M1male 1 · 10.9s
M2male 2 · 10.7s
M3male 3 · 9.0s · earlier sample
M4male 4 · 10.2s
M5male 5 · 10.9s

Other knobs we can turn

Beyond the 10 built-in voices, Supertonic exposes these per-call parameters — we can A/B any combination:

ParameterRangeWhat it changes
total_steps5 (low) — 12 (high)Generation quality. Default 8 (medium). Higher = better articulation, slower generation. For your stack, 8 is the sweet spot; 10-12 worth trying if you want maximum polish.
speed0.7 (slow) — 2.0 (fast)Playback speed without changing pitch. Current samples are 1.05 (slightly faster than natural). 0.95-1.1 sounds most natural; 1.2-1.5 useful for digest-style fast briefings.
lang31 languagesEnglish, Spanish, French, Arabic, Korean, German, Portuguese, Italian, etc. The voice can pronounce the same text in different language modes. "na" is language-agnostic (best for mixed-language).
Voice cloningany audio sampleYou can train a voice from a 5-10 second sample of YOUR voice. Then ntfy alerts speak in your voice. Per the Supertonic demo page — "Voice Builder | Cloning Demo". Deeper integration; worth exploring once the base pipeline is live.

What I'm asking

Tell me three things:

  1. Which voice number (F1-F5 or M1-M5)?
  2. Speed — stick with 1.05x, slow it to 1.0, or speed up to 1.15x?
  3. Voice cloning — want to record a 10-sec sample of YOUR voice and have all ntfy alerts speak in your voice? (Yes / no / later)

Once you pick, I lock that into the Tasker integration. The current 8 actions in your TP3 Notification task end with a Say that uses GoogleTTS. I'll replace it with an HTTP Request to the Supertonic endpoint + Music Play on the returned WAV. Same triggers, better voice, runs on G16 today + migrates to Apex post-Saturday.

Supertonic 3 v1.3.1 · HTTP serve at 192.168.1.185:7788 · survives shell exit via setsid nohup