The Teacher

Kelly.

An AI teacher who delivers a structured five-phase lesson every day, speaks with her own voice, holds real-time conversations, animates with emotional expression, and works offline — from Cloudflare’s edge to satellite broadcast. Built on fully sovereign systems. No third-party dependency in the production pipeline.

Kelly teaching Day 1 — “Hook” phase, Explorer archetype

Talk to Kelly now

The Five-Phase Lesson

Every day, Kelly delivers a structured lesson built around a single word. The lesson follows five phases, each designed to engage a different mode of learning. This isn’t a flashcard — it’s a complete learning experience that takes 3-5 minutes.

Phase	Purpose	What Kelly Does
1. Hook	Capture attention	Opens with a surprising fact, question, or scenario that creates curiosity about today’s word
2. Story	Contextual learning	Tells the word’s story — etymology, cultural origins, historical usage, how meaning shifted over time
3. Wonder	Deep exploration	Explores connections — synonyms, antonyms, related concepts, cross-language cognates, knowledge graph links
4. Action	Active practice	Guided practice using the word — fill-in-the-blank, sentence construction, real-world application prompts
5. Wisdom	Retention & reflection	Closing reflection connecting the word to broader themes. Primes next-day recall.

Four Learning Tracks

Learn: Core vocabulary building — the default track for new learners
Grow: Advanced vocabulary with emphasis on etymology and cross-language connections
Teach: Pedagogical track — teaches you how to teach the word to others
Trivia: Fun-focused track with surprising facts, word games, and cultural connections

Content Scale

74,008 base lessons across 19 languages
164,250 archetype variants in English — same word, same day, personalized for 10 different learning styles
24,989 assessment items in 4 formats across 6 languages
365 days of complete content in 8 languages (English, Spanish, French, German, Portuguese, Japanese, Korean, Chinese)

The 10 Teaching Archetypes

Kelly doesn’t teach everyone the same way. She adapts her personality, vocabulary, examples, and tone based on 10 teaching archetypes — each one a different cognitive and emotional approach to learning. Learners select (or are matched to) an archetype, and every lesson is personalized accordingly.

Archetype	Teaching Style	Best For Learners Who…
Explorer	Curiosity-driven discovery. “Let’s find out where this word has been.”	Love novelty, travel metaphors, open-ended questions
Architect	Systematic and structural. “Here’s how this word connects to the system.”	Think in patterns, want frameworks, prefer organized information
Empath	Emotionally attuned. “Feel what this word carries.”	Process through emotion, value stories about people, relate words to relationships
Rebel	Provocative and unconventional. “They don’t want you to know this word.”	Challenge authority, enjoy contrarian angles, want to feel like they’re discovering secrets
Provider	Practical and nurturing. “Here’s how this word helps people.”	Want immediate utility, care about helping others, prefer concrete examples
Diplomat	Balanced and contextual. “Consider all the ways this word is used.”	See multiple perspectives, enjoy nuance, want cultural context
MacGyver	Resourceful and hands-on. “Let’s build something with this word.”	Learn by doing, want word puzzles and construction challenges
Scientist	Evidence-based and analytical. “The data shows this word means…”	Want etymology breakdowns, frequency data, linguistic analysis
Strategist	Power-aware and tactical. “This word gives you an advantage.”	Think about communication as strategy, want words that signal competence
Survivor	Resilient and grounded. “This word will be there when you need it.”	Value reliability, prefer words that work in tough situations, appreciate directness

Three Ages of Kelly

Kelly exists across three life stages, each with a distinct voice, visual appearance, and teaching tone. Learners choose the Kelly that resonates with them — or experience all three through the age-transition feature.

Child Kelly

Voice: Little Annie

Warm, curious, slightly playful. Eyes wide at every new word. Makes learning feel like an adventure.

Adult Kelly

Voice: Kelly

Clear, confident, encouraging. The primary teaching voice. Balances expertise with accessibility.

Elder Kelly

Voice: Grace Elder

Measured, wise, patient. Draws on life experience. Makes every word feel like it carries weight.

Real-Time Conversation

Kelly isn’t just a lesson player — she holds real-time conversations. The kelly-chat Worker orchestrates Claude Haiku responses with text-to-speech and emotional expression inference, all streamed via server-sent events (SSE) for real-time interaction.

How It Works

You speak or type — input goes to the kelly-chat Worker
Claude Haiku generates a response — streamed token-by-token via SSE
Expression inference — each response is classified as curious, excited, thoughtful, empathetic, amused, or engaged
Text-to-speech — Kelly speaks her response using her age-appropriate cloned voice
Lip sync animation — SadTalker drives facial animation synchronized to the audio
Session preserved — conversation context stored in KV with 24-hour TTL for continuity

Conversation API

POST /v1/chat — SSE streaming conversation with token-by-token response and expression inference
POST /v1/chat/complete — Non-streaming completion for batch and programmatic use
GET /v1/speak — Real-time TTS for any text input
GET /v1/context — Current lesson context and state for a session
GET /v1/session/:id — Session retrieval (24h TTL)

Video Production: 2,300+ Videos Across 4 Engines

Kelly’s video lessons are produced across four video generation engines, each with different quality, cost, and identity characteristics. The kelly-video-factory Worker orchestrates all four with a unified production queue.

Engine	Quality	Cost	Output	Videos
Sovereign Pipeline Chatterbox + SadTalker + ffmpeg	Talking head	$0 (local RTX 5090)	~4 min render per 1 min video	187 (37 complete days)
Kling 3.0 Pro via fal.ai	Cinematic	$0.112/sec	1920x1080 H264, ~143s per 5s clip	2 (tested, proven)
HeyGen v2 Genny batch agent	Talking head (standard)	~1 credit/min	Photo avatar, batch pipeline	1,080 completed, 334 queued
LTX-2.3 + ID-LoRA Local RTX 5090	Near-cinematic	$0 (local)	FP8, zero-shot from canonical still	Installing (85GB models)

Media Library

15,542 total media files in R2 storage at kelly-assets.nicoletterankin.workers.dev
2,300+ lesson videos — HeyGen batch + sovereign pipeline + cinematic tests
850 legacy video clips — earlier production experiments
36 visual looks across 10 archetypes and 3 age groups
5 email avatar states × 3 sizes for email template personalization
7 source videos (62MB) providing Kelly’s canonical appearance and voice reference

R2 Video Path Convention

kelly/video/{lang}/{age}-{archetype}/day-{NNN}/{phase}.mp4

Example: kelly/video/en/adult-explorer/day-001/hook.mp4

Sovereign Voice Pipeline

Kelly’s voice is generated on owned hardware — an NVIDIA RTX 5090 (32GB VRAM, Blackwell architecture). No third-party TTS service. No API calls. No per-character costs. The voice references were created from Kelly’s canonical source videos and will never expire, get rate-limited, or be discontinued.

Voice Generation

Model: Chatterbox v0.1.7 (zero-shot voice cloning)
Speed: 2.3x realtime on RTX 5090 (generates 2.3 seconds of audio per second of compute)
Output per lesson: 25 WAV files (5 tiers × 5 phases), 5.2 minutes of audio total
Batch capacity: 365-day lesson set in ~10 hours of voice generation
Conda environment: chatterbox

Lip Sync Animation

Model: SadTalker (crop-512 optimization)
Speed: ~45 seconds per clip (1.7x faster than baseline by eliminating Poisson blending)
Compositing: ffmpeg 3-layer assembly (background + static body + animated face + text overlays) in <1 second per clip
Batch capacity: 365-day lesson set in ~21 hours (single overnight run)
Conda environment: sadtalker

Audio Pre-Generation

Nightly cron jobs (02:00 and 03:00 UTC) pre-generate and verify 150 audio files per day across 10 priority languages and 3 age groups. This means Kelly’s voice is ready and waiting before the learner wakes up — no generation latency at lesson time.

The Player

Kelly’s video player is built into the Daily Lesson consumer site. It auto-plays the video for the current lesson phase, falls back gracefully when video isn’t available, and provides enrichment features during playback.

Auto-play: Video starts immediately for the current phase (muted for browser compliance, with unmute button)
Fallback: When no video is available, shows Kelly’s canonical still image with text content
“Watch Kelly” toggle: When video exists but isn’t auto-playing, a toggle button lets the learner choose
Visual aids: Word enrichment data displayed alongside the video (translations, knowledge graph, pronunciation)
Phase navigation: Automatic advance between the 5 lesson phases with video for each

Kelly’s 8-Layer Resilience System

Education infrastructure that requires always-on connectivity excludes the people who need it most. Kelly’s shield system (kelly-shield Worker at shield.kellyai.com) provides 8 layers of progressively degraded but functional lesson delivery — from full-featured edge delivery down to physical books.

Layer 1 — Cloudflare Edge: Full lessons with video, audio, chat, and assessments. 330+ locations, sub-50ms latency.
Layer 2 — R2 Object Cache: Pre-rendered lesson bundles. No database queries needed. Survives D1 outages.
Layer 3 — KV Bundle: Compressed lesson packages in globally-replicated KV store. Fastest reads, smallest payloads.
Layer 4 — Service Worker PWA: Full offline capability. Lessons cached on device after first load. Works with zero connectivity.
Layer 5 — QR Print: Scannable QR codes on printed materials link to cached lessons. Bridges physical and digital.
Layer 6 — SMS Gateway: Text-based lessons for feature phones on 2G networks. Reaches the bottom billion.
Layer 7 — Satellite Filecasting: Broadcast delivery via Knapsack for Hope (Galaxy 19 → Americas) and Toosheh (YahSat → Iran/Middle East). No internet required at all — content received via satellite dish.
Layer 8 — Physical Book: Print-ready lesson compilations generated from the content engine. Works during total infrastructure collapse.

Where to Find Kelly

Live Chat kellyai.com/chat

Real-time conversation with Claude Haiku, voice, and expression

Daily Lessons thedailylesson.com

Structured 5-phase lessons with video, audio, and assessments

Brand & Subscriptions curiouskelly.com

Kelly’s marketing site with Stripe-powered subscription plans

Kelly’s Brand Site kellyai.com

Kelly’s home on the web, with the /chat interface

Slack

GoGood Slack App (ID: A0AK1EQH0P7)

Daily word-of-the-day delivered to Slack channels

MCP Tools mcp.thedailylesson.com

12 tools for AI agents via the Model Context Protocol

Talk to Kelly Enterprise → Technology →