Mandarin progress update: I am 14% to B1 (conversational) - about 1 year away at the current pace.
- I advance around 1.5% every week.
- learned 97 core words last month
- ~200 phrases/sentences learned last month
2x tutor sessions per week and daily flashcards + AI sentence translation exercises (roughly 100-150 per day)
35
47
373
my new recipe for learning and satisfying curiosities:
if I see a really cool piece of tech, I'll try to build my own version (without looking at the source code). even for complicated stuff, it's now possible to figure things out with one weekend and a bunch of LLM credits.
that's how you get deep down the stack and...
20
45
345
the trick to making voice agents fast is pipelining everything:
first pipeline: audio packets -> speech to text -> turn-taking model
second: LLM -> text to speech -> encoding -> output
here's a render of my current latency. bear in mind, I'm running this locally from a wooden hut in the mountains in Turkey - it shoul...
62
27
497
okay a few hours later and I have performance comparable to that of Vapi/Elevenlabs agent SDK, albeit along the green path - I'm sure there are hundreds of edge cases that these companies spend much time figuring out, not to mention making their offerings flexible/observable etc
but it's crazy how quickly you can get ...
day 1 learning to build voice agent infra from scratch:
put together a VAD, twilio, deepgram (SST), o4-min, elevenlabs (TTS) into an event-based loop that coordinates listening and speaking
main issue right now is latency and quality of turn-taking/interruptions. not quite as good as Vapi/Elevenlabs off-the-shelf
ne
17
17
171
day 1 learning to build voice agent infra from scratch:
put together a VAD, twilio, deepgram (SST), o4-min, elevenlabs (TTS) into an event-based loop that coordinates listening and speaking
main issue right now is latency and quality of turn-taking/interruptions. not quite as good as Vapi/Elevenlabs off-the-shelf
ne...
23
35
257
always keep coming back to the analogy of product building being so much like sculpting or painting.
the first strokes are broad and confident. a lot of material gets applied and moved around very quickly. you might slap together 5-6 features in just hours and build this really huge thing out of nothing.
but then th...
57
18
582
my Mandarin tutor is awesome. every session, he introduces ~25-35 new words and phrases, which builds my active vocabulary by ~20 words. I add them to my learning system, entering them into practice and consolidation - all managed by an algorithm that models my memory and manages my practice sessions.
within my syste...
40
20
326
