Mira: Real-time AI application on embedded Linux
A consumer hardware product with a fully local voice assistant pipeline. Collaboration with a senior electrical engineer.
- Designed and implemented a multilingual real-time voice-pipeline on embedded Linux, integrating voice activity detection, speech-to-text, large-language, and text-to-speech models with both fully local and cloud-backed variants.
- Achieved ~3-second end-to-end latency for the local pipeline on Raspberry Pi CM5 by selecting and configuring lightweight C/C++ inference runtimes under tight memory and compute constraints.
- Profiled and optimized pipeline performance through iterative benchmarking, managing concurrency and audio data flow across multiple ML model stages.