2025-10-24 13:36:46 +00:00
# Little Sophia's Brain
2025-10-24 13:34:32 +00:00
2025-10-24 13:36:46 +00:00
A modular Python script designed to run on Little Sophia's Radxa ZERO 3W. It handles higher-level input processing outside the abilities of the ESP32 central controller.
2025-10-24 13:34:32 +00:00
- Face Detection
- Voice-to-Text (V2T)
---
## Installation
1. Clone this repository to your Radxa ZERO 3W.
2. Download and place the Vosk model in the install directory:
- [vosk-model-small-en-us-0.15 ](https://alphacephei.com/vosk/models )
2025-10-24 13:36:46 +00:00
3. (Jake needs to add the rest of the prerequisites here — e.g., Python packages, environment setup, etc.)
2025-10-24 13:34:32 +00:00
## TODO:
2025-10-24 13:38:35 +00:00
Tested with usb mic and camera only (Logitech C920)
2025-10-24 13:40:26 +00:00
## File Overview
2025-10-24 13:38:35 +00:00
| File | Description |
2025-10-24 13:40:26 +00:00
|------------------|------------------------------------------------------------------------------|
2025-10-24 13:38:35 +00:00
| `RetinaFace.rknn` | RKNN model for face detection and video streaming. |
| `main.py` | Core script. Integrates face detection and voice-to-text into Flask stream. |
2025-10-24 13:40:26 +00:00
| `realtime.py` | Voice-to-text module (Vosk). For testing separately. |