Initial commit

This commit is contained in:
2026-05-05 09:40:28 +10:00
commit b865575511
116 changed files with 7290 additions and 0 deletions
+18
View File
@@ -0,0 +1,18 @@
# Project: Local Text-to-Speech (TTS)
## Status: Environment Configured & Model Downloaded & Verified
- **Directory:** `/home/openclaw/.openclaw/workspace/projects/local-tts`
- **Environment:** Python venv
- **Packages Installed:** `piper-tts`, `beautifulsoup4`, `requests` (CPU-only optimized on 2026-03-27)
- **Model:** `en_US-libritts_r-medium.onnx`
- **Status:** Inference pipeline verified and operational. Disk footprint reduced from 6.1GB to 294MB.
## Recent Work
- Successfully generated ~400MB WAV files.
- Debugged `piper` and `wave` interaction to resolve 0-byte file issues.
- Established `tts_script.py` as a stable wrapper with dynamic speed adjustment (via `--speed` parameter).
- Optimized environment by removing unnecessary CUDA/PyTorch/NVIDIA dependencies.
## Next Steps
1. User provides a target URL.
2. Execute `tts_script.py` (e.g., `python3 tts_script.py <url> <model> <config> <output> --speed 0.9`).
3. Retrieve/playback audio from `workspace/projects/local-tts/`.