Initial commit

2026-05-05 09:40:28 +10:00
commit b865575511
116 changed files with 7290 additions and 0 deletions
@@ -0,0 +1,18 @@
+# Project: Local Text-to-Speech (TTS)
+## Status: Environment Configured & Model Downloaded & Verified
+- **Directory:** `/home/openclaw/.openclaw/workspace/projects/local-tts`
+- **Environment:** Python venv
+- **Packages Installed:** `piper-tts`, `beautifulsoup4`, `requests` (CPU-only optimized on 2026-03-27)
+- **Model:** `en_US-libritts_r-medium.onnx`
+- **Status:** Inference pipeline verified and operational. Disk footprint reduced from 6.1GB to 294MB.
+
+## Recent Work
+- Successfully generated ~400MB WAV files.
+- Debugged `piper` and `wave` interaction to resolve 0-byte file issues.
+- Established `tts_script.py` as a stable wrapper with dynamic speed adjustment (via `--speed` parameter).
+- Optimized environment by removing unnecessary CUDA/PyTorch/NVIDIA dependencies.
+
+## Next Steps
+1. User provides a target URL.
+2. Execute `tts_script.py` (e.g., `python3 tts_script.py <url> <model> <config> <output> --speed 0.9`).
+3. Retrieve/playback audio from `workspace/projects/local-tts/`.