Files
Knowledge/projects/local-tts/context-meta.md
T
2026-05-05 09:40:28 +10:00

992 B

Project: Local Text-to-Speech (TTS)

Status: Environment Configured & Model Downloaded & Verified

  • Directory: /home/openclaw/.openclaw/workspace/projects/local-tts
  • Environment: Python venv
  • Packages Installed: piper-tts, beautifulsoup4, requests (CPU-only optimized on 2026-03-27)
  • Model: en_US-libritts_r-medium.onnx
  • Status: Inference pipeline verified and operational. Disk footprint reduced from 6.1GB to 294MB.

Recent Work

  • Successfully generated ~400MB WAV files.
  • Debugged piper and wave interaction to resolve 0-byte file issues.
  • Established tts_script.py as a stable wrapper with dynamic speed adjustment (via --speed parameter).
  • Optimized environment by removing unnecessary CUDA/PyTorch/NVIDIA dependencies.

Next Steps

  1. User provides a target URL.
  2. Execute tts_script.py (e.g., python3 tts_script.py <url> <model> <config> <output> --speed 0.9).
  3. Retrieve/playback audio from workspace/projects/local-tts/.