25 lines
1.5 KiB
Markdown
25 lines
1.5 KiB
Markdown
# Conversation Checkpoint: Email-to-Audio Conversion Workflow
|
|
|
|
## Summary
|
|
Michael requested a workflow to convert a specific email from Peter H. Diamandis ("Robotaxies & Flying Cars Will Reinvent Real Estate") into an audio format using a specific voice model ("norman") and adjusted speeds.
|
|
|
|
## Workflow Details
|
|
1. **Source:** Email ID 10824 from Peter H. Diamandis via Himalaya.
|
|
2. **Text Extraction:** Content was saved to `/home/openclaw/.openclaw/workspace/temp_stuff/diamandis_text.txt`.
|
|
3. **TTS Conversion:**
|
|
- Script created: `tts_flexible.py` in `/home/openclaw/.openclaw/workspace/projects/local-tts/`.
|
|
- Tool used: `piper-tts`.
|
|
- Voice Model: `norman` (onnx).
|
|
- Speed Adjustment: Implemented via `--speed` argument (length_scale manipulation).
|
|
4. **Audio Processing:** FFmpeg used to convert `.wav` output to `.mp3` (44100Hz, stereo, 192k bitrate).
|
|
5. **Upload:** Uploaded to Nextcloud via WebDAV (curl) to the `/Bernard/` directory.
|
|
|
|
## Current Files
|
|
- Text: `/home/openclaw/.openclaw/workspace/temp_stuff/diamandis_text.txt`
|
|
- Audio (0.8 speed): `diamandis_audio.mp3` (Nextcloud: /Bernard/diamandis_audio.mp3)
|
|
- Audio (0.9 speed): `diamandis_audio_0_9.mp3` (Nextcloud: /Bernard/diamandis_audio_0_9.mp3)
|
|
- Tooling: `/home/openclaw/.openclaw/workspace/projects/local-tts/tts_flexible.py`
|
|
|
|
## Future Re-use
|
|
This workflow can be repeated for any email text. Simply extract the text, run `tts_flexible.py` with desired parameters, encode with FFmpeg, and upload using the verified WebDAV method.
|