Forem

# tts

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
ESP32 Into a Speech-to-Text Device
Cover image for ESP32 Into a Speech-to-Text Device

ESP32 Into a Speech-to-Text Device

Comments
2 min read
Mumbli – my personal Wispr Flow
Cover image for Mumbli – my personal Wispr Flow

Mumbli – my personal Wispr Flow

Comments
3 min read
Importing an EPUB into an AI voice pipeline: what the chapter list looks like before audio runs

Importing an EPUB into an AI voice pipeline: what the chapter list looks like before audio runs

Comments
6 min read
Running a non-English audiobook through an AI voice pipeline: what's involved

Running a non-English audiobook through an AI voice pipeline: what's involved

Comments
5 min read
Auto-Assign Sounds: how AudioProducer.ai turns chapter text into music beds, ambience, and SFX

Auto-Assign Sounds: how AudioProducer.ai turns chapter text into music beds, ambience, and SFX

Comments
8 min read
The Voice Assistant Revolution: Architecture, Accuracy, and the Race for Real-Time Intelligence

The Voice Assistant Revolution: Architecture, Accuracy, and the Race for Real-Time Intelligence

Comments
6 min read
One Hour for the Demo, Three for the Production Line

One Hour for the Demo, Three for the Production Line

1
Comments
7 min read
Voice cloning inside the audiobook pipeline: integration notes and trade-offs

Voice cloning inside the audiobook pipeline: integration notes and trade-offs

Comments
6 min read
Voice AI Outside the US: Double the Price, Worse Experience (And How We're Trying to Fix It)
Cover image for Voice AI Outside the US: Double the Price, Worse Experience (And How We're Trying to Fix It)

Voice AI Outside the US: Double the Price, Worse Experience (And How We're Trying to Fix It)

Comments
6 min read
Running modern Python TTS toolchains on non-AVX2 CPUs

Running modern Python TTS toolchains on non-AVX2 CPUs

Comments
5 min read
How human feedback actually steers TTS fine-tuning

How human feedback actually steers TTS fine-tuning

Comments
6 min read
TTS Models for Indian Languages: The Tech Giving Bharat a Voice
Cover image for TTS Models for Indian Languages: The Tech Giving Bharat a Voice

TTS Models for Indian Languages: The Tech Giving Bharat a Voice

Comments
4 min read
How We Built Voice Messages for AI Companions: Real Voice Audio, ElevenLabs, and Beyond

How We Built Voice Messages for AI Companions: Real Voice Audio, ElevenLabs, and Beyond

Comments
4 min read
ESP32-C3 Text-to-Speech Using AI
Cover image for ESP32-C3 Text-to-Speech Using AI

ESP32-C3 Text-to-Speech Using AI

Comments
2 min read
One Open Source Project a Day (No.51): VibeVoice - Microsoft's Speech AI That Processes 90 Minutes of Audio in a Single Pass
Cover image for One Open Source Project a Day (No.51): VibeVoice - Microsoft's Speech AI That Processes 90 Minutes of Audio in a Single Pass

One Open Source Project a Day (No.51): VibeVoice - Microsoft's Speech AI That Processes 90 Minutes of Audio in a Single Pass

1
Comments
9 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.