Forem

Whisper

Whisper is a versatile speech recognition model that can transcribe, identify, and translate multiple languages.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Whisper Menu Bar
Cover image for Whisper Menu Bar

Whisper Menu Bar

Comments
1 min read
Whisper + Gradio on Colab: Speech-to-Text in Minutes

Whisper + Gradio on Colab: Speech-to-Text in Minutes

Comments
3 min read
Why Language Tech Matters: Developing AI Tools for Small Languages

Why Language Tech Matters: Developing AI Tools for Small Languages

Comments
4 min read
Building a YouTube Video Search App with Flask, Whisper, and RAG
Cover image for Building a YouTube Video Search App with Flask, Whisper, and RAG

Building a YouTube Video Search App with Flask, Whisper, and RAG

1
Comments
5 min read
WhatsApp + MCP: automatic audio transcription
Cover image for WhatsApp + MCP: automatic audio transcription

WhatsApp + MCP: automatic audio transcription

1
Comments
4 min read
🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs
Cover image for 🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs

🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs

Comments
4 min read
Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive
Cover image for Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive

Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive

5
Comments 4
6 min read
Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper
Cover image for Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper

Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper

Comments
5 min read
Building a Live Transcript

Building a Live Transcript

Comments
1 min read
Why I built Typist - lightning-fast AI audio transcription app
Cover image for Why I built Typist - lightning-fast AI audio transcription app

Why I built Typist - lightning-fast AI audio transcription app

Comments
5 min read
OZI - Subtitles Generator with AI

OZI - Subtitles Generator with AI

Comments
1 min read
OpenAI.fm! OpenAI's Newest Text-To-Speech Model - Proje Defteri
Cover image for OpenAI.fm! OpenAI's Newest Text-To-Speech Model - Proje Defteri

OpenAI.fm! OpenAI's Newest Text-To-Speech Model - Proje Defteri

Comments
8 min read
Building a Production-Ready Speech-to-Text System with Fine-Tuned Whisper Model
Cover image for Building a Production-Ready Speech-to-Text System with Fine-Tuned Whisper Model

Building a Production-Ready Speech-to-Text System with Fine-Tuned Whisper Model

Comments
5 min read
These 5 "Best Practices" Are Stopping You From Getting Hired
Cover image for These 5 "Best Practices" Are Stopping You From Getting Hired

These 5 "Best Practices" Are Stopping You From Getting Hired

17
Comments
4 min read
New Tutorial: Build a Voice-to-Text Transcription App with Whisper and React Native

New Tutorial: Build a Voice-to-Text Transcription App with Whisper and React Native

1
Comments
1 min read
Transform Your Speech into Text with the Power of OpenAI and useWhisper
Cover image for Transform Your Speech into Text with the Power of OpenAI and useWhisper

Transform Your Speech into Text with the Power of OpenAI and useWhisper

3
Comments 1
2 min read
Whisper Speech Recognition on Mac M4: Performance Analysis and Benchmarks

Whisper Speech Recognition on Mac M4: Performance Analysis and Benchmarks

1
Comments
2 min read
iPhone 的語音辨識功能:語音備忘錄,自動標點分段
Cover image for iPhone 的語音辨識功能:語音備忘錄,自動標點分段

iPhone 的語音辨識功能:語音備忘錄,自動標點分段

Comments 2
1 min read
How AI Tells the Difference Between “Ate” and “Eight” in Speech Recognition
Cover image for How AI Tells the Difference Between “Ate” and “Eight” in Speech Recognition

How AI Tells the Difference Between “Ate” and “Eight” in Speech Recognition

1
Comments
3 min read
How to make multilingual videos in 3 minutes
Cover image for How to make multilingual videos in 3 minutes

How to make multilingual videos in 3 minutes

1
Comments
3 min read
Высококачественная транскрипция зашумлённых двухканальных телефонных звонков
Cover image for Высококачественная транскрипция зашумлённых двухканальных телефонных звонков

Высококачественная транскрипция зашумлённых двухканальных телефонных звонков

2
Comments
1 min read
High-Quality Transcription of Noisy Dual-Channel Phone Calls
Cover image for High-Quality Transcription of Noisy Dual-Channel Phone Calls

High-Quality Transcription of Noisy Dual-Channel Phone Calls

2
Comments
3 min read
Why Maryrose Whittaker believes integrating Whisper in the Vodia PBX is a game changer

Why Maryrose Whittaker believes integrating Whisper in the Vodia PBX is a game changer

Comments
2 min read
OmniDictate: Free, Local, Real-Time AI Dictation for Windows

OmniDictate: Free, Local, Real-Time AI Dictation for Windows

4
Comments 1
5 min read
How to Use AI for Real-Time Speech Recognition and Transcription

How to Use AI for Real-Time Speech Recognition and Transcription

Comments
3 min read
loading...