Forem

Whisper

Whisper is a versatile speech recognition model that can transcribe, identify, and translate multiple languages.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
I built 90+ AI prompts because raw transcripts are useless

I built 90+ AI prompts because raw transcripts are useless

Comments 2
3 min read
Whisper Menu Bar
Cover image for Whisper Menu Bar

Whisper Menu Bar

Comments
1 min read
Whisper + Gradio on Colab: Speech-to-Text in Minutes

Whisper + Gradio on Colab: Speech-to-Text in Minutes

Comments
3 min read
Why Language Tech Matters: Developing AI Tools for Small Languages

Why Language Tech Matters: Developing AI Tools for Small Languages

Comments
4 min read
🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs
Cover image for 🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs

🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs

Comments
4 min read
Building a YouTube Video Search App with Flask, Whisper, and RAG
Cover image for Building a YouTube Video Search App with Flask, Whisper, and RAG

Building a YouTube Video Search App with Flask, Whisper, and RAG

2
Comments
5 min read
WhatsApp + MCP: automatic audio transcription
Cover image for WhatsApp + MCP: automatic audio transcription

WhatsApp + MCP: automatic audio transcription

2
Comments
4 min read
Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper
Cover image for Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper

Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper

Comments
5 min read
Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive
Cover image for Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive

Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive

5
Comments 5
6 min read
These 5 "Best Practices" Are Stopping You From Getting Hired
Cover image for These 5 "Best Practices" Are Stopping You From Getting Hired

These 5 "Best Practices" Are Stopping You From Getting Hired

17
Comments 1
4 min read
Transform Your Speech into Text with the Power of OpenAI and useWhisper
Cover image for Transform Your Speech into Text with the Power of OpenAI and useWhisper

Transform Your Speech into Text with the Power of OpenAI and useWhisper

3
Comments 1
2 min read
iPhone 的語音辨識功能:語音備忘錄,自動標點分段
Cover image for iPhone 的語音辨識功能:語音備忘錄,自動標點分段

iPhone 的語音辨識功能:語音備忘錄,自動標點分段

Comments 2
1 min read
How to make multilingual videos in 3 minutes
Cover image for How to make multilingual videos in 3 minutes

How to make multilingual videos in 3 minutes

1
Comments
3 min read
Высококачественная транскрипция зашумлённых двухканальных телефонных звонков
Cover image for Высококачественная транскрипция зашумлённых двухканальных телефонных звонков

Высококачественная транскрипция зашумлённых двухканальных телефонных звонков

2
Comments
1 min read
Code Faster in Cursor: A Pragmatic Guide to Voice Prompting
Cover image for Code Faster in Cursor: A Pragmatic Guide to Voice Prompting

Code Faster in Cursor: A Pragmatic Guide to Voice Prompting

18
Comments 2
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.