Whisper UI - AI Audio Transcribe is a powerful and innovative app that lets you convert any audio file into text or subtitles in seconds. Whether you need to transcribe an interview, a lecture, a podcast, or a video, Whisper UI can handle it all with ease and accuracy.
Whisper UI is more than just a transcription app. It is a fully offline app that uses OpenAI Whisper, a state-of-the-art speech recognition model, to transcribe audio on your computer. This means you don’t need any internet connection or worry about your data being sent to any remote server. You can enjoy fast and secure transcription of your audio files, without compromising on quality or privacy.
Now with GPU Hardware Acceleration: Whisper UI takes a giant leap forward by integrating support for GPU hardware acceleration. Harness the power of your computer’s CPU, OpenCL, and NVIDIA CUDA (versions 12 and 11) to boost transcription performance significantly. This feature enables faster processing times and smoother operation, especially for lengthy or complex audio files.
Fully Offline Capabilities: Utilizing the advanced OpenAI Whisper speech recognition model, Whisper UI operates entirely offline. This ensures your transcriptions are processed on your device without requiring an internet connection, guaranteeing privacy and security.
New Feature: LLM-Powered Offline Subtitle Translation
Whisper UI now includes a groundbreaking feature that utilizes Large Language Models (LLM) to translate subtitles offline, leveraging the power of your computer. This new addition enhances the app’s capabilities, allowing you to:
With Whisper UI, you can:
- Transcribe audio from any format, including MP4, MOV, MKV, AVI, MJPEG, MPEG, F4V, FLV, M2T, M2TS, M2V, 3GP, 3G2, MP3, WAV, OGG, FLAC, M4A, M4V, AIFF
- Record and transcribe audio directly from your computer’s microphone or any audio input device
- Select the input audio language and output text language
- Translate audio from 57 different languages into English
- Specify source language of any of the 57 supported languages
- Generate subtitles in various formats, including .srt, .ass, .vtt, ssa. .lrc
- Download the generated text or subtitle file
- Edit or correct the transcription within the app
- Install as a background service for Scorpio Player to use live transcription and display subtitles using your own computer power
- Customize the app’s appearance with Mica, Mica Alt, Acrylic, or Dynamic Shader Animation backgrounds
- Translate subtitles offline: With the integration of LLM, you can now translate subtitles without an internet connection, ensuring your data remains private and secure.
- Utilize your computer’s power: The translation process is performed directly on your computer, using its processing capabilities to deliver quick and accurate results.
- Support for multiple languages: This feature supports translation between various languages, making it easier to work with international content.
Whisper UI is the ultimate app for anyone who works with audio content. It saves you time and effort by providing you with accurate and editable transcriptions in minutes. It also helps you communicate and collaborate with people from different languages and cultures by translating audio with a single tap.
Available models and languages
There are five model sizes, four with English-only versions, offering speed and accuracy tradeoffs. Below are the names of the available models and their approximate memory requirements and inference speed relative to the large model; actual speed may vary depending on many factors including the available hardware.
AI Subtitle Translator: Bridging Language Barriers with Precision
Our app proudly supports an extensive range of languages for translation, ensuring that your subtitles are accurately conveyed no matter the content. Here’s a detailed look at our supported languages:
- English (US): Experience translations with American idioms and cultural nuances.
- English (Great Britain): Enjoy the charm of British English with its unique spellings and expressions.
- Chinese Simplified: Navigate the modern simplicity of China’s most widely used writing system.
- Chinese Traditional: Retain the classic beauty of traditional Chinese characters in your subtitles.
- Arabic: Connect with the rich linguistic tapestry of the Arab world through precise subtitle translation.
- German: Immerse yourself in the linguistic depth of Germany with translations that capture its essence.
- French: Feel the romance of French cinema with subtitles that resonate with Francophone eloquence.
- Italian: Relish in the lyrical rhythm of Italian dialogue with subtitles that sing.
- Japanese: Dive into the intricate layers of Japanese storytelling with culturally aware translations.
- Korean: Engage with the vibrant energy of Korean media through accurate and timely translations.
- Portuguese: Embrace the diverse dialects of Portuguese-speaking countries with tailored subtitle translations.
- Russian: Explore the vastness of Russian literature and film with subtitles that do justice to its complexity.
- Spanish: Revel in the diversity of Spanish variants, from European to Latin American, all translated with care.
- Turkish: Immerse yourself in the storied tradition of Turkey with subtitles that capture the essence of its language and culture.
Memory usage
Model Disk Mem SHA
tiny 75 MB ~125 MB bd577a113a864445d4c299885e0cb97d4ba92b5f
base 142 MB ~210 MB 465707469ff3a37a2b9b8d8f89f2f99de7299dac
small 466 MB ~600 MB 55356645c2b361a969dfd0ef2c5a50d530afd8d5
medium 1.5 GB ~1.7 GB fd9727b6e1217c2f614f9b698455c4ffd82463b4
large 2.9 GB ~3.3 GB ad82bf6a9043ceed055076d0fd39f5f186ff8062