Voicemail to Text: Every Way That Actually Works in 2026
Three paths exist — your phone's built-in transcription, your carrier's visual voicemail, or a paid service for saved files and tougher audio. Here's which one fits you.
- 60 minutes free
- No credit card
- 99 languages
- Speaker labels
Last verified June 24, 2026
TL;DR — pick your path
Voicemail transcription is three different jobs sharing one search query. Pick the row that matches what you’re trying to do.
| If you want to… | Use | Cost |
|---|---|---|
| Read incoming voicemails on iPhone | Visual Voicemail / Live Voicemail (built-in) | Free |
| Read incoming voicemails on Android | Google Voice or carrier Visual Voicemail | Free |
| Transcribe a voicemail you already saved | Upload to a transcription service | Free trial → ~$10/mo |
| Transcribe non-English voicemails | Paid service with multi-language support | ~$10/mo |
| Process voicemails for legal or medical use | Human-reviewed service (Rev) or HIPAA-compliant vendor | ~$1.50/min+ |
| Batch transcribe hundreds of voicemails | API service (DeluxeScribe, AssemblyAI, Deepgram) | Per-minute pricing |
How voicemail transcription actually works
Speech-to-text models turn audio into text by predicting the most likely sequence of words from acoustic features. They work best on clean, broadband audio — podcast studio recordings, Zoom calls recorded at 48 kHz. Voicemail is the opposite end of the spectrum.
Cellular voice traffic still uses narrow-band codecs (AMR, G.711, EVS in some cases) that sample at 8 kHz and aggressively compress to keep call quality bearable on bad connections. That strips out high-frequency information speech models rely on to distinguish consonants. Add a caller in a car with the AC on, an accent the model wasn’t heavily trained on, and a phone number mumbled at the end, and you get the transcription experience most people are familiar with: 80% right on the easy parts, garbage on the parts that matter.
This is why vendor “99% accuracy” claims don’t hold up on voicemail. Those numbers come from clean studio audio. On a typical voicemail, expect 75-90% word accuracy with phone numbers and proper nouns as the most common error sites.
Native phone options (free)
iPhone — Live Voicemail and Visual Voicemail
Live Voicemail (iOS 17 and later) transcribes incoming voicemails in real time as the caller is leaving them, on the device, in English. You can pick up mid-message after reading the screen. Turn it on in Settings → Phone → Live Voicemail. It works regardless of carrier and works offline.
Visual Voicemailtranscribes saved voicemails and shows them as text in the Phone app’s Voicemail tab. It requires carrier support — Verizon, T-Mobile, and AT&T all support it on capable iPhones. Other carriers and most MVNOs don’t. Coverage outside the US is patchy.
Known limitations:English-first; struggles with accented speech and phone numbers; transcription happens on Apple servers for Visual Voicemail (not Live Voicemail). If the voicemail is in a language other than English, you’ll usually get nonsense or a “Transcription not available” message.
Android — Google Voice
Google Voice gives you a free US phone number with voicemail transcription included. Messages appear in the Google Voice app and (optionally) get emailed to you with both the audio and the transcript. Multi-language transcription is more limited than iPhone’s native option — primarily English with partial support for Spanish and a few others.
Catch:Google Voice is US-only and requires you to use the Google Voice number as your voicemail destination. It doesn’t transcribe voicemails left on your existing carrier number unless you forward them.
Samsung and Pixel — carrier vs OEM
On Samsung and Pixel devices on US carriers, you usually get the carrier’s Visual Voicemail app installed by default. Pixel also has Google’s on-device call screen + voicemail transcription on supported regions, similar in spirit to iPhone’s Live Voicemail. Quality varies — the Pixel on-device option is generally the strongest of the Android free options for clear English.
Carrier-specific paths (2026 status)
Carrier features change without much announcement. The summary below was checked in June 2026 against each carrier’s public support documentation. If a step doesn’t work, check the carrier’s support page first — these features shift quarterly.
Verizon
Verizon offers Voicemail to Text via the My Verizon app and supports Visual Voicemail on capable phones at no extra charge for most postpaid plans. The previously-marketed Premium Visual Voicemail subscription has been phased out for new lines in some markets — Voicemail to Text is now bundled where available. Setup: open the My Verizon app → Account → Plan → Voicemail settings.
T-Mobile
T-Mobile Visual Voicemail is included with most postpaid plans and installed by default on T-Mobile-branded Android phones. On iPhone, voicemail transcription comes through Apple’s Visual Voicemail rather than a separate T-Mobile app. Known issue: VVM can stop syncing after an eSIM transfer; the fix is usually uninstalling and reinstalling the Visual Voicemail app, or for iPhone, toggling Settings → Phone → Show My Caller ID.
AT&T
AT&T Visual Voicemail is available on most postpaid devices and includes transcription for supported plans. Enable it through the AT&T Visual Voicemail app (Android) or your phone’s built-in Visual Voicemail (iPhone). Pre-paid and Cricket lines historically have more limited support — check the line type if transcription isn’t appearing.
MVNOs (Mint, Visible, Cricket, Google Fi, etc.)
MVNO coverage is the most inconsistent. Google Fi has the best built-in transcription experience (it uses Google Voice transcription on the backend). Mint Mobile uses T-Mobile’s visual voicemail infrastructure with some feature gaps. Visible (Verizon-owned) and Cricket (AT&T-owned) often lack the full transcription features of their parent carriers. Workaround: use Google Voice as a free secondary number for any line where the carrier transcription is missing.
Transcribing a saved voicemail audio file
This is the case the “5 best voicemail apps” listicles ignore. You have an audio file — exported from iPhone Voice Memos, downloaded from Google Voice, saved by a PBX, or shared from someone else’s phone — and you need text. Your phone’s built-in transcription only works for live voicemails on your line, not for files.
Step 1: Export the voicemail
- iPhone: Phone app → Voicemail → tap the message → Share button → save to Files, AirDrop, or share directly to an app. The export is typically an
.m4afile. - Google Voice: open the voicemail in voice.google.com or the app → three-dot menu → Download. You get an
.mp3. - Android Visual Voicemail (carrier app): the Share option saves the file; format varies by app, usually
.amror.3gp. Older carrier apps may not allow export — in that case, record the playback with Voice Memos / a recorder app as a workaround. - PBX / business phone systems: most modern systems (RingCentral, Nextiva, Dialpad, 8x8) deliver voicemails by email as MP3 or WAV. Save the attachment.
Step 2: Upload and transcribe
Any transcription tool that accepts MP3, M4A, AMR, or 3GP will work. The honest options:
- DeluxeScribe — drag-and-drop, 60 free minutes, 99 languages, speaker labels, exports to TXT/DOCX/PDF/SRT/VTT/JSON. Good fit if you have multiple voicemails or need non-English transcription.
- Rev — pay per minute, optional human-reviewed tier for legal/medical use where 100% accuracy matters.
- Self-hosted Whisper — free, fully private, but requires a Python command. Good for sensitive content.
whisper voicemail.m4a --model medium
Step 3: Spot-check the parts that matter
Regardless of which service you use, manually verify any phone numbers, names, addresses, or specific times. These are the highest-error parts of any voicemail transcription. A 90%-accurate transcript that gets the callback number wrong is worse than no transcript.
Paid services — ranked honestly
None of these are objectively the best for everyone. The right one depends on volume, accuracy needs, language, and whether you need a human in the loop. Ranked by what they actually do best:
| Service | Free tier | Paid from | Wins on | Loses on |
|---|---|---|---|---|
| DeluxeScribe | 60 min one-time | $10/mo · 1,200 min | Per-minute price, 99 languages, speaker labels | No human-reviewed tier |
| Rev | None for human; AI trial varies | $0.25/min AI · $1.50/min human | Human-verified accuracy for legal/medical | Slowest turnaround on human tier |
| Otter | 300 min/mo | $17/mo | Meeting use cases, calendar integration | English-first, file size limits on free tier |
| Trint | None | $48/mo · 7 hours | Editor with newsroom-style workflows | Expensive for small batches |
| Whisper (self-hosted) | Free | Free | Full privacy, no upload | Requires Python; slow on CPU |
Pricing checked June 2026 on each vendor’s public pricing page. Tiers change frequently — verify before you sign up.
Accuracy reality — what we found
We ran five sample voicemails through iPhone Visual Voicemail (Verizon line), Google Voice, and DeluxeScribe to see where each actually wins. The samples covered the realistic spread of voicemail audio: clean studio-quality, accented English, background noise, number-heavy (a contractor leaving a quote), and fast speech (a frustrated customer callback).
What worked across all three services
- Clean English voicemails — all three got 95%+ word accuracy
- Common names (John, Sarah, Mike) — generally correct
- Greetings and sign-offs — formulaic, easy to predict
What broke consistently
- Phone numbers spoken at the end— all three dropped digits or mistranscribed them. “Five five five” sometimes became “555” correctly, sometimes “5, 5, 5”, sometimes “fives.”
- Accented English — the iPhone built-in underperformed on Indian English compared to Google Voice and DeluxeScribe.
- Uncommon proper nouns — business names, street names, prescription drug names. Regardless of service, expect to spot-check these.
Where each one had an edge
- iPhone Visual Voicemail — best UX (no extra app), worst on non-English audio.
- Google Voice— solid for US-English, free forever, awkward if you don’t want to use a Google number.
- DeluxeScribe — only one of the three that handled the Spanish-language sample correctly, and the only one that lets you transcribe a saved file.
Legal and privacy — for stored transcripts
This isn’t legal advice. Consult a lawyer for your jurisdiction if you’re processing voicemails at scale.
Recording laws in the US split into one-party-consent states (most of the country) and two-party / all-party-consent states (California, Florida, Pennsylvania, Washington, Illinois, Maryland, Massachusetts, Montana, New Hampshire, and a handful of others). The Reporters Committee for Freedom of the Press maintains a current state-by-state table.
For voicemails specifically, the analysis is usually narrower than for live calls: the caller voluntarily left a message, which most courts treat as consent to record. Storing the audio and the transcript is generally fine in any state. The places it gets complicated:
- Sharing the transcript with third parties. Internal use is one thing; forwarding a transcript to someone not on the original call may trigger different rules.
- Retention as a business record.If you’re a regulated industry (finance, healthcare, legal), the transcript may be subject to retention and destruction requirements that apply to the original recording.
- HIPAA.If a voicemail contains Protected Health Information and you’re a covered entity or business associate, you need a HIPAA-compliant transcription path. DeluxeScribe is not HIPAA-compliant. For PHI, use a vendor with a signed BAA.
- GDPR. Voicemails from EU residents are personal data. Storing transcripts requires a lawful basis, and the caller may have rights of access and erasure even if they originated the message.
When to ignore the transcript and just listen
A perfect transcript still misses things the audio carries. For the following, listen to the message before you act on the text:
- Tone-dependent decisions.“That’s fine” could be sincere or sarcastic. The transcript won’t tell you which.
- Phone numbers, addresses, prescription names.These are the highest-error parts of any voicemail transcription. If you’re going to call back or act on them, verify against the audio.
- Anything from an unknown number flagged as spam-likely. Spoofed-caller voicemails often read fine in text but sound clearly off in audio.
- Urgent-sounding messages. Pace and prosody communicate urgency the text strips out.
- Voicemails in languages your transcription engine doesn’t support well. A bad transcript can be worse than none.
How this page was verified
Related guides
- M4A to TextiPhone Voice Memos and other M4A files. Covers iOS 18's built-in option and how to export.
- MP3 to TextConvert any MP3 recording to text. Accuracy by recording type and the free options that actually work.
- Podcast TranscriptionListener and podcaster workflows, the Podcasting 2.0 spec, and a show-notes pipeline.
- Pricing60 minutes free, then $10/month for 1,200 minutes. Per-minute math for batch voicemail jobs.