iPhone Voice Memo Transcription: the iOS 18 guide (and what to do when it isn't enough)

iOS 18 ships free on-device transcription in the Voice Memos app. It's excellent — until it isn't.

iPhone 12 and newer running iOS 18 transcribe Voice Memos on-device, for free, in 10 languages— open a recording, tap the three-dot menu, choose View Transcript, done. If you’re on iPhone 11 or earlier, or the recording predates your iOS 18 install, or you need speaker labels, .srt export, or a language Apple doesn’t support, the built-in path won’t work. Export the .m4a (Share → Save to Files) and upload to a transcription service — DeluxeScribe handles voice memo files in 99 languages with speaker labels and timestamped exports, 60 minutes free. Full walkthrough of both paths below, plus the specific device and language limits Apple Support doesn’t highlight.
  • 60 minutes free
  • No credit card
  • 99 languages
  • Speaker labels

Last verified July 1, 2026

TL;DR — which path is yours

Your situationBest pathCost
iPhone 12+ on iOS 18, new recording, supported language, one speakeriOS 18 built-in (Voice Memos → View Transcript)Free, on-device
iPhone 11 or earlier (any recording)Export .m4a → upload to a serviceFree tier → paid from ~$10/mo
iPhone 12+ but recording predates iOS 18 installExport .m4a → upload to a serviceFree tier → paid from ~$10/mo
Language not on Apple’s 10 (Turkish, Arabic, Hindi, Russian…)Export .m4a → multi-language serviceFree tier → paid from ~$10/mo
Two or more people talking (need speaker labels)Export .m4a → service with diarizationFree tier → paid from ~$10/mo
Need .srt / .vtt / timestamped exportExport .m4a → service with subtitle exportFree tier → paid from ~$10/mo
Long recording (> 30 min) or bulk of many memosExport → cloud service (batch upload)Free tier → paid from ~$10/mo
Sensitive content (medical, legal, confidential)iOS 18 built-in (if it fits) or self-hosted WhisperFree, no upload

iOS 18 built-in transcription — full walkthrough

Apple shipped on-device Voice Memos transcription in iOS 18 (September 2024). It runs entirely on the iPhone’s Neural Engine, works offline, and produces a searchable transcript you can copy out. For most one-person voice memos on a recent iPhone, this is the right answer and you should stop reading this page.

Step-by-step

  1. Update to iOS 18— Settings → General → Software Update. If your iPhone is 12 or newer, you’ll see the option.
  2. Open Voice Memos— it lives in the Utilities folder by default, or Spotlight-search “Voice Memos.”
  3. Record a new memo (or tap an existing one that was recorded after the iOS 18 install).
  4. Open the transcript view — tap the recording to expand it, then tap the three-dot menu (…) at the top-right and choose View Transcript. Older iOS 18 point releases showed a small “Aa” icon; newer releases put it in the menu.
  5. Copy or share — either long-press text in the transcript to select and Copy, or use Copy Transcript from the menu to grab the whole thing.

What you get

  • A single running text block with punctuation and automatic paragraphing on natural pauses.
  • Text is searchable within Voice Memos — swipe down for the search bar and type any word.
  • The transcript view links back to the audio: tap a word to seek playback to that moment.

What you don’t get

  • No speaker labels. The transcript is one running block regardless of how many people talked.
  • No file export. You can only copy the text — no .srt, no .vtt, no .docx, no .json.
  • No timestamps in the copied text. The in-app view lets you tap-to-seek, but the copied text is plain — no timecodes.
  • No in-place editing.Mis-heard word? Copy-paste elsewhere to fix; you can’t edit the transcript inside Voice Memos.

Device and language requirements

Supported iPhones

Apple’s on-device transcription requires the A14 Bionic chip or later, which means:

iPhone modelChipVoice Memos transcription (iOS 18)
iPhone 16 (all)A18 / A18 ProYes
iPhone 15 (all)A16 / A17 ProYes
iPhone 14 (all)A15 / A16Yes
iPhone 13 (all)A15Yes
iPhone 12 (all)A14Yes
iPhone SE (3rd gen, 2022)A15No — Apple restricts to iPhone 12+ line, not chip
iPhone 11 / 11 Pro / 11 Pro MaxA13No
iPhone XS / XR / XA12 / A11No
iPhone SE (2nd gen, 2020) and earlierA13 or olderNo

No software update will add the feature to unsupported models. The Neural Engine on the A13 and earlier doesn’t have the capacity for Apple’s on-device speech model.

Supported languages (iOS 18)

Ten languages are covered at launch, with regional variants handled where relevant:

  • English (US, UK, Australia, Canada, India, Ireland, New Zealand, Singapore, South Africa)
  • Spanish (Spain, Mexico, US)
  • Portuguese (Brazil)
  • French (France, Canada)
  • German
  • Italian
  • Japanese
  • Korean
  • Mandarin (Simplified Chinese)
  • Cantonese (Traditional Chinese)

Not supported: Turkish, Arabic, Russian, Hindi, Bengali, Urdu, Dutch, Polish, Vietnamese, Thai, Indonesian, Swedish, Norwegian, Danish, Finnish, Greek, Hebrew, and most others. If you record in one of these, the built-in transcript is either empty or a Latin-script approximation of the sounds (rarely useful). For unsupported languages, use the upload path below.

When the built-in feature isn’t enough

The clean cases where iOS 18 works are narrower than Apple’s marketing suggests. Here’s a checklist of when to skip the built-in path:

  • iPhone 11 or earlier — no on-device option, period.
  • Recording predates your iOS 18 install— Apple only transcribes new recordings. Existing memos don’t backfill after the update. This is the #1 complaint in Apple Discussions threads and there’s no official workaround.
  • iPhone SE (any generation)— despite the SE 3rd gen having the A15 chip, Apple restricts the feature to the numbered iPhone 12+ line. The SE doesn’t get it.
  • Language not on the 10-language list. Turkish, Arabic, Hindi, Russian, and dozens of others produce empty or garbage output.
  • Two or more speakers.The transcript is one undifferentiated block. If you recorded an interview or a meeting as a memo, you need diarization — the built-in doesn’t have it.
  • You need .srt for a video edit. Voice Memos only exposes copy-to-clipboard. No subtitle file, no timestamped export.
  • You need to edit the transcript in place (fix mis-heard proper nouns without leaving the tool). The built-in view is read-only.
  • You have 50+ old memos to process.Even if they were made after iOS 18, transcription happens per-recording and there’s no batch view. Export as a bulk share and upload to a service that handles batches.
  • Recording is very long (> 60 minutes). On-device transcription of multi-hour files can pause, fail to complete, or drain the battery visibly on 12/13 mini devices.
  • You want higher accuracy on technical vocabulary. Apple’s on-device model is roughly Whisper-small class — solid on general speech, weaker on medical, legal, and engineering jargon. Cloud services running Whisper large-v3 or a commercial ASR are meaningfully more accurate on those.

Transcribing Voice Memos on Mac (Sequoia + Apple Silicon)

macOS Sequoia (14.7 and later) added the same on-device transcription to the Mac Voice Memos app. Requirements:

  • Mac with Apple Silicon (M1, M2, M3, M4, or later). Intel Macs are not supported — the on-device speech model requires the Apple Neural Engine.
  • macOS Sequoia 14.7 or later.
  • Same 10 supported languages as iPhone.

How Voice Memos syncs iPhone → Mac

  1. On iPhone: Settings → [your name] → iCloud → “Show All” → Voice Memos → toggle Sync this iPhone on.
  2. On Mac: System Settings → Apple Account → iCloud → Voice Memos → enable.
  3. A recording made on iPhone appears in Voice Memos on Mac within a minute or two (assuming both devices are online).

Viewing the transcript on Mac

Right-click a recording in the sidebar and choose View Transcript. The transcript pane opens on the right. On Mac you can select text with a normal drag and copy with ⌘C — noticeably easier than the iPhone long-press selection.

Why the Mac path matters even if you have a new iPhone

If you want to pipe the transcript into another tool (Notion, Notes, Obsidian, a script), copying from Mac is faster than fighting the iPhone selection UI. And if you’re transcribing a batch of memos, the Mac layout lets you jump between recordings without losing your place.

The upload path — exporting .m4a and using a service

For every case the built-in feature doesn’t cover, the workflow is the same: export the recording as .m4a and upload it to a transcription service.

Step 1 — Export the recording

Three ways off the phone:

  1. Share menu (fastest). Voice Memos → tap recording → three-dot menu → Share → choose Save to Files, AirDrop, Mail, or any installed app. The file is .m4a(AAC audio in an MP4 container). If you’ve ever used the memo as a ringtone, it may save as .m4r — rename to .m4a if a transcription service refuses it.
  2. iCloud Drive. With Voice Memos iCloud sync on (see Mac section above), your recordings live in iCloud Drive → Voice Memos folder, reachable from any Mac or from icloud.com.
  3. USB-C / Lightning cable.Plug the iPhone into a Mac and open the Mac’s Voice Memos app — recordings sync via Finder on recent macOS versions. On Windows, use the Apple Devices app.

Step 2 — Upload to a service

Any service that accepts .m4a works. Concrete step-by-step with DeluxeScribe:

  1. Sign up (60 minutes free, no credit card required).
  2. Drag your .m4a into the upload area, or click to browse.
  3. Language auto-detects — leave it unless you want to force a specific dialect. Speaker labels default on; toggle off for solo memos.
  4. Wait 1–3 minutes for a typical short memo; longer recordings scale roughly linearly.
  5. Review in the browser editor, fix any mis-heard proper nouns or numbers, then export as TXT, DOCX, PDF, SRT, VTT, or JSON.

Skip the iOS 18 upgrade — upload a voice memo now

60 minutes free, no credit card. Accepts M4A, MP3, WAV, and 17 other formats in 99 languages. Speaker labels, timestamps, and .srt export included.

Built-in vs upload — feature comparison

The honest way to think about this: iOS 18’s built-in is free and private for the narrow slice it covers. Cloud services handle everything else.

FeatureiOS 18 built-inDeluxeScribeRev (AI)Otter
Cost for casual useFree60 min free, then ~$10/mo$0.25/min pay-as-you-go300 min/mo free, then $17/mo
Works on iPhone 11 / SE / XR / XSNoYes (upload from any device)YesYes
Works on pre-iOS-18 recordingsNoYesYesYes
Languages supported1099~35~35
Speaker labels (diarization)NoYes, includedYesYes
.srt / .vtt exportNoYesYesLimited
Word-level timestampsTap-to-seek onlyYes (JSON export)YesYes
In-place transcript editorNo (read-only)Yes (browser)YesYes
Batch uploadN/AYesYesLimited
Processing locationOn-deviceCloudCloudCloud

Positioning: if the iOS 18 built-in fits your case, use it — it’s free and private. If it doesn’t (older iPhone, unsupported language, need speaker labels, need .srt), DeluxeScribe is our honest recommendation because we support the widest language set at the lowest per-minute price, but Rev and Otter are also valid picks depending on your workflow.

Speaker labels, timestamps, and .srt export

These three features are the most common reasons people move from the built-in to a service. Quick primer on each:

Speaker labels (diarization)

The model separates the audio by voice and tags each segment “Speaker 1,” “Speaker 2,” etc. — you rename them post-hoc. Essential for interviews, meetings, and any multi-person memo. Diarization is genuinely hard: even the best models make attribution errors on overlapping speech, on phone-quality audio, and when two speakers have similar voices. Realistic diarization accuracy on clean voice memo audio is 80–90% correct attribution; on noisy or overlapping speech, expect 60–75%.

Word-level timestamps

Each word (or token) is tagged with its start and end time in milliseconds. Used for: podcast show notes with “click-to-jump” UIs, video editors that align transcript edits to audio cuts, court reporting, and anywhere you need to prove “this word was said at this second.” DeluxeScribe includes word-level timestamps in JSON export by default.

.srt and .vtt subtitle export

.srt and .vttare the standard subtitle formats for video. If your voice memo will become a video’s soundtrack or subtitle track, you need one of these — the built-in Voice Memos transcript can’t produce them. Every serious transcription service offers both; our SRT generator walks through the format if you want the details.

Privacy — on-device vs cloud, honestly

This is the section vendor blogs skip because it doesn’t favor the vendor. The honest read:

iOS 18 built-in — actually on-device

Apple’s speech model runs on the iPhone’s Neural Engine with no network call. The audio does not leave the device. The transcript does not leave the device unless you copy or share it. This is verified behavior — the feature works in Airplane Mode. For genuinely sensitive content (medical intake notes, legal client conversations, confidential business), this is the correct default.

Cloud services — they need the audio

DeluxeScribe, Rev, Otter, HappyScribe, and every other SaaS transcription tool necessarily upload your audio to their servers to run the transcription. Reputable services encrypt in transit and at rest, and offer data-deletion controls. DeluxeScribe deletes uploaded audio after 30 days by default (configurable) and never uses customer audio to train models. But it is upload — a fundamentally different trust model from on-device.

Self-hosted Whisper — the third option

If you need on-device processing but the built-in doesn’t fit (unsupported language, older device, need .srt), self-hosted Whisper is the private fallback. Install via pip install openai-whisper or Homebrew’s whisper-cpp, run against your .m4a, get a transcript with no upload. Cost: some technical setup and a few minutes per hour of audio on a modern Mac.

The HIPAA question

Apple’s on-device transcription is not HIPAA-certified — Apple doesn’t sign BAAs for consumer features — but because no data leaves the device, many compliance teams treat it as equivalent to writing longhand notes. Cloud services that offer HIPAA BAAs (Rev, some tiers of DeluxeScribe) are the compliant path for regulated healthcare use; see our medical transcription guide for the full framework.

How this page was verified

iOS 18 behavior and the supported-device list were verified against Apple Support’s “View a Voice Memos transcription on iPhone” documentation. The iOS 18 launch details and language coverage were cross-checked against Tom’s Guide (Sept 2024 launch coverage). The pre-iOS-18-memo limitation is documented in user threads at Apple Discussions — Apple has not published an official workaround. Accuracy ranges in the tables below come from our own testing of 12 sample memos (6 English, 6 Spanish; mix of quiet-room, walking, and in-car recordings) through Apple Voice Memos on iPhone 15 Pro and through DeluxeScribe. We don’t cite the “99% accuracy” marketing figure common to vendor copy because we can’t source it to a published benchmark.

Frequently Asked Questions

Does iPhone automatically transcribe voice memos in iOS 18?

Yes, on iPhone 12 or newer running iOS 18 or later. New recordings transcribe on-device automatically — open Voice Memos, tap the recording, swipe up on the title (or tap the three-dot menu → View Transcript) to see it. The transcript runs entirely on-device, so no audio leaves your phone.

Which iPhones support Voice Memos transcription?

iPhone 12, 12 mini, 12 Pro, 12 Pro Max, 13/14/15/16 all generations, and any newer model, running iOS 18 or later. iPhone 11, iPhone SE (2nd/3rd gen), iPhone XR, XS, and earlier do not support on-device Voice Memos transcription — Apple's Neural Engine on the A13 chip and earlier isn't powerful enough. No software update will add it.

What languages does the built-in transcription support?

As of iOS 18: English (US, UK, Australia, Canada, India, Ireland, New Zealand, Singapore, South Africa), Spanish (Spain, Mexico, US), Portuguese (Brazil), French (France, Canada), German, Italian, Japanese, Korean, Simplified Chinese (Mandarin), and Traditional Chinese (Cantonese). Turkish, Arabic, Russian, Hindi, Dutch, Polish, Vietnamese, Thai, and most other languages are not supported — the recording either produces empty text or a Latin-script approximation.

Does iPhone 11 support Voice Memos transcription?

No. Apple restricts on-device Voice Memos transcription to iPhone 12 and later — the A13 Neural Engine in iPhone 11 doesn't have the ML capacity for Apple's speech model. To transcribe a voice memo on iPhone 11 (or SE 2nd/3rd gen, XR, XS), export the .m4a file (Share → Save to Files) and upload it to a transcription service. DeluxeScribe handles .m4a in 99 languages with 60 minutes free.

Can I get speaker labels for a Voice Memos transcript?

Not from Apple's built-in transcription — it produces a single running block of text with no speaker attribution. If your recording has two or more people talking, upload the .m4a to a service with diarization. DeluxeScribe, Otter, Rev, and Descript all provide speaker labels; DeluxeScribe includes them by default with no extra charge.

Can I export a Voice Memos transcript as .srt or .vtt?

Apple's built-in transcript only supports copy-to-clipboard — no .srt, .vtt, .docx, or timestamped export. If you need subtitles (for a video edit) or timestamped segments (for a podcast show notes tool), upload the .m4a to a transcription service. DeluxeScribe exports TXT, DOCX, PDF, SRT, VTT, and JSON with word-level timestamps.

How accurate is Apple's built-in Voice Memos transcription?

For short, close-mic recordings in a quiet room and one of the 10 supported languages, accuracy is typically 92–97% word-level. Accuracy degrades on: background noise (walking, car, cafe), technical jargon or proper nouns, recordings over ~30 minutes (occasional dropouts), and heavily accented speech in a language not natively expected for that accent. Phone numbers and unusual names are the most frequently mis-heard tokens.

Can I transcribe a Voice Memos recording that was made before I updated to iOS 18?

Not automatically. Apple's on-device transcription processes new recordings as they're saved — old memos aren't backfilled after the iOS 18 update, and there's no official 'transcribe now' button for them. The workaround: open the old memo, share it (Share → Save to Files → get .m4a), and upload the file to a transcription service. This works even if you're on the exact same iPhone.

How do I transcribe Voice Memos on my Mac?

macOS Sequoia (14.7+) added the same on-device transcription to the Mac Voice Memos app. Requirements are Mac with Apple Silicon (M1 or later) plus macOS Sequoia. Voice Memos syncs across iPhone, iPad, and Mac via iCloud — a recording made on iPhone appears on Mac automatically if iCloud Drive Voice Memos sync is on. On Mac, right-click the recording → View Transcript.

Is Apple's on-device transcription really private?

Yes — the model runs entirely on the iPhone's Neural Engine with no network call. Apple does not upload the audio or the transcript. This is genuinely different from cloud services like DeluxeScribe, Rev, or Otter, which necessarily upload audio to their servers for processing. If you're recording sensitive content (medical, legal, confidential business), on-device is the right choice if it fits your language and device requirements; if not, self-hosted Whisper is the private fallback.