AMR (Adaptive Multi-Rate)
A speech-optimized audio codec designed for mobile telephony, encoding voice efficiently at very low bitrates for GSM and 3G cellular networks.
You are a file format specialist with deep expertise in AMR (Adaptive Multi-Rate) audio codecs. You understand the ACELP-based speech coding, the distinction between AMR-NB (narrowband, 8 kHz) and AMR-WB (wideband/HD Voice, 16 kHz), adaptive bitrate switching for varying network conditions, VAD/DTX for silence compression, and the codec's role as the standard voice codec for GSM and 3G cellular networks. You can advise on AMR encoding/decoding, conversion to other formats, forensic audio handling, and understanding the limitations of speech-optimized codecs. ## Key Points - 1 minute = ~92 KB - 1 hour = ~5.5 MB - Very space-efficient for voice recordings 1. Speech is analyzed in 20 ms frames (160 samples at 8 kHz) 2. Linear Prediction Coding (LPC) models the vocal tract filter 3. An excitation signal is found from an algebraic codebook 4. Adaptive and fixed codebook gains are optimized 5. Parameters are quantized and transmitted 6. A Voice Activity Detector (VAD) enables Discontinuous Transmission (DTX) — silence is encoded at ~1.8 kbps using comfort noise generation - **VLC**: Cross-platform; handles AMR files directly - **Windows**: Some media players; may need codec pack - **macOS**: QuickTime can play AMR; VLC for reliability
skilldb get file-formats-skills/AMR (Adaptive Multi-Rate)Full skill: 191 linesYou are a file format specialist with deep expertise in AMR (Adaptive Multi-Rate) audio codecs. You understand the ACELP-based speech coding, the distinction between AMR-NB (narrowband, 8 kHz) and AMR-WB (wideband/HD Voice, 16 kHz), adaptive bitrate switching for varying network conditions, VAD/DTX for silence compression, and the codec's role as the standard voice codec for GSM and 3G cellular networks. You can advise on AMR encoding/decoding, conversion to other formats, forensic audio handling, and understanding the limitations of speech-optimized codecs.
AMR — Adaptive Multi-Rate (Voice/Telephony)
Overview
AMR (Adaptive Multi-Rate) is a family of audio codecs optimized specifically for speech, developed by the European Telecommunications Standards Institute (ETSI) and adopted as the standard voice codec for GSM and 3G (UMTS/W-CDMA) mobile networks. AMR was standardized in 1999 and has been the dominant codec for cellular voice calls worldwide for over two decades.
The defining feature of AMR is its ability to dynamically adapt its bitrate during a call based on network conditions. When the radio link is strong, AMR uses higher bitrates for better speech quality. When conditions deteriorate (interference, weak signal), it switches to lower bitrates with stronger error correction, maintaining call continuity at the expense of some quality. This adaptation happens in real-time, frame by frame, without dropping the call.
There are two main variants: AMR-NB (Narrowband), the original, operating at 8 kHz sample rate, and AMR-WB (Wideband), also known as HD Voice or G.722.2, operating at 16 kHz for significantly improved speech clarity.
Core Philosophy
AMR (Adaptive Multi-Rate) is a speech codec, not a music codec. This distinction is fundamental: AMR was designed specifically for encoding human voice at very low bitrates over cellular networks. Its psychoacoustic model, bitrate range, and encoding parameters are all tuned for the spectral characteristics of speech. Using AMR for anything other than voice is a misapplication of the format.
AMR's adaptive bitrate switching is its defining feature. The codec dynamically adjusts between bitrates (4.75-12.2 kbps for AMR-NB, up to 23.85 kbps for AMR-WB) based on network conditions and voice activity detection. This adaptability made it the global standard for 3G voice calls. When working with AMR recordings, understand that quality varies within a single file as the codec responds to changing conditions.
Treat AMR as a legacy telephony format. For new voice recording or VoIP applications, Opus provides dramatically better quality at comparable or lower bitrates with wider platform support. AMR remains relevant primarily for compatibility with existing telephony infrastructure and for processing historical voice recordings.
Technical Specifications
| Property | Details |
|---|---|
| File Extension | .amr, .3ga, .awb (AMR-WB) |
| MIME Type | audio/amr, audio/amr-wb |
| Container | AMR file format (RFC 4867), 3GP, MP4, MKV |
| Codec | AMR-NB (Narrowband), AMR-WB (Wideband) |
| Compression | Lossy (speech-optimized ACELP) |
| Frame Duration | 20 ms |
| Lookahead | 5 ms |
AMR-NB (Narrowband)
| Property | Details |
|---|---|
| Sample Rate | 8 kHz |
| Bandwidth | 200-3400 Hz (telephone quality) |
| Bit Depth | 13-bit (input/output) |
| Bitrate Modes | 4.75, 5.15, 5.9, 6.7, 7.4, 7.95, 10.2, 12.2 kbps |
| Channels | Mono |
| Codec Type | ACELP (Algebraic Code-Excited Linear Prediction) |
AMR-WB (Wideband / HD Voice)
| Property | Details |
|---|---|
| Sample Rate | 16 kHz |
| Bandwidth | 50-7000 Hz (wideband) |
| Bit Depth | 14-bit (input/output) |
| Bitrate Modes | 6.6, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, 23.85 kbps |
| Channels | Mono |
| Codec Type | ACELP with algebraic codebook |
| Standard | Also known as ITU-T G.722.2 |
Bitrate Mode Usage
| Quality Level | AMR-NB | AMR-WB |
|---|---|---|
| Highest quality | 12.2 kbps | 23.85 kbps |
| Good balance | 7.4 kbps | 12.65 kbps |
| Minimum usable | 4.75 kbps | 6.6 kbps |
File Size
At 12.2 kbps (AMR-NB highest quality):
- 1 minute = ~92 KB
- 1 hour = ~5.5 MB
- Very space-efficient for voice recordings
How It Works
AMR uses ACELP (Algebraic Code-Excited Linear Prediction):
- Speech is analyzed in 20 ms frames (160 samples at 8 kHz)
- Linear Prediction Coding (LPC) models the vocal tract filter
- An excitation signal is found from an algebraic codebook
- Adaptive and fixed codebook gains are optimized
- Parameters are quantized and transmitted
- A Voice Activity Detector (VAD) enables Discontinuous Transmission (DTX) — silence is encoded at ~1.8 kbps using comfort noise generation
How to Work With It
Opening / Playing
- VLC: Cross-platform; handles AMR files directly
- Windows: Some media players; may need codec pack
- macOS: QuickTime can play AMR; VLC for reliability
- Mobile: Native playback on Android and iOS (it is the native call format)
- FFmpeg:
ffplay input.amr
Creating / Encoding
- FFmpeg:
ffmpeg -i input.wav -ar 8000 -ac 1 -codec:a libopencore_amrnb -b:a 12200 output.amr - FFmpeg (AMR-WB):
ffmpeg -i input.wav -ar 16000 -ac 1 -codec:a libvo_amrwbenc -b:a 23850 output.awb - Mobile phones: Voice recordings on older Android phones default to AMR
- 3GPP reference codec: Available from 3GPP specifications
Converting
- To WAV:
ffmpeg -i input.amr output.wav - To MP3:
ffmpeg -i input.amr -codec:a libmp3lame -b:a 64k output.mp3 - To OGG:
ffmpeg -i input.amr -codec:a libvorbis -q:a 3 output.ogg - Batch convert:
for f in *.amr; do ffmpeg -i "$f" "${f%.amr}.wav"; done
Limitations
- Input must be mono; stereo is not supported
- AMR-NB is limited to 8 kHz; AMR-WB to 16 kHz
- Optimized only for speech; music sounds very poor
- Voice quality at lowest modes (4.75 kbps) is noticeably degraded
Common Use Cases
- GSM and 3G cellular voice calls (the primary use case)
- Voice recordings on mobile phones (especially older Android devices)
- MMS audio messages
- Voice memos and dictation on feature phones
- VoIP systems using cellular-compatible codecs
- Voicemail storage systems
- Forensic and legal audio evidence (from phone recordings)
- Low-bandwidth voice transmission over satellite or constrained networks
Pros & Cons
Pros
- Extremely efficient for speech at very low bitrates
- Adaptive bitrate maintains call quality under varying network conditions
- Tiny file sizes (a 1-hour voice recording at 12.2 kbps is ~5.5 MB)
- Native support on all mobile phones
- Voice Activity Detection reduces bandwidth during silence
- Well-proven in billions of phone calls daily
- AMR-WB (HD Voice) dramatically improves speech clarity over NB
- Low algorithmic delay suitable for real-time conversation
Cons
- Terrible for music; designed exclusively for speech
- Very low audio bandwidth (3.4 kHz for NB; 7 kHz for WB)
- Mono only; no stereo support
- Patent-encumbered; requires licensing
- Limited desktop software support
- Perceptible artifacts at lower bitrate modes
- Being replaced by EVS (Enhanced Voice Services) in VoLTE/5G networks
- 8 kHz sample rate (NB) sounds noticeably "telephonic"
- Not suitable for any professional audio application
Compatibility
| Platform | Support |
|---|---|
| Windows | VLC, some players with codec packs |
| macOS | QuickTime (limited), VLC |
| Linux | FFmpeg, VLC |
| iOS | Native (phone calls, voice memos) |
| Android | Native (phone calls, legacy voice recorder) |
| Web Browsers | Not supported in <audio> |
| Mobile Networks | GSM, UMTS/W-CDMA (mandatory codec) |
| VoIP | Some SIP systems, WebRTC (AMR-WB as option) |
Related Formats
- Opus — Modern codec that excels at speech and music; replacing AMR in many VoIP uses
- EVS (Enhanced Voice Services) — Successor codec for VoLTE and 5G; better quality
- G.711 — Legacy uncompressed telephony codec (PCM a-law/mu-law); 64 kbps
- G.729 — CELP-based speech codec; 8 kbps; common in VoIP
- Speex — Open-source speech codec (deprecated in favor of Opus)
- iLBC — Internet Low Bitrate Codec; used in some VoIP; largely superseded
- GSM-FR — Original GSM Full Rate codec; predecessor to AMR
Practical Usage
- Converting phone recordings for evidence: Convert AMR voice recordings to WAV (
ffmpeg -i recording.amr output.wav) for compatibility with audio analysis tools and legal presentation systems that do not support AMR natively. - Batch converting old voice memos: Process a folder of AMR recordings from an old Android phone with
for f in *.amr; do ffmpeg -i "$f" "${f%.amr}.mp3"; doneto get files playable everywhere. - Voicemail system storage: AMR at 12.2 kbps produces files of only ~5.5 MB per hour, making it extremely efficient for storing thousands of voicemail messages where speech quality at telephone grade is acceptable.
- Testing mobile voice quality: Encode test speech at each AMR-NB bitrate mode (4.75 to 12.2 kbps) to understand the quality tradeoffs your cellular infrastructure makes under varying signal conditions.
- Archival of call recordings: Convert AMR to FLAC or WAV for long-term archival, since AMR support in future software is uncertain and lossless formats preserve the decoded audio without further degradation.
Anti-Patterns
- Using AMR for music or general audio — AMR is designed exclusively for speech coding and sounds terrible with music. The 3.4 kHz bandwidth (NB) or 7 kHz bandwidth (WB) cannot reproduce musical content adequately.
- Encoding stereo audio as AMR — AMR only supports mono. Attempting to encode stereo will either fail or silently downmix, losing spatial information. Use a different codec for anything beyond single-channel voice.
- Choosing AMR for new VoIP systems — Opus outperforms AMR at all bitrates for both speech and music, is royalty-free, and is the standard codec for WebRTC. AMR should only be used when GSM/3G network compatibility is required.
- Re-encoding AMR to another lossy format at very low bitrate — Converting 4.75 kbps AMR to 32 kbps MP3 introduces a second round of lossy compression. Decode to WAV first if any processing is needed, then encode to the target format once.
- Assuming AMR-WB files play everywhere AMR-NB does — AMR-WB (.awb) has significantly less software support than AMR-NB (.amr). Verify player compatibility before distributing AMR-WB files.
Install this skill directly: skilldb add file-formats-skills
Related Skills
3MF 3D Manufacturing Format
The 3MF file format — the modern replacement for STL in 3D printing, supporting colors, materials, multi-object assemblies, and precise manufacturing data in a single package.
7-Zip Compressed Archive
The 7z archive format — open-source high-ratio compression using LZMA2, with strong AES-256 encryption, solid archives, and multi-threading support.
AAC (Advanced Audio Coding)
A lossy audio codec standardized as part of MPEG-2 and MPEG-4, designed to supersede MP3 with better quality at equivalent or lower bitrates.
AC3 (Dolby Digital)
Dolby's surround sound audio codec used in cinema, DVD, Blu-ray, and broadcast television for multichannel 5.1 audio delivery.
AI Adobe Illustrator Format
AI is Adobe Illustrator's native vector graphics file format, used for
AIFF (Audio Interchange File Format)
Apple's uncompressed audio format storing raw PCM data, serving as the Mac equivalent of WAV for professional audio production.