Comparison of audio coding formats
Appearance
Technical details
Audio compression format | Algorithm | Sample rate | Bit rate | Latency | CBR | VBR | Stereo | Multichannel |
---|---|---|---|---|---|---|---|---|
AAC | MDCT, Hybrid Subband (AAC-HE) | 8–192 kHz[1] | 8–529 kbit/s (stereo) | 20–405 ms[2] | Yes | Yes | Yes: Dual, Mid/Side, Intensity, Parametric | Yes: Up to 48 channels |
AC3 | MDCT | 32, 44.1, 48 kHz | 32–640 kbit/s | 32–48 ms | Yes | In theory, it supports VBR, but no high-quality encoder supports that feature. Most high-quality encoders only do CBR encoding | Yes | Yes: Up to 6 channels |
AC4 | MDCT | 48 kHz | 24-1536 kbit/s | ? | Yes | No | Yes | Yes: Up to 24 channels |
AMBE | ? | 8 kHz | 2, 9.6 kbit/s | ? | ? | ? | ? | ? |
AMR | ACELP | 8 kHz | 4.75, 5.15, 5.90, 6.70, 7.40, 7.95, 10.20, 12.20 kbit/s | 25 ms | Yes | Yes | No | No |
AMR-WB (G.722.2) |
ACELP | 16 kHz | 6.60, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, 23.85 kbit/s | 25 ms | Yes | Yes | Yes: only in MPEG-4 Part 12 container | No |
AMR-WB+ | ACELP | 8, 11.025, 16, 22.05, 32, 44.1, 48 kHz | 6–36 kbit/s (mono) 7–48 kbit/s (stereo) |
60–90 ms | Yes | Yes | Yes | No |
aptX | Subband ADPCM | 24–48 kHz (stereo) | 192–384 kbit/s (stereo) | 2 ms | Yes (4:1) | No | Yes | Yes: Up to 8 channels |
ATRAC1 | MDCT-Hybrid Subband | 44.1 kHz | 292 kbit/s | >100 ms | Yes | No | Yes: Dual Only | No |
ATRAC3 | MDCT-Hybrid Subband | 44.1 kHz | 66, 105, 132, 146, 176, 264, 352 kbit/s | >100 ms | Yes | No | Yes: Dual (LP2), Mid/Side (LP4) | No |
ATRAC3plus | MDCT-Hybrid Subband | 44.1, 48 kHz | 32–768 kbit/s | >100 ms | Yes | No | Yes | Yes: Up to 8 channels |
ATRAC9 | MDCT-Hybrid Subband | 12, 24, 48 kHz | 36-672 kbit/s | >100 ms | Yes | No | Yes | Yes: Up to 8 channels |
BroadVoice (BV16, BV32) | Two-Stage Noise Feedback Coding (TSNFC) | 8, 16 kHz | 16, 32 kbit/s[3] | 5 ms | Yes | No | No | No |
Codec2 | Speech | 8 kHz | 0.7, 1.2, 1.3, 1.4, 1.6, 2.4, 3.2 kbit/s | 20–40 ms | Yes | No | No | No |
E-AC3 | MDCT | 32, 44.1, 48 kHz | 32–6144 kbit/s | 5.33–48 ms | Yes | No | Yes | Yes: Up to 15 channels |
E-aptX | Subband ADPCM | 15–48 kHz | 60, 384, 767, 1024 kbit/s, 1.28 Mbit/s | ? | Yes (4:1) | No | Yes | Yes: Up to 8 channels |
Encodec | Neural networks | 24 kHz (Mono)
48 kHz (Stereo) |
1.5 (Mono only), 3, 6, 12, 24 kbit/s | ? | Yes | No | Yes | No |
EVS | Speech | 8, 16, 32, 48 kHz | 5.9, 7.2, 8, 9.6, 13.2, 16.4, 24.4, 32, 48, 64, 96, 128 kbit/s + AMR-WB bitrate modes | 20 ms | Yes | Only 5.9 kbit/s with DTX on | Yes | No |
GSM-HR | VSELP | 8 kHz | 5.6 kbit/s | 25 ms | Yes | No | No | No |
GSM-FR | RPE-LTP | 8 kHz | 13 kbit/s | 20–30 ms | Yes | No | No | No |
GSM-EFR | ACELP | 8 kHz | 12.2 kbit/s | 20–30 ms | Yes | No | No | No |
HE-AAC | ? | 22 ~ 96 kHz | 16 ~ 80 kbit/s | ~130 ms[4] | Yes | Yes | Yes: Dual, Mid/Side, Intensity, Parametric | Yes: Up to 48 channels |
HVXC | Speech | 8 kHz | 2, 4 kbit/s | 36 ms | Yes | Yes | No | No |
iLBC | Block Independent LPC | 8 kHz | 13.33, 15.20 kbit/s | 25, 40 ms | Yes | No | No | No |
iSAC | Transform coding | 16, 32 kHz | 10–52 kbit/s | 33, 63 ms | Yes | Yes | No | No |
LC3 | Speech | 8, 16, 24, 32, 48 kHz | 16-320 kbit/s | 7.5-10 ms | Yes | No | Yes | No |
LDAC | Subband ADPCM | 44.1, 48, 88.2, 96, 176.4, 192 kHz | 303, 606, 909 kbit/s (44.1/88.2/176.4 kHz)
330, 660, 990 kbit/s (48/96/192 kHz) |
? | Yes | No | Yes | No |
LHDC | Subband ADPCM | 44.1, 48, 96 kHz | 400, 560, 900 kbit/s | ? | Yes | No | Yes | No |
L2HC | Subband ADPCM | 44.1, 48, 96, 192 kHz | 400, 560, 960, 1920 kbit/s | ? | Yes | No | Yes | ? |
Lyra | Speech | 8, 16, 32, 48 kHz | 3.2, 6, 9.2 kbit/s | 20 ms | Yes | No | No | No |
MP3 (MPEG-1, 2, 2.5 Audio Layer III) | MDCT, Hybrid Subband | 8, 11.025, 12, 16, 22.05, 24, 32, 44.1, 48 kHz | 8, 16, 24, 32, 40, 48, 56, 64, 80, 96, 112, 128, 144, 160, 192, 224, 256, 288, 320 kbit/s | >100 ms | Yes | Yes | Yes: Dual, Mid/Side, Intensity | No |
MPEG-1 Audio Layer II (MP2) | Subband | 32, 44.1, 48 kHz | 32, 48, 56, 64, 80, 96, 112, 128, 160, 192, 224, 256, 320, 384 kbit/s | Yes | Yes: but decoders are not required to support it | Yes: Dual, Intensity | No | |
MPEG-2 Audio Layer II (MP2) | Subband | 16, 22.05, 24 kHz[5][6] | 8, 16, 24, 32, 40, 48, 56, 64, 80, 96, 112, 128, 144, 160 kbit/s[5] | Yes | Yes: but decoders are not required to support it | Yes | Yes: up to 5 full range audio channels and an LFE-channel with MPEG Multichannel | |
Musepack | Subband | 32, 37.8, 44.1, 48 kHz | 20–350 kbit/s | ? | No | Yes | Yes | Yes: Up to 8 channels |
Opus | MDCT, LPC, LTP | 8–48 kHz | 6–510 kbit/s | 5–66.5 ms | Yes | Yes | Yes | Yes: Up to 255 channels[7] |
RealAudio | MDCT | Varies (see article) | Varies (see article) | Varies | Yes | Yes | Yes | Yes: Up to 6 channels |
Satin | Speech | 8, 16, 32, 48 kHz | 6-36 kbit/s | ? | Yes | No | Yes | No |
SILK | LPC, LTP | 8, 12, 16, 24 kHz | 6–40 kbit/s | 25 ms | Yes | ? | ? | ? |
Siren 7 | Derived from PT716plus, MLT | 16 kHz | 16, 24, 32 kbit/s | 40 ms | Yes | No | No | No |
Siren 14 | MLT | 32 kHz | 24, 32, 48 kbit/s (mono) 48, 64, 96 kbit/s (stereo) |
40 ms | Yes | No | Yes | No |
Siren 22 | MLT | 48 kHz | 32, 48, 64 kbit/s (mono) 64, 96, 128 kbit/s (stereo) |
40 ms | Yes | No | Yes | No |
SNAC | Neural networks | 24, 32, 44.1 kHz | 0.98 (24 kHz), 1.9 (32 kHz), 2.6 (44.1 kHz) kbit/s | ? | Yes | No | No | No |
Speex | CELP | 8, 16, 32, (48) kHz | 2.15–24.6 kbit/s (NB) 4–44.2 kbit/s (WB) |
30 ms (NB) 34 ms (WB) |
Yes | Yes | Yes: Intensity | Yes |
TSAC | Neural networks, modified version of Descript Audio Codec, extended for stereo with a transformer model to shrink even more while keeping the quality high. | 44.1 kHz (Mono and Stereo). Other sample rates: Uncertain. | ~0.45-5.5 kbps (mono)
~0.6-7.5 kbps (stereo) |
? | Yes | No | Yes | No |
USAC | ? | 7.35, 8, 11.025, 12, 16, 19.2, 22.05, 24, 32, 38.4, 44.1, 48, 57.6, 64, 88.2, 96 kHz | 6-128 kbit/s (mono)
12-320 kbit/s (stereo) |
? | Yes | Yes | Yes | No |
VMR-WB | ACELP | 16 kHz | 8.55, 4.0, 0.8, 13.3, 6.2, 2.7, 1.0 kbit/s | 33.75 ms | Yes | Yes | ? | ? |
Vorbis (Ogg) | MDCT | 8–192 kHz | 45-500 kbit/s (32-500 kbit/s for aoTuV tunings) | >100 ms | Yes/ABR | Yes | Yes: Dual, Lossless, Phase, Point (Intensity) | Yes: Up to 255 channels |
WavPack Lossy | Prediction, Quantization | 1 Hz to 16.777216 MHz | 196 kbit/s and up in lossy mode (for CD audio) | 3523.8 ms | Yes | Yes | Yes | Yes: Up to 256 channels |
Windows Media Audio Standard | MDCT | 8, 11.025, 16, 22.05, 32, 44.1, 48 kHz | 8–768 kbit/s | >100 ms | Yes | Yes | Yes | It can be hacked to support 5.1/7.1 support, but officially it doesn't support multi-channel |
Windows Media Audio Pro | MDCT | 8, 11.025, 16, 22.05, 32, 44.1, 48, 88.2, 96 kHz (8-22.05kHz not avabile in encoders that uses DirectShow like Switch by NCH Software or Windows Media Encoder) | 4–768 kbit/s | >100 ms | Yes | Yes | Yes | Yes: At least 8 channels, expandable |
Windows Media Audio Voice | ACELP? | 8, 11.025, 16, 22.05 kHz officially (can be hacked to support higher sample rates) | 4-20 kbit/s officially (can be hacked to support higher bitrates) | ? | Yes | No | It can be hacked to support stereo support, but officially it doesn't support stereo | It can be hacked to support 5.1/7.1 support, but officially it doesn't support multi-channel |
Audio compression format | Algorithm | Sample rate | Bits per sample | Latency | Stereo | Multichannel |
---|---|---|---|---|---|---|
ALAC | Lossless | 1–384 kHz | 16, 20, 24, 32[8] | ? | Yes | Yes: Up to 8 channels |
ALS | Lossless | 0–4 GHz | 8, 16, 24, 32 (int or float) | ? | Yes | Yes: Up to 65536 channels |
ATRAC Advanced Lossless | Lossless | 44.1 kHz | 16 | ? | Yes | No |
Dolby TrueHD | Lossless | 48, 96, 192 kHz | 16, 20, 24 | ? | Yes | Yes: 5.1, 7.1, and Dolby Atmos object-based audio[9] |
DTS-HD Master Audio | Lossless | 48, 96, 192 kHz | 16, 24 | ? | Yes | Yes: Up to 7.1[10] |
FLAC | Lossless | 1–1.048575 MHz (>655.350 kHz need not be streamable)[11] | 4–32 | 4.3–92 ms (46.4 ms typical) | Yes | Yes: Up to 8 channels |
HD-AAC | ? | ? | ? | ? | ? | ? |
LA | Lossless | 0–4 GiHz | 16 | ? | Yes | No |
L2HC | Lossless | 44.1kHz | 16 | ? | Yes | No |
Monkey's Audio | Lossless | 1–655.350 kHz[citation needed] | 8, 16, 24, 32 and 32-bit float | 1670 to 26,749 ms (varies with compression) | Yes | Yes: Up to 32 channels since version 5.50[12] |
OptimFROG | Lossless | 0–4 GiHz[13] | 8, 16, 24, 32 (int or float) | ? | Yes | No |
OSQ | Lossless | 6, 8, 11.025, 16, 22.05, 32, 44.1, 48, 64, 88.2, 96 kHz | 8, 16, 20, 24 | ? | Yes | No |
RealAudio Lossless | Lossless | 8, 11.025, 16, 22.05, 32, 44.1 kHz | 16 | Varies | Yes | No |
Sac | Lossless | 1–48 kHz | 16 | ? | Yes | No |
TAK | Lossless | 8–192 kHz | 8, 16, 24 | ? | Yes | Yes: Up to 6 channels |
True Audio (TTA) | Lossless | 0–4 GiHz | 8, 16, 24 | approx. 1045 ms[14] | Yes | Yes: Up to 16 (with ffmpeg), but no channel allocation |
WavPack Lossless | Lossless, Hybrid | 1 Hz to 1 GiHz | Up to 32 (and 32-bit float), and 1-bit DSD. | ? | Yes | Yes: Up to 256 channels |
Windows Media Audio Lossless | Lossless | 8, 11.025, 16, 22.05, 32, 44.1, 48, 88.2, 96 kHz | 16, 24 | >100 ms | Yes | Yes: Up to 6 channels |
Audio compression format | Algorithm | Sample rate | Bit rate | Bits per sample | Latency | CBR | VBR | Stereo | Multichannel |
---|---|---|---|---|---|---|---|---|---|
G.711 | companding A-law or μ-law, PCM | 8 kHz | 64 kbit/s | 8 bit (log) | 125 μs (typical) | Yes | No | No | No |
G.711.0 | Lossless compression of G.711 | 8 kHz | 0.2–65.6 kbit/s | 8 bit (log) | 5–40 ms | No | Yes | No | No |
G.711.1 | MDCT, A-law, μ-law | 8, 16 kHz | 64, 80, 96 kbit/s | 16 bit | 11.875 ms | Yes | Yes | No | No |
G.718 | CELP, MDCT, Lossy | 8, 16 kHz | 8, 12, 12.65, 16, 24, 32 kbit/s | 16 bit | 42.875–43.875 ms | Yes | Yes | No | No |
G.718B | CELP, MDCT, Lossy, Sinusoidal Coding | 32 kHz | 36, 40, 48 kbit/s | 16 bit | 49.625 ms | Yes | Yes | No | No |
G.719 | MDCT,[15] Lossy (incorporates elements of Siren Codec and Ericsson technology) | 48 kHz | 32–88 kbit/s in 4 kbit/s steps, 88–128 kbit/s in 8 kbit/s steps | 16 bit | 40 ms | Yes | No | Yes: only in MPEG-4 Part 12 container | Yes: only in MPEG-4 Part 12 container |
G.721 | ADPCM, Lossy | 8 kHz | 32 kbit/s | 13 bit | Yes | No | No | No | |
G.722 | sub-band ADPCM, Lossy | 16 kHz | 64 kbit/s (comprises 48, 56 or 64 kbit/s audio and 16, 8 or 0 kbit/s auxiliary data) |
14 bit | 4 ms | Yes | No | No | No |
G.722.1 | Modulated Lapped Transform (MDCT), Lossy (based on Siren Codec) | 16 kHz | 24, 32 kbit/s | 16 bit | 40 ms | Yes | No | No | No |
G.722.1C | Modulated Lapped Transform (MDCT), Lossy (based on Siren Codec) | 32 kHz | 24, 32, 48 kbit/s | 16 bit | 40 ms | Yes | No | No | No |
G.722.2 (AMR-WB) | multi-rate wideband ACELP, Lossy | 16 kHz | 6.60, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, 23.85 kbit/s | 14 bit | 25 ms | Yes | Yes | Yes: only in MPEG-4 Part 12 container | No |
G.723 | ADPCM, Lossy | 8 kHz | 24, 40 kbit/s | 13 bit | Yes | No | No | No | |
G.723.1 | MP-MLQ, ACELP, Lossy | 8 kHz | 5.3, 6.3 kbit/s | 13 bit | 37.5 ms | Yes | No | No | No |
G.726 | ADPCM, Lossy | 8 kHz | 16, 24, 32, 40 kbit/s | 13 bit | 125 μs | Yes | No | No | No |
G.727 | ADPCM, Lossy | 8 kHz | 16, 24, 32, 40 kbit/s | 13 bit | Yes | Yes | No | No | |
G.728 | low-delay CELP, Lossy | 8 kHz | 16 kbit/s | 13 bit | 0.625 ms | Yes | No | No | No |
G.729 | CS-ACELP, Lossy | 8 kHz | 8 kbit/s | 13 bit | 15 ms | Yes | No | No | No |
G.729D | CS-ACELP, Lossy | 8 kHz | 6.4 kbit/s | 13 bit | Yes | No | No | No | |
G.729E | CS-ACELP, Lossy | 8 kHz | 11.8 kbit/s | 13 bit | 15 ms | Yes | No | No | No |
G.729.1 | CELP, TDBWE, TDAC (MDCT), Lossy | 8, 16 kHz | 8 kbit/s, 12–32 kbit/s in 2 kbit/s steps | 16 bit | 48.9375 ms | Yes | Yes | No | No |
Notes
- The latency listed here is the total delay (frame size, plus all lookahead) at the normal operating sample rate (typically 44.1 kHz).
- Lossless compression will have a variable bit rate.
- ^ MPEG 2 AAC was limited to a 96 kHz sampling rate, however, with MPEG 4 AAC, a later version part of the MPEG 4 specification, the maximum sampling rate has been increased to 192 kHz.
- ^ "Algorithmic delay and synchronization in MPEG audio codecs - EE Times". EETimes. Retrieved 6 May 2017.
- ^ "Support Documents and Downloads". www.broadcom.com. Retrieved 6 May 2017.
- ^ Schnell, Markus; Schmidt, Markus; Jander, Manuel; Albert, Tobias; Geiger, Ralf; Ruoppila, Vesa; Ekstrand, Per; Bernhard, Grill (2008-10-01). "MPEG-4 Enhanced Low Delay AAC - A New Standard for High Quality Communication". Audio Engineering Society. Available from fraunhofer.de
- ^ a b Predrag Supurovic (September 1998). "MPEG Audio Frame Header". Retrieved 2009-07-11.
- ^ ISO/IEC JTC1/SC29/WG11 - Audio Subgroup. "MPEG Audio FAQ Version 9, MPEG-1 and MPEG-2 BC". Retrieved 2010-02-27.
{{cite web}}
: CS1 maint: numeric names: authors list (link) - ^ "Opus Codec". Retrieved 2014-07-02.
- ^ "macosforge/alac". GitHub. Retrieved 6 May 2017.
- ^ "Dolby Atmos for Home Theaters". Dolby.com. Archived from the original on 20 May 2018. Retrieved 4 April 2018.
- ^ "DTS-HD and DTS-HD Master Audio: Enabling HD Audio Across an Evolving Media Delivery Landscape", DTS, Inc., October 2012
- ^ sample rates from 1 up to 1048575 hertz and bit depths from 4 up to 32 bits [1]
- ^ "Monkey's Audio - a fast and powerful lossless audio compressor".
- ^ 4 bytes - sample rate
- ^ "Tau Projects". en.true-audio.com. Archived from the original on 27 October 2015. Retrieved 6 May 2017.
- ^ Britanak, Vladimir; Rao, K. R. (2017). Cosine-/Sine-Modulated Filter Banks: General Properties, Fast Algorithms and Integer Approximations. Springer. pp. 31, 478. ISBN 9783319610801.