Your cart is currently empty!

ISO 23003:2020
ISO 23003:2020 Information technology – MPEG audio technologies – Part 3: Unified speech and audio coding
CDN $390.00
Description
This document specifies a unified speech and audio codec which is capable of coding signals having an arbitrary mix of speech and audio content. The codec has a performance comparable to, or better than, the best known coding technology that might be tailored specifically to coding of either speech or general audio content. The codec supports single and multi-channel coding at high bitrates and provides perceptually transparent quality. At the same time, it enables very efficient coding at very low bitrates while retaining the full audio bandwidth.
This document incorporates several perceptually-based compression techniques developed in previous MPEG standards: perceptually shaped quantization noise, parametric coding of the upper spectrum region and parametric coding of the stereo sound stage. However, it combines these well-known perceptual techniques with a source coding technique: a model of sound production, specifically that of human speech.
Edition
2
Published Date
2020-06-24
Status
PUBLISHED
Pages
339
Format 
Secure PDF
Secure – PDF details
- Save your file locally or view it via a web viewer
- Viewing permissions are restricted exclusively to the purchaser
- Device limits - 3
- Printing – Enabled only to print (1) copy
See more about our Environmental Commitment

Abstract
This document specifies a unified speech and audio codec which is capable of coding signals having an arbitrary mix of speech and audio content. The codec has a performance comparable to, or better than, the best known coding technology that might be tailored specifically to coding of either speech or general audio content. The codec supports single and multi-channel coding at high bitrates and provides perceptually transparent quality. At the same time, it enables very efficient coding at very low bitrates while retaining the full audio bandwidth.
This document incorporates several perceptually-based compression techniques developed in previous MPEG standards: perceptually shaped quantization noise, parametric coding of the upper spectrum region and parametric coding of the stereo sound stage. However, it combines these well-known perceptual techniques with a source coding technique: a model of sound production, specifically that of human speech.
Previous Editions
Can’t find what you are looking for?
Please contact us at:
Related Documents
-
ISO 15424:2025 Information technology – Automatic identification and data capture techniques – Data carrier identifiers (including symbology identifiers)
0 out of 5CDN $233.00 Add to cart -
ISO 80004:2020 Nanotechnologies – Vocabulary – Part 3: Carbon nano-objects
0 out of 5CDN $76.00 Add to cart -
ISO 2382:2015 Information technology – Vocabulary
0 out of 5CDN $0.00 Add to cart -
ISO 16840:2006 Wheelchair seating – Part 1: Vocabulary, reference axis convention and measures for body segments, posture and postural support surfaces
0 out of 5CDN $351.00 Add to cart