Researching file formats 24: Unified Speech and Audio Coding

09 Feb 2024

This blog post is part of a series on file formats research. See this introduction post for more information.

Update: The official format definition is now online here: Unified Speech and Audio Coding. Comments welcome directly to the Library of Congress.

Okay, this format was hard! First, the format is standardized via ISO/IEC, which means it’s expensive. Next, the specification is extremely long and technical, with lots of math. And this is my area of expertise! It was still challenging to work through. Fortunately, there was a hype train as part of getting this codec standardized and encouraging adoption (something I’m familiar with re: the IETF CELLAR group), so there were a lot of high-level summaries out there that made this easier to translate into a generalist audience.

This codec was made to improve upon past work where encoding was stronger at human speech or music, but not necessarily both. It’s also just considering itself the next step in the progression of audio encoding technology. It’s built on the work of MPEG-4 and AAC.

This article, “MPEG Unified Speech and Audio Coding”, does a great job of summarizing the benefits of this codec.

I suppose this will be a short blog post this week – this took a lot of effort to dig through all the technical details, but I don’t have anything particularly profound to note, and I don’t feel like going on a rant about how great free and open standards are right now. Until next week!

Ashley Blewer

Researching file formats 24: Unified Speech and Audio Coding