How do you authenticate a sound recording?

Answers to your questions about the news.
Nov. 15 2002 4:57 PM

How Do You Authenticate a Sound Recording?

Al Jazeera television aired an audio tape this week it claimed was recorded by Osama Bin Laden. Administration sources told MSNBC Wednesday that they believe the tape is authentic. How do you authenticate a sound recording?

Advertisement

The feds have audio recordings analyzed by both human experts and machines. Human analysts are very good at doing the kind of thing most people do subconsciously—telling if someone comes from a particular region by recognizing basic vowel and consonant qualities. For example, a human analyst can tell whether the "Ye" sound in "Yemen" is of the right length and stress for Bin Laden's dialect. The expert would listen to previous recordings of Bin Laden's voice and painstakingly compare words—syllable by syllable—to those on the current tape. The feds might also bring in a linguist to verify whether the words on the tape generally match those uttered by someone of Bin Laden's age and educational background.

For a machine analysis, the feds use voice-authentication software, which measures the acoustic qualities of the voice—pitch, loudness, basic resonances—that can't be estimated by a human expert. This kind of analysis can produce basic spectrographic information (indicating overall intonation and loudness) or it can look for specific features of the voice, like if Bin Laden's voice was a bit on the nasal side. Voice authentication software is also excellent for cleaning up bad recordings; the latest tape is allegedly very noisy and possibly went down a phone line at some point. Such a system can also tell if different samples of the voice were recorded on different microphones and in different locations.

Once the recording is cleaner, the software can deconstruct each single sound. Every person creates the same sounds using a slightly different set of basic pitches. So, the set of frequencies in Bin Laden's vowels, like those in "ea" from "fear," will be marginally different from anyone else's. By examining this frequency detail for every vowel and comparing them to previous examples by him, a machine analysis can tell if they are the same and were all made by him. In cases where two examples of a word, like "bombing" and "bombing," sound exactly the same to a human expert, a machine can sometimes pick out frequency differences that indicate the words were spoken by two different people.

What if analysts are pretty sure the voice on a tape is Bin Laden's, but want to make sure it hasn't been spliced together from Osama's Greatest Hits? In that case, man and machine would look for tell-tale signs of fraud. The first red flag is any hitch in Bin Laden's timing. It's almost impossible to fake a speaker's rhythm, to make sure every syllable in an utterance matches the overall length and structure of that utterance. So, if the word "Kuwait" were inserted from a previous recording by Bin Laden, it would jar the basic rhythm of the rest of his speech.

Another sign of fakery is background noise. It's quite difficult to remove the original sound context from a voice recording. And even if you could, you'd still have to deal with the fact that speakers unconsciously pitch their voice to accommodate background noise. A giveaway sign might show up in the basic frequencies of one of Bin Laden's "kills" versus another of his "kills." If these pitches were different enough, this would be cause for suspicion.

Together, human and machine can provide formidable testimony in court, but neither type of analysis can say with 100 percent certainty that the speaker on the tape is Bin Laden or anyone else.

Explainer thanks Dr. Francis Nolan of the Linguistics Department, CambridgeUniversity and Judith Markowitz of J Markowitz Consultants and Speech Technology Magazine in Chicago.

TODAY IN SLATE

Sports Nut

Grandmaster Clash

One of the most amazing feats in chess history just happened, and no one noticed.

The Extraordinary Amicus Brief That Attempts to Explain the Wu-Tang Clan to the Supreme Court Justices

Amazon Is Officially a Gadget Company. Here Are Its Six New Devices.

Do the Celebrities Whose Nude Photos Were Stolen Have a Case Against Apple?

The NFL Explains How It Sees “the Role of the Female”

Future Tense

Amazon Is Now a Gadget Company

Food

How to Order Chinese Food

First, stop thinking of it as “Chinese food.”

Scotland Is Inspiring Secessionists Across America

The Country Where Women Aren’t Allowed to Work Once They’re 36 Weeks’ Pregnant

The XX Factor
Sept. 18 2014 11:40 AM The Country Where Women Aren’t Allowed to Work Once They’re 36 Weeks’ Pregnant
Moneybox
Sept. 17 2014 5:10 PM The Most Awkward Scenario in Which a Man Can Hold a Door for a Woman
  News & Politics
Weigel
Sept. 18 2014 3:19 PM In Defense of Congress Leaving Town Without a New War Vote
  Business
Business Insider
Sept. 18 2014 3:31 PM What Europe Would Look Like If All the Separatist Movements Got Their Way
  Life
Outward
Sept. 18 2014 4:15 PM Reactions to a Sketch of Chelsea Manning Reveal Transmisogyny
  Double X
The XX Factor
Sept. 18 2014 3:30 PM How Crisis Pregnancy Centers Trick Women
  Slate Plus
Behind the Scenes
Sept. 18 2014 1:23 PM “It’s Not Every Day That You Can Beat the World Champion” An exclusive interview with chess grandmaster Fabiano Caruana.
  Arts
Culturebox
Sept. 18 2014 4:00 PM When The Cosby Show Got “Very Special” Why were The Cosby Show’s Very Special Episodes so much better than every other ’80s sitcom’s?
  Technology
Future Tense
Sept. 18 2014 2:39 PM Here's How to Keep Apple From Sharing Your iPhone Data With the Police
  Health & Science
Science
Sept. 18 2014 3:35 PM Do People Still Die of Rabies? And how do you know if an animal is rabid?
  Sports
Sports Nut
Sept. 18 2014 11:42 AM Grandmaster Clash One of the most amazing feats in chess history just happened, and no one noticed.