Posts Tagged ‘Audio Authentication’

Audio Authentication and Visual Inspection

Tuesday, April 21st, 2015

visual inspectionAudio Authentication and Visual Inspection

Sound waves can tell us a lot about a recording. Like metadata, the visual elements of a sound wave can expose characteristics of an audio recording without even having to listen to it. These characteristics can be important, especially when it comes to detecting edits within audio evidence. The process of observing these characteristics is called visual inspection. This is a part of audio authentication process.

Visual inspection (a general term that comprises a variety of forensic tests like narrow band spectrum analysis) is a crucial part of an Audio Forensic Expert’s job. To understand how crucial visual inspection really is, it’s important to understand the concept and value of the noise floor.

The noise floor (usually unwanted sound) of a recording is the present background noise and overall “ambience” of a recording. For example, if you’re recording yourself speaking on the street in New York City, and you’re speaking into a microphone while standing in one place, the sound of the cars going by, the conversations happening around you, and the overall city noise (unwanted sound) will contribute to the noise floor.

If you’re standing in one spot recording that audio, the noise floor will never change, because the environment your audio device is picking up will stay consistent the entire time. The second that noise floor is altered;you know you have an edit.

There are many ways to examine this. One of the most reliable ways to observe this noise floor is what’s known as a spectrogram. The spectrogram is meant to read the spectrum of an audio recording. To put it simply, a spectrogram takes the contents of an audio recording and conforms the characteristics to blends of color that represent the spectrum of an audio recording in Hz. You can see that below.

Now, because the noise floor of a recording never changes, you can tell when you have an edit when the spectrogram shows a change in, or absence of, color. The noise floor will always stay consistent, so when there’s a short drastic change such as the one pictured below, you know you have an edit. This makes the recording inauthentic.

Spectrogram edit circled

Surely there are other ways to visually detect edits. Even the sound wave itself can expose an edit.

All sound waves should be smooth and continuous. Even if someone were to loudly clap during an audio recording, the sound wave will still remain smooth and continuous. When you see gaps, or a wave that is not smooth and continuous with another piece of the audio file, you know you have an edit.

Though a critical ear is generally considered the most important part of Audio Forensics, a good eye for edits in visual inspection can teach you a substantial amount about the evidence you’re working with before even taking the time to listen to it. Visual inspection really comes in handy when trying to determine the authenticity of a piece of audio evidence and to make sure a proper chain of custody was kept throughout the distribution of audio evidence.

Audio Forensics: An Accurate, Arguable and Authentic Approach to Understanding Audio Evidence

Tuesday, June 4th, 2013

audio forensicsBell Labs was the first to discover that spoken word patterns and sounds could be identified and characteristics examined to identify the individual who made them. This has been a very important advancement in forensic science because the potential to assist law enforcement is well worth the effort it takes to defend the proponents and practitioners. Audio forensics is sometimes referred to by some as a ”junk science.” After over 25 years of examining, editing and clarifying audio recordings, I can attest to and scientifically prove that voice identification and audio authentication comprise an exacting science that has huge benefit to the courts, law enforcement agencies and businesses.

In the following article, I will describe what works and does not work for two of the main activities of audio forensic experts: voice identification and audio authentication. I will also review and break down the steps and processes I employ and explain why I believe audio forensics is a valuable tool in litigation.

I have been retained for dozens of court cases, as well as by corporations, to analyze and help explain various aspects of audio evidence in one form or another. Some situations required that I find the truth about the source of a threatening voice, like a bomb threat called into 911 or a sexually harassing voicemail left on a victim’s phone.

Other cases involved defendants trying to validate or disqualify a pre-recorded audio confession. Evidentiary audio recordings all have one thing in common: they needed an experienced audio forensic expert to review and either qualify (validate) or disqualify the evidence. My job as an audio forensic expert is to determine the recording’s authenticity or to identify the person’s voice.

Voice Identification Overview

I have been practicing voice identification for over 25 years. Many of my skills and principles have been learned from employment as an audio engineer. Other skills I have learned through reading and studying to develop skills and completing successful cases successfully. I believe people’s voices, just like fingerprints, can be identified through visual inspection of sound waves and spectrum analysis, as well as through critical listening skills. I have conducted voice identification for sexual harassment, workers compensation and employment harassment, as well as various threatening voicemail messages like bomb threats.

In our country today, we are guilty until proven innocent, the opposite of what our United States Constitution promises. It is my job to determine the truth about voice recordings using visual, electronic and auditory inspection of, both the evidence recording and an exemplar (voice sample taken for the purpose of comparison).

A typical case I would review might involve a telephoned bomb threat or harassing call that was recorded on audiotape or digital voicemail. After the police arrested a suspect, I would be retained by either the state (court) or defense to determine the truth about that audio recording.

The first step is to examine the original evidence and learn as much about the recording as possible. How was it created? Who created it? What machinery was involved?

Then, with the help of the court or defense lawyer, I create an exemplar of the accused voice to compare visual, electronic and auditory characteristics.

Almost every legal case I have been engaged in has allowed my report and or testimony into evidentiary status to aid with ”due process.” I believe my success rate is high due to the fact that I employ the three testing platforms outlined above.

Steady advances in computer technology have had a huge impact on audio forensic voice identification. Having experience as an acoustic engineer who has listened to literally hundreds of hours of spoken word recordings, in addition to sophisticated electronic software programs, has contributed to my success with voice identification.

One case I examined involved a bomb threat. Bomb threats make up a fairly large segment of voice identification activity. The call in question was made from a pay phone outside of a convenience store to a 911 operator. This was scientifically evident when police traced the call.

The caller identified herself by name as an employee of XYZ Company. When the police arrived at XYZ Company, they found the employee with the name the caller gave the 911 operator and arrested her. The employee denied making the call.

She was charged with making a bomb threat call, guilty until proven innocent. I was retained by the defense to prove that our client did not make the bomb threat call.

Voice Identification Procedure

When comparing spoken word samples for the purpose of identification, I base my processes on historical information I have learned from the scientific community, state police crime labs, other forensic experts and designers and developers of electronic (especially computer) equipment and testing software programs. My process requires the visual, electronic, and auditory examination of every aspect of the words spoken, not just the pathological examination. The words themselves, the way the words flow together, the pauses between the words, the way the words are formed by the mouth and larynx can be measured using three processes. The first process is a visual examination of the sound wave, comparing the evidence and an exemplar (a voice sample of the accused). The second process is an electronic measurement of the evidence, which is then compared to the exemplar. The third process is perhaps the most important: critical listening skills that compare the evidence and the exemplar of how the words are spoken and pronounced. Noise floor and electronic measurement of speech and other audible sounds in the recording must also be considered and measured. Forensic procedure requires careful examination of all audio evidence characteristics, following procedures as outlined by the scientific community.

These scientific procedures begin with the analysis of the quality of the audio recording. It is important to establish that the quality of the recording in question is acceptable and workable. Sometimes, it may be necessary for an audio forensic expert to apply some light equalization or other non-destructive audio processing to reduce or remove background noise that may interfere with the forensic examination.

Voice identification requires the forensic examiner to discover similarities, as well as differences, in all three areas of investigation.

Here are the step-by-step processes I use when conducting voice identification:

1. Visual examination of the original recording, analogue or digital. This includes examination of the physical characteristics of the tape itself (if analogue) or analogue or digital recorder. It is important to examine the cassette tape (standard, mini or micro) or other analogue or digital source to determine if there are visual signs of tampering or alteration.

2. Once the physical evidence has been examined, the next step is to load the recording in question into a forensic computer. Visual examination of the sound wave, sonogram and spectrograph reveal speech characteristics and patterns of verbal delivery as well as electronic characteristics. At this point, the recording has been digitized so forensic software can analyze and conduct various tests.

3. If possible, for authentication or voice identification, an exemplar or comparison recording should be made of the original recording to compare the original recording characteristics. This same forensic examination process that is applied to the evidence is also applied to the exemplar to determine that the characteristics are the same and the recording is from the same audio recorder.

4. When conducting voice identification, it is important to create an exemplar of the accused for audio comparison using as exact conditions and equipment as close as possible to the measurements taken from the evidence as outlined above. The speech must be the same as the speech on the evidence in order for the testing to be accurate. As an audio forensic expert, I often have to coach the accused into the same energetic voice tone and inflection as the evidence recording. However, it is still possible to compare speech if the exemplar is not as close to the evidence as I would like.

5. Critical listening skills are used to examine the speech pattern, pronunciation, voice tone and inflection, accent, dialect and specific speech characteristics (like a lisp or significant ”s” delivery). There is a rhythm in how an individual speaks, and even if s/he is trying to disguise his/her speech (in an attempt to fool the forensic examiner), the rhythm and speech patterns as described above still show through. The expert must pay careful attention to the rhythm of spoken word formations. I listen to single words as well as phrases and sentences. I like to compare original evidence sections of spoken word recordings as well as individual words. This is best accomplished by editing exemplars and original recordings back to back. It is extremely helpful to then make these sub files of words and sentences within the section back to back with exemplars. I repeat the assembly over and over to accommodate critical listening skills with the auditory identification process. That way, your ear can experience the sounds, vowel formations and consonants without interruption.

There are many character traits that can be experienced in a spoken word recording. It is important for the audio forensic expert to become familiar with the evidence speech patterns and visual and electronic characteristics. These characteristics are evident in a person’s voice even if he or she attempts to disguise it and they are compared to the exemplar.

Audio Authentication

Using many of the same tools as described above, audio authentication can help determine the validity of audio evidence that is being considered as evidence in litigation.

When authenticating an audio recording, it is important that the audio forensic expert pay careful attention to tone consistency of the audio recorded signal (speech) as well as the recording’s noise floor.

The consistent audio-recorded signal is important because audio recordings that are not authentic are most always edited or fabricated assemblies of two or more audio recordings for the purpose to deceive the person(s) listening to the recording. Using the tools described above, the audio forensic expert can measure the tone consistency to determine authenticity.

Those same tools can also measure the noise floor looking for inconsistencies in the room tone or background noise of the recording. These breaks or changes in either audio recorded signal or background noise are signs that the audio recording being considered may be counterfeit or fake.

Critical Listening Skills

I have been working with professional speakers and analyzing other spoken word recordings since 1980 and have developed my critical listening skills to a degree that far exceeds the average person’s sound perception. When I first hear audio evidence and add exemplar recordings so I can listen to both back to back, then I apply my critical listening skills to determine the speech similarities as well differences between the two.

In my early days as an audio engineer, I learned to edit reel to reel tape with razor blades to make a recording sound as if it were recorded start to finish without a single mistake. Some of my edits were pretty tricky. I got so good I could split words in two and even three edits to fix a problem or shorten a script. After a while, I became very familiar with speech characteristics and patterns as well as vocal tone and pronunciation.

The best way to become skilled in voice identification is to listen to hundreds of hours of forensic evidence to become familiar with the various speech pathological characteristics and develop critical listening skills.

There can sometimes be differences in speech patterns that can help identify clues. Listen for several similarities as well as differences, such as nasal resonance differences and voice tone with regard to inflection.

Voice Identification Conclusions

When conducting the examination, the audio forensic expert must look for similarities as well as differences in all three testing platforms to help arrive at a conclusion.

After the investigation and testing procedures are complete, the forensic experts report must arrive at one of the following conclusions: positive identification, probable identification, positive elimination, possible elimination or inconclusive.

The key to successful voice identification is to develop a methodology and standard procedure that you strictly follow every time you conduct an identification and comparison.

Audio Authentication Conclusion

Every tone change in either the audio recorded signal or background noise must be documented and analyzed as a whole before considering the recording genuine or authentic. All forensic concerns must be documented and listed in the forensic report to prove the audio forensics findings.

The Audio Forensic Report

It is my belief that the audio forensic report should include:

1. The introduction: What the expert was asked to do and how the expert arrived at their conclusion, including all scientific fact.

2. The testing processes you employed to ex- amine the audio evidence.

3. The expert’s conclusion of the tests, includ- ing the expert’s opinion as to the relevant facts and concerns.

4. The expert’s curriculum vita (resume) to establish credibility as an audio forensic expert, and to accommodate the Federal Court’s protocol for submitting an expert report.

5. A published article authored by the expert concerning the kind of testing relevant to the current case.

For more information contact Ed Primeau at 800-647- 4281 or by email

photo credit: 18v via photopin (license)

The Noise Floor: A Forensic Aid for Audio Authentication and Voice Identification

Tuesday, October 19th, 2010

noise floorAudio authentication and voice identification requires that a forensic expert examine three critical aspects of an audio recording before beginning any forensic process. Whether it be analogue or digital audio recording, an audio forensic expert should inspect the consistent characteristics of the sound wave formations; listen critically to various tones present in the recording, background noise (noise floor) of the audio recording; and examine the electronic spectrograph measurement. These three critical aspects of an audio recording must be consistent throughout the recording to determine authenticity.

An audio forensic expert has been trained by examining hundreds, if not thousands, of hours of audio recordings. This experience helps the forensic examiner to develop a critical listening skill far more precise than the average person’s. That keen sense of sound perception is very important for audio authentication and voice identification.

During the examination process, regardless of analogue or digital audio examination, it is advantageous that the original recording, and recorder, as well as other recording equipment (wireless transmitter, microphone) also be examined. That way, the forensic examiner can recreate the characteristics of the audio recording including signatures (stop-start) and noise floor.

The noise floor is a critical aspect in audio authentication as well as audio identification because it provides the forensic examiner a second dimension of sound to examine and authenticate other than the main recorded signals (speech, gunshot and voice mail).

Alterations in an audio recording, analogue or digital, most likely will be first detected by a change in the noise floor of the audio recording followed by an anomaly that can be heard auditorially, measured electronically and viewed on the computer screen by examining the wave form.

Part of this noise floor is the background noise of the recording. It is the sounds present on the audio recording that the author had not necessarily intended to have recorded but is still part of the recording that is helpful to a forensic examiner.

Both analogue and digital audio recordings have background ambient noise, the noise floor, when the speech or other audio recorded is not present. This background noise speaks volumes on whether the audio recording being examined is original, authentic or has been altered or edited in addition to the examination outcome of the main recorded signals.

For more on Voice Identification, check out Ed Primeau’s latest book, “That’s Not My Voice!” available on Amazon.

Clarification of Audio Recordings for Authentication

Thursday, October 14th, 2010

digital audio recordingAll recordings–both digital and analogue–have a noise floor. The term originated when manufacturers of analogue audio recorders referred to the extraneous noise that their machine created in addition to the desired recorded audio signal.

Often a background noise constitutes most of the audio recording and covers a portion of speech that needs to be audible in order to determine a series of events pertinent to the case. These noises can often be removed by the audio forensic expert to help determine facts about the series of recorded events.

Background noise and noise floor extraneous sound can consist of a heating or air conditioning fan running, refrigerator motor, window fan, clock, fluorescent lighting, wind, rain, car running and even radio or television. All these sounds contribute to the background noise and noise floor of a recording and aid the forensic examiner in authenticating a recording. However, this background noise can interfere with the forensic examination. Clarification is part of the forensic examiner’s job. It is appropriate for the forensic examiner to remove these background sounds in order to authenticate or clarify an exhibit of audio recorded evidence.

Some of the recordings experts are asked to authenticate are confession recordings created by law enforcement agencies. Defendants exclaim, “That is not what I said, they edited it” or “There is more I said that has been edited out of the recording.” Due process entitles both parties in litigation to examine any evidence presented in their case. However, original recordings are not always available for examination. How do you as a law enforcement official feel about the absence of original recordings?

I have worked on cases where missing “original evidence” was considered spoliation of evidence. Personally I believe that circumstances of each case should be considered by the forensic examiner before any decision has been made by either party.

If the forensic examiner observes characteristics that are noticeably questionable, then the expert must notify the officials in charge of their findings during the preliminary examination phase of the forensic investigation. Original recordings are required, and if not produced, a motion to suppress the evidence should be filed.


photo credit: zoom h1 recorder for use as studio backup & ontheground audio via photopin (license)

Audio Forensic Expert Demonstrating Audio Authentication

Friday, October 8th, 2010

If you ever wonder what an audio forensic expert does, here is a video of one of our activities: audio authentication.