• Mayo Clinic Minute: Using voice to detect neurodegenerative disease

There's a lot of brain power that goes into speech. First, there's a thought or idea, which the brain must translate into words. Those words are then translated into specific movements of the lungs, tongue and mouth to shape the sounds. Those movements then must be perfectly executed and timed with breath. If there's any damage to the brain from a stroke or if there's presence of a brain disease, the time of the movements or translation can go wrong. Because of that, changes in voice and speech can provide the first clues to a neurodegenerative disease.

In this Mayo Clinic Minute, Dr. Hugo Botha, a Mayo Clinic behavioral neurologist, explains how voice samples collected for research can help diagnose neurodegenerative diseases early.

Watch: The Mayo Clinic Minute

Journalists: Broadcast-quality video (1:10) is in the downloads at the end of this post. Please courtesy: "Mayo Clinic News Network." Read the script.

"There are some diseases where the very first manifestation is in someone's voice or their speech," says Dr. Botha. Those include Parkinson’s disease; atypical parkinsonism such as multiple system atrophy, progressive supranuclear palsy and corticobasal syndrome; amyotrophic lateral sclerosis (ALS); myasthenia gravis; and some types of frontotemporal dementia that can result in aphasia.

As part of clinical practice, Mayo Clinic's neurology patients often are recorded when they have their voice or speech examined, which gives clinicians the opportunity to track the disease over time.

"But separate from the clinical practice, we have a large research program at Mayo, where we are collecting voice and speech samples using an application that runs on the person's phone or the laptop computer," Dr. Botha explains.

To collect the voice samples, patients are tasked with running through a series of exams remotely.

"They could do it — say every couple of weeks, every couple of months — so we can really get a longitudinal view of their disease instead of just a snapshot," says Dr. Botha.

The creation of this large and growing speech bank, which securely stores all speech and voice samples, can be used for research, including using it to train artificial intelligence (AI) algorithms.  

"There are some signals in someone's voice and speech that a computer or an algorithm might pick up on, that a human listener wouldn't pick up on. And so that's more of the sort of research, AI side of things, where we're trying to use hundreds of recordings and patients with various diseases, and then trying to see if the computer can separate those diseases, even though human listeners may not be able to," says Dr. Botha.