{"id":1781,"date":"2018-08-02T20:48:40","date_gmt":"2018-08-02T20:48:40","guid":{"rendered":"https:\/\/blogs.mathworks.com\/headlines\/?p=1781"},"modified":"2025-11-05T16:04:53","modified_gmt":"2025-11-05T21:04:53","slug":"start-up-helps-two-big-sisters-create-a-custom-synthesized-voice-for-their-sister","status":"publish","type":"post","link":"https:\/\/blogs.mathworks.com\/headlines\/2018\/08\/02\/start-up-helps-two-big-sisters-create-a-custom-synthesized-voice-for-their-sister\/","title":{"rendered":"Start-up helps two big sisters create a custom, synthesized voice for their sister"},"content":{"rendered":"<p>Maeve is an eleven-year-old that lives just outside of Boston with her parents and two big sisters. She has cerebral palsy and relies on a computer-generated voice synthesizer to communicate. The problem is, there are only a handful of voices available for her assistive communication device, and none of them are quite right for the pre-teen known for her sense of humor.<\/p>\n<p>The family learned a local start-up, <a href=\"https:\/\/www.vocalid.co\/\" target=\"_blank\" rel=\"noopener\">VocaliD<\/a>, was developing technology to create custom voices for assistive speech technology.\u00a0 After Maeve\u2019s sister Erin emailed the start-up, Maeve became VocaliD\u2019s first customer.<\/p>\n<p>VocaliD was founded by Dr. Rupal Patel, a former speech therapist. The start-up crowdsources human speech to create custom digital voices. Dr. Patel told a<a href=\"http:\/\/www.wcvb.com\/article\/5-for-good-local-girls-help-give-their-sister-with-cerebral-palsy-her-own-voice\/15955647\" target=\"_blank\" rel=\"noopener\"> local TV station<\/a>, &#8220;Even when someone has a very severe speech disorder, there are certain aspects of their voice that are preserved. I thought these individuals have unique voices, why can&#8217;t we make their devices sound more differentiated?&#8221;<\/p>\n<h2>Adaptive alternative communication<\/h2>\n<p>Adaptive alternative communication (AAC), has provided many people the ability to speak, but the voices are often computer sounding or that of 40-year-olds.\u00a0The options don\u2019t typically fit young children. And for adults who lose their voice due to medical conditions, they often feel they lose part of their identity along with their voice.\u00a0The most famous generated digital voice is likely the robotic-sounding speech synthesizer used by the late Stephen Hawking.<\/p>\n<p style=\"padding-left: 30px;\"><em>The Guardian, <\/em>in an article titled <a href=\"https:\/\/www.theguardian.com\/news\/2018\/jan\/23\/voice-replacement-technology-adaptive-alternative-communication-vocalid\" target=\"_blank\" rel=\"noopener\">How a new technology is changing the lives of people who cannot speak<\/a>, explained, \u201cHawking\u2019s case is one of the most striking examples of the way a person\u2019s voice shapes their identity. Though the robotic quality of his digital voice (and the American accent) felt inappropriate at first, it came to be his trademark. Hawking reshaped himself around his new voice, and years later, when he was offered the opportunity to use a new voice that was smoother, more human-sounding, and English, he refused. This felt like \u201chim\u201d now.<\/p>\n<p style=\"padding-left: 30px;\">\u201cThe \u201cStephen Hawking voice\u201d doesn\u2019t belong only to Hawking. In the years since it was created, the same voice has also been used by little girls, old men, and people of every racial and ethnic background. This is one of the stranger features of the world of people who rely on AAC: millions of them share a limited number of voices. While there is more variety now than before, only a few dozen options are widely available, and most of them are adult and male.\u201d<\/p>\n<h2>Creating a truly unique, synthesized voice<\/h2>\n<p>While Maeve cannot form words, the sounds she can make provides information about how her speech would sound. Geoffrey Meltzer, who\u00a0leads research and technology at VocaliD, recorded sounds that Maeve can make in order to create a custom voice. Her sisters Erin and Meghan then recorded hours of speech and data from a voice donor was added. The data from the four sources were used to train a statistical synthesizer. With less than five hours of processing, VocaliD developed a custom voice they could install on Maeve\u2019s AAC. It was not a copy of her sisters&#8217; voices, but rather a voice of her own based on the sound characteristics of her vocalizations.<\/p>\n<p>&nbsp;<\/p>\n<p><div id=\"attachment_2186\" style=\"width: 706px\" class=\"wp-caption alignnone\"><a href=\"http:\/\/goodkin.org\/wp-content\/uploads\/2016\/12\/Vocal-ID-Recording_goodkin-696x364.jpg\" target=\"_blank\" rel=\"attachment noopener wp-att-2186\"><img aria-describedby=\"caption-attachment-2186\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-2186 size-full\" src=\"https:\/\/blogs.mathworks.com\/headlines\/files\/2018\/08\/Vocal-ID-Recording.jpg\" alt=\"\" width=\"696\" height=\"364\" \/><\/a><p id=\"caption-attachment-2186\" class=\"wp-caption-text\">VocaliD speech synthesizer . Image Credit: VocaliD.<\/p><\/div><\/p>\n<p>&nbsp;<\/p>\n<p style=\"padding-left: 30px;\">&#8220;VocaliD uses state-of-the-art speech signal processing algorithms to extract the vocal identity cues from recipient\u2019s like Maeve which are blended with recordings of a matched voicebank contributor,&#8221; says\u00a0Meltzner. &#8220;While Maeve\u2019s sisters shared their voice in this instance, siblings aren\u2019t necessarily the best matches. The blended spoken dataset is then used to train a deep learning-based statistical speech synthesizer to create Maeve\u2019s uniquely personalized, digital voice which can then be installed on her speech generating device.<\/p>\n<p style=\"padding-left: 30px;\">&#8220;<a href=\"https:\/\/www.mathworks.com\/products\/matlab.html\" target=\"_blank\" rel=\"noopener\">MATLAB<\/a>, and especially the <a href=\"https:\/\/www.mathworks.com\/products\/signal.html\" target=\"_blank\" rel=\"noopener\">Signal Processing Toolbox,\u00a0<\/a> played an integral role in the prototyping, refining,\u00a0and testing of the speech processing algorithms which are used in the production of our voices today.&#8221;<\/p>\n<p>&nbsp;<\/p>\n<h2>Donating or banking your voice<\/h2>\n<p>For people who know their loss of voice is imminent, they can bank their voice. This option helps people facing oral surgery of progressive diseases such as ALS maintain the ability to communicate in their natural voice.<\/p>\n<p>Voice donors are also needed, especially for younger children and teens. <a href=\"https:\/\/www.nbcnews.com\/health\/health-news\/new-technology-gives-those-unable-speak-voice-more-their-own-n710026\" target=\"_blank\" rel=\"noopener\"><em>NBC News<\/em><\/a> shared how a group of seventh graders in California added their voices to the growing collection.<\/p>\n<p>You can change someone\u2019s life by simply reading out loud and recording your voice. A voice donor must record approximately 3,500 sentences. To become a voice donor, visit VocaliD\u2019s <a href=\"https:\/\/vocalid.ai\/voicebank\/\" target=\"_blank\" rel=\"noopener\">voicebank<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<div class=\"overview-image\"><!-- Featured Image From URL plugin --> <img decoding=\"async\" src=\"https:\/\/blogs.mathworks.com\/headlines\/files\/2018\/08\/Vocal-ID-Recording.jpg\" alt=\"\" style=\"\"><\/div>\n<p>Maeve is an eleven-year-old that lives just outside of Boston with her parents and two big sisters. She has cerebral palsy and relies on a computer-generated voice synthesizer to communicate. The&#8230; <a class=\"read-more\" href=\"https:\/\/blogs.mathworks.com\/headlines\/2018\/08\/02\/start-up-helps-two-big-sisters-create-a-custom-synthesized-voice-for-their-sister\/\">read more >><\/a><\/p>\n","protected":false},"author":138,"featured_media":-1,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts\/1781"}],"collection":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/users\/138"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/comments?post=1781"}],"version-history":[{"count":8,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts\/1781\/revisions"}],"predecessor-version":[{"id":4808,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts\/1781\/revisions\/4808"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/media?parent=1781"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/categories?post=1781"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/tags?post=1781"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}