Translation technology | 05.03.2010

Voice-generating technology hitting all the right notes

Artificial voice generators generally receive a lot of bad press, but this week was an exception. Two developments in the communications market were announced to worldwide acclaim: a silent-speech device incorporating an automatic translation tool with a twist; and a bespoke voice synthesizer which was aired on the Oprah Winfrey Show.

Silence was certainly speaking volumes at the CeBIT trade fair in Germany this week when scientists from the Karlsruhe Institute of Technology (KIT) demonstrated a device capable of ‘lipreading’ and transforming these movements into speech. The technology in question is called Silent Sounds which according to AFP works by electromyography – ‘monitoring the muscular movements produced when we speak and converting them into electrical pulses that can then be turned into speech.’

Currently the device functions through a variety of electrodes attached to the skin but it is anticipated that within a decade, the technology will become an everyday feature of mobile phones once it can be integrated into handsets. It is said to be 99 per cent accurate at the moment, but its success with different accents or technical language remains to be seen.

However, Silent Sounds does boast another feature and that is the automatic translation application which translates the input language into an output language of the user’s choice. At the moment it is mainly European languages which are on the menu as the developers explained that support for Chinese, for example, would require more development to incorporate ‘tone’.

But this type of technology is also important for the medical world and could help improve the quality of life for people who have no longer have the ability for speech due to an operation, illness, or accident. Such was the case for American film critic Roger Ebert who lost his voice four years ago following an operation. This week he unveiled a bespoke piece of voice-generating software on the Oprah Winfrey Show which has enabled him to speak again for the first time since the surgery that robbed him of his voice.

The device was developed by Edinburgh speech synthesis company, Cereproc, and what makes this machine stand out is that the computer-generated output sounds like Mr Ebert’s voice and not an electronic reproduction. The BBC reported how this was made possible through a process of accessing recordings of Mr Ebert’s voice, breaking these down into individual sounds, completing a transcription stage and finally reassembling everything. The user types out what he/she would like to say and the computer generates a ‘human’ voice. Mr Ebert commented that ‘It still needs improvements, but at least it sounds like me.’

These innovative technologies could well become common place in the future and what may seem like science fiction today, may be everyday communication tools when the products become market ready. For example, the ongoing work with the Silent Sounds device includes developing a system which is operable in offices and budding MI5 agents, military personnel, cinema-goers wishing to communicate from inside the theatre and even commuters will surely be adding it to their wish lists.

Further development stages and lots of tweaking are undoubtedly the order of the day for these devices and the jury is still out on the degree of success with which the automatic translation application will deal with the nuances and complexities of language. However, from those who would prefer to use silent communication for security reasons to the truly life-changing experience of giving people their voice back, there is no doubt that voice-generating technology is certainly hitting all the right notes.

Name	Host	Duration	Type	Description
NID	google.com	6 months	Third Party	This cookie is used by Google to create a profile based on user’s interest and display personalised ads to the users.
_gid	google.com	1 day	Third Party	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number visitors, the source where they have come from, and the pages visited in an anonymous form.
_ga	google.com	2 years	Third Party	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site’s analytics report. The cookies store information anonymously and assigns a randomly generated number to identify unique visitors.
MR	microsoft.com	1 week	Third Party	This cookie is used to measure the use of the website for analytics purposes.
MUID	microsoft.com	1 year	Third Party	Used by Microsoft as a unique identifier. The cookie is set by embedded Microsoft scripts. The purpose of this cookie is to synchronise the ID across many different Microsoft domains to enable user tracking.
IDE	google.com	2 years	Third Party	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
GPS	youtube.com	30 minutes	Third Party	This cookie is set by Youtube and registers a unique ID for tracking users based on their geographical location
VISITOR_INFO1_LIVE	youtube.com	5 months	Third Party	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Name	Host	Duration	Type	Description
_gat_UA-5518708-1	google.com	1 minute	Third Party	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
__cfduid	cloudflare.com	1 month	Third Party	The cookie is set by CloudFare. The cookie is used to identify individual clients behind a shared IP address to apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
viewed_cookie_policy	thetranslationpeople.com	11 month	Self	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not the user has consented to the use of cookies. It does not store any personal data.
cookielawinfo-checkbox-necessary	thetranslationpeople.com	11 month	Self	This cookie is set by the GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
YSC	youtube.com	1 month	Third Party	This cookies is set by Youtube and is used to track the views of embedded videos.

Voice-generating technology hitting all the right notes

Need help with a translation? Get in touch with us

Need help with a translation?
Get in touch with us