Google Cloud has announced a new feature within its TTS API that lets users generate a unique, new synthetic voice trained from recordings.
In a recent blog post, Google Cloud announced the general availability of Custom Voice within its Cloud Text-to-Speech (TTS) API. The new feature will offer users an alternative to the usual digital assistants and conversational interfaces they have grown accustomed to hearing.
Among other things, it lets them train custom voice models using their own audio recordings to create unique synthetic voice experiences.
Related | Google Replaces Classic Hangouts With Chat On Workspace
The feature can be helpful for businesses looking to establish a strong brand identity, as the Custom Voice can, for example, turn the interactive voice responses (IVR) of a customer service interaction into a unique customer experience.
Up until now, Google’s TTS API provided predefined options for its speech synthesis service with a static list of voices only.
Users can access the new Custom Voice directly in the TTS API and simply submit their audio recordings to use the feature. The service offers users a guide on the audio requirements to help make sure the custom voice generated is of the best quality. Upon ending the training, users can start using the new custom voice by referencing the model ID in their calls to the Cloud TTS API.
Google Cloud has also reassured its users that the company has conducted a deep ethical evaluation of the new feature and its relation to synthetic media in order to “surface and mitigate potential harms that it may create.”
Users interested in creating their personalized synthetic voice will need to go through a review process to ensure each use case is “aligned with [Google’s] AI Principles and adequate voice actor consent is given.”
Furthermore, to ensure that the audio recording submitted to generate the new voice is the user’s and not someone else’s, the process will require the users to read a sentence that Google Cloud chooses – for example: “I agree that my voice will be used to create a synthetic custom Text-to-Speech voice.”
TTS Custom Voice is now GA in English (US, AU, and UK), Spanish (US and Spain), French (France and Canada), Italian, German, Portuguese (Brazil), and Japanese.
More languages will be made available in the future. Interested users can already contact their seller and begin undergoing the review process.
Photo by Craig Pattenaude on Unsplash
You might also like
More from Google
Google Product Studio Lets Merchants Create Product Imagery With Generative AI
Google is launching Product Studio, a new tool that lets merchants create product imagery for free, using generative AI. Google Product …
Google Will Delete Accounts That Have Been Inactive For Two Years
Google said it plans to delete account that have not accessed any of its services for two years, as part …
Gmail Is Getting Blue Verified Checkmarks Too
A blue checkmark on Gmail means companies have successfully verified their identity through BIMI, showing they are a legitimate sender. It …
No More Passwords, Google Is Rolling Out Passkeys Globally
You no longer require a password to sign in on to your Google account, with passkeys rolling out to all …
Tech Companies Are Teaming Up To Free Us From Passwords
Apple, Google, and Microsoft are committing to expanded support for the FIDO standard to bring a passwordless future.
Google Replaces Classic Hangouts With Chat On Workspace
On March 22, users attempting to access Google's Hangouts chat services will automatically be redirected to Google Chat instead.
Google Rolls Out New Search Filters To All Users On Drive
Almost two years after introducing search chips into Gmail, Google has now announced the rollout of supportive filters to Google …
Google Unveils Early Access To Chrome OS Flex For PCs And Macs
Google has announced early access to Chrome OS Flex, a new version of Chrome OS that will bring the benefits …