cryptogon.com

OpenAI Can Re-Create Human Voices from 15 Second Samples

March 30th, 2024

Fifteen seconds? *pfft*

Google AI Clones Your Voice After Listening for 5 Seconds

Microsoft’s New AI Can Simulate Anyone’s Voice with 3 Seconds of Audio

Voice synthesis has come a long way since 1978’s Speak & Spell toy, which once wowed people with its state-of-the-art ability to read words aloud using an electronic voice. Now, using deep-learning AI models, software can create not only realistic-sounding voices, but also convincingly imitate existing voices using small samples of audio.

Further Reading
Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio

Along those lines, OpenAI just announced Voice Engine, a text-to-speech AI model for creating synthetic voices based on a 15-second segment of recorded audio. It has provided audio samples of the Voice Engine in action on its website.

Once a voice is cloned, a user can input text into the Voice Engine and get an AI-generated voice result. But OpenAI is not ready to widely release its technology yet. The company initially planned to launch a pilot program for developers to sign up for the Voice Engine API earlier this month. But after more consideration about ethical implications, the company decided to scale back its ambitions for now.

Open AI: Navigating the Challenges and Opportunities of Synthetic Voices

Posted in Rise of the Machines, Technology | Top Of Page

You must be logged in to post a comment.

The New Zealand Copyright Act 1994 specifies certain circumstances where all or a substantial part of a copyright work may be used without the copyright owner's permission. A "fair dealing" with copyright material does not infringe copyright if it is for the following purposes: research or private study; criticism or review; or reporting current events. If you are a legal copyright holder, or a designated agent for such, and you believe a post on this website falls outside the boundaries of "fair dealing," and legitimately infringes on your or your client's copyright, please contact Kevin Flaherty. Cryptogon contains both original material and material from external sources. Original material: Copyright Kevin Flaherty. Material from external sources: Copyright the respective owners / authors.

Design by Andreas Viklund | Ported by Matteo Turchetto

news – analysis – conspiracies

OpenAI Can Re-Create Human Voices from 15 Second Samples

Leave a Reply

Cryptogon Reader Support in April

Header Image