Voicebox, Meta’s new producing artificial intelligence model, can do almost anything related to voice

2 years ago

103

Meta, the umbrella company of Facebook and Instagram, announced its new productive artificial intelligence model. Voicebox was designed to assist creators with its ability to perform speech creation tasks such as audio editing, sampling, and styling, although it was not specifically trained.

Meta says this new AI model will benefit many people around the world. He gives many examples, such as helping visually impaired people hear text messages from their friends in their own voices. It can also enable people to speak foreign languages with their own voices.

The AI model is capable of producing high-quality sound clips and is capable of editing pre-recorded sounds to eliminate unwanted noises such as car horns. Besides that, it can produce sounds in six languages while maintaining content and style. The model is also expected to give natural voices to visual assistants in the future, or to real non-player characters in games in the metaverse.

Meta compared Voicebox to other voice AI models on the market and specifically cited Vall-E and YourTTS as competitors. When comparing word error rates and style similarity, Voicebox is more advanced and outperforms both models.

Voicebox is built on Meta’s newest non-autoregressive generative model, a Flow Matching model that is capable of highly non-deterministic matching between text and speech. Voicebox has so far been trained using over 50,000 hours of recorded speech and transcripts from publicly available audiobooks in English, French, Spanish, German, Polish and Portuguese.

Meta will not make the artificial intelligence program available to everyone, nor will it share its source code.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Shock Argument on the Agenda: Nuclear war on Mars and traces of a lost civilization

Galaxy S25 Edge can be sold in only two countries in the first place

Ripple’s image appeared! Is the XRP price for $ 3.40?

There is a discount of up to 90 %in more than 300 games, including Baldur’s Gate 3! Here are the games that decreased this week in Steam

The new nuclear rocket will be able to reach Pluto in just 4 years

China has imposed new restrictions on less earth elements

TechnoPixel

Voicebox, Meta’s new producing artificial intelligence model, can do almost anything related to voice