4chan members used ElevenLabs to make fake voices of Emma Watson, Joe Rogan, and other celebrities saying racist, transphobic, and violent things. The company recently made a statement on Twitter, reporting that “the number of cases of abuse of voice cloning has increased” and that they are trying to resolve the issue by implementing additional security measures.
Fake voices of celebrities used in racist rhetoric
The clips range from harmless to violent, from transphobic to homophobia and racism. A 4chan post with a wide variety of clips also included a link to the beta version of ElevenLabs, suggesting that ElevenLabs’ software may have been used to create the sounds. ElevenLabs offers both “speech synthesis” and “voice cloning” features on its official website. For audio cloning, ElevenLabs creates a clone of the corresponding audio from a clean sample recording that is longer than a minute.
It’s getting harder to believe what we see and hear online
Perhaps this emergence of “deepfake” sound clips should come as no surprise, since we saw a similar phenomenon occur a few years ago. Advances in artificial intelligence and machine learning have been used to produce fake videos of celebrities.
When we say fake videos, fake voices, fake gestures, the things we see and hear on the internet are getting away from reality. Of course, these technologies are not developed for these purposes. For example, on the official website of ElevenLabs, it mentions target usage areas such as audio newsletters, reading audiobooks and video. At this point, Edgar Allan Poe said, “Believe only half of what you see, none of what you hear.” rhetoric is coming.