Fake News

viscousmemories · #1 04-28-2019, 03:57 PM

We are so fucked.

Not The White House Correspondents' Dinner: Deep...

BrotherMan · #2 04-28-2019, 05:29 PM

I remember the way back day of Forrest Gump. Even though it was janky a few times, I remember thinking you couldn't even believe everything you see any more. Now that's been turned up to 11.

JoeP · #3 04-28-2019, 06:13 PM

Key takeaways:

(1) Porn stars to fix climate change? Porn stars in the White House! Elected I mean, not just paid per night.
(2) Doubt whether or not the wall exists? Perfect! Swamp all Trump's swampy talking points with messages of congratulation on how good and effective the wall is.
(3) They've got to get beyond having to use impersonators for the voices. Surely we have fake voice technology already?

erimir · #4 04-29-2019, 12:47 AM

We have pretty good fake voice technology, but afaik it requires significant amounts of audio.

For example, Google Translate has pretty good voices for many languages but it does not need to convey different emotions and they're not concerned with imitating unusual intonation patterns (as they would need to be to imitate Trump or Obama, who both have fairly distinctive intonations). But for the voices that are the highest quality, like say French or Hindi (the Hindi voice is also very good at English, actually), they are most likely using concatenative synthesis, which is essentially stitching the required sounds together from a database of recordings. For the sound /k/ in "recording", for example, you wouldn't want to use just any /k/, you'd want one where it follows a vowel like in "re" and is followed by a vowel like in "or", and your database works best if you have the most possible combinations, which then allows you to generate any novel word or sequence of words (since the first/last sound of the next/preceding word will also have an effect).

To get the high quality output of the Google Translate Hindi voice, they most likely have hours of a paid voice actress reading text designed to include many different sound combinations and the types of intonation they need to cover (Google Translate might not need to be able to generate, say, a sarcastic intonation, but they do want "question intonation" and things of that nature).

Its been a little while since I have learned much about voice synthesis, but I imagine this still holds based on the fact that they're still using robot voices for many languages on Google Translate (like Welsh, Serbian and Swahili) and completely lack voice synthesis for others (like Lao, Persian and Hebrew). If they could get high quality synthesis with only a few minutes of recordings, they would probably be willing to put the little money required into it for these lower resource languages (Swahili, Persian and Lao do, after all, have tens of millions of speakers each).

I assume if you want to account for different voice qualities like shouting, singing and so forth, you might need similar amounts of the appropriate type to be really accurate.

The conclusion I would draw for this is that it would be easiest to generate high-quality imitation voices for high-profile politicians, actors and TV/radio/podcast hosts. Anyone who has a lot of recorded data available. I wouldn't be too worried about them being able to fake your voice based on, say, you telling a telemarketer you're not interested. That's not enough high-quality data. But someone like Donald Trump or Barack Obama? They can definitely generate some pretty good audio, even if they couldn't, say, do a good standup comedy routine with all the required variation in intonation required for that type of performance.

Ari · #5 04-29-2019, 05:54 PM

While worriesome the technique is still both a bit brute force and processor intensive, at the moment. Her Trump looked a bit rubbery because it's still just mapping pixels to pixels and I don't believe it has a greater understanding of what a face is and does. I bet if if that Trump turned his head the system would panic, even if fed with side images of his face.

Which makes me wonder if we're going to start down the dystopian path of adding captcha like motions to speeches or events known to freak out current software just to prove the footage real.

ChuckF · #6 04-29-2019, 08:18 PM

Your Trump looks a bit rubbery

Sock Puppet · #7 04-30-2019, 09:25 PM

I'd be more concerned with fake video footage, if the holders of the highest offices in the land (and their mouthpieces, with a special fuck-you to Sarah, you should be driven from any public place, you rotten fucking cow) couldn't stand in front of the entire public and press, and spout the most ridiculously stupid lies with complete goddamned impunity.

That was the less unhinged part of the post. Pretend I had enough restraint to keep it to that.