Chris Remington@beehaw.orgM to Technology@beehaw.org · 6 months agoMicrosoft’s VASA-1 can deepfake a person with one photo and one audio trackarstechnica.comexternal-linkmessage-square55fedilinkarrow-up188arrow-down11cross-posted to: technology@lemmy.worldartificial_intel@lemmy.mltechnology@lemmy.ziptechnology@lemmy.ml
arrow-up187arrow-down1external-linkMicrosoft’s VASA-1 can deepfake a person with one photo and one audio trackarstechnica.comChris Remington@beehaw.orgM to Technology@beehaw.org · 6 months agomessage-square55fedilinkcross-posted to: technology@lemmy.worldartificial_intel@lemmy.mltechnology@lemmy.ziptechnology@lemmy.ml
minus-squareluciole (he/him)@beehaw.orglinkfedilinkarrow-up26·6 months agoThe actual research page is so awkward. The TLDR at the top goes: single portrait photo + speech audio = hyper-realistic talking face video Then a little lower comes the big red warning: We are exploring visual affective skill generation for virtual, interactive characters, NOT impersonating any person in the real world. No siree! Big “not what it looks like” vibes.
The actual research page is so awkward. The TLDR at the top goes:
Then a little lower comes the big red warning:
No siree! Big “not what it looks like” vibes.