This AI Clones Your Voice After Listening for 5 Seconds

This forum is for anything not Reason related, if you just want to talk about other stuff. Please keep it friendly!
Post Reply
avasopht
Competition Winner
Posts: 3975
Joined: 16 Jan 2015

13 Nov 2019


dhruan
Posts: 312
Joined: 16 Jan 2015
Location: Helsinki, Finland
Contact:

13 Nov 2019

Whoa... O_o
soundcloud.com/armsgrade

User avatar
diminished
Competition Winner
Posts: 1880
Joined: 15 Dec 2018

13 Nov 2019

10 years ago it was all about feature extraction, classification and the training of GMM and NNs. Welcome to the future. Applied science is scary. Deepest fakes. But who wouldn't buy a plugin mimicking Etta James, Elton John and Celine Dion at the same time, so they can sing about drinking lean?
:reason: Most recent track: resentment (synthwave) || Others: on my YouTube channel •ᴗ•

User avatar
bitley
Posts: 1673
Joined: 03 Jul 2015
Location: sweden
Contact:

13 Nov 2019

Awesome!!!

User avatar
Periwinkle
Posts: 190
Joined: 09 Jul 2019
Location: London England

13 Nov 2019

This could mean the end of vocalists as we know them.
Now, if I can only find a way to eliminate drummers and bass players.

It certainly sounds better than this:
Image

.“Art should comfort the disturbed and disturb the comfortable.”

― Banksy

User avatar
BananaSkins
Posts: 477
Joined: 29 Sep 2017

13 Nov 2019

How much have big Companies spent on software with voice recognition security measures... :?: :oops:

User avatar
bxbrkrz
Posts: 3856
Joined: 17 Jan 2015

13 Nov 2019

"What a time to be alive!" happily says the narrator toward the end of the video. In a few years nobody will be able to tell the difference, alive or dead, in the upcoming deep fake singularity.
:puf_smile:
757365206C6F67696320746F207365656B20616E73776572732075736520726561736F6E20746F2066696E6420776973646F6D20676574206F7574206F6620796F757220636F6D666F7274207A6F6E65206F7220796F757220696E737069726174696F6E2077696C6C206372797374616C6C697A6520666F7265766572

User avatar
Loque
Moderator
Posts: 11222
Joined: 28 Dec 2015

13 Nov 2019

Scary...combined with AI videos, the perfect fake is out there. Maybe the whole TV is just an illusion?
Reason12, Win10

User avatar
MannequinRaces
Posts: 1543
Joined: 18 Jan 2015

14 Nov 2019

That is bonkers! And to think it will only improve from here, crazy.

User avatar
MrFigg
Competition Winner
Posts: 9166
Joined: 20 Apr 2018

14 Nov 2019

Awesome.
A point though...the second example the guy speaking is Scottish. The first part of the synthesized output sounds as if he’s from Somerset or Devon. English in any case. And there I stopped. Turning Scottish people into English people is an abomination of science and should not be allowed.


:)
🗲 2ॐ ᛉ

User avatar
Zac
Posts: 1784
Joined: 19 May 2016
Contact:

14 Nov 2019

MrFigg wrote:
14 Nov 2019
Awesome.
A point though...the second example the guy speaking is Scottish. The first part of the synthesized output sounds as if he’s from Somerset or Devon. English in any case. And there I stopped. Turning Scottish people into English people is an abomination of science and should not be allowed.


:)
Why? Most of you live here anyway :D

User avatar
MrFigg
Competition Winner
Posts: 9166
Joined: 20 Apr 2018

14 Nov 2019

Zac wrote:
14 Nov 2019
MrFigg wrote:
14 Nov 2019
Awesome.
A point though...the second example the guy speaking is Scottish. The first part of the synthesized output sounds as if he’s from Somerset or Devon. English in any case. And there I stopped. Turning Scottish people into English people is an abomination of science and should not be allowed.


:)
Why? Most of you live here anyway :D
Choose life. Choose Sweden.
🗲 2ॐ ᛉ

User avatar
Dogcat
Posts: 29
Joined: 21 Sep 2019

14 Nov 2019

Finally... I can have Bob Dylan and Mariah Carey perform a duet on my next production!!

dhruan
Posts: 312
Joined: 16 Jan 2015
Location: Helsinki, Finland
Contact:

14 Nov 2019

MrFigg wrote:
14 Nov 2019
Zac wrote:
14 Nov 2019


Why? Most of you live here anyway :D
Choose life. Choose Sweden.
:lol:
soundcloud.com/armsgrade

User avatar
joeyluck
Moderator
Posts: 11079
Joined: 15 Jan 2015

14 Nov 2019

These examples are pretty creepy sounding lol. Scroll down to:
  • 3. Generalization to Long Utterances
And click on the red ones.

https://google.github.io/tacotron/publi ... index.html

They use excerpts from Harry Potter, but it gets weird.
My favorite is the one beginning with, "Uncle Vernon entered the kitchen as Harry was turning over the bacon." And the one beginning with "Harry had the best morning he'd had in a long time."
(Lessac < 5sec, Location-Sensitive). Starts off pretty well and then turns into something out of a horror movie.

User avatar
reddust
Posts: 677
Joined: 07 May 2018

14 Nov 2019

Periwinkle wrote:
13 Nov 2019
It certainly sounds better than this:
Well, when we get a singing example of the AI maybe we can assure that, because Vocaloid can also sound like this:



...which in my opinion is better than the AI on the video, but if they keep the work it could really become very a interesting tool for vocals :)
joeyluck wrote:
14 Nov 2019
These examples are pretty creepy sounding lol. Scroll down to:
  • 3. Generalization to Long Utterances
And click on the red ones.

https://google.github.io/tacotron/publi ... index.html

They use excerpts from Harry Potter, but it gets weird.
My favorite is the one beginning with, "Uncle Vernon entered the kitchen as Harry was turning over the bacon." And the one beginning with "Harry had the best morning he'd had in a long time."
(Lessac < 5sec, Location-Sensitive). Starts off pretty well and then turns into something out of a horror movie.
Haha, those examples are really creepy and funny at the same time...

Post Reply
  • Information
  • Who is online

    Users browsing this forum: No registered users and 2 guests