Results 1 to 5 of 5

Thread: [News] Lyrebird can 'copy the voice of anyone' in a minute

  1. #1
    Join XS BOINC Team StyM's Avatar
    Join Date
    Mar 2006
    Location
    Tropics
    Posts
    9,468

    [News] Lyrebird can 'copy the voice of anyone' in a minute

    http://hexus.net/tech/news/software/...anyone-minute/

    Montreal-based Lyrebird has published a microsite showcasing its voice imitation algorithms. The AI-startup claims it has developed an API which will let you synthesize speech using anyone's voice - from just a minute-long recording of the person. The speaker doesn't need to say any off the words contained in the audio generated. Furthermore, you will be able to select emotions for the speech, such as anger, sympathy, or stress.
    In the main demo from the showcase page at Lyrebird.ai, embedded above, you get to hear a synthesized voice switch between Barack Obama, Donald Trump, and Hillary Clinton imitations. There are plenty of other examples to check over. This is pretty impressive for real-time generation and voice imitation switching, though its easy in some segments to detect the 'robotic' speech sound.

    As TNW reminds us, Adobe showcased similar voice mimicking tech last year, under the name of Project VoCo. However, Adobe's tech required about 20 minutes of user speech for its mimicking tasks plus its software package installed on the client system. Lyrebird only needs a minute long recording and will shortly launch a cloud based API service for you to upload audio and download your synthesized speech.

  2. #2
    Xtremely High Voltage Sparky's Avatar
    Join Date
    Mar 2006
    Location
    Ohio, USA
    Posts
    16,040
    This intrigues me given this could correlate well into the industry I'm in (speech generation devices for the disabled).
    The Cardboard Master
    Crunch with us, the XS WCG team
    Intel Core i7 2600k @ 4.5GHz, 16GB DDR3-1600, Radeon 7950 @ 1000/1250, Win 10 Pro x64

  3. #3
    Xtreme Enthusiast
    Join Date
    Oct 2012
    Posts
    687
    This can be terrifying .If its THAT good, any court case involving someones voice on recording can be fake.
    Intel 5960X@4.2Ghz[Prime stable]@4.5 [XTU stable] 1.24v NB@3.6ghz Asrock X99 Extreme 3 4x8GB Corsair Vengeance@3200 16-17-17
    Sapphire nitro+ VEGA 56 Samsung SSD 850 256GB Crucial MX100 512GB HDD:WD10TB WD:8TB Seagate8TB

  4. #4
    Xtreme Cruncher
    Join Date
    Nov 2008
    Location
    NE Ohio, USA
    Posts
    1,608
    Huh, I was expecting to read something about Parrots

    Quote Originally Posted by Sparky View Post
    This intrigues me given this could correlate well into the industry I'm in (speech generation devices for the disabled).
    Interesting. I was amazed (and disappointed) at the same time with devices available when we had to go through assessment, research and finally insurance denial with our son who has ADHD w/Anxiety and specific learning disabilities. This was about 5-ish or so years ago when he wasn't talking much at 4-5 yrs old and needed something for communication in school and home. It was a device made by Tobii. After everything we had to go through I took matters into my own hands and ended up buying an iPad 2 and a $100 app called Sono Flex for him. That along with lots of speech therapy got him on the right track.

    So, thank you for whatever you do in that field
    24/7 Cruncher #1
    Crosshair VII Hero, Ryzen 3900X, 4.0 GHz @ 1.225v, Arctic Liquid Freezer II 420 AIO, 4x8GB GSKILL 3600MHz C15, ASUS TUF 3090 OC
    Samsung 980 1TB NVMe, Samsung 870 QVO 1TB, 2x10TB WD Red RAID1, Win 10 Pro, Enthoo Luxe TG, EVGA SuperNOVA 1200W P2

    24/7 Cruncher #2
    ASRock X470 Taichi, Ryzen 3900X, 4.0 GHz @ 1.225v, Arctic Liquid Freezer 280 AIO, 2x16GB GSKILL NEO 3600MHz C16, EVGA 3080ti FTW3 Ultra
    Samsung 970 EVO 250GB NVMe, Samsung 870 EVO 500GBWin 10 Ent, Enthoo Pro, Seasonic FOCUS Plus 850W

    24/7 Cruncher #3
    GA-P67A-UD4-B3 BIOS F8 mod, 2600k (L051B138) @ 4.5 GHz, 1.260v full load, Arctic Liquid 120, (Boots Win @ 5.6 GHz per Massman binning)
    Samsung Green 4x4GB @2133 C10, EVGA 2080ti FTW3 Hybrid, Samsung 870 EVO 500GB, 2x1TB WD Red RAID1, Win10 Ent, Rosewill Rise, EVGA SuperNOVA 1300W G2

    24/7 Cruncher #4 ... Crucial M225 64GB SSD Donated to Endurance Testing (Died at 968 TB of writes...no that is not a typo!)
    GA-EP45T-UD3LR BIOS F10 modded, Q6600 G0 VID 1.212 (L731B536), 3.6 GHz 9x400 @ 1.312v full load, Zerotherm Zen FZ120
    OCZ 2x2GB DDR3-1600MHz C7, Gigabyte 7950 @1200/1250, Crucial MX100 128GB, 2x1TB WD Red RAID1, Win10 Ent, Centurion 590, XFX PRO650W

    Music System
    SB Server->SB Touch w/Android Tablet as a remote->Denon AVR-X3300W->JBL Studio Series Floorstanding Speakers, JBL LS Center, 2x SVS SB-2000 Subs


  5. #5
    Xtremely High Voltage Sparky's Avatar
    Join Date
    Mar 2006
    Location
    Ohio, USA
    Posts
    16,040
    There has been a lot of progress in the SGD field in those years (I've been in it for 3 years, on the IT end of it).

    My main focus that I have worked on is SGD with alternative input, eye tracking being one of them. It can be amazing and frustrating technology to say the least.

    I dislike Tobii for a few reasons - one being they are a competitor - but also their software sucks IMO compared to some alternatives and they seem to be a bit too heavily money-driven as they have been adding things that you cannot do without a monthly subscription fee, some of which are not insignificant
    The Cardboard Master
    Crunch with us, the XS WCG team
    Intel Core i7 2600k @ 4.5GHz, 16GB DDR3-1600, Radeon 7950 @ 1000/1250, Win 10 Pro x64

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •