No
Yes
View More
View Less
Working...
Close
OK
Cancel
Confirm
System Message
Delete
My Schedule
An unknown error has occurred and your request could not be completed. Please contact support.
Scheduled
Scheduled
Wait Listed
Personal Calendar
Speaking
Conference Event
Meeting
Interest
There aren't any available sessions at this time.
Conflict Found
This session is already scheduled at another time. Would you like to...
Loading...
Please enter a maximum of {0} characters.
{0} remaining of {1} character maximum.
Please enter a maximum of {0} words.
{0} remaining of {1} word maximum.
must be 50 characters or less.
must be 40 characters or less.
Session Summary
We were unable to load the map image.
This has not yet been assigned to a map.
Search Catalog
Reply
Replies ()
Search
New Post
Microblog
Microblog Thread
Post Reply
Post
Your session timed out.
This web page is not optimized for viewing on a mobile device. Visit this site in a desktop browser to access the full set of features.
2019 GTC San Jose
Add to My Interests
Remove from My Interests

S9247 - Extreme Neural Network Computing Transforms Speech Quality

Session Speakers
Session Description

We'll explore in depth the application of deep learning to advanced speech processing. We'll show how novel neural network architectures and training methods, combined with audio signal processing, deliver near-perfect separation of speech from background sounds, even in the face of heavy reverberation and non-stationary noise. Our talk highlights the unprecedented data gathering, augmentation methods, and parallel training compute that allow us to leverage thousands of hours of unique speech content. We'll delve into the software development, API, GPU-Based cloud deployment, and embedded library methods that work with the neural network to enable next-generation audio and video production, media streaming, and telephony systems. We'll also discuss the likely trajectory of deep learning technology for speech enhancement, speech recognition, speaker identification, and seamless human-machine interface.


Additional Information
Deep Learning - Speech/Language Processing
Deep Learning - Speech/Language Processing
Automotive / Transportation, Media & Entertainment, Software, Telecommunications
Intermediate technical
Talk.1
50 minutes
Session Schedule