No
Yes
View More
View Less
Working...
Close
OK
Cancel
Confirm
System Message
Delete
My Schedule
An unknown error has occurred and your request could not be completed. Please contact support.
Scheduled
Scheduled
Wait Listed
Personal Calendar
Speaking
Conference Event
Meeting
Interest
There aren't any available sessions at this time.
Conflict Found
This session is already scheduled at another time. Would you like to...
Loading...
Please enter a maximum of {0} characters.
{0} remaining of {1} character maximum.
Please enter a maximum of {0} words.
{0} remaining of {1} word maximum.
must be 50 characters or less.
must be 40 characters or less.
Session Summary
We were unable to load the map image.
This has not yet been assigned to a map.
Search Catalog
Reply
Replies ()
Search
New Post
Microblog
Microblog Thread
Post Reply
Post
Your session timed out.
This web page is not optimized for viewing on a mobile device. Visit this site in a desktop browser to access the full set of features.
2019 GTC San Jose
Add to My Interests
Remove from My Interests

S9979 - Using ONNX for Accelerated Inferencing on Cloud and Edge

Session Speakers
Session Description

Are you a developer looking to operationalize machine learning models from different sources without compromising performance? Are you a data scientist who wishes there was a way to use the machine learning framework you want without worrying about how to deploy it to a variety of end points on cloud and edge? We'll describe ONNX, which provides a common format supported by many popular AI frameworks and hardware. Learn about ONNX and its core concepts and find out how to create ONNX models using frameworks like TensorFlow, PyTorch, and SciKit-Learn. We'll explain how to deploy models to cloud or edge using the high-performance, cross-platform ONNX Runtime, which leverages accelerators like NVIDIA TensorRT. Our talk will include case studies of Microsoft teams improving latency and reducing costs, thanks to ONNX.


Additional Information
Deep Learning/AI Frameworks
Deep Learning/AI Frameworks
General
All technical
Talk.1
50 minutes
Session Schedule