Event Details:
Location
Bldg. 320-105
United States
This event is open to:
Abstract:
AI/ML workloads are increasingly shifting from cloud-only deployment to a hybrid model in which the PC becomes an intelligent endpoint. This talk focuses on the local execution of AI models and applications on PCs, discussing why on-device execution matters, key client-side use cases, and how on-device models can interoperate with cloud-hosted models to deliver end-to-end experiences. Modern AI PCs integrate dedicated accelerators, commonly called Neural Processing Units (NPUs), alongside the CPU and GPU. We will use the NPU in AMD’s Ryzen AI as an example to illustrate their power-efficient architecture and then walk through the end-to-end software stack for seamless model deployment, including the Microsoft Windows ML framework.
Speaker Bio:
Vinod Kathail is a Senior Fellow at AMD. His current focus is on the end-to-end AI/ML software stack for AI Engine NPUs in Ryzen AI and embedded devices, building on his work at Xilinx. At Xilinx, he was a Fellow and Chief Architect for the Vitis high-level programming environment. Earlier, he co-founded Synfora, a high-level synthesis company, and served as its CTO. Prior to Synfora, he worked at HP Labs, where he led the PICO high-level synthesis project. He was also one of the architects of the VLIW architecture and compiler that became Intel Itanium. He has numerous patents and publications in AI accelerators, parallel/heterogeneous architectures and programming environments, and high-level synthesis. He received a Doctor of Science in EECS from MIT.
Related Topics
Explore More Events
-
Annual Conference
SAVE THE DATE! SystemX 2026 Fall Conference (November 9-10, 2026)
-Stanford Frances C. Arrillaga Alumni Center, McCaw Hall
United States