Kun Wu

2788 San Tomas Expy
Santa Clara, CA 95051
Kun Wu (吴昆) is currently a backend compiler engineer at NVIDIA. He received his Ph.D. degree from UIUC, advised by Prof. Wen-mei Hwu. He has contributed to several impactful projects, involving Hector, PyTorch-Direct, and Pylog. He also contributed to the MLIR sparse tensor dialect during his internship at the Google MLIR Sparsifier Team (Check commits here!).
During the undergrad program, Kun was under the supervision of Prof. Yu Wang and Prof. Yuan Xie.
Wanna talk with me? Feel free to send me an email :D My availability can be queried at the published Outlook calendar. Resume is available upon request.
News
Nov 30, 2022 | We filed a patent application to USPTO earlier this month for the idea conceived during HPE Internship in Summer 2021! |
---|---|
Mar 1, 2022 | Techniques proposed in our Pytorch-Direct project is now available in DGL v0.8! Please check the CUDA UVA-based optimization in the release note. |
Aug 2, 2021 | Findings in our Pytorch-Direct project led by David (Seungwon) Min have been merged to the DGL master branch! (PR #3086, #3184, #3194) |
Apr 12, 2021 | David (Seungwon) Min and I gave a talk at GTC 2021 on our Pytorch-Direct project! |
Select Publications
-
SaintSN: Streamlined and Intelligent Storage Node System-on-a-Chip for Exascale ClusterAcceptance rate: 17.6%. Pending US patent.Proceedings of the Hewlett Packard Enterprise Technical Conference 2022
-
Memory-Bound Proof-of-Work Acceleration for Blockchain ApplicationsProceedings of the 56th Annual Design Automation Conference 2019