Yang Sui

05/2025: One paper is accepted by Transactions on Machine Learning Research (TMLR 2025).
04/2025: We are so excited that our work DFloat11: Lossless Compression for LLM is reported by 新智元 and 机器之心.
04/2025: We are so excited that our survey Stop Overthinking is reported by 新智元.
03/2025: We are so excited to release our survey: Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models. We believe "Efficient Reasoning" is a very promising research direction in the future!
02/2025: Three papers are accepted by CVPR 2025. One of the papers, TopV, marks my first experience as an advisor, and I'm so proud of this work. Incredible work by Cheng!
10/2024: Invited to deliver a guest lecture, "Model Compression: Pruning, Quantization, and Recent Advances." at Texas A&M University, CSCE 689 Special Topics: Generative AI.
10/2024: I’m glad to join in the Department of Computer Science at Rice University as a Postdoctoral Associate.
09/2024: One paper is accepted by The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024).
09/2024: One paper is accepted as Findings by The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024).
09/2024: One paper is accepted by 30th Asia and South Pacific Design Automation Conference (ASP-DAC 2025).
07/2024: One paper is accepted by The 35th British Machine Vision Conference (BMVC 2024).
07/2024: One paper is accepted by European Conference on Computer Vision (ECCV 2024).
05/2024: One paper is accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS).
04/2024: I’m glad to receive the Paul Panayotatos Scholarship at Rutgers University.
03/2024: One paper is accepted by IEEE/ACM Design Automation Conference (DAC 2024).
02/2024: I’m glad to join the Creative Vision team, Snap Research as a Research Intern. I love the beautiful beach in Santa Monica and vibrant life in Los Angeles.
12/2023: Two papers are accepted as poster by the Data Compression Conference (DCC 2024).
10/2023: One paper is accepted by The International Symposium on High-Performance Computer Architecture (HPCA 2024).
09/2023: One paper is accepted by IEEE/ACM International Conference on Computer-Aided Design (ICCAD 2023).
07/2023: Invited to deliver a talk, "Efficient Diffusion Models and Large Language Models: Quantization, Pruning, and LoRA." (Video)
07/2023: One paper is accepted with Spotlight presentation at ICML'23 NCW Workshop.
06/2023: One paper is accepted by IEEE/RSJ International Conference on Intelligent Robots (IROS 2023).
03/2023: One paper is accepted by The 50th International Symposium on Computer Architecture (ISCA 2023).
02/2023: One paper is accepted by IEEE/ACM Design Automation Conference (DAC 2023).
02/2023: One paper receives the Best Paper Runner-Up Award with Oral presentation at AAAI’23 DCAA Workshop.
11/2022: Two papers are accepted with Oral presentation by AAAI Conference on Artificial Intelligence (AAAI 2023).
10/2022: Present a poster in IBM IEEE CAS/EDS – 5th AI Compute Symposium at IBM Thomas J Watson Research Center, Yorktown Heights, NY.
05/2022: I’m glad to join the Media Lab, Tencent America as a Research Intern remotely.
03/2022: One paper is accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022).
09/2021: One paper is accepted by Neural Information Processing Systems Conference (NeurIPS 2021).
09/2021: One paper is accepted by IEEE/ACM International Conference on Computer-Aided Design (ICCAD 2021).
03/2021: One paper is accepted by ACM International Symposium on Computer Architecture (ISCA 2021).
02/2021: One paper was accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021).

Previous "Efficient Deep Learning Reading Group" Sessions:

2025

2024

2023

2022

2021

2018