Justin (Zhaocong) Yuan

ML Research @ Qualcomm | ex-Nvidia | ex-Apple | MASc UofT

profile.png

I obtained my MASc degree in RL and Robotics from University of Toronto, advised by Angela Schoellig at Dynamic Systems Lab (DSL), also part of Vector Institute and UofT Robotics Institute. Previously, I received my BASc degree in Engineering Science from UofT. I was also an intern in Nvidia Toronto AI lab advised by Sanja Fidler, Apple Siri team based in Seattle, and Data-Driven Decision Making Lab advised by Scott Sanner. My prior works largely focus on safe RL, transfer learning, and Sim-to-Real tasks.

I currently work at Qualcomm and my research of focus spans across small language models (SLM), multimodal foundation models (vision & speech), LLM inference and optimization (e.g. speculative decoding, on-device acceleration), and more recently diffusion large language models (dLLM).

In my leisure time, I play fingerstyle guitar :guitar: and badminton :badminton:.

news

Sep 24, 2025 Paper “OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding “ accepted at NeurIPS 2025. [link] :sparkles: :sparkles: :sparkles:
Sep 25, 2024 Paper “Stepping Forward on the Last Mile” accepted at NeurIPS 2024. [link] :sparkles: :smile:
Feb 21, 2023 Join Qualcomm as a Machine Learning Research Engineer on the Embedded AI team.
Nov 10, 2022 Graduate from University of Toronto, MASc
Oct 23, 2022 IROS 2022 presentation of our safe-control-gym paper. [link]
May 27, 2022 ICRA 2022 Workshop on Releasing Robots into the Wild: Simulations, Benchmarks, and Deployment (organizer). website/videos
Mar 15, 2022 Vector Institute Industry Workshop (guest speaker).
May 04, 2020 Graduate from University of Toronto, BASc, Engineering Science
Sep 03, 2018 Intern at Nvidia, Toronto, deep learning research on Sim-to-Real transfer and self-driving
May 27, 2018 Intern at Apple, Seattle, on the Siri NLU team

latest posts

selected publications

  1. NeurIPS
    OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding
    arXiv preprint arXiv:2507.02659, 2025
  2. arXiv
    Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition Models
    arXiv preprint arXiv:2507.07877, 2025
  3. NeurIPS
    Stepping forward on the last mile
    Chen Feng , Jay Zhuo , Parker Zhang, Ramchalam Kinattinkara Ramakrishnan, Zhaocong Yuan, and Andrew Zou Li
    Advances in Neural Information Processing Systems, 2024
  4. IROS
    safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics
    IEEE Robotics and Automation Letters, 2022
  5. Annual Reviews
    Safe learning in robotics: From learning-based control to safe reinforcement learning
    Annual Review of Control, Robotics, and Autonomous Systems, 2022
  6. ICCV
    Meta-sim: Learning to generate synthetic datasets
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019