Justin (Zhaocong) Yuan

I obtained my MASc degree in RL and Robotics from University of Toronto, advised by Angela Schoellig at Dynamic Systems Lab (DSL), also part of Vector Institute and UofT Robotics Institute. Previously, I received my BASc degree in Engineering Science from UofT. I was also an intern in Nvidia Toronto AI lab advised by Sanja Fidler, Apple Siri team based in Seattle, and Data-Driven Decision Making Lab advised by Scott Sanner. My prior works largely focus on safe RL, transfer learning, and Sim-to-Real tasks.

I currently work at Qualcomm to develop multi-modal Transformer models for mobile/on-device applications, exploring various topics such as model distillation, quantization, and parameter efficient finetuning for LLMs or other foundation models.

In my leisure time, I play fingerstyle guitar and badminton .

news

Sep 25, 2024	Paper “Stepping Forward on the Last Mile” accepted to NeurIPS 2024. [link]
Feb 21, 2023	Join Qualcomm as a Machine Learning Research Engineer on the Embedded AI (eAI) Team.
Oct 23, 2022	IROS 2022 presentation of our safe-control-gym paper.

latest posts

Oct 16, 2024	Vector Quantization - A Quick Dive

selected publications

IROS

safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics

Zhaocong Yuan, Adam W Hall, Siqi Zhou, Lukas Brunke, Melissa Greeff, Jacopo Panerati, and Angela P Schoellig

IEEE Robotics and Automation Letters, 2022

Abstract PDF Code

In recent years, both reinforcement learning and learning-based control—as well as the study of their safety, which is crucial for deployment in real-world robots—have gained significant traction. However, to adequately gauge the progress and applicability of new results, we need the tools to equitably compare the approaches proposed by the controls and reinforcement learning communities. Here, we propose a new open-source benchmark suite, called safe-control-gym, supporting both model-based and databased control techniques. We provide implementations for three dynamic systems—the cart-pole, the 1D, and 2D quadrotor— and two control tasks—stabilization and trajectory tracking. We propose to extend OpenAI’s Gym API—the de facto standard in reinforcement learning research—with (i) the ability to specify (and query) symbolic dynamics and (ii) constraints, and (iii) (repeatably) inject simulated disturbances in the control inputs, state measurements, and inertial properties. To demonstrate our proposal and in an attempt to bring research communities closer together, we show how to use safe-control-gym to quantitatively compare the control performance, data efficiency, and safety of multiple approaches from the fields of traditional control, learning-based control, and reinforcement learning.
Annual Reviews

Safe learning in robotics: From learning-based control to safe reinforcement learning

Lukas Brunke, Melissa Greeff, Adam W Hall, Zhaocong Yuan, Siqi Zhou, Jacopo Panerati, and Angela P Schoellig

Annual Review of Control, Robotics, and Autonomous Systems, 2022

Abstract PDF Code

The last half decade has seen a steep rise in the number of contributions on safe learning methods for real-world robotic deployments from both the control and reinforcement learning communities. This article provides a concise but holistic review of the recent advances made in using machine learning to achieve safe decision-making under uncertainties, with a focus on unifying the language and frameworks used in control theory and reinforcement learning research. It includes learning-based control approaches that safely improve performance by learning the uncertain dynamics, reinforcement learning approaches that encourage safety or robustness, and methods that can formally certify the safety of a learned control policy. As data- and learning-based robot control methods continue to gain traction, researchers must understand when and how to best leverage them in real-world scenarios where safety is imperative, such as when operating in close proximityto humans. We highlight some of the open challenges that will drive the field of robot learning in the coming years, and emphasize the need for realistic physics-based benchmarks to facilitate fair comparisons between control and reinforcement learning approaches.
ICCV

Meta-sim: Learning to generate synthetic datasets

Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, and Sanja Fidler

In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019

Awarded Abstract PDF Code Website

Oral

Training models to high-end performance requires availability of large labeled datasets, which are expensive to get. The goal of our work is to automatically synthesize labeled datasets that are relevant for a downstream task. We propose Meta-Sim, which learns a generative model of synthetic scenes, and obtain images as well as its corresponding ground-truth via a graphics engine. We parametrize our dataset generator with a neural network, which learns to modify attributes of scene graphs obtained from probabilistic scene grammars, so as to minimize the distribution gap between its rendered outputs and target data. If the real dataset comes with a small labeled validation set, we additionally aim to optimize a meta-objective, i.e. downstream task performance. Experiments show that the proposed method can greatly improve content generation quality over a human-engineered probabilistic scene grammar, both qualitatively and quantitatively as measured by performance on a downstream task.