Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
DiJia Su, Sainaa Sukhbaatar, Michael Rabbat, Yuandong Tian, Qinqing Zheng
[paper]
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Lucas Lehnert, Sainaa Sukhbaatar, DiJia Su, Qinqing Zheng, Paul Mcvay, Michael Rabbat, Yuandong Tian
COLM 2024
[paper]
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding, Amy Zhang, Yuandong Tian, Qinqing Zheng
ICLR 2024 Generative Models for Decision Making Workshop
[paper]
Guided Flows for Generative Modeling and Decision Making
Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen
[paper]
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit Sikchi, Qinqing Zheng, Amy Zhang, Scott Niekum
ICLR 2024
(Spotlight)
[paper]
[code]
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Qinqing Zheng, Mikael Henaff, Brandon Amos, Aditya Grover
ICML 2023
[paper]
[code]
ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning
Tung Nguyen, Qinqing Zheng, Aditya Grover
NeurIPS 2022 Foundation Models for Decision Making Workshop
[paper]
[code]
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang, Aaron Courville, Yoshua Bengio, Qinqing
Zheng, Amy Zhang, Ricky T. Q. Chen
ICLR 2023
[paper]
Online Decision Transformer
Qinqing Zheng, Amy Zhang, Aditya Grover
ICML 2022
(Long Oral Presentation)
[paper]
[code]
[poster]
Near-Optimal Confidence Sequences for Bounded Random Variables
Arun Kumar Kuchibhotla*, Qinqing Zheng* (*Equal contribution)
ICML 2021
(Spotlight)
[paper]
[code]
A Theorem of the Alternative for Personalized Federated Learning
Shuxiao Chen, Qinqing Zheng, Qi Long, Weijie Su
Submitted.
[paper]
Federated \(f\)-Differential Privacy
Qinqing Zheng, Shuxiao Chen, Qi Long, Weijie Su
AISTATS 2021
[paper]
[code]
Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion
Qinqing Zheng, Jinshuo Dong, Qi Long, Weijie Su
ICML 2020
[paper]
[code]
ShadowSync: Performing Synchronization in the Background for Highly Scalable Distributed Training
Qinqing Zheng, Bor-Yiing Su, Jiyan Yang, Alisson Azzolini, Qiang Wu, Ou Jin, Shri Karandikar, Hagay Lupesko, Liang Xiong, Eric Zhou
[paper]
Convergence Analysis for Rectangular Matrix Completion Using Burer-Monteiro Factorization and Gradient Descent
Qinqing Zheng, John Lafferty
[paper]
A Convergent Gradient Descent Algorithm for Rank Minimization and Semidefinite Programming from Random Linear Measurements
Qinqing Zheng, John Lafferty
NeurIPS 2015
[paper]
[poster]
Interpolating Convex and Non-Convex Tensor Decompositions via the Subspace Norm
Qinqing Zheng , Ryota Tomioka
NeurIPS 2015
[paper]
[code]
[poster]