Papers

Statistical Efficiency of Distributional Temporal Difference
Yang Peng, Liangyu Zhang, and Zhihua Zhang
NeurIPS 2024 (oral).
[Paper]
Estimation and Inference in Distributional Reinforcement Learning
Liangyu Zhang, Yang Peng, Jiadong Liang, Wenhao Yang, and Zhihua Zhang
R&R at The Annals of Statistics.
[Paper]
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu, Wenhao Yang, Liangyu Zhang, and Zhihua Zhang
Preprint, arXiv:2209.05186.
[Paper]
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning
Liangyu Zhang, Yang Peng, Wenhao Yang, and Zhihua Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
[Paper]
Semi-Infinitely Constrained Markov Decision Processes
Liangyu Zhang, Yang Peng, Wenhao Yang, and Zhihua Zhang
NeurIPS 2022.
[Paper]
Toward Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Wenhao Yang, Liangyu Zhang, and Zhihua Zhang
The Annals of Statistics, 2022.
[Paper]