QinBo Bai

Cited by

	All	Since 2020
Citations	439	437
h-index	10	10
i10-index	11	11

180

135

20192020202120222023202420252 23 42 71 118 163 19

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

QinBo Bai

Purdue University

Verified email at purdue.edu

Reinforcement learning Constraint Optimization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep learning-based channel estimation algorithm over time selective fading channels Q Bai, J Wang, Y Zhang, J Song IEEE Transactions on Cognitive Communications and Networking 6 (1), 125-134, 2019	161	2019
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3682-3689, 2022	74	2022
Reinforcement learning for constrained markov decision processes A Gattami, Q Bai, V Aggarwal International Conference on Artificial Intelligence and Statistics, 2656-2664, 2021	34	2021
Achieving zero constraint violation for constrained reinforcement learning via conservative natural policy gradient primal-dual algorithm Q Bai, AS Bedi, V Aggarwal Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6737-6744, 2023	28	2023
Regret guarantees for model-based reinforcement learning with long-term average constraints M Agarwal, Q Bai, V Aggarwal Uncertainty in Artificial Intelligence, 22-31, 2022	18	2022
Provably efficient model-free algorithm for mdps with peak constraints Q Bai, V Aggarwal, A Gattami arXiv preprint arXiv:2003.05555, 2020	18*	2020
Regret analysis of policy gradient algorithm for infinite horizon average reward markov decision processes Q Bai, WU Mondal, V Aggarwal Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 10980 …, 2024	16	2024
A reinforcement learning framework for vehicular network routing under peak and average constraints N Geng, Q Bai, C Liu, T Lan, V Aggarwal, Y Yang, M Xu IEEE Transactions on Vehicular Technology 72 (5), 6753-6764, 2023	15	2023
Reinforcement learning for multi-objective and constrained Markov decision processes A Gattami, Q Bai, V Agarwal arXiv preprint arXiv:1901.08978, 2019	15	2019
Concave utility reinforcement learning with zero-constraint violations M Agarwal, Q Bai, V Aggarwal arXiv preprint arXiv:2109.05439, 2021	14	2021
Joint optimization of multi-objective reinforcement learning with policy gradient based algorithm Q Bai, M Agarwal, V Aggarwal arXiv preprint arXiv:2105.14125, 2021	10	2021
Escaping saddle points for zeroth-order non-convex optimization using estimated gradient descent Q Bai, M Agarwal, V Aggarwal 2020 54th Annual Conference on Information Sciences and Systems (CISS), 1-6, 2020	8	2020
Markov decision processes with long-term average constraints M Agarwal, Q Bai, V Aggarwal arXiv preprint arXiv:2106.06680, 2021	7	2021
Achieving zero constraint violation for concave utility constrained reinforcement learning via primal-dual approach Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal Journal of Artificial Intelligence Research 78, 975-1016, 2023	6	2023
Provably sample-efficient model-free algorithm for mdps with peak constraints Q Bai, V Aggarwal, A Gattami Journal of Machine Learning Research 24 (60), 1-25, 2023	6	2023
Joint optimization of concave scalarized multi-objective reinforcement learning with policy gradient based algorithm Q Bai, M Agarwal, V Aggarwal Journal of Artificial Intelligence Research 74, 1565-1597, 2022	5	2022
Learning general parameterized policies for infinite horizon average reward constrained mdps via primal-dual policy gradient algorithm Q Bai, W Mondal, V Aggarwal Advances in Neural Information Processing Systems 37, 108566-108599, 2024	2	2024
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms V Aggarwal, WU Mondal, Q Bai Foundations and Trends® in Optimization 6 (4), 193-298, 2024	1	2024
Model-free algorithm and regret analysis for MDPs with long-term constraints Q Bai, V Aggarwal, A Gattami arXiv preprint arXiv:2006.05961, 2020	1	2020
Model-Free Algorithms for Constrained Reinforcement Learning in Discounted and Average Reward Settings Q Bai Purdue University, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by