Fully parameterized quantile function for distributional reinforcement learning D Yang, L Zhao, Z Lin, T Qin, J Bian, TY Liu Advances in neural information processing systems 32, 2019 | 154 | 2019 |
Maniskill: Generalizable manipulation skill benchmark with large-scale demonstrations T Mu, Z Ling, F Xiang, D Yang, X Li, S Tao, Z Huang, Z Jia, H Su arXiv preprint arXiv:2107.14483, 2021 | 90 | 2021 |
Individualized indicator for all: Stock-wise technical indicator optimization with stock embedding Z Li, D Yang, L Zhao, J Bian, T Qin, TY Liu Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019 | 53 | 2019 |
Distributional reward decomposition for reinforcement learning Z Lin, L Zhao, D Yang, T Qin, TY Liu, G Yang Advances in neural information processing systems 32, 2019 | 17 | 2019 |
RD : Reward Decomposition with Representation Decomposition Z Lin*, D Yang*, L Zhao, T Qin, G Yang, TY Liu Advances in Neural Information Processing Systems 33, 2020 | 9 | 2020 |
RD2 reward decomposition with representation disentanglement Z Lin, D Yang, L Zhao, T Qin, G Yang, T Liu Proceedings of the 34th International Conference on Neural Information …, 2020 | 6 | 2020 |
Individualized Indicator for All Z Li, D Yang, L Zhao, J Bian, T Qin, TY Liu Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019 | 2 | 2019 |
Defensive Quantization Layer For Convolutional Network Against Adversarial Attack S Song, Q Wang, D Yang, Y Song, X Liu, T Zhang | | 2019 |