Martha White
Martha White
Verified email at ualberta.ca - Homepage
TitleCited byYear
Off-Policy Actor-Critic
T Degris, M White, RS Sutton
Twenty-Ninth International Conference on Machine Learning, 2012
1672012
Convex multi-view subspace learning
M White, X Zhang, D Schuurmans, Y Yu
Advances in Neural Information Processing Systems, 1673-1681, 2012
1102012
An emphatic approach to the problem of off-policy temporal-difference learning
RS Sutton, AR Mahmood, M White
The Journal of Machine Learning Research 17 (1), 2603-2631, 2016
842016
Estimating the class prior and posterior from noisy positives and unlabeled data
S Jain, M White, P Radivojac
Advances in neural information processing systems, 2693-2701, 2016
512016
Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions.
X Zhang, Y Yu, M White, R Huang, D Schuurmans
Proceedings of the AAAI Conference on Artificial Intelligence, 2011
322011
Relaxed clipping: A global training method for robust regression and classification
Y Yu, M Yang, L Xu, M White, D Schuurmans
Advances in Neural Information Processing Systems 23, 2011
312011
Nonparametric semi-supervised learning of class proportions
S Jain, M White, MW Trosset, P Radivojac
arXiv preprint arXiv:1601.01944, 2016
292016
Unifying task specification in reinforcement learning
M White
International Conference on Machine Learning, 2016
272016
Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning
L Xu, M White, D Schuurmans
Proceedings of the 26th International Conference on Machine Learning, 1137-1144, 2009
242009
Recovering true classifier performance in positive-unlabeled learning
S Jain, M White, P Radivojac
Thirty-First AAAI Conference on Artificial Intelligence, 2017
222017
A greedy approach to adapting the trace parameter for temporal difference learning
M White, A White
International Conference on Autonomous Agents & Multiagent Systems, 557-565, 2016
192016
Emphatic temporal-difference learning
AR Mahmood, H Yu, M White, RS Sutton
European Workshop on Reinforcement Learning, 2015
192015
Investigating practical, linear temporal difference learning
A White, M White
Autonomous Agents and Multiagent Sytems, 2016
17*2016
Learning a Value Analysis Tool for Agent Evaluation.
M White, MH Bowling
International Joint Conference on Artificial Intelligence, 1976-1981, 2009
172009
Partition tree weighting
J Veness, M White, M Bowling, A György
2013 Data Compression Conference, 321-330, 2013
162013
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains
M White, A White
Advances in Neural Information Processing Systems, 2433–2441, 2010
142010
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence, 2018
112018
An off-policy policy gradient theorem using emphatic weightings
E Imani, E Graves, M White
Advances in Neural Information Processing Systems, 96-106, 2018
112018
Supervised autoencoders: Improving generalization performance with unsupervised regularizers
L Le, A Patterson, M White
Advances in Neural Information Processing Systems, 107-117, 2018
102018
Accelerated Gradient Temporal Difference Learning
Y Pan, A White, M White
Proceedings of the AAAI Conference on Artificial Intelligence, 2016
102016
The system can't perform the operation now. Try again later.
Articles 1–20