Thompson sampling for complex online problems A Gopalan, S Mannor, Y Mansour International Conference on Machine Learning, 100-108, 2014 | 142 | 2014 |
On kernelized multi-armed bandits SR Chowdhury, A Gopalan International Conference on Machine Learning, 844-853, 2017 | 113 | 2017 |
Thompson sampling for learning parameterized markov decision processes A Gopalan, S Mannor Conference on Learning Theory, 861-898, 2015 | 76 | 2015 |
On wireless scheduling with partial channel-state information A Gopalan, C Caramanis, S Shakkottai Proc. Ann. Allerton Conf. Communication, Control and Computing, 2007 | 76* | 2007 |
On the Whittle index for restless multiarmed hidden Markov bandits R Meshram, D Manjunath, A Gopalan IEEE Transactions on Automatic Control 63 (9), 3046-3053, 2018 | 35 | 2018 |
Collaborative learning of stochastic bandits over a social network RK Kolla, K Jagannathan, A Gopalan IEEE/ACM Transactions on Networking 26 (4), 1782-1795, 2018 | 32 | 2018 |
Epidemic spreading with external agents S Banerjee, A Gopalan, AK Das, S Shakkottai IEEE Transactions on Information Theory 60 (7), 4125-4138, 2014 | 26 | 2014 |
Random mobility and the spread of infection A Gopalan, S Banerjee, AK Das, S Shakkottai 2011 Proceedings IEEE INFOCOM, 999-1007, 2011 | 26 | 2011 |
Optimizing distributed actor systems for dynamic interactive services A Newell, G Kliot, I Menache, A Gopalan, S Akiyama, M Silberstein Proceedings of the Eleventh European Conference on Computer Systems, 1-15, 2016 | 25 | 2016 |
On distributed scheduling with heterogeneously delayed network-state information AA Reddy, S Banerjee, A Gopalan, S Shakkottai, L Ying Queueing Systems 72 (3), 193-218, 2012 | 18 | 2012 |
Low-rank bandits with latent mixtures A Gopalan, OA Maillard, M Zaki arXiv preprint arXiv:1609.01508, 2016 | 16 | 2016 |
User rankings from comparisons: Learning permutations in high dimensions I Mitliagkas, A Gopalan, C Caramanis, S Vishwanath 2011 49th Annual Allerton Conference on Communication, Control, and …, 2011 | 16 | 2011 |
Battle of Bandits. A Saha, A Gopalan UAI, 805-814, 2018 | 14 | 2018 |
Misspecified linear bandits A Ghosh, SR Chowdhury, A Gopalan Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017 | 14 | 2017 |
Optimal recommendation to users that react: Online learning for a class of POMDPs R Meshram, A Gopalan, D Manjunath 2016 IEEE 55th Conference on Decision and Control (CDC), 7210-7215, 2016 | 13 | 2016 |
Thompson sampling for complex bandit problems A Gopalan, S Mannor, Y Mansour arXiv preprint arXiv:1311.0466, 2013 | 12 | 2013 |
Low-delay wireless scheduling with partial channel-state information A Gopalan, C Caramanis, S Shakkottai 2012 Proceedings IEEE INFOCOM, 1071-1079, 2012 | 12 | 2012 |
A restless bandit with no observable states for recommendation systems and communication link scheduling R Meshram, D Manjunath, A Gopalan 2015 54th IEEE Conference on Decision and Control (CDC), 7820-7825, 2015 | 10 | 2015 |
Online learning in kernelized markov decision processes SR Chowdhury, A Gopalan The 22nd International Conference on Artificial Intelligence and Statistics …, 2019 | 9 | 2019 |
PAC battling bandits in the plackett-luce model A Saha, A Gopalan Algorithmic Learning Theory, 700-737, 2019 | 9 | 2019 |