Corralling stochastic bandit algorithms R Arora, TV Marinov, M Mohri International Conference on Artificial Intelligence and Statistics, 2116-2124, 2021 | 44 | 2021 |
Stochastic approximation for canonical correlation analysis R Arora, TV Marinov, P Mianjy, N Srebro Advances in Neural Information Processing Systems 30, 2017 | 44 | 2017 |
Streaming Kernel PCA with Random Features E Ullah, P Mianjy, TV Marinov, R Arora Advances in Neural Information Processing Systems 31, 2018 | 42 | 2018 |
Streaming Kernel PCA with Random Features E Ullah, P Mianjy, TV Marinov, R Arora Advances in Neural Information Processing Systems 31, 2018 | 42 | 2018 |
Beyond value-function gaps: Improved instance-dependent regret bounds for episodic reinforcement learning C Dann, TV Marinov, M Mohri, J Zimmert Advances in Neural Information Processing Systems 34, 1-12, 2021 | 40 | 2021 |
Stochastic optimization for multiview representation learning using partial least squares R Arora, P Mianjy, T Marinov International Conference on Machine Learning, 1786-1794, 2016 | 35 | 2016 |
Bandits with feedback graphs and switching costs R Arora, TV Marinov, M Mohri Advances in Neural Information Processing Systems 32, 2019 | 34 | 2019 |
The Pareto Frontier of model selection for general Contextual Bandits TV Marinov, J Zimmert Advances in Neural Information Processing Systems 34, 17956-17967, 2021 | 27 | 2021 |
A mechanism for sample-efficient in-context learning for sparse retrieval tasks J Abernethy, A Agarwal, TV Marinov, MK Warmuth International Conference on Algorithmic Learning Theory, 3-46, 2024 | 23 | 2024 |
Streaming principal component analysis in noisy setting TV Marinov, P Mianjy, R Arora International Conference on Machine Learning, 3413-3422, 2018 | 23 | 2018 |
Policy regret in repeated games R Arora, M Dinitz, TV Marinov, M Mohri Advances in Neural Information Processing Systems 31, 2018 | 18 | 2018 |
Dimension Independent Generalization of DP-SGD for Overparameterized Smooth Convex Optimization YA Ma, TV Marinov, T Zhang arXiv preprint arXiv:2206.01836, 2022 | 9 | 2022 |
Stochastic online learning with feedback graphs: Finite-time and asymptotic optimality TV Marinov, M Mohri, J Zimmert Advances in Neural Information Processing Systems 35, 24947-24959, 2022 | 6 | 2022 |
Efficient convex relaxations for streaming PCA R Arora, TV Marinov Advances in Neural Information Processing Systems 32, 2019 | 4 | 2019 |
Open Problem: Finite-Time Instance Dependent Optimality for Stochastic Online Learning with Feedback Graphs TV Marinov, M Mohri, J Zimmert Conference on Learning Theory, 5644-5649, 2022 | 3 | 2022 |
Private Stochastic Convex Optimization: Efficient Algorithms for Non-smooth Objectives R Arora, TV Marinov, E Ullah arXiv preprint arXiv:2002.09609, 2020 | 3 | 2020 |
Multiple-policy High-confidence Policy Evaluation C Dann, M Ghavamzadeh, TV Marinov International Conference on Artificial Intelligence and Statistics, 9470-9487, 2023 | 1 | 2023 |
Incentive-compatible Bandits: Importance Weighting No More J Zimmert, TV Marinov arXiv preprint arXiv:2405.06480, 2024 | | 2024 |
Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization TV Marinov, A Agarwal, M Trofin arXiv preprint arXiv:2403.19462, 2024 | | 2024 |
Leveraging User-Triggered Supervision in Contextual Bandits A Agarwal, C Gentile, TV Marinov arXiv preprint arXiv:2302.03784, 2023 | | 2023 |