Silviu Pitis
Silviu Pitis
University of Toronto, Vector Institute
Verified email at cs.toronto.edu - Homepage
Title
Cited by
Cited by
Year
Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
S Pitis, H Chan, S Zhao, B Stadie, J Ba
Thirty-seventh International Conference on Machine Learning (ICML 2020), 2020
222020
Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach
S Pitis
The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), 2019
152019
Counterfactual data augmentation using locally factored dynamics
S Pitis, E Creager, A Garg
arXiv preprint arXiv:2007.02863, 2020
142020
Source Traces for Temporal Difference Learning
S Pitis
The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 2018
92018
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
K De Asis, A Chan, S Pitis, RS Sutton, D Graves
The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), 2020
82020
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
S Pitis, H Chan, K Jamali, J Ba
Eighth International Conference on Learning Representations (ICLR 2020), 2020
42020
Methods for retrieving alternative contract language using a prototype
S Pitis
The Sixteenth International Conference on Law and Artificial Intelligence …, 2017
22017
ProtoGE: Prototype Goal Encodings for Multi-goal Reinforcement Learning
S Pitis, H Chan, J Ba
The 4th Multidisciplinary Conference on Reinforcement Learning and Decision …, 2019
12019
Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes
S Pitis, MR Zhang
International Conference on Autonomous Agents and Multi-Agent Systems 2020, 2020
2020
Challenging the MDP Status Quo: An Axiomatic Approach to Rationality for Reinforcement Learning Agents
S Pitis
1st Workshop on Goal Specifications for Reinforcement Learning, FAIM 2018, 2018
2018
Reasoning for reinforcement learning
S Pitis
Hierarchical Reinforcement Learning Workshop at NIPS 2017, 2017
2017
Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes Download PDF
S Pitis, MR Zhang
The system can't perform the operation now. Try again later.
Articles 1–12