Follow
Thiago D. Simão
Title
Cited by
Cited by
Year
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
792021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
422021
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
29*2020
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
Proceedings of the AAAI Conference on Artificial Intelligence 33, 4967-4974, 2019
262019
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning 112 (3), 859-887, 2023
192023
Robust anytime learning of Markov decision processes
M Suilen, TD Simão, D Parker, N Jansen
Advances in Neural Information Processing Systems 35, 28790-28802, 2022
132022
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
Proceedings of the 28th International Joint Conference on Artificial …, 2019
102019
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
arXiv preprint arXiv:2210.01801, 2022
62022
Decision-making under uncertainty: beyond probabilities: Challenges and perspectives
T Badings, TD Simão, M Suilen, N Jansen
International Journal on Software Tools for Technology Transfer, 1-17, 2023
52023
Reinforcement Learning by Guided Safe Exploration
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
arXiv preprint arXiv:2307.14316, 2023
4*2023
Safe Policy Improvement for POMDPs via Finite-State Controllers
TD Simão, M Suilen, N Jansen
arXiv preprint arXiv:2301.04939, 2023
32023
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
2022 IEEE 25th International Conference on Intelligent Transportation …, 2022
32022
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen
arXiv preprint arXiv:2305.07958, 2023
22023
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
22017
Utilização de algoritmos genéticos para otimização de soluções para o timetabling escolar
TD SIMÃO
Tese apresentada ao Departamento de Ciência da Computação da Universidade …, 2013
22013
Scalable safe policy improvement via monte carlo tree search
A Castellini, F Bianchi, E Zorzi, TD Simao, A Farinelli, MTJ Spaan
International Conference on Machine Learning, 3732-3756, 2023
12023
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring
M Krale, TD Simão, N Jansen
arXiv preprint arXiv:2303.08271, 2023
12023
When a Robot Reaches Out for Human Help
I Andrés, LN de Barros, DD Mauá, TD Simão
Ibero-American Conference on Artificial Intelligence, 277-289, 2018
12018
Desenvolvimento de Jogos 3D para a Educação a Distância
UA Leitão, TD Simão, JA Neves
VIII Congresso Brasileiro de Ensino Superior a Distância (ESUD). Ouro Preto …, 2011
12011
Risk-aware curriculum generation for heavy-tailed task distributions
C Koprulu, TD Simão, N Jansen, U Topcu
Uncertainty in Artificial Intelligence, 1132-1142, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20