Follow
Thiago D. Simão
Title
Cited by
Cited by
Year
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning.
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
522021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
302021
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
Proceedings of the AAAI Conference on Artificial Intelligence 33, 4967-4974, 2019
242019
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
22*2020
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning 112 (3), 859-887, 2023
92023
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
Proceedings of the 28th International Joint Conference on Artificial …, 2019
82019
Robust anytime learning of Markov decision processes
M Suilen, TD Simão, D Parker, N Jansen
Advances in Neural Information Processing Systems 35, 28790-28802, 2022
52022
Training and transferring safe policies in reinforcement learning
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
AAMAS 2022 Workshop on Adaptive Learning Agents, 2022
32022
Safe Policy Improvement for POMDPs via Finite-State Controllers
TD Simão, M Suilen, N Jansen
arXiv preprint arXiv:2301.04939, 2023
22023
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
2022 IEEE 25th International Conference on Intelligent Transportation …, 2022
22022
Decision-Making Under Uncertainty: Beyond Probabilities
T Badings, TD Simão, M Suilen, N Jansen
arXiv preprint arXiv:2303.05848, 2023
12023
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
arXiv preprint arXiv:2210.01801, 2022
12022
When a Robot Reaches Out for Human Help
I Andrés, LN de Barros, DD Mauá, TD Simão
Ibero-American Conference on Artificial Intelligence, 277-289, 2018
12018
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
12017
Planejamento Probabilístico com Becos Sem Saída
TD Simão, LN de Barros, FL Silva
XII Encontro Nacional de Inteligência Artificial e Computacional, 2015
12015
Desenvolvimento de Jogos 3D para a Educação a Distância
UA Leitão, TD Simão, JA Neves
VIII Congresso Brasileiro de Ensino Superior a Distância (ESUD). Ouro Preto …, 2011
12011
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen
arXiv preprint arXiv:2305.07958, 2023
2023
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring
M Krale, TD Simão, N Jansen
arXiv preprint arXiv:2303.08271, 2023
2023
Safe Online and Offline Reinforcement Learning
TD Simão
Delft University of Technology, 2023
2023
Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking
D Gross, TD Simao, N Jansen, GA Perez
arXiv preprint arXiv:2212.05337, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20