W Bradley Knox
W Bradley Knox
Research Associate Professor at UT Austin
Verified email at - Homepage
Cited by
Cited by
Power to the people: The role of humans in interactive machine learning
S Amershi, M Cakmak, WB Knox, T Kulesza
AI Magazine 35 (4), 105-120, 2014
Interactively shaping agents via human reinforcement: The TAMER framework
WB Knox, P Stone
Proceedings of the 5th International Conference on Knowledge Capture (K-CAP …, 2009
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
WB Knox, P Stone
Proceedings of the 9th International Conference on Autonomous Agents and …, 2010
Reinforcement learning from simultaneous human and MDP reward
WB Knox, P Stone
Proceedings of the 11th International Conference on Autonomous Agents and …, 2012
Tamer: Training an agent manually via evaluative reinforcement
WB Knox, P Stone
2008 7th IEEE international conference on development and learning, 292-297, 2008
Training a robot via human feedback: A case study
WB Knox, P Stone, C Breazeal
International Conference on Social Robotics (ICSR), 460-470, 2013
Computationally modeling interpersonal trust
JJ Lee, B Knox, J Baumann, C Breazeal, D DeSteno
Frontiers in psychology 4, 56004, 2013
The nature of belief-directed exploratory choice in human decision-making
WB Knox, AR Otto, P Stone, B Love
Frontiers in Psychology 2, 2012
How humans teach agents: A new experimental perspective
WB Knox, BD Glass, BC Love, WT Maddox, P Stone
International Journal of Social Robotics 4 (4), 409-421, 2012
Reward (Mis)design for Autonomous Driving
WB Knox, A Allievi, H Banzhaf, F Schmitt, P Stone
arXiv preprint arXiv:2104.13906, 2021
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance
WB Knox, P Stone
Artificial Intelligence 225, 24-50, 2015
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks
WB Knox, P Stone
21st IEEE International Symposium on Robot and Human Interactive …, 2012
The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Y Cui, Q Zhang, A Allievi, P Stone, S Niekum, WB Knox
Conference on Robot Learning (CoRL), 2020
Learning from Human-Generated Reward
WB Knox
University of Texas at Austin, 2012
Know thine enemy: A champion RoboCup coach agent
G Kuhlmann, WB Knox, P Stone
Proceedings of the National Conference on Artificial Intelligence 21 (2), 1463, 2006
Learning non-myopically from human-generated reward
WB Knox, P Stone
Proceedings of the 2013 international conference on Intelligent user …, 2013
Using informative behavior to increase engagement in the tamer framework
G Li, H Hung, S Whiteson, WB Knox
Proceedings of the 2013 international conference on autonomous agents and …, 2013
The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications
S Booth, WB Knox, J Shah, S Niekum, P Stone, A Allievi
Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 5920-5929, 2023
Design Principles for Creating Human-Shapable Agents.
WB Knox, IR Fasel, P Stone
AAAI Spring Symposium: Agents that Learn from Human Teachers, 79-86, 2009
Contrastive prefence learning: Learning from human feedback without rl
J Hejna, R Rafailov, H Sikchi, C Finn, S Niekum, WB Knox, D Sadigh
arXiv preprint arXiv:2310.13639, 2023
The system can't perform the operation now. Try again later.
Articles 1–20