W Bradley Knox
W Bradley Knox
Research Scientist at Google Research
Patvirtintas el. paštas - Pagrindinis puslapis
Power to the people: The role of humans in interactive machine learning
S Amershi, M Cakmak, WB Knox, T Kulesza
AI Magazine 35 (4), 105-120, 2014
Interactively shaping agents via human reinforcement: The TAMER framework
WB Knox, P Stone
Proceedings of the 5th International Conference on Knowledge Capture (K-CAP …, 2009
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
WB Knox, P Stone
Proceedings of the 9th International Conference on Autonomous Agents and …, 2010
Reinforcement learning from simultaneous human and MDP reward
WB Knox, P Stone
Proceedings of the 11th International Conference on Autonomous Agents and …, 2012
Training a robot via human feedback: A case study
WB Knox, P Stone, C Breazeal
International Conference on Social Robotics (ICSR), 460-470, 2013
Tamer: Training an agent manually via evaluative reinforcement
WB Knox, P Stone
2008 7th IEEE international conference on development and learning, 292-297, 2008
Computationally modeling interpersonal trust
JJ Lee, B Knox, J Baumann, C Breazeal, D DeSteno
Frontiers in psychology, 893, 2013
The nature of belief-directed exploratory choice in human decision-making
WB Knox, AR Otto, P Stone, B Love
Frontiers in Psychology 2, 2012
How humans teach agents: A new experimental perspective
WB Knox, BD Glass, BC Love, WT Maddox, P Stone
International Journal of Social Robotics 4 (4), 409-421, 2012
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance
WB Knox, P Stone
Artificial Intelligence 225, 24-50, 2015
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks
WB Knox, P Stone
21st IEEE International Symposium on Robot and Human Interactive …, 2012
Learning from Human-Generated Reward
WB Knox
University of Texas at Austin, 2012
Know thine enemy: A champion RoboCup coach agent
G Kuhlmann, WB Knox, P Stone
Proceedings of the National Conference on Artificial Intelligence 21 (2), 1463, 2006
Using informative behavior to increase engagement in the tamer framework
G Li, H Hung, S Whiteson, WB Knox
Proceedings of the 2013 international conference on Autonomous agents and …, 2013
The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Y Cui, Q Zhang, A Allievi, P Stone, S Niekum, WB Knox
Conference on Robot Learning (CoRL), 2020
Learning non-myopically from human-generated reward
WB Knox, P Stone
Proceedings of the 2013 international conference on Intelligent user …, 2013
Design Principles for Creating Human-Shapable Agents.
WB Knox, IR Fasel, P Stone
AAAI Spring Symposium: Agents that Learn from Human Teachers, 79-86, 2009
Reward (Mis)design for Autonomous Driving
WB Knox, A Allievi, H Banzhaf, F Schmitt, P Stone
arXiv preprint arXiv:2104.13906, 2021
Physiological and behavioral signatures of reflective exploratory choice
AR Otto, WB Knox, AB Markman, BC Love
Cognitive, Affective, & Behavioral Neuroscience 14, 1167-1183, 2014
Using informative behavior to increase engagement while learning from human reward
G Li, S Whiteson, WB Knox, H Hung
Autonomous agents and multi-agent systems 30, 826-848, 2016
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–20