W Bradley Knox

Cituota

	Visi	Nuo 2019
Šaltiniai	3728	2374
h-rodyklė	23	17
i10-rodyklė	33	25

580

290

145

435

200920102011201220132014201520162017201820192020202120222023202423 27 54 69 119 147 139 214 215 294 298 404 466 462 564 175

Viešas pasiekiamumas

Peržiūrėti viską

4 straipsniai

0 straipsnių

pasiekiami

nepasiekiami

Pagal finansavimo įpareigojimus

Bendraautoriai

Peter StoneProfessor of Computer Science, The University of Texas at AustinPatvirtintas el. paštas cs.utexas.edu
Cynthia BreazealProfessor Media Arts and Sciences, MIT Media LabPatvirtintas el. paštas media.mit.edu
Maya CakmakUniversity of WashingtonPatvirtintas el. paštas cs.washington.edu
Bradley C. LoveProfessor of Cognitive and Decision Sciences, University College LondonPatvirtintas el. paštas ucl.ac.uk
Todd KuleszaUser Experience Researcher, GooglePatvirtintas el. paštas google.com
Saleema AmershiMicrosoft ResearchPatvirtintas el. paštas microsoft.com
Alessandro AllieviImperial College LondonPatvirtintas el. paštas imperial.ac.uk
Scott NiekumAssociate Professor, University of Massachusetts AmherstPatvirtintas el. paštas cs.umass.edu
Hayley HungAssociate Professor, Delft University of TechnologyPatvirtintas el. paštas tudelft.nl
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoPatvirtintas el. paštas cs.ox.ac.uk
Guangliang LiAssociate Professor, College of Electrical Engineering, Ocean University of China, Qingdao, ChinaPatvirtintas el. paštas ouc.edu.cn
Ross OttoDepartment of Psychology, McGill UniversityPatvirtintas el. paštas mcgill.ca
Jin Joo Lee, PhDAmazon Lab126Patvirtintas el. paštas amazon.com
W. Todd MaddoxWayne Holtzman Chair and Professor of Psychology, University of TexasPatvirtintas el. paštas utexas.edu
Serena BoothMITPatvirtintas el. paštas mit.edu
Felix SchmittBosch Center for Artificial IntelligencePatvirtintas el. paštas de.bosch.com
Jolie Baumann WormwoodUniversity of New HampshirePatvirtintas el. paštas unh.edu
David DeStenoNortheastern UniversityPatvirtintas el. paštas northeastern.edu
Brian GlassPostdoctoral Researcher of Psychology and Computer Science, University College London, University ofPatvirtintas el. paštas qmul.ac.uk
Samuel SpauldingMedia Lab, Massachusetts Institute of TechnologyPatvirtintas el. paštas media.mit.edu

Stebėti

W Bradley Knox

Research Scientist at UT Austin

Patvirtintas el. paštas cs.utexas.edu - Pagrindinis puslapis

Reward functions Alignment RLHF Reinforcement Learning Human-Robot Interaction


Pavadinimas Rūšiuoti pagal šaltinius Rūšiuoti pagal metus Rūšiuoti pagal pavadinimą	Cituota Cituota	Metai
Power to the people: The role of humans in interactive machine learning S Amershi, M Cakmak, WB Knox, T Kulesza AI Magazine 35 (4), 105-120, 2014	1126	2014
Interactively shaping agents via human reinforcement: The TAMER framework WB Knox, P Stone Proceedings of the 5th International Conference on Knowledge Capture (K-CAP …, 2009	580	2009
Combining manual feedback with subsequent MDP reward signals for reinforcement learning WB Knox, P Stone Proceedings of the 9th International Conference on Autonomous Agents and …, 2010	265	2010
Reinforcement learning from simultaneous human and MDP reward WB Knox, P Stone Proceedings of the 11th International Conference on Autonomous Agents and …, 2012	252*	2012
Tamer: Training an agent manually via evaluative reinforcement WB Knox, P Stone 2008 7th IEEE international conference on development and learning, 292-297, 2008	200	2008
Training a robot via human feedback: A case study WB Knox, P Stone, C Breazeal International Conference on Social Robotics (ICSR), 460-470, 2013	170	2013
Computationally modeling interpersonal trust JJ Lee, B Knox, J Baumann, C Breazeal, D DeSteno Frontiers in psychology 4, 56004, 2013	123	2013
The nature of belief-directed exploratory choice in human decision-making WB Knox, AR Otto, P Stone, B Love Frontiers in Psychology 2, 2012	96	2012
How humans teach agents: A new experimental perspective WB Knox, BD Glass, BC Love, WT Maddox, P Stone International Journal of Social Robotics 4 (4), 409-421, 2012	95	2012
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance WB Knox, P Stone Artificial Intelligence 225, 24-50, 2015	76	2015
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks WB Knox, P Stone 21st IEEE International Symposium on Robot and Human Interactive …, 2012	70	2012
Reward (Mis)design for Autonomous Driving WB Knox, A Allievi, H Banzhaf, F Schmitt, P Stone arXiv preprint arXiv:2104.13906, 2021	66	2021
The EMPATHIC Framework for Task Learning from Implicit Human Feedback Y Cui, Q Zhang, A Allievi, P Stone, S Niekum, WB Knox Conference on Robot Learning (CoRL), 2020	56	2020
Learning from Human-Generated Reward WB Knox University of Texas at Austin, 2012	56	2012
Know thine enemy: A champion RoboCup coach agent G Kuhlmann, WB Knox, P Stone Proceedings of the National Conference on Artificial Intelligence 21 (2), 1463, 2006	48	2006
Using informative behavior to increase engagement in the tamer framework G Li, H Hung, S Whiteson, WB Knox Proceedings of the 2013 international conference on autonomous agents and …, 2013	42	2013
Learning non-myopically from human-generated reward WB Knox, P Stone Proceedings of the 2013 international conference on Intelligent user …, 2013	42	2013
Design Principles for Creating Human-Shapable Agents. WB Knox, IR Fasel, P Stone AAAI Spring Symposium: Agents that Learn from Human Teachers, 79-86, 2009	34	2009
Physiological and behavioral signatures of reflective exploratory choice AR Otto, WB Knox, AB Markman, BC Love Cognitive, Affective, & Behavioral Neuroscience 14, 1167-1183, 2014	27	2014
The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications S Booth, WB Knox, J Shah, S Niekum, P Stone, A Allievi Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 5920-5929, 2023	25	2023

Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.

Straipsniai 1–20

Šaltinių per metus

Dubliuoti šaltiniai

Sujungti šaltiniai

Pridėti bendraautoriusBendraautoriai

Stebėti

Cituota

Bendraautoriai