Yangchen Pan

Cituota

	Visi	Nuo 2019
Šaltiniai	476	458
h-rodyklė	12	11
i10-rodyklė	14	12

120

201720182019202020212022202320245 12 30 40 77 94 107 110

Viešas pasiekiamumas

Peržiūrėti viską

7 straipsniai

0 straipsnių

pasiekiami

nepasiekiami

Pagal finansavimo įpareigojimus

Stebėti

Yangchen Pan

University of Oxford

Patvirtintas el. paštas eng.ox.ac.uk - Pagrindinis puslapis

Machine learning reinforcement learning deep learning


Pavadinimas Rūšiuoti pagal šaltinius Rūšiuoti pagal metus Rūšiuoti pagal pavadinimą	Cituota Cituota	Metai
Maxmin q-learning: Controlling the estimation bias of q-learning Q Lan, Y Pan, A Fyshe, M White International Conference on Learning Representations 2020, 2020	182	2020
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains Y Pan, M Zaheer, A White, A Patterson, M White IJCAI, 2018	55	2018
Accelerated gradient temporal difference learning Y Pan, A White, M White Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	34	2017
The In-Sample Softmax for Offline Reinforcement Learning C Xiao, H Wang, Y Pan, A White, M White ICLR 2023, 2023	25	2023
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement S Neumann, S Lim, A Joseph, Y Pan, A White, M White ICLR 2023, 2023	25*	2023
Hill climbing on value estimates for search-control in Dyna Y Pan, H Yao, A Farahmand, M White IJCAI 2019, 2019	18	2019
Fuzzy tiling activations: A simple approach to learning sparse representations online Y Pan, K Banman, M White ICLR 2022, 2022	17	2022
Frequency-based Search-control in Dyna Y Pan, J Mei, A Farahmand ICLR 2020, 2020	16	2020
Reinforcement learning with function-valued action spaces for partial differential equation control Y Pan, A Farahmand, M White, S Nabi, P Grover, D Nikovski International Conference on Machine Learning, 3986-3995, 2018	16	2018
Understanding and mitigating the limitations of prioritized experience replay Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo Uncertainty in Artificial Intelligence, 1561-1571, 2022	15	2022
Incremental truncated LSTD C Gehring, Y Pan, M White IJCAI 2016, 2016	15	2016
Effective sketching methods for value function approximation Y Pan, ES Azer, M White Uncertainty in Artificial Intelligence 2017, 2017	14	2017
Adapting kernel representations online using submodular maximization M Schlegel, Y Pan, J Chen, M White International Conference on Machine Learning, 3037-3046, 2017	11	2017
An implicit function learning approach for parametric modal regression Y Pan, E Imani, A Farahmand, M White Advances in Neural Information Processing Systems 33, 11442-11452, 2020	10	2020
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation Q Lan, Y Pan, J Luo, AR Mahmood Transactions on Machine Learning Research, 2023	8*	2023
An Alternate Policy Gradient Estimator for Softmax Policies S Garg, S Tosatto, Y Pan, M White, AR Mahmood AISTATS 2022, 2021	5	2021
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient Y Luo, G Liu, P Poupart, Y Pan 2023 Advances in Neural Information Processing Systems (NeurIPS), 2023	4	2023
Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods A Ma, Y Pan, A Farahmand Transactions on Machine Learning Research, 2023	3	2023
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning X Zhao, Y Pan, C Xiao, S Chandar, J Rajendran UAI 2023, 2023	2	2023
Improving Adversarial Transferability via Model Alignment A Ma, A Farahmand, Y Pan, P Torr, J Gu arXiv preprint arXiv:2311.18495, 2024	1	2024

Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.

Straipsniai 1–20

Šaltinių per metus

Dubliuoti šaltiniai

Sujungti šaltiniai

Pridėti bendraautoriusBendraautoriai

Stebėti

Cituota