Følg
Adam White
Adam White
University of Alberta, Amii (Alberta Machine Intelligence Institute)
Verifisert e-postadresse på ualberta.ca - Startside
Tittel
Sitert av
Sitert av
År
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup
The 10th International Conference on Autonomous Agents and Multiagent …, 2011
6182011
RL-Glue: Language-independent software for reinforcement-learning experiments
B Tanner, A White
The Journal of Machine Learning Research 10, 2133-2136, 2009
1692009
Multi-timescale nexting in a reinforcement learning robot
J Modayil, A White, RS Sutton
Adaptive Behavior 22 (2), 146-160, 2014
1452014
Developing a predictive approach to knowledge
A White
University of Alberta, 2015
832015
Feature construction for reinforcement learning in hearts
NR Sturtevant, AM White
Computers and Games: 5th International Conference, CG 2006, Turin, Italy …, 2007
832007
Loss of plasticity in continual deep reinforcement learning
Z Abbas, R Zhao, J Modayil, A White, MC Machado
Conference on Lifelong Learning Agents, 620-636, 2023
632023
Report on the 2008 reinforcement learning competition
S Whiteson, B Tanner, A White
AI Magazine 31 (2), 81-81, 2010
582010
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
arXiv preprint arXiv:1806.04624, 2018
562018
Adapting behavior via intrinsic reward: A survey and empirical study
C Linke, NM Ady, M White, T Degris, A White
Journal of artificial intelligence research 69, 1287-1332, 2020
522020
Gradient temporal-difference learning with regularized corrections
S Ghiassian, A Patterson, S Garg, D Gupta, A White, M White
International Conference on Machine Learning, 3524-3534, 2020
502020
A greedy approach to adapting the trace parameter for temporal difference learning
M White, A White
arXiv preprint arXiv:1607.00446, 2016
472016
Investigating practical linear temporal difference learning
A White, M White
arXiv preprint arXiv:1602.08771, 2016
472016
General value function networks
M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White
Journal of Artificial Intelligence Research 70, 497-543, 2021
442021
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains
M White, A White
Advances in Neural Information Processing Systems, 2010
402010
Improving performance in reinforcement learning by breaking generalization in neural networks
S Ghiassian, B Rafiee, YL Lo, A White
arXiv preprint arXiv:2003.07417, 2020
382020
Surprise and curiosity for big data robotics
A White, J Modayil, RS Sutton
Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
382014
Accelerated gradient temporal difference learning
Y Pan, A White, M White
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
352017
Online off-policy prediction
S Ghiassian, A Patterson, M White, RS Sutton, A White
arXiv preprint arXiv:1811.02597, 2018
322018
Scaling life-long off-policy learning
RSS Adam White, Joseph Modayil
2012 IEEE International Conference on Development and Learning and …, 2013
32*2013
The in-sample softmax for offline reinforcement learning
C Xiao, H Wang, Y Pan, A White, M White
arXiv preprint arXiv:2302.14372, 2023
302023
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20