Yunhao Tang

Sitert av

	Alle	Siden 2019
Sitater	1566	1558
h-indeks	17	17
i10-indeks	25	25

600

300

150

450

20182019202020212022202320246 39 128 190 238 363 597

Offentlig tilgang

Vis alle

4 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Rémi MunosDeepMindVerifisert e-postadresse på inria.fr
Krzysztof ChoromanskiGoogle Brain Robotics New York & Columbia UniversityVerifisert e-postadresse på columbia.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerifisert e-postadresse på meta.com
Aldo PacchianoBroad Institute of MIT and HarvardVerifisert e-postadresse på broadinstitute.org
Mark RowlandResearch Scientist, Google DeepMindVerifisert e-postadresse på google.com
Will DabneyDeepMindVerifisert e-postadresse på google.com
Shipra AgrawalColumbia universityVerifisert e-postadresse på columbia.edu
Tamás SarlósGoogleVerifisert e-postadresse på google.com
Wenbo GaoColumbia UniversityVerifisert e-postadresse på columbia.edu
Vikas SindhwaniGoogle DeepMind RoboticsVerifisert e-postadresse på google.com
Tadashi KozunoOmron Sinic XVerifisert e-postadresse på sinicx.com
Florent AltchéResearch Engineer, DeepMindVerifisert e-postadresse på google.com
Yuri FaenzaAssociate Professor, IEOR, Columbia UniversityVerifisert e-postadresse på columbia.edu
Alp KucukelbirAdjunct Professor of Computer Science, Columbia UniversityVerifisert e-postadresse på cs.columbia.edu
Adrian WellerDirector of Research, Machine Learning, University of CambridgeVerifisert e-postadresse på eng.cam.ac.uk
Anna ChoromanskaNew York UniversityVerifisert e-postadresse på nyu.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerifisert e-postadresse på cs.berkeley.edu
Jiri HronResearch Scientist, Google DeepMindVerifisert e-postadresse på google.com
Steven KapturowskiDeepMindVerifisert e-postadresse på google.com
David AbelResearch Scientist, DeepMindVerifisert e-postadresse på deepmind.com

Følg

Yunhao Tang

Research Scientist, DeepMind

Verifisert e-postadresse på columbia.edu - Startside

Reinforcement Learning


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	463	2023
Reinforcement learning for integer programming: Learning to cut Y Tang, S Agrawal, Y Faenza International conference on machine learning, 9367-9376, 2020	190	2020
Es-maml: Simple hessian-free meta learning X Song, W Gao, Y Yang, K Choromanski, A Pacchiano, Y Tang arXiv preprint arXiv:1910.01215, 2019	126	2019
Discretizing continuous action space for on-policy optimization Y Tang, S Agrawal Proceedings of the aaai conference on artificial intelligence 34 (04), 5981-5988, 2020	114	2020
Monte-Carlo tree search as regularized policy optimization JB Grill, F Altché, Y Tang, T Hubert, M Valko, I Antonoglou, R Munos International Conference on Machine Learning, 3769-3778, 2020	68	2020
Byol-explore: Exploration by bootstrapped prediction Z Guo, S Thakoor, M Pîslar, B Avila Pires, F Altché, C Tallec, A Saade, ... Advances in neural information processing systems 35, 31855-31870, 2022	53	2022
From complexity to simplicity: Adaptive es-active subspaces for blackbox optimization KM Choromanski, A Pacchiano, J Parker-Holder, Y Tang, V Sindhwani Advances in Neural Information Processing Systems 32, 2019	49	2019
Orthogonal estimation of Wasserstein distances M Rowland, J Hron, Y Tang, K Choromanski, T Sarlos, A Weller The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	46	2019
Provably robust blackbox optimization for reinforcement learning K Choromanski, A Pacchiano, J Parker-Holder, Y Tang, D Jain, Y Yang, ... CoRR, abs/1903.02993, 2019	42	2019
Exploration by distributional reinforcement learning Y Tang, S Agrawal arXiv preprint arXiv:1805.01907, 2018	40	2018
Learning to Score Behaviors for Guided Policy Optimization A Pacchiano, J Parker-Holder, Y Tang, A Choromanska, K Choromanski, ... arXiv preprint arXiv:1906.04349, 2019	38	2019
Boosting trust region policy optimization by normalizing flows policy Y Tang, S Agrawal arXiv preprint arXiv:1809.10326, 2018	33	2018
Self-imitation learning via generalized lower bound q-learning Y Tang Advances in neural information processing systems 33, 13964-13975, 2020	23	2020
Understanding self-predictive learning for reinforcement learning Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ... International Conference on Machine Learning, 33632-33656, 2023	21	2023
Hindsight expectation maximization for goal-conditioned reinforcement learning Y Tang, A Kucukelbir International Conference on Artificial Intelligence and Statistics, 2863-2871, 2021	20	2021
Revisiting Peng’s Q() for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... International Conference on Machine Learning, 5794-5804, 2021	18	2021
Taylor expansion policy optimization Y Tang, M Valko, R Munos International Conference on Machine Learning, 9397-9406, 2020	17	2020
An analysis of quantile temporal-difference learning M Rowland, R Munos, MG Azar, Y Tang, G Ostrovski, A Harutyunyan, ... arXiv preprint arXiv:2301.04462, 2023	15	2023
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	14	2023
Online hyper-parameter tuning in off-policy learning via evolutionary strategies Y Tang, K Choromanski arXiv preprint arXiv:2006.07554, 2020	14	2020

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere