Agrim Gupta
Agrim Gupta
PhD Student, Stanford University
Verified email at - Homepage
Cited by
Cited by
Social gan: Socially acceptable trajectories with generative adversarial networks
A Gupta, J Johnson, L Fei-Fei, S Savarese, A Alahi
Proceedings of the IEEE conference on computer vision and pattern …, 2018
Lvis: A dataset for large vocabulary instance segmentation
A Gupta, P Dollar, R Girshick
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
Image generation from scene graphs
J Johnson, A Gupta, L Fei-Fei
Proceedings of the IEEE conference on computer vision and pattern …, 2018
Vima: General robot manipulation with multimodal prompts
Y Jiang, A Gupta, Z Zhang, G Wang, Y Dou, Y Chen, L Fei-Fei, ...
NeurIPS 2022 Foundation Models for Decision Making Workshop, 2022
Embodied Intelligence via Learning and Evolution
A Gupta, S Savarese, S Ganguli, L Fei-Fei
Nature Communications 12, 5721, 2021
Characterizing and improving stability in neural style transfer
A Gupta, J Johnson, A Alahi, L Fei-Fei
Proceedings of the IEEE International Conference on Computer Vision, 4067-4076, 2017
Maskvit: Masked visual pre-training for video prediction
A Gupta, S Tian, Y Zhang, J Wu, R Martín-Martín, L Fei-Fei
arXiv preprint arXiv:2206.11894, 2022
Open x-embodiment: Robotic learning datasets and rt-x models
A Padalkar, A Pooley, A Jain, A Bewley, A Herzog, A Irpan, A Khazatsky, ...
arXiv preprint arXiv:2310.08864, 2023
Trajnet: Towards A Benchmark for Human Trajectory Prediction
A Sadeghian, V Kosaraju, A Gupta, S Savarese, A Alahi, 2018
Metamorph: Learning universal controllers with transformers
A Gupta, L Fan, S Ganguli, L Fei-Fei
arXiv preprint arXiv:2203.11931, 2022
Robocat: A self-improving foundation agent for robotic manipulation
K Bousmalis, G Vezzani, D Rao, C Devin, AX Lee, M Bauza, T Davchev, ...
arXiv preprint arXiv:2306.11706, 2023
Videopoet: A large language model for zero-shot video generation
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, R Hornung, H Adam, ...
arXiv preprint arXiv:2312.14125, 2023
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation
L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ...
arXiv preprint arXiv:2310.05737, 2023
Photorealistic video generation with diffusion models
A Gupta, L Yu, K Sohn, X Gu, M Hahn, L Fei-Fei, I Essa, L Jiang, ...
arXiv preprint arXiv:2312.06662, 2023
Holistic evaluation of text-to-image models
T Lee, M Yasunaga, C Meng, Y Mai, JS Park, A Gupta, Y Zhang, ...
Advances in Neural Information Processing Systems 36, 2024
Siamese masked autoencoders
A Gupta, J Wu, J Deng, FF Li
Advances in Neural Information Processing Systems 36, 2024
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
G Kim, A Martinez, YC Su, B Jou, J Lezama, A Gupta, L Yu, L Jiang, ...
arXiv preprint arXiv:2405.13762, 2024
The system can't perform the operation now. Try again later.
Articles 1–17