Stebėti
Haoyu Lu
Pavadinimas
Cituota
Cituota
Metai
Towards artificial general intelligence via a multimodal foundation model
N Fei, Z Lu, Y Gao, G Yang, Y Huo, J Wen, H Lu, R Song, X Gao, T Xiang, ...
Nature Communications 13 (1), 3094, 2022
133*2022
WenLan: Bridging vision and language by large-scale multi-modal pre-training
Y Huo, M Zhang, G Liu, H Lu, Y Gao, G Yang, J Wen, H Zhang, B Xu, ...
arXiv preprint arXiv:2103.06561, 2021
1192021
Cots: Collaborative two-stream vision-language pre-training model for cross-modal retrieval
H Lu, N Fei, Y Huo, Y Gao, Z Lu, JR Wen
Proceedings of the IEEE/CVF conference on computer Vision and pattern …, 2022
492022
Self-supervised video representation learning with constrained spatiotemporal jigsaw
Y Huo, M Ding, H Lu, Z Lu, T Xiang, JR Wen, Z Huang, J Jiang, S Zhang, ...
172020
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
H Lu, G Yang, N Fei, Y Huo, Z Lu, P Luo, M Ding
ICLR 2024, 2023
13*2023
Learning versatile neural architectures by propagating network codes
M Ding, Y Huo, H Lu, L Yang, Z Wang, Z Lu, J Wang, P Luo
arXiv preprint arXiv:2103.13253, 2021
132021
UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling
H Lu, M Ding, Y Huo, G Yang, Z Lu, M Tomizuka, W Zhan
ICLR 2024, 2023
112023
Multimodal foundation models are better simulators of the human brain
H Lu, Q Zhou, N Fei, Z Lu, M Ding, J Wen, C Du, X Zhao, H Sun, H He, ...
arXiv preprint arXiv:2208.08263, 2022
92022
Compressed video contrastive learning
Y Huo, M Ding, H Lu, N Fei, Z Lu, JR Wen, P Luo
Advances in Neural Information Processing Systems 34, 14176-14187, 2021
92021
LGDN: Language-Guided Denoising Network for Video-Language Modeling
H Lu, M Ding, N Fei, Y Huo, Z Lu
Advances in Neural Information Processing Systems, 2022, 2022
72022
Cross-modal contrastive learning for generalizable and efficient image-text retrieval
H Lu, Y Huo, M Ding, N Fei, Z Lu
Machine Intelligence Research 20 (4), 569-582, 2023
52023
Bmu-moco: Bidirectional momentum update for continual video-language modeling
Y Gao, N Fei, H Lu, Z Lu, H Jiang, Y Li, Z Cao
Advances in Neural Information Processing Systems 35, 22699-22712, 2022
32022
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–12