Deep modular co-attention networks for visual question answering Z Yu, J Yu, Y Cui, D Tao, Q Tian Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 1007 | 2019 |
Multi-modal factorized bilinear pooling with co-attention learning for visual question answering Z Yu, J Yu, J Fan, D Tao Proceedings of the IEEE international conference on computer vision, 1821-1830, 2017 | 832 | 2017 |
Multimodal deep autoencoder for human pose recovery C Hong, J Yu, J Wan, D Tao, M Wang IEEE transactions on image processing 24 (12), 5659-5670, 2015 | 605 | 2015 |
Click prediction for web image reranking using multimodal sparse coding J Yu, Y Rui, D Tao IEEE transactions on image processing 23 (5), 2019-2032, 2014 | 555 | 2014 |
Beyond bilinear: Generalized multimodal factorized high-order pooling for visual question answering Z Yu, J Yu, C Xiang, J Fan, D Tao IEEE transactions on neural networks and learning systems 29 (12), 5947-5959, 2018 | 547 | 2018 |
Hierarchical deep click feature prediction for fine-grained image recognition J Yu, M Tan, H Zhang, Y Rui, D Tao IEEE transactions on pattern analysis and machine intelligence 44 (2), 563-578, 2019 | 518 | 2019 |
Learning to rank using user clicks and visual features for image retrieval J Yu, D Tao, M Wang, Y Rui IEEE transactions on cybernetics 45 (4), 767-779, 2014 | 456 | 2014 |
Adaptive hypergraph learning and its application in image classification J Yu, D Tao, M Wang IEEE Transactions on Image Processing 21 (7), 3262-3272, 2012 | 439 | 2012 |
Multimodal transformer with multi-view visual representation for image captioning J Yu, J Li, Z Yu, Q Huang IEEE transactions on circuits and systems for video technology 30 (12), 4467 …, 2019 | 430 | 2019 |
Deep multimodal distance metric learning using click constraints for image ranking J Yu, X Yang, F Gao, D Tao IEEE transactions on cybernetics 47 (12), 4014-4024, 2016 | 376 | 2016 |
Spatial pyramid-enhanced NetVLAD with weighted triplet loss for place recognition J Yu, C Zhu, J Zhang, Q Huang, D Tao IEEE transactions on neural networks and learning systems 31 (2), 661-674, 2019 | 367 | 2019 |
Activitynet-qa: A dataset for understanding complex web videos via question answering Z Yu, D Xu, J Yu, T Yu, Z Zhao, Y Zhuang, D Tao Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 9127-9134, 2019 | 360 | 2019 |
iPrivacy: image privacy protection by identifying sensitive objects via deep multi-task learning J Yu, B Zhang, Z Kuang, D Lin, J Fan IEEE Transactions on Information Forensics and Security 12 (5), 1005-1016, 2016 | 325 | 2016 |
Multimodal face-pose estimation with multitask manifold deep learning C Hong, J Yu, J Zhang, X Jin, KH Lee IEEE transactions on industrial informatics 15 (7), 3952-3961, 2018 | 303 | 2018 |
Image-based three-dimensional human pose recovery by multiview locality-sensitive sparse retrieval C Hong, J Yu, D Tao, M Wang IEEE transactions on industrial electronics 62 (6), 3742-3751, 2014 | 287 | 2014 |
Coupled deep autoencoder for single image super-resolution K Zeng, J Yu, R Wang, C Li, D Tao IEEE transactions on cybernetics 47 (1), 27-37, 2015 | 266 | 2015 |
High-order distance-based multiview stochastic learning in image classification J Yu, Y Rui, YY Tang, D Tao IEEE transactions on cybernetics 44 (12), 2431-2442, 2014 | 263 | 2014 |
Semisupervised multiview distance metric learning for cartoon synthesis J Yu, M Wang, D Tao IEEE transactions on image processing 21 (11), 4636-4648, 2012 | 241 | 2012 |
Local deep-feature alignment for unsupervised dimension reduction J Zhang, J Yu, D Tao IEEE transactions on image processing 27 (5), 2420-2432, 2018 | 217 | 2018 |
Leveraging content sensitiveness and user trustworthiness to recommend fine-grained privacy settings for social image sharing J Yu, Z Kuang, B Zhang, W Zhang, D Lin, J Fan IEEE transactions on information forensics and security 13 (5), 1317-1332, 2018 | 212 | 2018 |