Follow
Bowen Shi
Bowen Shi
Verified email at sjtu.edu.cn
Title
Cited by
Cited by
Year
Latency-aware differentiable neural architecture search
Y Xu, L Xie, X Zhang, X Chen, B Shi, Q Tian, H Xiong
arXiv preprint arXiv:2001.06392, 2020
382020
ITS-frame: A framework for multi-aspect analysis in the field of intelligent transportation systems
X Xu, Y Liu, W Wang, X Zhao, QZ Sheng, Z Wang, B Shi
IEEE Transactions on Intelligent Transportation Systems 20 (8), 2893-2902, 2018
312018
Pose-oriented transformer with uncertainty-guided refinement for 2d-to-3d human pose estimation
H Li, B Shi, W Dai, H Zheng, B Wang, Y Sun, M Guo, C Li, J Zou, H Xiong
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 1296-1304, 2023
142023
A transformer-based decoder for semantic segmentation with multi-level context mining
B Shi, D Jiang, X Zhang, H Li, W Dai, J Zou, H Xiong, Q Tian
European Conference on Computer Vision, 624-639, 2022
132022
Hierarchical graph networks for 3d human pose estimation
H Li, B Shi, W Dai, Y Chen, B Wang, Y Sun, M Guo, C Li, J Zou, H Xiong
arXiv preprint arXiv:2111.11927, 2021
112021
Adapting Shortcut with Normalizing Flow: An Efficient Tuning Framework for Visual Recognition
Y Wang, B Shi, X Zhang, J Li, Y Liu, W Dai, C Li, H Xiong, Q Tian
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR …, 2023
62023
Multi-dataset pretraining: A unified model for semantic segmentation
B Shi, X Zhang, H Xu, W Dai, J Zou, H Xiong, Q Tian
arXiv preprint arXiv:2106.04121, 2021
62021
Rethinking visual prompt learning as masked visual token modeling
N Liao, B Shi, X Zhang, M Cao, J Yan, Q Tian
arXiv preprint arXiv:2303.04998, 2023
42023
Tiny-hourglassnet: An efficient design for 3d human pose estimation
B Shi, Y Xu, W Dai, B Wang, S Zhang, C Li, J Zou, H Xiong
2020 IEEE international conference on image processing (ICIP), 1491-1495, 2020
42020
Deep neural network-based algorithm approximation via multivariate polynomial regression
C Liu, B Shi, C Li, J Zou, Y Chen, H Xiong
2019 IEEE Global Communications Conference (GLOBECOM), 1-6, 2019
22019
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
B Shi, P Zhao, Z Wang, Y Zhang, Y Wang, J Li, W Dai, J Zou, H Xiong, ...
arXiv preprint arXiv:2401.06397, 2024
12024
ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
H Zheng, H Li, B Shi, W Dai, B Wang, Y Sun, M Guo, H Xiong
2023 IEEE International Conference on Multimedia and Expo (ICME), 2657-2662, 2023
12023
AiluRus: A Scalable ViT Framework for Dense Prediction
J Li, Y Wang, X Zhang, B Shi, D Jiang, C Li, W Dai, H Xiong, Q Tian
Advances in Neural Information Processing Systems 36, 2024
2024
VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients
Y Wang, Y Liu, X Zhang, J Li, B Shi, C Li, W Dai, H Xiong, Q Tian
Proceedings of the 31st ACM International Conference on Multimedia, 4595-4605, 2023
2023
BarLeRIa: An Efficient Tuning Framework for Referring Image Segmentation
Y Wang, J Li, X ZHANG, B Shi, C Li, W Dai, H Xiong, Q Tian
The Twelfth International Conference on Learning Representations, 2023
2023
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
B Shi, X Zhang, Y Wang, J Li, W Dai, J Zou, H Xiong, Q Tian
arXiv preprint arXiv:2306.15876, 2023
2023
MS3: A Multimodal Supervised Pretrained Model for Semantic Segmentation
B Shi, X ZHANG, W Dai, J Zou, H Xiong, Q Tian
2022
The system can't perform the operation now. Try again later.
Articles 1–17