Shihua Huang (黄世华)

Shihua Huang is currently a senior AI engineer in Intellindust from May 2023. Prior to that, he was a PhD student with the Dept. of Computer Science and Engineering at Michigan State University, USA.

His research interests lie in practical, robust, and efficient AI algorithms, including real-time object detection and open-vocabulary object detection. He also dedicates significant time to deploying these algorithms on edge devices in industrial applications.

Email  /  CV  /  简历  /  Bio  /  Google Scholar  /  Github

profile photo
News
Selected Research
PontTuset DEIM: DETR with Improved Matching for Fast Convergence
Shihua Huang, Zhichao Lu, Xiaodong Cun, Yongjun Yu, Xiao Zhou, Xi Shen,
CVPR , 2025  
project page / arXiv / code

DEIM is an advanced training framework designed to enhance the matching mechanism in DETRs, enabling faster convergence and improved performance.

PontTuset Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective
Shihua Huang, Zhichao Lu, Kalyanmoy Deb, Vishnu Boddeti,
CVPR , 2023  
project page / arXiv / code

Given systematic ablative experiments, insights are derived for RobustScaling and RobustResblock which are then combined for RobustResNets. RobustResNets consistently outperform both the standard WRNs and other existing robust architectures, achieving state-of-the-art AutoAttack robust accuracy of 61.1% without additional data and 63.7% with 500K external data while being 2× more compact in terms of parameters.

PontTuset FaPN: Feature-aligned Pyramid Network for Dense Image Prediction
Shihua Huang, Zhichao Lu, Ran Cheng, Cheng He
ICCV, 2021  
project page / arXiv / code

FaPN is a simple yet effective top-down pyramidal architecture to generate multi-scale features for dense image prediction. It improves FPN's AP / mIoU by 1.5 - 2.6% on all tasks. It achieved 56.7% mIoU over ADE20k-150 when paired with MaskFormer.

PontTuset Evolutionary Multi-Objective Optimization Driven by Generative Adversarial Networks (GANs)
Cheng He, Shihua Huang, Ran Cheng, Kay Chen Tan, Yaochu Jin
TCYB , 2020  
arXiv / code
Service
Reviewer: IEEE Trans. on Neural Network and Learning System, IEEE Trans. on Multi Media, IEEE Trans. on Cognitive and Developmental Systems, Neural Networks, Applied Soft Computing, and Complex & Intelligent Systems.