Publications

In this page, you can find the our publications in 2025, 2024, 2023, 2022, 2021, before 2021, which can also be found in Google Scholar.

* indicates equal contribution. indicates corresponding author.

Highlights

U2-BENCH: Benchmarking Large Vision-Language Models on Ultrasound Understanding

We introduce U2-BENCH, the first comprehensive benchmark to evaluate LVLMs on ultrasound understanding across classification, detection, regression, and text generation tasks. U2-BENCH aggregates 7,241 cases spanning 15 anatomical regions and defines 8 clinically inspired tasks, such as diagnosis, view recognition, lesion localization, clinical value estimation, and report generation, across 50 ultrasound application scenarios. We evaluate 20 state-of-the-art LVLMs, both open- and closed-source, general-purpose and medical-specific. Our results reveal strong performance on image-level classification, but persistent challenges in spatial reasoning and clinical language generation.

Anjie Le, Henan Liu, Yue Wang, Zhenyu Liu, Rongkun Zhu, Taohan Weng, Jinze Yu, Boyang Wang, Yalun Wu, Kaiwen Yan, Quanlin Sun, Jialun Pei, Siya Liu, Haoyun Zheng, Zhoujun Li, Alison Noble, Jacques Souquet, Xiaoqing Guo†, Manxi Lin†, Hongcheng Guo†.

Initial submission, 2025.

Infproto-powered Adaptive Classifier and Agnostic Feature Learning for Single Domain Generalization in Medical Images

Medical imaging devices and variations between medical centers can result in domain shift, where images exhibit different styles and characteristics. In this paper, we aim to improve model performance when deploying an AI model trained in one medical center to new clinical scenarios. We reason that domain-adaptive classifier learning and domain-agnostic feature extraction are key components in single domain generalization, and further propose an adaptive infinite prototypes (InfProto) scheme to facilitate the learning of the two components.

Xiaoqing Guo, Jie Liu, Yixuan Yuan.

International Journal of Computer Vision (IJCV), 2024.

MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video

We present the first automated multimodal summary generation system, MMSummary, for medical imaging video, particularly with a focus on fetal ultrasound analysis. Imitating the examination process performed by a human sonographer, MMSummary is designed as a three-stage pipeline, progressing from keyframe detection to keyframe captioning and finally anatomy segmentation and measurement. This system provides comprehensive summaries for fetal ultrasound examinations to enhance clinical workflow efficiency (reduce scanning time by approximately 1/3%).

Xiaoqing Guo†, Qianhui Men, Alison Noble.

International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024), Marrakesh, Morocco.

Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation

We investigate how estimating 3D fetal pose from freehand 2D ultrasound scanning can guide a sonographer to locate a head standard plane. Fetal head pose is estimated by the proposed Pose-GuideNet, a novel 2D/3D registration approach to align freehand 2D ultrasound to a 3D anatomical atlas without the acquisition of 3D ultrasound. Evaluations with probe motions demonstrate the feasibility of adopting Pose-GuideNet for freehand ultrasound-assisted navigation in a sensor-free environment.

Qianhui Men, Xiaoqing Guo, Aris Papageorghiou, Alison Noble.

International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024), Marrakesh, Morocco.

 

2025

  1. Sonomate: Visually grounded language model for fetal ultrasound understanding and human interaction
    Xiaoqing Guo†, Mohammad Alsharid, He Zhao, Yipei Wang, Jayne Lander, Aris T. Papageorghiou, Alison Noble.
    Nature Biomedical Engineering (NBME), 2025.

  2. GaussianReg: Rapid 2D/3D Registration for Emergency Surgery via Explicit 3D Modeling with Gaussian Primitives
    Weihao Yu, Xiaoqing Guo, Xinyu Liu, Yifan Liu, Hao Zheng, Yawen Huang, Yixuan Yuan.
    International Conference on Computer Vision (ICCV 2025), Honolulu, Hawaii.

  3. GeoT: Geometry-guided Instance-dependent Transition Matrix for Semi-supervised Tooth Point Cloud Segmentation
    Weihao Yu, Xiaoqing Guo, Chenxin Li, Yifan Liu, Yixuan Yuan.
    Information Processing in Medical Imaging (IPMI 2025), Kos Island.

  4. Decoupled Representation Learning for Difference Medical Report Generation
    Chen Yang, Xiaoqing Guo, Yixuan Yuan.
    International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2025), Copenhagen, Denmark.

  5. U2-BENCH: Benchmarking Large Vision-Language Models on Ultrasound Understanding
    Anjie Le, Henan Liu, Yue Wang, Zhenyu Liu, Rongkun Zhu, Taohan Weng, Jinze Yu, Boyang Wang, Yalun Wu, Kaiwen Yan, Quanlin Sun, Jialun Pei, Siya Liu, Haoyun Zheng, Zhoujun Li, Alison Noble, Jacques Souquet, Xiaoqing Guo†, Manxi Lin†, Hongcheng Guo†.
    Initial submission, 2025.

2024

  1. Infproto-powered Adaptive Classifier and Agnostic Feature Learning for Single Domain Generalization in Medical Images
    Xiaoqing Guo, Jie Liu, Yixuan Yuan.
    International Journal of Computer Vision (IJCV), 2024.

  2. Disentangle Then Calibrate with Gradient Guidance: A Unified Framework for Common and Rare Disease Diagnosis
    Yuanyuan Chen, Xiaoqing Guo, Yong Xia, Yixuan Yuan.
    IEEE Transactions on Medical Imaging (IEEE TMI), 2024.

  3. Integrated Lithium Niobate Microwave Photonic Processing Engine [CityU News]
    Hanke Feng*, Tong Ge*, Xiaoqing Guo, Benshan Wang, Yiwen Zhang, Zhaoxi Chen, Sha Zhu, Ke Zhang, Wenzhao Sun, Chaoran Huang, Yixuan Yuan, Cheng Wang.
    Nature, 2024.

  4. Dynamic Attribute-guided Few-shot Open-set Network for Medical Image Diagnosis
    Yiwen Luo, Xiaoqing Guo, Li Liu, Yixuan Yuan.
    Expert Systems With Applications, 2024.

  5. MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video
    Xiaoqing Guo†, Qianhui Men, Alison Noble.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024), Marrakesh, Morocco.

  6. Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation
    Qianhui Men, Xiaoqing Guo, Aris Papageorghiou, Alison Noble.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024), Marrakesh, Morocco.

  7. IterMask^2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI [code] (Early Accept, Oral)
    Ziyun Liang, Xiaoqing Guo, Alison Noble, Konstantinos Kamnitsas.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024), Marrakesh, Morocco.

  8. Diversified and Personalized Multi-rater Medical Image Segmentation [code] (Highlight)
    Yicheng Wu, Xiangde Luo, Zhe Xu, Xiaoqing Guo, Lie Ju, Zongyuan Ge, Wenjun Liao, Jianfei Cai.
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2024), Seattle, USA.

2023

  1. Handling Open-set Noise and Novel Target Recognition in Domain Adaptive Semantic Segmentation [code]
    Xiaoqing Guo, Jie Liu, Tongliang Liu, Yixuan Yuan.
    IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 2023.

  2. Dynamic Feature Splicing for Few-Shot Rare Disease Diagnosis
    Yuanyuan Chen*, Xiaoqing Guo*, Yongsheng Pan, Yong Xia, Yixuan Yuan. (* indicates equal contribution)
    Medical Image Analysis (MedIA), 2023.

  3. Novel Scenes & Classes: Towards Adaptive Open-set Object Detection [code] (Oral)
    Wuyang Li, Xiaoqing Guo, Yixuan Yuan.
    IEEE International Conference on Computer Vision (ICCV 2023), Paris, France.

2022

  1. Semantic-oriented Labeled-to-unlabeled Distribution Translation for Image Segmentation [code] [知乎]
    Xiaoqing Guo, Jie Liu, Yixuan Yuan.
    IEEE Transactions on Medical Imaging (IEEE TMI), 2022.

  2. Non-equivalent Images and Pixels: Confidence-aware Resampling with Meta-learning Mixup for Polyp Segmentation
    Xiaoqing Guo, Zhen Chen, Jun Liu, Yixuan Yuan.
    Medical Image Analysis (MedIA), 2022.

  3. D2-Net: Dual Disentanglement Network for Brain Tumor Segmentation with Missing Modalities [code]
    Qiushi Yang, Xiaoqing Guo, Zhen Chen, Peter Y. M. Woo, Yixuan Yuan.
    IEEE Transactions on Medical Imaging (IEEE TMI), 2022.

  4. Graph-based Surgical Instrument Adaptive Segmentation via Domain-Common Knowledge [code]
    Jie Liu, Xiaoqing Guo, Yixuan Yuan.
    IEEE Transactions on Medical Imaging (IEEE TMI), 2022.

  5. Source Free Domain Adaptation for Medical Image Segmentation with Fourier Style Mining [code]
    Xiaoqing Guo, Chen Yang, Zhen Chen, Yixuan Yuan.
    Medical Image Analysis (MedIA), 2022.

  6. SimT: Handling Open-set Noise for Domain Adaptive Semantic Segmentation [code] [知乎]
    Xiaoqing Guo, Jie Liu, Tongliang Liu, Yixuan Yuan.
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), New Orleans, Louisiana, USA.

  7. Joint Class-Affinity Loss Correction for Robust Medical Image Segmentation with Noisy Labels [code]
    Xiaoqing Guo, Yixuan Yuan.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022), Singapore.

  8. Unknown-Oriented Learning for Open Set Domain Adaptation
    Jie Liu, Xiaoqing Guo, Yixuan Yuan.
    European Conference on Computer Vision (ECCV 2022), Tel-Aviv.

  9. Disentangle then Calibrate: Selective Treasure Sharing for Generalized Rare Disease Diagnosis (Early Accept)
    Yuanyuan Chen, Xiaoqing Guo, Yong Xia, Yixuan Yuan.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022), Singapore.

2021

  1. Learn to Threshold: ThresholdNet with Confidence-Guided Manifold Mixup for Polyp Segmentation [code] [MICS中文推送] (Early Accept)
    Xiaoqing Guo, Chen Yang, Yajie Liu, Yixuan Yuan.
    IEEE Transactions on Medical Imaging (IEEE TMI), 2021.

  2. Dynamic-weighting Hierarchical Segmentation Network for Medical Images [code]
    Xiaoqing Guo, Chen Yang, Yixuan Yuan.
    Medical Image Analysis (MedIA), 2021.

  3. Super-Resolution Enhanced Medical Image Diagnosis with Sample Affinity Interaction [code]
    Zhen Chen, Xiaoqing Guo, Peter Y. M. Woo, Yixuan Yuan.
    IEEE Transactions on Medical Imaging (IEEE TMI), 2021.

  4. Consolidated Domain Adaptive Detection and Localization Framework for Cross-device Colonoscopic Images [code] [video]
    Xinyu Liu, Xiaoqing Guo, Yajie Liu, Yixuan Yuan.
    Medical Image Analysis (MedIA), 2021.

  5. Mutual-Prototype Adaptation for Cross-Domain Polyp Segmentation [code]
    Chen Yang, Xiaoqing Guo, Meilu Zhu, Bulat Ibragimov, Yixuan Yuan.
    IEEE Journal of Biomedical and Health Informatics (JBHI), 2021.

  6. A Morphometric Analysis of Commonly Used Craniometric Approaches for Freehand Ventriculoperitoneal Shunting
    Peter Y. M. Woo, Desiree KK Wong, Yixuan Yuan, Xiaoqing Guo, Michael KW See, Matthew Tam, Alain KS Wong, Kwong-Yau Chan.
    Operative Neurosurgery, 2021.

  7. MetaCorrection: Domain-aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation [code] [video] [MICS中文推送]
    Xiaoqing Guo*, Chen Yang*, Baopu Li, Yixuan Yuan. (* indicates equal contribution)
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), Virtual.

  8. COINet: Adaptive Segmentation with Co-Interactive Network for Autonomous Driving
    Jie Liu, Xiaoqing Guo, Baopu Li, Yixuan Yuan.
    IEEE International Conference on Intelligent Robots and Systems (IROS 2021), Prague, Czech Republic.

  9. Prototypical Interaction Graph for Unsupervised Domain Adaptation in Surgical Instrument Segmentation [code] [video] (Early Accept)
    Jie Liu, Xiaoqing Guo, Yixuan Yuan.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2021), Strasbourg, France.

  10. Joint Polyp Detection and Segmentation with Heterogeneous Endoscopic Data (Ranked 1st place in polyp detection)
    Wuyang Li, Chen Yang, Jie Liu, Xinyu Liu, Xiaoqing Guo, Yixuan Yuan.
    The 3rd International Endoscopy Computer Vision Challenge and Workshop (EndoCV) of ISBI 2021 (MICCAI 2021), Nice, France.

Before 2021

  1. Semi-supervised WCE Image Classification with Adaptive Aggregated Attention [code]
    Xiaoqing Guo, Yixuan Yuan.
    Medical Image Analysis (MedIA), 2020.

  2. RNN-stega: Linguistic steganography based on recurrent neural networks [code]
    Zhongliang Yang, Xiaoqing Guo, Ziming Chen, Yongfeng Huang, Yujin Zhang.
    IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2018.

  3. Triple ANet: Adaptive Abnormal-aware Attention Network for WCE Image Classification [code] (Early Accept)
    Xiaoqing Guo, Yixuan Yuan.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2019), Shenzhen, China.

  4. Complementary Network with Adaptive Receptive Fields for Melanoma Segmentation [code]
    Xiaoqing Guo, Zhen Chen, Yixuan Yuan.
    IEEE International Symposium on Biomedical Imaging (ISBI 2020), Iowa City, Iowa, USA.

  5. Domain Knowledge Based Brain Tumor Segmentation and Overall Survival Prediction
    Xiaoqing Guo, Chen Yang, Pak Lun Lam, Peter Y. M. Woo, Yixuan Yuan.
    Brain Lesion (BrainLes) workshop of MICCAI 2019, Shenzhen, China.

  6. Joint Spatial-Wavelet Dual-Stream Network for Super-Resolution [code] (Early Accept)
    Zhen Chen, Xiaoqing Guo, Chen Yang, Bulat Ibragimov, Yixuan Yuan.
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2020), Lima, Peru.

  7. Prostate Segmentation with Encoder-Decoder Densely Connected Convolutional Network (ED-DenseNet)
    Yixuan Yuan, Wenjian Qin, Xiaoqing Guo, Mark Buyyounouski, Steve Hancock, Bin Han, Lei Xing.
    IEEE International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy.