Education

  • 2019.09 - 2022.10, Ph.D. in Computer Science, School of Computer Science and Technology,
    Hangzhou Dianzi University, China. Supervisor: Jun Yu
  • 2016.09-2019.06, Master in Signal Processing, Faculty of Information Science and Engineering,
    Ningbo University, China. Supervisor: Feng Shao
  • 2012.09-2016.06, Bachelor in Communication Engineering, Faculty of Information Science and Engineering,
    Ningbo University, China.

Research Interests

  • Deep Learning; Multimodal Learning; Medical Vision-Language Learning; Medical Image Analysis; Large Models

Publications

  Journal

  1. Spatio-Temporal and Retrieval-Augmented Modelling for Chest X-Ray Report Generation.
    Yan Yang, Xiaoxing You, Ke Zhang, Zhenqi Fu, Xianyun Wang, Jiajun Ding, Jiamei Sun, Zhou Yu, Qingming Huang, Weidong Han, and Jun Yu.
    IEEE Transactions on Medical Imaging, accepted, (2025).
  2. Token-Mixer: Bind Image and Text in One Embedding Space for Medical Image Reporting.
    Yan Yang, Jun Yu, Zhenqi Fu, Ke Zhang, Ting Yu, Xianyun Wang, Hanliang Jiang, Junhui Lv, Qingming Huang, and Weidong Han.
    IEEE Transactions on Medical Imaging, vol. 43, no. 11, pp. 4017 - 4028, (2024).
  3. Joint Embedding of Deep Visual and Semantic Features for Medical Report Generation.
    Yan Yang, Jun Yu, Jian Zhang, Weidong Han, Hanliang Jiang, Qingming Huang.
    IEEE Transactions on Multimedia, vol. 25, pp. 167-178, (2023).
  4. Attribute Prototype-guided Iterative Scene Graph for Explainable Radiology Report Generation.
    Ke Zhang, Yan Yang, Jun Yu, Jianping Fan, Hanliang Jiang, Qingming Huang, Weidong Han.
    IEEE Transactions on Medical Imaging, vol. 43, no. 12, pp. 4470 - 4482, (2024).
  5. A Contrastive Triplet Network for Automatic Chest X-Ray Reporting.
    Yan Yang, Jun Yu, Hanliang Jiang, Weidong Han, Jian Zhang, Wei Jiang.
    Neurocomputing, vol. 502, pp. 71-83, (2022).
  6. Multi-task Paired Masking with Alignment Modeling for Medical Vision-Language Pre-training.
    Ke Zhang, Yan Yang, Jun Yu, Hanliang Jiang, Jianping Fan, Qingming Huang, Weidong Han.
    IEEE Transactions on Multimedia, (2023).
  7. Consistency Conditioned Memory Augmented Dynamic Diagnosis for Medical Visual Question Answering.
    Ting Yu; Binghui Ge; Shuhui Wang; Yan Yang; Qingming Huang; Jun Yu.
    IEEE Journal of Biomedical and Health Informatics, vol. 29, no. 2, pp. 1357-1370 (2024).
  8. Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation.
    Ting Yu; Wangwen Lu; Yan Yang; Weidong Han; Qingming Huang; Jun Yu; Ke Zhang.
    IEEE Journal of Biomedical and Health Informatics, (2025).
  9. Discriminative Dictionary Learning for Retinal Vessel Segmentation Using Fusion of Multiple Features.
    Yan Yang, Feng Shao, Zhenqi Fu, Randi Fu.
    Signal, Image and Video Processing, vol. 13, no. 18, pp. 1529–1537, (2019).
  10. Blood Vessel Segmentation of Fundus Images via Cross-modality Dictionary Learning.
    Yan Yang, Feng Shao, Zhenqi Fu, Randi Fu.
    Applied Optics, vol. 57, no. 25, pp. 7287 - 7295, (2018).
  11. Automated Quality Assessment of Fundus Images via Analysis of Illumination, Naturalness and Structure.
    Feng Shao, Yan Yang, Qiuping Jiang, Gangyi Jiang, Yo-sung Ho.
    IEEE Access, vol. 6, pp. 806-817, (2018).
  12. Multi-Model Synergistic Gaussian Splatting for Sparse View Synthesis.
    Changyue Shi, Chuxiao Yang, Xinyuan Hu, Yan Yang, Min Tan, Jiajun Ding.
    Image and Vision Computing, (2025).

  Conference

  1. Learning a Simple Low-light Image Enhancer from Paired Low-light Instances.
    Zhenqi Fu, Yan Yang, Xiaotong Tu, Yue Huang, Xinghao Ding, and Kai-Kuang Ma.
    CVPR, 2023.
  2. Unsupervised Underwater Image Restoration: From a Homology Perspective.
    Zhenqi Fu, Huangxing Lin, Yan Yang, Shu Chai, Liyan Sun, Yue Huang, Xinghao Ding.
    AAAI, 2022.
  3. A Study of Perceptual Quality Assessment for Stereoscopic Image Retargeting.
    Zhenqi Fu, Yan Yang, Feng shao, Xinghao Ding.
    APSIPA 2019. [Database]

Academic Activities

  Invited Reviewer

  • IEEE TMI
  • IEEE TCSVT
  • IEEE TMM
  • IJCAI 2024
  • AAAI 2022/23
  • Artificial Intelligence In Medicine
  • Information Sciences
  • IEEE Access
  • Neural Processing Letters

Resources

  Links

  • CT-RATE Database: A dataset of chest CT volumes with corresponding radiology text reports. Related paper: Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography [2024].
  • RadGenome-Chest CT Database: A comprehensive, large-scale, region-guided 3D chest CT interpretation dataset based on CT-RATE. Related paper: RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis [2024].
  • M3D Database: A large-scale open-source 3D medical dataset, consists of 120K image-text pairs and 662K instruction-response pairs. Related paper: M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models [2024].
  • Ultrasound-Report-Generation Database: A public database containing ultrasound images and reports. Related paper: Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance [2024].
  • FFA-IR Database: An explainable and reliable MRG benchmark based on FFA Images and Reports. Related paper: FFA-IR: Towards an explainable and reliable medical report generation benchmark [2021].
  • IU X-Ray Database: A public database containing chest X-rays and related reports. Related paper: Preparing a collection of radiology examinations for distribution and retrieval [2015].
  • MIT MIMIC-CXR Database: A large public database containing chest X-rays and related reports. Related paper: MIMIC-CXR, a de-identifed publicly available database of chest radiographs with free-text reports [2019].
  • Chexpert Plus Database: A large public database with chest X-rays and reports. Releated paper: CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats [2024].
  • RadGenome-Brain MRI Database: a comprehensive dataset encompassing segmentation masks of anomaly regions and manually authored reports. Related paper: AutoRG-Brain: Grounded Report Generation for Brain MRI [2024].
  • Bladder Pathology Database: A public database containing bladder pathology images and related reports. Related paper: Pathologist-level interpretable whole-slide cancer diagnosis with deep learning [2019].
  • DRIVE Database: A database for retinal vessel segmentation [2004].

Contact

Email: yangyan@hdu.edu.cn

Last update on 2020-Feb.