结合YOLOv11改进的双视角X光图像违禁品检测

吴海滨; 刘文柏; 袁鹏飞; 王爱丽

doi:10.37188/CO.2026-0062

Article Contents

Chinese Optics > 2026 > Accepted Manuscript

WU Hai-bin, LIU Wen-bai, YUAN Peng-fei, WANG Ai-li. Improved prohibited item detection in double-view X-ray images combined with YOLOv11[J]. Chinese Optics. doi: 10.37188/CO.2026-0062

Citation:

WU Hai-bin, LIU Wen-bai, YUAN Peng-fei, WANG Ai-li. Improved prohibited item detection in double-view X-ray images combined with YOLOv11[J]. Chinese Optics. doi: 10.37188/CO.2026-0062

WU Hai-bin, LIU Wen-bai, YUAN Peng-fei, WANG Ai-li. Improved prohibited item detection in double-view X-ray images combined with YOLOv11[J]. Chinese Optics. doi: 10.37188/CO.2026-0062

Citation:

WU Hai-bin, LIU Wen-bai, YUAN Peng-fei, WANG Ai-li. Improved prohibited item detection in double-view X-ray images combined with YOLOv11[J]. Chinese Optics. doi: 10.37188/CO.2026-0062

PDF( 8339 KB)

Improved prohibited item detection in double-view X-ray images combined with YOLOv11

doi: 10.37188/CO.2026-0062

cstr: 32171.14.CO.2026-0062

Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin 150080, China

Funds: Supported by Natural Science Foundation of Heilongjiang Province of China (No. LH2023F034)

More Information

Corresponding author: aili925@hrbust.edu.cn
Available Online: 04 Jul 2026

Abstract

Abstract

To address the issues of insufficient adaptability in cross-view feature fusion and inadequate utilization of complementary information in existing dual-view X-ray security inspection image prohibited item detection methods, this paper proposes an improved dual-view fusion detection method combined with YOLOv11 (Dual View Fusion combined with YOLOv11, DVF-YOLOv11). The proposed method employs a parameter-shared dual-branch YOLOv11 backbone network to extract multi-scale features from the overlook-view and side-view images, respectively. A Cross-View Attention Fusion (CVAF) module is designed to adaptively enhance dual-view features through a cascaded mechanism of channel attention and spatial attention. An adaptive weight prediction network is introduced to dynamically adjust the fusion weights of each view, and is combined with channel compression convolution to form a dual-path fusion strategy. A joint loss function composed of feature preservation loss, complementarity loss, and weight balance loss is further designed to guide the fusion learning process. On the DvXray dataset, the proposed method achieves an mAP50 of 94.02% and an mAP50-95 of 79.41%, improving by 2.99% and 5.29%, respectively, over the single overlook-view baseline. Experimental results demonstrate that the proposed method improves the accuracy and robustness of prohibited item detection in dual-view X-ray security inspection images.
- X-ray security inspection,
- YOLOv11,
- dual-view fusion,
- adaptive weight fusion

FullText(HTML)

References(23)

References

[1]	林俊豪, 张云飞, 陈少伟, 等. 无监督掩码循环对抗网络实现细胞虚拟染色[J]. 中国光学(中英文), 2026, 19(4), doi: 10.37188/CO.2026-0021. (查阅网上资料,未找到对应的卷期页码信息,请确认). LIN J H, ZHANG Y F, CHEN SH W, et al. Unsupervised masked cycle-adversarial network for cellular virtual staining[J]. Chinese Optics, 2026, 19(4), doi: 10.37188/CO.2026-0021. (in Chinese).
[2]	汪建民, 赵浩冰, 王轲, 等. 无人机飞行单光子动态成像中姿态补偿及重建方法[J]. 中国光学(中英文), 2026, 19(3): 605-618. doi: 10.37188/CO.2026-0004 WANG J M, ZHAO H B, WANG K, et al. Attitude compensation and reconstruction methods for single-photon dynamic imaging during UAV flight[J]. Chinese Optics, 2026, 19(3): 605-618. doi: 10.37188/CO.2026-0004
[3]	XU Y, ZHANG Q Y, SU Q, et al. PIXDet: prohibited item detection in X-ray image based on whole-process feature fusion and local-global semantic dependency interaction[J]. IEEE Transactions on Instrumentation and Measurement, 2023, 72: 5032917. doi: 10.1109/tim.2023.3330184
[4]	WEI Y L, TAO R SH, WU ZH J, et al. Occluded prohibited items detection: an X-ray security inspection benchmark and de-occlusion attention module[C]. Proceedings of the 28th ACM International Conference on Multimedia, ACM, 2020: 138-146.
[5]	TAO R SH, WEI Y L, JIANG X J, et al. Towards real-world X-ray security inspection: a high-quality benchmark and lateral inhibition module for prohibited items detection[C]. 2021 IEEE/CVF International Conference on Computer Vision, IEEE, 2021: 10923-10932.
[6]	ZHU Z M, ZHU Y, WANG H R, et al. FDTNet: enhancing frequency-aware representation for prohibited object detection from X-ray images via dual-stream transformers[J]. Engineering Applications of Artificial Intelligence, 2024, 133: 108076. doi: 10.1016/j.engappai.2024.108076
[7]	刘建军, 冯沛, 廖威, 等. YOLO-STM: 基于Swin-Transformer与MSDA的X光安检图像危险品识别网络[J]. 中国体视学与图像分析, 2024, 29(3): 230-241. doi: 10.13505/j.1007-1482.2024.29.03.008 LIU J J, FENG P, LIAO W, et al. YOLO-STM: a network model for identifying prohibited items in X-ray security inspection images based on Swin-Transformer and MSDA[J]. Chinese Journal of Stereology and Image Analysis, 2024, 29(3): 230-241. (in Chinese). doi: 10.13505/j.1007-1482.2024.29.03.008
[8]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]. Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, 2016: 779-788.
[9]	KHANAM R, HUSSAIN M. YOLOv11: an overview of the key architectural enhancements[J]. arXiv preprint arXiv: 2410.17725, 2024. (查阅网上资料, 请核对文献类型及格式).
[10]	STEITZ J M O, SAEEDAN F, ROTH S. Multi-view X-ray R-CNN[C]. Proceedings of the 40th German Conference on Pattern Recognition, Springer, 2019: 153-168.
[11]	TULI A, BOHRA R, MOGHE T, et al. Automatic threat detection in single, stereo (two) and multi view X-ray images[C]. Proceedings of 2020 IEEE 17th India Council International Conference, IEEE, 2020: 1-7.
[12]	WU M D, YI F F, ZHANG H G, et al. Dualray: dual-view X-ray security inspection benchmark and fusion detection framework[C]. Proceedings of the 5th Chinese Conference on Pattern Recognition and Computer Vision, Springer, 2022: 721-734.
[13]	MENG X L, FENG H, REN Y, et al. Transformer-based dual-view X-ray security inspection image analysis[J]. Engineering Applications of Artificial Intelligence, 2024, 138: 109382. doi: 10.1016/j.engappai.2024.109382
[14]	HONG S L, ZHOU Y Z, XU W C. DAGNet: a dual-view attention-guided network for efficient X-ray security inspection[C]. Proceedings of 2025 International Joint Conference on Neural Networks, IEEE, 2025: 1-8.
[15]	TAO R SH, WANG H Y, GUO Y ZH, et al. Dual-view X-ray detection: can AI detect prohibited items from dual-view X-ray images like humans?[C]. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, 2025: 10338-10347.
[16]	MA B W, JIA T, LI M Y, et al. Toward dual-view X-ray baggage inspection: a large-scale benchmark and adaptive hierarchical cross refinement for prohibited item discovery[J]. IEEE Transactions on Information Forensics and Security, 2024, 19: 3866-3878. doi: 10.1109/TIFS.2024.3372797
[17]	VARGHESE R, SAMBATH M. YOLOv8: a novel object detection algorithm with enhanced performance and robustness[C]. Proceedings of the 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems, IEEE, 2024: 1-6.
[18]	WANG A, CHEN H, LIU L H, et al. YOLOv10: real-time end-to-end object detection[C]. Proceedings of the 38th International Conference on Neural Information Processing Systems, Curran Associates Inc. , 2024: 3429.
[19]	TIAN Y J, YE Q X, DOERMANN D. YOLOv12: attention-centric real-time object detectors[J]. arXiv preprint arXiv: 2502.12524, 2025. (查阅网上资料, 请核对文献类型及格式).
[20]	LEI M Q, LI S Q, WU Y H, et al. YOLOv13: real-time object detection with hypergraph-enhanced adaptive visual perception[J]. arXiv preprint arXiv: 2506.17733, 2025. (查阅网上资料, 请核对文献类型及格式).
[21]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]. Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, 2018: 7132-7141.
[22]	WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]. Proceedings of the 15th European Conference on Computer Vision, Springer, 2018: 3-19.
[23]	WANG Q L, WU B G, ZHU P F, et al. ECA-Net: efficient channel attention for deep convolutional neural networks[C]. Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, 2020: 11531-11539.

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(10) / Tables(4)

Get Citation

PDF

XML

Article views(12) PDF downloads(3)

Improved prohibited item detection in double-view X-ray images combined with YOLOv11

doi: 10.37188/CO.2026-0062

cstr: 32171.14.CO.2026-0062

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Proportional views

Related

Improved prohibited item detection in double-view X-ray images combined with YOLOv11

doi: 10.37188/CO.2026-0062 cstr: 32171.14.CO.2026-0062

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Proportional views

Related

Export File

Citation

Format

Content

doi: 10.37188/CO.2026-0062

cstr: 32171.14.CO.2026-0062