Turn off MathJax
Article Contents
WANG Bo-xiao, SONG Yan-song, DONG Xiao-na. Indistinguishable points attention-aware network for infrared small object detection[J]. Chinese Optics. doi: 10.37188/CO.2023-0178
Citation: WANG Bo-xiao, SONG Yan-song, DONG Xiao-na. Indistinguishable points attention-aware network for infrared small object detection[J]. Chinese Optics. doi: 10.37188/CO.2023-0178

Indistinguishable points attention-aware network for infrared small object detection

doi: 10.37188/CO.2023-0178
Funds:  Supported by National key research and development program (No. 2022YFB3902505); Key project of national natural science foundation of China (No. U2141231); National natural science foundation of China (No. 62305032)
More Information
  • Corresponding author: songyansong2006@126.com
  • Received Date: 11 Oct 2023
  • Accepted Date: 05 Dec 2023
  • Available Online: 16 Jan 2024
  • As aircraft maneuverability increases, multi-frame infrared small target detection methods are becoming insufficient to meet detection requirements. In recent years, significant progress has been achieved in single-frame infrared small-target detection methods based on deep learning; however, infrared small targets often lack shape features and have blurred boundaries and backgrounds, obstructing accurate segmentation. Based on this, an indistinguishable points attention-aware network for infrared small object detection was proposed. First, potential target areas were acquired through a point-based region proposal module while filtering out redundant backgrounds; then, to achieve high-quality segmentation, the fine mask boundary module determined disordered, non-local indistinguishable points in the coarse mask, fused multi-scale features, and modeled the attention pixel by pixel; finally, the point detection head generated a fine segmentation mask by re-predicting the indistinguishable points’ attention-aware features. The proposed method reached 87.4 mAP and 63.4 mAP on the publicly available datasets NUDT-SIRST and IRDST, and the F-measure reached 0.8935 and 0.7056, respectively. It can achieve accurate segmentation in multi-detection scenarios and multi-target morphology, suppressing false alarm information while controlling the computational overhead.

     

  • loading
  • [1]
    单秋莎, 谢梅林, 刘朝晖, 等. 制冷型长波红外光学系统设计[J]. 中国光学,2022,15(1):72-78. doi: 10.37188/CO.2021-0116

    SHAN Q SH, XIE M L, LIU ZH H, et al. Design of cooled long-wavelength infrared imaging optical system[J]. Chinese Optics, 2022, 15(1): 72-78. (in Chinese). doi: 10.37188/CO.2021-0116
    [2]
    MA T L, YANG ZH, WANG J Q, et al. Infrared small target detection network with generate label and feature mapping[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 6505405.
    [3]
    SUN Y, YANG J G, AN W. Infrared dim and small target detection via multiple subspace learning and spatial-temporal patch-tensor model[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(5): 3737-3752. doi: 10.1109/TGRS.2020.3022069
    [4]
    赵鹏鹏, 李庶中, 李迅, 等. 融合视觉显著性和局部熵的红外弱小目标检测[J]. 中国光学,2022,15(2):267-275. doi: 10.37188/CO.2021-0170

    ZHAO P P, LI SH ZH, LI X, et al. Infrared dim small target detection based on visual saliency and local entropy[J]. Chinese Optics, 2022, 15(2): 267-275. (in Chinese). doi: 10.37188/CO.2021-0170
    [5]
    GAO C Q, MENG D Y, YANG Y, et al. Infrared patch-image model for small target detection in a single image[J]. IEEE Transactions on Image Processing, 2013, 22(12): 4996-5009. doi: 10.1109/TIP.2013.2281420
    [6]
    CHEN C L P, LI H, WEI Y T, et al. A local contrast method for small infrared target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2014, 52(1): 574-581. doi: 10.1109/TGRS.2013.2242477
    [7]
    XIA CH Q, LI X R, ZHAO L Y, et al. Infrared small target detection based on multiscale local contrast measure using local energy factor[J]. IEEE Geoscience and Remote Sensing Letters, 2020, 17(1): 157-161. doi: 10.1109/LGRS.2019.2914432
    [8]
    HAN J H, MORADI S, FARAMARZI I, et al. A local contrast method for infrared small-target detection utilizing a tri-layer window[J]. IEEE Geoscience and Remote Sensing Letters, 2020, 17(10): 1822-1826. doi: 10.1109/LGRS.2019.2954578
    [9]
    刘彦磊, 李孟喆, 王宣宣. 轻量型YOLOv5s车载红外图像目标检测[J]. 中国光学,2023,16(5):1045-1055. doi: 10.37188/CO.2022-0254

    LIU Y L, LI M ZH, WANG X X. Lightweight YOLOv5s vehicle infrared image target detection[J]. Chinese Optics, 2023, 16(5): 1045-1055. (in Chinese). doi: 10.37188/CO.2022-0254
    [10]
    PANG Y W, WANG T C, ANWER R M, et al. Efficient featurized image pyramid network for single shot detector[C]. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2019: 7328-7336.
    [11]
    YANG X, YAN J CH, FENG Z M, et al. R3Det: Refined single-stage detector with feature refinement for rotating object[C]. Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI Press, 2020.
    [12]
    LIU Y ZH, CAO S, LASANG P, et al. Modular lightweight network for road object detection using a feature fusion approach[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2021, 51(8): 4716-4728.
    [13]
    ZHANG SH F, WEN L Y, BIAN X, et al. Single-shot refinement neural network for object detection [C]. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, 2018: 4203-4212.
    [14]
    YUAN Y, XIONG ZH T, WANG Q. VSSA-NET: Vertical spatial sequence attention network for traffic sign detection[J]. IEEE Transactions on Image Processing, 2019, 28(7): 3423-3434. doi: 10.1109/TIP.2019.2896952
    [15]
    PANG Y W, CAO J L, WANG J, et al. JCS-Net: Joint classification and super-resolution network for small-scale pedestrian detection in surveillance images[J]. IEEE Transactions on Information Forensics and Security, 2019, 14(12): 3322-3331. doi: 10.1109/TIFS.2019.2916592
    [16]
    DAI Y M, WU Y Q, ZHOU F, et al. Attentional local contrast networks for infrared small target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(11): 9813-9824. doi: 10.1109/TGRS.2020.3044958
    [17]
    LI B Y, XIAO CH, WANG L G, et al. Dense nested attention network for infrared small target detection[J]. IEEE Transactions on Image Processing, 2023, 32: 1745-1758. doi: 10.1109/TIP.2022.3199107
    [18]
    WANG K W, DU SH Y, LIU CH X, et al. Interior attention-aware network for infrared small target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5002013.
    [19]
    GIRSHICK R. Fast R-CNN[C]. 2015 IEEE International Conference on Computer Vision (ICCV), IEEE, 2015: 1440-1448.
    [20]
    DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An Image is worth 16x16 words: Transformers for image recognition at scale[C]. 9th International Conference on Learning Representations, OpenReview. net, 2021.
    [21]
    SUN H, BAI J X, YANG F, et al. Receptive-field and direction induced attention network for infrared dim small target detection with a large-scale dataset IRDST[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 1-13.
    [22]
    ZHOU X Y, KARPUR A, LUO L J, et al. StarMap for category-agnostic keypoint and viewpoint estimation[C]. Proceedings of the European Conference on Computer Vision, Springer, 2018: 328-345.
    [23]
    LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection[C]. 2017 IEEE International Conference on Computer Vision (ICCV), IEEE, 2017: 2999-3007.
    [24]
    YANG Z, LIU SH H, HU H, et al. RepPoints: Point set representation for object detection[C]. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), IEEE, 2019: 9656-9665.
    [25]
    XU B, WANG N Y, CHEN T Q, et al. Empirical evaluation of rectified activations in convolutional network[Z]. arXiv preprint: 1505.00853, 2015. (查阅网上资料, 未能确认文献类型, 请确认) .

    XU B, WANG N Y, CHEN T Q, et al.. Empirical evaluation of rectified activations in convolutional network[Z]. arXiv preprint: 1505.00853, 2015. (查阅网上资料, 未能确认文献类型, 请确认).
    [26]
    ZHU X ZH, SU W J, LU L W, et al. Deformable DETR: deformable transformers for end-to-end object detection[C]. 9th International Conference on Learning Representations, OpenReview. net, 2020.
    [27]
    WU Y, KIRILLOV A, Massa F, et al. Detectron2[CP/OL]. (2019)[2023-8-24]. https://github.com/facebookresearch/detectron2. (查阅网上所有资料,未找到本条文献信息,请确认) .

    WU Y, KIRILLOV A, Massa F, et al.. Detectron2[CP/OL]. (2019)[2023-8-24]. https://github.com/facebookresearch/detectron2. (查阅网上所有资料,未找到本条文献信息,请确认).
    [28]
    TAN M X, PANG R M, LE Q V. EfficientDet: Scalable and efficient object detection[C]. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2020: 10778-10787.
    [29]
    YU F, WANG D Q, SHELHAMER E, et al. Deep layer aggregation[C]. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, 2018: 2403-2412.
    [30]
    LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: Common objects in context[M]. FLEET D, PAJDLA T, SCHIELE B, et al. Computer Vision – ECCV 2014. Cham: Springer, 2014: 740-755.
    [31]
    WANG H, ZHOU L P, WANG L. Miss detection vs. false alarm: adversarial learning for small object segmentation in infrared images[C]. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), IEEE, 2019: 8508-8517.
    [32]
    RIVEST J F, FORTIN R. Detection of dim targets in digital infrared imagery by morphological image processing[J]. Optical Engineering, 1996, 35(7): 1886-1893. doi: 10.1117/1.600620
    [33]
    AGHAZIYARATI S, MORADI S, TALEBI H. Small infrared target detection using absolute average difference weighted by cumulative directional derivatives[J]. Infrared Physics & Technology, 2019, 101: 78-87.
    [34]
    HE K M, GKIOXARI G, DOLLÁR P, et al. Mask R-CNN[C]. 2017 IEEE International Conference on Computer Vision (ICCV), IEEE, 2017: 2980-2988.
    [35]
    KIRILLOV A, WU Y X, HE K M, et al. PointRend: Image segmentation as rendering[C]. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2020: 9796-9805.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(7)  / Tables(7)

    Article views(172) PDF downloads(18) Cited by()
    Proportional views

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return