• 论文
主办单位:煤炭科学研究总院有限公司、中国煤炭学会学术期刊工作委员会
基于改进YOLOv7和ByteTrack的煤矿关键岗位人员不安全行为识别
  • Title

    Recognition of unsafe behaviors of key position personnel in coal mines based on improved YOLOv7 and ByteTrack

  • 作者

    韩康李敬兆陶荣颖

  • Author

    HAN Kang;LI Jingzhao;TAO Rongying

  • 单位

    安徽理工大学人工智能学院淮浙煤电有限责任公司顾北煤矿

  • Organization
    School of Artificial Intelligence, Anhui University of Science and Technology
    Gubei Coal Mine, Huaizhe Coal Power Co., Ltd.
  • 摘要
    应用人工智能技术对矿井提升机司机等煤矿关键岗位人员的行为进行实时识别,防止发生设备误操作等危险情况,对保障煤矿安全生产具有重要意义。针对基于图像特征的人员行为识别方法存在的抗背景干扰能力差与实时性不足问题,提出了一种基于改进YOLOv7和ByteTrack的煤矿关键岗位人员不安全行为识别方法。首先,基于MobileOne和C3对YOLOv7目标检测模型骨干与头部网络进行轻量化改进,提高模型推理速度;其次,融合ByteTrack跟踪算法,实现工作人员跟踪锁定,提高抗背景干扰能力;然后,采用MobileNetV2优化OpenPose的网络结构,提高对骨架特征的提取效率;最后,通过时空图卷积网络(ST−GCN)分析人体骨架关键点在时间序列上的空间结构和动态变化,实现对不安全行为的分析识别。实验结果表明:MobileOneC3−YOLO模型的精确率达93.7%,推理速度较YOLOv7模型提高了52%;融合ByteTrack的人员锁定模型锁定成功率达97.1%;改进OpenPose模型内存需求减少了170.3 MiB,在CPU与GPU上的推理速度分别提升了74.7%和54.9%;不安全行为识别模型对疲劳睡岗、离岗、侧身交谈和玩手机4种不安全行为的识别精确率达93.5%,推理速度达18.6 帧/s。
  • Abstract
    The application of artificial intelligence technology can real-time recognize the behavior of key position personnel in coal mines, such as mine hoist drivers, to prevent dangerous situations such as equipment misoperation. It is of great significance for ensuring coal mine safety production. The personnel behavior recognition method based on image features has problems of poor resistance to background interference and insufficient real-time performance. In order to solve the above problems, a coal mine key position personnel unsafe behavior recognition method based on improved YOLOv7 and ByteTrack is proposed. Firstly, based on MobileOne and C3, lightweight improvements are made to the backbone and head network of the YOLOv7 object detection model to improve the inference speed of the model. Secondly, integrating ByteTrack tracking algorithm, to achieve the tracking and locking of personnel is achieved, and the capability to resist background interference is improved. Thirdly, MobileNetV2 is used to optimize the network structure of OpenPose and improve the efficiency of skeleton feature extraction. Finally, the spatial temporal graph convolutional networks (ST−GCN) is used to analyze the spatial structure and dynamic changes of the key points of the human skeleton in the time series, achieving the analysis and recognition of unsafe behaviors. The experimental results show that the precision of the MobileOneC3−YOLO model reaches 93.7%, and the inference speed is improved by 52% compared to the YOLOv7 model. The success rate of personnel locking model integrating ByteTrack reaches 97.1%. The improved OpenPose model reduces memory requirements by 170.3 MiB. The inference speed on CPU and GPU is improved by 74.7% and 54.9%, respectively; The recognition precision of the unsafe behavior recognition model for four types of unsafe behaviors, including fatigue sleeping on duty, leaving work, side talking, and playing with mobile phones, reaches 93.5%, and the inference speed reaches 18.6 frames per second.
  • 关键词

    不安全行为识别目标检测姿态估计时空图卷积网络人员锁定YOLOv7ByteTrack

  • KeyWords

    unsafe behaviors recognition;object detection;attitude estimation;spatial temporal graph convolutional networks;personnel locking;YOLOv7;ByteTrack

  • 基金项目(Foundation)
    国家自然科学基金资助项目(52374154);安徽理工大学研究生创新基金资助项目(2022CX1008)。
  • DOI
  • 引用格式
    韩康,李敬兆,陶荣颖. 基于改进YOLOv7和ByteTrack的煤矿关键岗位人员不安全行为识别[J]. 工矿自动化,2024,50(3):82-91.
  • Citation
    HAN Kang, LI Jingzhao, TAO Rongying. Recognition of unsafe behaviors of key position personnel in coal mines based on improved YOLOv7 and ByteTrack[J]. Journal of Mine Automation,2024,50(3):82-91.
  • 相关文章
  • 图表

    Table1

    表 1 评价指标
    评价指标定义计算公式
    Precision(↑)精确率\( \dfrac{{{\mathrm{TP}}}}{{{\mathrm{TP}} + {\mathrm{FP}}}}\)
    Params(↓)参数量
    AP(↑)关键点相似度为[0.50,0.55,···,0.95]时
    10个位置的平均精确率
    帧率(↑)每秒处理的帧数\( \dfrac{{总帧数}}{{时间}} \)
    MOTA(↑)多目标跟踪准确率\( 1 - \dfrac{{{\mathrm{FN}} + {\mathrm{FP}} + {\mathrm{IDSW}}}}{{{\mathrm{GT}}}}\)
    IDF1(↑)ID调和均值\( \dfrac{{2{\mathrm{TP}}}}{{2{\mathrm{TP}} + {\mathrm{FP }}+ {\mathrm{FN}}}}\)
    FP(↓)误跟踪目标数
    FN(↓)漏跟踪目标数

    Table2

    表 2 人员检测模型性能
    模型精确率/%参数量/107单帧图像推理耗时/s
    YOLOv589.63.120.0395
    YOLOv791.13.720.0437
    YOLOv891.94.360.0475
    MobileOneC3−YOLO93.72.510.0208

    Table3

    表 3 人员检测模型消融实验结果
    改进策略精确率/%参数量/107单帧图像推理耗时/s
    MobileOneC3
    ××91.13.720.043 7
    ×90.82.930.034 3
    ×92.12.710.035 9
    93.72.510.020 8

    Table4

    表 4 跟踪算法对比实验结果
    算法IDF1/%MOTA/%FPFN帧率/(帧·s−1
    SORT75.573.64 85621 37621.3
    DeepSort85.781.36 51219 83714.7
    ByteTrack88.185.55 53913 55729.6

    Table5

    表 5 关键岗位人员锁定结果统计
    关键岗位人员成功锁定次数平均成功
    锁定次数
    总体锁定
    成功率/%
    1轮2轮3轮
    矿井提升机司机6059605997.10
    绞车司机59605758
    变电站值班人员58605958
    井口信把工59586058

    Table6

    表 6 不安全行为识别模型性能测试结果
    模型AP/%模型内存/MiB单帧图像推理耗时/s
    CPU:12900KGPU:3090
    OpenPose74.6203.80.83260.0573
    改进OpenPose72.833.50.21030.0258

    Table7

    表 7 不安全行为识别模型消融实验结果
    改进策略精确率/%帧率/(帧·s−1
    YOLOv7+OpenPose91.87.1
    YOLOv7+改进OpenPose92.112.5
    MobileOneC3−YOLO+OpenPose91.613.7
    MobileOneC3−YOLO+改进OpenPose93.518.6
相关问题

主办单位:煤炭科学研究总院有限公司 中国煤炭学会学术期刊工作委员会

©版权所有2015 煤炭科学研究总院有限公司 地址:北京市朝阳区和平里青年沟东路煤炭大厦 邮编:100013
京ICP备05086979号-16  技术支持:云智互联