基于Vision Mamba模型的渔业监测物种分类性能比较
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

S 951.2;TP 391.41

基金项目:

国家重点研发计划(2023YFD2401302)


Performance comparison of fishery species classification based on vision mamba
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    渔业电子观察员(Electronic monitoring)是实施渔业智能化监管的重要手段,图像识别是其支撑的关键技术之一,如何解决边缘计算场景下部署高性能、轻量化模型是目前面临的挑战。本研究引入深度学习领域的 Vision Mamba (ViM) 模型,该模型利用选择性状态空间机制(State space model,SSM)构建双向编码器,在保持线性计算复杂度的同时实现了对图像长距离依赖关系的全局建模。研究以自然保护协会渔业监测数据集为基础,与ResNet、EfficientNet、DeiT等主流模型开展了系统性的性能对比研究。结果显示,ViM模型在效率与精度上均表现出卓越性能。在轻量级模型中,ViM-Tiny在比ResNet-18基线模型少44.28%参数量的情况下,准确率提升了1.12%,F1分数提升了2.19%。在中量级模型中,ViM-Small在参数量相较ResNet-101基线模型减少44.65%的情况下,仍能实现与之接近持平的准确率(0.960 3)与F1分数(0.964 5)。研究表明,ViM模型能够在显著降低模型复杂度的同时,仍保持强大的渔业物种分类能力,在轻量化与高精度之间取得了很好的平衡。研究为构建高效、智能的渔业监管系统提供了新的技术路径。

    Abstract:

    As a crucial means for implementing intelligent fishery supervision, electronic monitoring (EM) relies heavily on image recognition technology. A key challenge lies in deploying high-performance yet lightweight models in edge computing scenarios. This study introduces the Vision Mamba (ViM) model from the field of deep learning. By utilizing the selective state space model (SSM) to construct a bidirectional encoder, the model achieves global modeling of long-range dependencies in images while maintaining linear computational complexity. Based on the Nature Conservancy (TNC) Fisheries Monitoring dataset, a systematic performance comparison was conducted against mainstream models such as ResNet, EfficientNet, and DeiT. The results demonstrate that the ViM model exhibits outstanding performance in both efficiency and accuracy. Among lightweight models, ViM-Tiny achieved a 1.12% increase in accuracy and a 2.19% improvement in F1 score while reducing parameters by 44.28% compared to the ResNet-18 baseline. Among medium-sized models, ViM-Small, with 44.65% fewer parameters than the ResNet-101 baseline, achieved comparable accuracy (0.960 3) and F1 score (0.964 5). The study indicates that the ViM model maintains strong fishery species classification capability while significantly reducing model complexity, achieving an excellent balance between lightweight design and high accuracy. This research provides a novel technical pathway for constructing efficient and intelligent fishery supervision systems.

    参考文献
    相似文献
    引证文献
引用本文

张泽海,黄小双,孔祥洪,刘必林,陈新军.基于Vision Mamba模型的渔业监测物种分类性能比较[J].上海海洋大学学报,2026,35(2):508-519.
ZHANG Zehai, HUANG Xiaoshuang, KONG Xianghong, LIU Bilin, CHEN Xinjun. Performance comparison of fishery species classification based on vision mamba[J]. Journal of Shanghai Ocean University,2026,35(2):508-519.

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-11-25
  • 最后修改日期:2026-01-07
  • 录用日期:
  • 在线发布日期: 2026-03-24
  • 出版日期: 2026-03-31
文章二维码
关闭