|Table of Contents|

Citation:
 Shuai Fang,Jianhui Cui,Ling Yang,et al.Research on Signal Extraction and Classification for Ship Sound Signal Recognition[J].Journal of Marine Science and Application,2024,(4):984-995.[doi:10.1007/s11804-024-00435-0]
Click and Copy

Research on Signal Extraction and Classification for Ship Sound Signal Recognition

Info

Title:
Research on Signal Extraction and Classification for Ship Sound Signal Recognition
Author(s):
Shuai Fang1 Jianhui Cui1 Ling Yang1 Fanbin Meng2 Huawei Xie2 Chunyan Hou2 Bin Li2
Affilations:
Author(s):
Shuai Fang1 Jianhui Cui1 Ling Yang1 Fanbin Meng2 Huawei Xie2 Chunyan Hou2 Bin Li2
1 Maritime College, Tianjin University of Technology, Tianjin, 300384, China;
2 Tianjin Navigation Instrument Research Institute, Tianjin, 300131, China
Keywords:
Ship signal identificationSignal extractionAutomatic classificationIntelligent shipsSupport vector machine
分类号:
-
DOI:
10.1007/s11804-024-00435-0
Abstract:
The movements and intentions of other ships can be determined by gathering and examining ship sound signals. The extraction and analysis of ship sound signals fundamentally support the autonomous navigation of intelligent ships. Mel scale frequency cepstral coefficient (MFCC) feature parameters are improved and optimized to form NewMFCC by introducing second-order difference and wavelet packet decomposition transformation methods in this paper. Transforming sound signals into a feature vector that fully describes the dynamic characteristics of ship sound signals and the high- and low-frequency information solves the problem of the inability to transport ordinary sound signals directly as signals for training in machine learning models. Radial basis function kernels are used to conduct support vector machine classifier simulation experiments. Five types of sound signals, namely, one type of ship sound signals and four types of interference sound signals, are categorized and identified as classification targets to verify the feasibility of the classification of ship sound signals and interference signals. The proposed method improves classification accuracy by approximately 15%.

References:

Amelia F, Gunawan D (2019) DWT-MFCC method for speaker recognition system with noise. 7th International Conference on Smart Computing & Communications (ICSCC), Miri, Malaysia, 310-314. DOI: 10.1109/ICSCC.2019.8843660
Bhattacharjee U, Kshirod S (2013) Language identification system using MFCC and prosodic features. International Conference on Intelligent Systems and Signal Processing (ISSP), 194-197. DOI: 10.1109/Issp.2013.6526901
Bin Hamzah HI, Bin Abdullah A, Candrawati R (2009) Biologically-inspired abstraction model to analyze sound signal. IEEE Student Conference on Research and Development, Serdang, Malaysia, 180-183. DOI: 10.1109/Scored.2009.5443168
Bryant ML, Garber FD (1999) SVM classifier applied to the mstar public data set. Proc. SPIE 3721, Algorithms for Synthetic Aperture Radar Imagery VI. https://doi.org/10.1117/12.357652
Chomorlig, Zhang Z, Xiang YL (2012) Research of feature extraction in mongolian speech based on an improved algorithm of MFCC parameter. 2nd International Conference on Advanced Engineering Materials and Technology (AEMT), Zhuhai, 833-837. DOI: 10.4028/www.scientific.net/AMR.542-543.833
Chu XT, Wang HP, Yang HT, Lin NH (2022) Speaker recognition based on convolutional neural network. Police Technology (1): 46-50
Cortes C, Vapnik V (1995) Support-vector networks. Machine Learning 20(3): 273-297. DOI: 10.1023/A:1022627411411
Deng MQ, Meng TT, Cao JW, Wang SM, Zhang J, Fan HJ (2020) Heart sound classification based on improved MFCC features and convolutional recurrent neural networks. Neural Networks 130: 22-32. DOI: 10.1016/J.Neunet.2020.06.015
Du C, Shao JH, Yang W, Et Al (2021) Indoor visible light positioning based on support vector machine optimized by grid search method. Laser Journal 42(3): 104-109. DOI: 10.14016/J.Cnki.Jgzz.2021.03.104
Fan P (2017) A complex sound recognition method in noisy environments. Hefei University of Technology (10): 18-21. DOI: 10.7666/D.Y3235272
Geoffrey L, Collie A (2004) Comparison of novices and experts in the identification of sonar signals. Speech Communication 43(4): 297-310. DOI: 10.1016/J.Specom.2004.03.003
Hu L, Hu XJ, Huang ZH, Xu L, Hu K, Zhang JM (2021) Mura defect detection based on effective background reconstruction and contrast enhancement. Journal of Liquid Crystals and Displays 36(10): 1395-1402. DOI: 10.37188/CJLCD.2021-0177
Lin W, Yang L, Xu B (2006) Speaker recognition based on modified MFCC parameters of chinese mandarin whispered speech. Journal of Nanjing University (Natural Sciences) 42(1): 54-62. DOI: CNKI:SUN:NJDZ.0.2006-01-007
Lu MJ, Dong SJ, Tang M, Wang CR, Cao L, Yan XP (2023) Research on the development of china’s marine transportation equipment industry. Chinese Engineering Science (3): 53-61. DOI: 10.15302/J-Sscae-2023.03.006
Lv DJ, Zhang Y, Fu QJ, Xu HF, Liu J, Zi JL, Huang X (2020) Birdsong recognition based on MFCC combined with vocal tract properties. 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE), Harbin, 1523-1526. DOI: 10.1109/Icmcce51767.2020.00334
Ma YJ, Zheng XY (2015) Research and application based on improved support vector machine. Industrial Control Computer 28(12): 25-26+28. DOI: CNKI:SUN:GYKJ.0.2015-12-012
Mahesha P, Vinod DS (2017) LP-Hillbert transform based MFCC for effective discrimination of stuttering dysfluencies. 2nd IEEE International Conference on Wireless Communications, Signal Processing and Networking (Wispnet), Chennai, 2561-2565. DOI: 10.1109/Wispnet.2017.8300225
Maurya A, Kumar D, Agarwal RK (2017) Speaker recognition for hindi speech signal using MFCC-GMM approach. 6th International Conference on Smart Computing and Communications (ICSCC), Kurukshetra, 880-887. DOI: 10.1016/J.Procs.2017.12.112
Nassim A, Abderrahmane A (2017) Boosting scores fusion approach using front-end diversity and adaboost algorithm, for speaker verification. Computers and Electrical Engineering 62: 648-662. DOI: 10.1016/J.Compeleceng.2017.03.022
Jiang N, Qiu M, Dai W (2017) SROC: A speaker recognition with data decision level fusion method in cloud environment. J Sign Process Syst 86: 123-133. DOI: 10.1007/S11265-015-1100-7
Shi YB, Wang L (2011) Improved MFCC algorithm in speaker recognition system. International Conference on Graphic and Image Processing (ICGIP 2011), Cairo, Egypt, 828567. https://doi.org/10.1117/12.913462
Srivastava S, Bhardwaj S, Bhandari A, Gupta K, Bahl H, Gupta JRP (2012) Wavelet packet based mel frequency cepstral features for text independent speaker identification. 1st International Symposium on Intelligent Informatics (ISI12), Chennai, India, 237-247. https://doi.org/10.1007/978-3-642-32063-7_26
Tang L, Tian YJ, Pardalos Panos M (2018) A novel perspective on multiclass classification: Regular simplex support vector machine. Information Sciences: 480: 324-338. https://doi.org/10.1016/j.ins.2018.12.026
Tian J, Xue SH, Huang HN, Zhang CH (2007) Classification of underwater still objects based on multi-field features and SVM. J Mar. Sc. Appl. 6: 36-40. DOI: 10.1007/S11804-007-6042-4
Tomchuk KK (2018) Spectral masking in MFCC calculation for noisy speech. Wave Electronics and Its Application In Information and Telecommunication Systems (WECONF), St Petersburg, Russia, 1-4. DOI: 10.1109/WECONF.2018.8604460
Tuncer T, Aydemir E (2020) An automated local binary pattern ship identification method by using sound. Acta Infologica 4(1): 57-63. DOI: 10.26650/Acin.762809
Wang LH, Zhao ZH (2020) Marine mechanical noise monitoring system based on MFCC-SVM. Automation and Instrumentation 35(12): 54-58. DOI:https://doi.org/10.19557/J.Cnki.1001-9944.2020.12.012
Wang S, Ding N, Li N, Zhang JJ, Zong CQ (2023) Language cognition and language computation: language understanding by human and machine. Chinese Science: Information Science 52(10): 1748-1774. DOI: 10.48550/Arxiv.2301.04788
Wang Y, Hu WP (2018) Speech emotion recognition based on improved MFCC. In Proceedings of the 2nd International Conference on Computer Science and Application Engineering (CSAE’18), New York, Article 88: 1-7. DOI: 10.1145/3207677.3278037
Wróbel K, Montewka J, Kujala P (2017) Towards the assessment of potential impact of unmanned vessels on maritime transportation safety. Reliability Engineering & System Safety 165: 155-169. DOI: 10.1016/J.Ress.2017.03.029
Xie GD, Yang JC, Yi Q, Han ZF (2010) Research on battlefield target sound detection technology. Control Engineering 17(S1): 41-44. DOI: 10.14107/J.Cnki.Kzgc.2010.S1.025
Xie SS, Xu HF, Liu J, Zhang Y, Lv DJ (2021) Research on bird songs recognition based on MFCC-HMM. International Conference on Computer, Control and Robotics (ICCCR), Shanghai, 262-266. DOI: 10.1109/Icccr49711.2021.9349284
Xu H, Liu YL (2021) Unmanned aerial vehicle sound recognition algorithm based on deep learning. Computer Science 48(7): 225-232. DOI: 10.11896/Jsjkx.200500091
Yan CF, Chun HZ, Yuan LD, Zhi YS (2013) Coarse frequency offsetestimation of TD-LTE based on spectrum centroid. Advanced Materials Research 756-759: 3602-3606. DOI: 10.4028/www.scientific.Net/AMR.756-759.3602
Yan Q, Zhou ZJ, Li S (2011) Chinese accents identification with modified MFCC. International Conference on Instrumentation, Measurement, Circuits and Systems (ICIMCS 2011), Hong Kong, 659-666. https://doi.org/10.1007/978-3-642-27334-6_77
Yang Y, Chen X (2020) Experimental design of speaker recognition based on neural networks. Laboratory Research and Exploration 39(9): 38-41+50. DOI: 10.3969/J.Issn.1006-7167.2020.09.008
Yu Y, Xu JX, Zhang MC, Shi H (2019) Comparative analysis of wavelet analysis and wavelet packet analysis in bearing fault diagnosis. Coal Mine Machinery 40(12): 170-173. DOI: CNKI:SUN:MKJX.0.2019-12-055
Zhang C, Ma Y (2012) Ensemble machine learning: methods and applications. Springer Science & Business Media, 10-329. DOI: 10.1007/9781441993267
Zhang XY, Bai J, Liang WZ (2006) The speech recognition system based on bark wavelet MFCC. 8th International Conference on Signal Processing, Guilin, 833-835. DOI: 10.1109/ICOSP.2006.345539
Zheng F, Zhang GL, Song ZJ (2001) Comparison of different implementations of MFCC. Journal of Computer Science and Technology 16(6): 582-589. DOI: 10.1007/BF02943243
Zhong M, Cai W (2019) Feature fusion-based recognition of marine mammal sounds. Electronic Science and Technology 32(5): 32-37. DOI: 10.16180/J.Cnki.Issn1007-7820.2019.05.007

Memo

Memo:
Received date:2023-6-20;Accepted date:2023-11-3。
Corresponding author:Jianhui Cui,E-mail:jianhuicui@email.tjut.edu.cn
Last Update: 2025-01-09