Multi-scale Feature Extraction and Fusion Net: Research on UAVs Image Semantic Segmentation Technology

Authors

  • Xiaogang Li Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China
  • Di Su State Grid Henan Electric Power Company, 450000, Zhengzhou, China
  • Dongxu Chang State Grid Henan Electric Power Company, 450000, Zhengzhou, China
  • Jiajia Liu Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China
  • Liwei Wang Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China
  • Zhansheng Tian Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China
  • Shuxuan Wang Xidian University, School of Aerospace Science and Technology, 710071, Xi’an, China
  • Wei Sun Xidian University, School of Aerospace Science and Technology, 710071, Xi’an, China

DOI:

https://doi.org/10.13052/jicts2245-800X.1115

Keywords:

Semantic segmentation, drone image, Deep learning, multi-scale feature extraction, contextual information

Abstract

Since UAV aerial images are usually captured by UAVs at high altitudes with oblique viewing angles, the amount of data is large, and the spatial resolution changes greatly, so the information on small targets is easily lost during segmentation. Aiming at the above problems, this paper presents a semantic segmentation method for UAV images, which introduces a multi-scale feature extraction and fusion module based on the encoding-decoding framework. By combining multi-scale channel feature extraction and multi-scale spatial feature extraction, the network can focus more on certain feature layers and spatial regions when extracting features. Some invalid redundant features are eliminated and the segmentation results are optimized by introducing global context information to capture global information and detailed information. Moreover, one compares the proposed method with FCN-8s, MSDNet, and U-Net network models on the large-scale multi-class UAV dataset UAVid. The experimental results indicate that the proposed method has higher performance in both MIoU and MPA, with an overall improvement of 9.2% and 8.5%, respectively, and its prediction capability is more balanced for both large-scale and small-scale targets.

Downloads

Download data is not yet available.

Author Biographies

Xiaogang Li, Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China

Xiaogang Li is a master’s student and senior engineer. He graduated from Power System and Automation at Tianjin University in July 2010. He joined Henan Jiuyu EPRI Electric Power Technology Co., Ltd. in January 2019. He is mainly engaged in the technical service of power equipment. He is good at research on intelligent operation inspection technology of power grids, application research on new energy technology of electric power, electrical equipment tests, and fault diagnosis research.

Di Su, State Grid Henan Electric Power Company, 450000, Zhengzhou, China

Di Su is a master’s student and senior engineer. He graduated from Motor and Electrical Equipment of North China Electric Power University in March 2006. He joined the State Grid Zhengzhou Electric Power Company in July 2006. He is engaged in production, operation and maintenance, management, organization, and personnel work. He is good at safety production, comprehensive management, and personnel management.

Dongxu Chang, State Grid Henan Electric Power Company, 450000, Zhengzhou, China

Dongxu Chang received his B.Sc. degree in Electrical Engineering from Air Force No. 1 Aviation University, Xinyang, China, in 2008. He has been engaged in UHV DC operation and maintenance for 14 years and has accumulated solid professional work experience. From Fufeng DC to Jinsu DC, and then to Tianzhong DC and Qingyu DC, he has traveled across the whole country and witnessed the growth, development, and expansion of UHV in China.

Jiajia Liu, Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China

Jiajia Liu received his B.Sc. degree in automation from Zhengzhou University (Zhengzhou, China) in 2007. He is currently working as an engineer at Henan Jiuyu Enpai Electric Power Technology Co., LTD., Zhengzhou, China. He is mainly engaged in high-voltage test inspection.

Liwei Wang, Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China

Liwei Wang was born in Xuchang, China. He received his B.Sc. degree in Detection Technology and Instruments from Xidian University in 1996. He is currently the deputy manager of Henan Jiuyu Enpai Power Technology Co., Ltd. He has rich working experience in the power system-related working field, familiar with power production, production and operation, product development, and distribution network product detection. He has a solid commissioning experience with power automation equipment and has participated in the on-site commissioning of automation equipment many times. As a key member, he participated in many automation equipment R&D projects and the R&D implementation of several information projects.

Zhansheng Tian, Henan Jiuyu Enpai Electric Power Technology Co., Ltd., 450000, Zhengzhou, China

Zhansheng Tian graduated from PLA Information Engineering University (Zhengzhou, China). He joined Henan Jiuyu Enpai Electric Power Technology Co., LTD. in 2010. He is currently the leader of the substation group in the equipment division of the company. He has led the diagnosis of more than 100 accidents of 110–500 kV main equipment, diagnosed many generator rotor turn-to-turn short circuit faults in the province, and participated in the preparation of the Technical Specification of Partial Discharge Online Monitoring Device for Gas Insulated Metal Enclosed Switchgear with UHF Method.

Shuxuan Wang, Xidian University, School of Aerospace Science and Technology, 710071, Xi’an, China

Shuxuan Wang received her B.Sc. degree from the Northeast Electric Power University, Jilin, China, in 2019, and her M.Sc. degree from the Xidian University, Xi’an, China, in 2022. She is currently working in Guangzhou Asensing Technology Co., Ltd. Her research interests include image target detection and image semantic segmentation.

Wei Sun, Xidian University, School of Aerospace Science and Technology, 710071, Xi’an, China

Wei Sun received his B.Sc., M.Sc. and Ph.D. degrees from the Xidian University (Xi’an, China), in 2002, 2005, and 2009, respectively. He has been a professor in the School of Aerospace Science and Technology at Xidian University since 2017. His research interests include intelligent robots and unmanned aerial systems.

References

Shelhamer E, Long J, Darrell T. Fully Convolutional Networks for Semantic Segmentation [J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(4): 640–651.

Duan Lijuan, Sun Qichao, Qiao Yuanhua, et al. Semantic Segmentation Algorithm of RGB-D Indoor Image Based on Attention Perception and Semantic Perception [J]. Chinese Journal of Computers, 2021.

Badrinarayanan V, Kendall A, Cipolla R. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481–2495.

Zhao H S, Shi J P, Qi X, et al. Pyramid Scene Parsing Network [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Hawail. USA. 2017: 6230–6239.

Chen L C, Papandreou G, Kokkinos I, et al. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs [J]. Computer Science, 2014(4): 357–361.

Chen L C, Papandreou G, Kokkinos I, et al. Deeplab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 40(4): 834–848.

Chen L C, Papandreou G, Schroff F, et al. Rethinking Atrous Convolution for Semantic Image Segmentation [J]. ArXiv preprint arXiv: 1706.05587, 2017.

Chen L C, Zhu Y, Papandreou G, et al. Encoder-decoder with Atrous Separable Convolution for Semantic Image Segmentation [C]. Proceedings of the European conference on computer vision (ECCV), 2018: 801–818.

Badrinarayanan V, Kendall A. Cipolla R. SegNet: A Deep Convolutional Encoder-decoder Architecture for Image Segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481–2495.

Ronneberger O, Fischer P, Brox T. U-Net: Convolution Networks for Biomedical Image Segmentation [C]. International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich, Germany, 2015: 234–241.

Jegou S, Drozdzal M, Vazquez D, et al. The One Hundred layers Tiramisu: Fully Convolutional Densenets for Semantic Segmentation. arXiv preprint arXiv: 1611.09326, 2016.

Lin G S, Milan A, Shen C H, Reid I. RefineNet: Multi-path Refinement Network for High-resolution Semantic Segmentation. //Proceedings of the IEEE(Conference on Computer Vision and Pattern Recognition. Hawaii, USA. 2017: 5168–5177.

Wang Yanran, Chen Qingliang, Wu Junjun. A Review of Image Semantic Segmentation Methods for Complex Environments [J]. Computer Science, 2019, 46(9): 36–46.

Ye L, Vosselman G, Xia G, et al. UAVid: A Semantic Segmentation Dataset for UAV Imagery [J]. 2018.

Downloads

Published

2023-01-14

How to Cite

Li, X. ., Su, D. ., Chang, D. ., Liu, J. ., Wang, L. ., Tian, Z. ., Wang, S. ., & Sun, W. . (2023). Multi-scale Feature Extraction and Fusion Net: Research on UAVs Image Semantic Segmentation Technology. Journal of ICT Standardization, 11(01), 97–116. https://doi.org/10.13052/jicts2245-800X.1115

Issue

Section

Articles