Sequential Hybrid Integration of U-Net and Fully Convolutional Networks with Mask R-CNN for Enhanced Building Boundary Segmentation from Satellite Imagery
DOI:
https://doi.org/10.21271/ZJPAS.37.3.13Keywords:
Deep Learning,U-Net, FCN, Mask R-CNN, Sequential IntegrationAbstract
In the recent years, building boundary segmentation obtained significant advancement through using deep learning. The present algorithms, such as Convolutional Neural Network (CNN) are unable to detect buildings in challenging urban areas like occlusions. This study investigates the integration of U-Net and Fully Convolutional Networks (FCN) with Mask R-CNN to improve building boundary segmentation using high-resolution satellite imagery. A sequential hybrid approach has been developed for combining semantic and instant segmentation. The integration between the U-Net with Mask R-CNN has been achieved by feeding the segmentation result from the U-Net as an input into the Mask R-CNN. A similar procedure was applied in the integration of the FCN with Mask R-CNN. The integration of U-Net with Mask R-CNN resulted in an improvement in the recall by 9.9% and an increase by 4.3 % in the F1-score, demonstrating its capability in segmenting boundary precision and fine-grained details. Similarly, FCN combined with Mask R-CNN has shown an enhancement of recall by 9.9% and precision by 7.6%, assuring its capability in the capture of global context. Further analysis through comparison between integration U-Net with Mask R-CNN with results from previous studies, demonstrates that the proposed integration scheme outperforms the existing results. The performance evaluation across RGB and panchromatic datasets highlights the flexibility of these integrations by proving their efficiency in different applications. Despite the minor challenges that appeared in boundary alignment, the results brought out the potential of such hybrid models for applications in urban planning, cadastral mapping, and disaster management.
References
Ahmadian, N., Sedaghat, A., Mohammadi, N. & Aghdami-Nia, M. 2024. Deep-Learning-Based Edge Detection for Improving Building Footprint Extraction from Satellite Images. Environmental Sciences Proceedings, 29, 61.
Alsabhan, W., Alotaiby, T. & Dudin, B. 2022. Detecting Buildings and Nonbuildings from Satellite Images Using U‐Net. Computational Intelligence and Neuroscience, 2022, 4831223.
Amarù, S., Marelli, D., Ciocca, G. & Schettini, R. 2023. DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision. Journal of Imaging, 9, 232.
Anh, H. T., Tuan, T. A., Long, H. P., Ha, L. H. & Thang, T. N. Multi Deep Learning Model for Building Footprint Extraction from High Resolution Remote Sensing Image. 2022 Singapore. Springer Nature Singapore, 246-252.
Aryal, J. & Neupane, B. 2023. Multi-Scale Feature Map Aggregation and Supervised Domain Adaptation of Fully Convolutional Networks for Urban Building Footprint Extraction. Remote Sensing, 15, 488.
Ayala, C., Sesma, R., Aranda, C. & Galar, M. 2021. A deep learning approach to an enhanced building footprint and road detection in high-resolution satellite imagery. Remote Sensing, 13, 3135.
Bai, J., Jia, C., Yu, S., Sun, L., Zhang, L., Chang, Z. & Hou, A. Building Extraction from High-Resolution Remote Sensing Images Using Improved HRNet Method. IGARSS 2024-2024 IEEE International Geoscience and Remote Sensing Symposium, 2024. IEEE, 7982-7985.
Bittner, K., Adam, F., Cui, S., Körner, M. & Reinartz, P. 2018. Building footprint extraction from VHR remote sensing images combined with normalized DSMs using fused fully convolutional networks. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 11, 2615-2629.
Bousias Alexakis, E. & Armenakis, C. 2022. Improving CNN-Based Building Semantic Segmentation Using Object Boundaries. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B3-2022, 41-48.
Carvalho, O. L. F. D., De Carvalho Junior, O. A., Albuquerque, A. O. D., Bem, P. P. D., Silva, C. R., Ferreira, P. H. G., Moura, R. D. S. D., Gomes, R. a. T., Guimaraes, R. F. & Borges, D. L. 2020. Instance segmentation for large, multi-channel remote sensing imagery using mask-RCNN and a mosaicking approach. Remote Sensing, 13, 39.
Dalal, A.-A., Shao, Y., Alalimi, A. & Abdu, A. 2020. Mask R-CNN for geospatial object detection. International Journal of Information Technology and Computer Science (IJITCS), 12, 63-72.
Das, S. Automated Building Segmentation in Areal Images Using Boundary Edge Detection. 2024 Singapore. Springer Nature Singapore, 237-250.
Dutta, A. & Zisserman, A. The VIA annotation software for images, audio and video. Proceedings of the 27th ACM international conference on multimedia, 2019. 2276-2279.
Gao, J., Zhang, B., Wu, Y. & Guo, C. Building Extraction from High Resolution Remote Sensing Images Based on Improved Mask R-CNN. 2022 4th International Conference on Robotics and Computer Vision (ICRCV), 2022. IEEE, 1-6.
Gashti, E. H., Delavar, M. R., Guan, H. & Li, J. 2024. Semantic Segmentation Uncertainty Assessment of Different U-net Architectures for Extracting Building Footprints. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 10, 141-148.
Guillaume, C., Aramburu, C. & Bougdal-Lambert, I. 2017. Satellite image segmentation for building detection using U-Net. Computer Science.
Han, Q., Yin, Q., Zheng, X. & Chen, Z. 2021. Remote sensing image building detection method based on Mask R-CNN. Complex & Intelligent Systems, 1-9.
He, K., Gkioxari, G., Dollár, P. & Girshick, R. Mask r-cnn. Proceedings of the IEEE international conference on computer vision, 2017. 2961-2969.
Ji, S., Wei, S. & Lu, M. 2018. Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Transactions on geoscience and remote sensing, 57, 574-586.
Li, C., Fu, L., Zhu, Q., Zhu, J., Fang, Z., Xie, Y., Guo, Y. & Gong, Y. 2021a. Attention Enhanced U-Net for Building Extraction from Farmland Based on Google and WorldView-2 Remote Sensing Images. Remote Sensing, 13, 4411.
Li, L., Zhang, T., Oehmcke, S., Gieseke, F. & Igel, C. 2023. BuildSeg: a general framework for the segmentation of buildings. arXiv preprint arXiv:2301.06190.
Li, W., He, C., Fang, J., Zheng, J., Fu, H. & Yu, L. 2019. Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data. Remote Sensing, 11, 403.
Li, Y., Xu, W., Chen, H., Jiang, J. & Li, X. 2021b. A novel framework based on mask R-CNN and histogram thresholding for scalable segmentation of new and old rural buildings. Remote Sensing, 13, 1070.
Liu, P., Liu, X., Liu, M., Shi, Q., Yang, J., Xu, X. & Zhang, Y. 2019. Building footprint extraction from high-resolution images via spatial residual inception convolutional neural network. Remote Sensing, 11, 830.
Liu, Z., Chen, B. & Zhang, A. Building segmentation from satellite imagery using U-Net with ResNet encoder. 2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE), 25-27 Dec. 2020 2020. 1967-1971.
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2015. 3431-3440.
Luo, L., Li, P. & Yan, X. 2021. Deep Learning-Based Building Extraction from Remote Sensing Images: A Comprehensive Review. Energies, 14, 7982.
Lussange, J., Yu, M., Tarabalka, Y. & Lafarge, F. 2023. 3D detection of roof sections from a single satellite image and application to LOD2-building reconstruction. arXiv preprint arXiv:2307.05409.
Mace, E., Manville, K., Barbu-Mcinnis, M., Laielli, M., Klaric, M. & Dooley, S. 2018. Overhead detection: Beyond 8-bits and rgb. arXiv preprint arXiv:1808.02443.
Mou, L. & Zhu, X. X. 2018. RiFCN: Recurrent network in fully convolutional network for semantic segmentation of high resolution remote sensing images. arXiv preprint arXiv:1805.02091.
Neupane, B., Horanont, T. & Aryal, J. 2021. Deep learning-based semantic segmentation of urban features in satellite images: A review and meta-analysis. Remote Sensing, 13, 808.
Noureldeen, A. & Wahed, M. E. 2024. Enhanced building footprint extraction from satellite imagery using Mask R-CNN and PointRend. Bulletin of Electrical Engineering and Informatics, 13, 3601-3608.
Prathap, G. & Afanasyev, I. Deep learning approach for building detection in satellite multispectral imagery. 2018 international conference on intelligent systems (IS), 2018. IEEE, 461-465.
Pushparaj, J. & Hegde, A. V. 2017. Evaluation of pan-sharpening methods for spatial and spectral quality. Applied Geomatics, 9, 1-12.
Raghavan, R., Verma, D. C., Pandey, D., Anand, R., Pandey, B. K. & Singh, H. 2022. Optimized building extraction from high-resolution satellite imagery using deep learning. Multimedia Tools and Applications, 81, 42309-42323.
Reda, K. & Kedzierski, M. 2020. Detection, Classification and Boundary Regularization of Buildings in Satellite Imagery Using Faster Edge Region Convolutional Neural Networks. Remote Sensing, 12, 2240.
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, 2015. Springer, 234-241.
Sakeena, M., Stumpe, E., Despotovic, M., Koch, D. & Zeppelzauer, M. 2023. On the Robustness and Generalization Ability of Building Footprint Extraction on the Example of SegNet and Mask R-CNN. Remote Sensing, 15, 2135.
Sariturk, B. & Seker, D. Z. 2022. A Residual-Inception U-Net (RIU-Net) Approach and Comparisons with U-Shaped CNN and Transformer Models for Building Segmentation from High-Resolution Satellite Images. Sensors, 22, 7624.
Shorten, C. & Khoshgoftaar, T. M. 2019. A survey on image data augmentation for deep learning. Journal of big data, 6, 1-48.
Susetyo, D. B., Harintaka, H. & Rizaldy, A. The application of mask R-CNN for building extraction. The 9th International Seminar on Aerospace Science and Technology, 2023 Bogor, Indonesia. AIP Publishing.
Thakur, V., Doja, M., Ahmad, T. & Rawat, R. 2019. Cadastral boundary extraction and image classification using OBIA and machine learning for National Land Records Modernization Programme in India. J. Remote Sens. GIS, 8.
Wagner, F. H., Dalagnol, R., Tarabalka, Y., Segantine, T. Y., Thomé, R. & Hirye, M. C. 2020. U-net-id, an instance segmentation model for building extraction from satellite images—case study in the joanópolis city, brazil. Remote Sensing, 12, 1544.
Wang, W., Shi, Y., Zhang, J., Hu, L., Li, S., He, D. & Liu, F. 2023. Traditional village building extraction based on improved Mask R-CNN: a case study of Beijing, China. Remote Sensing, 15, 2616.
Wang, X., Tian, M., Zhang, Z., He, K., Wang, S., Liu, Y. & Dong, Y. 2024. SDSNet: Building Extraction in High-Resolution Remote Sensing Images Using a Deep Convolutional Network with Cross-Layer Feature Information Interaction Filtering. Remote Sensing, 16, 169.
Xia, L., Zhang, X., Zhang, J., Yang, H. & Chen, T. 2021. Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection. Remote Sensing, 13, 2187.
Yan, G., Jing, H., Li, H., Guo, H. & He, S. 2023. Enhancing Building Segmentation in Remote Sensing Images: Advanced Multi-Scale Boundary Refinement with MBR-HRNet. Remote Sensing, 15, 3766.
Yu, Y., Wang, C., Kou, R., Wang, H., Yang, B., Xu, J. & Fu, Q. 2024. Enhancing Building Segmentation With Shadow-Aware Edge Perception. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 18, 1-12.
Zhang, L., Dong, R., Yuan, S., Li, W., Zheng, J. & Fu, H. 2021. Making Low-Resolution Satellite Images Reborn: A Deep Learning Approach for Super-Resolution Building Extraction. Remote Sensing, 13, 2872.
Zhang, L., Wu, J., Fan, Y., Gao, H. & Shao, Y. 2020. An efficient building extraction method from high spatial resolution remote sensing images based on improved mask R-CNN. Sensors, 20, 1465.
Zhang, Y. & Chi, M. 2020. Mask-R-FCN: A deep fusion network for semantic segmentation. IEEE Access, 8, 155753-155765.
Zhao, K., Kamran, M. & Sohn, G. 2020. Boundary Regularized Building Footprint Extraction from Satellite Images Using Deep Neural Networks. ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., V-2-2020, 617-624.
Zorzi, S. & Fraundorfer, F. Regularization of building boundaries in satellite images using adversarial and regularized losses. IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium, 2019. IEEE, 5140-5143.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Rojgar Qarani Ismael, Haval Abduljabbar Sadeq

This work is licensed under a Creative Commons Attribution 4.0 International License.




