Penerapan GeoAI Berbasis Mask R-CNN untuk Deteksi Kendaraan pada Citra Orthophoto Kawasan Perkotaan
DOI:
https://doi.org/10.22487/ruang.v20i1.333Abstract
GeoAI technology, which integrates artificial intelligence with spatial analysis, offers a novel approach to extracting urban object information from high-resolution imagery. This study applies Mask R-CNN with a ResNet-50 backbone architecture to detect vehicle objects in orthophoto imagery derived from the processing of 100 UAV photographs over an urban area in Switzerland. A total of 80 vehicle objects were annotated and partitioned into training (70%), validation (15%), and testing (15%) datasets. Model evaluation was conducted using a multi-threshold Intersection over Union (IoU) approach at values of ≥0.5, ≥0.75, and ≥0.95, and analyzed through a confusion matrix alongside Precision, Recall, F1-score, and Mean Average Precision (mAP) metrics. The results demonstrate that the model achieved Precision and Recall scores of 1.00 at IoU ≥0.5; however, performance declined at stricter thresholds, with an aggregate mAP of 0.56, indicating moderate overall performance. These findings suggest that the model is effective for macro-spatial analytical needs such as vehicle count estimation and distribution mapping, yet remains insufficiently stable for applications requiring high geometric precision. Conceptually, this study underscores the importance of multi-threshold evaluation in the application of deep learning for urban spatial analysis, while demonstrating the potential of GeoAI integration in data-driven urban planning.
References
K. He, G. Gkioxari, P. Dollar, and R. Girshick, Mask R-CNN. 2017. doi: 10.1109/ICCV.2017.322.
Michael Batty, “Artificial intelligence and smart cities,” Environ. Plan. B Urban Anal. City Sci., vol. 45, no. 1, pp. 3–6, Jan. 2018, doi: 10.1177/2399808317751169.
R. Kitchin, “The ethics of smart cities and urban science,” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol. 374, no. 2083, p. 20160115, Dec. 2016, doi: 10.1098/rsta.2016.0115.
G. Mai et al., “Towards the next generation of Geospatial Artificial Intelligence,” International Journal of Applied Earth Observation and Geoinformation, vol. 136, p. 104368, Feb. 2025, doi: 10.1016/j.jag.2025.104368.
W. Li, “Artificial Intelligence in Earth Science: A GeoAI Perspective,” Journal of Geophysical Research: Machine Learning and Computation, vol. 2, no. 3, Sep. 2025, doi: 10.1029/2025JH000691.
T. Hoeser and C. Kuenzer, “Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review-Part I: Evolution and Recent Trends,” Remote Sens. (Basel)., vol. 12, no. 10, p. 1667, May 2020, doi: 10.3390/rs12101667.
P. N. Zivich and A. I. Naimi, “A primer on neural networks,” Am. J. Epidemiol., vol. 194, no. 6, pp. 1473–1475, Jun. 2025, doi: 10.1093/aje/kwae380.
A. Masood and K. Ahmad, “A review on emerging artificial intelligence (AI) techniques for air pollution forecasting: Fundamentals, application and performance,” J. Clean. Prod., vol. 322, p. 129072, Nov. 2021, doi: 10.1016/j.jclepro.2021.129072.
M. Madhiarasan and M. Louzazni, “Analysis of Artificial Neural Network: Architecture, Types, and Forecasting Applications,” Journal of Electrical and Computer Engineering, vol. 2022, pp. 1–23, Apr. 2022, doi: 10.1155/2022/5416722.
Y. , L. X. Zhang and J. Wang, “Real time object detection for urban management: Methods and applications.,” Journal of Urban Computing and Geospatial Intelligence, 2023.
J. Heaton, “Ian Goodfellow, Yoshua Bengio, and Aaron Courville: Deep learning,” Genet. Program. Evolvable Mach., vol. 19, no. 1–2, pp. 305–307, Jun. 2018, doi: 10.1007/s10710-017-9314-z.
A. G. Prasetyo and N. F. Anshafa, GeoAI & Deep Learning untuk Urban Mapping: Deteksi Otomatis Objek Perkotaan dengan ArcGIS Pro. 2025.
L. , W. Z. Zhu and Q. Zhang, “Satellite Image Segmentation Using Deep Learning Techniques for Geospatial Analysis,” ISPRS Journal of Photogrammetry and Remote Sensing, 2023.
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 40, no. 4, pp. 834–848, Apr. 2018, doi: 10.1109/TPAMI.2017.2699184.
M. Everingham, L. Van Gool, C. Williams, J. Winn, and A. Zisserman, “The Pascal Visual Object Classes (VOC) challenge,” Int. J. Comput. Vis., vol. 88, pp. 303–338, Jun. 2010, doi: 10.1007/s11263-009-0275-4.
X. Zhang, Y. Zhou, and J. Luo, “Deep learning for processing and analysis of remote sensing big data: a technical review,” Big Earth Data, vol. 6, no. 4, pp. 527–560, Oct. 2022, doi: 10.1080/20964471.2021.1964879.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Dita Septyana, Budi Andresi, Nadine Sandra Agustina

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with Ruang: Jurnal Arsitektur agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.





