Alur kerja pembelajaran mesin pada pemodelan spasial kerawanan longsor

https://doi.org/10.22146/mgi.95857

Guruh Samodra(1*)

(1) Faculty of Geography, Universitas Gadjah Mada, Yogyakarta.
(*) Corresponding Author

Abstract


Abstrak Salah satu instrumen pengurangan risiko bencana longsor adalah peta kerawanan longsor yang dihasilkan dari pemodelan spasial. Alur kerja pemodelan spasial kerawanan longsor menggunakan model pembelajaran mesin belum terakomodasi dalam Standar Nasional Indonesia (SNI) yang berlaku saat ini (tahun 2024). Penelitian ini berusaha menjelaskan variasi langkah-langkah dalam alur kerja pembelajaran mesin dan menunjukkan perbedaannya dengan alur kerja model statistik. Model statistik regresi logistik dan model pembelajaran mesin random forest dipilih untuk menjelaskan perbedaan alur kerja pemodelan spasial kerawanan longsor. Formulasi alur kerja pemodelan spasial kerawanan longsor diterapkan untuk memetakan kerawanan longsor di Kabupaten Pacitan. Pada tanggal 27-29 November 2017, 743 longsor terjadi di Kabupaten Pacitan dipicu oleh hujan yang sangat lebat akibat Siklon Tropis Cempaka. Kabupaten Pacitan merupakan salah satu wilayah rawan longsor di Provinsi Jawa Timur. Alur kerja pemodelan spasial kerawanan longsor terbagi atas beberapa langkah yaitu penyiapan data, pra-pemrosesan data (pre-processing), melatih dan menyetel model, memvalidasi model, pemodelan spasial, dan uji akurasi. Hasil uji akurasi model RF dan LR yang diterapkan di Kabupaten Pacitan masing-masing sebesar 0,75 dan 0,73. Penelitian ini diharapkan dapat memberikan masukan dalam penyusunan SNI pemetaan kerawanan longsor di masa mendatang serta dapat digunakan sebagai acuan dalam pemetaan kerawanan longsor secara umum di Indonesia.

 

Abstract One of the landslide risk reduction instruments is landslide susceptibility maps which can be produced by spatial modeling. The landslide susceptibility modeling based on machine learning workflows have not been accommodated in the current verison of Indonesian National Standard (SNI). This study seeks to explain the variation of machine learning workflows and show how they differ from statistical learning workflows. Logistic regression model and random forest machine learning models were selected to explain variations in landslide susceptibility modeling workflows. The modeling workflows were applied to map landslide susceptibility in Pacitan Regency. On 27-29 November 2017, 743 landslides occurred in Pacitan Regency triggered by very heavy rain due to Tropical Cyclone Cempaka. Pacitan Regency is one of the landslide-prone areas in East Java Province. The landslide susceptibility modeling workflow is divided into several steps, i.e. data preparation, data pre-processing, training and tuning the model, validating the model, spatial modeling, and accuracy testing. The accuracy test results of the RF and LR models applied in Pacitan Regency were 0.75 and 0.73 respectively. This research is expected to provide a benchmark for landslide susceptibility mapping in Indonesia.

 





References

Abraham, M. T., Satyam, N., Lokesh, R., Pradhan, B., & Alamri, A. (2021). Factors Affecting Landslide Susceptibility Mapping: Assessing the Influence of Different Machine Learning Approaches, Sampling Strategies and Data Splitting. Land 2021, Vol. 10, Page 989, 10(9), 989. https://doi.org/10.3390/LAND10090989

Agresti, A. (2013). Categorical Data Analysis. John Wiley & Sons, Ltd.

Amato, F., López, A., Peña-Méndez, E. M., Vaňhara, P., Hampl, A., & Havel, J. (2013). Artificial neural networks in medical diagnosis. Journal of Applied Biomedicine, 11(2), 47–58. https://doi.org/10.2478/V10136-012-0031-X

Badan Standardisasi Nasional. (2016). Standar nasional Indonesia 8291:2016 Penyusunan dan Penentuan Zona Kerentanan Gerakan Tanah.

Beven, K. J., & Kirkby, M. J. (1979). A physically based, variable contributing area model of basin hydrology. Hydrological Sciences Bulletin, 24(1), 43–69. https://doi.org/10.1080/02626667909491834

Biau, G., & Scornet, E. (2016). A random forest guided tour. Test, 25(2), 197–227. https://doi.org/10.1007/S11749-016-0481-7/METRICS

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Cascini, L. (2008). Applicability of landslide susceptibility and hazard zoning at different scales. Engineering Geology, 102(3–4), 164–177. https://doi.org/10.1016/j.enggeo.2008.03.016

Catani, F., Lagomarsino, D., Segoni, S., & Tofani, V. (2013). Landslide susceptibility estimation by random forests technique: Sensitivity and scaling issues. Natural Hazards and Earth System Sciences, 13(11), 2815–2831. https://doi.org/10.5194/NHESS-13-2815-2013

Chen, W., Xie, X., Peng, J., Shahabi, H., Hong, H., Bui, D. T., Duan, Z., Li, S., & Zhu, A. X. (2018). GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. CATENA, 164, 135–149. https://doi.org/10.1016/J.CATENA.2018.01.012

Conrad, O., Bechtel, B., Bock, M., Dietrich, H., Fischer, E., Gerlitz, L., Wehberg, J., Wichmann, V., & Böhner, J. (2015). System for Automated Geoscientific Analyses (SAGA) v. 2.1.4. Geoscientific Model Development, 8(7), 1991–2007. https://doi.org/10.5194/GMD-8-1991-2015

Corominas, J., & Moya, J. (2008). A review of assessing landslide frequency for hazard zoning purposes. Engineering Geology, 102(3–4), 193–213.

Cutler, D. R., Edwards, T. C., Beard, K. H., Cutler, A., Hess, K. T., Gibson, J., & Lawler, J. J. (2007). Random forests for classification in ecology. Ecology, 88(11), 2783–2792. https://doi.org/10.1890/07-0539.1

Fell, R., Corominas, J., Bonnard, C., Cascini, L., Leroi, E., & Savage, W. Z. (2008). Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Engineering Geology, 102(3–4), 85–98.

Goetz, J. N., Brenning, A., Petschko, H., & Leopold, P. (2015). Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Computers & Geosciences, 81, 1–11. https://doi.org/10.1016/J.CAGEO.2015.04.007

Guyon, I., & Elisseeff, A. (2003). An Introduction to Variable and Feature Selection. Journal of Machine Learning Research, 3, 1157–1182.

Haryono, E., & Day, M. (2004). Landform Differentiation within The Gunung Kidul Kegelkarst, Java, Indonesia. Journal of Cave and Karst Studies, 66(2), 62–69. https://caves.org/pub/journal/PDF/V66/v66n2-Haryono.pdf

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning. https://doi.org/10.1007/978-0-387-84858-7

Hussin, H. Y., Zumpano, V., Reichenbach, P., Sterlacchini, S., Micu, M., van Westen, C., & Bălteanu, D. (2016). Different landslide sampling strategies in a grid-based bi-variate statistical susceptibility model. Geomorphology, 253, 508–523. https://doi.org/10.1016/J.GEOMORPH.2015.10.030

Kuhn, M., & Wickham, H. (2020). Tidymodels: a collection of packages for modeling and machine learning using tidyverse principles. https://www.tidymodels.org

Liaw, A., & Wiener, M. (2002). Classification and Regression by randomForest. R. News, 2(3), 18–22.

Lovelace, R., Nowosad, J., & Münchow, J. (2020). Geocomputation with R (1st ed.). Chapman and Hall/CRC Press.

Menard, S. (2002). Applied Logistic Regression Analysis. Applied Logistic Regression Analysis. https://doi.org/10.4135/9781412983433

Moore, I. D., Grayson, R. B., & Ladson, A. R. (1991). Digital terrain modelling: A review of hydrological, geomorphological, and biological applications. Hydrological Processes, 5(1), 3–30. https://doi.org/10.1002/hyp.3360050103

O’Brien, R. M. (2007). A caution regarding rules of thumb for variance inflation factors. Quality and Quantity, 41(5), 673–690. https://doi.org/10.1007/s11135-006-9018-6

Probst, P., Wright, M. N., & Boulesteix, A.-L. (2019). Hyperparameters and tuning strategies for random forest. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3), e1301. https://doi.org/10.1002/WIDM.1301

Quinn, P., Beven, K., Chevallier, P., & Planchon, O. (1991). The prediction of hillslope flow paths for distributed hydrological modelling using digital terrain models. Hydrological Processes, 5(1), 59–79. https://doi.org/10.1002/hyp.3360050106

R Core Team. (2021). R: a Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.r-project.org/

Samodra, G. (2023). Teknologi Geospasial Inventarisasi Longsor (Eds Pertam). Gadjah Mada University Press.

Samodra, G., Chen, G., Sartohadi, J., & Kasama, K. (2017). Comparing data-driven landslide susceptibility models based on participatory landslide inventory mapping in Purwosari area, Yogyakarta, Java. Environmental Earth Sciences, 76(4), 1–19. https://doi.org/10.1007/s12665-017-6475-2

Samodra, G., Ngadisih, N., Malawani, M. N., Mardiatno, D., Cahyadi, A., & Nugroho, F. S. (2020). Frequency–magnitude of landslides affected by the 27–29 November 2017 Tropical Cyclone Cempaka in Pacitan, East Java. Journal of Mountain Science, 17(4), 773–786. https://doi.org/10.1007/s11629-019-5734-y

Schuster, R. L., & Kockelman, W. J. (1996). Principles of landslide hazard reduction. In A. K. Turner & R. L. Schuster (Eds.), Landslides: Investigation and Mitigation (pp. 91–105). National Academy Press. National Research Council, Transportation Research Safety Board Special Report.

Smyth, H. R., Hall, R., & Nichols, G. J. (2008). Cenozoic volcanic arc history of East Java, Indonesia: The stratigraphic record of eruptions on an active continental margin. Special Paper of the Geological Society of America, 436, 199–222. https://doi.org/10.1130/2008.2436(10)

Soeters, R., & Westen, C. J. van. (1996). Slope instability recognition, analysis, and zonation. In A. K. Turner & R.L. Schuster (Eds.), Landslides, investigation and mitigation ( Transportation Research Board, National Research Council, Special Report ; 247) (pp. 129–177). National Academy Press.

Steger, S., Brenning, A., Bell, R., Petschko, H., & Glade, T. (2016). Exploring discrepancies between quantitative validation results and the geomorphic plausibility of statistical landslide susceptibility maps. Geomorphology, 262, 8–23. https://doi.org/10.1016/j.geomorph.2016.03.015

van Westen, C. J., Castellanos, E., & Kuriakose, S. L. (2008). Spatial data for landslide susceptibility, hazard, and vulnerability assessment: An overview. Engineering Geology, 102(3–4), 112–131. https://doi.org/10.1016/J.ENGGEO.2008.03.010



DOI: https://doi.org/10.22146/mgi.95857

Article Metrics

Abstract views : 444

Refbacks

  • There are currently no refbacks.




Copyright (c) 2024 Guruh Samodra

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


 

Accredited Journal, Based on Decree of the Minister of Research, Technology and Higher Education, Republic of Indonesia Number 164/E/KPT/2021

Volume 35 No 2 the Year 2021 for Volume 39 No 1 the Year 2025

ISSN  0215-1790 (print) ISSN 2540-945X  (online)

 

website statistics Statistik MGI