Application of Principal Component Analysis (PCA) and K-Means for Stratigraphic Unit Classification in Geochemical Samples from the Misti Volcano

Main Article Content

Mary Luz Nina-Palacios
Ernesto Nayer Tumi-Figueroa
Hugo Ticona-Salluca

Abstract

This study presents a comprehensive analysis of the Pacheco stage tephra deposits from Misti Volcano, Arequipa, Peru, using dimensionality reduction techniques through Principal Component Analysis (PCA) and clustering via K-Means. Geochemical samples collected between 2015 and 2023 were analyzed, focusing on major and trace elements such as Ti, Fe, MgO, and SiO₂. PCA allowed for dimensionality reduction, revealing that the first two principal components explained 34.27% of the total variance, with variables such as Ti and Fe having the most significant influence. Subsequently, the K-Means algorithm identified four geochemically differentiated clusters, which were compared with known stratigraphic units, showing a notable correlation with units such as Ponche Gris and La Rosada. The results suggest that these clusters reflect variations in magmatic processes and eruptive phases, offering new insights into the geochemical evolution of Misti Volcano. Future work could include isotopic analysis and the integration of deep learning techniques to enhance the understanding of magmatic sources and their influence on volcanic stratigraphy. 

Article Details

How to Cite
Application of Principal Component Analysis (PCA) and K-Means for Stratigraphic Unit Classification in Geochemical Samples from the Misti Volcano. (2024). C&T Riqchary Science and Technology Research Magazine, 6(2), 14-20. https://doi.org/10.57166/riqchary.v6.n2.2024.124
Section
Artículos
Author Biographies

Mary Luz Nina-Palacios , National University of the Altiplano Puno

Mary Luz Nina Palacios, student of the Faculty of Statistical Engineering and Computer Science, Researcher at the Odin Laboratory (Optimization, Development and Research) of the National University of the Altiplano, Pronabec Fellow

Ernesto Nayer Tumi-Figueroa, National University of the Altiplano Puno

Ernesto Nayer Tumi Figueroa, Professor at the National University of the Altiplano, Director of the Institute for Research in Computer Science, RENACYT Researcher.

Hugo Ticona-Salluca, National University of the Altiplano Puno

Hugo Ticona Salluca, Bachelor of Statistical and Computer Engineering, Software Developer at the Odin Laboratory (Optimization, Development and Research) of the National University of the Altiplano, RENACYT Researcher.

How to Cite

Application of Principal Component Analysis (PCA) and K-Means for Stratigraphic Unit Classification in Geochemical Samples from the Misti Volcano. (2024). C&T Riqchary Science and Technology Research Magazine, 6(2), 14-20. https://doi.org/10.57166/riqchary.v6.n2.2024.124

References

Rollinson, H. R. (2014). Using geochemical data: evaluation, presentation, interpretation. Routledge. https://doi.org/10.4324/9781315845548

Winter, J. D. (2010). Principles of Igneous and Metamorphic Petrology. Pearson Education.

Jordan, M. I., & Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255-260. https://doi.org/10.1126/science.aaa8415

Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202

Hartigan, J. A., & Wong, M. A. (1979). Algorithm AS 136: A K-means clustering algorithm. Journal of the Royal Statistical Society: Series C (Applied Statistics), 28(1), 100-108. https://doi.org/10.2307/2346830

Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202

Olden, J. D., & Jackson, D. A. (2002). Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154(1-2), 135-150. https://doi.org/10.1016/S0304-3800(02)00064-9

Feng, X., Min, X., Wang, J., & Guo, J. (2020). A comparison study of clustering methods for geo-chemical data analysis. Geochemistry, Geophysics, Geosystems, 21(7), e2020GC009274.

Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202

Olden, J. D., & Jackson, D. A. (2002). Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154(1-2), 135-150. https://doi.org/10.1016/S0304-3800(02)00064-9

Feng, X., Min, X., Wang, J., & Guo, J. (2020). A comparison study of clustering methods for geo-chemical data analysis. Geochemistry, Geophysics, Geosystems, 21(7), e2020GC009274.

Rollinson, H. R. (2014). Using geochemical data: evaluation, presentation, interpretation. Routledge. https://doi.org/10.4324/9781315845548

Boggs, S. (2011). Principles of Sedimentology and Stratigraphy. Pearson Education.

Bindeman, I. N. (2008). Oxygen isotopes in mantle and crustal magmas as revealed by single crystal analysis. Reviews in Min-eralogy and Geochemistry, 69(1), 445-478. https://doi.org/10.2138/rmg.2008.69.12

Lecun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Na-ture, 521(7553), 436-444. https://doi.org/10.1038/nature14539

Caldeira, K., Hoffmann, N., & Rudiger, M. (2020). Using deep learning for volcanic eruption prediction. Geophysical Research Letters, 47(3), e2019GL086650