Application of Principal Component Analysis (PCA) and K-Means for Stratigraphic Unit Classification in Geochemical Samples from the Misti Volcano
Main Article Content
Abstract
This study presents a comprehensive analysis of the Pacheco stage tephra deposits from Misti Volcano, Arequipa, Peru, using dimensionality reduction techniques through Principal Component Analysis (PCA) and clustering via K-Means. Geochemical samples collected between 2015 and 2023 were analyzed, focusing on major and trace elements such as Ti, Fe, MgO, and SiO₂. PCA allowed for dimensionality reduction, revealing that the first two principal components explained 34.27% of the total variance, with variables such as Ti and Fe having the most significant influence. Subsequently, the K-Means algorithm identified four geochemically differentiated clusters, which were compared with known stratigraphic units, showing a notable correlation with units such as Ponche Gris and La Rosada. The results suggest that these clusters reflect variations in magmatic processes and eruptive phases, offering new insights into the geochemical evolution of Misti Volcano. Future work could include isotopic analysis and the integration of deep learning techniques to enhance the understanding of magmatic sources and their influence on volcanic stratigraphy.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
When an author creates an article and publishes it in a journal, the copyright passes to the journal as part of the publishing agreement. Therefore, the journal becomes the owner of the rights to reproduce, distribute and sell the article. The author retains some rights, such as the right to be recognized as the creator of the article and the right to use the article for his or her own scholarly or research purposes, unless otherwise agreed in the publication agreement.
How to Cite
References
Rollinson, H. R. (2014). Using geochemical data: evaluation, presentation, interpretation. Routledge. https://doi.org/10.4324/9781315845548
Winter, J. D. (2010). Principles of Igneous and Metamorphic Petrology. Pearson Education.
Jordan, M. I., & Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255-260. https://doi.org/10.1126/science.aaa8415
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202
Hartigan, J. A., & Wong, M. A. (1979). Algorithm AS 136: A K-means clustering algorithm. Journal of the Royal Statistical Society: Series C (Applied Statistics), 28(1), 100-108. https://doi.org/10.2307/2346830
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202
Olden, J. D., & Jackson, D. A. (2002). Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154(1-2), 135-150. https://doi.org/10.1016/S0304-3800(02)00064-9
Feng, X., Min, X., Wang, J., & Guo, J. (2020). A comparison study of clustering methods for geo-chemical data analysis. Geochemistry, Geophysics, Geosystems, 21(7), e2020GC009274.
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202
Olden, J. D., & Jackson, D. A. (2002). Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154(1-2), 135-150. https://doi.org/10.1016/S0304-3800(02)00064-9
Feng, X., Min, X., Wang, J., & Guo, J. (2020). A comparison study of clustering methods for geo-chemical data analysis. Geochemistry, Geophysics, Geosystems, 21(7), e2020GC009274.
Rollinson, H. R. (2014). Using geochemical data: evaluation, presentation, interpretation. Routledge. https://doi.org/10.4324/9781315845548
Boggs, S. (2011). Principles of Sedimentology and Stratigraphy. Pearson Education.
Bindeman, I. N. (2008). Oxygen isotopes in mantle and crustal magmas as revealed by single crystal analysis. Reviews in Min-eralogy and Geochemistry, 69(1), 445-478. https://doi.org/10.2138/rmg.2008.69.12
Lecun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Na-ture, 521(7553), 436-444. https://doi.org/10.1038/nature14539
Caldeira, K., Hoffmann, N., & Rudiger, M. (2020). Using deep learning for volcanic eruption prediction. Geophysical Research Letters, 47(3), e2019GL086650