Web system for the objective evaluation of the sensory quality of chocolate using machine learning
Main Article Content
Abstract
This study introduces the design, implementation, and validation of a web-based platform, developed in Python, for the objective assessment of chocolate’s sensory quality utilizing state-of-the-art machine learning techniques. The intrinsic subjectivity in sensory evaluation—arising from heterogeneous consumer and expert perceptions—poses a significant challenge to the chocolate industry. Addressing this limitation, the proposed system integrates structured data on ingredients, geographic origin, and sensory descriptors, augmented with evaluative scores from both consumers and connoisseurs. A predictive model based on the XGBoost algorithm was trained and rigorously validated, achieving a root mean square error (RMSE) of 0.3867, outperforming alternative regression approaches in predictive accuracy. The platform enables users to input new chocolate profiles, receive automated quality predictions, contribute evaluations, and access in-depth analytical insights. The dataset comprises over 1,700 chocolate samples rated by industry experts across diverse origins. The full source code and dataset are openly available at: https://bit.ly/4ci1P4Y
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
When an author creates an article and publishes it in a journal, the copyright passes to the journal as part of the publishing agreement. Therefore, the journal becomes the owner of the rights to reproduce, distribute and sell the article. The author retains some rights, such as the right to be recognized as the creator of the article and the right to use the article for his or her own scholarly or research purposes, unless otherwise agreed in the publication agreement.
How to Cite
References
R. Tatman, «Kaggle,» [En línea]. Available: https://www.kaggle.com/datasets/rtatman/chocolate-bar-ratings. [Último acceso: 2024].
International Cocoa Organization, «ICCO,» [En línea]. Available: https://www.icco.org/august-2024-quarterly-bulletin-of-cocoa-statistics/. [Último acceso: 4 Septiembre 2024].
J. Lee y J. An, «Consumers’ sensory perception homogeneity and liking of chocolate,» Food Quality and Preference, vol. 118, p. 105178, 2024. https://doi.org/10.1016/j.foodqual.2024.105178
P. Yolci Omeroglu y T. Ozdal, «Fatty acid composition of sweet bakery goods and chocolate products and evaluation of overall nu-tritional,» Journal of Food Composition and Analysis, vol. 88, p. 103438, 2020. https://doi.org/10.1016/j.jfca.2020.103438
X. Liu, J. Wang, H. Wang, Y. Huang y Z. Ren, «Prediction of prunoi-deae fruit quality characteristics based on machine learning and spectral characteristic acquisition optimization,» Food Control, vol. 165, p. 110627, 2024. https://doi.org/10.1016/j.foodcont.2024.110627
G. Samaras, D. Bikos, P. Cann, M. Masen, Y. Hardalupas, J. Vieira, C. Hartmann y M. Charalambides, «A multiscale finite element analysis model for predicting the effect of micro-aeration on the fragmentation of chocolate during the first bite,» European Journal of Mechanics - A/Solids, vol. 104, p. 105221, 2024. https://doi.org/10.1016/j.euromechsol.2024.105221
S. J. Boegman, S. Carodenuto, S. Rebitt, H. Grant y B. Cisneros, «See-ing through transparency in the craft chocolate industry: The what, how, and why of cacao sourcing,» Journal of Agriculture and Food Research, vol. 14, p. 100739, 2023. https://doi.org/10.1016/j.jafr.2023.100739
N. Prakansamut, K. Adulpadungsak, S. Sonwai, K. Aryusuk y S. Lilitchan, «Application of functional oil blend-based oleogels as novel structured oil alternatives in chocolate spread,» LWT, vol. 203, p. 116322, 2024. https://doi.org/10.1016/j.lwt.2024.116322
C. González, E. V. Acosta, J. C. Mazo Rivas y D. A. Muñoz, «Phe-nomenological based model for the prediction of the structural changes during chocolate conching process,» Journal of Food Engi-neering, vol. 289, p. 110184, 2021. https://doi.org/10.1016/j.jfoodeng.2020.110184
J. Wagner, J. D. Wilkin, A. Szymkowiak y J. Grigor, «Sensory and affective response to chocolate differing in cocoa content: A TDS and facial electromyography approach,» Physiology & Behavior, vol. 270, p. 114308, 2023. https://doi.org/10.1016/j.physbeh.2023.114308
B. Le Révérend, I. Smart, P. Fryer y S. Bakalis, «Modelling the rapid cooling and casting of chocolate to predict phase behaviour,» Chem-ical Engineering Science, vol. 270, pp. 1077-1086, 2019. https://doi.org/10.1016/j.ces.2010.12.007
C. Gallery, S. Bourge y G. Agoda-Tandjawa, «Flow behaviors of multiple molten chocolate matrices: Appropriate curve fitting mod-els and impact of different types of surfactants,» Journal of Food En-gineering, vol. 363, p. 111780, 2024. https://doi.org/10.1016/j.jfoodeng.2023.111780
S. Kumar y I. Chong, «Correlation analysis to identify the effective data in machine learning: Prediction of depressive disorder and emotion states,» International Journal of Environmental Research and Public Health, vol. 15, nº 12, p. 2907, 2020. https://doi.org/10.3390/ijerph15122907
Y. Ren, Z. Lv, Z. Xu, Q. Wang y Z. Wang, «Slurry-ability mathemati-cal modeling of microwave-modified lignite,» Energy, vol. 281, p. 128143, 2023. https://doi.org/10.1016/j.energy.2023.128143
T. Chen y C. Guestrin, «XGBoost: A Scalable Tree Boosting System,» Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785-794, 2016. https://doi.org/10.1145/2939672.2939785
C. E.J. Armstrong, J. Niimi, P. K. Boss, V. Pagay y D. W. Jeffery, «Use of Machine Learning with Fused Spectral Data for Prediction of Product Sensory Characteristics: The Case of Grape to Wine,» Foods, vol. 12, nº 4, p. 757, 2023. https://doi.org/10.3390/foods12040757
P. C. Ossani, D. F. Rossoni, M. Â. Cirillo y F. M. Borém, «Classifica-tion of specialty coffees using machine learning techniques,» Re-search, Society and Development, vol. 10, nº 5, 2021. https://doi.org/10.33448/rsd-v10i5.14732