Plot the ROC curves for a specific class with
four machine learning models
Figure 7: ROC curves for a Specific Class with 4 Machine
Learning Models (Red Wine Data) (Picture credit:
Original).
Figure 8: ROC curves for a Specific Class with 4 Machine
Learning Models (Picture credit: Original).
The larger the area under ROC curve, the better the
performance of the classifier. Based on the above
figure 7 and figure 8, we found that for a specific
class/category both in red and white wine dataset, RF
performs the best. XGBoost performs slightly worse
than RF, but better than SVC and MLP, which also
conforms to the comparison results summarized in
Table 1 and 2.
5 CONCLUSION
In this study, we conducted an in-depth analysis using
samples from both red and white wine datasets,
focusing on eleven physicochemical properties to
construct four machine learning models. The
classifiers' performance was assessed using various
metrics such as accuracy, precision, recall, F1 scores,
and ROC curves to provide a comprehensive
evaluation of the models. Through oversampling and
parameter tuning, we identified Random Forest as the
optimal machine learning model for both red and
white wine datasets, achieving an accuracy score of
90.71% and 93.01% respectively.
Furthermore, our results indicated that XGBoost
outperformed SVM and MLP in the remaining three
models, underscoring the importance of
oversampling in enhancing models' performance.
However, there are still opportunities for
improvement in this research. For instance, further
exploration of feature engineering techniques and
consideration of additional parameters during the
parameter adjustment process could lead to more
robust and accurate models. This highlights the
potential for future research to optimize and refine the
predictive capabilities of machine learning models in
the context of wine quality assessment.
REFERENCES
Zhang, Z., et al. Sensory Quality Evaluation and
Physicochemical Factor Analysis of Wine. Modern
Food 2022, (07), 202-206.
Cheng, Z., et al. A Grading Evaluation Model for Wine
Quality. Science & Technology Industry Parks 2018,
(03), 58.
Yao Y., et al. Research on Wine Evaluation Based on
LASSO Regression. Journal of Yunnan Agricultural
University (Natural Science), 2016,31(02):294-302.
Cortez P., Cerdeira A., Almeida F., et al. Modeling wine
preferences by data mining from physicochemical
properties. Decision Support Systems,2009,47(4):547-
553.
Xia, M., et al. Wine Quality Prediction Based on Support
Vector Machine. Manufacturing Automation 2020,
(05), 57-60.
Zhao, F., & Jiang, S. Application of GA-BP based on
Optimization in Wine Quality Prediction. Journal of
Harbin University of Commerce 2021, (03), 307-313.
Sun W. BP Model After Genetic Algorithm Dimensionality
Reduction Optimization and Wine Quality Prediction.
Journal of Shaoyang College (Natural Science Edition),
2017,14(01):23-29.
Jain, K., et al. Machine Learning-based Predictive
Modelling for the Enhancement of Wine Quality.
Scientific Reports,2023.
Zhang, Z., et al. Machine Learning Approaches for Wine
Quality Prediction. Journal of Food Engineering 2019,
(08), 123-129.
Cheng, Z., et al. Comparative Study of Wine Quality
Evaluation Methods. Food Science and Technology
2017, (06), 45-50.