Investigating the impact of missing value imputation methods on the prediction of diabetes using machine learning

Message:
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
Diabetes poses significant challenges due to its prevalence and the potential consequences of inaccurate or delayed diagnosis. This study focuses on enhancing prediction reliability to mitigate such risks. Initially, it identifies diabetes-related factors through correlation analysis with the target variable and implements models to address missing data. Subsequently, various imputation methods including CART, GMM, and RFR are employed to evaluate these factors. Results from each imputation scenario inform the selection of the most effective method. The study then employs ensemble algorithms like AdaBoost, Bagging, Gradient Boosting, and RF to enhance classification model accuracy. Further refinement is achieved by optimizing hyper-parameters through grid search. Evaluation involves comparing model predictions with those of medical professionals to assess accuracy. The findings reveal superior performance of optimized machine learning models over human predictions, indicating potential for improved diagnosis accuracy and reduced medical errors. This research contributes to advancing predictive modeling in diabetes diagnosis, offering prospects for enhanced community health and reduced socioeconomic burdens.
Language:
English
Published:
Journal of Industrial and Systems Engineering, Volume:16 Issue: 3, Summer 2024
Pages:
30 to 62
https://magiran.com/p2827530  
سامانه نویسندگان
  • Jolai، Fariborz
    Corresponding Author (5)
    Jolai, Fariborz
    Full Professor Industrial Engineering, University of Tehran, تهران, Iran
اطلاعات نویسنده(گان) توسط ایشان ثبت و تکمیل شده‌است. برای مشاهده مشخصات و فهرست همه مطالب، صفحه رزومه را ببینید.
مقالات دیگری از این نویسنده (گان)