فهرست مطالب

Journal of Biostatistics and Epidemiology
Volume:9 Issue: 2, Spring 2023

  • تاریخ انتشار: 1402/10/17
  • تعداد عناوین: 9
|
  • Parvin Mirmiran, _ Hanieh Malmir, Zahra Bahadoran* Pages 147-172
    Introduction

    Nutrition Clinical Trials (NCTs) are pivotal in establishing causal links between nutritional interventions and chronic diseases. This review comprehensively examines prevalent clinical trial designs, emphasizing their strengths and limitations. The goal is to provide insights into the selection and optimization of these designs for dietary intervention studies.

    Methods

    Various study designs in NCTs are explored, including quasi-experimental designs, double-blind randomized placebo-controlled trials for nutrient/functional foods supplementation, community-based lifestyle interventions, pragmatic nutrition interventions, and field trial projects. The characteristics, advantages, and challenges of each design are discussed. Real examples are presented to illustrate how these designs can be tailored and optimized for dietary intervention studies.

    Results

    Parallel randomized clinical trials are acknowledged as the gold standard, despite requiring substantial sample sizes and having inherent limitations. Cross-over NCTs emerge as valuable for assessing temporary treatment effects while mitigating potential confounders and interpatient variability. However, they may not be suitable for acute diseases and progressive disorders, and attrition rates can be higher. Multi-arm randomized designs offer increased study power with a lower sample size but necessitate more intricate design, analysis, and result reporting.

    Conclusion

    In conclusion, each study design in NCTs comes with its set of strengths and limitations. The selection of an appropriate design should consider determinants and common considerations to provide robust evidence for establishing cause-and-effect associations or assessing the safety and efficacy of food products in nutrition research. This comprehensive understanding aids researchers in making informed choices when planning and conducting nutrition clinical trials.

    Keywords: Nutrition clinical trial, Quasi-experimentaldesign, Randomized clinical trial
  • Mojgan Akbarzadeh-Jahromi, _ Negar Taheri, Babak Dashtdar, Nasim Taheri, Fatemeh Abiri, Marjan Zare* Pages 173-188
    Introduction

    Human Papilloma Virus infection (HPV) high-risk genotypes are responsible for up to 70% of invasive cervical cancers. It was aimed to determine the national and provincial prevalence of the total HPV and its high-risk genotypes including HPV genotype 16 (HPV16) and HPV genotype 18 (HPV18), and HPV genotypes other than genotypes of 16 and 18 (HPV other genotypes) among Iranian healthy women.

    Methods

    Iran with 28 provinces locates at latitude and longitude of 32° 00' north and 53° 00' east. All Persian and English studies reporting HPV infection based on cervical specimens were selected through searching the PubMed, Magiran, Scopus, Irandoc databases, and Google Scholar research search engine. Sample size and event rates were used to compute the overall event rates and 95% confidence interval (95% C.I); Fixed or random effects model, heterogeneity indices including Q-statistics (p-value), and degree of heterogeneity (I2) were reported. The search was done up to February 29, 2022. Comprehensive Meta-analysis 2.2.064 and ArcGIS 10.8.2 software tools were used at a significance level of <0.05.

    Results

    The meta-analysis included nineteen studies with 258839 participants. The national meta-analysis resulted in a total HPV prevalence of 0.025 (95% C.I 0.016, 0.039); those of HPV16, HPV18, and HPV other genotypes were 0.032 (95% C.I 0.019, 0.051), 0.028 (95% C.I 0.019, 0.040), and 0.048 (95% C.I 0.033, 0.069), respectively. The provincial meta-analysis showed that the total HPV prevalence was highest in Zanjn and Kerman (0.323 and 0.240, respectively); that of HPV16 was highest in Boushehr and Khozestan (0.298 and 0.253, respectively); that of HPV18 was highest in Tehran (0.089) and that of HPV other genotypes was highest in Khozestan (0.542).

    Conclusion

    The current results would help policymakers and health managers accentuate on further implementation of screening strategies and health services in needier areas such as Zanjan, Kerma, Khozestan, and Tehran.

    Keywords: Women's health, HPV prevalence, HPV genotypes, Cervical cancer, Meta-analysis, Epidemiology, Iran
  • Jaleh Shoshtarian Malak, Samira Alsaeidi*, Fatemeh Haji Ali Asgari, Fahimeh Khedmatkon Pages 189-200
    Introduction

    Prediction of Wegener's granulomatosis diagnosis and relapse is a complex process. In this study, we applied machine learning algorithms to predict Wegener's granulomatosis relapse.

    Methods

    In this research, 189 patients admitted to Amiralam Hospital were studied and followed for approximately 2 years. Patient features included demographics, organ involvement, symptoms, and other clinical data. Different popular machine learning algorithms were applied for predicting Wegener's granulomatosis relapse, including Support Vector Machines, Random Forest, Gradient Boosting, and XGBoost algorithms. The prediction model performance was measured for the different candidate prediction algorithms using accuracy, precision, recall, and F1-measure. The selected prediction model performance was calculated based on different relapse rates and major relapse occurrence according to Birmingham Vasculitis Activity Score (BVAS) fields.

    Results

    Applying different machine learning algorithms, the XGBoost algorithm performed the best. The results indicated that the prediction model's performance increased when calculating higher relapse rate possibilities. The XGBoost model had 82% accuracy while predicting more than one relapse rate and 92% accuracy in predicting more than twice the relapse rate. We also calculated the SHAP value for the prediction model. The results indicated that Cr, BVAS, lymphocyte percentage, vitamin D, nose involvement, alkaline phosphatase, diagnosis age, white blood cell count, erythrocyte sedimentation rate, and initial nose presentation are the 10 most important features according to SHAP value.

    Conclusion

    In this study, we have developed Wegener's granulomatosis relapse prediction model using machine learning algorithms. We achieved reasonable precision and recall for early prediction and decisionmaking regarding Wegener's granulomatosis relapse.

    Keywords: Wegener's granulomatosisrelapse, Relapse prediction, Machine learning, Clinical decision-making, Xgboost algorithm, Birmingham vasculitisactivity score, Predictive modeling, Healthcare analytics, Autoimmune diseases, Precision medicine
  • Elham Abdalmaleki, Zhaleh Abdi, Saharnaz Sazgarnejad, _ Bahar Haghdoost, Elham Ahmadnezhad* Pages 201-215
    Introduction

    The primary health care (PHC) approach is widely acknowledged as a fundamental element in achieving universal health coverage (UHC) goals. Consequently, numerous countries have undertaken efforts to restructure their health systems based on PHC principles. This study aims to evaluate geographic disparities in essential maternal and child indicators provided at the PHC level, focusing on both crude and effective coverage. Additionally, it seeks to explore the association between effective coverage and health expenditures within the national and sub-national contexts of Iran.

    Methods

    This research employed a secondary analysis approach to investigate the spatial distribution of maternal and child health (MCH) indicators in Iran's provinces, utilizing the latest available data from the 2010 Demographic Health Survey (DHS). To provide a comprehensive understanding of MCH indicators, the study calculated composite indicators, crude, and effective coverage. The provinces' situations were compared using the median cut-off method. Additionally, the study examined the association between coverage indicators and total health expenditure per-capita.

    Results

    At the national level, the crude and the effective composite coverage were 89.56% and 77.22%, respectively. Also, the medians of composite crude and effective service coverage in the provinces were 90.25% and 77.62%, respectively. There was no significant difference between urban and rural areas.

    Conclusion

    This study has revealed a notable difference between the crude and effective service coverage of the selected MCH indicators. While the coverage of maternal services was generally higher than that of child services, there were significant geographic disparities in the coverage of key indicators of MCH services across provinces. Despite the provision of free services in rural areas, their coverage was not higher than that of urban areas. These findings suggest that PHC services in Iran are still far from achieving the desired coverage and UHC goals. Policymakers and stakeholders need to focus on addressing the gaps in effective coverage and geographic disparities to improve access to essential maternal and child health services and achieve UHC in Iran.

    Keywords: Universal health coverage, Effective coverage, Primary health care, Total health expenditure, Evidence-informed policymaking, Health policy, systemsresearch
  • Elangovan Arumugum *, Vasna Joshua Pages 216-226
    Introduction

    The HIV Sentinel Surveillance (HSS) conducted by National AIDS Control Organization (NACO) is the predominant data source for HIV estimations in India. While the HSS targets the key populations at risk of HIV infection, the National Family Health Survey (NFHS) measures the communitybased HIV prevalence. Improvised HIV estimates in India were attributed to the HIV prevalence data obtained from the NACO-HSS and NFHS.

    Methods

    Bayesian analysis was performed to determine the state-level prevalence of HIV among females in seven South Indian States. The analysis involved plotting the prior, likelihood, and posterior distributions, facilitating a visual assessment of the data. The HIV prevalence among females calculated from the NFHS (2015-16) survey data was used for prior distributions. HIV prevalence among pregnant women obtained from the HIV Sentinel Surveillance 2019 was used for likelihood. Bayesian analysis was performed using the R programming (RStudio 2022.02.0). A posterior probability distribution was obtained using the prior distribution and the likelihood by applying the Bayes theorem. Graphical representation was achieved through R's plotting functions. Kerala and Pondicherry were not included in the analysis due to zero or very low prevalence reported in both NFHS and HSS.

    Results

    The Bayesian estimates of HIV prevalence among females were 0.38 % [95% CI:0.29 - 0.47] in Andhra Pradesh, 0.28 [95% CI:0.23 - 0.35] in Karnataka, 0.27 [95% CI:0.20 - 0.34] Odisha, 0.27 % [95% CI:0.19 - 0.36] in Telangana and 0.19 [95% CI:0.15 - 0.24] in Tamil Nadu.

    Conclusion

    Bayesian techniques present a versatile and robust strategy for modelling and analysing HIVrelated data, offering a flexible and powerful approach to data analysis.

    Keywords: HIV, Prevalence, Sentinel surveillance, Bayesian analysis, India, Data modeling
  • Divya Vijithaswan Nair*, Rujuta Hadaye Pages 227-240
    Introduction

    With the presence of COVID 19, some countries also faced an increase in number of cases due to Monkeypox virus. The main aim of this research was to investigate whether it is possible to fit count data regression models to predict the daily incidence of Monkeypox confirmed cases.

    Methods

    In this study we have used two types of traditional count regression models like Poisson regression model and Negative binomial regression model using identity and logarithmic link function. Since our data was overdispersed, Negative binomial regression model with logarithmic link function fitted well as compared to other models. The parameters were estimated using SPSS, version 23.0.

    Results

    The Negative Binomial Regression model with logarithm function fits well to the data related to Monkeypox cases. Therefore, the model shows that majority of the countries like Brazil, Canada, France, Germany, Peru, Spain, United Kingdom and United States of America shows significant decrease in number of cases with respect to time. The prediction line was plotted using this model where the line predicts well about the daily Monkeypox cases reported by different countries.

    Conclusion

    From our study, we concluded that the count data regression model can be used widely to predict the incidence of any disease. The countries like Canada and Brazil have largest and smallest slope coefficient which shows maximum and minimum decrease in expected number of cases confirmed each day respectively.

    Keywords: Negative binomialregression model, Poisson regression model, Monkeypox, Akaike informationcriterion (AIC), Bayesian informationcriterion (BIC)
  • Gideon Addo *, Paul Poku Sampene Ossei, _ Bismark Amponsah Yeboah, William Gilbert Ayibor, Raphael Doh-Nani, _ Seidu Mohammed, Michael Obuobi, Roselyn Assor Appau Pages 241-256
    Introduction

    Hospital length of stay (LOS) remains a vital metric for assessing patient outcomes and healthcare resource utilization. Given the substantial financial impact of diagnosing and treating colorectal anomalies, coupled with an increased susceptibility to postoperative complications, it is crucial to understand the factors affecting LOS following colorectal surgery. Our primary objective was to investigate the preoperative, intraoperative, and postoperative risk factors that have substantial influence over LOS following a colorectal procedure.

    Methods

    This study analyzed data from a retrospective study of adults who underwent various colorectal surgeries (colostomy, ileostomy, small bowel resection, etc.) at Cleveland Clinic Foundation (January 2005 - December 2014). Predictor variables were categorized into preoperative (patient demographics, medical history, comorbidities, lifestyle factors), intraoperative, and postoperative factors. LOS was grouped into short-term (SLOS) (≤ 7 days), medium-term (MLOS) (8-30 days), and long-term (LLOS) (> 30 days) stays. Multinomial logistic regression models assessed predictor effects on LOS.

    Results

    Among the 7874 patients, 50.7% were females, with a minimum age of 20 years. SLOS were observed in 61.1%, MLOS in 37.6%, and LLOS in 1.3% of patients. Advanced age correlated with prolonged LOS, possibly due to age-related health challenges like weak immune systems. Coagulopathy, and fluid and electrolyte disorders raised MLOS and LLOS risk, likely due to complications like significant bleeding and electrolyte imbalances. Surgery duration predicted longer LOS, elevating LLOS and MLOS by 52% and 42%. Postoperative infections were associated to extended stays, possibly due to subsequent interventions, monitoring and recovery delays.

    Conclusion

    Our study revealed that key preoperative predictors of LOS included Age, coagulopathy, fluid and electrolyte disorders, severe weight loss, and drug abuse. Notably, intraoperative factors such as surgical approach (open vs laparoscopic) and surgery duration, alongside postoperative complications including superficial and serious infections, significantly influenced LOS. By incorporating these insights into the preoperative planning, clinicians could potentially develop tailored interventions to mitigate risk factors and enhance postoperative recovery, thus potentially reducing LOS and improving patient outcomes.

    Keywords: Colorectal surgery, Hospital length of stay, Preoperative factors, Intraoperative factors, Postoperative complications, Risk factors
  • Parisa Rezanejad-Asl, _ Farid Zayeri*, Abbas Hajifathali Pages 257-269
    Introduction

    The mixed effects logistic regression model is a common model for analysing correlated binary data as longitudinal data. The between and within subject variances are typically considered to be homogeneous but longitudinal data often show heterogeneity in these variances. This study proposes a Bayesian mixed effects location scale model to accommodate heteroscedasticity in binary data analysis.

    Methods

    This study was carried out in two stages; first, the simulation study was used to evaluate the accuracy of the proposed model with the Bayesian approach and then the proposed model was applied to a real data. In simulation study, the data were generated from the mixed effects location scale model with different correlations between the random location effect and random scale effect and different sample sizes. In order to evaluate the accuracy of the estimations, the Root Mean Square Error, bias and Coverage Probability were calculated and the deviance information criterion was used to select the appropriate model. At the end we utilized this model to analyse uric acid levels of patients with haematological disorders.

    Results

    The simulation results show the accuracy of model parameter estimates as well as the correlation between random location and scale effects. They also display that if a random scale effect is present in the data, it should be accounted for in model. Otherwise, the model is forced to assign the within subject variation due to these subject random effects to the error term. The results of real data are also in line with this. The odds of having normal UA levels increases by a factor of 26% per week. Due to the positive value of the covariance parameter, patients with higher mean of UA levels show higher variation in UA levels. Furthermore, the significance of the covariates in the between subject and within subject variances model, as well as the significance of the random scale variance determines the heterogeneity across subjects.

    Conclusion

    Bayesian mixed effects location scale model provides a useful tool for analysing correlated binary data with heteroscedasticity because it considers data correlation and modelling mean and variance simultaneously. Furthermore, it improves the accuracy of statistical inference in longitudinal studies compared to classic mixed effects models.

    Keywords: Correlated binary data, Heteroscedasticity, Bayesian mixed-effectsmodel, Variance modelling, Random scale effects, Longitudinal data analysis
  • Fereshteh Mokhtarpour, Mostafa Hosseini, Akram Yazdani, Mehdi Yaseri * Pages 270-286
    Introduction

    Quantile regression is a valuable alternative for survival data analysis, enabling flexible evaluations of covariate effects on survival outcomes with intuitive interpretations. It offers practical computation and reliability. However, challenges arise when applying quantile regression to censored data, particularly for upper quantiles. The minimum distance approach, utilizing dual-kernel estimation and the inverse cumulative distribution function, shows promise in addressing these challenges, especially with higher-dimensional covariates.

    Methods

    This study contrasts two methods within the realm of quantile linear regression for survival analysis: check-based modeling and the minimum distance approach. Effectiveness is assessed across various scenarios through comprehensive simulation.

    Results

    The simulation results showed that using the quantile regression model with the minimum distance approach reduces the percentage of root mean square error in parameter estimation compared to the quantile regression models based on the check loss function. Additionally, a larger sample size and reduced censoring percentage led to decreased root mean square error in parameter estimation.

    Conclusion

    The research highlights the benefits of using the minimum distance approach for quantile regression. It reduces errors, improves model predictions, captures patterns, and optimizes parameters even with complete data. However, this approach has limitations. The accuracy of estimated quantiles can be influenced by the choice of distance metric and weighting scheme. The assumption of independence between censoring mechanism and survival time may not hold in real-world scenarios. Additionally, dealing with large datasets can be computationally complex.

    Keywords: Quantile regression, Minimum distanceapproach, Survival, Check-based modeling, Inverse cumulativedistribution function