Skip to main content

Predicting chronic kidney disease progression with artificial intelligence

Abstract

Background

The use of tools that allow estimation of the probability of progression of chronic kidney disease (CKD) to advanced stages has not yet achieved significant practical importance in clinical setting. This study aimed to develop and validate a machine learning-based model for predicting the need for renal replacement therapy (RRT) and disease progression for patients with stage 3–5 CKD.

Methods

This was a retrospective, closed cohort, observational study. Patients with CKD affiliated with a private insurer with five-year follow-up data were selected. Demographic, clinical, and laboratory variables were included, and the models were developed based on machine learning methods. The outcomes were CKD progression, a significant decrease in the estimated glomerular filtration rate (eGFR), and the need for RRT.

Results

Three prediction models were developed—Model 1 (risk at 4.5 years, n = 1446) with a F1 of 0.82, 0.53, and 0.55 for RRT, stage progression, and reduction in the eGFR, respectively,— Model 2 (time- to-event, n = 2143) with a C-index of 0.89, 0.67, and 0.67 for RRT, stage progression, reduction in the eGFR, respectively, and Model 3 (reduced Model 2) with C-index = 0.68, 0.68 and 0.88, for RRT, stage progression, reduction in the eGFR, respectively.

Conclusion

The time-to-event model performed well in predicting the three outcomes of CKD progression at five years. This model can be useful for predicting the onset and time of occurrence of the outcomes of interest in the population with established CKD.

Peer Review reports

Introduction

Chronic Kidney Disease (CKD) is a global public health problem [1,2,3]. In 2017, the global prevalence was 9.1%, with approximately 700 million cases [4]. In Colombia, it has been estimated a gross and underdiagnosed prevalence of 1.54 CKD cases per 100 inhabitants in 2022 [5]. Primary and secondary prevention are the mainstay of treatment, and management according to the risk profile is a priority. However, according to the United States Renal Data System (USRDS) report, approximately 35.4% of patients with CKD are referred late to interdisciplinary programs, probably due to failure in adequate risk profile classification [6, 7]. During the last decade, several prediction models to estimate the probability of CKD progression, with predictors such as age, estimated glomerular filtration rate (eGFR), serum albumin level, and the presence of comorbidities, are being used. These models usually predict CKD progression through eGFR loss or need for RRT (dialysis or kidney transplant).[8] However, the models available show some limitations, such as lack of estimation of the competing risk of death or non-fatal cardiovascular disease associated with disease progression [9], lack of validation in reference centers for patients with CKD, significant predominance of an ethnic group in the cohorts used [10], and low rates of outcomes such as need of RRT [11]. Prediction models are usually built using logistic regression or Cox proportional hazards, and recently, some have used artificial intelligence methods, such as neural networks and random forests, among others [12].

One of the most validated models around the world, ESKD, have an excellent performance, with a C statistic of 0.917 (95% confidence interval [CI], 0.901–0.933; P < 0.001) in the development cohort, and 0.841 (95% CI, 0.825–0.857) in the validation cohort [8, 9]. No predictive models have been developed or validated in Colombia. The primary rationale for developing a new model in our context stems from the fact that current prediction models include variables present in fewer than 50% of our patients (such as albuminuria, phosphorus, bicarbonate). [13] These are often substituted with qualitative measures of proteinuria, proteinuria in a 24-hour urine sample, or indirectly with lipid profiles or blood albumin levels. [5] Constructing a model based on the available variables is imperative for our country and may be applicable to other low-income countries. Therefore, this study aimed to develop and validate a predictive model for the risk of progression and the need for RRT in patients with stage 3–5 CKD, which may be clinically useful for assisting our country’s healthcare system.

Methods

Design and population

This was a retrospective, closed cohort, observational study. The cohort was selected based on the electronic medical records and clinical laboratory data of patients affiliated with a private health insurer in Colombia (EPS Sanitas). This register correspond to all patients treated for chronic kidney disease. In the year of selection of the cohort, it had approximately 2.5 million members and a geographical distribution mostly represented by the country’s capital, with 48.5% of the population, followed by the central and northern regions with 13.8% and 12.4%, respectively, including patients with CKD both in primary care units as well as in high complexity centers.

Patients with a confirmed diagnosis of CKD in stages 3–5 were included, and a 5- year follow-up from 2013 to 2018 was implemented. Confirmed diagnosis of CKD was defined as a decrease in eGFR (between 60 and 15 mL/min/1.73 m2) or albuminuria (24-h urine albumin excretion rate > 30 mg/24 h or urine albumin/creatinine ratio > 30 mg/g [3 mg/mmol]) of any etiology for more than 3 months. eGFR was calculated using the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) 2009 formula based on serum creatinine, sex, and the age of the research subject.13 Pregnant patients and patients who met the clinical criteria for starting RRT (dialysis or transplant) or those who were already receiving it at the start of follow-up were excluded.

Outcomes

The outcomes evaluated were as follows: (1) CKD progression: defined as progression in the stages of this condition based on a decrease in the eGFR; (2) significant decrease in the eGFR: defined as a progression to greater than 25% reduction in the eGFR; and (3) indication for RRT: defined as the need for dialysis and/or kidney transplant.

Predictors

The predictors that were initially considered according to the importance reported in the literature are shown in the supplementary data (Supplementary data, Table S1). Physical examination and laboratory test predictors were measured within a time window of ± 120 days with respect to the initial eGFR. Predictors were selected through a heuristic factor; based on information availability for at least 50% of the patients, the combination of these factors that keeps the greatest number of patients is studied and selected. This approach lies mainly in the availability of information on clinical results and rules out the use of data imputation due to confidence intervals could be artificially narrow. Predictors based on medical history (comorbidities) were identified when recorded on previous dates or on the same date in which the eGFR was obtained, and cases in which their incidence in the samples was less than 1% were excluded.

Three predictive models were developed using two different statistical approaches: Model 1, risk analysis at 4.5 years; Model 2, time-to-event analysis; and Model 3, time-to-event analysis with reduced variables. Model 3 corresponds to a reduced version of Model 2 predictors, which were obtained after implementing a validation in an external cohort.

Population size and selection

Patients with at least one eGFR record of less than 60 mL/min/1.73 m2 were selected from a sample of 21,356 patients with a confirmed diagnosis of CKD identified in the institutional registry (Fig. 1). Next, the presence of at least two eGFR measurements during follow-up and the availability of complete data in relation to the selected predictors were verified. For Model 1, patients with a eGFR measurement in the first year of follow-up (the lowest value) and at 4.5 years were selected, with a final population of 1,466 patients. For Models 2 and 3, patients with the first eGFR measurement in 2013 (beginning of the cohort) were included, for a total of 2,143 subjects. Sensitivity analyses for clinical and laboratory predictors were performed for each model (Supplementary data, Tables S1 and S3).

Fig. 1
figure 1

Selection of the population for Models 1 and 2

Model development and validation

The following two strategies were used for the selection of predictors: (1) A review of the linear correlation between the same predictors and (2) An evaluation of the level of importance for prediction of each one through an engineering process using the Recursive Feature Elimination (RFE) algorithm. The coefficients of each predictor and subsets of predictors with optimal performance were estimated for each outcome of interest. Performance, according to this algorithm, was estimated with the accuracy metric. No imputation procedure for missing data was performed. Qualitative variables with multiple categories were classified into subgroups of dichotomous predictors before this process, following the dummy variables strategy for optimal training of the base algorithm.

The predictors included in Models 1 and 2 were age, sex, marital status, geographical region of residence, socioeconomic status, initial eGFR in mL/min/1.73 m2, presence of diabetes mellitus, coronary heart disease, arterial hypertension, anemia, heart failure, cerebrovascular disease, rheumatoid arthritis, Non-steroidal anti-inflammatory drugs (NSAID) consumption, creatinine levels (mg/dL), hemoglobin (gr/dL), serum potassium (mEq/L), HDL cholesterol (mg/dL), LDL cholesterol (mg/dL), and triglycerides (mg/dL). In the case of Model 3, only age, sex, region of residence, diabetes mellitus, arterial hypertension, hemoglobin, creatinine, HDL cholesterol, LDL cholesterol, and baseline eGFR were considered because the external validation cohort relied only on these predictors.

Baseline eGFR was estimated taking as reference the minimum value among the measurements recorded in 2013. This same strategy was used to identify the final eGFR; we recorded the minimum measurement within a range of ± 120 days of the date corresponding to 4.5 years from the baseline eGFR. Qualitative variables are presented as absolute and relative frequencies. Quantitative variables are reported as medians and interquartile ranges (IQR).

Model 1

Models were generated for each outcome using the following three methods: logistic regression and two other methods based on machine learning called Neural Networks (NN) and Random Forests (RF). A black box method was used for the neural network model, which was robust for complex data patterns. Their performance was evaluated individually once the resulting models were generated and compared using F1, accuracy, sensitivity, and precision metrics.

Model 2

A follow-up mechanism (in months) was initially designed to establish the risk of appearance of the three outcomes of interest over time. For the outcomes, significant decrease in eGFR and CKD stage progression, the first eGFR measurement that showed this deterioration, or change in stage was selected. For the RRT outcome, the need to start replacement therapy was determined as the end point of follow-up. Three artificial intelligence methods were used for modeling: Cox Penalty (P-Cox), Random Survival Forest (RSF), and Gradient Boosting Model (GBM). The concordance index metric (C-index) was used to compare the results of these three approaches.

Model 3

As in model 2, the same follow-up design was used for the three outcomes. The same three survival models were estimated: P-Cox, RSF, and GBM. Furthermore, based on the purpose of this reduced model, an external validation was performed for a fully audited cohort, extracted using data from the Sanitas Renal Unit (URS) between 2019 and 2021. This validation cohort included 648 patients. (Supplementary data, Figure S1). The sample was divided into the following two groups for implementation and evaluation of the models: training (70%) and test (30%), and a cross-validation process was performed on the training set.

Development of the electronic calculator

Based on the model obtained, an electronic calculator with a graphical interface was developed on a web platform using the Extreme Programming software development process model for clinical use.

Model 3 was chosen to develop an electronic risk calculator given the optimal validation results and the data completeness challenges in electronic health records. Three endpoints (API) were formulated for each outcome (CKD progression, significant decrease in the eGFR, and RRT). These subroutines were integrated into a RESTful service, providing a response that includes the C-index model performance, follow-up time periods risk probabilities, and an encoded image for the survival curve. This information is retrieved and presented in the electronic calculator for clinical use.

Results

Model 1

The sociodemographic, clinical, and laboratory characteristics of the Model 1 cohort are shown in Table 1. There was a high proportion of female subjects, more than 90% of the population corresponded to the middle stratum, and the region of origin with the highest proportion corresponded to the country’s capital. The most frequent clinical history variables were the presence of arterial hypertension, consumption of NSAIDs, and diabetes mellitus.

Table 1 Sociodemographic, clinical, and laboratory characteristics for the population in Model 1

The cumulative incidence for the outcome of stage progression was 26.3% (95% CI 24.2–28.7); for decreased eGFR was 18.3% (95% CI 16.4–20.4); and for RRT was 3.8% (95% CI 2.9–4.9). The bivariate analysis between predictors and outcomes is shown in the supplementary data (Tables S4, S5 for Model 1; Tables S6, S7 for Model 2).

The selection of predictors according to the RFE algorithm for each outcome is shown in Fig. 2. The most relevant predictors were age, initial eGFR, creatinine, LDL, triglycerides, HDL, hemoglobin, potassium, and history of diabetes mellitus or arterial hypertension, especially for the outcomes of significant decrease in eGFR and stage progression.

The performance of the different models for risk prediction at 4.5 years based on artificial intelligence metrics are shown in Table 3. The NN and RF models showed a similar performance according to F1 for the estimation of significant decrease in eGFR and stage progression. Logistic regression showed better performance than NN and RF for the RRT.

Fig. 2
figure 2

Predictors in order of importance for each outcome according to the recursive feature elimination method

Model 2

The main sociodemographic, clinical, and laboratory characteristics of the time-to- event analysis model cohort are shown in Table 2. The 5-year incidence rate for stage progression was 48.3% (95% CI 46.2–50.5); for decreased eGFR was 37.4% (95% CI 35.4–39.5); and for RRT was 3.9% (95% CI 3.1–4.8). The survival curves according to the initial eGFR stage for the outcomes of interest are shown in Fig. 3. The probability of RRT was higher for subjects with stage 4 and 5 CKD. The performance of the estimated models for prediction based on time-to-event analysis is presented in Table 4. The model with the best performance for prediction of time-to-event was RSF according to the C-index. For RRT and significant decrease in eGFR, the three algorithms showed a similar performance, while for stage progression, the GBM showed the best performance.

Fig. 3
figure 3

Survival curves for outcomes according to Model 2

Table 2 Sociodemographic, clinical, and laboratory characteristics for the population in Model 2
Table 3 Performance of Model 1 according to the prediction algorithm
Table 4 Performance of time-to-event models for the outcomes of interest

Model 3

For this model, the sociodemographic, clinical, and laboratory characteristics were identical to those of the time-to-event analysis model cohort (Table 2), but omitting the predictors of marital status, socioeconomic status, coronary heart disease, anemia, heart failure, cerebrovascular disease, consumption of NSAIDs, serum potassium, and triglycerides.

The training results for the three estimated models are shown in Table 5. The GBM showed slightly better performance than the other models for all three outcomes. It was determined that the RRT outcome showed better results than the other outcomes in the three models.

Table 5 Performance of reduced time-to-event models for the outcomes of interest

Regarding the external validation with the URS cohort, the results of the C-index metric for the different models reduced from the validation data, compared against the test results (20% of 2,143 patients) from the initial cohort, are shown in Table 6. In this case, the GBM algorithm performed better for the outcomes of significant decrease in eGFR and RRT. In contrast, the RSF algorithm was slightly better than GBM in terms of CKD progression.

Table 6 External validation performance and test of the reduced time- to-event models for the outcomes of interest

Discussion

A model for predicting the progression of CKD to advanced stages and the need for RRT was developed and validated in this study. Risk at 4.5 years and time-to-event (survival analysis) in patients with CKD stages 3–5 were assessed based on data analysis using artificial intelligence tools. The proposed model showed adequate performance, allowing its systematic implementation in CKD clinical management programs, and guaranteeing its usefulness as part of the strategies that guide clinical decision making.

Publications of prediction models for chronic kidney disease have increased in recent years, and this has been analyzed in four recently published systematic reviews [14,15,16,17].However, skepticism still remains among clinicians regarding the performance and applicability of these models, apart from the fact that clinical practice standards are generic when it comes to defining a precise recommendation on this subject. For example, KDIGO recommends using prediction models for timely referral to RRT. However, it does not define how and when to use these tools [18].

In this context, it is important to have prediction tools that guide decision making, the management of prevention programs, and timely multidisciplinary intervention strategies. However, several of the published models include populations with different levels of disease severity and do not precisely define the outcomes and the time of disease progression in which they should be used [14]. Our study developed a prediction model that can be applied in patients with CKD stages 3–5, with a 5-year follow-up, determining the following as main outcomes: (i) progression of CKD stage based on the eGFR (ii) reduction greater than 25% in the eGFR compared to baseline, and (iii) onset of RRT (dialysis for more than 3 months or kidney transplant).

To minimize the risk of bias, a cohort of 2,143 patients distributed in three arms was included for each outcome. For the incidence of RRT, the entire cohort of 2,143 patients was included. For progression of CKD stage and significant decrease in eGFR, there were 2,060 patients, of which 1,035 and 802 presented the respective outcomes. Furthermore, the external validation process included a cohort of 648 patients. As it became evident during the cohort selection process, when working with real data, this study was contingent upon missing information, as is the case of alkaline phosphatase, where no data was found for 92.6% of the patients. Because these are precise clinical data, the use of imputation strategies was ruled out. On the other hand, as demonstrated by the performance metrics, the models that predict RRT could be showing data overfitting due to imbalance in the classes; there were very few patients who presented the outcome. This was especially evident in the external validation for RRT, where a C-index of 0.9518 was obtained, but this cohort only had two patients who showed this outcome.

The inclusion of a validation cohort increases the reliability of the models. This is how the kidney failure risk equation (KFRE) developed by Tangri et al. [7] has become the standard for comparison, given that the model has shown consistently good performance in patients with CKD stages 3–5 in several external validation studies with a low risk of bias. One of these validations included 31 multinational cohorts with a mean baseline eGFR of 46 mL/min/1.73 m2 and showed that the KFRE model has a high discrimination capacity and adequate calibration [15]. Another model validated with an external cohort and a low risk of bias is the Kaiser Permanente Northwest (KPNW) model [9].

Although some clear prognostic factors of this and other prediction models are the main ingredients of our model, the context of our country requires particular sociodemographic factors that are included here. Our cohort is consistent with the country’s reality, with an absence of relevant variables that limit the sample, and with greater participation of patients from the central region. Upon admission to the cohort, the largest participation included stage 3 patients, and the frequency RRT initiation decreased over time.

The prediction of CKD progression to advanced stages or admission to dialysis allows for the implementation of strategies for individualized treatment, control of risk factors, evaluation of population indicators, establishment of education management strategies, selection of renal support therapies, and preemptive renal transplantation [19].

Conventionally, risk factors related to CKD progression included in most prediction models have been demographic variables such as age, sex, and geographic origin, and clinical variables such as comorbidities, eGFR or its deterioration in the previous year, albuminuria, serum bicarbonate, albumin, calcium, hemoglobin, and phosphorus, among others [6].In recent publications describing other models proposed concomitantly with our own model, we found unconventional predictors such as biomarkers (CXCL12, NT-proBNP, NGAL, and troponin T) and the application of artificial intelligence methods [20,21,22].

Although CKD management programs in Colombia seem to be very well structured, they do not offer sub-specialized management programs for the entire population with CKD in terms of health policies. This is due to several reasons, in particular, a disproportion between the number of patients with CKD and the current number of nephrologists in the country. In their daily roles, these nephrologists cover different areas of work, such as critical care, hospital nephrology, kidney units, transplant groups, and outpatient and preventive services. This reveals an insufficient human resource, far from international standards (a nephrologist–patient ratio of approximately 1:2000 patients), requiring the support of non-nephrologists, trained and familiar with this group of patients. In general, these programs are configured based on an initial snapshot of the stage of the disease exclusively on the eGFR. It is possible to optimize the care of these patients with the implementation of the prediction models. Thus, a possible scenario can be proposed in which the care of stage 3 patients with a low risk of progression could continue in first level centers, improving the window of opportunity for the highly specialized care of patients classified showing a high risk of progression. This would lead to an improvement in cost-effectiveness indicators.

Among the strengths of our model, we can highlight the following: (1) the creation of a machine learning-based technological tool that makes it possible for non- specialized healthcare personnel to estimate the risk for patients. This is useful in prevention programs in the context of a limited specialized human resource; (2) the estimation of the risk of a significant decrease in eGFR as an additional outcome to stage progression benefits the early implementation of prevention strategies to avoid the deterioration of renal function, even within the same risk category; (3) the predictors were obtained from frequent registration data in the clinical follow-up records of this group of patients. The use of additional resources was not necessary for their application; (4) to date, no models have been developed in a population with sociodemographic characteristics like ours. This represents an opportunity to use a tool based on the context of our country and others in the Latin American region; (5) an external validation cohort was included, and the tool is designed to be implemented in a specific population of patients with stage 3–5 CKD; and (6) the translation of a mathematical model into a digital tool in the form of a calculator with a graphical, easy-to-use interface facilitates its use in these patient care scenarios.

This model has certain limitations as follows: (1) the follow-up period was 5 years, which limits the analysis of disease progression in some groups of patients; (2) there was no differentiation into racial groups, which affects the analysis of subgroups and progression according to this predictor, which is part of the tools frequently used to calculate the eGFR; (3) model development was based on a retrospective cohort, which reduced the availability of complete data in relation to the predictors of interest for the final analysis; (4) the population subgroup belonging to the middle-income socioeconomic stratum was overrepresented (90%), which is well above the percentages recorded in national evaluation surveys (45.5%) and which may imply more favorable socioeconomic contexts and determinants than those of the average Colombian population; 6) the percentage of representation of women was greater than 50%, which may not be related to the proportion of the population on dialysis or subjected to kidney transplant; and 7) the geographical distribution of the patients included in the sample was linked to the distribution of affiliates of the selected insurer, which does not have members in all regions of the country. This prevents the generalization of estimates to the entire national territory.

In conclusion, the developed model constitutes a tool to help manage the progression of CKD in terms of early intervention and optimization of available human resources.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Schiepatti A, Remuzzi G. Chronic renal diseases as a public health problem: Epidemiology, social, and economic implications. Kidney Int. 2005;68(suppl 98):7–10.

    Article  Google Scholar 

  2. Levey AS, Atkins R, Coresh J, et al. Chronic kidney disease as a global public health problem: approaches and initiatives– a position statement from kidney disease improving global outcomes. Kidney Int. 2007;72:247–59. https://doi.org/10.1038/sj.ki.5002343.

    Article  CAS  PubMed  Google Scholar 

  3. Lopera-Medina MM. Chronic kidney disease in Colombia: Health needs and response of the General System of Social Security for Health. Rev Gerenc Y Polit Salud. 2016;15:212–33. https://doi.org/10.11144/Javeriana.rgyps15-30.ercc.

    Article  Google Scholar 

  4. Cockwell P, Fisher LA. The global burden of chronic kidney disease. Lancet. 2020;395:662–4. https://doi.org/10.1016/S0140-6736(19)32977-0.

    Article  PubMed  Google Scholar 

  5. Colombian Fund for High Cost Diseases -. High Cost Account (CAC); Situation for chronic kidney disease, arterial hypertension, and diabetes mellitus in Colombia 2022; Bogota D.C 2022. Published online 2023.

  6. Saran R, Li Y, Robinson B, Al E. Epidemiology of Kidney Disease in the United States. Am J Kidney Dis. 2016;67:s1–434. https://doi.org/10.1053/j.ajkd.2015.12.015. US Renal Data System 2015 Annual Data Report.

  7. Mendelssohn DC, Malmberg C, Hamandi B. An integrated review of unplanned dialysis initiation: reframing the terminology to suboptimal initiation. BMC Nephrol. 2009;10:1–8. https://doi.org/10.1186/1471-2369-10-22.

    Article  Google Scholar 

  8. Cao J, Singh K. Integrating risk prediction models into chronic kidney disease care. Curr Opin Nephrol Hypertens. 2020;29:339–45. https://doi.org/10.1097/MNH.0000000000000603.

    Article  PubMed  Google Scholar 

  9. Tangri N, Stevens LA, Griffith J, et al. A predictive model for progression of chronic kidney disease to kidney failure. JAMA. 2011;305:1553–9. https://doi.org/10.1001/jama.2011.451.

    Article  CAS  PubMed  Google Scholar 

  10. Halbesma N, Jansen DF, Heymans MW, et al. Development and validation of a general population renal risk score. Clin J Am Soc Nephrol. 2011;6:1731–8. https://doi.org/10.2215/CJN.08590910.

    Article  CAS  PubMed  Google Scholar 

  11. Schroeder EB, Yang X, Thorp ML, et al. Predicting 5-year risk of RRT in stage 3 or 4 CKD: development and external validation. Clin J Am Soc Nephrol. 2017;12:87–94. https://doi.org/10.2215/CJN.01290216.

    Article  CAS  PubMed  Google Scholar 

  12. Xiao J, Ding R, Xu X, et al. Comparison and development of machine learning tools in the prediction of chronic kidney disease progression. J Transl Med. 2019;17:1–13. https://doi.org/10.1186/s12967-019-1860-0.

    Article  Google Scholar 

  13. Levey AS, Stevens LA, Schmid CH, et al. A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150:604–12. https://doi.org/10.7326/0003-4819-150-9-200905050-00006.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Tangri N, Kitsios GD, Inker LA, et al. Risk prediction models for patients with chronic kidney disease. Ann Intern Med. 2014;158:596–603.

    Article  Google Scholar 

  15. Echouffo-Tcheugui JB, Kengne AP. Risk models to predict chronic kidney disease and its progression: a systematic review. PLoS Med. 2012;9. https://doi.org/10.1371/journal.pmed.1001344.

  16. Ramspek CL, de Jong Y, Dekker, et al. Towards the best kidney failure prediction tool: a systematic review and selection aid. Nephrol Dial Transpl. 2020;35:1527–38. https://doi.org/10.1093/ndt/gfz018.

    Article  Google Scholar 

  17. Tangri N, Grams ME, Levey AS, et al. Multinational assessment of accuracy of equations for predicting risk of kidney failure ameta-analysis. JAMA. 2016;315:164–74. https://doi.org/10.1001/jama.2015.18202.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. (KDIGO) 2012 clinical practice Guideline for the evaluation and management of chronic kidney disease (CKD) chap. 5: referral to specialists and models of care. Kidney Int Suppl. 2013;3:112–9. https://doi.org/10.1038/kisup.2012.68.

  19. KDIGO. 2012 clinical practice Guideline for the evaluation and management of chronic kidney disease (CKD) chap. 2: definition, identification, and prediction of CKD progression. Kidney Int Suppl. 2013;3:63–72. https://doi.org/10.1038/kisup.2012.65.

  20. Anderson AH, Xie D, Wang X, et al. Novel risk factors for progression of diabetic and nondiabetic CKD: findings from the chronic renal insufficiency cohort (CRIC) study. Am J Kidney Dis. 2021;77:56–e731. https://doi.org/10.1053/j.ajkd.2020.07.011.

    Article  CAS  PubMed  Google Scholar 

  21. Zacharias HU, Altenbuchinger M, Schultheiss UT et al. A predictive model for progression of CKD to kidney failure based on routine laboratory tests. Am J Kidney Dis. 2021;(Drks 00003971). https://doi.org/10.1053/j.ajkd.2021.05.018.

  22. Dai D, Alvarez PJ, Woods SD. A predictive model for progression of chronic kidney disease to kidney failure using a large administrative claims database. Clin Outcomes Res. 2021;13:475–486. doi:10.2147/CEOR.S313857.

Download references

Acknowledgements

The authors thank Crimson Interactive Pvt. Ltd. (Enago)– https://www.enago.com/es/ for their assistance in manuscript translation and editing.

Funding

This project was funded by Minciencias– Colombia, contract number 791–2018 and ID project 143580764304. The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

MA Isaza-RugetProtocol/project development, Data collection or management, Data analysis, Manuscript writing/editing. N Yomayusa Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing. CA González Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing. C Alvarado- H Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing. FA de Oro V Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing A Cely Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing. J Murcia Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing A Gonzalez-Velez Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing A Robayo Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing CC Colmenares-Mejia Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editingA Castillo Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing MI Conde Protocol/project development, Data collection or management, Data analysis, Manuscript writing/editing.

Corresponding author

Correspondence to Claudia C. Colmenares-Mejía.

Ethics declarations

Ethics approval and consent to participate

This study was reviewed and approved by the Ethics Committee of the Fundación Universitaria Sanitas (approbation number: CEIFUS 283 − 18), prior to the review of electronic data which were handled in accordance with the ethical standards of the international and national regulations regarding the handling of electronic medical record information. Additionally, this research was in accordance with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Informed consent

Was waived by Ethics Committee of the Fundación Universitaria Sanitas (approbation number: CEIFUS 283 − 18) as it was a retrospective study with data from medical electronic health records.

Consent for publication

Not applicable.

Competing interests

All authors declare that they have no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Isaza-Ruget, M.A., Yomayusa, N., González, C.A. et al. Predicting chronic kidney disease progression with artificial intelligence. BMC Nephrol 25, 148 (2024). https://doi.org/10.1186/s12882-024-03545-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12882-024-03545-7

Keywords