Skip to main content

Diagnosis model of early Pneumocystis jirovecii pneumonia based on convolutional neural network: a comparison with traditional PCR diagnostic method

Abstract

Background

Pneumocystis jirovecii pneumonia (PJP) is an interstitial pneumonia caused by pneumocystis jirovecii (PJ). The diagnosis of PJP primarily relies on the detection of the pathogen from lower respiratory tract specimens. However, it faces challenges such as difficulty in obtaining specimens and low detection rates. In the clinical diagnosis process, it is necessary to combine clinical symptoms, serological test results, chest Computed tomography (CT) images, molecular biology techniques, and metagenomics next-generation sequencing (mNGS) for comprehensive analysis.

Purpose

This study aims to overcome the limitations of traditional PJP diagnosis methods and develop a non-invasive, efficient, and accurate diagnostic approach for PJP. By using this method, patients can receive early diagnosis and treatment, effectively improving their prognosis.

Methods

We constructed an intelligent diagnostic model for PJP based on the different Convolutional Neural Networks. Firstly, we used the Convolutional Neural Network to extract CT image features from patients. Then, we fused the CT image features with clinical information features using a feature fusion function. Finally, the fused features were input into the classification network to obtain the patient's diagnosis result.

Results

In this study, for the diagnosis of PJP, the accuracy of the traditional PCR diagnostic method is 77.58%, while the mean accuracy of the optimal diagnostic model based on convolutional neural networks is 88.90%.

Conclusion

The accuracy of the diagnostic method proposed in this paper is 11.32% higher than that of the traditional PCR diagnostic method. The method proposed in this paper is an efficient, accurate, and non-invasive early diagnosis approach for PJP.

Peer Review reports

Introduction

Pneumocystis jirovecii (PJ) is a common opportunistic pathogen causing pulmonary infections and is one of the leading causes of high mortality in immunocompromised patients. Pneumocystis jirovecii pneumonia (PJP) occurs in 70–80% of patients with patients with human immunodeficiency virus (HIV) [1]. Over the past decade, the incidence of PJP in patients with HIV has decreased due to the widespread use of antiretroviral therapy. Statistics from the United States on opportunistic infections of PJ in acquired immune deficiency syndrome (AIDS) patients showed a five-fold reduction in incidence per 100,000 person-years between 2000 and 2010 [2]. These findings are supported by research from England, where the number of PJP cases in HIV-infected patients decreased by approximately half during the same period. However, in recent years, there has been an increasing number of non-HIV-infected patients with PJ infections. This includes patients with solid malignancies, solid organ transplant and hematopoietic stem cell transplant recipients, those who have undergone chemotherapy and molecular targeted therapy, patients receiving immunosuppressive therapy due to autoimmune and inflammatory conditions, and individuals with genetic or primary immunodeficiency disorders [3].

Due to the difficulty of culturing PJ in the laboratory, microscopic examination of the pathogen in respiratory specimens has become the gold standard for diagnosing PJP [4]. However, whether cysts and trophozoites of PJ can be observed under the microscope depends on the technical skills and experience of the examining physician, so the level of the observer has a significant impact on the detection rate and sensitivity. Additionally, non-HIV-infected patients with PJ infections typically have lower fungal loads compared to HIV-infected patients [5]. This lower fungal load can result in false-negative microscopic examination results. Currently, PJP diagnosis guidelines recommend a comprehensive analysis of host factors, microbiological evidence, chest CT findings, polymerase chain reaction(PCR), and detection of CD4+T cells, lactate dehydrogenase(LDH), and 1,3-β-d-glucan (BDG) in serum for HIV-positive patients with PJP, categorized by severity [6]. However, in clinical practice, PJP has a rapid onset and a poor prognosis. Early diagnosis is largely influenced by the treating physician's experience and judgment. Most non-HIV patients are already at a severe stage when diagnosed [7]. Without timely and effective treatment, the mortality rate can be as high as 30% to 59%, with even higher mortality rates (48% to 70%) among allogeneic hematopoietic stem cell transplant (allo-HSCT) recipients [8]. Therefore, there is an urgent need for an efficient, objective, and convenient method to diagnose PJP.

In recent years, deep learning has made significant progress in the field of medicine and has been shown to outperform traditional machine learning techniques. Convolutional Neural Networks (CNN) is an efficient deep learning method for image recognition, with powerful feature extraction capabilities. They can quickly analyze and process clinical data and medical images to provide diagnostic results. Many studies have reported that deep learning algorithms can improve the accuracy of diagnoses for various diseases, including gastric cancer, thyroid cancer, liver cancer, colorectal cancer, breast cancer, lung cancer, cervical cancer, skin cancer, cataracts, and diabetic retinopathy [9,10,11,12,13,14,15,16,17,18]. Recognizing the importance of clinical features and chest CT images in PJP diagnosis, we have developed a novel early diagnostic model for PJP based on CNN and evaluated its diagnostic performance for the first time.

Materials and methods

This section introduces the experimental methods for the entire article. Figure 1 shows the experimental flowchart of the study.

Fig. 1
figure 1

The experimental flowchart of the study

Firstly, we will identify all possible diagnostic indicators, clinical features, and chest CT imaging features related to PJP based on the literature and guidelines. Then, from these clinical features, we will select independently significant risk factors with statistical significance.

Next, We are using a Pneumocystis jirovecii DNA extraction kit to extract DNA from sputum and bronchoalveolar lavage fluid samples of both PJP and non-PJP patients in the laboratory. Subsequently, we amplify the target DNA fragments using a PCR thermal cycler. Then, we observe the amplification curve and Ct value of the products in the analysis software provided by the instrument to determine whether the PCR result is positive.

Finally, there is the CNN diagnostic method: first, the patient's CT images are read into CNN to extract CT image features. Then normalize the patient's risk factors to obtain clinical features. Due to the fact that the clinical features of patients are binary variables, we will mark positive clinical features as "1" and negative clinical features as "0". In addition, due to the possibility of missing clinical features in patients, we will mark missing clinical features as "2". Using the feature fusion function, the obtained CT image features are fused with clinical features to obtain the fused features. Finally, the fused features are read into the fully connected layer (FC) structure classification to obtain the final diagnostic result.

PCR diagnostic method

Pulmonary alveolar lavage fluid (BALF) or sputum was centrifuged at 13,000 rpm for 10 min. Sample DNA was extracted using a nucleic acid extraction kit produced by Hangzhou Dilan Biotechnology Co., Ltd., following the manufacturer's instructions. Subsequently, the target gene was amplified using the Dilan Biotechnology PJ nucleic acid detection kit, as shown in Fig. 2. The main reaction system consisted of 4 μL of extracted DNA sample, 4 μL of dNTP Mix, 1 μL of primers, 0.5 μL of probes, and 10.5 μL of distilled water (dd H2O). After amplification using a PCR machine, the results were analyzed based on a standard curve. The dNTP Mix is a premixed reagent containing dNTPs, Taq enzyme, UNG enzyme, MgCl2, and Buffer.

Fig. 2
figure 2

The PCR detection process of PJ DNA

In this study, the target gene was amplified using a real-time PCR instrument (Thermo Fisher Scientific, Applied Biosystems 7500, USA) under the following conditions: 37 °C for 3 min, 95 °C for 5 min. Then 95 °C for 15 s and 60 °C for 30 s by 40 cycles, as shown in Fig. 3. Subsequently, the Ct values of amplification curves for each sample were analyzed using ABI 7500 software v2.3. Finally, experimental results were interpreted according to the manufacturer's instructions. when the sample Ct values ≤ 36.00 with normal amplification curves and the internal reference Ct value < 40.00, the result was considered positive.

Fig. 3
figure 3

Process of target gene amplification

The CNN method

This paper proposes a method for diagnosing PJP using a CNN as the main framework. This study chose Densenet121 [19], Resnet50 [20], VGG19 [21] and Inception-V3 [22] network in convolutional neural networks as the core model structures. Taking Densenet121 network as an example. The Densenet121 network in the CNN is selected as the main framework. Compared to Traditional CNNs, the Densenet121 network has fewer parameters and is computationally more efficient [23].

Next, provide specific examples to introduce the structure and performance of Densenet121. Densenet121 consists of three dense convolutional blocks [24]. The first advantage of the dense connection module is that it has fewer parameters compared to traditional CNNs because it does not need to re-learn redundant feature maps [25]. Secondly, the way dense block connects features makes the propagation of features and gradients more efficient, making the network easier to train [26]. Each layer of the dense block can directly utilize the gradient of the loss function as well as the initial input information, which is like a form of implicit deep supervision, aiding in training deeper networks [27]. The vanishing gradient problem becomes more likely to occur as the network gets deeper, and the reason for this is the propagation of input and gradient information across many layers [28, 29]. This dense connection is equivalent to directly connecting the input information and loss function to each layer, which can mitigate the effect of the vanishing gradient phenomenon [30].

Figure 4 illustrates the workflow of this proposed method (Taking Densenet121 network as an example). Firstly, the patient's CT images are input into the DenseNet121 network, which extracts features from the CT images through operations like convolution and pooling [31]. Then, a feature fusion function, Concat, is used to concatenate the patient's CT image features with clinical information features [32]. Finally, the spliced features are input into the last fully connected layer (FC) for classification, then the final diagnostic result is obtained [33].

Fig. 4
figure 4

The workflow of the CNN method

Patients selection

The project collected clinical data of PJP at Zhujiang Hospital of Southern Medical University, from August 2022 to August 2023. This study strictly adhered to ethical rules and obtained approval from the hospital's Clinical Medical Center Ethics Committee. Since the study was retrospective, informed consent from patients was not required.

The patient in our study met all of the following inclusion criteria: 1. Age ≥ 18 years; 2. Diagnosed as PJP by two or more clinical physicians based on clinical presentations, laboratory, and imaging examination results; 3. Complete clinical data; 4. Patients can be followed up from admission to discharge or death through the electronic medical record system. At the same time, patients meeting any of the following criteria were excluded: 1. Suspected PJP patients; 2. History of PJP diagnosis before; 3. Missing clinical data, including clinical features, laboratory results, and chest CT imaging results; 4. Pulmonary infections caused by other fungi or pathogens.

To validate the reliability of the model, we used non-PJP patients with other types of pneumonia (Non-PJP) as a control group, matched with PJP patients in a 1:1 ratio, with age (± 5 years) and gender as matching variables.

Data collection

Review the literature and guidelines related to Pneumocystis jirovecii pneumonia (PJP) [34], find out all possible diagnostic indicators, clinical features, and chest CT imaging characteristics in the diagnosis of PJP, and then collect this clinical information from both PJP patients and non-PJP patients. Categorize the clinical information into clinical manifestations, laboratory test results, chest CT images, diagnostic methods, treatment regimens, and prognosis. All information is entered into electronic medical record forms by one recorder and verified by another. Use the chi-square test to analyze the collected clinical information, select statistically significant indicators, and identify these selected indicators as risk factors for PJP. Further, use multivariate logistic regression analysis on these risk factors to identify more statistically significant indicators, which are then identified as independent risk factors for PJP.

Because this study aims to construct a diagnostic model for early PJP, we only collected data from patients diagnosed with PJP for the first time, with diagnosis confirmed by two clinical physicians. Imaging data was only collected from chest CT images meeting the following criteria: no prior use of any PJP-specific treatment drugs before diagnosis based on clinical symptoms and laboratory test results. The chest CT reports for the matched patient group were independently classified and reviewed by two radiologists.

CT image preprocessing

In order to eliminate the differences in CT images collected by different medical software and devices and facilitate model training, this study used image standardization and resizing methods to normalize CT images before model training.

All CT images used in the experiment are three channel grayscale images, where each pixel is composed of three values R, G, and B. In the process of image standardization, the average value is subtracted from the R, G, and B values of each channel in each CT image, and then divided by the standard deviation to obtain the standardized image output. Calculate the average and standard deviation based on the R, G, and B values of CT images in the training set. Afterwards, adjust the size of the standardized image to a fixed 256 × 256 pixel size.

Results

Statistical analysis

As shown in Table 1 below, chi-square tests revealed 16 clinical factors that were statistically significant, including blood tumors, organ transplantation, immune-related diseases, respiratory failure, chronic kidney disease, hypoalbuminemia, mixed infections, immunosuppressant use, long-term steroid use, surgical history, ground-glass opacity (GGO) on chest CT, mediastinal lymph node enlargement, pleural effusion, lymphocyte reduction, elevated C-reactive protein (CRP), and positive PCR (P < 0.05). To ensure the accuracy of the statistical results, laboratory test indicators with some missing data were excluded, including procalcitonin (PCT), LDH, and CD4+ T < 200 pcs/μL. Then, multivariable logistic regression analysis was conducted on the clinically significant factors mentioned above, revealing four independent risk factors for PJP, which included blood tumors, long-term steroid use, ground-glass opacity on chest CT, and a positive result in PCR testing. All deep learning methods, statistical analyses, and graphing were performed using IBM SPSS Statistic 25.

Table 1 Clinical characteristics and statistical results of PJP and Non-PJP other types of pneumonia

Results of PCR diagnosis

As shown in Fig. 5, among the 58 cases of PJP, 51 cases (87.93%) had a positive PCR result with a Ct value ≤ 36 and normal amplification curves, while 7 cases (12.07%) had a negative PCR result with a Ct value > 36 or no normal amplification curve. Among the 58 cases of Non-PJP, 19 cases (32.76%) were positive, and 39 cases (67.24%) were negative. In this study, the sensitivity of the standalone PCR test for PJP was 87.93%, with a specificity of 67.24% and an accuracy of 77.58%. There was a significant statistical difference in PCR test results between PJP and Non-PJP (P < 0.001).

Fig. 5
figure 5

PCR positivity rate in PJP and Non-PJP patients

Results of the CNN diagnosis

Dataset partitioning

In this study, the CT images of PJP and Non-PJP patients were divided into training, validation, and test sets in an 8:1:1 ratio. The specific division is shown in Fig. 6.

Fig. 6
figure 6

Schematic diagram of data partitioning

The dataset partitioning of this study is independent and random. As shown in Fig. 6, for 58 PJP patients, 46 patients (464 CT images) were used as the training set, 6 patients (58 CT images) were used as the validation set, and 6 patients (58 CT images) were used as the test set. The dataset partitioning rules for Non PJP patients and PJP patients are the same. For 58 Non PJP patients, 46 patients (464 CT images) were used as the training set, 6 patients (58 CT images) were used as the validation set, and 6 patients (58 CT images) were used as the test set.

Experimental design

A total of 600 rounds of training were conducted in this study, with the test model saved every 20 rounds, resulting in a total of 30 test models.

During the training process, a total of 600 training epochs were completed. In each epoch, the model received a batch of CT images of patients and corresponding clinical information, and produced model parameters.

The batch size of the model training is 32. The initial learning rate for model training is set to 0.001, which determines the degree of continuous updating and automatic adjustment of model parameters. Generate a set of model parameters every 20 training periods, which are saved and used for model validation on the validation set. Select the parameters with the best diagnostic performance as the final model parameters, and use the test set for the next step of testing. The results of the tests were averaged for each model tested.

Statistical analysis

These test models were then used to evaluate the test dataset. In this study, accuracy, recall, precision, specificity, sensitivity, \({F}_{1}\) score and Area Under Curve (AUC) were selected as evaluation metrics for the CNN. All deep learning methods, statistical analyses, and graphing were performed using the Pytorch toolkit and Python 3.7 (Python Software Foundation, www.python.org).

Training effect of network model

Taking the Densenet121 network model as an example. As shown in Figs. 7 and 8, during the 600 training periods of the model, the training accuracy and validation accuracy of the model gradually improved, while the training loss and validation loss gradually decreased. The accuracy and training loss curves eventually tend to stabilize. No obvious overfitting was observed.

Fig. 7
figure 7

Trends in model training loss and validation loss

Fig. 8
figure 8

Trends in model training accuracy and validation accuracy

Experimental results

First, input 16 risk factors that have undergone univariate analysis, including the patient's CT image features, as classification criteria into the network. The specific testing results of the top-performing 4 out of 30 test models are shown in Table 2. The ROC curves for these four test models are shown in Fig. 9.

Table 2 Testing results of models using 16 risk factors as diagnostic indicators
Fig. 9
figure 9

ROC curves for the models: Graphs (a) to (d) show the ROC curves of Resnet50, VGG19, Inception-V3, Densenet121, respectively

The four independent risk factors obtained through multivariate binary logistic regression analysis were used as classification criteria and input into the network to obtain four test models, with their corresponding AUC curves plotted. The high-performing test models obtained from the 4 independent risk factors are shown in Table 3, and their ROC curves are displayed in Fig. 10.

Table 3 Testing results of models using 4 independent risk factors as diagnostic indicators
Fig. 10
figure 10

ROC curves for the models: Graphs (a) to (d) show the ROC curves of Resnet50, VGG19, Inception-V3, Densenet121, respectively

Discussion

Based on the above research, the sensitivity of the standalone PCR test for PJP was 87.93%, with a specificity of 67.24% and an accuracy of 77.58%. However, the optimal early PJP diagnostic model Densenet121 based on convolutional neural networks had a sensitivity of 98%, a specificity of 80% and an accuracy of 88.90%. The diagnostic model constructed by combining clinical features and chest CT images significantly improves diagnostic accuracy compared to a standalone PCR test report. In clinical practice, inputting a patient's clinical information and lung CT images into the model proposed in this study can efficiently and rapidly predict the risk of PJP infection. Timely clinical intervention can be taken for high-risk patients to reduce complications and lower mortality.

Although the current gold standard for diagnosing PJP is to detect cysts or trophozoites in lower respiratory tract specimens through traditional staining or immunofluorescence staining methods, with the latter having higher sensitivity than traditional staining, in actual clinical practice, due to cost considerations, immunofluorescence staining is rarely used, and most cases employ traditional staining methods. Therefore, the observation of pathogens through traditional staining methods often serves as the basis for diagnosis, despite the low sensitivity and cumbersome operation of traditional staining methods, and whether pathogens can be found depends on the observer's experience. In addition, non-HIV patients are prone to false negatives due to low fungal loads. Among the 58 PJP patients in this experiment, two experienced senior laboratory physicians observed only 3 positive cases under a microscope using traditional staining methods, further confirming the low sensitivity of traditional staining methods. In recent years, bronchoalveolar lavage fluid combined with metagenomic next-generation sequencing (mNGS) has been used in clinical practice due to its rapidity and high detection rate. However, the high cost of mNGS and its inability to distinguish colonization from infection can lead to unnecessary overtreatment based solely on mNGS results. Moreover, interference from human DNA, microbial loads, and incomplete fungal databases are all shortcomings of mNGS. The PCR mentioned in this study is a commonly used PJP detection method in clinical practice. It not only has high sensitivity, capable of detecting very low levels of fungal loads undetectable by immunofluorescence staining, and can essentially rule out PJP based on negative predictive values, but it is also much cheaper than mNGS. However, PCR has relatively low specificity, so standalone PCR test results also have limitations in distinguishing colonization from infection.

Given the current state of PJP diagnosis, we will identify all possible diagnostic indicators, clinical features, and chest CT imaging features related to PJP based on the literature and guidelines. Then, from these clinical features, we will select independently significant risk factors with statistical significance. We will build an early diagnosis model for PJP by combining PCR test results with clinically significant features and lung CT imaging. Compared to standalone PCR testing, this model showed an increase in sensitivity, specificity, and accuracy by 10.07%, 12.76%, and 11.32% respectively. Additionally, colonization and infection of PJP also depend on the patient's own immune status. In our model, clinical features are closely related to this condition, demonstrating that our model provides a more comprehensive and reliable diagnosis compared to standalone PCR analysis.

However, it is important to note that this study has limitations, including limited sample size and the lack of validation through animal model experiments. Although the preliminary results are promising, the lack of data may affect the diagnostic performance of the model. We believe that incorporating more CT images and a larger volume of clinical information from patients can improve the model's diagnostic efficacy. Although this paper used statistical methods to identify patient risk factors as diagnostic indicators for the neural network model, clinical physicians need to comprehensively consider the patient's clinical presentation in the diagnostic process, rather than solely focusing on statistically significant risk factors. Therefore, in practical applications, this method can serve as an auxiliary diagnostic tool for clinicians in disease assessment.

As medical research continues to advance, treatment methods evolve, algorithmic capabilities improve, and medical diagnostic technologies enhance, the future holds promise for the integration of artificial intelligence techniques with clinical data to offer new diagnostic and treatment strategies for PJP patients in clinical practice.

Conclusion

This article proposes an early diagnosis method for PJP based on CNN. Compared with independent PCR methods, the accuracy of CNN-based PJP diagnostic methods is 11.32% higher than that of PCR detection. This study overcomes the limitations of traditional PJP diagnostic methods and develops a non-invasive, efficient, and accurate PJP diagnostic method. By using this method, PJP patients can receive early diagnosis and treatment, effectively improving prognosis.

Availability of data and materials

No datasets were generated or analysed during the current study.

Abbreviations

PJ:

Pneumocystis jirovecii

PJP:

Pneumocystis jirovecii Pneumonia

CT:

Computed tomography

mNGS:

Metagenomics next-generation sequencing

HIV:

Human immunodeficiency virus

AIDS:

Acquired immune deficiency syndrome

PCR:

Polymerase chain reaction

LDH:

Lactate dehydrogenase

BGD:

1,3-β-D-glucan

allo-HSCT:

Allogeneic hematopoietic stem cell transplant

CNN:

Convolutional neural networks

dd H2O:

Distilled water

GGO:

Ground-glass opacity

CRP:

C-reactive protein

PCT:

Procalcitonin

AUC:

Area Under Curve

References

  1. Fannin S, Gottlieb MS, Weisman JD, et al. A cluster of Kaposi’s sarcoma and pneumocystis carinii pneumonia among homosexual male residents of Los Angeles and range counties, California. MMWR Wkly. 1982;31(32):305–7.

    Google Scholar 

  2. Buchacz K, Lau B, Jing Y, et al. Incidence of AIDS-defining opportunistic infections in a multicohort analysis of HIV-infected persons in the United States and Canada, 2000–2010. J Infect Dis. 2016;214:862–72.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Avino LJ, Naylor SM, Roecker AM. Pneumocystis Jirovecii pneumonia in the non-HIV infected population. Ann Pharmacother. 2016;50(8):673–9.

    Article  CAS  PubMed  Google Scholar 

  4. Catherinot E, Lanternier F, Bougnoux ME, Lecuit M, Couderc LJ, Lortholary O. Pneumocystis Jirovecii pneumonia. Infect Dis Clin North Am. 2010;24:107e38.

    Article  Google Scholar 

  5. Limper AH, Offord KP, Smith TF. Martin 2nd WJ. Pneumocystis carinii pneumonia. Differences in lung parasite number and inflammation in patients with and without AIDS. Am Rev Respir Dis. 1989;140:1204e9.

    Article  Google Scholar 

  6. Miller RF, Le Noury J, Corbett EL, et al. Pneumocystis carinii infection: current treatment and prevention. J Antimicrob Chemother. 1996;37(suppl B):33–53.

    Article  CAS  PubMed  Google Scholar 

  7. Asai N, Motojima S, Ohkuni Y, et al. Non-HIV Pneumocystis pneumonia: do conventional community-acquired pneumonia guidelines under estimate its severity. Multidiscip Respir Med. 2012;7(1):1.

    Article  Google Scholar 

  8. Cordonnier C, Cesaro S, Maschmeyer G, et al. Pneumocystis Jirovecii pneumonia: still a concern in patients with haematological malignancies and stem cell transplant recipients. J Antimicrob Chemother. 2016;71(9):2379–85. https://doi.org/10.1093/jac/dkw155.

    Article  CAS  PubMed  Google Scholar 

  9. Chen D, Wu LL, Li YX, et al. Comparing blind spots of unsedated ultrafine, sedated, and unsedated conventional gastroscopy with and without artificial intelligence: a prospective, single-blind, 3-parallel-group, randomized, single-center trial. Gastrointest Endosc. 2020;91(2):332–9.

    Article  PubMed  Google Scholar 

  10. Yoo YJ, Ha EJ, Cho YJ, et al. Computer-aided diagnosis of thyroid nodules via ultrasonography: initial clinical experience. Korean J Radiol. 2018;19(4):665–72.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Kiani A, Uyumazturk B, Rajpurkar P, et al. Impact of a deep learning assistant on the histopathologic classification of liver cancer. NPJ Digit Med. 2020;3:23.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Wang P, Liu XG, Berzin TM, et al. Effect of a deep-learning computer-aided detection system on adenoma detection during colonoscopy (CADe-DB trial): a double-blind randomised study. Lancet Gastroenterol Hepatol. 2020;5(4):343–51.

    Article  PubMed  Google Scholar 

  13. Lotter W, Diab AR, Haslam B, et al. Robust breast cancer detection in mammography and digital breast tomosynthesis using an annotation-efficient deep learning approach. Nat Med. 2021;27(2):244–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Sim Y, Chung MJ, Kottere E, et al. Deep convolutional neural network-based software improves radiologist detection of malignant lung nodules on chest radiographs. Radiology. 2020;294(1):199–209.

    Article  PubMed  Google Scholar 

  15. Urushibara A, Saida T, Mori K, et al. Diagnosing uterine cervical cancer on a single T2-weighted image: comparison between deep learning versus radiologists. Eur J Radiol. 2021;135:109471.

    Article  PubMed  Google Scholar 

  16. Esteva A, Kuprel B, Novoa RA, et al. Corrigendum: dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;546(7660):686.

    Article  CAS  PubMed  Google Scholar 

  17. Lin H, Li R, Liu Z, et al. Diagnostic efficacy and therapeutic decision-making capacity of an artificial intelligence platform for childhood cataracts in eye clinics: a multicentre randomized controlled trial. E Clin Med. 2019;9:52–9.

    Google Scholar 

  18. Natarajan S, Jain A, Krishnan R, et al. Diagnostic accuracy of community-based diabetic retinopathy screening with an offline artificial intelligence system on a smartphone. JAMA Ophthalmol. 2019;137(10):1182–8.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Wang W, Li M, Fan P, et al. Prototype early diagnostic model for invasive pulmonary aspergillosis based on deep learning and big data training. Mycoses. 2023;66(2):118–27. https://doi.org/10.1111/myc.13540.

    Article  CAS  PubMed  Google Scholar 

  20. Pan Y, Liu J, Cai Y, et al. Fundus image classification using inception V3 and ResNet-50 for the early diagnostics of fundus diseases. Front Physiol. 2023;14:1126780. https://doi.org/10.3389/fphys.2023.1126780. Published 2023 Feb 15.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Khan MA, Rajinikanth V, Satapathy SC, et al. VGG19 Network assisted joint segmentation and classification of lung nodules in CT images. Diagnostics (Basel). 2021;11(12):2208. https://doi.org/10.3390/diagnostics11122208. Published 2021 Nov 26.

    Article  PubMed  Google Scholar 

  22. Mujahid M, Rustam F, Álvarez R, Luis Vidal Mazón J, Díez IT, Ashraf I. Pneumonia classification from X-ray images with Inception-V3 and convolutional neural network. Diagnostics (Basel). 2022;12(5):1280. https://doi.org/10.3390/diagnostics12051280. Published 2022 May 21.

    Article  CAS  PubMed  Google Scholar 

  23. Lian L, Zhu T, Zhu F, Zhu H. Deep learning for caries detection and classification. Diagnostics (Basel). 2021;11(9):1672. https://doi.org/10.3390/diagnostics11091672. Published 2021 Sep 13.

    Article  PubMed  Google Scholar 

  24. Kundu R, Das R, Geem ZW, Han GT, Sarkar R. Pneumonia detection in chest X-ray images using an ensemble of deep learning models. PLoS One. 2021;16(9):e0256630. https://doi.org/10.1371/journal.pone.0256630. Published 2021 Sep 7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Ji J, Zhang W, Dong Y, Lin R, Geng Y, Hong L. Automated cervical cell segmentation using deep ensemble learning. BMC Med Imaging. 2023;23(1):137. https://doi.org/10.1186/s12880-023-01096-1. Published 2023 Sep 21.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Mueller D, Mertes S, Schroeter N, et al. Towards automated COVID-19 presence and severity classification. Stud Health Technol Inf. 2023;302:917–21. https://doi.org/10.3233/SHTI230309.

    Article  Google Scholar 

  27. Wang G, Yan P, Tang Q, Yang L, Chen J. Multiscale feature fusion for skin lesion classification. Biomed Res Int. 2023. https://doi.org/10.1155/2023/5146543. 2023:5146543. Published 2023 Jan 5.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Liu X, Hu Y, Zhou G, et al. DS-MENet for the classification of citrus disease. Front Plant Sci. 2022;13:884464. https://doi.org/10.3389/fpls.2022.884464. Published 2022 Jul 22.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Li G, Tian H, Wu H, et al. Artificial intelligence for non-mass breast lesions detection and classification on ultrasound images: a comparative study. BMC Med Inform Decis Mak. 2023;23(1):174. https://doi.org/10.1186/s12911-023-02277-2. Published 2023 Sep 4.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Deng C, Li D, Feng M, Han D, Huang Q. The value of deep neural networks in the pathological classification of thyroid tumors. Diagn Pathol. 2023;18(1):95. https://doi.org/10.1186/s13000-023-01380-2. Published 2023 Aug 19.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Shamsi A, Asgharnezhad H, Jokandan SS, et al. An uncertainty-aware transfer learning-based framework for COVID-19 diagnosis. IEEE Trans Neural Netw Learn Syst. 2021;32(4):1408–17. https://doi.org/10.1109/TNNLS.2021.3054306.

    Article  PubMed  Google Scholar 

  32. Wang X, Liu Y, Du Z, et al. Prediction of protein solubility based on sequence feature fusion and DDcCNN. Interdiscip Sci. 2021;13(4):703–16. https://doi.org/10.1007/s12539-021-00456-1.

    Article  CAS  PubMed  Google Scholar 

  33. Jia B, Zhang Y. Spectrum analysis for fully connected neural networks [published online ahead of print, 2022 Apr 18]. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2022.3164875.

  34. Lagrou K, Chen S, Masur H, Viscoli C, Decker CF, Pagano L, Groll AH. Pneumocystis jirovecii disease: basis for the revised EORTC/MSGERC invasive fungal disease definitions in individuals without human immunodeficiency virus. Clin Infect Dis. 2021;72(Suppl 2):S114–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We pay sincere thanks to all involved researchers and cited researchers.

Funding

Guangzhou Science and Technology Plan Project, China. Project number: 202206010093.

Author information

Authors and Affiliations

Authors

Contributions

The work presented here was carried out in collaboration between all authors. LHL: Software, Validation, Writing - Review & Editing and Data Curation. LYY: Conceptualization, Methodology, Visualization, Writing - Original Draft and Formal Analysis. LQW: Investigation, Project administration, Supervision and Funding acquisition. LJ: Project administration, Supervision and Resources.

Corresponding authors

Correspondence to Qingwen Lv or Jun Long.

Ethics declarations

Ethics approval and consent to participate

All authors confirm that all methods were carried out in accordance with relevant guidelines and regulations. All authors confirm that all experimental protocols were approved by the Medical Ethics Committee of the Zhujiang Hospital of Southern Medical University. All authors confirm that informed consent was obtained from all subjects and/or their legal guardian(s).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Liu, H., Lv, Q. et al. Diagnosis model of early Pneumocystis jirovecii pneumonia based on convolutional neural network: a comparison with traditional PCR diagnostic method. BMC Pulm Med 24, 205 (2024). https://doi.org/10.1186/s12890-024-02987-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12890-024-02987-x

Keywords