-
Academic Radiology Nov 2021To perform a meta-analysis to compare the diagnostic test accuracy (DTA) of deep learning (DL) in detecting coronavirus disease 2019 (COVID-19), and to investigate how... (Meta-Analysis)
Meta-Analysis
RATIONALE AND OBJECTIVE
To perform a meta-analysis to compare the diagnostic test accuracy (DTA) of deep learning (DL) in detecting coronavirus disease 2019 (COVID-19), and to investigate how network architecture and type of datasets affect DL performance.
MATERIALS AND METHODS
We searched PubMed, Web of Science and Inspec from January 1, 2020, to December 3, 2020, for retrospective and prospective studies on deep learning detection with at least reported sensitivity and specificity. Pooled DTA was obtained using random-effect models. Sub-group analysis between studies was also carried out for data source and network architectures.
RESULTS
The pooled sensitivity and specificity were 91% (95% confidence interval [CI]: 88%, 93%; I = 69%) and 92% (95% CI: 88%, 94%; I = 88%), respectively for 19 studies. The pooled AUC and diagnostic odds ratio (DOR) were 0.95 (95% CI: 0.88, 0.92) and 112.5 (95% CI: 57.7, 219.3; I = 90%) respectively. The overall accuracy, recall, F1-score, LR and LR are 89.5%, 89.5%, 89.7%, 23.13 and 0.13. Sub-group analysis shows that the sensitivity and DOR significantly vary with the type of network architectures and sources of data with low heterogeneity are (I = 0%) and (I = 18%) for ResNet architecture and single-source datasets, respectively.
CONCLUSION
The diagnosis of COVID-19 via deep learning has achieved incredible performance, and the source of datasets, as well as network architectures, strongly affect DL performance.
Topics: COVID-19; Deep Learning; Diagnostic Tests, Routine; Humans; Prospective Studies; Retrospective Studies; SARS-CoV-2
PubMed: 34649779
DOI: 10.1016/j.acra.2021.08.008 -
Age and Ageing Jan 2024Sarcopenia is an important prognostic factor, but its optimal screening methods remain challenging. Several new indices developed based on serum creatinine (Cr) and... (Meta-Analysis)
Meta-Analysis
BACKGROUND
Sarcopenia is an important prognostic factor, but its optimal screening methods remain challenging. Several new indices developed based on serum creatinine (Cr) and cystatin C (CysC) have been proposed to be diagnostic biomarkers for sarcopenia screening.
OBJECTIVE
This review aimed to evaluate the diagnostic accuracy of serum Cr- and CysC-based indices for sarcopenia diagnosis.
METHODS
We systematically searched MEDLINE, EMBASE, SCIE and SCOPUS from inception to 2 April 2023. Methodological quality was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool. A bivariate random-effects model was used to synthesise the pooled sensitivity, specificity and area under the curves of the summary receiver operating characteristic (SROC-AUC).
RESULTS
We retrieved 936 publications and included 16 studies with 5,566 participants (mean age ranged: 51.0-78.4 years, 50.2% men). The prevalence of sarcopenia ranged from 7.8 to 69.5%. All included studies presented a moderate to high risk of bias. The serum Cr- and CysC-based indices showed moderate diagnostic accuracy for sarcopenia (pooled sensitivity: 0.67, 95% CI 0.57-0.75; pooled specificity: 076, 95% CI 0.67-0.83; pooled SROC-AUC: 0.78, 95% CI 0.74-0.81). The Cr/CysC ratio is the most widely studied index, followed by the Cr × eGFRcys index. Overall, both indicators had satisfactory and comparable performance in screening sarcopenia.
CONCLUSION
Serum Cr- and CysC-based indices showed moderate diagnostic accuracy for sarcopenia. The most studied indices-the Cr/CysC ratio and Cr × eGFRcys index-had comparable diagnostic accuracy for evaluating sarcopenia and may serve as surrogate markers for sarcopenia. However, further validation is required to verify these findings.
Topics: Humans; Creatinine; Cystatin C; Diagnostic Tests, Routine; ROC Curve; Sarcopenia
PubMed: 38251741
DOI: 10.1093/ageing/afad252 -
Preventive Veterinary Medicine May 2018A systematic review was conducted to identify studies with data for statistical meta-analyses of sensitivity (Se) and specificity (Sp) of ante-mortem and post-mortem... (Review)
Review
A systematic review was conducted to identify studies with data for statistical meta-analyses of sensitivity (Se) and specificity (Sp) of ante-mortem and post-mortem diagnostic tests for bovine tuberculosis (bTB) in cattle. Members of a working group (WG) developed and tested search criteria and developed a standardised two-stage review process, to identify primary studies with numerator and denominator data for test performance and an agreed range of covariate data. No limits were applied to year, language, region or type of test in initial searches of electronic databases. In stage 1, titles and available abstracts were reviewed. References that complied with stage 1 selection criteria were reviewed in entirety and agreed data were extracted from references that complied with stage 2 selection criteria. At stage 1, 9782 references were reviewed and 261 (2.6%) passed through to stage 2 where 215 English language references were each randomly allocated to two of 18 WG reviewers and 46 references in other languages were allocated to native speakers. Agreement regarding eligibility between reviewers of the same reference at stage 2 was moderate (Kappa statistic = 0.51) and a resolution procedure was conducted. Only 119 references (published 1934-2009) were identified with eligible performance estimates for one or more of 14 different diagnostic test types; despite a comprehensive search strategy and the global impact of bTB. Searches of electronic databases for diagnostic test performance data were found to be nonspecific with regard to identifying references with diagnostic test Se or Sp data. Guidelines for the content of abstracts to research papers reporting diagnostic test performance are presented. The results of meta-analyses of the sensitivity and specificity of the tests, and of an evaluation of the methodological quality of the source references, are presented in accompanying papers (Nuñez-Garcia et al., 2017; Downs et al., 2017).
Topics: Animals; Autopsy; Cattle; Diagnostic Tests, Routine; Sensitivity and Specificity; Tuberculosis, Bovine
PubMed: 29395122
DOI: 10.1016/j.prevetmed.2017.11.004 -
Journal of Magnetic Resonance Imaging :... Jan 2021
Meta-Analysis
Editorial for "MRI vs. CT for the Detection of Liver Metastases in Patients With Pancreatic Carcinoma: A Comparative Diagnostic Test Accuracy Systematic Review and Meta-Analysis".
Topics: Diagnostic Tests, Routine; Humans; Liver Neoplasms; Magnetic Resonance Imaging; Pancreatic Neoplasms; Sensitivity and Specificity; Tomography, X-Ray Computed
PubMed: 32034836
DOI: 10.1002/jmri.27072 -
Orthopaedic Surgery Nov 2022The current diagnostic criteria for periprosthetic joint infection (PJI) are diverse and controversial, leading to delayed diagnosis. This study aimed to evaluate and... (Meta-Analysis)
Meta-Analysis Review
OBJECTIVE
The current diagnostic criteria for periprosthetic joint infection (PJI) are diverse and controversial, leading to delayed diagnosis. This study aimed to evaluate and unify their diagnostic accuracy and the threshold selection of serum and synovial routine tests for PJI at an early stage.
METHODS
We searched the MEDLINE and Embase databases for retrospective or prospective studies which reported preoperative-available assays (serum, synovial, or culture tests) for the diagnosis of chronic PJI among inflammatory arthritis (IA) or non-IA populations from January 1, 2000 to June 30, 2022. Threshold effective analysis was performed on synovial polymorphonuclear neutrophils (PMN%), synovial white blood cell (WBC), serum C-reactive protein (CRP), and erythrocyte sedimentation rate (ESR) to find the relevant cut-offs.
RESULTS
Two hundred and sixteen studies and information from 45,316 individuals were included in the final analysis. Synovial laboratory-based α-defensin and calprotectin had the best comprehensive sensitivity (0.91 [0.86-0.94], 0.95 [0.88-0.98]) and specificity (0.96 [0.94-0.97], 0.95 [0.89-0.98]) values. According to the threshold effect analysis, the recommended cut-offs are 70% (sensitivity 0.89 [0.85-0.92], specificity 0.90 [0.87-0.93]), 4100/μL (sensitivity 0.90 [0.87-0.93], specificity 0.97 [0.93-0.98]), 13.5 mg/L (sensitivity 0.84 [0.78-0.89], specificity 0.83 [0.73-0.89]), and 30 mm/h (sensitivity 0.79 [0.74-0.83], specificity 0.78 [0.72-0.83]) for synovial PMN%, synovial WBC, serum CRP, and ESR, respectively, and tests seem to be more reliable among non-IA patients.
CONCLUSIONS
The laboratory-based synovial α-defensin and synovial calprotectin are the two best independent preoperative diagnostic tests for PJI. A cut off of 70% for synovial PMN% and tighter cut-offs for synovial WBC and serum CRP could have a better diagnostic accuracy for non-IA patients with chronic PJI.
Topics: Humans; alpha-Defensins; Arthritis, Infectious; Arthroplasty, Replacement, Hip; C-Reactive Protein; Diagnostic Tests, Routine; Leukocyte L1 Antigen Complex; Prospective Studies; Prosthesis-Related Infections; Retrospective Studies; Synovial Fluid
PubMed: 36181336
DOI: 10.1111/os.13500 -
International Journal of Stroke :... Oct 2023Post-stroke cognitive impairment (PSCI) is one of the serious complications of stroke. The Montreal Cognitive Assessment (MoCA), as a brief cognitive impairment... (Meta-Analysis)
Meta-Analysis Review
Which cutoff value of the Montreal Cognitive Assessment should be used for post-stroke cognitive impairment? A systematic review and meta-analysis on diagnostic test accuracy.
BACKGROUND
Post-stroke cognitive impairment (PSCI) is one of the serious complications of stroke. The Montreal Cognitive Assessment (MoCA), as a brief cognitive impairment screening tool, is widely used in stroke survivors. However, some studies have suggested that the use of the universal cutoff value of 26 may be inappropriate for detecting cognitive impairments in stroke settings.
AIM
We conducted this study to identify the optimal cutoff value of the MoCA in screening for PSCI.
METHODS
PubMed, CINAHL, Embase, the Cochrane Library, and Web of Science were searched for eligible studies until March 23, 2023. All studies were screened by two independent researchers. The quality of each article was evaluated by the Quality Assessment of Diagnostic Accuracy Studies-2 tool. A bivariate mixed-effects model was used to pool sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, diagnostic odds ratio, and the summary receiver operating characteristic curve.
RESULTS
Twenty-four studies with a total of 4231 patients were included in this review. Despite the lack of evidence of publication bias, a high degree of heterogeneity was observed. A meta-analysis revealed that a cutoff value of 21/22 yielded the best diagnostic accuracy. The optimal cutoff varied in different regions, stroke types, and stroke phases as well.
CONCLUSION
The optimal cutoff of MoCA was 21/22 for stroke populations rather than the initially recommended cutoff of 26. A revised (lower) cutoff should be considered for stroke survivors.
Topics: Humans; Stroke; Cognitive Dysfunction; Mental Status and Dementia Tests; Neurologic Examination; Diagnostic Tests, Routine; Sensitivity and Specificity
PubMed: 37190789
DOI: 10.1177/17474930231178660 -
American Journal of Obstetrics and... Dec 2022Endocervical sampling in women with suspected cervical neoplasia can be performed by either endocervical brush or endocervical curettage. This study aimed to estimate... (Meta-Analysis)
Meta-Analysis Review
OBJECTIVE
Endocervical sampling in women with suspected cervical neoplasia can be performed by either endocervical brush or endocervical curettage. This study aimed to estimate the diagnostic accuracy, discomfort, and number of inadequate samples with either test.
DATA SOURCES
Four bibliographic databases were searched on June 9, 2022, with no date or language restrictions.
STUDY ELIGIBILITY CRITERIA
We included all diagnostic studies and randomized clinical trials that compared the endocervical brush with endocervical curettage in women with an indication for colposcopy.
METHODS
The review protocol was registered on the International Prospective Register of Systematic Reviews (PROSPERO) (CRD42021222406). Two authors independently screened studies, extracted data, performed the risk-of-bias assessment (Quality Assessment of Diagnostic Accuracy Studies-2), and rated the certainty of the evidence using the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) methodology. A meta-analysis of diagnostic test accuracy was performed using a bivariate random-effects model.
RESULTS
We included 7 studies: 4 diagnostic cohort studies and 3 randomized clinical trials. The reference standard was conization or hysterectomy. Risk of bias and concern about applicability were high for some of the studies in patient selection and flow and timing. Overall pooled sensitivity was 81% (95% confidence interval, 48-95; 799 women; 7 studies; low quality of evidence) for endocervical brush and 70% (95% confidence interval, 42-89; 761 women; 7 studies; low quality of evidence) for endocervical curettage. Overall pooled specificity was 73% (95% confidence interval, 36-93; 799 women; 7 studies; low quality of evidence) for endocervical brush and 81% (95% confidence interval, 56-94; 761 women; 7 studies; low quality of evidence) for endocervical curettage. The risk ratio for inadequate samples with endocervical curettage compared with endocervical brush was 2.53 (95% confidence interval, 0.58-11.0; P=.215; low-certainty evidence). Two studies reported on patient discomfort; one found less discomfort in the endocervical brush group, and the other found no difference.
CONCLUSION
No difference was found between endocervical brush and endocervical curettage in diagnostic accuracy, inadequate sampling rate, and adverse effects based on low-quality of evidence. Variation in the characteristics of women and the resulting diagnostic pathways make the external validity limited.
Topics: Female; Humans; Pregnancy; Diagnostic Tests, Routine; Sensitivity and Specificity; Cervix Uteri; Uterine Cervical Neoplasms; Colposcopy
PubMed: 35934116
DOI: 10.1016/j.ajog.2022.07.036 -
Journal of Medical Internet Research Dec 2021Interpretation of capsule endoscopy images or movies is operator-dependent and time-consuming. As a result, computer-aided diagnosis (CAD) has been applied to enhance... (Meta-Analysis)
Meta-Analysis Review
BACKGROUND
Interpretation of capsule endoscopy images or movies is operator-dependent and time-consuming. As a result, computer-aided diagnosis (CAD) has been applied to enhance the efficacy and accuracy of the review process. Two previous meta-analyses reported the diagnostic performance of CAD models for gastrointestinal ulcers or hemorrhage in capsule endoscopy. However, insufficient systematic reviews have been conducted, which cannot determine the real diagnostic validity of CAD models.
OBJECTIVE
To evaluate the diagnostic test accuracy of CAD models for gastrointestinal ulcers or hemorrhage using wireless capsule endoscopic images.
METHODS
We conducted core databases searching for studies based on CAD models for the diagnosis of ulcers or hemorrhage using capsule endoscopy and presenting data on diagnostic performance. Systematic review and diagnostic test accuracy meta-analysis were performed.
RESULTS
Overall, 39 studies were included. The pooled area under the curve, sensitivity, specificity, and diagnostic odds ratio of CAD models for the diagnosis of ulcers (or erosions) were .97 (95% confidence interval, .95-.98), .93 (.89-.95), .92 (.89-.94), and 138 (79-243), respectively. The pooled area under the curve, sensitivity, specificity, and diagnostic odds ratio of CAD models for the diagnosis of hemorrhage (or angioectasia) were .99 (.98-.99), .96 (.94-0.97), .97 (.95-.99), and 888 (343-2303), respectively. Subgroup analyses showed robust results. Meta-regression showed that published year, number of training images, and target disease (ulcers vs erosions, hemorrhage vs angioectasia) was found to be the source of heterogeneity. No publication bias was detected.
CONCLUSIONS
CAD models showed high performance for the optical diagnosis of gastrointestinal ulcer and hemorrhage in wireless capsule endoscopy.
Topics: Capsule Endoscopy; Computers; Diagnostic Tests, Routine; Hemorrhage; Humans; Ulcer
PubMed: 34904949
DOI: 10.2196/33267 -
Tropical Medicine & International... Nov 2022To investigate the current status of the availability and affordability of specific essential medicines and diagnostics for diabetes in Africa. (Meta-Analysis)
Meta-Analysis Review
OBJECTIVE
To investigate the current status of the availability and affordability of specific essential medicines and diagnostics for diabetes in Africa.
METHODS
Systematic review and meta-analysis. Studies conducted in Africa that reported any information on the availability and affordability of short-acting, intermediate-acting, and premixed insulin, glibenclamide, metformin, blood glucose, glycated haemoglobin or HbA1c, and lipid profile tests were included. Random-effect model meta-analysis and descriptive statistics were performed to determine the pooled availability and affordability, respectively.
RESULTS
A total of 21 studies were included. The pooled availability of each drug was as follows: short-acting insulin 33.5% (95% CI: 17.8%-49.2%, I = 95.02%), intermediate-acting insulin 23.1% (95% CI: 6.3%-39.9%, I = 91.6%), premixed insulin 49.4% (95% CI: 24.9%-73.9%, I = 90.57%), glibenclamide 55.9% (95% CI: 43.8%-68.0%, I = 96.7%), and metformin 47.0% (95% CI: 34.6%-59.4%, I = 97.54%). Regarding diagnostic tests, for glucometers the pooled availability was 49.5% (95% CI: 37.9%-61.1%, I = 97.43%), for HbA1c 24.6% (95% CI: 3.1%-46.1%, I = 91.64), and for lipid profile tests 35.7% (95% CI: 19.4%-51.9%, I = 83.77%). The median (IQR) affordability in days' wages was 7 (4.7-7.5) for short-acting insulin, 4.4 (3.9-4.9) for intermediate-acting insulin, 7.1 (5.8-16.7) for premixed insulin, 0.7 (0.7-0.7) for glibenclamide, and 2.1 (1.8-2.8) for metformin.
CONCLUSION
The availability of the five essential medicines and three diagnostic tests for diabetes in Africa is suboptimal. The relatively high cost of insulin, HbA1c, and lipid profile tests is a significant barrier to optimal diabetes care. Pragmatic country-specific strategies are urgently needed to address these inequities in access and cost.
Topics: Humans; Diagnostic Tests, Routine; Glyburide; Glycated Hemoglobin; Health Services Accessibility; Drugs, Essential; Diabetes Mellitus; Costs and Cost Analysis; Insulin; Metformin; Insulin, Short-Acting; Lipids
PubMed: 36121433
DOI: 10.1111/tmi.13819 -
BMJ Open Feb 2018Health systems are currently subject to unprecedented financial strains. Inappropriate test use wastes finite health resources (overuse) and delays diagnoses and... (Meta-Analysis)
Meta-Analysis Review
BACKGROUND
Health systems are currently subject to unprecedented financial strains. Inappropriate test use wastes finite health resources (overuse) and delays diagnoses and treatment (underuse). As most patient care is provided in primary care, it represents an ideal setting to mitigate waste.
OBJECTIVE
To identify overuse and underuse of diagnostic tests in primary care.
DESIGN
Systematic review and meta-analysis.
DATA SOURCES AND ELIGIBILITY CRITERIA
We searched MEDLINE and Embase from January 1999 to October 2017 for studies that measured the inappropriateness of any diagnostic test (measured against a national or international guideline) ordered for adult patients in primary care.
RESULTS
We included 357 171 patients from 63 studies in 15 countries. We extracted 103 measures of inappropriateness (41 underuse and 62 overuse) from included studies for 47 different diagnostic tests.The overall rate of inappropriate diagnostic test ordering varied substantially (0.2%-100%)%).17 tests were underused >50% of the time. Of these, echocardiography (n=4 measures) was consistently underused (between 54% and 89%, n=4). There was large variation in the rate of inappropriate underuse of pulmonary function tests (38%-78%, n=8).Eleven tests were inappropriately overused >50% of the time. Echocardiography was consistently overused (77%-92%), whereas inappropriate overuse of urinary cultures, upper endoscopy and colonoscopy varied widely, from 36% to 77% (n=3), 10%-54% (n=10) and 8%-52% (n=2), respectively.
CONCLUSIONS
There is marked variation in the appropriate use of diagnostic tests in primary care. Specifically, the use of echocardiography (both underuse and overuse) is consistently poor. There is substantial variation in the rate of inappropriate underuse of pulmonary function tests and the overuse of upper endoscopy, urinary cultures and colonoscopy.
PROSPERO REGISTRATION NUMBER
CRD42016048832.
Topics: Adult; Diagnostic Tests, Routine; Echocardiography; Endoscopy, Digestive System; Humans; Primary Health Care; Respiratory Function Tests; Unnecessary Procedures
PubMed: 29440142
DOI: 10.1136/bmjopen-2017-018557