machine learning - OpenMD.com Journal Search

Machine learning for administrative health records: A systematic review of techniques and applications.

Artificial Intelligence in Medicine Oct 2023

Machine learning provides many powerful and effective techniques for analysing heterogeneous electronic health records (EHR). Administrative Health Records (AHR) are a... (Review)

Summary PubMed

Review

Authors: Adrian Caruana, Madhushi Bandara, Katarzyna Musial...

Machine learning provides many powerful and effective techniques for analysing heterogeneous electronic health records (EHR). Administrative Health Records (AHR) are a subset of EHR collected for administrative purposes, and the use of machine learning on AHRs is a growing subfield of EHR analytics. Existing reviews of EHR analytics emphasise that the data-modality of the EHR limits the breadth of suitable machine learning techniques, and pursuable healthcare applications. Despite emphasising the importance of data modality, the literature fails to analyse which techniques and applications are relevant to AHRs. AHRs contain uniquely well-structured, categorically encoded records which are distinct from other data-modalities captured by EHRs, and they can provide valuable information pertaining to how patients interact with the healthcare system. This paper systematically reviews AHR-based research, analysing 70 relevant studies and spanning multiple databases. We identify and analyse which machine learning techniques are applied to AHRs and which health informatics applications are pursued in AHR-based research. We also analyse how these techniques are applied in pursuit of each application, and identify the limitations of these approaches. We find that while AHR-based studies are disconnected from each other, the use of AHRs in health informatics research is substantial and accelerating. Our synthesis of these studies highlights the utility of AHRs for pursuing increasingly complex and diverse research objectives despite a number of pervading data- and technique-based limitations. Finally, through our findings, we propose a set of future research directions that can enhance the utility of AHR data and machine learning techniques for health informatics research.

Topics: Humans; Machine Learning; Electronic Health Records; Medical Informatics; Databases, Factual; Delivery of Health Care

PubMed: 37783537
DOI: 10.1016/j.artmed.2023.102642

Machine learning approaches for frailty detection, prediction and classification in elderly people: A systematic review.

International Journal of Medical... Oct 2023

Frailty in older people is a syndrome related to aging that is becoming increasingly common and problematic as the average age of the world population increases.... (Review)

Summary PubMed

Review

Authors: Matteo Leghissa, Álvaro Carrera, Carlos A Iglesias...

BACKGROUND

Frailty in older people is a syndrome related to aging that is becoming increasingly common and problematic as the average age of the world population increases. Detecting frailty in its early stages or, even better, predicting its appearance can greatly benefit health in later years of life and save the healthcare system from high costs. Machine Learning models fit the need to develop a tool for supporting medical decision-making in detecting or predicting frailty.

METHODS

In this review, we followed the PRISMA methodology to conduct a systematic search of the most relevant Machine Learning models that have been developed so far in the context of frailty. We selected 41 publications and compared them according to their purpose, the type of dataset used, the target variables, and the results they obtained, highlighting their shortcomings and strengths.

RESULTS

The variety of frailty definitions allows many problems to fall into this field, and it is often challenging to compare results due to the differences in target variables. The data types can be divided into gait data, usually collected with sensors, and medical records, often in the context of aging studies. The most common algorithms are well-known models available from every Machine Learning library. Only one study developed a new framework for frailty classification, and only two considered Explainability.

CONCLUSIONS

This review highlights some gaps in the field of Machine Learning applied to the assessment and prediction of frailty, such as the need for a universal quantitative definition. It emphasizes the need for close collaboration between medical professionals and data scientists to unlock the potential of data collected in hospital and clinical settings. As a suggestion for future work, the area of Explainability, which is crucial for models in medicine and health care, was considered in very few studies.

Topics: Humans; Aged; Frailty; Machine Learning; Algorithms

PubMed: 37586309
DOI: 10.1016/j.ijmedinf.2023.105172

Machine Learning in the Diagnosis and Prognostic Prediction of Dental Caries: A Systematic Review.

Caries Research 2022

We performed a systematic review to evaluate the success of machine learning algorithms in the diagnosis and prognostic prediction of dental caries. The review protocol...

Summary PubMed

Authors: Lilian Toledo Reyes, Jessica Klöckner Knorst, Fernanda Ruffo Ortiz...

We performed a systematic review to evaluate the success of machine learning algorithms in the diagnosis and prognostic prediction of dental caries. The review protocol was a priori registered in the PROSPERO, CRD42020183447. The search involved electronic bibliographic databases: PubMed/Medline, Scopus, EMBASE, Web of Science, and grey literature until December 2020. We excluded review articles, case series, case reports, editorials, letters, comments, educational methodologies, assessments of robotic devices, and articles with less than 10 participants or specimens. Two independent reviewers selected the studies and performed the assessment of the methodological quality based on standardized scales. We summarize data on the machine learning algorithms used; software; performance outcomes such as accuracy/precision, sensitivity/recall, specificity, area under the receiver operating characteristic curve (AUC), and positive/negative predictive values related to dental caries. Meta-analyses were not performed due to methodological differences. Our review included 15 studies (10 diagnostic studies and 5 prognostic prediction studies). Cross-sectional design studies were predominant (12). The most frequently used statistical measure of performance reported in diagnostic studies was AUC value, which ranged from 0.745 to 0.987. For most diagnostic studies, data from contingency tables were not available. Reported sensitivities were higher in low risk of bias prognostic prediction studies (median [IQR] of 0.996 [0.971-1.000] vs. unclear/high risk of bias studies 0.189 [0-0.340]; p value 0.025). While there were no significant differences in the specificity between these subgroups, we concluded that the use of these technologies for the diagnosis and prognostic prediction of dental caries, although promising, is at an early stage. The general applicability of the evidence was limited given that most models were developed outside the real clinical setting with a prevalence of unclear/high risk of bias. Researchers must increase the overall quality of their research protocols by providing a comprehensive report on the methods implemented.

Topics: Humans; Prognosis; Dental Caries; Cross-Sectional Studies; Machine Learning; Algorithms

PubMed: 35636386
DOI: 10.1159/000524167

Machine learning for understanding and predicting neurodevelopmental outcomes in premature infants: a systematic review.

Pediatric Research Jan 2023

Machine learning has been attracting increasing attention for use in healthcare applications, including neonatal medicine. One application for this tool is in... (Review)

Summary PubMed Full Text PDF

Review

Authors: Stephanie Baker, Yogavijayan Kandasamy

BACKGROUND

Machine learning has been attracting increasing attention for use in healthcare applications, including neonatal medicine. One application for this tool is in understanding and predicting neurodevelopmental outcomes in preterm infants. In this study, we have carried out a systematic review to identify findings and challenges to date.

METHODS

This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines. Four databases were searched in February 2022, with articles then screened in a non-blinded manner by two authors.

RESULTS

The literature search returned 278 studies, with 11 meeting the eligibility criteria for inclusion. Convolutional neural networks were the most common machine learning approach, with most studies seeking to predict neurodevelopmental outcomes from images and connectomes describing brain structure and function. Studies to date also sought to identify features predictive of outcomes; however, results varied greatly.

CONCLUSIONS

Initial studies in this field have achieved promising results; however, many machine learning techniques remain to be explored, and the consensus is yet to be reached on which clinical and brain features are most predictive of neurodevelopmental outcomes.

IMPACT

This systematic review looks at the question of whether machine learning can be used to predict and understand neurodevelopmental outcomes in preterm infants. Our review finds that promising initial works have been conducted in this field, but many challenges and opportunities remain. Quality assessment of relevant articles is conducted using the Newcastle-Ottawa Scale. This work identifies challenges that remain and suggests several key directions for future research. To the best of the authors' knowledge, this is the first systematic review to explore this topic.

Topics: Infant; Infant, Newborn; Humans; Infant, Premature; Machine Learning

PubMed: 35641551
DOI: 10.1038/s41390-022-02120-w

A Systematic Review of Wi-Fi and Machine Learning Integration with Topic Modeling Techniques.

Sensors (Basel, Switzerland) Jun 2022

Wireless networks have drastically influenced our lifestyle, changing our workplaces and society. Among the variety of wireless technology, Wi-Fi surely plays a leading... (Review)

Summary PubMed Full Text PDF

Review

Authors: Daniele Atzeni, Davide Bacciu, Daniele Mazzei...

Wireless networks have drastically influenced our lifestyle, changing our workplaces and society. Among the variety of wireless technology, Wi-Fi surely plays a leading role, especially in local area networks. The spread of mobiles and tablets, and more recently, the advent of Internet of Things, have resulted in a multitude of Wi-Fi-enabled devices continuously sending data to the Internet and between each other. At the same time, Machine Learning has proven to be one of the most effective and versatile tools for the analysis of fast streaming data. This systematic review aims at studying the interaction between these technologies and how it has developed throughout their lifetimes. We used Scopus, Web of Science, and IEEE Xplore databases to retrieve paper abstracts and leveraged a topic modeling technique, namely, BERTopic, to analyze the resulting document corpus. After these steps, we inspected the obtained clusters and computed statistics to characterize and interpret the topics they refer to. Our results include both the applications of Wi-Fi sensing and the variety of Machine Learning algorithms used to tackle them. We also report how the Wi-Fi advances have affected sensing applications and the choice of the most suitable Machine Learning models.

Topics: Algorithms; Databases, Factual; Local Area Networks; Machine Learning; Wireless Technology

PubMed: 35808430
DOI: 10.3390/s22134925

Natural Language Processing in Electronic Health Records in relation to healthcare decision-making: A systematic review.

Computers in Biology and Medicine Mar 2023

Natural Language Processing (NLP) is widely used to extract clinical insights from Electronic Health Records (EHRs). However, the lack of annotated data, automated... (Review)

Summary PubMed

Review

Authors: Elias Hossain, Rajib Rana, Niall Higgins...

BACKGROUND

Natural Language Processing (NLP) is widely used to extract clinical insights from Electronic Health Records (EHRs). However, the lack of annotated data, automated tools, and other challenges hinder the full utilisation of NLP for EHRs. Various Machine Learning (ML), Deep Learning (DL) and NLP techniques are studied and compared to understand the limitations and opportunities in this space comprehensively.

METHODOLOGY

After screening 261 articles from 11 databases, we included 127 papers for full-text review covering seven categories of articles: (1) medical note classification, (2) clinical entity recognition, (3) text summarisation, (4) deep learning (DL) and transfer learning architecture, (5) information extraction, (6) Medical language translation and (7) other NLP applications. This study follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.

RESULT AND DISCUSSION

EHR was the most commonly used data type among the selected articles, and the datasets were primarily unstructured. Various ML and DL methods were used, with prediction or classification being the most common application of ML or DL. The most common use cases were: the International Classification of Diseases, Ninth Revision (ICD-9) classification, clinical note analysis, and named entity recognition (NER) for clinical descriptions and research on psychiatric disorders.

CONCLUSION

We find that the adopted ML models were not adequately assessed. In addition, the data imbalance problem is quite important, yet we must find techniques to address this underlining problem. Future studies should address key limitations in studies, primarily identifying Lupus Nephritis, Suicide Attempts, perinatal self-harmed and ICD-9 classification.

Topics: Humans; Natural Language Processing; Electronic Health Records; Machine Learning; Information Storage and Retrieval; Delivery of Health Care

PubMed: 36805219
DOI: 10.1016/j.compbiomed.2023.106649

Application of machine learning and artificial intelligence in the diagnosis and classification of polycystic ovarian syndrome: a systematic review.

Frontiers in Endocrinology 2023

Polycystic Ovarian Syndrome (PCOS) is the most common endocrinopathy in women of reproductive age and remains widely underdiagnosed leading to significant morbidity....

Summary PubMed Full Text PDF

Authors: Francisco J Barrera, Ethan D L Brown, Amanda Rojo...

INTRODUCTION

Polycystic Ovarian Syndrome (PCOS) is the most common endocrinopathy in women of reproductive age and remains widely underdiagnosed leading to significant morbidity. Artificial intelligence (AI) and machine learning (ML) hold promise in improving diagnostics. Thus, we performed a systematic review of literature to identify the utility of AI/ML in the diagnosis or classification of PCOS.

METHODS

We applied a search strategy using the following databases MEDLINE, Embase, the Cochrane Central Register of Controlled Trials, the Web of Science, and the IEEE Xplore Digital Library using relevant keywords. Eligible studies were identified, and results were extracted for their synthesis from inception until January 1, 2022.

RESULTS

135 studies were screened and ultimately, 31 studies were included in this study. Data sources used by the AI/ML interventions included clinical data, electronic health records, and genetic and proteomic data. Ten studies (32%) employed standardized criteria (NIH, Rotterdam, or Revised International PCOS classification), while 17 (55%) used clinical information with/without imaging. The most common AI techniques employed were support vector machine (42% studies), K-nearest neighbor (26%), and regression models (23%) were the commonest AI/ML. Receiver operating curves (ROC) were employed to compare AI/ML with clinical diagnosis. Area under the ROC ranged from 73% to 100% (n=7 studies), diagnostic accuracy from 89% to 100% (n=4 studies), sensitivity from 41% to 100% (n=10 studies), specificity from 75% to 100% (n=10 studies), positive predictive value (PPV) from 68% to 95% (n=4 studies), and negative predictive value (NPV) from 94% to 99% (n=2 studies).

CONCLUSION

Artificial intelligence and machine learning provide a high diagnostic and classification performance in detecting PCOS, thereby providing an avenue for early diagnosis of this disorder. However, AI-based studies should use standardized PCOS diagnostic criteria to enhance the clinical applicability of AI/ML in PCOS and improve adherence to methodological and reporting guidelines for maximum diagnostic utility.

SYSTEMATIC REVIEW REGISTRATION

https://www.crd.york.ac.uk/prospero/, identifier CRD42022295287.

Topics: Female; Humans; Artificial Intelligence; Polycystic Ovary Syndrome; Proteomics; Machine Learning; Cluster Analysis

PubMed: 37790605
DOI: 10.3389/fendo.2023.1106625

Machine learning models to detect and predict patient safety events using electronic health records: A systematic review.

International Journal of Medical... Dec 2023

Identifying patient safety events using electronic health records (EHRs) and automated machine learning-based detection methods can help improve the efficiency and... (Review)

Summary PubMed

Review

Authors: Ghasem Deimazar, Abbas Sheikhtaheri

INTRODUCTION

Identifying patient safety events using electronic health records (EHRs) and automated machine learning-based detection methods can help improve the efficiency and quality of healthcare service provision.

OBJECTIVE

This study aimed to systematically review machine learning-based methods and techniques, as well as their results for patient safety event management using EHRs.

METHODS

We reviewed the studies that focused on machine learning techniques, including automatic prediction and detection of patient safety events and medical errors through EHR analysis to manage patient safety events. The data were collected by searching Scopus, PubMed (Medline), Web of Science, EMBASE, and IEEE Xplore databases.

RESULTS

After screening, 41 papers were reviewed. Support vector machine (SVM), random forest, conditional random field (CRF), and bidirectional long short-term memory with conditional random field (BiLSTM-CRF) algorithms were mostly applied to predict, identify, and classify patient safety events using EHRs; however, they had different performances. BiLSTM-CRF was employed in most of the studies to extract and identify concepts, e.g., adverse drug events (ADEs) and adverse drug reactions (ADRs), as well as relationships between drug and severity, drug and ADEs, drug and ADRs. Recurrent neural networks (RNN) and BiLSTM-CRF had the best results in detecting ADEs compared to other patient safety events. Linear classifiers and Naive Bayes (NB) had the highest performance for ADR detection. Logistic regression had the best results in detecting surgical site infections. According to the findings, the quality of articles has non-significantly improved in recent years, but they had low average scores.

CONCLUSIONS

Machine learning can be useful in automatic detection and prediction of patient safety events. However, most of these algorithms have not yet been externally validated or prospectively tested. Therefore, further studies are required to improve the performance of these automated systems.

Topics: Humans; Electronic Health Records; Bayes Theorem; Patient Safety; Machine Learning; Algorithms

PubMed: 37837710
DOI: 10.1016/j.ijmedinf.2023.105246

Machine Learning for Diagnosis of Systemic Lupus Erythematosus: A Systematic Review and Meta-Analysis.

Computational Intelligence and... 2022

Application of machine learning (ML) for identification of systemic lupus erythematosus (SLE) has been recently drawing increasing attention, while there is still lack... (Meta-Analysis)

Summary PubMed Full Text PDF

Meta-Analysis

Authors: Yuan Zhou, Meng Wang, Shasha Zhao...

BACKGROUND

Application of machine learning (ML) for identification of systemic lupus erythematosus (SLE) has been recently drawing increasing attention, while there is still lack of evidence-based support.

METHODS

Systematic review and meta-analysis are conducted to evaluate its diagnostic accuracy and application prospect. PubMed, Embase, Cochrane Library, and Web of Science libraries are searched, in combination with manual searching and literature retrospection, for studies regarding machine learning for identifying SLE and neuropsychiatric systemic lupus erythematosus (NPSLE). Quality Assessment of Diagnostic Accuracy Studies (QUADA-2) is applied to assess the quality of included studies. Diagnostic accuracy of the SLE model and NPSLE model is assessed using the bivariate fixed-effect model, and the data are pooled. Summary receiver operator characteristic curve (SROC) is plotted, and area under the curve (AUC) is calculated.

RESULTS

Eighteen (18) studies are included, in which ten (10) focused on SLE and eight (8) on NPSLE. The AUC of SLE identification is 0.95, the sensitivity is 0.90, the specificity is 0.89, the PLR is 8.4, the NLR is 0.12, and the DOR is 73. AUC of NPSLE identification is 0.89, the sensitivity is 0.83, the specificity is 0.83, the PLR is 5.0, the NLR is 0.20, and the DOR is 25.

CONCLUSION

Machine learning presented remarkable performance in identification of SLE and NPSLE. Based on the convenience for inclusion factor collection and non-invasiveness of detection, machine learning is expected to be widely applied in clinical practice to assist medical decision making.

Topics: Humans; Lupus Erythematosus, Systemic; Machine Learning; Area Under Curve

PubMed: 36458233
DOI: 10.1155/2022/7167066

Radiomic and Genomic Machine Learning Method Performance for Prostate Cancer Diagnosis: Systematic Literature Review.

Journal of Medical Internet Research Apr 2021

Machine learning algorithms have been drawing attention at the joining of pathology and radiology in prostate cancer research. However, due to their algorithmic learning... (Meta-Analysis)

Summary PubMed Full Text PDF

Meta-Analysis Review

Authors: Rossana Castaldo, Carlo Cavaliere, Andrea Soricelli...

BACKGROUND

Machine learning algorithms have been drawing attention at the joining of pathology and radiology in prostate cancer research. However, due to their algorithmic learning complexity and the variability of their architecture, there is an ongoing need to analyze their performance.

OBJECTIVE

This study assesses the source of heterogeneity and the performance of machine learning applied to radiomic, genomic, and clinical biomarkers for the diagnosis of prostate cancer. One research focus of this study was on clearly identifying problems and issues related to the implementation of machine learning in clinical studies.

METHODS

Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) protocol, 816 titles were identified from the PubMed, Scopus, and OvidSP databases. Studies that used machine learning to detect prostate cancer and provided performance measures were included in our analysis. The quality of the eligible studies was assessed using the QUADAS-2 (quality assessment of diagnostic accuracy studies-version 2) tool. The hierarchical multivariate model was applied to the pooled data in a meta-analysis. To investigate the heterogeneity among studies, I statistics were performed along with visual evaluation of coupled forest plots. Due to the internal heterogeneity among machine learning algorithms, subgroup analysis was carried out to investigate the diagnostic capability of machine learning systems in clinical practice.

RESULTS

In the final analysis, 37 studies were included, of which 29 entered the meta-analysis pooling. The analysis of machine learning methods to detect prostate cancer reveals the limited usage of the methods and the lack of standards that hinder the implementation of machine learning in clinical applications.

CONCLUSIONS

The performance of machine learning for diagnosis of prostate cancer was considered satisfactory for several studies investigating the multiparametric magnetic resonance imaging and urine biomarkers; however, given the limitations indicated in our study, further studies are warranted to extend the potential use of machine learning to clinical settings. Recommendations on the use of machine learning techniques were also provided to help researchers to design robust studies to facilitate evidence generation from the use of radiomic and genomic biomarkers.

Topics: Algorithms; Genomics; Humans; Machine Learning; Male; Prostatic Neoplasms

PubMed: 33792552
DOI: 10.2196/22394