transformer - OpenMD.com Journal Search

Twitter-based gender recognition using transformers.

Mathematical Biosciences and... Aug 2023

Social media contains useful information about people and society that could help advance research in many different areas of health (e.g. by applying opinion mining,...

Summary PubMed Full Text

Authors: Zahra Movahedi Nia, Ali Ahmadi, Bruce Mellado...

Social media contains useful information about people and society that could help advance research in many different areas of health (e.g. by applying opinion mining, emotion/sentiment analysis and statistical analysis) such as mental health, health surveillance, socio-economic inequality and gender vulnerability. User demographics provide rich information that could help study the subject further. However, user demographics such as gender are considered private and are not freely available. In this study, we propose a model based on transformers to predict the user's gender from their images and tweets. The image-based classification model is trained in two different methods: using the profile image of the user and using various image contents posted by the user on Twitter. For the first method a Twitter gender recognition dataset, publicly available on Kaggle and for the second method the PAN-18 dataset is used. Several transformer models, i.e. vision transformers (ViT), LeViT and Swin Transformer are fine-tuned for both of the image datasets and then compared. Next, different transformer models, namely, bidirectional encoders representations from transformers (BERT), RoBERTa and ELECTRA are fine-tuned to recognize the user's gender by their tweets. This is highly beneficial, because not all users provide an image that indicates their gender. The gender of such users could be detected from their tweets. The significance of the image and text classification models were evaluated using the Mann-Whitney U test. Finally, the combination model improved the accuracy of image and text classification models by 11.73 and 5.26% for the Kaggle dataset and by 8.55 and 9.8% for the PAN-18 dataset, respectively. This shows that the image and text classification models are capable of complementing each other by providing additional information to one another. Our overall multimodal method has an accuracy of 88.11% for the Kaggle and 89.24% for the PAN-18 dataset and outperforms state-of-the-art models. Our work benefits research that critically require user demographic information such as gender to further analyze and study social media content for health-related issues.

Topics: Humans; Social Media; Electric Power Supplies; Research Design

PubMed: 37919997
DOI: 10.3934/mbe.2023711

Applications of transformer-based language models in bioinformatics: a survey.

Bioinformatics Advances 2023

The transformer-based language models, including vanilla transformer, BERT and GPT-3, have achieved revolutionary breakthroughs in the field of natural language... (Review)

Summary PubMed Full Text PDF

Review

Authors: Shuang Zhang, Rui Fan, Yuti Liu...

SUMMARY

The transformer-based language models, including vanilla transformer, BERT and GPT-3, have achieved revolutionary breakthroughs in the field of natural language processing (NLP). Since there are inherent similarities between various biological sequences and natural languages, the remarkable interpretability and adaptability of these models have prompted a new wave of their application in bioinformatics research. To provide a timely and comprehensive review, we introduce key developments of transformer-based language models by describing the detailed structure of transformers and summarize their contribution to a wide range of bioinformatics research from basic sequence analysis to drug discovery. While transformer-based applications in bioinformatics are diverse and multifaceted, we identify and discuss the common challenges, including heterogeneity of training data, computational expense and model interpretability, and opportunities in the context of bioinformatics research. We hope that the broader community of NLP researchers, bioinformaticians and biologists will be brought together to foster future research and development in transformer-based language models, and inspire novel bioinformatics applications that are unattainable by traditional methods.

SUPPLEMENTARY INFORMATION

Supplementary data are available at online.

PubMed: 36845200
DOI: 10.1093/bioadv/vbad001

AI chatbots not yet ready for clinical use.

Frontiers in Digital Health 2023

As large language models (LLMs) expand and become more advanced, so do the natural language processing capabilities of conversational AI, or "chatbots". OpenAI's recent...

Summary PubMed Full Text PDF

Authors: Joshua Au Yeung, Zeljko Kraljevic, Akish Luintel...

As large language models (LLMs) expand and become more advanced, so do the natural language processing capabilities of conversational AI, or "chatbots". OpenAI's recent release, ChatGPT, uses a transformer-based model to enable human-like text generation and question-answering on general domain knowledge, while a healthcare-specific Large Language Model (LLM) such as GatorTron has focused on the real-world healthcare domain knowledge. As LLMs advance to achieve near human-level performances on medical question and answering benchmarks, it is probable that Conversational AI will soon be developed for use in healthcare. In this article we discuss the potential and compare the performance of two different approaches to generative pretrained transformers-ChatGPT, the most widely used general conversational LLM, and Foresight, a GPT (generative pretrained transformer) based model focused on modelling patients and disorders. The comparison is conducted on the task of forecasting relevant diagnoses based on clinical vignettes. We also discuss important considerations and limitations of transformer-based chatbots for clinical use.

PubMed: 37122812
DOI: 10.3389/fdgth.2023.1161098

Deep Learning and Transformer Approaches for UAV-Based Wildfire Detection and Segmentation.

Sensors (Basel, Switzerland) Mar 2022

Wildfires are a worldwide natural disaster causing important economic damages and loss of lives. Experts predict that wildfires will increase in the coming years mainly...

Summary PubMed Full Text PDF

Authors: Rafik Ghali, Moulay A Akhloufi, Wided Souidene Mseddi...

Wildfires are a worldwide natural disaster causing important economic damages and loss of lives. Experts predict that wildfires will increase in the coming years mainly due to climate change. Early detection and prediction of fire spread can help reduce affected areas and improve firefighting. Numerous systems were developed to detect fire. Recently, Unmanned Aerial Vehicles were employed to tackle this problem due to their high flexibility, their low-cost, and their ability to cover wide areas during the day or night. However, they are still limited by challenging problems such as small fire size, background complexity, and image degradation. To deal with the aforementioned limitations, we adapted and optimized Deep Learning methods to detect wildfire at an early stage. A novel deep ensemble learning method, which combines EfficientNet-B5 and DenseNet-201 models, is proposed to identify and classify wildfire using aerial images. In addition, two vision transformers (TransUNet and TransFire) and a deep convolutional model (EfficientSeg) were employed to segment wildfire regions and determine the precise fire regions. The obtained results are promising and show the efficiency of using Deep Learning and vision transformers for wildfire classification and segmentation. The proposed model for wildfire classification obtained an accuracy of 85.12% and outperformed many state-of-the-art works. It proved its ability in classifying wildfire even small fire areas. The best semantic segmentation models achieved an F1-score of 99.9% for TransUNet architecture and 99.82% for TransFire architecture superior to recent published models. More specifically, we demonstrated the ability of these models to extract the finer details of wildfire using aerial images. They can further overcome current model limitations, such as background complexity and small wildfire areas.

Topics: Climate Change; Deep Learning; Fires; Wildfires

PubMed: 35271126
DOI: 10.3390/s22051977

A survey of Transformer applications for histopathological image analysis: New developments and future directions.

Biomedical Engineering Online Sep 2023

Transformers have been widely used in many computer vision challenges and have shown the capability of producing better results than convolutional neural networks... (Review)

Summary PubMed Full Text PDF

Review

Authors: Chukwuemeka Clinton Atabansi, Jing Nie, Haijun Liu...

Transformers have been widely used in many computer vision challenges and have shown the capability of producing better results than convolutional neural networks (CNNs). Taking advantage of capturing long-range contextual information and learning more complex relations in the image data, Transformers have been used and applied to histopathological image processing tasks. In this survey, we make an effort to present a thorough analysis of the uses of Transformers in histopathological image analysis, covering several topics, from the newly built Transformer models to unresolved challenges. To be more precise, we first begin by outlining the fundamental principles of the attention mechanism included in Transformer models and other key frameworks. Second, we analyze Transformer-based applications in the histopathological imaging domain and provide a thorough evaluation of more than 100 research publications across different downstream tasks to cover the most recent innovations, including survival analysis and prediction, segmentation, classification, detection, and representation. Within this survey work, we also compare the performance of CNN-based techniques to Transformers based on recently published papers, highlight major challenges, and provide interesting future research directions. Despite the outstanding performance of the Transformer-based architectures in a number of papers reviewed in this survey, we anticipate that further improvements and exploration of Transformers in the histopathological imaging domain are still required in the future. We hope that this survey paper will give readers in this field of study a thorough understanding of Transformer-based techniques in histopathological image analysis, and an up-to-date paper list summary will be provided at https://github.com/S-domain/Survey-Paper .

Topics: Image Processing, Computer-Assisted; Learning; Neural Networks, Computer

PubMed: 37749595
DOI: 10.1186/s12938-023-01157-0

AMMU: A survey of transformer-based biomedical pretrained language models.

Journal of Biomedical Informatics Feb 2022

Transformer-based pretrained language models (PLMs) have started a new era in modern natural language processing (NLP). These models combine the power of transformers,... (Review)

Summary PubMed Full Text

Review

Authors: Katikapalli Subramanyam Kalyan, Ajit Rajasekharan, Sivanesan Sangeetha...

Transformer-based pretrained language models (PLMs) have started a new era in modern natural language processing (NLP). These models combine the power of transformers, transfer learning, and self-supervised learning (SSL). Following the success of these models in the general domain, the biomedical research community has developed various in-domain PLMs starting from BioBERT to the latest BioELECTRA and BioALBERT models. We strongly believe there is a need for a survey paper that can provide a comprehensive survey of various transformer-based biomedical pretrained language models (BPLMs). In this survey, we start with a brief overview of foundational concepts like self-supervised learning, embedding layer and transformer encoder layers. We discuss core concepts of transformer-based PLMs like pretraining methods, pretraining tasks, fine-tuning methods, and various embedding types specific to biomedical domain. We introduce a taxonomy for transformer-based BPLMs and then discuss all the models. We discuss various challenges and present possible solutions. We conclude by highlighting some of the open issues which will drive the research community to further improve transformer-based BPLMs. The list of all the publicly available transformer-based BPLMs along with their links is provided at https://mr-nlp.github.io/posts/2021/05/transformer-based-biomedical-pretrained-language-models-list/.

Topics: Biomedical Research; Language; Natural Language Processing

PubMed: 34974190
DOI: 10.1016/j.jbi.2021.103982

Transformer-based tool recommendation system in Galaxy.

BMC Bioinformatics Nov 2023

Galaxy is a web-based open-source platform for scientific analyses. Researchers use thousands of high-quality tools and workflows for their respective analyses in...

Summary PubMed Full Text PDF

Authors: Anup Kumar, Björn Grüning, Rolf Backofen...

BACKGROUND

Galaxy is a web-based open-source platform for scientific analyses. Researchers use thousands of high-quality tools and workflows for their respective analyses in Galaxy. Tool recommender system predicts a collection of tools that can be used to extend an analysis. In this work, a tool recommender system is developed by training a transformer on workflows available on Galaxy Europe and its performance is compared to other neural networks such as recurrent, convolutional and dense neural networks.

RESULTS

The transformer neural network achieves two times faster convergence, has significantly lower model usage (model reconstruction and prediction) time and shows a better generalisation that goes beyond training workflows than the older tool recommender system created using RNN in Galaxy. In addition, the transformer also outperforms CNN and DNN on several key indicators. It achieves a faster convergence time, lower model usage time, and higher quality tool recommendations than CNN. Compared to DNN, it converges faster to a higher precision@k metric (approximately 0.98 by transformer compared to approximately 0.9 by DNN) and shows higher quality tool recommendations.

CONCLUSION

Our work shows a novel usage of transformers to recommend tools for extending scientific workflows. A more robust tool recommendation model, created using a transformer, having significantly lower usage time than RNN and CNN, higher precision@k than DNN, and higher quality tool recommendations than all three neural networks, will benefit researchers in creating scientifically significant workflows and exploratory data analysis in Galaxy. Additionally, the ability to train faster than all three neural networks imparts more scalability for training on larger datasets consisting of millions of tool sequences. Open-source scripts to create the recommendation model are available under MIT licence at https://github.com/anuprulez/galaxy_tool_recommendation_transformers.

Topics: Software; Neural Networks, Computer; Workflow; Data Analysis; Europe

PubMed: 38012574
DOI: 10.1186/s12859-023-05573-w

Towards Online Ageing Detection in Transformer Oil: A Review.

Sensors (Basel, Switzerland) Oct 2022

Transformers play an essential role in power networks, ensuring that generated power gets to consumers at the safest voltage level. However, they are prone to insulation... (Review)

Summary PubMed Full Text PDF

Review

Authors: Ugochukwu Elele, Azam Nekahi, Arshad Arshad...

Transformers play an essential role in power networks, ensuring that generated power gets to consumers at the safest voltage level. However, they are prone to insulation failure from ageing, which has fatal and economic consequences if left undetected or unattended. Traditional detection methods are based on scheduled maintenance practices that often involve taking samples from in situ transformers and analysing them in laboratories using several techniques. This conventional method exposes the engineer performing the test to hazards, requires specialised training, and does not guarantee reliable results because samples can be contaminated during collection and transportation. This paper reviews the transformer oil types and some traditional ageing detection methods, including breakdown voltage (BDV), spectroscopy, dissolved gas analysis, total acid number, interfacial tension, and corresponding regulating standards. In addition, a review of sensors, technologies to improve the reliability of online ageing detection, and related online transformer ageing systems is covered in this work. A non-destructive online ageing detection method for in situ transformer oil is a better alternative to the traditional offline detection method. Moreover, when combined with the Internet of Things (IoT) and artificial intelligence, a prescriptive maintenance solution emerges, offering more advantages and robustness than offline preventive maintenance approaches.

Topics: Artificial Intelligence; Reproducibility of Results; Electric Power Supplies; Maintenance

PubMed: 36298273
DOI: 10.3390/s22207923

Diagnostic Accuracy of Differential-Diagnosis Lists Generated by Generative Pretrained Transformer 3 Chatbot for Clinical Vignettes with Common Chief Complaints: A Pilot...

International Journal of Environmental... Feb 2023

The diagnostic accuracy of differential diagnoses generated by artificial intelligence (AI) chatbots, including the generative pretrained transformer 3 (GPT-3) chatbot...

Summary PubMed Full Text PDF

Diagnostic Accuracy of Differential-Diagnosis Lists Generated by Generative Pretrained Transformer 3 Chatbot for Clinical Vignettes with Common Chief Complaints: A Pilot Study.

Authors: Takanobu Hirosawa, Yukinori Harada, Masashi Yokose...

The diagnostic accuracy of differential diagnoses generated by artificial intelligence (AI) chatbots, including the generative pretrained transformer 3 (GPT-3) chatbot (ChatGPT-3) is unknown. This study evaluated the accuracy of differential-diagnosis lists generated by ChatGPT-3 for clinical vignettes with common chief complaints. General internal medicine physicians created clinical cases, correct diagnoses, and five differential diagnoses for ten common chief complaints. The rate of correct diagnosis by ChatGPT-3 within the ten differential-diagnosis lists was 28/30 (93.3%). The rate of correct diagnosis by physicians was still superior to that by ChatGPT-3 within the five differential-diagnosis lists (98.3% vs. 83.3%, = 0.03). The rate of correct diagnosis by physicians was also superior to that by ChatGPT-3 in the top diagnosis (53.3% vs. 93.3%, < 0.001). The rate of consistent differential diagnoses among physicians within the ten differential-diagnosis lists generated by ChatGPT-3 was 62/88 (70.5%). In summary, this study demonstrates the high diagnostic accuracy of differential-diagnosis lists generated by ChatGPT-3 for clinical cases with common chief complaints. This suggests that AI chatbots such as ChatGPT-3 can generate a well-differentiated diagnosis list for common chief complaints. However, the order of these lists can be improved in the future.

Topics: Humans; Artificial Intelligence; Diagnosis, Differential; Pilot Projects; Software; General Practitioners

PubMed: 36834073
DOI: 10.3390/ijerph20043378