-
PloS One 2024Singing voice separation on robots faces the problem of interpreting ambiguous auditory signals. The acoustic signal, which the humanoid robot perceives through its...
Singing voice separation on robots faces the problem of interpreting ambiguous auditory signals. The acoustic signal, which the humanoid robot perceives through its onboard microphones, is a mixture of singing voice, music, and noise, with distortion, attenuation, and reverberation. In this paper, we used the 3D Inception-ResUNet structure in the U-shaped encoding and decoding network to improve the utilization of the spatial and spectral information of the spectrogram. Multiobjectives were used to train the model: magnitude consistency loss, phase consistency loss, and magnitude correlation consistency loss. We recorded the singing voice and accompaniment derived from the MIR-1K dataset with NAO robots and synthesized the 10-channel dataset for training the model. The experimental results show that the proposed model trained by multiple objectives reaches an average NSDR of 11.55 dB on the test dataset, which outperforms the comparison model.
Topics: Singing; Voice Quality; Music; Acoustics
PubMed: 38285654
DOI: 10.1371/journal.pone.0289453 -
National Journal of Maxillofacial... 2023Hemifacial microsomia (HFM) is a complex congenital malformation with an extremely variable phenotypic presentation. It usually involves structures of the first and...
Hemifacial microsomia (HFM) is a complex congenital malformation with an extremely variable phenotypic presentation. It usually involves structures of the first and second pharyngeal arches. Anomalies of the cardiac, pulmonary, renal, and gastrointestinal systems are present, but the main characteristic is the mandibular hypoplasia. This is commonly treated with orthodontic hardware and various surgical modalities. Most recently, a total joint replacement with a customized prosthesis is idealized to provide the best outcomes to these patients, so it has been used in some cases. The following case is of a 23-year-old female with congenital hypoplastic mandibular head and the absence of mandibular fossa. The proposed treatment was to reconstruct the mandible with a customized prosthesis and orthognathic surgery to correct the asymmetry and provide better phonation, speech, and facial contour. The patient is under six years follow-up with a complete adaptation of the prosthesis.
PubMed: 38273912
DOI: 10.4103/njms.njms_14_22 -
BMC Women's Health Jan 2024Control of pelvic floor muscles (PFM) is emphasized as important to obtain functional breath support in opera singing, but there is not much research that proves PFM...
BACKGROUND
Control of pelvic floor muscles (PFM) is emphasized as important to obtain functional breath support in opera singing, but there is not much research that proves PFM function as part of breath support in classical singing. Transperineal ultrasound is a reliable method for quantification of PFM contraction in urogynecology. Our aim was to establish if transperineal ultrasound can be used for observation of movement of the PFM during singing and to quantify pelvic floor contraction.
METHODS
Cross sectional study of 10 professional opera singers examined with transperineal ultrasound in the supine position at rest and contraction, and standing at rest and during singing. Levator hiatal area was measured in a 3D rendered volume. Levator hiatal anteroposterior (AP) diameter and bladder neck distance from symphysis were measured in 2D images.
RESULTS
The AP diameter was shortened from supine rest to contraction (15 mm), standing (6 mm) and singing (9 mm), all p < 0.01. The bladder neck had a non-significant descent of 3 mm during singing. The mean proportional change in AP diameter from rest to contraction was 24.2% (moderate to strong contraction) and from rest to singing was 15% (weak to moderate contraction).
CONCLUSIONS
Transperineal ultrasound can be used to examine the PFM during singing. The classically trained singers had good voluntary PFM contraction and moderate contraction during singing. AP diameter was significantly shortened from supine to upright position, with further shortening during singing, confirming that female opera singers contracted their pelvic floor during singing.
Topics: Female; Humans; Pelvic Floor; Pilot Projects; Singing; Cross-Sectional Studies; Ultrasonography
PubMed: 38267929
DOI: 10.1186/s12905-024-02895-6 -
Intensive & Critical Care Nursing Jun 2024This study aimed to investigate the voice use of nurses working in intensive care units (ICUs) and their perception of acoustic environments.
OBJECTIVE
This study aimed to investigate the voice use of nurses working in intensive care units (ICUs) and their perception of acoustic environments.
SETTING AND SAMPLE
The research was conducted in four different hospitals in China during the COVID-19 pandemic. A total of 60 ICU nurses were recruited for their voice use monitoring and 100 nurses participated in the survey.
RESEARCH METHODOLOGY
Firstly, voice-related parameters such as voice level (SPL, dB), fundamental frequency (F0, Hz), and voicing time percentage (Dt, %) were measured using a vocal monitor. To collect data, a non-invasive accelerometer was attached to the participants' necks during their working hours. Secondly, the perception of the ICU acoustic environment was assessed using semantic differential.
RESULTS
The results showed that nurses spoke approximately 0.9-4 dB louder to patients and colleagues in ICUs compared to quiet rooms, and their fundamental frequency (F0) significantly increased during work. The voice levels of nurses were influenced by background noise levels, with a significant correlation coefficient of 0.44 (p < 0.01). Furthermore, the background noise levels ranged from 58.1 to 73.9 dBA, exceeding the guideline values set by the World Health Organisation (WHO). The semantic differential analysis identified 'Stress' and 'Irritation' as the two main components, indicating the prevalence of negative experiences within ICUs.
IMPLICATIONS FOR CLINICAL PRACTICE
This study highlights the potential risk of voice disorders among ICU nurses. The findings also underscore the importance of implementing strategies to reduce noise levels in ICUs to reduce voice disorders among nurses.
Topics: Humans; Phonation; COVID-19; Pandemics; Voice Disorders; Intensive Care Units; Nurses
PubMed: 38232571
DOI: 10.1016/j.iccn.2023.103620 -
Journal of Religion and Health Feb 2024This study explores the dynamics of coping strategies of Czech religious leaders during a peak of the COVID-19 pandemic. An interpretative phenomenological analysis...
This study explores the dynamics of coping strategies of Czech religious leaders during a peak of the COVID-19 pandemic. An interpretative phenomenological analysis reveals that mental health among pastors is closely linked to a need to maintain community and social contact, while physical health is related to limitations upon ritual elements. In all narratives, the lived experience of mental health in the form of prosocial behavior is significantly prioritized despite the possibility of spreading infection. The analysis also shows that maintaining the community is closely linked to risky behaviors, which positively affected group and individual well-being.
Topics: Humans; Singing; COVID-19; Clergy; Czech Republic; Pandemics
PubMed: 38227155
DOI: 10.1007/s10943-023-01977-3 -
Journal of Cancer Research and Clinical... Jan 2024The study aimed to assess the relevance of objective vibratory parameters derived from high-speed videolaryngoscopy (HSV) as a supporting tool, to assist clinicians in...
PURPOSE
The study aimed to assess the relevance of objective vibratory parameters derived from high-speed videolaryngoscopy (HSV) as a supporting tool, to assist clinicians in establishing the initial diagnosis of benign and malignant glottal organic lesions.
METHODS
The HSV examinations were conducted in 175 subjects: 50 normophonic, 85 subjects with benign vocal fold lesions, and 40 with early glottic cancer; organic lesions were confirmed by histopathologic examination. The parameters, derived from HSV kymography: amplitude, symmetry, and glottal dynamic characteristics, were compared statistically between the groups with the following ROC analysis.
RESULTS
Among 14 calculated parameters, 10 differed significantly between the groups. Four of them, the average resultant amplitude of the involved vocal fold (AmpInvolvedAvg), average amplitude asymmetry for the whole glottis and its middle third part (AmplAsymAvg; AmplAsymAvg_2/3), and absolute average phase difference (AbsPhaseDiffAvg), showed significant differences between benign and malignant lesions. Amplitude values were decreasing, while asymmetry and phase difference values were increasing with the risk of malignancy. In ROC analysis, the highest AUC was observed for AmpAsymAvg (0.719; p < 0.0001), and next in order was AmpInvolvedAvg (0.70; p = 0.0002).
CONCLUSION
The golden standard in the diagnosis of organic lesions of glottis remains clinical examination with videolaryngoscopy, confirmed by histopathological examination. Our results showed that measurements of amplitude, asymmetry, and phase of vibrations in malignant vocal fold masses deteriorate significantly in comparison to benign vocal lesions. High-speed videolaryngoscopy could aid their preliminary differentiation noninvasively before histopathological examination; however, further research on larger groups is needed.
Topics: Humans; Phonation; Laryngoscopy; Vocal Cords; Vibration; Kymography
PubMed: 38216796
DOI: 10.1007/s00432-023-05543-y -
BMJ Open Jan 2024Invasive non-typhoidal (iNTS) serovars are a major cause of community-acquired bloodstream infections in sub-Saharan Africa (SSA). In this setting, serovar Typhimurium...
Protocol for the challenge non-typhoidal (CHANTS) study: a first-in-human, in-patient, double-blind, randomised, safety and dose-escalation controlled human infection model in the UK.
INTRODUCTION
Invasive non-typhoidal (iNTS) serovars are a major cause of community-acquired bloodstream infections in sub-Saharan Africa (SSA). In this setting, serovar Typhimurium accounts for two-thirds of infections and is associated with an estimated case fatality rate of 15%-20%. Several iNTS vaccine candidates are in early-stage assessment which-if found effective-would provide a valuable public health tool to reduce iNTS disease burden. The CHANTS study aims to develop a first-in-human Typhimurium controlled human infection model, which can act as a platform for future vaccine evaluation, in addition to providing novel insights into iNTS disease pathogenesis.
METHODS AND ANALYSIS
This double-blind, safety and dose-escalation study will randomise 40-80 healthy UK participants aged 18-50 to receive oral challenge with one of two strains of . Typhimurium belonging to the ST19 (strain 4/74) or ST313 (strain D23580) lineages. 4/74 is a global strain often associated with diarrhoeal illness predominantly in high-income settings, while D23580 is an archetypal strain representing invasive disease-causing isolates found in SSA. The primary objective is to determine the minimum infectious dose (colony-forming unit) required for 60%-75% of participants to develop clinical or microbiological features of systemic salmonellosis. Secondary endpoints are to describe and compare the clinical, microbiological and immunological responses following challenge. Dose escalation or de-escalation will be undertaken by continual-reassessment methodology and limited within prespecified safety thresholds. Exploratory objectives are to describe mechanisms of iNTS virulence, identify putative immune correlates of protection and describe host-pathogen interactions in response to infection.
ETHICS AND DISSEMINATION
Ethical approval has been obtained from the NHS Health Research Authority (London-Fulham Research Ethics Committee 21/PR/0051; IRAS Project ID 301659). The study findings will be disseminated in international peer-reviewed journals and presented at national/international stakeholder meetings. Study outcome summaries will be provided to both funders and participants.
TRIAL REGISTRATION NUMBER
NCT05870150.
Topics: Humans; Singing; Salmonella; Typhoid Fever; London; Vaccines; Randomized Controlled Trials as Topic
PubMed: 38199617
DOI: 10.1136/bmjopen-2023-076477 -
European Archives of... Apr 2024This study evaluated the swallowing and voice function of laryngeal cancer patients after Supracricoid Partial Laryngectomy(SCPL), and its influence on quality of life...
OBJECTIVE
This study evaluated the swallowing and voice function of laryngeal cancer patients after Supracricoid Partial Laryngectomy(SCPL), and its influence on quality of life to provide a reference for the selection of surgical methods for laryngeal cancer patients.
METHODS
Twenty-one patients who received SCPL between April 2015 and November 2021 were included. Each patient's swallowing function and quality of life were assessed through fiberoptic endoscopic examination of swallowing (FEES) and the M.D. Anderson Dysphagia Inventory (MDADI). Fundamental, jitter, shimmer, maximum phonation time (MPT), and voice handicap index-10 (VHI-10) were performed to assess voice function and voice-related quality of life.
RESULTS
The results of the FEES of the 21 patients were as follows: the rates of pharyngeal residue after swallowing solid, semiliquid, and liquid food were 0%, 28.57%, and 38.09%, respectively; the rates of laryngeal infiltration after swallowing solid, semiliquid, and liquid food were 0%, 28.57%, and 4.76%, respectively; and aspiration did not occur in any of the patients. In the evaluation of swallowing quality of life, the mean total MDADI score was 92.6 ± 6.32. The voice function evaluation showed that the mean F0, jitter, shimmer, and MPT values were 156.01 ± 120.87 (HZ), 11.57 ± 6.21 (%), 35.37 ± 14.16 (%) and 7.85 ± 6.08 (s), respectively. The mean total VHI-10 score was 7.14 ± 4.84.
CONCLUSION
SCPL provides patients with satisfactory swallowing and voice function. The patients in this study were satisfied with their quality of life in terms of swallowing and voice. SCPL can be used as a surgical method to preserve laryngeal function in patients with laryngeal cancer.
Topics: Humans; Laryngectomy; Deglutition; Laryngeal Neoplasms; Quality of Life; Voice
PubMed: 38183455
DOI: 10.1007/s00405-023-08416-9 -
Neurorehabilitation and Neural Repair Feb 2024Parkinson's disease can negatively affect vocal functioning and social wellbeing, particularly in the latter stages of disease progression. Face-to-face group singing... (Clinical Trial)
Clinical Trial
BACKGROUND
Parkinson's disease can negatively affect vocal functioning and social wellbeing, particularly in the latter stages of disease progression. Face-to-face group singing interventions can improve communication and wellbeing outcomes, yet not all people can access in-person sessions. To help overcome barriers to participation, exploration of the feasibility and utility of online therapeutic singing programs is needed.
OBJECTIVES
To evaluate the feasibility, acceptability, and preliminary efficacy of a 12-week ParkinSong Online intervention on speech and wellbeing for people with Parkinson's disease.
METHODS
A total of 28 participants with idiopathic Parkinson's disease were recruited to a single-arm feasibility study. Weekly 90-minute online sessions were co-facilitated by a music therapist and speech pathologist. Speech and wellbeing assessments were conducted pre and post intervention. Participant and facilitator surveys were administered after each session, with focus group interviews at the end of the program.
RESULTS
The recruitment rate was high (90%) with no attrition, adverse events, or safety issues. There was good intervention fidelity, attendance (average 89%), and positive participant experience. Feasibility was good, with technology reported as the main challenge (connecting and navigating Zoom). No improvements were seen in voice measures or wellbeing outcomes in this small trial. The online format used in this study did not provide the same benefits as in-person ParkinSong sessions.
CONCLUSIONS
ParkinSong Online is feasible for recreational purposes and social engagement provided that people have adequate technological knowledge or support. The optimal online delivery format to achieve communication improvements in Parkinson's awaits confirmation.
Topics: Humans; Feasibility Studies; Parkinson Disease; Singing; Surveys and Questionnaires; Telemedicine
PubMed: 38156662
DOI: 10.1177/15459683231219269 -
Digital Health 2023Amyotrophic lateral sclerosis (ALS) frequently causes speech impairments, which can be valuable early indicators of decline. Automated acoustic assessment of speech in...
BACKGROUND AND OBJECTIVE
Amyotrophic lateral sclerosis (ALS) frequently causes speech impairments, which can be valuable early indicators of decline. Automated acoustic assessment of speech in ALS is attractive, and there is a pressing need to validate such tools in line with best practices, including analytical and clinical validation. We hypothesized that data analysis using a novel speech assessment pipeline would correspond strongly to analyses performed using lab-standard practices and that acoustic features from the novel pipeline would correspond to clinical outcomes of interest in ALS.
METHODS
We analyzed data from three standard speech assessment tasks (i.e., vowel phonation, passage reading, and diadochokinesis) in 122 ALS patients. Data were analyzed automatically using a pipeline developed by Winterlight Labs, which yielded 53 acoustic features. First, for analytical validation, data were analyzed using a lab-standard analysis pipeline for comparison. This was followed by univariate analysis (Spearman correlations between individual features in Winterlight and in-lab datasets) and multivariate analysis (sparse canonical correlation analysis (SCCA)). Subsequently, clinical validation was performed. This included univariate analysis (Spearman correlation between automated acoustic features and clinical measures) and multivariate analysis (interpretable autoencoder-based dimensionality reduction).
RESULTS
Analytical validity was demonstrated by substantial univariate correlations (Spearman's > 0.70) between corresponding pairs of features from automated and lab-based datasets, as well as interpretable SCCA feature groups. Clinical validity was supported by strong univariate correlations between automated features and clinical measures (Spearman's > 0.70), as well as associations between multivariate outputs and clinical measures.
CONCLUSION
This novel, automated speech assessment feature set demonstrates substantial promise as a valid tool for analyzing impaired speech in ALS patients and for the further development of these technologies.
PubMed: 38144173
DOI: 10.1177/20552076231219102