-
Database : the Journal of Biological... Jan 2018Tripal community database construction toolkit utilizing the content management system Drupal. Tripal is used to make biological, genetic and genomic data more...
Tripal community database construction toolkit utilizing the content management system Drupal. Tripal is used to make biological, genetic and genomic data more discoverable, shareable, searchable and standardized. As funding for community-level genomics databases declines, Tripal's open-source codebase provides a means for sites to be built and maintained with a minimal investment in staff and new development. Tripal is ultimately as strong as the community of sites and developers that use it. We present a set of developer tools that will make building and maintaining Tripal 3 sites easier for new and returning users. These tools break down barriers to entry such as setting up developer and testing environments, acquiring and loading test datasets, working with controlled vocabulary terms and writing new Drupal classes.
Topics: Databases, Genetic; Humans; Software; Vocabulary
PubMed: 30295719
DOI: 10.1093/database/bay099 -
Journal of Biomedical Semantics Jun 2023The Findable, Accessible, Interoperable and Reusable(FAIR) Principles explicitly require the use of FAIR vocabularies, but what precisely constitutes a FAIR vocabulary...
BACKGROUND
The Findable, Accessible, Interoperable and Reusable(FAIR) Principles explicitly require the use of FAIR vocabularies, but what precisely constitutes a FAIR vocabulary remains unclear. Being able to define FAIR vocabularies, identify features of FAIR vocabularies, and provide assessment approaches against the features can guide the development of vocabularies.
RESULTS
We differentiate data, data resources and vocabularies used for FAIR, examine the application of the FAIR Principles to vocabularies, align their requirements with the Open Biomedical Ontologies principles, and propose FAIR Vocabulary Features. We also design assessment approaches for FAIR vocabularies by mapping the FVFs with existing FAIR assessment indicators. Finally, we demonstrate how they can be used for evaluating and improving vocabularies using exemplary biomedical vocabularies.
CONCLUSIONS
Our work proposes features of FAIR vocabularies and corresponding indicators for assessing the FAIR levels of different types of vocabularies, identifies use cases for vocabulary engineers, and guides the evolution of vocabularies.
Topics: Vocabulary, Controlled; Vocabulary; Biological Ontologies
PubMed: 37264430
DOI: 10.1186/s13326-023-00286-8 -
The British Journal of Sociology Mar 2021This paper examines the relationship between parents' and children's language skills for a nationally representative birth cohort born in the United Kingdom-the...
This paper examines the relationship between parents' and children's language skills for a nationally representative birth cohort born in the United Kingdom-the Millennium Cohort Study (MCS). We investigate both socioeconomic and ethnic differentials in children's vocabulary scores and the role of differences in parents' vocabulary scores in accounting for these. We find large vocabulary gaps between highly educated and less educated parents, and between ethnic groups. Nevertheless, socioeconomic and ethnic gaps in vocabulary scores are far wider among the parents than among their children. Parental vocabulary is a powerful mediator of inequalities in offspring's vocabulary scores at age 14, and also a powerful driver of change in language skills between the ages of five and 14. Once we account for parental vocabulary, no ethnic minority group of young people has a negative "vocabulary gap" compared to whites.
Topics: Adolescent; Cohort Studies; Humans; Language; Parent-Child Relations; Parents; Vocabulary
PubMed: 33595850
DOI: 10.1111/1468-4446.12780 -
PloS One 2021There are increasing applications of natural language processing techniques for information retrieval, indexing, topic modelling and text classification in engineering...
There are increasing applications of natural language processing techniques for information retrieval, indexing, topic modelling and text classification in engineering contexts. A standard component of such tasks is the removal of stopwords, which are uninformative components of the data. While researchers use readily available stopwords lists that are derived from non-technical resources, the technical jargon of engineering fields contains their own highly frequent and uninformative words and there exists no standard stopwords list for technical language processing applications. Here we address this gap by rigorously identifying generic, insignificant, uninformative stopwords in engineering texts beyond the stopwords in general texts, based on the synthesis of alternative statistical measures such as term frequency, inverse document frequency, and entropy, and curating a stopwords dataset ready for technical language processing applications.
Topics: Humans; Natural Language Processing; Semantics; Task Performance and Analysis; Vocabulary
PubMed: 34351911
DOI: 10.1371/journal.pone.0254937 -
Neuroscience and Biobehavioral Reviews Jun 2017Sleep plays a role in strengthening new words and integrating them with existing vocabulary knowledge, consistent with neural models of learning in which sleep supports... (Meta-Analysis)
Meta-Analysis Review
Sleep plays a role in strengthening new words and integrating them with existing vocabulary knowledge, consistent with neural models of learning in which sleep supports hippocampal transfer to neocortical memory. Such models are based on adult research, yet neural maturation may mean that the mechanisms supporting word learning vary across development. Here, we propose a model in which children may capitalise on larger amounts of slow-wave sleep to support a greater demand on learning and neural reorganisation, whereas adults may benefit from a richer knowledge base to support consolidation. Such an argument is reinforced by the well-reported "Matthew effect", whereby rich vocabulary knowledge is associated with better acquisition of new vocabulary. We present a meta-analysis that supports this association between children's existing vocabulary knowledge and their integration of new words overnight. Whilst multiple mechanisms likely contribute to vocabulary consolidation and neural reorganisation across the lifespan, we propose that contributions of existing knowledge should be rigorously examined in developmental studies. Such research has potential to greatly enhance neural models of learning.
Topics: Humans; Knowledge; Learning; Memory; Sleep; Vocabulary
PubMed: 28274725
DOI: 10.1016/j.neubiorev.2017.01.054 -
Cognitive Science May 2022The vocabulary of human languages has been argued to support efficient communication by optimizing the trade-off between simplicity and informativeness. The argument has...
The vocabulary of human languages has been argued to support efficient communication by optimizing the trade-off between simplicity and informativeness. The argument has been originally based on cross-linguistic analyses of vocabulary in semantic domains of content words, such as kinship, color, and number terms. The present work applies this analysis to a category of function words: indefinite pronouns (e.g., someone, anyone, no one). We build on previous work to establish the meaning space and featural make-up for indefinite pronouns, and show that indefinite pronoun systems across languages optimize the simplicity/informativeness trade-off. This demonstrates that pressures for efficient communication shape both content and function word categories. In doing so, our work aligns with several concurrent studies exploring the simplicity/informativeness trade-off in functional vocabulary. Importantly, we further argue that the trade-off may explain some of the universal properties of indefinite pronouns, thus reducing the explanatory load for linguistic theories.
Topics: Communication; Humans; Language; Linguistics; Semantics; Vocabulary
PubMed: 35579878
DOI: 10.1111/cogs.13142 -
Child Development May 2022The present study examined how children spontaneously represent facial cues associated with emotion. 106 three- to six-year-old children (48 male, 58 female; 9.4%...
The present study examined how children spontaneously represent facial cues associated with emotion. 106 three- to six-year-old children (48 male, 58 female; 9.4% Asian, 84.0% White, 6.6% more than one race) and 40 adults (10 male, 30 female; 10% Hispanic, 30% Asian, 2.5% Black, 57.5% White) were recruited from a Midwestern city (2019-2020), and sorted emotion cues in a spatial arrangement method that assesses emotion knowledge without reliance on emotion vocabulary. Using supervised and unsupervised analyses, the study found evidence for continuities and gradual changes in children's emotion knowledge compared to adults. Emotion knowledge develops through an incremental learning process in which children change their representations using combinations of factors-particularly valence-that are weighted differently across development.
Topics: Adult; Asian People; Child; Child, Preschool; Cues; Emotions; Female; Hispanic or Latino; Humans; Male; Vocabulary
PubMed: 34822168
DOI: 10.1111/cdev.13716 -
Journal of Intellectual Disability... Jan 2022This study examined the association between executive functioning and language in young adults with Down syndrome (DS).
BACKGROUND
This study examined the association between executive functioning and language in young adults with Down syndrome (DS).
METHOD
Nineteen young adults with DS (aged 19-24 years) completed standardised measures of overall cognition, vocabulary, verbal fluency and executive function skills.
RESULTS
Friedman's analysis of variance (χ (3) = 28.15, P < .001) and post hoc comparisons indicated that, on average, participants had a significantly lower overall non-verbal than verbal cognitive age equivalent and lower expressive than receptive vocabulary skills. Using Spearman correlations, performance on a verbal measure of cognition inhibition was significantly negatively related to receptive vocabulary (ρ = -.529, adjusted P = .036) and verbal fluency (ρ = -.608, adjusted P = .022). Attention was significantly positively correlated with receptive (ρ = .698, adjusted-p = .005) and expressive (ρ = .542, adjusted P = .027) vocabulary. Verbal working memory was significantly positively associated with receptive vocabulary (ρ = .585, adjusted P = .022) and verbal fluency (ρ = .737, adjusted P = .003). Finally, visuospatial working memory was significantly associated with receptive vocabulary (ρ = .562, adjusted P = .027).
CONCLUSIONS
Verbal and non-verbal measures of executive functioning skills had important associations with language ability in young adults with DS. Future translational research is needed to investigate causal pathways underlying these relationships. Research should explore if interventions aimed at increasing executive functioning skills (e.g. attention, inhibition and working memory) have the potential to lead to increases in language for young adults with DS.
Topics: Down Syndrome; Executive Function; Humans; Language; Memory, Short-Term; Vocabulary; Young Adult
PubMed: 34288180
DOI: 10.1111/jir.12868 -
Journal of Speech, Language, and... Sep 2023This study explored approaches for measuring vocabulary among bilingual children with varying levels of proficiency in Spanish and English.
PURPOSE
This study explored approaches for measuring vocabulary among bilingual children with varying levels of proficiency in Spanish and English.
METHOD
One hundred fifteen kindergarten and first-grade Spanish-English-speaking children completed measures of vocabulary and sentence repetition in Spanish and English. Scores were derived from their responses to the vocabulary measure: Spanish-only vocabulary, English-only vocabulary, conceptual vocabulary, and total vocabulary. Best language sentence repetition was also obtained. Using both visualization of data and statistical analysis, we tested for potential associations between children's relative language skills in Spanish and English and the scores they received on each of the vocabulary metrics.
RESULTS
Participants' single-language vocabulary scores were linearly associated with their relative language scores. Higher relative Spanish language skills corresponded with higher Spanish-only vocabulary scores, and higher English language skills corresponded with higher English-only vocabulary scores. A quadratic association between children's relative language and their conceptual vocabulary scores was observed. Children with more balanced skills in Spanish and English received lower scores for conceptual vocabulary. No association between total vocabulary and relative language was observed.
CONCLUSIONS
Results revealed evidence of differential test bias for single-language vocabulary scores and conceptual vocabulary scores. Spanish-only vocabulary underestimated knowledge of participants with higher English proficiency, whereas English-only vocabulary underestimated knowledge of participants with higher Spanish proficiency. Conceptual scoring yielded lower values for participants with relatively balanced proficiency in Spanish and English. There is need for further consideration of score and test functioning across the full continuum of bilinguals with dynamic proficiencies in each of their languages.
SUPPLEMENTAL MATERIAL
https://doi.org/10.23641/asha.23796330.
Topics: Child; Humans; Vocabulary; Multilingualism; Hispanic or Latino; Language Tests; Language
PubMed: 37541317
DOI: 10.1044/2023_JSLHR-22-00573 -
Sensors (Basel, Switzerland) Jul 2022Audio-visual speech recognition (AVSR) can significantly improve performance over audio-only recognition for small or medium vocabularies. However, current AVSR, whether...
Audio-visual speech recognition (AVSR) can significantly improve performance over audio-only recognition for small or medium vocabularies. However, current AVSR, whether hybrid or end-to-end (E2E), still does not appear to make optimal use of this secondary information stream as the performance is still clearly diminished in noisy conditions for large-vocabulary systems. We, therefore, propose a new fusion architecture-the decision fusion net (DFN). A broad range of time-variant reliability measures are used as an auxiliary input to improve performance. The DFN is used in both hybrid and E2E models. Our experiments on two large-vocabulary datasets, the Lip Reading Sentences 2 and 3 (LRS2 and LRS3) corpora, show highly significant improvements in performance over previous AVSR systems for large-vocabulary datasets. The hybrid model with the proposed DFN integration component even outperforms dynamic stream-weighting, which is considered to be the theoretical upper bound for conventional dynamic stream-weighting approaches. Compared to the hybrid audio-only model, the proposed DFN achieves a relative word-error-rate reduction of 51% on average, while the E2E-DFN model, with its more competitive audio-only baseline system, achieves a relative word error rate reduction of 43%, both showing the efficacy of our proposed fusion architecture.
Topics: Recognition, Psychology; Reproducibility of Results; Speech; Speech Perception; Vocabulary
PubMed: 35898005
DOI: 10.3390/s22155501