961 resultados para Robust speech recognition
Resumo:
A nyone traveling to the United States from countries other than New Zealand will be surprised by the prevalence of health-related advertisements on television, including ads for drugs. Typically, these TV ads follow a pattern: an ad for a burger at only 99 cents, followed by one for a proton-pump inhibitor, then an ad on healthy home-cooked food delivered directly to your home and an ad for a home-based abdominal workout DVD, followed by an ad for a lipid-lowering drug. There are, however, nuances. After 8 pm, the visitor might encounter an ad for the "little blue pill." This sequence sometimes includes an ad featuring antihistamines for allergic rhinitis in spring and one promoting antidepressants in the winter. Such direct-to-consumer advertising (DTCA) of prescription drugs is usual business in the United States and New Zealand but is prohibited in the rest of the world. Why? Because DTCA for prescribing drugs has pros and cons (discussed elsewhere,1-3 including in JGIM4) that are balanced differently in different countries. Constitutional factors-such as the First Amendment protections on speech, including commercial speech, in the United States5 -as well as patient and population safety considerations, which all differ across countries, modulate reactions to DTCA. Additionally, lack of robust data on the impact of DTCA on prescription drug use adds to the confusion. Evidence, though limited, suggests that DTCA increases drug sales. However, whether the increase in sales corrects existing underuse or encourages over/misuse is not clear.
Resumo:
The last 2 years have seen exciting advances in the genetics of Landau-Kleffner syndrome and related disorders, encompassed within the epilepsy-aphasia spectrum (EAS). The striking finding of mutations in the N-methyl-D-aspartate (NMDA) receptor subunit gene GRIN2A as the first monogenic cause in up to 20 % of patients with EAS suggests that excitatory glutamate receptors play a key role in these disorders. Patients with GRIN2A mutations have a recognizable speech and language phenotype that may assist with diagnosis. Other molecules involved in RNA binding and cell adhesion have been implicated in EAS; copy number variations are also found. The emerging picture highlights the overlap between the genetic determinants of EAS with speech and language disorders, intellectual disability, autism spectrum disorders and more complex developmental phenotypes.
Resumo:
Background Mesial temporal lobe epilepsy (MTLE) is the most common type of focal epilepsy in adults and can be successfully cured by surgery. One of the main complications of this surgery however is a decline in language abilities. The magnitude of this decline is related to the degree of language lateralization to the left hemisphere. Most fMRI paradigms used to determine language dominance in epileptic populations have used active language tasks. Sometimes, these paradigms are too complex and may result in patient underperformance. Only a few studies have used purely passive tasks, such as listening to standard speech. Methods In the present study we characterized language lateralization in patients with MTLE using a rapid and passive semantic language task. We used functional magnetic resonance imaging (fMRI) to study 23 patients [12 with Left (LMTLE), 11 with Right mesial temporal lobe epilepsy (RMTLE)] and 19 healthy right-handed controls using a 6 minute long semantic task in which subjects passively listened to groups of sentences (SEN) and pseudo sentences (PSEN). A lateralization index (LI) was computed using a priori regions of interest of the temporal lobe. Results The LI for the significant contrasts produced activations for all participants in both temporal lobes. 81.8% of RMTLE patients and 79% of healthy individuals had a bilateral language representation for this particular task. However, 50% of LMTLE patients presented an atypical right hemispheric dominance in the LI. More importantly, the degree of right lateralization in LMTLE patients was correlated with the age of epilepsy onset. Conclusions The simple, rapid, non-collaboration dependent, passive task described in this study, produces a robust activation in the temporal lobe in both patients and controls and is capable of illustrating a pattern of atypical language organization for LMTLE patients. Furthermore, we observed that the atypical right-lateralization patterns in LMTLE patients was associated to earlier age at epilepsy onset. These results are in line with the idea that early onset of epileptic activity is associated to larger neuroplastic changes.
Resumo:
We consider robust parametric procedures for univariate discrete distributions, focusing on the negative binomial model. The procedures are based on three steps: ?First, a very robust, but possibly inefficient, estimate of the model parameters is computed. ?Second, this initial model is used to identify outliers, which are then removed from the sample. ?Third, a corrected maximum likelihood estimator is computed with the remaining observations. The final estimate inherits the breakdown point (bdp) of the initial one and its efficiency can be significantly higher. Analogous procedures were proposed in [1], [2], [5] for the continuous case. A comparison of the asymptotic bias of various estimates under point contamination points out the minimum Neyman's chi-squared disparity estimate as a good choice for the initial step. Various minimum disparity estimators were explored by Lindsay [4], who showed that the minimum Neyman's chi-squared estimate has a 50% bdp under point contamination; in addition, it is asymptotically fully efficient at the model. However, the finite sample efficiency of this estimate under the uncontaminated negative binomial model is usually much lower than 100% and the bias can be strong. We show that its performance can then be greatly improved using the three step procedure outlined above. In addition, we compare the final estimate with the procedure described in
Resumo:
Alzheimer’s disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis is sometimes made by excluding other dementias, and definitive confirmation must be done trough a post-mortem study of the brain tissue of the patient. The purpose of this paper is to contribute to improvement of early diagnosis of AD and its degree of severity, from an automatic analysis performed by non-invasive intelligent methods. The methods selected in this case are Automatic Spontaneous Speech Analysis (ASSA) and Emotional Temperature (ET), that have the great advantage of being non invasive, low cost and without any side effects.
Resumo:
This paper analyzes applications of cumulant analysis in speech processing. A special focus is made on different second-order statistics. A dominant role is played by an integral representation for cumulants by means of integrals involving cyclic products of kernels.
Resumo:
In this paper we show how a nonlinear preprocessing of speech signal -with high noise- based on morphological filters improves the performance of robust algorithms for pitch tracking (RAPT). This result happens for a very simple morphological filter. More sophisticated ones could even improve such results. Mathematical morphology is widely used in image processing and has a great amount of applications. Almost all its formulations derived in the two-dimensional framework are easily reformulated to be adapted to one-dimensional context
Resumo:
We prove the existence and local uniqueness of invariant tori on the verge of breakdown for two systems: the quasi-periodically driven logistic map and the quasi-periodically forced standard map. These systems exemplify two scenarios: the Heagy-Hammel route for the creation of strange non- chaotic attractors and the nonsmooth bifurcation of saddle invariant tori. Our proofs are computer- assisted and are based on a tailored version of the Newton-Kantorovich theorem. The proofs cannot be performed using classical perturbation theory because the two scenarios are very far from the perturbative regime, and fundamental hypotheses such as reducibility or hyperbolicity either do not hold or are very close to failing. Our proofs are based on a reliable computation of the invariant tori and a careful study of their dynamical properties, leading to the rigorous validation of the numerical results with our novel computational techniques.
Resumo:
In order to spare functional areas during the removal of brain tumours, electrical stimulation mapping was used in 90 patients (77 in the left hemisphere and 13 in the right; 2754 cortical sites tested). Language functions were studied with a special focus on comprehension of auditory and visual words and the semantic system. In addition to naming, patients were asked to perform pointing tasks from auditory and visual stimuli (using sets of 4 different images controlled for familiarity), and also auditory object (sound recognition) and Token test tasks. Ninety-two auditory comprehension interference sites were observed. We found that the process of auditory comprehension involved a few, fine-grained, sub-centimetre cortical territories. Early stages of speech comprehension seem to relate to two posterior regions in the left superior temporal gyrus. Downstream lexical-semantic speech processing and sound analysis involved 2 pathways, along the anterior part of the left superior temporal gyrus, and posteriorly around the supramarginal and middle temporal gyri. Electrostimulation experimentally dissociated perceptual consciousness attached to speech comprehension. The initial word discrimination process can be considered as an "automatic" stage, the attention feedback not being impaired by stimulation as would be the case at the lexical-semantic stage. Multimodal organization of the superior temporal gyrus was also detected since some neurones could be involved in comprehension of visual material and naming. These findings demonstrate a fine graded, sub-centimetre, cortical representation of speech comprehension processing mainly in the left superior temporal gyrus and are in line with those described in dual stream models of language comprehension processing.
Resumo:
Lexical diversity measures are notoriously sensitive to variations of sample size and recent approaches to this issue typically involve the computation of the average variety of lexical units in random subsamples of fixed size. This methodology has been further extended to measures of inflectional diversity such as the average number of wordforms per lexeme, also known as the mean size of paradigm (MSP) index. In this contribution we argue that, while random sampling can indeed be used to increase the robustness of inflectional diversity measures, using a fixed subsample size is only justified under the hypothesis that the corpora that we compare have the same degree of lexematic diversity. In the more general case where they may have differing degrees of lexematic diversity, a more sophisticated strategy can and should be adopted. A novel approach to the measurement of inflectional diversity is proposed, aiming to cope not only with variations of sample size, but also with variations of lexematic diversity. The robustness of this new method is empirically assessed and the results show that while there is still room for improvement, the proposed methodology considerably attenuates the impact of lexematic diversity discrepancies on the measurement of inflectional diversity.
Resumo:
Tumor antigen-specific CD4(+) T cells generally orchestrate and regulate immune cells to provide immune surveillance against malignancy. However, activation of antigen-specific CD4(+) T cells is restricted at local tumor sites where antigen-presenting cells (APCs) are frequently dysfunctional, which can cause rapid exhaustion of anti-tumor immune responses. Herein, we characterize anti-tumor effects of a unique human CD4(+) helper T-cell subset that directly recognizes the cytoplasmic tumor antigen, NY-ESO-1, presented by MHC class II on cancer cells. Upon direct recognition of cancer cells, tumor-recognizing CD4(+) T cells (TR-CD4) potently induced IFN-γ-dependent growth arrest in cancer cells. In addition, direct recognition of cancer cells triggers TR-CD4 to provide help to NY-ESO-1-specific CD8(+) T cells by enhancing cytotoxic activity, and improving viability and proliferation in the absence of APCs. Notably, the TR-CD4 either alone or in collaboration with CD8(+) T cells significantly inhibited tumor growth in vivo in a xenograft model. Finally, retroviral gene-engineering with T cell receptor (TCR) derived from TR-CD4 produced large numbers of functional TR-CD4. These observations provide mechanistic insights into the role of TR-CD4 in tumor immunity, and suggest that approaches to utilize TR-CD4 will augment anti-tumor immune responses for durable therapeutic efficacy in cancer patients.
Resumo:
The design and synthesis of two Janus-type heterocycles with the capacity to simultaneously recognize guanine and uracyl in G-U mismatched pairs through complementary hydrogen bond pairing is described. Both compounds were conveniently functionalized with a carboxylic function and efficiently attached to a tripeptide sequence by using solid-phase methodologies. Ligands based on the derivatization of such Janus compounds with a small aminoglycoside, neamine, and its guanidinylated analogue have been synthesized, and their interaction with Tau RNA has been investigated by using several biophysical techniques, including UV-monitored melting curves, fluorescence titration experiments, and 1H NMR. The overall results indicated that Janus-neamine/guanidinoneamine showed some preference for the +3 mutated RNA sequence associated with the development of some tauopathies, although preliminary NMR studies have not confirmed binding to G-U pairs. Moreover, a good correlation has been found between the RNA binding affinity of such Janus-containing ligands and their ability to stabilize this secondary structure upon complexation.