941 resultados para Visual Speaker Recognition, Visual Speech Recognition, Cascading Appearance-Based Features
Resumo:
OBJECTIVES: Dental erosion, the chemical dissolution of enamel without bacterial involvement, is a rarely reported manifestation of gastroesophageal reflux disease (GERD), as well as of recurrent vomiting and dietary habits. It leads to loss of tooth substance, hypersensitivity, functional impairment, and even tooth fracture. To date, dental erosions have been assessed using only very basic visual methods, and no evidence-based guidelines or studies exist regarding the prevention or treatment of GERD-related dental erosions. METHODS: In this randomized, double-blind study, we used optical coherence tomography (OCT) to quantify dental tissue demineralization and enamel loss before and after 3 weeks of acid-suppressive treatment with esomeprazole 20 mg b.i.d. or placebo in 30 patients presenting to the Berne University Dental Clinic with advanced dental erosions and abnormal acid exposure by 24-h esophageal pH manometry (defined as >4% of the 24-h period with pH<4). Enamel thickness, reflectivity, and absorbance as measures of demineralization were quantified by OCT before and after therapy at identical localizations on teeth with most severe visible erosions as well as several other predefined changes in teeth. RESULTS: The mean+/-s.e.m. decrease of enamel thickness of all teeth before and after treatment at the site of maximum exposure was 7.2+/-0.16 black trianglem with esomeprazole and 15.25+/-0.17black trianglem with placebo (P=0.013), representing a loss of 0.3% and 0.8% of the total enamel thickness, respectively. The change in optical reflectivity to a depth of 25 black trianglem after treatment was-1.122 +/-0.769 dB with esomeprazole and +2.059+/-0.534 dB with placebo (P 0.012), with increased reflectivity signifying demineralization. CONCLUSIONS: OCT non-invasively detected and quantified significantly diminished progression of dental tissue demineralization and enamel loss after only 3 weeks of treatment with esomeprazole 20 mg b.i.d. vs. placebo. This suggests that esomeprazole may be useful in counteracting progression of GERD-related dental erosions. Further validation of preventative treatment regimens using this sensitive detection method is required, including longer follow-up and correlation with quantitative reflux measures.
Resumo:
This paper presents an automatic modulation classifier for electronic warfare applications. It is a pattern recognition modulation classifier based on statistical features of the phase and instantaneous frequency. This classifier runs in a real time operation mode with sampling rates in excess of 1 Gsample/s. The hardware platform for this application is a Field Programmable Gate Array (FPGA). This AMC is subsidiary of a digital channelised receiver also implemented in the same platform.
Resumo:
This work presents a method to detect Microcalcifications in Regions of Interest from digitized mammograms. The method is based mainly on the combination of Image Processing, Pattern Recognition and Artificial Intelligence. The Top-Hat transform is a technique based on mathematical morphology operations that, in this work is used to perform contrast enhancement of microcalcifications in the region of interest. In order to find more or less homogeneous regions in the image, we apply a novel image sub-segmentation technique based on Possibilistic Fuzzy c-Means clustering algorithm. From the original region of interest we extract two window-based features, Mean and Deviation Standard, which will be used in a classifier based on a Artificial Neural Network in order to identify microcalcifications. Our results show that the proposed method is a good alternative in the stage of microcalcifications detection, because this stage is an important part of the early Breast Cancer detection
Resumo:
We present high spatial resolution ion-microprobe rare earth element (REE) data for discrete growth phases of complex polyphase zircons from early Archaean Amitsoq gneisses, outer Godthabsfjord, SW Greenland. In Matsuda diagrams, the two major growth phases, >3.8 Ga cores and ca. 3.65 Ga rims, have steep positive slopes from La to Lu, prominent positive Ce anomalies and negative Eu anomalies that are consistent with growth in a melt. Exceptions to this are non-cathodolurnmescent zircon developed between the cores and rims, sometimes truncating zoning in the cores, and late Archaean prismatic tip overgrowths, both of which exhibit flatter light REE (LREE) patterns and have small or no Eu anomaly, which we interpret as the result of metamorphism and/or small-degree, isolated partial melting. Our data support previous interpretations that the ca. 3.65 Ga zircon phase was generated in a melt, with the >3.8 Ga phase representing either original protolith zircons in a large degree partial melt or inherited zircons in an introduced magma. Regardless which of these two interpretations is correct for these, and similar, rocks in the outer GodthAbsfjord, the 3.65 Ga event will have profoundly affected isotopic systems and obscured beyond recognition any earlier igneous features such as cross-cutting relationships, which may only be assigned a minimum 3.65 Ga age. (C) 2003 Elsevier Science B.V. All rights reserved.
Resumo:
There have been two main approaches to feature detection in human and computer vision - luminance-based and energy-based. Bars and edges might arise from peaks of luminance and luminance gradient respectively, or bars and edges might be found at peaks of local energy, where local phases are aligned across spatial frequency. This basic issue of definition is important because it guides more detailed models and interpretations of early vision. Which approach better describes the perceived positions of elements in a 3-element contour-alignment task? We used the class of 1-D images defined by Morrone and Burr in which the amplitude spectrum is that of a (partially blurred) square wave and Fourier components in a given image have a common phase. Observers judged whether the centre element (eg ±458 phase) was to the left or right of the flanking pair (eg 0º phase). Lateral offset of the centre element was varied to find the point of subjective alignment from the fitted psychometric function. This point shifted systematically to the left or right according to the sign of the centre phase, increasing with the degree of blur. These shifts were well predicted by the location of luminance peaks and other derivative-based features, but not by energy peaks which (by design) predicted no shift at all. These results on contour alignment agree well with earlier ones from a more explicit feature-marking task, and strongly suggest that human vision does not use local energy peaks to locate basic first-order features. [Supported by the Wellcome Trust (ref: 056093)]
Resumo:
Background: During last decade the use of ECG recordings in biometric recognition studies has increased. ECG characteristics made it suitable for subject identification: it is unique, present in all living individuals, and hard to forge. However, in spite of the great number of approaches found in literature, no agreement exists on the most appropriate methodology. This study aimed at providing a survey of the techniques used so far in ECG-based human identification. Specifically, a pattern recognition perspective is here proposed providing a unifying framework to appreciate previous studies and, hopefully, guide future research. Methods: We searched for papers on the subject from the earliest available date using relevant electronic databases (Medline, IEEEXplore, Scopus, and Web of Knowledge). The following terms were used in different combinations: electrocardiogram, ECG, human identification, biometric, authentication and individual variability. The electronic sources were last searched on 1st March 2015. In our selection we included published research on peer-reviewed journals, books chapters and conferences proceedings. The search was performed for English language documents. Results: 100 pertinent papers were found. Number of subjects involved in the journal studies ranges from 10 to 502, age from 16 to 86, male and female subjects are generally present. Number of analysed leads varies as well as the recording conditions. Identification performance differs widely as well as verification rate. Many studies refer to publicly available databases (Physionet ECG databases repository) while others rely on proprietary recordings making difficult them to compare. As a measure of overall accuracy we computed a weighted average of the identification rate and equal error rate in authentication scenarios. Identification rate resulted equal to 94.95 % while the equal error rate equal to 0.92 %. Conclusions: Biometric recognition is a mature field of research. Nevertheless, the use of physiological signals features, such as the ECG traits, needs further improvements. ECG features have the potential to be used in daily activities such as access control and patient handling as well as in wearable electronics applications. However, some barriers still limit its growth. Further analysis should be addressed on the use of single lead recordings and the study of features which are not dependent on the recording sites (e.g. fingers, hand palms). Moreover, it is expected that new techniques will be developed using fiducials and non-fiducial based features in order to catch the best of both approaches. ECG recognition in pathological subjects is also worth of additional investigations.
Resumo:
Although persuasion often occurs via oral communication, it remains a comparatively understudied area. This research tested the hypothesis that changes in three properties of voice influence perceptions of speaker confidence, which in turn differentially affects attitudes according to different underlying psychological processes that the Elaboration Likelihood Model (ELM, Petty & Cacioppo, 1984), suggests should emerge under different levels of thought. Experiment 1 was a 2 (Elaboration: high vs. low) x 2 (Vocal speed: increased speed vs. decreased speed) x 2 (Vocal intonation: falling intonation vs. rising intonation) between participants factorial design. Vocal speed and vocal intonation influenced perceptions of speaker confidence as predicted. In line with the ELM, under high elaboration, confidence biased thought favorability, which in turn influenced attitudes. Under low elaboration, confidence did not bias thoughts but rather directly influenced attitudes as a peripheral cue. Experiment 2 used a similar design as Experiment 1 but focused on vocal pitch. Results confirmed pitch influenced perceptions of confidence as predicted. Importantly, we also replicated the bias and cue processes found in Experiment 1. Experiment 3 investigated the process by which a broader spectrum of speech rate influenced persuasion under moderate elaboration. In a 2 (Argument quality: strong vs. weak) x 4 (Vocal speed: extremely slow vs. moderately slow vs. moderately fast vs. extremely fast) between participants factorial design, results confirmed the hypothesized non-linear relationship between speech rate and perceptions of confidence. In line with the ELM, speech rate influenced persuasion based on the amount of processing. Experiment 4 investigated the effects of a broader spectrum of vocal intonation on persuasion under moderate elaboration and used a similar design as Experiment 3. Results indicated a partial success of our vocal intonation manipulation. No evidence was found to support the hypothesized mechanism. These studies show that changes in several different properties of voice can influence the extent to which others perceive them as confident. Importantly, evidence suggests different vocal properties influence persuasion by the same bias and cue processes under high and low thought. Evidence also suggests that under moderate thought, speech rate influences persuasion based on the amount of processing.
Resumo:
[EN]Most face recognition systems are based on some form of batch learning. Online face recognition is not only more practical, it is also much more biologically plausible. Typical batch learners aim at minimizing both training error and (a measure of) hypothesis complexity. We show that the same minimization can be done incrementally as long as some form of ”scaffolding” is applied throughout the learning process. Scaffolding means: make the system learn from samples that are neither too easy nor too difficult at each step. We note that such learning behavior is also biologically plausible. Experiments using large sequences of facial images support the theoretical claims. The proposed method compares well with other, numerical calculus-based online learners.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
The structure of an animal’s eye is determined by the tasks it must perform. While vertebrates rely on their two eyes for all visual functions, insects have evolved a wide range of specialized visual organs to support behaviors such as prey capture, predator evasion, mate pursuit, flight stabilization, and navigation. Compound eyes and ocelli constitute the vision forming and sensing mechanisms of some flying insects. They provide signals useful for flight stabilization and navigation. In contrast to the well-studied compound eye, the ocelli, seen as the second visual system, sense fast luminance changes and allows for fast visual processing. Using a luminance-based sensor that mimics the insect ocelli and a camera-based motion detection system, a frequency-domain characterization of an ocellar sensor and optic flow (due to rotational motion) are analyzed. Inspired by the insect neurons that make use of signals from both vision sensing mechanisms, advantages, disadvantages and complementary properties of ocellar and optic flow estimates are discussed.
Resumo:
The developmental progression of emotional competence in childhood provides a robust evidence for its relation to social competence and important adjustment outcomes. This study aimed to analyze how this association is established in middle childhood. For this purpose, we tested 182 Portuguese children aged between 8 and 11 years, of 3rd and 4th grades, in public schools. Firstly, for assessing social competence we used an instrument directed to children using critical social situations within the relationships with peers in the school context - Socially in Action-Peers (SAp) (Rocha, Candeias & Lopes da Silva, 2012); children were assessed by three sources: themselves, their peers and their teacher. Secondly, we assessed children’s emotional understanding, individually, with the Test of Emotion Comprehension (Pons & Harris, 2002; Pons, Harris & Rosnay, 2004). Relations between social competence levels (in a composite score and using self, peers and teachers’ scores) and emotional comprehension components (comprehension of the recognition of emotions, based on facial expressions; external emotional causes; contribute of desire to emotion; emotions based on belief; memory influence under emotional state evaluation; possibility of emotional regulation; possibility of hiding an emotional state; having mixed emotions; contribution of morality to emotion experience) were investigated by means of two SSA (Similarity Structure Analysis) - a Multidimensional Scaling procedure and the external variable as points technique. In the first structural analysis (SSA) we will consider self, peers and teachers’ scores on Social Competence as content variables and TEC as external variable; in the second SSA we will consider TEC components as content variables and Social Competence in their different levels as external variable. The implications of these MDS procedures in order to better understand how social competence and emotional comprehension are related in children is discussed, as well as the repercussions of these findings for social competence and emotional understanding assessment and intervention in childhood is examined.
Resumo:
Some decades of research on emotional development have underlined the contribution of several domains to emotional understanding in childhood. Based on this research, Pons and colleagues (Pons & Harris, 2002; Pons, Harris & Rosnay, 2004) have proposed the Test of Emotion Comprehension (TEC) which assesses nine domains of emotional understanding, namely the recognition of emotions, based on facial expressions; the comprehension of external emotional causes; impact of desire on emotions; emotions based on beliefs; memory influence on emotions; possibility of emotional regulation; possibility of hiding an emotional state; having mixed emotions; contribution of morality to emotional experiences. This instrument was administered individually to 182 Portuguese children aged between 8 and 11 years, of 3rd and 4th grades, in public schools. Additionally, we used the Socially in Action-Peers (SAp) (Rocha, Candeias & Lopes da Silva, 2012) to assess TEC’s criterion-related validity. Mean differences results in TEC by gender and by socio-economic status (SES) were analyzed. The results of the TEC’s psychometric analysis were performed in terms of items’ sensitivity and reliability (stability, test-retest). Finally, in order to explore the theoretical structure underlying TEC a Confirmatory Factor Analysis and a Similarity Structure Analysis were computed. Implications of these findings for emotional understanding assessment and intervention in childhood are discussed.
Resumo:
In this paper, a linguistically rule-based grapheme-to-phone (G2P) transcription algorithm is described for European Portuguese. A complete set of phonological and phonetic transcription rules regarding the European Portuguese standard variety is presented. This algorithm was implemented and tested by using online newspaper articles. The obtained experimental results gave rise to 98.80% of accuracy rate. Future developments in order to increase this value are foreseen. Our purpose with this work is to develop a module/ tool that can improve synthetic speech naturalness in European Portuguese. Other applications of this system can be expected like language teaching/learning. These results, together with our perspectives of future improvements, have proved the dramatic importance of linguistic knowledge on the development of Text-to-Speech systems (TTS).