961 resultados para Robust speech recognition
Resumo:
In this paper, we present an integrated system for real-time automatic detection of human actions from video. The proposed approach uses the boundary of humans as the main feature for recognizing actions. Background subtraction is performed using Gaussian mixture model. Then, features are extracted from silhouettes and Vector Quantization is used to map features into symbols (bag of words approach). Finally, actions are detected using the Hidden Markov Model. The proposed system was validated using a newly collected real- world dataset. The obtained results show that the system is capable of achieving robust human detection, in both indoor and outdoor environments. Moreover, promising classification results were achieved when detecting two basic human actions: walking and sitting.
Resumo:
Dissertação de mestrado integrado em Engenharia e Gestão de Sistemas de Informação
Resumo:
Tese de Doutoramento em Engenharia de Eletrónica e de Computadores
Resumo:
It gives me great pleasure to accept the invitation to address this conference on “Meeting the Challenges of Cultural Diversity in the Irish Healthcare Sector” which is being organised by the Irish Health Services Management Institute in partnership with the National Consultative Committee on Racism and Interculturalism. The conference provides an important opportunity to develop our knowledge and understanding of the issues surrounding cultural diversity in the health sector from the twin perspectives of patients and staff. Cultural diversity has over recent years become an increasingly visible aspect of Irish society bringing with it both opportunities and challenges. It holds out great possibilities for the enrichment of all who live in Ireland but it also challenges us to adapt creatively to the changes required to realise this potential and to ensure that the experience is a positive one for all concerned but particularly for those in the minority ethnic groups. In the last number of years in particular, the focus has tended to be on people coming to this country either as refugees, asylum seekers or economic migrants. Government figures estimate that as many as 340,000 immigrants are expected in the next six years. However ethnic and cultural diversity are not new phenomena in Ireland. Travellers have a long history as an indigenous minority group in Ireland with a strong culture and identity of their own. The changing experience and dynamics of their relationship with the wider society and its institutions over time can, I think, provide some valuable lessons for us as we seek to address the more numerous and complex issues of cultural diversity which have arisen for us in the last decade. Turning more specifically to the health sector which is the focus of this conference, culture and identity have particular relevance to health service policy and provision in that The first requirement is that we in the health service acknowledge cultural diversity and the differences in behaviours and in the less obvious areas of values and beliefs that this often implies. Only by acknowledging these differences in a respectful way and informing ourselves of them can we address them. Our equality legislation – The Employment Equality Act, 1998 and the Equal Status Act, 2000 – prohibits discrimination on nine grounds including race and membership of the Traveller community. The Equal Status Act prohibits discrimination on an individual basis in relation to the nine grounds while for groups it provides for the promotion of equality of opportunity. The Act applies to the provision of services including health services. I will speak first about cultural diversity in relation to the patient. In this respect it is worth mentioning that the recognition of cultural diversity and appropriate responses to it were issues which were strongly emphasised in the public consultation process which we held earlier this year in the context of developing National Anti-Poverty targets for the health sector and also our new national health strategy. Awareness and sensitivity training for staff is a key requirement for adapting to a culturally diverse patient population. The focus of this training should be the development of the knowledge and skills to provide services sensitive to cultural diversity. Such training can often be most effectively delivered in partnership with members of the minority groups themselves. I am aware that the Traveller community, for example, is involved in in-service training for health care workers. I am also aware that the National Consultative Committee on Racism and Interculturalism has been involved in training with the Eastern Regional Health Authority. We need to have more such initiatives. A step beyond the sensitivity training for existing staff is the training of members of the minority communities themselves as workers in our health services. Again the Traveller community has set an example in this area with its Primary Health Care Project for Travellers. The Primary Health Care for Travellers Project was established in 1994 as a joint partnership initiative with the Eastern Health Board and Pavee Point, with ongoing technical assistance being provided from the Department of Community Health and General Practice, Trinity College, Dublin. This project was the first of its kind in the country and has facilitated The project included a training course which concentrated on skills development, capacity building and the empowerment of Travellers. This confidence and skill allowed the Community Health Workers to go out and conduct a baseline survey to identify and articulate Travellers’ health needs. This was the first time that Travellers were involved in this process; in the past their needs were assumed. The results of the survey were fed back to the community and they prioritised their needs and suggested changes to the health services which would facilitate their access and utilisation. Ongoing monitoring and data collection demonstrates a big improvement in levels of satisfaction and uptake and ulitisation of health services by Travellers in the pilot area. This Primary Health Care for Travellers initiative is being replicated in three other areas around the country and funding has been approved for a further 9 new projects. This pilot project was the recipient of a WHO 50th anniversary commemorative award in 1998. The project is developing as a model of good practice which could inspire further initiatives of this type for other minority groups. Access to information has been identified in numerous consultative processes as a key factor in enabling people to take a proactive approach to managing their own health and that of their families and in facilitating their access to health services. Honouring our commitment to equity in these areas requires that information is provided in culturally appropriate formats. The National Health Promotion Strategy 2000-2005, for example, recognises that there exists within our society many groups with different requirements which need to be identified and accommodated when planning and implementing health promotion interventions. These groups include Travellers, refugees and asylum seekers, people with intellectual, physical or sensory disability and the gay and lesbian community. The Strategy acknowledges the challenge involved in being sensitive to the potential differences in patterns of poor health among these different groups. The Strategic aim is to promote the physical, mental and social well-being of individuals from these groups. The objective of the Strategy on these issues are: While our long term aim may be to mainstream responses so that our health services is truly multicultural, we must recognise the need at this point in time for very specific focused responses particularly for groups with poor health status such as Travellers and also for refugees and asylum seekers. In the case of refugees and asylum seekers examples of targeted services are screening for communicable diseases – offered on a voluntary basis – and psychological support services for those who have suffered trauma before coming here. The two approaches of targeting and mainstreaming are not mutually exclusive. A combination of both is required at this point in time but the balance between them must be kept under constant review in the light of changing needs. A major requirement if we are to meet the challenge of cultural diversity is an appropriate data and research base. I think it is important that we build up our information and research data base in partnership with the minority groups themselves. We must establish what the health needs of diverse groups are; we must monitor uptake of services and how well we are responding to needs and we must monitor outcomes and health status. We must also examine the impact of the policies in other sectors on the health of minority groups. The National Health Information Strategy, currently being developed, and the recently published National Strategy for Health Research – Making Knowledge Work for Health provide important frameworks within which we can improve our data and research base. A culturally diverse health sector workforce – challenges and opportunities The Irish health service can benefit greatly from successful international recruitment. There has been a strong non-national representation amongst the medical profession for more than 30 years. More recently there have been significant increases in other categories of health service workers from overseas. The Department recognises the enormous value that overseas recruitment brings over a wide range of services and supports the development of effective and appropriate recruitment strategies in partnership with health service employers. These changes have made cultural diversity an important issue for all health service organisations. Diversity in the workplace is primarily about creating a culture that seeks, respects, values and harnesses difference. This includes all the differences that when added together make each person unique. So instead of the focus being on particular groups, diversity is about all of us. Change is not about helping “them” to join “us” but about critically looking at “us” and rooting out all aspects of our culture that inappropriately exclude people and prevent us from being inclusive in the way we relate to employees, potential employees and clients of the health service. International recruitment benefits consumers, Irish employees and the overseas personnel alike. Regardless of whether they are employed by the health service, members of minority groups will be clients of our service and consequently we need to be flexible in order to accommodate different cultural needs. For staff, we recognise that coming from other cultures can be a difficult transition. Consequently health service employers have made strong efforts to assist them during this period. Many organisations provide induction courses, religious facilities (such as prayer rooms) and help in finding suitable accommodation. The Health Service Employers Agency (HSEA) is developing an equal opportunities/diversity strategy and action plans as well as training programmes to support their implementation, to ensure that all health service employment policies and practices promote the equality/diversity agenda to continue the development of a culturally diverse health service. The management of this new environment is extremely important for the health service as it offers an opportunity to go beyond set legal requirements and to strive for an acceptance and nurturing of cultural differences. Workforce cultural diversity affords us the opportunity to learn from the working practices and perspectives of others by allowing personnel to present their ideas and experience through teamwork, partnership structures and other appropriate fora, leading to further improvement in the services we provide. It is important to ensure that both personnel units and line managers communicate directly with their staff and demonstrate by their actions that they intend to create an inclusive work place which doesn´t demand that minority staff fit. Contented, valued employees who feel that there is a place for them in the organisation will deliver a high quality health service. Your conference here today has two laudable aims – to heighten awareness and assist health care staff to work effectively with their colleagues from different cultural backgrounds and to gain a greater understanding of the diverse needs of patients from minority ethnic backgrounds. There is a synergy in these aims and in the tasks to which they give rise in the management of our health service. The creative adaptations required for one have the potential to feed into the other. I would like to commend both organisations which are hosting this conference for their initiative in making this event happen, particularly at this time – Racism in the Workplace Week. I look forward very much to hearing the outcome of your deliberations. Thank you.
Resumo:
Los hablantes bilingües tienen un acceso al léxico más lento y menos robusto que los monolingües, incluso cuando hablan en su lengua materna y dominante. Este fenómeno, comúnmente llamado “la desventaja bilingüe” también se observa en hablantes de una segunda lengua en comparación con hablantes de una primera lengua. Una causa que posiblemente contribuya a estas desventajas es el uso de control inhibitorio durante la producción del lenguaje: la inhibición de palabras coactivadas de la lengua actualmente no en uso puede prevenir intrusiones de dicha lengua, pero al mismo tiempo ralentizar la producción del lenguaje. El primer objetivo de los estudios descritos en este informe era testear esta hipótesis mediante diferentes predicciones generadas por teorías de control inhibitorio del lenguaje. Un segundo objetivo era investigar la extensión de la desventaja bilingüe dentro y fuera de la producción de palabras aisladas, así como avanzar en el conocimiento de las variables que la modulan. En lo atingente al primer objetivo, la evidencia obtenida es incompatible con un control inhibitorio global, desafiando la idea de mecanismos específicos en el hablante bilingüe utilizados para la selección léxica. Esto implica que una explicación común para el control de lenguaje y la desventaja bilingüe en el acceso al léxico es poco plausible. En cuanto al segundo objetivo, los resultados muestran que (a) la desventaja bilingüe no tiene un impacto al acceso a la memoria; (b) la desventaja bilingüe extiende a la producción del habla conectada; y (c) similitudes entre lenguas a diferentes niveles de representación así como la frecuencia de uso son factores que modulan la desventaja bilingüe.
Resumo:
Introduction: Difficult tracheal intubation remains a constant and significant source of morbidity and mortality in anaesthetic practice. Insufficient airway assessment in the preoperative period continues to be a major cause of unanticipated difficult intubation. Although many risk factors have already been identified, preoperative airway evaluation is not always regarded as a standard procedure and the respective weight of each risk factor remains unclear. Moreover the predictive scores available are not sensitive, moderately specific and often operator-dependant. In order to improve the preoperative detection of patients at risk for difficult intubation, we developed a system for automated and objective evaluation of morphologic criteria of the face and neck using video recordings and advanced techniques borrowed from face recognition. Method and results: Frontal video sequences were recorded in 5 healthy volunteers. During the video recording, subjects were requested to perform maximal flexion-extension of the neck and to open wide the mouth with tongue pulled out. A robust and real-time face tracking system was then applied, allowing to automatically identify and map a grid of 55 control points on the face, which were tracked during head motion. These points located important features of the face, such as the eyebrows, the nose, the contours of the eyes and mouth, and the external contours, including the chin. Moreover, based on this face tracking, the orientation of the head could also be estimated at each frame of the video sequence. Thus, we could infer for each frame the pitch angle of the head pose (related to the vertical rotation of the head) and obtain the degree of head extension. Morphological criteria used in the most frequent cited predictive scores were also extracted, such as mouth opening, degree of visibility of the uvula or thyreo-mental distance. Discussion and conclusion: Preliminary results suggest the high feasibility of the technique. The next step will be the application of the same automated and objective evaluation to patients who will undergo tracheal intubation. The difficulties related to intubation will be then correlated to the biometric characteristics of the patients. The objective in mind is to analyze the biometrics data with artificial intelligence algorithms to build a highly sensitive and specific predictive test.
Resumo:
In this paper we propose the inversion of nonlinear distortions in order to improve the recognition rates of a speaker recognizer system. We study the effect of saturations on the test signals, trying to take into account real situations where the training material has been recorded in a controlled situation but the testing signals present some mismatch with the input signal level (saturations). The experimental results for speaker recognition shows that a combination of several strategies can improve the recognition rates with saturated test sentences from 80% to 89.39%, while the results with clean speech (without saturation) is 87.76% for one microphone, and for speaker identification can reduce the minimum detection cost function with saturated test sentences from 6.42% to 4.15%, while the results with clean speech (without saturation) is 5.74% for one microphone and 7.02% for the other one.
Resumo:
In this paper we propose an endpoint detection system based on the use of several features extracted from each speech frame, followed by a robust classifier (i.e Adaboost and Bagging of decision trees, and a multilayer perceptron) and a finite state automata (FSA). We present results for four different classifiers. The FSA module consisted of a 4-state decision logic that filtered false alarms and false positives. We compare the use of four different classifiers in this task. The look ahead of the method that we propose was of 7 frames, which are the number of frames that maximized the accuracy of the system. The system was tested with real signals recorded inside a car, with signal to noise ratio that ranged from 6 dB to 30dB. Finally we present experimental results demonstrating that the system yields robust endpoint detection.
Resumo:
We describe a series of experiments in which we start with English to French and English to Japanese versions of an Open Source rule-based speech translation system for a medical domain, and bootstrap correspondign statistical systems. Comparative evaluation reveals that the rule-based systems are still significantly better than the statistical ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; also, a hybrid system only marginally improved recall at the cost of a los in precision. The result suggests that rule-based architectures may still be preferable to statistical ones for safety-critical speech translation tasks.
Resumo:
In this paper we propose the inversion of nonlinear distortions in order to improve the recognition rates of a speaker recognizer system. We study the effect of saturations on the test signals, trying to take into account real situations where the training material has been recorded in a controlled situation but the testing signals present some mismatch with the input signal level (saturations). The experimental results shows that a combination of several strategies can improve the recognition rates with saturated test sentences from 80% to 89.39%, while the results with clean speech (without saturation) is 87.76% for one microphone.
Resumo:
The purpose of our project is to contribute to earlier diagnosis of AD and better estimates of its severity by using automatic analysis performed through new biomarkers extracted from non-invasive intelligent methods. The methods selected in this case are speech biomarkers oriented to Sponta-neous Speech and Emotional Response Analysis. Thus the main goal of the present work is feature search in Spontaneous Speech oriented to pre-clinical evaluation for the definition of test for AD diagnosis by One-class classifier. One-class classifi-cation problem differs from multi-class classifier in one essen-tial aspect. In one-class classification it is assumed that only information of one of the classes, the target class, is available. In this work we explore the problem of imbalanced datasets that is particularly crucial in applications where the goal is to maximize recognition of the minority class as in medical diag-nosis. The use of information about outlier and Fractal Dimen-sion features improves the system performance.
Resumo:
Top-down contextual influences play a major part in speech understanding, especially in hearing-impaired patients with deteriorated auditory input. Those influences are most obvious in difficult listening situations, such as listening to sentences in noise but can also be observed at the word level under more favorable conditions, as in one of the most commonly used tasks in audiology, i.e., repeating isolated words in silence. This study aimed to explore the role of top-down contextual influences and their dependence on lexical factors and patient-specific factors using standard clinical linguistic material. Spondaic word perception was tested in 160 hearing-impaired patients aged 23-88 years with a four-frequency average pure-tone threshold ranging from 21 to 88 dB HL. Sixty spondaic words were randomly presented at a level adjusted to correspond to a speech perception score ranging between 40 and 70% of the performance intensity function obtained using monosyllabic words. Phoneme and whole-word recognition scores were used to calculate two context-influence indices (the j factor and the ratio of word scores to phonemic scores) and were correlated with linguistic factors, such as the phonological neighborhood density and several indices of word occurrence frequencies. Contextual influence was greater for spondaic words than in similar studies using monosyllabic words, with an overall j factor of 2.07 (SD = 0.5). For both indices, context use decreased with increasing hearing loss once the average hearing loss exceeded 55 dB HL. In right-handed patients, significantly greater context influence was observed for words presented in the right ears than for words presented in the left, especially in patients with many years of education. The correlations between raw word scores (and context influence indices) and word occurrence frequencies showed a significant age-dependent effect, with a stronger correlation between perception scores and word occurrence frequencies when the occurrence frequencies were based on the years corresponding to the patients' youth, showing a "historic" word frequency effect. This effect was still observed for patients with few years of formal education, but recent occurrence frequencies based on current word exposure had a stronger influence for those patients, especially for younger ones.
Resumo:
Human activity recognition in everyday environments is a critical, but challenging task in Ambient Intelligence applications to achieve proper Ambient Assisted Living, and key challenges still remain to be dealt with to realize robust methods. One of the major limitations of the Ambient Intelligence systems today is the lack of semantic models of those activities on the environment, so that the system can recognize the speci c activity being performed by the user(s) and act accordingly. In this context, this thesis addresses the general problem of knowledge representation in Smart Spaces. The main objective is to develop knowledge-based models, equipped with semantics to learn, infer and monitor human behaviours in Smart Spaces. Moreover, it is easy to recognize that some aspects of this problem have a high degree of uncertainty, and therefore, the developed models must be equipped with mechanisms to manage this type of information. A fuzzy ontology and a semantic hybrid system are presented to allow modelling and recognition of a set of complex real-life scenarios where vagueness and uncertainty are inherent to the human nature of the users that perform it. The handling of uncertain, incomplete and vague data (i.e., missing sensor readings and activity execution variations, since human behaviour is non-deterministic) is approached for the rst time through a fuzzy ontology validated on real-time settings within a hybrid data-driven and knowledgebased architecture. The semantics of activities, sub-activities and real-time object interaction are taken into consideration. The proposed framework consists of two main modules: the low-level sub-activity recognizer and the high-level activity recognizer. The rst module detects sub-activities (i.e., actions or basic activities) that take input data directly from a depth sensor (Kinect). The main contribution of this thesis tackles the second component of the hybrid system, which lays on top of the previous one, in a superior level of abstraction, and acquires the input data from the rst module's output, and executes ontological inference to provide users, activities and their in uence in the environment, with semantics. This component is thus knowledge-based, and a fuzzy ontology was designed to model the high-level activities. Since activity recognition requires context-awareness and the ability to discriminate among activities in di erent environments, the semantic framework allows for modelling common-sense knowledge in the form of a rule-based system that supports expressions close to natural language in the form of fuzzy linguistic labels. The framework advantages have been evaluated with a challenging and new public dataset, CAD-120, achieving an accuracy of 90.1% and 91.1% respectively for low and high-level activities. This entails an improvement over both, entirely data-driven approaches, and merely ontology-based approaches. As an added value, for the system to be su ciently simple and exible to be managed by non-expert users, and thus, facilitate the transfer of research to industry, a development framework composed by a programming toolbox, a hybrid crisp and fuzzy architecture, and graphical models to represent and con gure human behaviour in Smart Spaces, were developed in order to provide the framework with more usability in the nal application. As a result, human behaviour recognition can help assisting people with special needs such as in healthcare, independent elderly living, in remote rehabilitation monitoring, industrial process guideline control, and many other cases. This thesis shows use cases in these areas.
Resumo:
This lexical decision study with eye tracking of Japanese two-kanji-character words investigated the order in which a whole two-character word and its morphographic constituents are activated in the course of lexical access, the relative contributions of the left and the right characters in lexical decision, the depth to which semantic radicals are processed, and how nonlinguistic factors affect lexical processes. Mixed-effects regression analyses of response times and subgaze durations (i.e., first-pass fixation time spent on each of the two characters) revealed joint contributions of morphographic units at all levels of the linguistic structure with the magnitude and the direction of the lexical effects modulated by readers’ locus of attention in a left-to-right preferred processing path. During the early time frame, character effects were larger in magnitude and more robust than radical and whole-word effects, regardless of the font size and the type of nonwords. Extending previous radical-based and character-based models, we propose a task/decision-sensitive character-driven processing model with a level-skipping assumption: Connections from the feature level bypass the lower radical level and link up directly to the higher character level.
Resumo:
A speech by Sean O'Sullivan, given in the House of Commons, "For the Recognition of the Beaver as a Symbol of the Sovereignty of the Dominion of Canada".