981 resultados para Challenging environments
Resumo:
Novel techniques have been developed for the automatic recognition of human behaviour in challenging environments using information from visual and infra-red camera feeds. The techniques have been applied to two interesting scenarios: Recognise drivers' speech using lip movements and recognising audience behaviour, while watching a movie, using facial features and body movements. Outcome of the research in these two areas will be useful in the improving the performance of voice recognition in automobiles for voice based control and for obtaining accurate movie interest ratings based on live audience response analysis.
Resumo:
Automatically recognizing faces captured under uncontrolled environments has always been a challenging topic in the past decades. In this work, we investigate cohort score normalization that has been widely used in biometric verification as means to improve the robustness of face recognition under challenging environments. In particular, we introduce cohort score normalization into undersampled face recognition problem. Further, we develop an effective cohort normalization method specifically for the unconstrained face pair matching problem. Extensive experiments conducted on several well known face databases demonstrate the effectiveness of cohort normalization on these challenging scenarios. In addition, to give a proper understanding of cohort behavior, we study the impact of the number and quality of cohort samples on the normalization performance. The experimental results show that bigger cohort set size gives more stable and often better results to a point before the performance saturates. And cohort samples with different quality indeed produce different cohort normalization performance. Recognizing faces gone after alterations is another challenging problem for current face recognition algorithms. Face image alterations can be roughly classified into two categories: unintentional (e.g., geometrics transformations introduced by the acquisition devide) and intentional alterations (e.g., plastic surgery). We study the impact of these alterations on face recognition accuracy. Our results show that state-of-the-art algorithms are able to overcome limited digital alterations but are sensitive to more relevant modifications. Further, we develop two useful descriptors for detecting those alterations which can significantly affect the recognition performance. In the end, we propose to use the Structural Similarity (SSIM) quality map to detect and model variations due to plastic surgeries. Extensive experiments conducted on a plastic surgery face database demonstrate the potential of SSIM map for matching face images after surgeries.
Resumo:
Modern robots are increasingly expected to function in uncertain and dynamically challenging environments, often in proximity with humans. In addition, wide scale adoption of robots requires on-the-fly adaptability of software for diverse application. These requirements strongly suggest the need to adopt formal representations of high level goals and safety specifications, especially as temporal logic formulas. This approach allows for the use of formal verification techniques for controller synthesis that can give guarantees for safety and performance. Robots operating in unstructured environments also face limited sensing capability. Correctly inferring a robot's progress toward high level goal can be challenging.
This thesis develops new algorithms for synthesizing discrete controllers in partially known environments under specifications represented as linear temporal logic (LTL) formulas. It is inspired by recent developments in finite abstraction techniques for hybrid systems and motion planning problems. The robot and its environment is assumed to have a finite abstraction as a Partially Observable Markov Decision Process (POMDP), which is a powerful model class capable of representing a wide variety of problems. However, synthesizing controllers that satisfy LTL goals over POMDPs is a challenging problem which has received only limited attention.
This thesis proposes tractable, approximate algorithms for the control synthesis problem using Finite State Controllers (FSCs). The use of FSCs to control finite POMDPs allows for the closed system to be analyzed as finite global Markov chain. The thesis explicitly shows how transient and steady state behavior of the global Markov chains can be related to two different criteria with respect to satisfaction of LTL formulas. First, the maximization of the probability of LTL satisfaction is related to an optimization problem over a parametrization of the FSC. Analytic computation of gradients are derived which allows the use of first order optimization techniques.
The second criterion encourages rapid and frequent visits to a restricted set of states over infinite executions. It is formulated as a constrained optimization problem with a discounted long term reward objective by the novel utilization of a fundamental equation for Markov chains - the Poisson equation. A new constrained policy iteration technique is proposed to solve the resulting dynamic program, which also provides a way to escape local maxima.
The algorithms proposed in the thesis are applied to the task planning and execution challenges faced during the DARPA Autonomous Robotic Manipulation - Software challenge.
Resumo:
The underground scenarios are one of the most challenging environments for accurate and precise 3d mapping where hostile conditions like absence of Global Positioning Systems, extreme lighting variations and geometrically smooth surfaces may be expected. So far, the state-of-the-art methods in underground modelling remain restricted to environments in which pronounced geometric features are abundant. This limitation is a consequence of the scan matching algorithms used to solve the localization and registration problems. This paper contributes to the expansion of the modelling capabilities to structures characterized by uniform geometry and smooth surfaces, as is the case of road and train tunnels. To achieve that, we combine some state of the art techniques from mobile robotics, and propose a method for 6DOF platform positioning in such scenarios, that is latter used for the environment modelling. A visual monocular Simultaneous Localization and Mapping (MonoSLAM) approach based on the Extended Kalman Filter (EKF), complemented by the introduction of inertial measurements in the prediction step, allows our system to localize himself over long distances, using exclusively sensors carried on board a mobile platform. By feeding the Extended Kalman Filter with inertial data we were able to overcome the major problem related with MonoSLAM implementations, known as scale factor ambiguity. Despite extreme lighting variations, reliable visual features were extracted through the SIFT algorithm, and inserted directly in the EKF mechanism according to the Inverse Depth Parametrization. Through the 1-Point RANSAC (Random Sample Consensus) wrong frame-to-frame feature matches were rejected. The developed method was tested based on a dataset acquired inside a road tunnel and the navigation results compared with a ground truth obtained by post-processing a high grade Inertial Navigation System and L1/L2 RTK-GPS measurements acquired outside the tunnel. Results from the localization strategy are presented and analyzed.
Resumo:
Speech perception routinely takes place in noisy or degraded listening environments, leading to ambiguity in the identity of the speech token. Here, I present one review paper and two experimental papers that highlight cognitive and visual speech contributions to the listening process, particularly in challenging listening environments. First, I survey the literature linking audiometric age-related hearing loss and cognitive decline and review the four proposed causal mechanisms underlying this link. I argue that future research in this area requires greater consideration of the functional overlap between hearing and cognition. I also present an alternative framework for understanding causal relationships between age-related declines in hearing and cognition, with emphasis on the interconnected nature of hearing and cognition and likely contributions from multiple causal mechanisms. I also provide a number of testable hypotheses to examine how impairments in one domain may affect the other. In my first experimental study, I examine the direct contribution of working memory (through a cognitive training manipulation) on speech in noise comprehension in older adults. My results challenge the efficacy of cognitive training more generally, and also provide support for the contribution of sentence context in reducing working memory load. My findings also challenge the ubiquitous use of the Reading Span test as a pure test of working memory. In a second experimental (fMRI) study, I examine the role of attention in audiovisual speech integration, particularly when the acoustic signal is degraded. I demonstrate that attentional processes support audiovisual speech integration in the middle and superior temporal gyri, as well as the fusiform gyrus. My results also suggest that the superior temporal sulcus is sensitive to intelligibility enhancement, regardless of how this benefit is obtained (i.e., whether it is obtained through visual speech information or speech clarity). In addition, I also demonstrate that both the cingulo-opercular network and motor speech areas are recruited in difficult listening conditions. Taken together, these findings augment our understanding of cognitive contributions to the listening process and demonstrate that memory, working memory, and executive control networks may flexibly be recruited in order to meet listening demands in challenging environments.
Resumo:
In recent years the air transport industry has experienced unprecedented growth, driven by strong local and global economies. Whether this growth can continue in the face of anticipated oil crises; international economic forecasts and recent influenza outbreaks is yet to be seen. One thing is certain, airport owners and operators will continue to be faced with challenging environments in which to do business. In response, many airports recognize the value in diversifying their revenue streams through a variety of landside property developments within the airport boundary. In Australia it is the type and intended market of this development that is a point of contention between private airport corporations and their surrounding municipalities. The aim of this preliminary research is to identify and categorize on-airport development occurring at the twenty-two privatized Australian airports which are administered under the Airports Act [1996]. This new knowledge will assist airport and municipal planners in understanding the current extent and category of on-airport land use, allowing them to make better decisions when proposing development both within airport master plans and beyond the airport boundary in local town and municipal plans.
Resumo:
In recent times, the improved levels of accuracy obtained by Automatic Speech Recognition (ASR) technology has made it viable for use in a number of commercial products. Unfortunately, these types of applications are limited to only a few of the world’s languages, primarily because ASR development is reliant on the availability of large amounts of language specific resources. This motivates the need for techniques which reduce this language-specific, resource dependency. Ideally, these approaches should generalise across languages, thereby providing scope for rapid creation of ASR capabilities for resource poor languages. Cross Lingual ASR emerges as a means for addressing this need. Underpinning this approach is the observation that sound production is largely influenced by the physiological construction of the vocal tract, and accordingly, is human, and not language specific. As a result, a common inventory of sounds exists across languages; a property which is exploitable, as sounds from a resource poor, target language can be recognised using models trained on resource rich, source languages. One of the initial impediments to the commercial uptake of ASR technology was its fragility in more challenging environments, such as conversational telephone speech. Subsequent improvements in these environments has gained consumer confidence. Pragmatically, if cross lingual techniques are to considered a viable alternative when resources are limited, they need to perform under the same types of conditions. Accordingly, this thesis evaluates cross lingual techniques using two speech environments; clean read speech and conversational telephone speech. Languages used in evaluations are German, Mandarin, Japanese and Spanish. Results highlight that previously proposed approaches provide respectable results for simpler environments such as read speech, but degrade significantly when in the more taxing conversational environment. Two separate approaches for addressing this degradation are proposed. The first is based on deriving better target language lexical representation, in terms of the source language model set. The second, and ultimately more successful approach, focuses on improving the classification accuracy of context-dependent (CD) models, by catering for the adverse influence of languages specific phonotactic properties. Whilst the primary research goal in this thesis is directed towards improving cross lingual techniques, the catalyst for investigating its use was based on expressed interest from several organisations for an Indonesian ASR capability. In Indonesia alone, there are over 200 million speakers of some Malay variant, provides further impetus and commercial justification for speech related research on this language. Unfortunately, at the beginning of the candidature, limited research had been conducted on the Indonesian language in the field of speech science, and virtually no resources existed. This thesis details the investigative and development work dedicated towards obtaining an ASR system with a 10000 word recognition vocabulary for the Indonesian language.
Resumo:
Tarrant argues that a solid risk management strategy is critical to building effective, transformational and adaptive organisations. Organisations are a fundamental part of our society and economic system whether they are private, public or not-for-profits. There are very few aspects of our society and economy that don’t rely wholly or in part on the performance of organisations. Disasters and crises are complex and very challenging environments for organisations. How can effective transformational and adaptive capacity become institutionalised and a core part of good governance of organisations? Effective risk management is a critical element in meeting organisational objectives in a turbulent and uncertain environment.
Resumo:
Abstract. In recent years, sparse representation based classification(SRC) has received much attention in face recognition with multipletraining samples of each subject. However, it cannot be easily applied toa recognition task with insufficient training samples under uncontrolledenvironments. On the other hand, cohort normalization, as a way of mea-suring the degradation effect under challenging environments in relationto a pool of cohort samples, has been widely used in the area of biometricauthentication. In this paper, for the first time, we introduce cohort nor-malization to SRC-based face recognition with insufficient training sam-ples. Specifically, a user-specific cohort set is selected to normalize theraw residual, which is obtained from comparing the test sample with itssparse representations corresponding to the gallery subject, using poly-nomial regression. Experimental results on AR and FERET databases show that cohort normalization can bring SRC much robustness against various forms of degradation factors for undersampled face recognition.
Resumo:
The trend of cultural diversity is increasing in all organizations, especially engineering ones, due to globalization, mergers, joint ventures and the movement of the workforce. The collaborative nature of projects in engineering industries requires long-term teamwork between local and international engineers. Research confirms a specific culture among engineering companies that isassumed to have a negative effect on collaboration and communication among co-workers. Multicultural workplaces have been reported as challenging environments in the engineering work culture, which calls for more research among engineering organizations. An everyday challenge for co-workers, especially in culturally diverse contexts, is handling interpersonal conflict. This perceived conflict among individuals can happen because of actual differences in tasks or relationships. Research demonstrates that task conflict at the group level has some positive effects on decision-making and innovation, while it has negative effects on employees’ work attitude and performance. However, relationship conflict at the individual level has only negative effects including frustration, tension, low job satisfaction, high employee turnover and low productivity. Outcomes of both task and relationship conflict at individual level can have long-term negative consequences like damaged organizational commitment. One of the most important sources of differences between individuals, which results in conflict, is their cultural backgrounds. First, this thesis suggests that in culturally diverse workplaces, people perceive more relationship conflict than task conflict. Second, this thesis examines interpersonal communication in culturally diverse work places. Communicating effectively in culturally diverse workplaces is crucial for today’s business. Culture has a large effect on the ways that people communicate with each other. Ineffective communication can escalate interpersonal conflict and cause frustration in the long term. Communication satisfaction, defined as enjoying the communication and feeling that the communication was appropriate and effective, has a positive effect on individuals’ psychological wellbeing. In a culturally diverse workplace, it is assumed that individuals feel less satisfied with their interpersonal communications because of their lack of knowledge about other cultures’ communication norms. To manage interpersonal interactions, many authors suggest that individuals need a specific capability, i.e., cultural intelligence (some studies use cultural competence, global intelligence or intercultural competence interchangeably). Some authors argue that cultures are synergic and convergent and the postmodernist definition of culture is just our dominant beliefs. However, other authors suggest that cultural intelligence is the strongest and most comprehensive competency for managing cross-cultural interactions, because various cultures differ so greatly at the micro level. This thesis argues that individuals with a high level of cultural intelligence perceive less interpersonal conflict and more satisfaction with their interpersonal communication. Third, this thesis also looks at individuals' perception of cultural diversity. It is suggested that level of cultural diversity plays a moderating role on all of the proposed relationships (effect of cultural intelligence on perception of relationship conflict/ communication satisfaction) This thesis examines the relationship among cultural diversity, cultural intelligence, interpersonal conflict and communication by surveying eleven companies in the oil and gas industry. The multicultural nature of companies within the oil and gas industry and the characteristics of engineering culture call for more in-depth research on interpersonal interactions. A total of 286 invitation emails were sent and 118 respondents replied to the survey, giving a 41.26 per cent response rate. All the respondents were engineers, engineering managers or practical technicians. The average age of the participants was 36.93 years and 58.82 per cent were male. Overall, 47.6 per cent of the respondents had at least a master’s degree. Totally, 42.85 per cent of the respondents were working in a country that was not their country of birth. The overall findings reveal that cultural diversity and cultural intelligence significantly influence interpersonal conflict and communication satisfaction. Further, this thesis also finds that cultural intelligence is an effective competency for dealing with the perception of interpersonal relationship conflict and communication satisfaction when the level of cultural diversity is moderate to high. This thesis suggests that cultural intelligence training is necessary to increase the level of this competency among employees in order to help them to have better understanding of other cultures. Human resource management can design these training courses with consideration for the level of cultural diversity within the organization.
Resumo:
This paper introduces an improved line tracker using IMU and vision data for visual servoing tasks. We utilize an Image Jacobian which describes motion of a line feature to corresponding camera movements. These camera motions are estimated using an IMU. We demonstrate impacts of the proposed method in challenging environments: maximum angular rate ~160 0/s, acceleration ~6m /s2 and in cluttered outdoor scenes. Simulation and quantitative tracking performance comparison with the Visual Servoing Platform (ViSP) are also presented.
Resumo:
This paper presents Sequence Matching Across Route Traversals (SMART); a generally applicable sequence-based place recognition algorithm. SMART provides invariance to changes in illumination and vehicle speed while also providing moderate pose invariance and robustness to environmental aliasing. We evaluate SMART on vehicles travelling at highly variable speeds in two challenging environments; firstly, on an all-terrain vehicle in an off-road, forest track and secondly, using a passenger car traversing an urban environment across day and night. We provide comparative results to the current state-of-the-art SeqSLAM algorithm and investigate the effects of altering SMART’s image matching parameters. Additionally, we conduct an extensive study of the relationship between image sequence length and SMART’s matching performance. Our results show viable place recognition performance in both environments with short 10-metre sequences, and up to 96% recall at 100% precision across extreme day-night cycles when longer image sequences are used.
Resumo:
The ability to build high-fidelity 3D representations of the environment from sensor data is critical for autonomous robots. Multi-sensor data fusion allows for more complete and accurate representations. Furthermore, using distinct sensing modalities (i.e. sensors using a different physical process and/or operating at different electromagnetic frequencies) usually leads to more reliable perception, especially in challenging environments, as modalities may complement each other. However, they may react differently to certain materials or environmental conditions, leading to catastrophic fusion. In this paper, we propose a new method to reliably fuse data from multiple sensing modalities, including in situations where they detect different targets. We first compute distinct continuous surface representations for each sensing modality, with uncertainty, using Gaussian Process Implicit Surfaces (GPIS). Second, we perform a local consistency test between these representations, to separate consistent data (i.e. data corresponding to the detection of the same target by the sensors) from inconsistent data. The consistent data can then be fused together, using another GPIS process, and the rest of the data can be combined as appropriate. The approach is first validated using synthetic data. We then demonstrate its benefit using a mobile robot, equipped with a laser scanner and a radar, which operates in an outdoor environment in the presence of large clouds of airborne dust and smoke.
Resumo:
A vital element to improve outcomes for disadvantaged students is outstanding teachers. A reality, however, is that teacher graduates in the top quartile of academic scores are far less likely to accept positions in tough urban, regional, rural and remote schools. Further, because high poverty schools can be challenging environments, these teachers are retained for much shorter periods of time. In response to this challenge, the National Exceptional Teachers for Disadvantaged Schools program (NETDS) creates a pathway for the highest quality pre-service teachers to be fully prepared, professionally and personally, for roles within high poverty schools. The program identifies the highest-achieving mainstream preservice teachers in university programs across the country and offers them a specialised curriculum and supported practicum experience in a network of disadvantaged partner schools. By working closely with government, philanthropy and partner schools, the program also works to channel these exceptional pre-service teachers into employment in schools where they will have the greatest impact. Its initial results have been exceptional: over 90% of graduates are now employed as teachers in high poverty schools. This paper will discuss their research on how they are working to build the infrastructure and capacity for research on innovations that prepare teachers for 21st century schools in the Australian context.
Resumo:
This poster presents the results of a critical review of the literature on the intersection between paramedic practice with Autism Spectrum Disorder (ASD) and previews the clinical and communication challenges likely to be experienced with these patients. Paramedics in Australia provide 24/7 out-of-hospital care to the community. Although their core business is to provide emergency care, paramedics also provide care for vulnerable people as a consequence of the social, economic or domestic milieu. Little is known about the frequency of use of emergency out-of-hospital services by children with ASD and their families. Similarly, little is known about the attitudes and perceptions of paramedics to children with ASD and their emergency health care. However, individuals with ASD are likely to require paramedic services at some point across the life span and may be more frequent users of health services as a consequence of the challenges they face. The high rate of co-morbidities of people diagnosed with ASD is reported and includes seizure disorders, gastro-intestinal disorders, metabolic disorders, hormonal dysfunction, ear, nose and throat infections, hearing impairment, hypertension, allergies/anaphylaxis, immune disorders, migraine and diabetes, gross/fine motor skill dysfunction, premature birth, birth defects, obesity and mental illness. Individuals with ASD may frequently experience concurrent communication, behaviour and sensory challenges. Consequently, Paramedics can encounter difficulties gathering important patient information which may compromise sensitive care. These interactions occur often in high pressure and emotionally challenging environments, which add to the difficulties in communicating the treatment and transport needs of this population.