500 resultados para Optical music recognition
Resumo:
Probabilistic robotics, most often applied to the problem of simultaneous localisation and mapping (SLAM), requires measures of uncertainly to accompany observations of the environment. This paper describes how uncertainly can be characterised for a vision system that locates coloured landmark in a typical laboratory environment. The paper describes a model of the uncertainly in segmentation, the internal camera model and the mounting of the camera on the robot. It =plains the implementation of the system on a laboratory robot, and provides experimental results that show the coherence of the uncertainly model,
Resumo:
The relationship between the quality of parent-child interactions and positive child developmental trajectories is well established (Guralnick, 2006; Shonkoff & Meissels, 2000; Zubrick et al., 2008). However, a range of parental, family, and socio-economic factors can pose risks to parents’ capacity to participate in quality interactions with their children. In particular, families with a child with a disability have been found to have higher levels of parenting stress, and are more likely to experience economic disadvantage, as well as social isolation. The importance of early interventions to promote positive parenting and child development for these families is widely recognised (Shonkoff & Meissels, 2000). However, to date, there is a lack of evidence about the effectiveness of early parenting programs for families who have a young child with a disability. This thesis investigates the impact of a music therapy parenting program, Sing & Grow, on 201 parent-child dyads who attended programs specifically targeted to parents who had a young child with a disability. Sing & Grow is an Australian national early parenting intervention funded by the Australian Government Department of Families, Housing, Community Services and Indigenous Affairs and delivered by Playgroup Queensland. It is designed and delivered by Registered Music Therapists for families with children aged from birth to three years. It aims to improve parenting skills and confidence, improve family functioning (positive parent-child interactions), enhance child development, and provide social networking opportunities to socially isolated families. The intervention targets a range of families in circumstances that have the potential to impact negatively on family functioning. This thesis uses data from the National Evaluation Study of Sing & Grow from programs which were targeted at families who had a young child with a disability. Three studies were conducted to address the objectives of this thesis. Study 1 examines the effects of the Sing & Grow intervention on parent reported pre and post parent mental health, parenting confidence, parenting skills, and child development, and other parent reported outcomes including social support, use of intervention resources, satisfaction with the intervention and perceived benefits of and barriers to participation. Significant improvements from pre to post were found for parent mental health and parent reported child communication and social skills, along with evidence that parents were very satisfied with the program and that it brought social benefits to families. Study 2 explored the pre to post effects of the intervention on children’s developmental skills and parent-child interactions using observational ratings made by clinicians. Significant pre to post improvements were found for parenting sensitivity, parental engagement with child and acceptance of child as well as for child responsiveness to parent, interest, and participation in the intervention, and social skills. Study 3 examined the nature of child and family characteristics that predicted better outcomes for families while taking account of the level of participation in the program. An overall outcome index was calculated and served as the dependent variable in a logistic regression analysis. Families who attended six or more sessions and mothers who had not completed high school were more likely to have higher outcome scores at post intervention than those who attended fewer sessions and those with more educated mothers respectively. The findings of this research indicate that the intervention had a positive impact on participants’ mental health, parenting behaviours and child development and that level of attendance was associated with better outcomes. There was also evidence that the program reached its target of high risk families (i.e., families in which mothers had lower educational levels) and that for these families better outcomes were achieved. There were also indications that the program was accessible and highly regarded by families and that it promoted social connections for participants. A theoretical model of how the intervention is currently working for families is proposed to explain the connections between early parenting, child development and maternal wellbeing. However, more research is required to further elucidate the mechanisms by which the intervention creates change for families. This research presents promising evidence that a short term group music therapy program can elicit important therapeutic benefits for families who have a child with a disability.
Resumo:
In this paper we propose a new method for utilising phase information by complementing it with traditional magnitude-only spectral subtraction speech enhancement through Complex Spectrum Subtraction (CSS). The proposed approach has the following advantages over traditional magnitude-only spectral subtraction: (a) it introduces complementary information to the enhancement algorithm; (b) it reduces the total number of algorithmic parameters, and; (c) is designed for improving clean speech magnitude spectra and is therefore suitable for both automatic speech recognition (ASR) and speech perception applications. Oracle-based ASR experiments verify this approach, showing an average of 20% relative word accuracy improvements when accurate estimates of the phase spectrum are available. Based on sinusoidal analysis and assuming stationarity between observations (which is shown to be better approximated as the frame rate is increased), this paper also proposes a novel method for acquiring the phase information called Phase Estimation via Delay Projection (PEDEP). Further oracle ASR experiments validate the potential for the proposed PEDEP technique in ideal conditions. Realistic implementation of CSS with PEDEP shows performance comparable to state of the art spectral subtraction techniques in a range of 15-20 dB signal-to-noise ratio environments. These results clearly demonstrate the potential for using phase spectra in spectral subtractive enhancement applications, and at the same time highlight the need for deriving more accurate phase estimates in a wider range of noise conditions.
Resumo:
This doctoral thesis comprises three distinct yet related projects which investigate interdisciplinary practice across: music collaboration; mime performance; and corporate communication. Both the processes and underpinning research of these projects explore, expose and exploit areas where disparate and apparently conflicting fields of professional practice successfully and effectively; intersect, interact, and inform each other - rather than conflict - thereby enhancing each, both individually and collectively. Informed by three decades of professional practice across: music; stage performance; television; corporate communication; design; and tertiary education, the three projects have produced innovative, creative, and commercial viable outcomes, manifest in a variety of media including: music; written text; digital, audio/visual; and internet. In exploring new practice and creating new knowledge, these project outcomes clearly demonstrate the value and effectiveness of reconciling disparate fields of practice through the application of inter-disciplinary creativity and innovation to professional practice.
Resumo:
Uncooperative iris identification systems at a distance and on the move often suffer from poor resolution and poor focus of the captured iris images. The lack of pixel resolution and well-focused images significantly degrades the iris recognition performance. This paper proposes a new approach to incorporate the focus score into a reconstruction-based super-resolution process to generate a high resolution iris image from a low resolution and focus inconsistent video sequence of an eye. A reconstruction-based technique, which can incorporate middle and high frequency components from multiple low resolution frames into one desired super-resolved frame without introducing false high frequency components, is used. A new focus assessment approach is proposed for uncooperative iris at a distance and on the move to improve performance for variations in lighting, size and occlusion. A novel fusion scheme is then proposed to incorporate the proposed focus score into the super-resolution process. The experiments conducted on the The Multiple Biometric Grand Challenge portal database shows that our proposed approach achieves an EER of 2.1%, outperforming the existing state-of-the-art averaging signal-level fusion approach by 19.2% and the robust mean super-resolution approach by 8.7%.
Resumo:
Purpose: To investigate the short term influence of imposed monocular defocus upon human optical axial length (the distance from anterior cornea to retinal pigment epithelium) and ocular biometrics. Methods: Twenty-eight young adult subjects (14 myopes and 14 emmetropes) had eye biometrics measured before and then 30 and 60 minutes after exposure to monocular (right eye) defocus. Four different monocular defocus conditions were tested, each on a separate day: control (no defocus), myopic (+3 D defocus), hyperopic (-3 D defocus) and diffuse (0.2 density Bangerter filter) defocus. The fellow eye was optimally corrected (no defocus). Results: Imposed defocus caused small but significant changes in optical axial length (p<0.0001). A significant increase in optical axial length (mean change +8 ± 14 μm, p=0.03) occurred following hyperopic defocus, and a significant reduction in optical axial length (mean change -13 ± 14 μm, p=0.0001) was found following myopic defocus. A small increase in optical axial length was observed following diffuse defocus (mean change +6 ± 13 μm, p=0.053). Choroidal thickness also exhibited some significant changes with certain defocus conditions. No significant difference was found between myopes and emmetropes in the changes in optical axial length or choroidal thickness with defocus. Conclusions: Significant changes in optical axial length occur in human subjects following 60 minutes of monocular defocus. The bi-directional optical axial length changes observed in response to defocus implies the human visual system is capable of detecting the presence and sign of defocus and altering optical axial length to move the retina towards the image plane.
Resumo:
This paper explores models of teaching and learning music composition in higher education. It analyses the pedagogical approaches apparent in the literature on teaching and learning composition in schools and universities, and introduces a teaching model as: learning from the masters; mastery of techniques; exploring ideas; and developing voice. It then presents a learning model developed from a qualitative study into students’ experiences of learning composition at university as: craft, process and art. The relationship between the students’ experiences and the pedagogical model is examined. Finally, the implications for composition curricula in higher education are presented.
Resumo:
This paper raises some questions about teaching and teacher education in the social sciences in response to the decision to implement a national curriculum in Australia. In particular, it contends that the decision to focus on discipline-specific knowledge in the social sciences will not necessarily meet the hopes of the Melbourne Declaration and deliver a 21st century curriculum that prepares students for the future. In doing so, it suggests that social educators need to engage with the broader discourse and political context shaping the push for curriculum reform in Australia and makes reference to the marginalisation of civics and citizenship education in the latest draft of the Australian curriculum: History.
Resumo:
Voice recognition is one of the key enablers to reduce driver distraction as in-vehicle systems become more and more complex. With the integration of voice recognition in vehicles, safety and usability are improved as the driver’s eyes and hands are not required to operate system controls. Whilst speaker independent voice recognition is well developed, performance in high noise environments (e.g. vehicles) is still limited. La Trobe University and Queensland University of Technology have developed a low-cost hardware-based speech enhancement system for automotive environments based on spectral subtraction and delay–sum beamforming techniques. The enhancement algorithms have been optimised using authentic Australian English collected under typical driving conditions. Performance tests conducted using speech data collected under variety of vehicle noise conditions demonstrate a word recognition rate improvement in the order of 10% or more under the noisiest conditions. Currently developed to a proof of concept stage there is potential for even greater performance improvement.
Resumo:
We investigated influences of optics and surround area on color appearance of defocused, small narrow band photopic lights (1’ arc diameter, λmax 510 - 628 nm) centered within a black annulus and surrounded by a white field. Participants included seven normal trichromats with L- or M-cone biased ratios. We controlled chromatic aberration with elements of a Powell achromatizing lens and corrected higher-order aberrations with an adaptive-optics system. Longitudinal chromatic aberrations, but not monochromatic aberrations, are involved in changing appearance of small lights with defocus. Surround field structure is important because color changes were not observed when lights were presented on a uniform white surround.
Resumo:
For some time we have jokingly referred to our network jamming research with jam2jam as ‘Switched on Orff’ (Brown, Sorensen and Dillon 2002; Dillon 2003; Dillon 2006; Dillon 2006; Brown and Dillon 2007). The connection with electronic music and Wendy Carlos’ classic work ‘Switched on Bach’ was obvious; we were using electronic music in schools and with children. The deeper connection with Orff however was about recognising that electronic music and instruments could have cultural values and knowledge embedded in their design and practice in same way as what has come to be known as the Orff method (Orff and Keetman 1958-66). However whilst the Orff method focuses upon Western art music perceptual framework electronic instruments have the potential to have more fluid musical environments and even to move to interdisciplinary study by including visual media. Whilst the Orff method focused on making sense of Western art music through experience electronic environments potentially can make sense of the world of multi media that pervades our lives.
Resumo:
Music making affects relationships with self and others by generating a sense of belonging to a culture or ideology (Bamford, 2006; Barovick, 2001; Dillon & Stewart, 2006; Fiske, 2000; Hallam, 2001). Whilst studies from arts education research present compelling examples of these relationships, others argue that they do not present sufficiently validated evidence of a causal link between music making experiences and cognitive or social change (Winner & Cooper, 2000; Winner & Hetland, 2000a, 2000b, 2001). I have suggested elsewhere that this disconnection between compelling evidence and observations of the effects of music making are in part due to the lack of rigor in research and the incapacity of many methods to capture these experiences in meaningful ways (Dillon, 2006). Part of the answer to these questions about rigor and causality lay in the creative use of new media technologies that capture the results of relationships in music artefacts. Crucially, it is the effective management of these artefacts within computer systems that allows researchers and practitioners to collect, organize, analyse and then theorise such music making experiences.
Resumo:
Within a surveillance video, occlusions are commonplace, and accurately resolving these occlusions is key when seeking to accurately track objects. The challenge of accurately segmenting objects is further complicated by the fact that within many real-world surveillance environments, the objects appear very similar. For example, footage of pedestrians in a city environment will consist of many people wearing dark suits. In this paper, we propose a novel technique to segment groups and resolve occlusions using optical flow discontinuities. We demonstrate that the ratio of continuous to discontinuous pixels within a region can be used to locate the overlapping edges, and incorporate this into an object tracking framework. Results on a portion of the ETISEO database show that the proposed algorithm results in improved tracking performance overall, and improved tracking within occlusions.
Resumo:
Aim: This article reports the results of a study evaluating a preferred music listening intervention for reducing anxiety in older adults with dementia in nursing homes. Background. Anxiety can have a significant negative impact on older adults’ functional status, quality of life and health care resources. However, anxiety is often under-diagnosed and inappropriately treated in those with dementia. Little is known about the use of a preferred music listening intervention for managing anxiety in those with dementia.---------- Design: A quasi-experimental pretest and posttest design was used. ---------- Methods: This study aimed to evaluate the effectiveness of a preferred music listening intervention on anxiety in older adults with dementia in nursing home. Twenty-nine participants in the experimental group received a 30-minute music listening intervention based on personal preferences delivered by trained nursing staff in mid-afternoon, twice a week for six weeks. Meanwhile, 23 participants in the control group only received usual standard care with no music. Anxiety was measured by Rating Anxiety in Dementia at baseline and week six. Analysis of covariance (ANCOVA) was used to determine the effectiveness of a preferred music listening intervention on anxiety at six weeks while controlling for pretest anxiety, age and marital status. Results. ANCOVA results indicated that older adults who received the preferred music listening had a significantly lower anxiety score at six weeks compared with those who received the usual standard care with no music (F = 12Æ15, p = 0Æ001).---------- Conclusions: Preferred music listening had a positive impact by reducing the level of anxiety in older adults with dementia. Relevance to clinical practice. Nursing staff can learn how to implement preferred music intervention to provide appropriate care tailored to the individual needs of older adults with dementia. Preferred music listening is an inexpensive and viable intervention to promote mental health of those with dementia.