910 resultados para Audio input
Resumo:
Visual information in the form of lip movements of the speaker has been shown to improve the performance of speech recognition and search applications. In our previous work, we proposed cross database training of synchronous hidden Markov models (SHMMs) to make use of external large and publicly available audio databases in addition to the relatively small given audio visual database. In this work, the cross database training approach is improved by performing an additional audio adaptation step, which enables audio visual SHMMs to benefit from audio observations of the external audio models before adding visual modality to them. The proposed approach outperforms the baseline cross database training approach in clean and noisy environments in terms of phone recognition accuracy as well as spoken term detection (STD) accuracy.
Resumo:
Speech recognition can be improved by using visual information in the form of lip movements of the speaker in addition to audio information. To date, state-of-the-art techniques for audio-visual speech recognition continue to use audio and visual data of the same database for training their models. In this paper, we present a new approach to make use of one modality of an external dataset in addition to a given audio-visual dataset. By so doing, it is possible to create more powerful models from other extensive audio-only databases and adapt them on our comparatively smaller multi-stream databases. Results show that the presented approach outperforms the widely adopted synchronous hidden Markov models (HMM) trained jointly on audio and visual data of a given audio-visual database for phone recognition by 29% relative. It also outperforms the external audio models trained on extensive external audio datasets and also internal audio models by 5.5% and 46% relative respectively. We also show that the proposed approach is beneficial in noisy environments where the audio source is affected by the environmental noise.
Resumo:
Automated digital recordings are useful for large-scale temporal and spatial environmental monitoring. An important research effort has been the automated classification of calling bird species. In this paper we examine a related task, retrieval of birdcalls from a database of audio recordings, similar to a user supplied query call. Such a retrieval task can sometimes be more useful than an automated classifier. We compare three approaches to similarity-based birdcall retrieval using spectral ridge features and two kinds of gradient features, structure tensor and the histogram of oriented gradients. The retrieval accuracy of our spectral ridge method is 94% compared to 82% for the structure tensor method and 90% for the histogram of gradients method. Additionally, this approach potentially offers a more compact representation and is more computationally efficient.
Resumo:
The use of organophosphate esters (PFRs) as flame retardants and plasticizers has increased due to the ban of some brominated flame retardants. There is however some concern regarding the toxicity, particularly carcinogenicity and neurotoxicity, of some of the PFRs. In this study we applied wastewater analysis to assess use of PFRs by the Australian population. Influent samples were collected from eleven wastewater treatment plants (STPs) in Australia on Census day and analysed for PFRs using gas chromatography coupled with mass spectrometry (GC-MS). Per capita mass loads of PFRs were calculated using the accurate Census head counts. The results indicate that tris(2-butoxyethyl) phosphate (TBOEP) has the highest per capita input into wastewater followed by tris(2-chloroisopropyl) phosphate (TCIPP), tris(isobutyl) phosphate (TIBP), tris(2-chloroethyl) phosphate (TCEP) and tris(1,3-dichloroisopropyl) phosphate (TDCIPP). Similar PFR profiles were observed across the Australian STPs and a comparison with European and U.S. STPs indicated similar PFR concentrations. We estimate that approximately 2.1 mg person−1 day−1 of PFRs are input into Australian wastewater which equates to 16 tonnes per annum.
Resumo:
Analogue and digital techniques for linearization of non-linear input-output relationship of transducers are briefly reviewed. The condition required for linearizing a non-linear function y = f(x) using a non-linear analogue-to-digital converter, is explained. A simple technique to construct a non-linear digital-to-analogue converter, based on ' segments of equal digital interval ' is described. The technique was used to build an N-DAC which can be employed in a successive approximation or counter-ramp type ADC to linearize the non-linear transfer function of a thermistor-resistor combination. The possibility of achieving an order of magnitude higher accuracy in the measurement of temperature is shown.
Resumo:
This paper is concerned with the development of an algorithm for pole placement in multi-input dynamic systems. The algorithm which uses a series of elementary transformations is believed to be simpler, computationally more efficient and numerically stable when compared with earlier methods. In this paper two methods have been presented.
Resumo:
The potential of beef producers to profitably produce 500-kg steers at 2.5 years of age in northern Australia's dry tropics to meet specifications of high-value markets, using a high-input management (HIM) system was examined. HIM included targeted high levels of fortified molasses supplementation, short seasonal mating and the use of growth promotants. Using herds of 300-400 females plus steer progeny at three sites, HIM was compared at a business level to prevailing best-practice, strategic low-input management (SLIM) in which there is a relatively low usage of energy concentrates to supplement pasture intake. The data presented for each breeding-age cohort within management system at each site includes: annual pregnancy rates (range: 14-99%), time of conception, mortalities (range: 0-10%), progeny losses between confirmed pregnancy and weaning (range: 0-29%), and weaning rates (range: 14-92%) over the 2-year observation. Annual changes in weight and relative net worth were calculated for all breeding and non-breeding cohorts. Reasons for outcomes are discussed. Compared with SLIM herds, both weaning weights and annual growth were >= 30 kg higher, enabling 86-100% of HIM steers to exceed 500 kg at 2.5 years of age. Very few contemporary SLIM steers reached this target. HIM was most profitably applied to steers. Where HIM was able to achieve high pregnancy rates in yearlings, its application was recommended in females. Well managed, appropriate HIM systems increased profits by around $15/adult equivalent at prevailing beef and supplement prices. However, a 20% supplement price rise without a commensurate increase in values for young slaughter steers would generally eliminate this advantage. This study demonstrated the complexity of pro. table application of research outcomes to commercial business, even when component research suggests that specific strategies may increase growth and reproductive efficiency and/or be more pro. table. Because of the higher level of management required, higher costs and returns, and higher susceptibility to market changes and disease, HIM systems should only be applied after SLIM systems are well developed. To increase profitability, any strategy must ultimately either increase steer growth and sale values and/or enable a shift to high pregnancy rates in yearling heifers.
Resumo:
This paper is concerned with the development of an algorithm for pole placement in multi-input dynamic systems. The algorithm which uses a series of elementary transformations is believed to be simpler, computationally more efficient and numerically stable when compared with earlier methods. In this paper two methods have been presented.
Resumo:
Acoustic recordings play an increasingly important role in monitoring terrestrial and aquatic environments. However, rapid advances in technology make it possible to accumulate thousands of hours of recordings, more than ecologists can ever listen to. Our approach to this big-data challenge is to visualize the content of long-duration audio recordings on multiple scales, from minutes, hours, days to years. The visualization should facilitate navigation and yield ecologically meaningful information prior to listening to the audio. To construct images, we calculate acoustic indices, statistics that describe the distribution of acoustic energy and reflect content of ecological interest. We combine various indices to produce false-color spectrogram images that reveal acoustic content and facilitate navigation. The technical challenge we investigate in this work is how to navigate recordings that are days or even months in duration. We introduce a method of zooming through multiple temporal scales, analogous to Google Maps. However, the “landscape” to be navigated is not geographical and not therefore intrinsically visual, but rather a graphical representation of the underlying audio. We describe solutions to navigating spectrograms that range over three orders of magnitude of temporal scale. We make three sets of observations: 1. We determine that at least ten intermediate scale steps are required to zoom over three orders of magnitude of temporal scale; 2. We determine that three different visual representations are required to cover the range of temporal scales; 3. We present a solution to the problem of maintaining visual continuity when stepping between different visual representations. Finally, we demonstrate the utility of the approach with four case studies.
Enhancing economic input to the CQSS2 Project report. Commissioned by the Fitzroy Basin Association.
Resumo:
The Fitzroy Basin is the second largest catchment area in Australia covering 143,00 km² and is the largest catchment for the Great Barrier Reef lagoon (Karfs et al., 2009). The Great Barrier Reef is the largest reef system in the world; it covers an area of approximately 225,000 km² in the northern Queensland continental shelf. There are approximately 750 reefs that exist within 40 km of the Queensland Coast (Haynes et al., 2007). The prime determinant for the changes in water quality have been attributed to grazing, with beef production the largest single land use industry comprising 90% of the land area (Karfs et al., 2009). In response to the depletion of water quality in the reef, in 2003 a Reef Water Quality plan was developed by the Australian and Queensland governments. The plan targets as a priority sediment contributions from grazing cattle in high risk catchments (The State of Queensland and Commonwealth of Australia, 2003). The economic incentive strategy designed includes analysing the costs and benefits of best management practice that will lead to improved water quality (The State of Queensland and Commonwealth of Australia, 2003).
Resumo:
Background Australian policy mandates consumer and carer participation in mental health services at all levels including research. Inspired by a UK model - Service Users Group Advising on Research [SUGAR] - we conducted a scoping project in 2013 with a view to create a consumer and carer led research process that moves beyond stigma and tokenism, that values the unique knowledge of lived experience and leads to people being treated better when accessing services. This poster presents the initial findings. Aims The project’s purpose was to explore with consumers, consumer companions and carers at the Metro North Mental Health-RBWH their interest in and views about research partnerships with academic and clinical colleagues. Methods This poster overviews the initial findings from three audio-recorded focus groups conducted with a total of 14 consumers, carers and consumer companions at the Brisbane site. Analysis Our work was guided by framework analysis (Gale et al. 2013). It defines 5 steps for analysing narrative data: familiarising; development of categories; indexing; charting and interpretation. Eight main ideas were initially developed and were divided between the authors to further index. This process identified 37 related analytic ideas. The authors integrated these by combining, removing and redefining them by consensus though a mapping process. The final step is the return of the analysis to the participants for feedback and input into the interpretation of the focus group discussions. Results 1. Value & Respect: Feeling Valued & Respected, Tokenism, Stigma, Governance, Valuing prior knowledge / background 2. Pathways to Knowledge and Involvement in Research: ‘Where to begin’, Support, Unity & partnership, Communication, Co-ordination, Flexibility due to fluctuating capacity 3. Personal Context: Barriers regarding Commitments & the nature of mental illness, Wellbeing needs, Prior experience of research, Motivators, Attributes 4. What is research? Developing Knowledge, What to do research on, how and why? Conclusion and Discussion Initial analysis suggests that participants saw potential for ‘amazing things’ in mental health research such as reflecting their priorities and moving beyond stigma and tokenism. The main needs identified were education, mentoring, funding support and research processes that fitted consumers’ and carers’limitations and fluctuating capacities. They identified maintaining motivation and interest as an issue since research processes are often extended by ethics and funding applications. Participants felt that consumer and carer led research would value the unique knowledge that the lived experience of consumers and carers brings and lead to people being treated better when accessing services.
Resumo:
In this paper the response of a gyrostabilized platform subjected to a transient torque has been analyzed by deliberately introducing non-linearity into the command of the servomotor. The resulting third-order non-linear differential equation has been solved by using a transformation technique involving the displacement variable. The condition under which platform oscillations may grow with time or die with time are important from the point of view of platform stabilization. The effect of deliberate addition of non-linearity with a view to achieving the ideal response—that is, to bring the platform back to its equilibrium position with as few oscillations as possible—has been investigated. The conditions under which instability may set in on account of the small transient input and small non-linearity has also been discussed. The analysis is illustrated by means of a numerical example. The results of analysis are compared with numerical solutions obtained on a digital computer.
Resumo:
The aim of this paper is to present results of research investigating the effectiveness of audio feedback in a third year undergraduate unit. While there is a large and growing body of literature about providing assessment feedback, there is little focussing on the use of audio media. This study employs a mixed method approach, involving semi-structured interviews with academic staff and a survey of students. Analysis of the interview data suggests that there are a number of issues surrounding acceptance of using audio feedback by lecturers. The next stage of the study is to examine the extent to which lecturers change their perceptions as they use audio feedback and to analyse the perceptions of the students (n=120), including the perceived importance of feedback, the ways in which they used the audio feedback and the extent to which they believe they control events that affect them. Ultimately, this study seeks to provide recommendations appropriate to the implementation of audio feedback in higher education.
Resumo:
Providing audio feedback to assessment is relatively uncommon in higher education. However, published research suggests that it is preferred over written feedback by students but lecturers were less convinced. The aim of this paper is to examine further these findings in the context of a third year business ethics unit. Data was collected from two sources. The first is a series of in-depth, semi-structured interviews conducted with three lecturers providing audio feeback for the first time in Semester One 2011. The second source of data was drawn from the university student evaluation system. A total of 363 responses were used providing 'before' and 'after' perspectives about the effectiveness of audio feedback versus written feedback. Between 2005 and 2009 the survey data provided information about student attitudes to written assessment feedback (n=261). From 2010 onwards the data relates to audio (mp3) feedback (n=102). The analysis of he interview data indicated that introducing audio feedback should be done with care. The perception of the participating lecturers was mixed, ranging from sceptism to outright enthusiasm, but over time the overall approach became positive. It was found that particular attention needs to be paid to small (but important) technical details, and lecturers need to be convinced of its effectieness, especially that it is not necessarily more time consuming than providing written feedback. For students, the analysis revealed a clear preference for audio feedback. It is concluded that there is cause for concern and reason for optimism. It is a cause for concern because there is a possibility that scepticism on the part of academic staff seems to be based on assumptions about what students prefer and a concern about using the technology. There is reason for optimism because the evidence points towards students preferring audio feedback and as academic staff become more familiar with the technology the scepticism tends to evaporate. While this study is limited in scope, questions are raised about tackling negative staff perceptions of audio feedback that are worthy of further research.
Resumo:
Tutkimuksen tavoitteena on tuottaa uutta tietoa Suomen kansantalouden rakenteesta ja lyhyen aikavälin kehityksestä 1920- ja 1930-luvulla. Tutkimus toteutettiin laatimalla kansantaloutta kuvaava panos-tuotostaulu vuodelle 1928 sekä sen laajennus, panos-tuotosmalli. Aineiston avulla kuvataan kansantalouden rakenteellisia riippuvuuksia, tuotannon avaintoimialoja sekä näiden vaikutusta kansantalouteen. Lisäksi tutkimuksessa tarkastellaan kansantalouden tuontiriippuvuutta sekä tuontitullien vaikutusta hintoihin 1930-luvun laman aikana. Tutkimuksen perusteella voitiin identifioida Suomen kansantalouden avaintoimialat vuonna 1928: maatalous, metsätalous, elintarviketeollisuus, puuteollisuus, paperiteollisuus ja rakennustoiminta. Erityisesti elintarviketeollisuuden vahva rooli kansantaloudessa oli kenties yllättävää, erityisesti kun huomioidaan kuinka vähän toimiala on saanut huomiota osakseen taloushistorian tutkimuksessa. Tutkimus osoitti, että Suomen vienti oli pääomavaltaisempaa kuin tuonti. Vaikka tämän tuloksen tulkinta on varauksellinen, tutkimus pystyi osoittamaan ja kvantifioimaan toimialojen työ- ja pääomapanoksen osuuden tuotoksesta yksityiskohtaisesti. Panos-tuotosmallilla arvioitiin puuteollisuuden, paperiteollisuuden ja rakennustoiminnan ajanjaksona 1928-32 tapahtuneen loppukäytön muutoksen vaikutusta kansantalouteen. Merkittävä havainto on, että rakennustoiminnan loppukäytön muutoksella oli erittäin suuri kasvua vähentävä vaikutus koko kansantaloudessa. Talonrakennusinvestointien romahtaminen aiheutti lähes 13 prosentin tuotannon laskun kansantaloudessa. Vaikutus oli jopa suurempi kuin puuteollisuuden viennin romahtamisen. Tulokset osoittavat toisaalta, että yksityisen kulutuksen merkitys kansantaloudelle oli erittäin vahva. Esimerkiksi puuteollisuuden viennin romahtaminen aiheutti yli 4 % tuotannon vähenemisen mutta huomioitaessa mallissa myös yksityisen kulutuksen väheneminen, oli kokonaisvaikutus yli 10 %. Yksityisen kulutuksen huomioiminen mallissa siis yli kaksinkertaisti toimialojen vaikutukset kansantalouteen. Tulokset vahvistivat aiemmissa tutkimuksissa esitettyjä johtopäätöksiä tullipolitiikasta ja osoittivat maatalouteen läheisesti liittyvän elintarviketeollisuuden olleen eniten suojeltu toimiala kansantaloudessa. Muut kotimarkkinoiden toimialat eivät kuitenkaan hyötyneet tullipolitiikasta lamakauden aikana. Panos-tuotoshintamallilla osoitettiin, ettei tullipolitiikka ollut niin onnistunutta kuin aikalaistutkimuksissa väitettiin, vaan tullit korkeintaan pystyivät hidastamaan hintojen alenemista. Tutkimuksen liitteenä esitetään kaikki keskeiset Suomen kansantaloutta vuonna 1928 kuvaavat tilastolliset taulukot, mukaan lukien käyttö- ja tarjontataulukot, panos-tuotostaulukot, panoskertoimet, Leontiefin käänteismatriisi sekä työ- ja pääomapanoskertoimet.