374 resultados para Optical character recognition
Resumo:
We propose the use of optical flow information as a method for detecting and describing changes in the environment, from the perspective of a mobile camera. We analyze the characteristics of the optical flow signal and demonstrate how robust flow vectors can be generated and used for the detection of depth discontinuities and appearance changes at key locations. To successfully achieve this task, a full discussion on camera positioning, distortion compensation, noise filtering, and parameter estimation is presented. We then extract statistical attributes from the flow signal to describe the location of the scene changes. We also employ clustering and dominant shape of vectors to increase the descriptiveness. Once a database of nodes (where a node is a detected scene change) and their corresponding flow features is created, matching can be performed whenever nodes are encountered, such that topological localization can be achieved. We retrieve the most likely node according to the Mahalanobis and Chi-square distances between the current frame and the database. The results illustrate the applicability of the technique for detecting and describing scene changes in diverse lighting conditions, considering indoor and outdoor environments and different robot platforms.
Resumo:
This paper presents an approach to mobile robot localization, place recognition and loop closure using a monostatic ultra-wide band (UWB) radar system. The UWB radar is a time-of-flight based range measurement sensor that transmits short pulses and receives reflected waves from objects in the environment. The main idea of the poposed localization method is to treat the received waveform as a signature of place. The resulting echo waveform is very complex and highly depends on the position of the sensor with respect to surrounding objects. On the other hand, the sensor receives similar waveforms from the same positions.Moreover, the directional characteristics of dipole antenna is almost omnidirectional. Therefore, we can localize the sensor position to find similar waveform from waveform database. This paper proposes a place recognitionmethod based on waveform matching, presents a number of experiments that illustrate the high positon estimation accuracy of our UWB radar-based localization system, and shows the resulting loop detection performance in a typical indoor office environment and a forest.
Resumo:
Nanoporous Nb2O5 has been previously demonstrated to be a viable electrochromic material with strong intercalation characteristics. Despite showing such promising properties, its potential for optical gas sensing applications, which involves the production of ionic species such as H+, has yet to be explored. Nanoporous Nb2O5 can accommodate a large amount of H+ ions in a process that results in an energy bandgap change of the material, which induces an optical response. Here, we demonstrate the optical hydrogen gas (H¬2) sensing capability of nanoporous anodic Nb2O5 with a large surface-to-volume ratio prepared via a high temperature anodization method. The large active surface area of the film provides enhanced pathways for efficient hydrogen adsorption and dissociation, which are facilitated by a thin layer of Pt catalyst. We show that the process of H2 sensing causes optical modulations that are investigated in terms of response magnitudes and dynamics. The optical modulations induced by the intercalation process and sensing properties of nanoporous anodic Nb2O5 shown in this work can potentially be used for future optical gas sensing systems.
Resumo:
Background Serum lutein (L) and zeaxanthin (Z) positively correlate with macular pigment optical density (MPOD), hence the latter is a valuable indirect tool for measuring L and Z content in the macula. L and Z have been attributed antioxidant capacity and protection from certain retinal diseases but their uptake within the eye is thought to depend on genetic, age and environmental factors. In particular gene variants within beta-carotene monooxygenase (BCMO1) are thought to modulate MPOD in the macula. Objectives: To determine the effect of BCMO1 single nucleotide polymorphisms (SNPs) rs11645428, rs6420424 and rs6464851 on macular pigment optical density (MPOD) in a cohort of young healthy participants of Caucasian origin with normal ocular health. Design In this cohort study, MPOD was assessed in 46 healthy participants (22 male and 24 female) with a mean age of 24 ± 4.0 years (range 19-33). The three SNPs, rs11645428, rs6420424, rs6564851 that have established associations with MPOD were determined using MassEXTEND (hME) Sequenom assay. One-way analysis of variance (ANOVA) was performed on groups segregated into homozygous and heterozygous BCMO1 genotypes. Correlations between body mass index (BMI), iris colour, gender, central retinal thickness (CRT), diet and MPOD were investigated. Results MPOD did not significantly vary with BCMO1 rs11645428 (F2,41 = 0.700, p = 0.503), rs6420424 (F2,41 = 0.210, p = 0.801) nor rs6464851 homozygous or heterozygous genotypes (F2,41 = 0,13, p = 0.88), in this young healthy cohort. The combination of these three SNPs into triple genotypes based on plasma conversion efficiency did not affect MPOD (F2,41 = 0.07, p = 0.9). There was a significant negative correlation with MPOD and central retinal thickness (r = - 0.39, p = 0.01) but no significant correlation between BMI, iris colour, gender and MPOD. Conclusion Our results indicate that macular pigment deposition within the central retina is not dependent on BCMO1 gene variants in young healthy people. We propose that MPOD is saturated in younger persons and/or other gene variant combinations determine its deposition.
Resumo:
Robustness to variations in environmental conditions and camera viewpoint is essential for long-term place recognition, navigation and SLAM. Existing systems typically solve either of these problems, but invariance to both remains a challenge. This paper presents a training-free approach to lateral viewpoint- and condition-invariant, vision-based place recognition. Our successive frame patch-tracking technique infers average scene depth along traverses and automatically rescales views of the same place at different depths to increase their similarity. We combine our system with the condition-invariant SMART algorithm and demonstrate place recognition between day and night, across entire 4-lane-plus-median-strip roads, where current algorithms fail.
Resumo:
This paper combines experimental data with simple mathematical models to investigate the influence of spray formulation type and leaf character (wettability) on shatter, bounce and adhesion of droplets impacting with cotton, rice and wheat leaves. Impaction criteria that allow for different angles of the leaf surface and the droplet impact trajectory are presented; their predictions are based on whether combinations of droplet size and velocity lie above or below bounce and shatter boundaries. In the experimental component, real leaves are used, with all their inherent natural variability. Further, commercial agricultural spray nozzles are employed, resulting in a range of droplet characteristics. Given this natural variability, there is broad agreement between the data and predictions. As predicted, the shatter of droplets was found to increase as droplet size and velocity increased, and the surface became harder to wet. Bouncing of droplets occurred most frequently on hard to wet surfaces with high surface tension mixtures. On the other hand, a number of small droplets with low impact velocity were observed to bounce when predicted to lie well within the adhering regime. We believe this discrepancy between the predictions and experimental data could be due to air layer effects that were not taken into account in the current bounce equations. Other discrepancies between experiment and theory are thought to be due to the current assumption of a dry impact surface, whereas, in practice, the leaf surfaces became increasingly covered with fluid throughout the spray test runs.
A LIN inspired optical bus for signal isolation in multilevel or modular power electronic converters
Resumo:
Proposed in this paper is a low-cost, half-duplex optical communication bus for control signal isolation in modular or multilevel power electronic converters. The concept is inspired by the Local Interconnect Network (LIN) serial network protocol as used in the automotive industry. The proposed communications bus utilises readily available optical transceivers and is suitable for use with low-cost microcontrollers for distributed control of multilevel converters. As a signal isolation concept, the proposed optical bus enables very high cell count modular multilevel cascaded converters (MMCCs) for high-bandwidth, high-voltage and high-power applications. Prototype hardware is developed and the optical bus concept is validated experimentally in a 33-level MMCC converter operating at 120 Vrms and 60 Hz.
Resumo:
The benefits for university graduates in growing skills and capabilities through volunteering experiences are gaining increased attention. Building leadership self-efficacy supports students develop their capacity for understanding, articulating and evidencing their learning. Reward and recognition is fundamental in the student’s journey to build self-efficacy. Through this research, concepts of reward and recognition have been explored and articulated through the experiences and perceptions of actively engaged student peer leaders. The research methodology has enabled a collaborative, student-centred approach in shaping an innovative Rewards Framework, which supports, recognises and rewards the learning journey from beginning peer leader to competent and confident graduate.
Resumo:
Place recognition has long been an incompletely solved problem in that all approaches involve significant compromises. Current methods address many but never all of the critical challenges of place recognition – viewpoint-invariance, condition-invariance and minimizing training requirements. Here we present an approach that adapts state-of-the-art object proposal techniques to identify potential landmarks within an image for place recognition. We use the astonishing power of convolutional neural network features to identify matching landmark proposals between images to perform place recognition over extreme appearance and viewpoint variations. Our system does not require any form of training, all components are generic enough to be used off-the-shelf. We present a range of challenging experiments in varied viewpoint and environmental conditions. We demonstrate superior performance to current state-of-the- art techniques. Furthermore, by building on existing and widely used recognition frameworks, this approach provides a highly compatible place recognition system with the potential for easy integration of other techniques such as object detection and semantic scene interpretation.
Resumo:
2,4,6-trinitrotoluene (TNT) is one of the most commonly used nitro aromatic explosives in landmine, military and mining industry. This article demonstrates rapid and selective identification of TNT by surface-enhanced Raman spectroscopy (SERS) using 6-aminohexanethiol (AHT) as a new recognition molecule. First, Meisenheimer complex formation between AHT and TNT is confirmed by the development of pink colour and appearance of new band around 500 nm in UV-visible spectrum. Solution Raman spectroscopy study also supported the AHT:TNT complex formation by demonstrating changes in the vibrational stretching of AHT molecule between 2800-3000 cm−1. For surface enhanced Raman spectroscopy analysis, a self-assembled monolayer (SAM) of AHT is formed over the gold nanostructure (AuNS) SERS substrate in order to selectively capture TNT onto the surface. Electrochemical desorption and X-ray photoelectron studies are performed over AHT SAM modified surface to examine the presence of free amine groups with appropriate orientation for complex formation. Further, AHT and butanethiol (BT) mixed monolayer system is explored to improve the AHT:TNT complex formation efficiency. Using a 9:1 AHT:BT mixed monolayer, a very low detection limit (LOD) of 100 fM TNT was realized. The new method delivers high selectivity towards TNT over 2,4 DNT and picric acid. Finally, real sample analysis is demonstrated by the extraction and SERS detection of 302 pM of TNT from spiked.
Resumo:
The QUT-NOISE-SRE protocol is designed to mix the large QUT-NOISE database, consisting of over 10 hours of back- ground noise, collected across 10 unique locations covering 5 common noise scenarios, with commonly used speaker recognition datasets such as Switchboard, Mixer and the speaker recognition evaluation (SRE) datasets provided by NIST. By allowing common, clean, speech corpora to be mixed with a wide variety of noise conditions, environmental reverberant responses, and signal-to-noise ratios, this protocol provides a solid basis for the development, evaluation and benchmarking of robust speaker recognition algorithms, and is freely available to download alongside the QUT-NOISE database. In this work, we use the QUT-NOISE-SRE protocol to evaluate a state-of-the-art PLDA i-vector speaker recognition system, demonstrating the importance of designing voice-activity-detection front-ends specifically for speaker recognition, rather than aiming for perfect coherence with the true speech/non-speech boundaries.
Resumo:
We used event-related fMRI to investigate the neural correlates of encoding strength and word frequency effects in recognition memory. At test, participants made Old/New decisions to intermixed low (LF) and high frequency (HF) words that had been presented once or twice at study and to new, unstudied words. The Old/New effect for all hits vs. correctly rejected unstudied words was associated with differential activity in multiple cortical regions, including the anterior medial temporal lobe (MTL), hippocampus, left lateral parietal cortex and anterior left inferior prefrontal cortex (LIPC). Items repeated at study had superior hit rates (HR) compared to items presented once and were associated with reduced activity in the right anterior MTL. By contrast, other regions that had shown conventional Old/New effects did not demonstrate modulation according to memory strength. A mirror effect for word frequency was demonstrated, with the LF word HR advantage associated with increased activity in the left lateral temporal cortex. However, none of the regions that had demonstrated Old/New item retrieval effects showed modulation according to word frequency. These findings are interpreted as supporting single-process memory models proposing a unitary strength-like memory signal and models attributing the LF word HR advantage to the greater lexico-semantic context-noise associated with HF words due to their being experienced in many pre-experimental contexts.
Resumo:
In the present study, items pre-exposed in a familiarization series were included in a list discrimination task to manipulate memory strength. At test, participants were required to discriminate strong targets and strong lures from weak targets and new lures. This resulted in a concordant pattern of increased "old" responses to strong targets and lures. Model estimates attributed this pattern to either equivalent increases in memory strength across the two types of items (unequal variance signal detection model) or equivalent increases in both familiarity and recollection (dual process signal detection [DPSD] model). Hippocampal activity associated with strong targets and lures showed equivalent increases compared with missed items. This remained the case when analyses were restricted to high-confidence responses considered by the DPSD model to reflect predominantly recollection. A similar pattern of activity was observed in parahippocampal cortex for high-confidence responses. The present results are incompatible with "noncriterial" or "false" recollection being reflected solely in inflated DPSD familiarity estimates and support a positive correlation between hippocampal activity and memory strength irrespective of the accuracy of list discrimination, consistent with the unequal variance signal detection model account.
Resumo:
Speech recognition can be improved by using visual information in the form of lip movements of the speaker in addition to audio information. To date, state-of-the-art techniques for audio-visual speech recognition continue to use audio and visual data of the same database for training their models. In this paper, we present a new approach to make use of one modality of an external dataset in addition to a given audio-visual dataset. By so doing, it is possible to create more powerful models from other extensive audio-only databases and adapt them on our comparatively smaller multi-stream databases. Results show that the presented approach outperforms the widely adopted synchronous hidden Markov models (HMM) trained jointly on audio and visual data of a given audio-visual database for phone recognition by 29% relative. It also outperforms the external audio models trained on extensive external audio datasets and also internal audio models by 5.5% and 46% relative respectively. We also show that the proposed approach is beneficial in noisy environments where the audio source is affected by the environmental noise.
Resumo:
Graphitic like layered materials exhibit intriguing electronic structures and thus the search for new types of two-dimensional (2D) monolayer materials is of great interest for developing novel nano-devices. By using density functional theory (DFT) method, here we for the first time investigate the structure, stability, electronic and optical properties of monolayer lead iodide (PbI2). The stability of PbI2 monolayer is first confirmed by phonon dispersion calculation. Compared to the calculation using generalized gradient approximation, screened hybrid functional and spin–orbit coupling effects can not only predicts an accurate bandgap (2.63 eV), but also the correct position of valence and conduction band edges. The biaxial strain can tune its bandgap size in a wide range from 1 eV to 3 eV, which can be understood by the strain induced uniformly change of electric field between Pb and I atomic layer. The calculated imaginary part of the dielectric function of 2D graphene/PbI2 van der Waals type hetero-structure shows significant red shift of absorption edge compared to that of a pure monolayer PbI2. Our findings highlight a new interesting 2D material with potential applications in nanoelectronics and optoelectronics.