36 resultados para Combine harvester
Resumo:
Manually inspecting bridges is a time-consuming and costly task. There are over 600,000 bridges in the US, and not all of them can be inspected and maintained within the specified time frame as some state DOTs cannot afford the essential costs and manpower. This paper presents a novel method that can detect bridge concrete columns from visual data for the purpose of eventually creating an automated bridge condition assessment system. The method employs SIFT feature detection and matching to find overlapping areas among images. Affine transformation matrices are then calculated to combine images containing different segments of one column into a single image. Following that, the bridge columns are detected by identifying the boundaries in the stitched image and classifying the material within each boundary. Preliminary test results using real bridge images indicate that most columns in stitched images can be correctly detected and thus, the viability of the application of this research.
Resumo:
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often combine outputs from multiple subsystems developed at different sites. Cross system adaptation can be used as an alternative to direct hypothesis level combination schemes such as ROVER. The standard approach involves only cross adapting acoustic models. To fully exploit the complimentary features among sub-systems, language model (LM) cross adaptation techniques can be used. Previous research on multi-level n-gram LM cross adaptation is extended to further include the cross adaptation of neural network LMs in this paper. Using this improved LM cross adaptation framework, significant error rate gains of 4.0%-7.1% relative were obtained over acoustic model only cross adaptation when combining a range of Chinese LVCSR sub-systems used in the 2010 and 2011 DARPA GALE evaluations. Copyright © 2011 ISCA.
Resumo:
Language models (LMs) are often constructed by building multiple individual component models that are combined using context independent interpolation weights. By tuning these weights, using either perplexity or discriminative approaches, it is possible to adapt LMs to a particular task. This paper investigates the use of context dependent weighting in both interpolation and test-time adaptation of language models. Depending on the previous word contexts, a discrete history weighting function is used to adjust the contribution from each component model. As this dramatically increases the number of parameters to estimate, robust weight estimation schemes are required. Several approaches are described in this paper. The first approach is based on MAP estimation where interpolation weights of lower order contexts are used as smoothing priors. The second approach uses training data to ensure robust estimation of LM interpolation weights. This can also serve as a smoothing prior for MAP adaptation. A normalized perplexity metric is proposed to handle the bias of the standard perplexity criterion to corpus size. A range of schemes to combine weight information obtained from training data and test data hypotheses are also proposed to improve robustness during context dependent LM adaptation. In addition, a minimum Bayes' risk (MBR) based discriminative training scheme is also proposed. An efficient weighted finite state transducer (WFST) decoding algorithm for context dependent interpolation is also presented. The proposed technique was evaluated using a state-of-the-art Mandarin Chinese broadcast speech transcription task. Character error rate (CER) reductions up to 7.3 relative were obtained as well as consistent perplexity improvements. © 2012 Elsevier Ltd. All rights reserved.
Resumo:
We describe an RFID tag reading system for reading one or more RFID Tags, the system comprising an RF transmitter and an RF receiver, a plurality of transmit/receive antennas coupled to said RF transmitter and to said RF receiver, to provide spatial transmit/receive signal diversity, and a tag signal decoder coupled to at least said RF receiver, wherein said system is configured to combine received RF signals from said antennas to provide a combined received RF signal, wherein said RF receiver has said combined received RF signal as an input; wherein said antennas are spaced apart from one another sufficiently for one said antenna not to be within the near field of another said antenna, wherein said system is configured to perform a tag inventory cycle comprising a plurality of tag read rounds to read said tags, a said tag read round comprising transmission of one or more RF tag interrogation signals simultaneously from said plurality of antennas and receiving a signal from one or more of said tags, a said tag read round having a set of time slots during which a said tag is able to transmit tag data including a tag ID for reception by said antenna, and wherein said system is configured to perform, during a said tag inventory cycle, one or both of: a change in a frequency of said tag interrogation signals transmitted simultaneously from said plurality of antennas, and a change in a relative phase of a said RF tag interrogation signals transmitted from one of said antennas with respect to another of said antennas.
Resumo:
Mandarin Chinese is based on characters which are syllabic in nature and morphological in meaning. All spoken languages have syllabiotactic rules which govern the construction of syllables and their allowed sequences. These constraints are not as restrictive as those learned from word sequences, but they can provide additional useful linguistic information. Hence, it is possible to improve speech recognition performance by appropriately combining these two types of constraints. For the Chinese language considered in this paper, character level language models (LMs) can be used as a first level approximation to allowed syllable sequences. To test this idea, word and character level n-gram LMs were trained on 2.8 billion words (equivalent to 4.3 billion characters) of texts from a wide collection of text sources. Both hypothesis and model based combination techniques were investigated to combine word and character level LMs. Significant character error rate reductions up to 7.3% relative were obtained on a state-of-the-art Mandarin Chinese broadcast audio recognition task using an adapted history dependent multi-level LM that performs a log-linearly combination of character and word level LMs. This supports the hypothesis that character or syllable sequence models are useful for improving Mandarin speech recognition performance.
Resumo:
While searching for objects, we combine information from multiple visual modalities. Classical theories of visual search assume that features are processed independently prior to an integration stage. Based on this, one would predict that features that are equally discriminable in single feature search should remain so in conjunction search. We test this hypothesis by examining whether search accuracy in feature search predicts accuracy in conjunction search. Subjects searched for objects combining color and orientation or size; eye movements were recorded. Prior to the main experiment, we matched feature discriminability, making sure that in feature search, 70% of saccades were likely to go to the correct target stimulus. In contrast to this symmetric single feature discrimination performance, the conjunction search task showed an asymmetry in feature discrimination performance: In conjunction search, a similar percentage of saccades went to the correct color as in feature search but much less often to correct orientation or size. Therefore, accuracy in feature search is a good predictor of accuracy in conjunction search for color but not for size and orientation. We propose two explanations for the presence of such asymmetries in conjunction search: the use of conjunctively tuned channels and differential crowding effects for different features.
Resumo:
We study unsupervised learning in a probabilistic generative model for occlusion. The model uses two types of latent variables: one indicates which objects are present in the image, and the other how they are ordered in depth. This depth order then determines how the positions and appearances of the objects present, specified in the model parameters, combine to form the image. We show that the object parameters can be learnt from an unlabelled set of images in which objects occlude one another. Exact maximum-likelihood learning is intractable. However, we show that tractable approximations to Expectation Maximization (EM) can be found if the training images each contain only a small number of objects on average. In numerical experiments it is shown that these approximations recover the correct set of object parameters. Experiments on a novel version of the bars test using colored bars, and experiments on more realistic data, show that the algorithm performs well in extracting the generating causes. Experiments based on the standard bars benchmark test for object learning show that the algorithm performs well in comparison to other recent component extraction approaches. The model and the learning algorithm thus connect research on occlusion with the research field of multiple-causes component extraction methods.
Resumo:
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often combine outputs from multiple sub-systems that may even be developed at different sites. Cross system adaptation, in which model adaptation is performed using the outputs from another sub-system, can be used as an alternative to hypothesis level combination schemes such as ROVER. Normally cross adaptation is only performed on the acoustic models. However, there are many other levels in LVCSR systems' modelling hierarchy where complimentary features may be exploited, for example, the sub-word and the word level, to further improve cross adaptation based system combination. It is thus interesting to also cross adapt language models (LMs) to capture these additional useful features. In this paper cross adaptation is applied to three forms of language models, a multi-level LM that models both syllable and word sequences, a word level neural network LM, and the linear combination of the two. Significant error rate reductions of 4.0-7.1% relative were obtained over ROVER and acoustic model only cross adaptation when combining a range of Chinese LVCSR sub-systems used in the 2010 and 2011 DARPA GALE evaluations. © 2012 Elsevier Ltd. All rights reserved.
Resumo:
Widespread approaches to fabricate surfaces with robust micro- and nanostructured topographies have been stimulated by opportunities to enhance interface performance by combining physical and chemical effects. In particular, arrays of asymmetric surface features, such as arrays of grooves, inclined pillars, and helical protrusions, have been shown to impart unique anisotropy in properties including wetting, adhesion, thermal and/or electrical conductivity, optical activity, and capability to direct cell growth. These properties are of wide interest for applications including energy conversion, microelectronics, chemical and biological sensing, and bioengineering. However, fabrication of asymmetric surface features often pushes the limits of traditional etching and deposition techniques, making it challenging to produce the desired surfaces in a scalable and cost-effective manner. We review and classify approaches to fabricate arrays of asymmetric 2D and 3D surface features, in polymers, metals, and ceramics. Analytical and empirical relationships among geometries, materials, and surface properties are discussed, especially in the context of the applications mentioned above. Further, opportunities for new fabrication methods that combine lithography with principles of self-assembly are identified, aiming to establish design principles for fabrication of arbitrary 3D surface textures over large areas. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Resumo:
This paper derives a new algorithm that performs independent component analysis (ICA) by optimizing the contrast function of the RADICAL algorithm. The core idea of the proposed optimization method is to combine the global search of a good initial condition with a gradient-descent algorithm. This new ICA algorithm performs faster than the RADICAL algorithm (based on Jacobi rotations) while still preserving, and even enhancing, the strong robustness properties that result from its contrast. © Springer-Verlag Berlin Heidelberg 2007.
Resumo:
The task of word-level confidence estimation (CE) for automatic speech recognition (ASR) systems stands to benefit from the combination of suitably defined input features from multiple information sources. However, the information sources of interest may not necessarily operate at the same level of granularity as the underlying ASR system. The research described here builds on previous work on confidence estimation for ASR systems using features extracted from word-level recognition lattices, by incorporating information at the sub-word level. Furthermore, the use of Conditional Random Fields (CRFs) with hidden states is investigated as a technique to combine information for word-level CE. Performance improvements are shown using the sub-word-level information in linear-chain CRFs with appropriately engineered feature functions, as well as when applying the hidden-state CRF model at the word level.
Resumo:
In this book several streams of nonlinear control theory are merged and di- rected towards a constructive solution of the feedback stabilization problem. Analytic, geometric and asymptotic concepts are assembled as design tools for a wide variety of nonlinear phenomena and structures. Di®erential-geometric concepts reveal important structural properties of nonlinear systems, but al- low no margin for modeling errors. To overcome this de¯ciency, we combine them with analytic concepts of passivity, optimality and Lyapunov stability. In this way geometry serves as a guide for construction of design procedures, while analysis provides robustness tools which geometry lacks.
Resumo:
Graphene is emerging as a viable alternative to conventional optoelectronic, plasmonic and nanophotonic materials. The interaction of light with charge carriers creates an out-of-equilibrium distribution, which relaxes on an ultrafast timescale to a hot Fermi-Dirac distribution, that subsequently cools emitting phonons. Although the slower relaxation mechanisms have been extensively investigated, the initial stages still pose a challenge. Experimentally, they defy the resolution of most pump-probe setups, due to the extremely fast sub-100 fs carrier dynamics. Theoretically, massless Dirac fermions represent a novel many-body problem, fundamentally different from Schrödinger fermions. Here we combine pump-probe spectroscopy with a microscopic theory to investigate electron-electron interactions during the early stages of relaxation. We identify the mechanisms controlling the ultrafast dynamics, in particular the role of collinear scattering. This gives rise to Auger processes, including charge multiplication, which is key in photovoltage generation and photodetectors.
Resumo:
A series of fluid-structure interaction simulations of an aerodynamic tension-cone supersonic decelerator prototype intended for large mass payload deployment in planetary explorations are discussed. The fluid-structure interaction computations combine large deformation analysis of thin shells with large-eddy simulation of compressible turbulent flows using a loosely coupled approach to enable quantification of the dynamics of the vehicle. The simulation results are compared with experiments carried out at the NASA Glenn Research Center. Reasonably good agreement between the simulations and the experiment is observed throughout a deflation cycle. The simulations help to illuminate the details of the dynamic progressive buckling of the tension-cone decelerator that ultimately results in the collapse of the structure as the inflation pressure is decreased. Furthermore, the tension-cone decelerator exhibits a transient oscillatory behavior under impulsive loading that ultimately dies out. The frequency of these oscillations was determined to be related to the acoustic time scale in the compressed subsonic region between the bow shock and the structure. As shown, when the natural frequency of the structure and the frequency of the compressed subsonic region approximately match, the decelerator exhibits relatively large nonaxisymetric oscillations. The observed response appears to be a fluid-structure interaction resonance resulting from an acoustic chamber (pistonlike) mode exciting the structure. Copyright © 2013 by Christopher Porter, R. Mark Rennie, Eric J. Jumper.
Resumo:
In the modern engineering design cycle the use of computational tools becomes a neces- sity. The complexity of the engineering systems under consideration for design increases dramatically as the demands for advanced and innovative design concepts and engineering products is expanding. At the same time the advancements in the available technology in terms of computational resources and power, as well as the intelligence of the design software, accommodate these demands and make them a viable approach towards the chal- lenge of real-world engineering problems. This class of design optimisation problems is by nature multi-disciplinary. In the present work we establish enhanced optimisation capabil- ities within the Nimrod/O tool for massively distributed execution of computational tasks through cluster and computational grid resources, and develop the potential to combine and benefit from all the possible available technological advancements, both software and hardware. We develop the interface between a Free Form Deformation geometry manage- ment in-house code with the 2D airfoil aerodynamic efficiency evaluation tool XFoil, and the well established multi-objective heuristic optimisation algorithm NSGA-II. A simple airfoil design problem has been defined to demonstrate the functionality of the design sys- tem, but also to accommodate a framework for future developments and testing with other state-of-the-art optimisation algorithms such as the Multi-Objective Genetic Algorithm (MOGA) and the Multi-Objective Tabu Search (MOTS) techniques. Ultimately, heav- ily computationally expensive industrial design cases can be realised within the presented framework that could not be investigated before. © 2012 by the authors. Published by the American Institute of Aeronautics and Astronautics, Inc.