Biblioteca Digital

19 resultados para optical character recognition system

Acceleration Of HMM-Based Speech Recognition System By Parallel FPGA Gaussian Calculation

Relevância:

100.00% 100.00%

Publicador:

Veja mais

An Improved Palmprint Recognition System Using Iris Features

Relevância:

100.00% 100.00%

Publicador:

Veja mais

A Generalized Fuzzy Linguistic Model for Predicting Component Concentrations in an Optical Gas Sensing System

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Motivated by environmental protection concerns, monitoring the flue gas of thermal power plant is now often mandatory due to the need to ensure that emission levels stay within safe limits. Optical based gas sensing systems are increasingly employed for this purpose, with regression techniques used to relate gas optical absorption spectra to the concentrations of specific gas components of interest (NOx, SO2 etc.). Accurately predicting gas concentrations from absorption spectra remains a challenging problem due to the presence of nonlinearities in the relationships and the high-dimensional and correlated nature of the spectral data. This article proposes a generalized fuzzy linguistic model (GFLM) to address this challenge. The GFLM is made up of a series of “If-Then” fuzzy rules. The absorption spectra are input variables in the rule antecedent. The rule consequent is a general nonlinear polynomial function of the absorption spectra. Model parameters are estimated using least squares and gradient descent optimization algorithms. The performance of GFLM is compared with other traditional prediction models, such as partial least squares, support vector machines, multilayer perceptron neural networks and radial basis function networks, for two real flue gas spectral datasets: one from a coal-fired power plant and one from a gas-fired power plant. The experimental results show that the generalized fuzzy linguistic model has good predictive ability, and is competitive with alternative approaches, while having the added advantage of providing an interpretable model.

Veja mais

A generalized fuzzy linguistic model for predicting component concentrations in an optical gas sensing system

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Motivated by environmental protection concerns, monitoring the flue gas of thermal power plant is now often mandatory due to the need to ensure that emission levels stay within safe limits. Optical based gas sensing systems are increasingly employed for this purpose, with regression techniques used to relate gas optical absorption spectra to the concentrations of specific gas components of interest (NO_x, SO₂ etc.). Accurately predicting gas concentrations from absorption spectra remains a challenging problem due to the presence of nonlinearities in the relationships and the high-dimensional and correlated nature of the spectral data. This article proposes a generalized fuzzy linguistic model (GFLM) to address this challenge. The GFLM is made up of a series of “If-Then” fuzzy rules. The absorption spectra are input variables in the rule antecedent. The rule consequent is a general nonlinear polynomial function of the absorption spectra. Model parameters are estimated using least squares and gradient descent optimization algorithms. The performance of GFLM is compared with other traditional prediction models, such as partial least squares, support vector machines, multilayer perceptron neural networks and radial basis function networks, for two real flue gas spectral datasets: one from a coal-fired power plant and one from a gas-fired power plant. The experimental results show that the generalized fuzzy linguistic model has good predictive ability, and is competitive with alternative approaches, while having the added advantage of providing an interpretable model.

Veja mais

System oriented neural networks - Problem formulation, methodology and application

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A novel methodology is proposed for the development of neural network models for complex engineering systems exhibiting nonlinearity. This method performs neural network modeling by first establishing some fundamental nonlinear functions from a priori engineering knowledge, which are then constructed and coded into appropriate chromosome representations. Given a suitable fitness function, using evolutionary approaches such as genetic algorithms, a population of chromosomes evolves for a certain number of generations to finally produce a neural network model best fitting the system data. The objective is to improve the transparency of the neural networks, i.e. to produce physically meaningful

Veja mais

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A scalable large vocabulary, speaker independent speech recognition system is being developed using Hidden Markov Models (HMMs) for acoustic modeling and a Weighted Finite State Transducer (WFST) to compile sentence, word, and phoneme models. The system comprises a software backend search and an FPGA-based Gaussian calculation which are covered here. In this paper, we present an efficient pipelined design implemented both as an embedded peripheral and as a scalable, parallel hardware accelerator. Both architectures have been implemented on an Alpha Data XRC-5T1, reconfigurable computer housing a Virtex 5 SX95T FPGA. The core has been tested and is capable of calculating a full set of Gaussian results from 3825 acoustic models in 9.03 ms which coupled with a backend search of 5000 words has provided an accuracy of over 80%. Parallel implementations have been designed with up to 32 cores and have been successfully implemented with a clock frequency of 133?MHz.

Veja mais

Pulse shape measurements using single shot-frequency resolved optical gating for high energy (80 J) short pulse (600 fs) laser

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Relevant to laser based electron/ion accelerations, a single shot second harmonic generation frequency resolved optical gating (FROG) system has been developed to characterize laser pulses (80 J, ∼600 fs) incident on and transmitted through nanofoil targets, employing relay imaging, spatial filter, and partially coated glass substrates to reduce spatial nonuniformity and B-integral. The device can be completely aligned without using a pulsed laser source. Variations of incident pulse shape were measured from durations of 613 fs (nearly symmetric shape) to 571 fs (asymmetric shape with pre- or postpulse). The FROG measurements are consistent with independent spectral and autocorrelation measurements. © 2010 American Institute of Physics.

Veja mais

3D Morphable Model Construction for Robust Ear and Face Recognition

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recent work suggests that the human ear varies significantly between different subjects and can be used for identification. In principle, therefore, using ears in addition to the face within a recognition system could improve accuracy and robustness, particularly for non-frontal views. The paper describes work that investigates this hypothesis using an approach based on the construction of a 3D morphable model of the head and ear. One issue with creating a model that includes the ear is that existing training datasets contain noise and partial occlusion. Rather than exclude these regions manually, a classifier has been developed which automates this process. When combined with a robust registration algorithm the resulting system enables full head morphable models to be constructed efficiently using less constrained datasets. The algorithm has been evaluated using registration consistency, model coverage and minimalism metrics, which together demonstrate the accuracy of the approach. To make it easier to build on this work, the source code has been made available online.

Veja mais

Cascaded multimodal biometric recognition framework

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A practically viable multi-biometric recognition system should not only be stable, robust and accurate but should also adhere to real-time processing speed and memory constraints. This study proposes a cascaded classifier-based framework for use in biometric recognition systems. The proposed framework utilises a set of weak classifiers to reduce the enrolled users' dataset to a small list of candidate users. This list is then used by a strong classifier set as the final stage of the cascade to formulate the decision. At each stage, the candidate list is generated by a Mahalanobis distance-based match score quality measure. One of the key features of the authors framework is that each classifier in the ensemble can be designed to use a different modality thus providing the advantages of a truly multimodal biometric recognition system. In addition, it is one of the first truly multimodal cascaded classifier-based approaches for biometric recognition. The performance of the proposed system is evaluated both for single and multimodalities to demonstrate the effectiveness of the approach.

Veja mais

Comprehensive numerical modelling of the performance of a second harmonic generation stage coupled with a low-gain optical parametric amplifier

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present a comprehensive model for predicting the full performance of a second harmonic generation-optical parametric amplification system that aims at enhancing the temporal contrast of laser pulses. The model simultaneously takes into account all the main parameters at play in the system such as the group velocity mismatch, the beam divergence, the spectral content, the pump depletion, and the length of the nonlinear crystals. We monitor the influence of the initial parameters of the input pulse and the interdependence of the two related non-linear processes on the performance of the system and show its optimum configuration. The influence of the initial beam divergence on the spectral and the temporal characteristics of the generated pulse is discussed. In addition, we show that using a crystal slightly longer than the optimum length and introducing small delay between the seed and the pump ensures maximum efficiency and compensates for the spectral shift in the optical parametric amplification stage in case of chirped input pulse. As an example, calculations for bandwidth transform limited and chirped pulses of sub-picosecond duration in beta barium borate crystal are presented.

Veja mais

An iterative longest matching segment approach to speech enhancement with additive noise and channel distortion

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a new approach to speech enhancement from single-channel measurements involving both noise and channel distortion (i.e., convolutional noise), and demonstrates its applications for robust speech recognition and for improving noisy speech quality. The approach is based on finding longest matching segments (LMS) from a corpus of clean, wideband speech. The approach adds three novel developments to our previous LMS research. First, we address the problem of channel distortion as well as additive noise. Second, we present an improved method for modeling noise for speech estimation. Third, we present an iterative algorithm which updates the noise and channel estimates of the corpus data model. In experiments using speech recognition as a test with the Aurora 4 database, the use of our enhancement approach as a preprocessor for feature extraction significantly improved the performance of a baseline recognition system. In another comparison against conventional enhancement algorithms, both the PESQ and the segmental SNR ratings of the LMS algorithm were superior to the other methods for noisy speech enhancement.

Veja mais

Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a new approach to single-channel speech enhancement involving both noise and channel distortion (i.e., convolutional noise). The approach is based on finding longest matching segments (LMS) from a corpus of clean, wideband speech. The approach adds three novel developments to our previous LMS research. First, we address the problem of channel distortion as well as additive noise. Second, we present an improved method for modeling noise. Third, we present an iterative algorithm for improved speech estimates. In experiments using speech recognition as a test with the Aurora 4 database, the use of our enhancement approach as a preprocessor for feature extraction significantly improved the performance of a baseline recognition system. In another comparison against conventional enhancement algorithms, both the PESQ and the segmental SNR ratings of the LMS algorithm were superior to the other methods for noisy speech enhancement. Index Terms: corpus-based speech model, longest matching segment, speech enhancement, speech recognition

Veja mais

An intelligent system for facial emotion recognition

Relevância:

40.00% 40.00%

Publicador:

Veja mais

On emotion recognition of faces and of speech using neural networks, fuzzy logic and the ASSESS system

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Electrical, Thermal and Optical Diagnostics of an Atmospheric Plasma Jet System

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Plasma diagnostics of atmospheric plasmas is a key tool in helping to understand processing performance issues. This paper presents an electrical, optical and thermographic imaging study of the PlasmaStream atmospheric plasma jet system. The system was found to exhibit three operating modes; one constricted/localized plasma and two extended volume plasmas. At low power and helium flows the plasma is localized at the electrodes and has the electrical properties of a corona/filamentary discharge with electrical chaotic temporal structure. With increasing discharge power and helium flow the plasma expands into the volume of the tube, becoming regular and homogeneous in appearance. Emission spectra show evidence of atomic oxygen, nitric oxide and the hydroxyl radical production. Plasma activated gas temperature deduced from the rotational temperature of nitrogen molecules was found to be of order of 400 K: whereas thermographic imaging of the quartz tube yielded surface temperatures between 319 and 347 K.

Veja mais

19 resultados para optical character recognition system

Filtro por publicador