83 resultados para Semi-bilingual dictionary
Resumo:
Long-term (2009-2012) data from ground-based measurements of aerosol black carbon (BC) from a semi-urban site, Pantnagar (29.0 degrees N, 79.5 degrees E, 231 m amsl), in the Indo-Gangetic Plain (IGP) near the Himalayan foothills are analyzed to study the regional characterization. Large variations are seen in BC at both diurnal and seasonal scales, associated with the mesoscale and synoptic meteorological processes, and local/regional anthropogenic activities. BC diurnal variations show two peaks (morning and evening) arising from the combined effects of the atmospheric boundary layer (ABL) dynamics and local emissions. The diurnal amplitudes as well as the rates of diurnal evolution are the highest in winter season, followed by autumn, and the lowest in summer-monsoon. BC exhibits nearly an inverse relation with mixing layer depth in all seasons; being strongest in winter (R-2 = 0.89) and weakest (R-2 = 0.33) in monsoon (July-August). Unlike BC, co-located aerosol optical depths (AOD) and aerosol absorption are highest in spring over IGP, probably due to the presence of higher abundances of aerosols (including dust) above the ABL (in the free troposphere). AOD (500 nm) showed annual peak (>0.6) in May-June, dominated by coarse mode, while fine mode aerosols dominated in late autumn and early winter. Aerosols profiles from CALIPSO show highest values close to the surface in winter/autumn, similar to the feature seen in surface BC, whereas at altitudes > 2 km, the extinction is maximum in spring/summer. WRF-Chem model is used to simulate BC temporal variations and then compared with observed BC. The model captures most of the important features of the diurnal and seasonal variations but significantly underestimated the observed BC levels, suggesting improvements in diurnal and seasonal varying BC emissions apart from the boundary layer processes. (C) 2015 Elsevier Ltd. All rights reserved.
Resumo:
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.
Resumo:
Cross domain and cross-modal matching has many applications in the field of computer vision and pattern recognition. A few examples are heterogeneous face recognition, cross view action recognition, etc. This is a very challenging task since the data in two domains can differ significantly. In this work, we propose a coupled dictionary and transformation learning approach that models the relationship between the data in both domains. The approach learns a pair of transformation matrices that map the data in the two domains in such a manner that they share common sparse representations with respect to their own dictionaries in the transformed space. The dictionaries for the two domains are learnt in a coupled manner with an additional discriminative term to ensure improved recognition performance. The dictionaries and the transformation matrices are jointly updated in an iterative manner. The applicability of the proposed approach is illustrated by evaluating its performance on different challenging tasks: face recognition across pose, illumination and resolution, heterogeneous face recognition and cross view action recognition. Extensive experiments on five datasets namely, CMU-PIE, Multi-PIE, ChokePoint, HFB and IXMAS datasets and comparisons with several state-of-the-art approaches show the effectiveness of the proposed approach. (C) 2015 Elsevier B.V. All rights reserved.
Resumo:
The present work explores the potential of semi-solid heat treatment technique by elucidating its effect on the plastic behavior of 304L SS in hot working domain. To accomplish this objective, hot isothermal compression tests on 304L SS specimens with semi-solid heat treatment and conventional annealing heat treatment have been carried out within a temperature range of 1273-1473 K and strain rates ranging from 0.01 to 1 s(-1). The dynamic flow behavior of this steel in its conventional heat-treated condition and semi-solid heat-treated condition has been characterized in terms of strain hardening, temperature softening, strain rate hardening, and dynamic flow softening. Extensive microstructural investigation has been carried out to corroborate the results obtained from the analysis of flow behavior. Detailed analysis of the results demonstrates that semi-solid heat treatment moderates work hardening, strain rate hardening, and temperature sensitivity of 304L SS, which is favorable for hot deformation. The post-deformation hardness values of semi-solid heat-treated steel and conventionally heat-treated steel were found to remain similar despite the pre-deformation heat treatment conditions. The results obtained demonstrate the potential of semi-solid heat treatment as a pre-deformation heat treatment step to effectively reduce the strength of the material to facilitate easier deformation without affecting the post-deformation properties of the steel.
Resumo:
Cross domain and cross-modal matching has many applications in the field of computer vision and pattern recognition. A few examples are heterogeneous face recognition, cross view action recognition, etc. This is a very challenging task since the data in two domains can differ significantly. In this work, we propose a coupled dictionary and transformation learning approach that models the relationship between the data in both domains. The approach learns a pair of transformation matrices that map the data in the two domains in such a manner that they share common sparse representations with respect to their own dictionaries in the transformed space. The dictionaries for the two domains are learnt in a coupled manner with an additional discriminative term to ensure improved recognition performance. The dictionaries and the transformation matrices are jointly updated in an iterative manner. The applicability of the proposed approach is illustrated by evaluating its performance on different challenging tasks: face recognition across pose, illumination and resolution, heterogeneous face recognition and cross view action recognition. Extensive experiments on five datasets namely, CMU-PIE, Multi-PIE, ChokePoint, HFB and IXMAS datasets and comparisons with several state-of-the-art approaches show the effectiveness of the proposed approach. (C) 2015 Elsevier B.V. All rights reserved.
Resumo:
Human detection is a complex problem owing to the variable pose that they can adopt. Here, we address this problem in sparse representation framework with an overcomplete scale-embedded dictionary. Histogram of oriented gradient features extracted from the candidate image patches are sparsely represented by the dictionary that contain positive bases along with negative and trivial bases. The object is detected based on the proposed likelihood measure obtained from the distribution of these sparse coefficients. The likelihood is obtained as the ratio of contribution of positive bases to negative and trivial bases. The positive bases of the dictionary represent the object (human) at various scales. This enables us to detect the object at any scale in one shot and avoids multiple scanning at different scales. This significantly reduces the computational complexity of detection task. In addition to human detection, it also finds the scale at which the human is detected due to the scale-embedded structure of the dictionary.
Resumo:
We develop a new dictionary learning algorithm called the l(1)-K-svp, by minimizing the l(1) distortion on the data term. The proposed formulation corresponds to maximum a posteriori estimation assuming a Laplacian prior on the coefficient matrix and additive noise, and is, in general, robust to non-Gaussian noise. The l(1) distortion is minimized by employing the iteratively reweighted least-squares algorithm. The dictionary atoms and the corresponding sparse coefficients are simultaneously estimated in the dictionary update step. Experimental results show that l(1)-K-SVD results in noise-robustness, faster convergence, and higher atom recovery rate than the method of optimal directions, K-SVD, and the robust dictionary learning algorithm (RDL), in Gaussian as well as non-Gaussian noise. For a fixed value of sparsity, number of dictionary atoms, and data dimension, l(1)-K-SVD outperforms K-SVD and RDL on small training sets. We also consider the generalized l(p), 0 < p < 1, data metric to tackle heavy-tailed/impulsive noise. In an image denoising application, l(1)-K-SVD was found to result in higher peak signal-to-noise ratio (PSNR) over K-SVD for Laplacian noise. The structural similarity index increases by 0.1 for low input PSNR, which is significant and demonstrates the efficacy of the proposed method. (C) 2015 Elsevier B.V. All rights reserved.
Resumo:
Fingerprints are used for identification in forensics and are classified into Manual and Automatic. Automatic fingerprint identification system is classified into Latent and Exemplar. A novel Exemplar technique of Fingerprint Image Verification using Dictionary Learning (FIVDL) is proposed to improve the performance of low quality fingerprints, where Dictionary learning method reduces the time complexity by using block processing instead of pixel processing. The dynamic range of an image is adjusted by using Successive Mean Quantization Transform (SMQT) technique and the frequency domain noise is reduced using spectral frequency Histogram Equalization. Then, an adaptive nonlinear dynamic range adjustment technique is utilized to determine the local spectral features on corresponding fingerprint ridge frequency and orientation. The dictionary is constructed using spatial fundamental frequency that is determined from the spectral features. These dictionaries help in removing the spurious noise present in fingerprints and reduce the time complexity by using block processing instead of pixel processing. Further, dictionaries are used to reconstruct the image for matching. The proposed FIVDL is verified on FVC database sets and Experimental result shows an improvement over the state-of-the-art techniques. (C) 2015 The Authors. Published by Elsevier B.V.