46 resultados para Mel-Frequency Cepstral Coefficients
em Cochin University of Science
Resumo:
Presently different audio watermarking methods are available; most of them inclined towards copyright protection and copy protection. This is the key motive for the notion to develop a speaker verification scheme that guar- antees non-repudiation services and the thesis is its outcome. The research presented in this thesis scrutinizes the field of audio water- marking and the outcome is a speaker verification scheme that is proficient in addressing issues allied to non-repudiation to a great extent. This work aimed in developing novel audio watermarking schemes utilizing the fun- damental ideas of Fast-Fourier Transform (FFT) or Fast Walsh-Hadamard Transform (FWHT). The Mel-Frequency Cepstral Coefficients (MFCC) the best parametric representation of the acoustic signals along with few other key acoustic characteristics is employed in crafting of new schemes. The au- dio watermark created is entirely dependent to the acoustic features, hence named as FeatureMark and is crucial in this work. In any watermarking scheme, the quality of the extracted watermark de- pends exclusively on the pre-processing action and in this work framing and windowing techniques are involved. The theme non-repudiation provides immense significance in the audio watermarking schemes proposed in this work. Modification of the signal spectrum is achieved in a variety of ways by selecting appropriate FFT/FWHT coefficients and the watermarking schemes were evaluated for imperceptibility, robustness and capacity char- acteristics. The proposed schemes are unequivocally effective in terms of maintaining the sound quality, retrieving the embedded FeatureMark and in terms of the capacity to hold the mark bits. Robust nature of these marking schemes is achieved with the help of syn- chronization codes such as Barker Code with FFT based FeatureMarking scheme and Walsh Code with FWHT based FeatureMarking scheme. An- other important feature associated with this scheme is the employment of an encryption scheme towards the preparation of its FeatureMark that scrambles the signal features that helps to keep the signal features unreve- laed. A comparative study with the existing watermarking schemes and the ex- periments to evaluate imperceptibility, robustness and capacity tests guar- antee that the proposed schemes can be baselined as efficient audio water- marking schemes. The four new digital audio watermarking algorithms in terms of their performance are remarkable thereby opening more opportu- nities for further research.
Resumo:
Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers in the age group ranging from 20 to 40 years. The system obtained a word recognition accuracy of 87.4% and a sentence recognition accuracy of 84%, when tested with a set of continuous speech data.
Resumo:
Digit speech recognition is important in many applications such as automatic data entry, PIN entry, voice dialing telephone, automated banking system, etc. This paper presents speaker independent speech recognition system for Malayalam digits. The system employs Mel frequency cepstrum coefficient (MFCC) as feature for signal processing and Hidden Markov model (HMM) for recognition. The system is trained with 21 male and female voices in the age group of 20 to 40 years and there was 98.5% word recognition accuracy (94.8% sentence recognition accuracy) on a test set of continuous digit recognition task.
Resumo:
A primary medium for the human beings to communicate through language is Speech. Automatic Speech Recognition is wide spread today. Recognizing single digits is vital to a number of applications such as voice dialling of telephone numbers, automatic data entry, credit card entry, PIN (personal identification number) entry, entry of access codes for transactions, etc. In this paper we present a comparative study of SVM (Support Vector Machine) and HMM (Hidden Markov Model) to recognize and identify the digits used in Malayalam speech.
Resumo:
The increasing interest in the interaction of light with electricity and electronically active materials made the materials and techniques for producing semitransparent electrically conducting films particularly attractive. Transparent conductors have found major applications in a number of electronic and optoelectronic devices including resistors, transparent heating elements, antistatic and electromagnetic shield coatings, transparent electrode for solar cells, antireflection coatings, heat reflecting mirrors in glass windows and many other. Tin doped indium oxide (indium tin oxide or ITO) is one of the most commonly used transparent conducting oxides. At present and likely well into the future this material offers best available performance in terms of conductivity and transmittivity combined with excellent environmental stability, reproducibility and good surface morphology. Although partial transparency, with a reduction in conductivity, can be obtained for very thin metallic films, high transparency and simultaneously high conductivity cannot be attained in intrinsic stoichiometric materials. The only way this can be achieved is by creating electron degeneracy in a wide bandgap (Eg > 3eV or more for visible radiation) material by controllably introducing non-stoichiometry and/or appropriate dopants. These conditions can be conveniently met for ITO as well as a number of other materials like Zinc oxide, Cadmium oxide etc. ITO shows interesting and technologically important combination of properties viz high luminous transmittance, high IR reflectance, good electrical conductivity, excellent substrate adherence and chemical inertness. ITO is a key part of solar cells, window coatings, energy efficient buildings, and flat panel displays. In solar cells, ITO can be the transparent, conducting top layer that lets light into the cell to shine the junction and lets electricity flow out. Improving the ITO layer can help improve the solar cell efficiency. A transparent ii conducting oxide is a material with high transparency in a derived part of the spectrum and high electrical conductivity. Beyond these key properties of transparent conducting oxides (TCOs), ITO has a number of other key characteristics. The structure of ITO can be amorphous, crystalline, or mixed, depending on the deposition temperature and atmosphere. The electro-optical properties are a function of the crystallinity of the material. In general, ITO deposited at room temperature is amorphous, and ITO deposited at higher temperatures is crystalline. Depositing at high temperatures is more expensive than at room temperature, and this method may not be compatible with the underlying devices. The main objective of this thesis work is to optimise the growth conditions of Indium tin oxide thin films at low processing temperatures. The films are prepared by radio frequency magnetron sputtering under various deposition conditions. The films are also deposited on to flexible substrates by employing bias sputtering technique. The films thus grown were characterised using different tools. A powder x-ray diffractometer was used to analyse the crystalline nature of the films. The energy dispersive x-ray analysis (EDX) and scanning electron microscopy (SEM) were used for evaluating the composition and morphology of the films. Optical properties were investigated using the UVVIS- NIR spectrophotometer by recording the transmission/absorption spectra. The electrical properties were studied using vander Pauw four probe technique. The plasma generated during the sputtering of the ITO target was analysed using Langmuir probe and optical emission spectral studies.
Resumo:
Electromagnetic scattering behaviour of a superstrate loaded metallo– dielectric structure based on Sierpinski carpet fractal geometry is reported. The results indicate that the frequency at which backscattering is minimum can be tuned by varying the thickness of the superstrate. A reduction in backscattered power of 44 dB is obtained simultaneously for both TE and TM polarisations of the incident field.
Resumo:
The radio frequency plasma generated during the sputtering of Indium Tin Oxide target using Argon was analyzed by Langmuir probe and optical-emission spectroscopy. The basic plasma parameters such as electron temperature and ion density were evaluated. These studies were carried out by varying the RF power from 20 to 50 W. A linear increase in ion density and an exponential decrease in electron temperature with rf power were observed. The measured plasma parameters were then correlated with the properties of ITO thin films deposited under similar plasma conditions.
Resumo:
A new dual port microstrip antenna geometry for dual frequency operation is presented. The structure consists of the intersection of two circles of the same radius with their centres displaced by a small fraction of the wavelength . This antenna provides wide impedance bandwidth and excellent isolation between its ports. The gain of the antenna is comparable to that of a standard circular microstrip antenna operating at the same resonant frequency. A theoretical analysis for calculating the resonant frequencies of the two ports is also presented
Resumo:
A novel dual frequency dual-polarized square microstrip patch antenna embedded with a slot is presented. The proposed antenna offers tunability of the frequency ratio between the two frequencies by adjusting the slot dimensions. This configuration also provides a size reduction up to -51 and 35% for the two modes as compared to a square micro strip patch antenna
Resumo:
Dual frequency operation is achieved from a compact microstrip antenna by loading a pair of narrow slots close to its radiating edges. The two frequencies have parallel polarization planes and similar radiation characteristics. The ratio between the two operating frequencies can be tuned in the range (1.14-1. 24), which is much smaller than that of similar designs. The above excellent radiation characteristics are achieved along with an area reduction of - 75% compared to the standard rectangular patch
Resumo:
Design equations are presented for calculating the resonance frequencies for a compact dual frequency arrow-shaped microstrip antenna. This provides a fast and simple way to predict the resonant frequencies of the antenna. The antenna is also analyzed using the IE3D simulation package. The theoretical predictions are found to be very close to the IE3D results and thus establish the validity of the design formulae
Resumo:
A novel dual frequency design of a square microstrip antenna coaxially fed along the diagonal with a chip capacitor mounted at the bottom of the stub is introduced. This design provides enhanced area reduction and good cross-polarization levels. The antenna design can be used as a compact antenna system where limited size is a major requirement. The details of the antenna design and experimental results are presented
Resumo:
The mathematical formulation of empirically developed formulas Jirr the calculation of the resonant frequency of a thick-substrate (h s 0.08151 A,,) microstrip antenna has been analyzed. With the use qt' tunnel-based artificial neural networks (ANNs), the resonant frequency of antennas with h satisfying the thick-substrate condition are calculated and compared with the existing experimental results and also with the simulation results obtained with the use of an IE3D software package. The artificial neural network results are in very good agreement with the experimental results
Resumo:
A new design of' a dual-frequency dual-polarized square microsh'ip antenna fed along the diagonal, embedded with a square slot having three extended stubs for frequency tuning, is introduced. The proposed antenna was fabricated using a standard photolithographic method and the antenna was tested using the HP 3510(:; Vector Network Analyser. The antenna is capable of generating dual resonant frequencies with mutually perpendicular polarizations and broad radiation pattern characteristics. Such dual-frequency designs find wide applications in personal mobile handsets combining GSM and CDS 1800 modes, and applications in which different frequencies are used for emission and reception such as personal satellite communications and cellular network systems.