955 resultados para Voiced or unvoiced classification
Resumo:
This thesis investigates the potential use of zerocrossing information for speech sample estimation. It provides 21 new method tn) estimate speech samples using composite zerocrossings. A simple linear interpolation technique is developed for this purpose. By using this method the A/D converter can be avoided in a speech coder. The newly proposed zerocrossing sampling theory is supported with results of computer simulations using real speech data. The thesis also presents two methods for voiced/ unvoiced classification. One of these methods is based on a distance measure which is a function of short time zerocrossing rate and short time energy of the signal. The other one is based on the attractor dimension and entropy of the signal. Among these two methods the first one is simple and reguires only very few computations compared to the other. This method is used imtea later chapter to design an enhanced Adaptive Transform Coder. The later part of the thesis addresses a few problems in Adaptive Transform Coding and presents an improved ATC. Transform coefficient with maximum amplitude is considered as ‘side information’. This. enables more accurate tfiiz assignment enui step—size computation. A new bit reassignment scheme is also introduced in this work. Finally, sum ATC which applies switching between luiscrete Cosine Transform and Discrete Walsh-Hadamard Transform for voiced and unvoiced speech segments respectively is presented. Simulation results are provided to show the improved performance of the coder
Resumo:
This thesis investigated the potential use of Linear Predictive Coding in speech communication applications. A Modified Block Adaptive Predictive Coder is developed, which reduces the computational burden and complexity without sacrificing the speech quality, as compared to the conventional adaptive predictive coding (APC) system. For this, changes in the evaluation methods have been evolved. This method is as different from the usual APC system in that the difference between the true and the predicted value is not transmitted. This allows the replacement of the high order predictor in the transmitter section of a predictive coding system, by a simple delay unit, which makes the transmitter quite simple. Also, the block length used in the processing of the speech signal is adjusted relative to the pitch period of the signal being processed rather than choosing a constant length as hitherto done by other researchers. The efficiency of the newly proposed coder has been supported with results of computer simulation using real speech data. Three methods for voiced/unvoiced/silent/transition classification have been presented. The first one is based on energy, zerocrossing rate and the periodicity of the waveform. The second method uses normalised correlation coefficient as the main parameter, while the third method utilizes a pitch-dependent correlation factor. The third algorithm which gives the minimum error probability has been chosen in a later chapter to design the modified coder The thesis also presents a comparazive study beh-cm the autocorrelation and the covariance methods used in the evaluaiicn of the predictor parameters. It has been proved that the azztocorrelation method is superior to the covariance method with respect to the filter stabf-it)‘ and also in an SNR sense, though the increase in gain is only small. The Modified Block Adaptive Coder applies a switching from pitch precitzion to spectrum prediction when the speech segment changes from a voiced or transition region to an unvoiced region. The experiments cont;-:ted in coding, transmission and simulation, used speech samples from .\£=_‘ajr2_1a:r1 and English phrases. Proposal for a speaker reecgnifion syste: and a phoneme identification system has also been outlized towards the end of the thesis.
Resumo:
The aim of this thesis is to investigate computerized voice assessment methods to classify between the normal and Dysarthric speech signals. In this proposed system, computerized assessment methods equipped with signal processing and artificial intelligence techniques have been introduced. The sentences used for the measurement of inter-stress intervals (ISI) were read by each subject. These sentences were computed for comparisons between normal and impaired voice. Band pass filter has been used for the preprocessing of speech samples. Speech segmentation is performed using signal energy and spectral centroid to separate voiced and unvoiced areas in speech signal. Acoustic features are extracted from the LPC model and speech segments from each audio signal to find the anomalies. The speech features which have been assessed for classification are Energy Entropy, Zero crossing rate (ZCR), Spectral-Centroid, Mean Fundamental-Frequency (Meanf0), Jitter (RAP), Jitter (PPQ), and Shimmer (APQ). Naïve Bayes (NB) has been used for speech classification. For speech test-1 and test-2, 72% and 80% accuracies of classification between healthy and impaired speech samples have been achieved respectively using the NB. For speech test-3, 64% correct classification is achieved using the NB. The results direct the possibility of speech impairment classification in PD patients based on the clinical rating scale.
Resumo:
The canonical representation of speech constitutes a perfect reconstruction (PR) analysis-synthesis system. Its parameters are the autoregressive (AR) model coefficients, the pitch period and the voiced and unvoiced components of the excitation represented as transform coefficients. Each set of parameters may be operated on independently. A time-frequency unvoiced excitation (TFUNEX) model is proposed that has high time resolution and selective frequency resolution. Improved time-frequency fit is obtained by using for antialiasing cancellation the clustering of pitch-synchronous transform tracks defined in the modulation transform domain. The TFUNEX model delivers high-quality speech while compressing the unvoiced excitation representation about 13 times over its raw transform coefficient representation for wideband speech.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
3rd SMTDA Conference Proceedings, 11-14 June 2014, Lisbon Portugal.
Resumo:
Este artigo analisa as características específicas e os processos de indexação e classificação realizados em bibliotecas escolares para tratar e recuperar as informações de suas coleções. Também se analisam as linguagens como ferramentas documentais específicas utilizadas em bibliotecas escolares portuguesas espanholas, portuguesas e brasileiras. Para atingir este objetivo, o modelo de biblioteca escolar é estudado de forma crítica, se analisa o conceito de biblioteca escolar de forma crítica, se estudam suas funções e se examinam as técnicas e os instrumentos que permitem organizar a informação. Entre outras ferramentas, estudam-se listas de cabeçalhos de assuntos como os Cabeçalhos de assuntos para livros infantis e juvenis e a Lista de Cabeçalhos de assuntos para as bibliotecas; sistemas de classificação, como a Classificação Decimal Universal (edição de bolso) ou a classificação por centros de interesse e tesauros especializados como o Tesauro da Educação UNESCO-OIE e o Tesauro Europeu da Educação, entre outros.
Resumo:
La biblioteca escolar es un servicio de información básico para todos los miembros de una comunidad educativa, que forma parte de los espacios docentes de los centros y de los procesos pedagógicos que tienen lugar en ellos. Las bibliotecas escolares funcionan como centros de recursos para las actividades de enseñanza-aprendizaje, están constituidas por un conjunto sistematizado y dinámico de servicios y fondos documentales que permiten a los usuarios desarrollar hábitos lectores y buscar y valorar las fuentes de información, entre otras relevantes funciones. Los recursos de información que albergan son uno de sus principales activos, pero si colección documental no está organizada, las tareas de búsqueda y localización de la información resultarán complicadas y la calidad de los recursos obtenidos, cuestionable. Los bibliotecarios deben conocer en profundidad las características específicas del fondo documental y las fuentes disponibles; las técnicas y herramientas adecuadas para procesar y tratar el fondo bibliográfico, así como los métodos de recuperación de la información más convenientes. En este contexto, el objetivo de este trabajo es analizar de forma pormenorizada los procesos de indización y clasificación que se realizan en las bibliotecas escolares para procesar y recuperar la información que albergan su colecciones, así como describir las características más relevantes de las herramientas específicas que se usan en las bibliotecas escolares españolas, brasileñas y portuguesas, adaptadas a las características de los usuarios que utilizan sus servicios y acuden a ellas para resolver necesidades de información. Para lograr este propósito, se analiza el concepto de biblioteca escolar de forma crítica, se estudian sus funciones y se examinan las técnicas y los instrumentos que permiten organizar la información. Entre otras herramientas, se estudian listas de encabezamientos de materia como los Encabezamientos de materia para libros infantiles y juveniles y la Lista de Encabezamientos de materia para las bibliotecas públicas; sistemas de clasificación, como la Clasificación Decimal Universal (edición de bolsillo) o la clasificación por centros de interés y tesauros especializados como el Tesauro de la Educación UNESCO-OIE y el Tesauro Europeo de la Educación, entre otros.
Resumo:
Background Surgery of radiation-induced cataracts in children with retinoblastoma (RB) is a challenge as early intervention is weighted against the need to delay surgery until complete tumour control is obtained. This study analyses the safety and functional results of such surgery. Methods In a retrospective, non-comparative, consecutive case series, we reviewed medical records of RB patients </=14 y of age who underwent either external beam radiotherapy or plaque treatment and were operated for radiation-induced cataract between 1985 and 2008. Results In total, 21 eyes of 20 RB patients were included and 18 out of the 21 eyes had Reese-Ellsworth stage V or ABC classification group D/E RB. Median interval between last treatment for RB and cataract surgery was 21.5 months, range 3-164 months. Phacoaspiration was performed in 13 eyes (61%), extra-capsular cataract extraction in 8 (39%) and intraocular lens implantation in 19 eyes (90%). The majority of cases, 11/21 (52%), underwent posterior capsulorhexis or capsulotomy and 6/21 (28%) an anterior vitrectomy. Postoperative visual acuity was >/=20/200 in 13 eyes and <20/200 in 5 eyes. Intraocular tumour recurrence was noted in three eyes. Mean postoperative follow up was 90 months+/-69 months. Conclusions Modern cataract surgery, including clear cornea approach, lens aspiration with posterior capsulotomy, anterior vitrectomy and IOL implantation is a safe procedure for radiation-induced cataract as long as RB is controlled. The visual prognosis is limited by initial tumour involvement of the macula and by corneal complications of radiotherapy. We recommend a minimal interval of 9 months between completion of treatment of retinoblastoma and cataract surgery.
Resumo:
In this paper we explore the use of non-linear transformations in order to improve the performance of an entropy based voice activity detector (VAD). The idea of using a non-linear transformation comes from some previous work done in speech linear prediction (LPC) field based in source separation techniques, where the score function was added into the classical equations in order to take into account the real distribution of the signal. We explore the possibility of estimating the entropy of frames after calculating its score function, instead of using original frames. We observe that if signal is clean, estimated entropy is essentially the same; but if signal is noisy transformed frames (with score function) are able to give different entropy if the frame is voiced against unvoiced ones. Experimental results show that this fact permits to detect voice activity under high noise, where simple entropy method fails.
Resumo:
Esta investigación se desarrolló con el fin de conocer de conocer detalladamente el comportamiento del sector automotriz de Colombia en los últimos años, haciendo énfasis en el segmento de vehículos de gama media del mercado, los cuales son los más comprados en volumen por los colombianos. El estudio se realizó en un periodo comprendido entre 2.006 y 2.011 para tener identificación y valoración real del sector hasta la situación actual, donde se evaluaron las ensambladoras presentes en el país que pertenecen a la división o clasificación arancelaria dada por Código Industrial Internacional Uniforme - CIIU. El trabajo se fundamenta en la metodología del Análisis Estructural de Sectores Estratégicos por medio del cual es necesario desarrollar un examen histórico del comportamiento del sector en los últimos años con el cual se presenta el desarrollo en términos económicos, ventas, estado de hacinamiento, manchas blancas, principales participantes del mercado, TLC´s y demás rubros que caracterizan el segmento. De igual manera, el trabajo de investigación refleja una vista general de las estrategias utilizadas por las principales marcas del estudio, evaluando como estas son imitadas o no por sus competidores, y de igual manera, determinar que hace a una empresa ser líder en el mercado no tan solo tomando el ámbito económico como pilar.
Resumo:
A fissura de palato, em associação à Sequência de Pierre Robin, pode favorecer o desenvolvimento de produções atípicas (compensatórias), na fala da criança, como é o caso da oclusiva glotal (golpe de glote) comumente observada em substituição aos sons oclusivos (vozeados ou não). No presente estudo, foi realizada a análise dos parâmetros fonético-acústicos da oclusiva glotal produzidas em /k/ e /g/ por uma criança do gênero feminino, com 5 anos, que apresentava fissura de palato reparada, associada à Sequência de Pierre Robin. Para isso, foram selecionadas seis palavras em que a oclusiva velar encontrava-se na posição inicial da palavra e combinada com as vogais /a/, /i/ e /u/ na posição acentuada. Foi ainda realizado julgamento perceptivo-auditivo por três fonoaudiólogos, que apresentou concordância quanto à presença da oclusiva glotal de 100% para ambas as relações (intra e inter-juízes). Na inspeção dos dados via espectrograma foi observada variabilidade dos parâmetros espectrais (burst e transição formântica) e essas variações também puderam ser computadas considerando as vogais separadamente. A análise estatística revelou diferença estatisticamente significante entre as duas consoantes velares (/k/ e /g/) nos parâmetros espectral (burst), temporal (VOT e duração relativa da oclusiva na palavra) e os relativos às características acústicas das vogais adjacentes às oclusivas (período estacionário de F3). Por fim, as características acústicas da oclusiva glotal sugeriram que a criança pode ter utilizado de estratégias para marcar contrastes fônicos na língua, ainda que os mesmos não tenham magnitude suficiente para serem resgatados auditivamente pelo ouvinte.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Pós-graduação em Agronomia (Energia na Agricultura) - FCA
Resumo:
This article analyzes the specific features and processes of indexing and classification performed in school libraries to process and retrieve information from their collections. Subject languages used in Spanish, Portuguese and Brazilian Portuguese school libraries are also analyzed. To achieve this goal, the concept of school library was analyzed, its function was studied and the techniques and tools that allow the information organization were examined. Among the tools, we studied the Subject Headings Lists for children and juveniles’ books and the Subject Headings List for public libraries, the Universal Decimal Classification System (paperback edition) or the classification by fields of interest and specialized thesauri like the Tesauro de la Educación UNESCO-OIE and the TesauroEuropeo de la Educación.