962 resultados para Classification Tree Pruning
Resumo:
This paper investigates several approaches to bootstrapping a new spoken language understanding (SLU) component in a target language given a large dataset of semantically-annotated utterances in some other source language. The aim is to reduce the cost associated with porting a spoken dialogue system from one language to another by minimising the amount of data required in the target language. Since word-level semantic annotations are costly, Semantic Tuple Classifiers (STCs) are used in conjunction with statistical machine translation models both of which are trained from unaligned data to further reduce development time. The paper presents experiments in which a French SLU component in the tourist information domain is bootstrapped from English data. Results show that training STCs on automatically translated data produced the best performance for predicting the utterance's dialogue act type, however individual slot/value pairs are best predicted by training STCs on the source language and using them to decode translated utterances. © 2010 ISCA.
Resumo:
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 signal which is used for the generation of the source-excitation. When a mixed source excitation is used, this decision can be based on two different sources of information: the state-specific MSD-prior of the F0 models, and/or the frame-specific features generated by the aperiodicity model. This paper examines the meaning of these variables in the synthesis process, their interaction, and how they affect the perceived quality of the generated speech The results of several perceptual experiments show that when using mixed excitation, subjects consistently prefer samples with very few or no false unvoiced errors, whereas a reduction in the rate of false voiced errors does not produce any perceptual improvement. This suggests that rather than using any form of hard voiced/unvoiced classification, e.g., the MSD-prior, it is better for synthesis to use a continuous F0 signal and rely on the frame-level soft voiced/unvoiced decision of the aperiodicity model. © 2011 IEEE.
Resumo:
The standard, ad-hoc stopping criteria used in decision tree-based context clustering are known to be sub-optimal and require parameters to be tuned. This paper proposes a new approach for decision tree-based context clustering based on cross validation and hierarchical priors. Combination of cross validation and hierarchical priors within decision tree-based context clustering offers better model selection and more robust parameter estimation than conventional approaches, with no tuning parameters. Experimental results on HMM-based speech synthesis show that the proposed approach achieved significant improvements in naturalness of synthesized speech over the conventional approaches. © 2011 IEEE.
Resumo:
We present a novel, implementation friendly and occlusion aware semi-supervised video segmentation algorithm using tree structured graphical models, which delivers pixel labels alongwith their uncertainty estimates. Our motivation to employ supervision is to tackle a task-specific segmentation problem where the semantic objects are pre-defined by the user. The video model we propose for this problem is based on a tree structured approximation of a patch based undirected mixture model, which includes a novel time-series and a soft label Random Forest classifier participating in a feedback mechanism. We demonstrate the efficacy of our model in cutting out foreground objects and multi-class segmentation problems in lengthy and complex road scene sequences. Our results have wide applicability, including harvesting labelled video data for training discriminative models, shape/pose/articulation learning and large scale statistical analysis to develop priors for video segmentation. © 2011 IEEE.
Resumo:
Hundreds of tropical plant species house ant colonies in specialized chambers called domatia. When, in 1873, Richard Spruce likened plant-ants to fleas and asserted that domatia are ant-created galls, he incited a debate that lasted almost a century. Alth
Resumo:
We examined protein polymorphism of 20 native pig breeds in China and 3 introduced pig breeds. Thirty loci have been investigated, among which six loci were found to be polymorphic. Especially, the polymorphism of malate dehydrogenase (MDH), adenylate kinase (AK), and two new alleles of adenosine deaminase (ADA) had not been reported in domestic pigs and wild pigs. The percentage of polymorphic loci (P), the mean heterozygosity (H), and the mean number of alleles (A) are 0.200, 0.065, and 1.300, respectively. The degree of genetic variability of Chinese pigs as a whole was higher than that of goats, lower than that of cattle and horses, and similar to that of sheep. Using the gene frequencies of the 30 loci, Nei's genetic distance among the 20 native breeds in China and 3 introduced pig breeds was calculated by the formula of Nei. The program NEIGHBOR in PHYLIP 3.5c was chosen to construct an UPGMA tree and a NJ tree. Our results show that, of the total genetic variation found in the native pig breeds in China, 31% (0.31) is ascribable to genetic differences among breeds. About 69% of the total genetic variation is found within breeds. Most breeds are in linkage disequilibrium. The patterns of genetic similarities between the Chinese native pig breeds were not in agreement with the proposed pig type classification.
Resumo:
A brief description is given of a program to carry out analysis of variance two-way classification on MICRO 2200, for use in fishery data processing.
Resumo:
P>The non-classical major histocompatibility complex (MHC) class I molecule CD1d presents lipid antigens to invariant natural killer T (iNKT) cells, which are an important part of the innate immune system. CD1d/iNKT systems are highly conserved in evoluti
Semantic Discriminant mapping for classification and browsing of remote sensing textures and objects
Resumo:
We present a new approach based on Discriminant Analysis to map a high dimensional image feature space onto a subspace which has the following advantages: 1. each dimension corresponds to a semantic likelihood, 2. an efficient and simple multiclass classifier is proposed and 3. it is low dimensional. This mapping is learnt from a given set of labeled images with a class groundtruth. In the new space a classifier is naturally derived which performs as well as a linear SVM. We will show that projecting images in this new space provides a database browsing tool which is meaningful to the user. Results are presented on a remote sensing database with eight classes, made available online. The output semantic space is a low dimensional feature space which opens perspectives for other recognition tasks. © 2005 IEEE.
Resumo:
Life is full of difficult choices. Everyone has their own way of dealing with these, some effective, some not. The problem is particularly acute in engineering design because of the vast amount of information designers have to process. This paper deals with a subset of this set of problems: the subset of selecting materials and processes, and their links to the design of products. Even these, though, present many of the generic problems of choice, and the challenges in creating tools to assist the designer in making them. The key elements are those of classification, of indexing, of reaching decisions using incomplete data in many different formats, and of devising effective strategies for selection. This final element - that of selection strategies - poses particular challenges. Product design, as an example, is an intricate blend of the technical and (for want of a better word) the aesthetic. To meet these needs, a tool that allows selection by analysis, by analogy, by association and simply by 'browsing' is necessary. An example of such a tool, its successes and remaining challenges, will be described.
Resumo:
Fea's tree rat (Chiromyscus chiropus) is a very rare species which there are only a few specimens in the world. The chromosomes of two male specimens, collected from Xishuanbanna, Yunnan, are analysed by several banding technique (G-, C-bands, as well as Ag-staining). The diploid chromosome number is 22, and autosomes comprise 5 pairs of metacentrics, 2 pairs of subacrocentrics, and 3 pairs of acrocentrics. The X chromosome is a acrocentric, and Y is a micro-chromosome, almost a point, which could be a marker chromosome of the species and the genus. The centromeric C-bands are very faint, and C-bands of Nos. 1, 2, 9 and Y chromosome are negative. Only one pair Ag-NORs was found on No. 10 in the silver-stained karyotype. The relationship between morphologic and chromosomal features was discussed, and C-banded karyotype evolutionary trend has also been discussed. Moreover, the conventional karyotype of Niviventer confucianus was described.