917 resultados para Bregman divergence
Resumo:
Clustering ensemble methods produce a consensus partition of a set of data points by combining the results of a collection of base clustering algorithms. In the evidence accumulation clustering (EAC) paradigm, the clustering ensemble is transformed into a pairwise co-association matrix, thus avoiding the label correspondence problem, which is intrinsic to other clustering ensemble schemes. In this paper, we propose a consensus clustering approach based on the EAC paradigm, which is not limited to crisp partitions and fully exploits the nature of the co-association matrix. Our solution determines probabilistic assignments of data points to clusters by minimizing a Bregman divergence between the observed co-association frequencies and the corresponding co-occurrence probabilities expressed as functions of the unknown assignments. We additionally propose an optimization algorithm to find a solution under any double-convex Bregman divergence. Experiments on both synthetic and real benchmark data show the effectiveness of the proposed approach.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-06
Resumo:
The content-based image retrieval is important for various purposes like disease diagnoses from computerized tomography, for example. The relevance, social and economic of image retrieval systems has created the necessity of its improvement. Within this context, the content-based image retrieval systems are composed of two stages, the feature extraction and similarity measurement. The stage of similarity is still a challenge due to the wide variety of similarity measurement functions, which can be combined with the different techniques present in the recovery process and return results that aren’t always the most satisfactory. The most common functions used to measure the similarity are the Euclidean and Cosine, but some researchers have noted some limitations in these functions conventional proximity, in the step of search by similarity. For that reason, the Bregman divergences (Kullback Leibler and I-Generalized) have attracted the attention of researchers, due to its flexibility in the similarity analysis. Thus, the aim of this research was to conduct a comparative study over the use of Bregman divergences in relation the Euclidean and Cosine functions, in the step similarity of content-based image retrieval, checking the advantages and disadvantages of each function. For this, it was created a content-based image retrieval system in two stages: offline and online, using approaches BSM, FISM, BoVW and BoVW-SPM. With this system was created three groups of experiments using databases: Caltech101, Oxford and UK-bench. The performance of content-based image retrieval system using the different functions of similarity was tested through of evaluation measures: Mean Average Precision, normalized Discounted Cumulative Gain, precision at k, precision x recall. Finally, this study shows that the use of Bregman divergences (Kullback Leibler and Generalized) obtains better results than the Euclidean and Cosine measures with significant gains for content-based image retrieval.
Resumo:
Background: The cattle tick, Rhipicephalus (Boophilus) microplus, economically impact cattle industry in tropical and subtropical regions of the world. The morphological and genetic differences among R. microplus strains have been documented in the literature, suggesting that biogeographical and ecological separation may have resulted in boophilid ticks from America/Africa and those from Australia being different species. To test the hypothesis of the presence of different boophilid species, herein we performed a series of experiments to characterize the reproductive performance of crosses between R. microplus from Australia, Africa and America and the genetic diversity of strains from Australia, Asia, Africa and America. Results: The results showed that the crosses between Australian and Argentinean or Mozambican strains of boophilid ticks are infertile while crosses between Argentinean and Mozambican strains are fertile. These results showed that tick strains from Africa (Mozambique) and America (Argentina) are the same species, while ticks from Australia may actually represent a separate species. The genetic analysis of mitochondrial 12S and 16S rDNA and microsatellite loci were not conclusive when taken separately, but provided evidence that Australian tick strains were genetically different from Asian, African and American strains. Conclusion: The results reported herein support the hypothesis that at least two different species share the name R. microplus. These species could be redefined as R. microplus (Canestrini, 1887) (for American and African strains) and probably the old R. australis Fuller, 1899 (for Australian strains), which needs to be redescribed. However, experiments with a larger number of tick strains from different geographic locations are needed to corroborate these results.
Resumo:
Background: Cryptic species complexes are common among anophelines. Previous phylogenetic analysis based on the complete mtDNA COI gene sequences detected paraphyly in the Neotropical malaria vector Anopheles marajoara. The ""Folmer region"" detects a single taxon using a 3% divergence threshold. Methods: To test the paraphyletic hypothesis and examine the utility of the Folmer region, genealogical trees based on a concatenated (white + 3' COI sequences) dataset and pairwise differentiation of COI fragments were examined. The population structure and demographic history were based on partial COI sequences for 294 individuals from 14 localities in Amazonian Brazil. 109 individuals from 12 localities were sequenced for the nDNA white gene, and 57 individuals from 11 localities were sequenced for the ribosomal DNA (rDNA) internal transcribed spacer 2 (ITS2). Results: Distinct A. marajoara lineages were detected by combined genealogical analysis and were also supported among COI haplotypes using a median joining network and AMOVA, with time since divergence during the Pleistocene (< 100,000 ya). COI sequences at the 3' end were more variable, demonstrating significant pairwise differentiation (3.82%) compared to the more moderate 2.92% detected by the Folmer region. Lineage 1 was present in all localities, whereas lineage 2 was restricted mainly to the west. Mismatch distributions for both lineages were bimodal, likely due to multiple colonization events and spatial expansion (similar to 798 - 81,045 ya). There appears to be gene flow within, not between lineages, and a partial barrier was detected near Rio Jari in Amapa state, separating western and eastern populations. In contrast, both nDNA data sets (white gene sequences with or without the retention of the 4th intron, and ITS2 sequences and length) detected a single A. marajoara lineage. Conclusions: Strong support for combined data with significant differentiation detected in the COI and absent in the nDNA suggest that the divergence is recent, and detectable only by the faster evolving mtDNA. A within subgenus threshold of >2% may be more appropriate among sister taxa in cryptic anopheline complexes than the standard 3%. Differences in demographic history and climatic changes may have contributed to mtDNA lineage divergence in A. marajoara.
Resumo:
Background: The malaria parasite Plasmodium falciparum exhibits abundant genetic diversity, and this diversity is key to its success as a pathogen. Previous efforts to study genetic diversity in P. falciparum have begun to elucidate the demographic history of the species, as well as patterns of population structure and patterns of linkage disequilibrium within its genome. Such studies will be greatly enhanced by new genomic tools and recent large-scale efforts to map genomic variation. To that end, we have developed a high throughput single nucleotide polymorphism (SNP) genotyping platform for P. falciparum. Results: Using an Affymetrix 3,000 SNP assay array, we found roughly half the assays (1,638) yielded high quality, 100% accurate genotyping calls for both major and minor SNP alleles. Genotype data from 76 global isolates confirm significant genetic differentiation among continental populations and varying levels of SNP diversity and linkage disequilibrium according to geographic location and local epidemiological factors. We further discovered that nonsynonymous and silent (synonymous or noncoding) SNPs differ with respect to within-population diversity, interpopulation differentiation, and the degree to which allele frequencies are correlated between populations. Conclusions: The distinct population profile of nonsynonymous variants indicates that natural selection has a significant influence on genomic diversity in P. falciparum, and that many of these changes may reflect functional variants deserving of follow-up study. Our analysis demonstrates the potential for new high-throughput genotyping technologies to enhance studies of population structure, natural selection, and ultimately enable genome-wide association studies in P. falciparum to find genes underlying key phenotypic traits.
Resumo:
This paper studies semistability of the recursive Kalman filter in the context of linear time-varying (LTV), possibly nondetectable systems with incorrect noise information. Semistability is a key property, as it ensures that the actual estimation error does not diverge exponentially. We explore structural properties of the filter to obtain a necessary and sufficient condition for the filter to be semistable. The condition does not involve limiting gains nor the solution of Riccati equations, as they can be difficult to obtain numerically and may not exist. We also compare semistability with the notions of stability and stability w.r.t. the initial error covariance, and we show that semistability in a sense makes no distinction between persistent and nonpersistent incorrect noise models, as opposed to stability. In the linear time invariant scenario we obtain algebraic, easy to test conditions for semistability and stability, which complement results available in the context of detectable systems. Illustrative examples are included.
Resumo:
The most popular algorithms for blind equalization are the constant-modulus algorithm (CMA) and the Shalvi-Weinstein algorithm (SWA). It is well-known that SWA presents a higher convergence rate than CMA. at the expense of higher computational complexity. If the forgetting factor is not sufficiently close to one, if the initialization is distant from the optimal solution, or if the signal-to-noise ratio is low, SWA can converge to undesirable local minima or even diverge. In this paper, we show that divergence can be caused by an inconsistency in the nonlinear estimate of the transmitted signal. or (when the algorithm is implemented in finite precision) by the loss of positiveness of the estimate of the autocorrelation matrix, or by a combination of both. In order to avoid the first cause of divergence, we propose a dual-mode SWA. In the first mode of operation. the new algorithm works as SWA; in the second mode, it rejects inconsistent estimates of the transmitted signal. Assuming the persistence of excitation condition, we present a deterministic stability analysis of the new algorithm. To avoid the second cause of divergence, we propose a dual-mode lattice SWA, which is stable even in finite-precision arithmetic, and has a computational complexity that increases linearly with the number of adjustable equalizer coefficients. The good performance of the proposed algorithms is confirmed through numerical simulations.
Resumo:
Far too often, phenotypic divergence has been misinterpreted as genetic divergence, and based on phenotypic divergence, genetic divergence has been indicated. We have attempted to disprove this statement and call for the differentiation of phenotypic and genotypic variation.
Resumo:
Apiomorpha Rubsaamen (Hemiptera: Coccoidea: Eriococcidae) is one of the most chromosomally diverse of all animal genera. There is extensive karyotypic variation within many of the morphologically defined species, including A. munita (Schrader) which is here reported to have diploid chromosome counts ranging from 6 to more than 100. Each of the three morphologically defined subspecies of A. munita also displays considerable chromosomal variation: A. m. tereticornuta Gullan (2n =6, 8, 20, 22 or 24), A. m. malleensis Gullan (2n =6, 20, 22, 24 or 26), and A. m. munita (Schrader) (2n=54 or >100). Apiomorpha munita appears to occur only on eucalypts of the informal subgenus Symphyomyrtus, with each of the subspecies of A. munita restricted to discrete symphyomyrt sections. Several different karyotypic forms within each subspecies of A. munita appear to be restricted to only one or a few eucalypt species or series. The association between apparent host specificity and chromosomal rearrangements in A. munita suggests that both may be playing an active role in taxon divergence in Apiomorpha. (C) 2001 The Linnean Society of London.
Resumo:
The increase of the women purchase power has led some companies to adopt strategies of products differentiation as well as to produce specific products to the female public. The auto industry is not immune to this phenomenon, once the women represent, approximately half of the automobile sales in the country. Considering the consumption and the behavior differences between women and men, it has set the following question: are there differences between the choices associated to the automobile by men and the choices associated to the automobile by women? It has been presented to the participants items found in the people`s day-by-day, which are valorized by them, and the participants have been asked to choose and associate these items to the automobile. The results analysis revealed there are more similarities than differences between choices associated to the automobile by men ad choices associated to the automobile by women. The similarity between the choices suggests that the representations, the meanings and values assigned. to the car by men ana women are similar and thus the strategy of product differentiation does not apply to the automotive industry
Resumo:
We have measured nucleotide variation in the CLOCK/CYCLE heterodimer inhibition domain (CCID) of the clock X-linked gene period in seven species belonging to the Drosophila buzzatii cluster, namely D. buzzatii, Drosophila koepferae, Drosophila antonietae, Drosophila serido, Drosophila gouveai, Drosophila seriema and Drosophila borborema. We detected that the purifying selection is the main force driving the sequence evolution in period, in agreement with the important role of CCID in clock machinery. Our survey revealed that period provides valuable phylogenetic information that allowed to resolve phylogenetic relationships among D. gouveai, D. borborema and D. seriema, which composed a polytomic clade in preliminary studies. The analysis of patterns of intraspecific variation revealed two different lineages of period in D. koepferae, probably reflecting introgressive hybridization from D. buzzatii, in concordance with previous molecular data.
Resumo:
Drosophila antonietae and Drosophila gouveai are allopatric, cactophilic, cryptic and endemic of South America species, which aedeagus morphology is considered the main diagnostic character. In this work, single close populations from the edge distributions of each species, located in an ""introgressive corridor"", were analyzed regarding temporal isozenzymatic genetic variability. Isocitrate dehydrogenase (Idh) appeared as a diagnostic locus between D. antonieate and D. gouveai because each population was fixed for different alleles. Moreover, several polymorphic loci showed accentuated divergence in the allele frequency, as evidenced by Nei`s l(0.3188) and D (1.1432), and also by Reynolds` genetic distance and identity (1.3207 and 0.7331, respectively). Our results showed that, in spite of the very similar external morphology, related evolutionary histories, close distributions, and events of introgression in the studied area, these cryptic species have high allozymatic differentiation, and this is discussed here. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
Aim The aim of this study was to assess the causal mechanisms underlying populational subdivision in Drosophila gouveai, a cactophilic species associated with xeric vegetation enclaves in eastern Brazil. A secondary aim was to investigate the genetic effects of Pleistocene climatic fluctuations on these environments. Location Dry vegetation enclaves within the limits of the Cerrado domain in eastern Brazil. Methods We determined the mitochondrial DNA haplotypes of 55 individuals (representing 12 populations) based on sequence data of a 483-bp fragment from the cytochrome c oxidase subunit II (COII) gene. Phylogenetic and coalescent analyses were used to test for the occurrence of demographic events and to infer the time of divergence amongst genetically independent groups. Results Our analyses revealed the existence of two divergent subclades (G1 and G2) plus an introgressed clade restricted to the southernmost range of D. gouveai. Subclades G1 and G2 displayed genetic footprints of range expansion and segregated geographical distributions in south-eastern and some central highland regions, east and west of the Parana River valley. Molecular dating indicated that the main demographic and diversification events occurred in the late to middle Pleistocene. Main conclusions The phylogeographical and genetic patterns observed for D. gouveai in this study are consistent with changes in the distribution of dry vegetation in eastern Brazil. All of the estimates obtained by molecular dating indicate that range expansion and isolation pre-dated the Last Glacial Maximum, occurring during the late to middle Pleistocene, and were probably triggered by climatic changes during the Pleistocene. The current patchy geographical distribution and population subdivision in D. gouveai is apparently closely linked to these past events.
Resumo:
Theory predicts that in small isolated populations random genetic drift can lead to phenotypic divergence; however this prediction has rarely been tested quantitatively in natural populations. Here we utilize natural repeated island colonization events by members of the avian species complex, Zosterops lateralis, to assess whether or not genetic drift alone is an adequate explanation for the observed patterns of microevolutionary divergence in morphology. Morphological and molecular genetic characteristics of island and mainland populations are compared to test three predictions of drift theory: (1) that the pattern of morphological change is idiosyncratic to each island; (2) that there is concordance between morphological and neutral genetic shifts across island populations; and (3) for populations whose time of colonization is known, that the rate of morphological change is sufficiently slow to be accounted for solely by genetic drift. Our results are not consistent with these predictions. First, the direction of size shifts was consistently towards larger size, suggesting the action of a nonrandom process. Second, patterns of morphological divergence among recently colonized populations showed little concordance with divergence in neutral genetic characters. Third, rate tests of morphological change showed that effective population sizes were not small enough for random processes alone to account for the magnitude of microevolutionary change. Altogether, these three lines of evidence suggest that drift alone is not an adequate explanation of morphological differentiation in recently colonized island Zosterops and therefore we suggest that the observed microevolutionary changes are largely a result of directional natural selection.