21 resultados para Distances between Predicates

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We give a 5-approximation algorithm to the rooted Subtree-Prune-and-Regraft (rSPR) distance between two phylogenies, which was recently shown to be NP-complete by Bordewich and Semple [5]. This paper presents the first approximation result for this important tree distance. The algorithm follows a standard format for tree distances such as Rodrigues et al. [24] and Hein et al. [13]. The novel ideas are in the analysis. In the analysis, the cost of the algorithm uses a \cascading" scheme that accounts for possible wrong moves. This accounting is missing from previous analysis of tree distance approximation algorithms. Further, we show how all algorithms of this type can be implemented in linear time and give experimental results.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The measurement of social polarization has received little attention from the literature. The only social polarization index that has been used to measure religious or ethnic polarization (the RQ index) has several shortcomings that are critically discussed in the paper. In particular, that index is not taking into account the existing distance between and within different groups. A couple of axiomatically characterized social polarization indices that overcome these limitations are presented. In the empirical section we show that the rankings of countries according to the levels of polarization change to a great extent when we replace the RQ index by the indices presented in this paper.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Rotation distance quantifies the difference in shape between two rooted binary trees of the same size by counting the minimum number of elementary changes needed to transform one tree to the other. We describe several types of rotation distance, and provide upper bounds on distances between trees with a fixed number of nodes with respect to each type. These bounds are obtained by relating each restricted rotation distance to the word length of elements of Thompson's group F with respect to different generating sets, including both finite and infinite generating sets.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Treball de recerca realitzat per un alumne d'ensenyament secundari i guardonat amb un Premi CIRIT per fomentar l'esperit científic del Jovent l'any 2009. Aquest treball de recerca es basa en l'experimentació i, posteriorment, l'obtenció i anàlisi de resultats de l'experiment creador d'anells de Liesegang. Aquest experiment, consistent en la precipitació d'un compost en una base gelificada formant anells distanciats logarítmicament els uns dels altres, ha estat durant més d'un segle objecte d'investigació de moltíssims científics, els quals no han sabut mai treure'n una explicació lògica i raonable d'aquest rar comportament. L'autor ha pretès recrear els curiosos anells intentant formar-los amb diferents inhibidors i compostos als trobats en la bibliografia. Després de realitzar més d'una trentena d'experiments, s'ha realitzat una anàlisi exhaustiva dels resultats. Aquest apartat ha estat un dels més enriquidors, ja que s'han dut a terme en ell comparacions sorprenents i troballes molt curioses, com per exemple la similitud entre els anells de Liesegang i les estructures de Turing, la qual intenta explicar les formes presents en els ocels dels éssers vius; i l'aparició d'anells de Liesegang segons l’òptica visual, efecte inexistent en l’àmplia bibliografia consultada. A més a més, també s'han efectuat una sèrie d'estudis: un en què es confirmen les distàncies logarítmiques entre els anells i on es realitza una comparació entre les dades empíriques i el patró matemàtic; i un altre en què s'estudia el comportament dels anells al variar els factors que regulen la velocitat de reacció.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper, we analyse the asymptotic behavior of solutions of the continuous kinetic version of flocking by Cucker and Smale [16], which describes the collective behavior of an ensemble of organisms, animals or devices. This kinetic version introduced in [24] is here obtained starting from a Boltzmann-type equation. The large-time behavior of the distribution in phase space is subsequently studied by means of particle approximations and a stability property in distances between measures. A continuous analogue of the theorems of [16] is shown to hold for the solutions on the kinetic model. More precisely, the solutions will concentrate exponentially fast their velocity to their mean while in space they will converge towards a translational flocking solution.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper examines the impact of ethnic divisions on conflict. The analysis relies on a theoretical model of conflict (Esteban and Ray, 2010) in which equilibrium conflict is shown to be accurately described by a linear function of just three distributional indices of ethnic diversity: the Gini coefficient, the Hirschman-Herfindahl fractionalization index, and a measure of polarization. Based on a dataset constructed by James Fearon and data from Ethnologue on ethno-linguistic groups and the "linguistic distances" between them, we compute the three distribution indices. Our results show that ethnic polarization is a highly significant correlate of conflict. Fractionalization is also significant in some of the statistical exercises, but the Gini coefficient never is. In particular, inter-group distances computed from language and embodied in polarization measures turn out to be extremely important correlates of ethnic conflict.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

El estudio analiza y compara las percepciones y conocimientos ambientales que estudiantes de primaria del municipio maya de Felipe Carrillo Puerto tienen sobre la vecina Reserva de la Biósfera de Sian Ka’an (RBSK) en términos de biodiversidad, con el fin de evaluar y elaborar programas de Educación Ambiental (EA) dirigidos a promover la protección de este espacio natural. Se analizan dibujos, cuestionarios y encuestas recogidos en tres comunidades maya que difieren en su proximidad a la RBSK y en su nivel de urbanización. Los resultados indican que los niños en general desconocen la RBSK y la biodiversidad de la zona. Para aumentar el conocimiento ambiental de los alumnos se propone potenciar el conocimiento vivencial, mantener la lengua y cultura maya y el contacto de ésta con la naturaleza y potenciar la afinidad del maestro por la temática ambiental y por la RBSK.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper introduces local distance-based generalized linear models. These models extend (weighted) distance-based linear models firstly with the generalized linear model concept, then by localizing. Distances between individuals are the only predictor information needed to fit these models. Therefore they are applicable to mixed (qualitative and quantitative) explanatory variables or when the regressor is of functional type. Models can be fitted and analysed with the R package dbstats, which implements several distancebased prediction methods.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

En aquest article es defineixen uns nous índexs tridimensionals per a la descripció de les molècules a partir de paràmetres derivats de la Teoria de la Semblança Molecular i de les distàncies euclidianes entre els àtoms i les càrregues atòmiques efectives. Aquests indexs,anomenats 3D, s'han aplicat a l'estudi de les relacions estructura-propietat d'una família d'hidrocarburs, i han demostrat una capacitat de descripció de tres propietats de la família (temperatura d'ebullició, temperatura de fusió i densitat) molt més acurada que quan s'utilitzen els indexs 2D clàssics

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The statistical analysis of literary style is the part of stylometry that compares measurable characteristicsin a text that are rarely controlled by the author, with those in other texts. When thegoal is to settle authorship questions, these characteristics should relate to the author’s style andnot to the genre, epoch or editor, and they should be such that their variation between authors islarger than the variation within comparable texts from the same author.For an overview of the literature on stylometry and some of the techniques involved, see for exampleMosteller and Wallace (1964, 82), Herdan (1964), Morton (1978), Holmes (1985), Oakes (1998) orLebart, Salem and Berry (1998).Tirant lo Blanc, a chivalry book, is the main work in catalan literature and it was hailed to be“the best book of its kind in the world” by Cervantes in Don Quixote. Considered by writterslike Vargas Llosa or Damaso Alonso to be the first modern novel in Europe, it has been translatedseveral times into Spanish, Italian and French, with modern English translations by Rosenthal(1996) and La Fontaine (1993). The main body of this book was written between 1460 and 1465,but it was not printed until 1490.There is an intense and long lasting debate around its authorship sprouting from its first edition,where its introduction states that the whole book is the work of Martorell (1413?-1468), while atthe end it is stated that the last one fourth of the book is by Galba (?-1490), after the death ofMartorell. Some of the authors that support the theory of single authorship are Riquer (1990),Chiner (1993) and Badia (1993), while some of those supporting the double authorship are Riquer(1947), Coromines (1956) and Ferrando (1995). For an overview of this debate, see Riquer (1990).Neither of the two candidate authors left any text comparable to the one under study, and thereforediscriminant analysis can not be used to help classify chapters by author. By using sample textsencompassing about ten percent of the book, and looking at word length and at the use of 44conjunctions, prepositions and articles, Ginebra and Cabos (1998) detect heterogeneities that mightindicate the existence of two authors. By analyzing the diversity of the vocabulary, Riba andGinebra (2000) estimates that stylistic boundary to be near chapter 383.Following the lead of the extensive literature, this paper looks into word length, the use of the mostfrequent words and into the use of vowels in each chapter of the book. Given that the featuresselected are categorical, that leads to three contingency tables of ordered rows and therefore tothree sequences of multinomial observations.Section 2 explores these sequences graphically, observing a clear shift in their distribution. Section 3describes the problem of the estimation of a suden change-point in those sequences, in the followingsections we propose various ways to estimate change-points in multinomial sequences; the methodin section 4 involves fitting models for polytomous data, the one in Section 5 fits gamma modelsonto the sequence of Chi-square distances between each row profiles and the average profile, theone in Section 6 fits models onto the sequence of values taken by the first component of thecorrespondence analysis as well as onto sequences of other summary measures like the averageword length. In Section 7 we fit models onto the marginal binomial sequences to identify thefeatures that distinguish the chapters before and after that boundary. Most methods rely heavilyon the use of generalized linear models

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Graphical displays which show inter--sample distances are importantfor the interpretation and presentation of multivariate data. Except whenthe displays are two--dimensional, however, they are often difficult tovisualize as a whole. A device, based on multidimensional unfolding, isdescribed for presenting some intrinsically high--dimensional displays infewer, usually two, dimensions. This goal is achieved by representing eachsample by a pair of points, say $R_i$ and $r_i$, so that a theoreticaldistance between the $i$-th and $j$-th samples is represented twice, onceby the distance between $R_i$ and $r_j$ and once by the distance between$R_j$ and $r_i$. Self--distances between $R_i$ and $r_i$ need not be zero.The mathematical conditions for unfolding to exhibit symmetry are established.Algorithms for finding approximate fits, not constrained to be symmetric,are discussed and some examples are given.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

When the behaviour of a specific hypothesis test statistic is studied by aMonte Carlo experiment, the usual way to describe its quality is by givingthe empirical level of the test. As an alternative to this procedure, we usethe empirical distribution of the obtained \emph{p-}values and exploit itsinformation both graphically and numerically.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper we examine whether access to markets had a significant influence onmigration choices of Spanish internal migrants in the inter-war years. We perform astructural contrast of a New Economic Geography model that focus on the forwardlinkage that links workers location choice with the geography of industrial production,one of the centripetal forces that drive agglomeration in the NEG models. The resultshighlight the presence of this forward linkage in the Spanish economy of the inter-warperiod. That is, we prove the existence of a direct relation between workers¿ localizationdecisions and the market potential of the host regions. In addition, the direct estimationof the values associated with key parameters in the NEG model allows us to simulatethe migratory flows derived from different scenarios of the relative size of regions andthe distances between them. We show that in Spain the power of attraction of theagglomerations grew as they increased in size, but the high elasticity estimated for themigration costs reduced the intensity of the migratory flows. This could help to explainthe apparently low intensity of internal migrations in Spain until its upsurge during the1920s. This also explains the geography of migrations in Spain during this period,which hardly affected the regions furthest from the large industrial agglomerations (i.e.,regions such as Andalusia, Estremadura and Castile-La Mancha) but had an intenseeffect on the provinces nearest to the principal centres of industrial development.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

La regressió basada en distàncies és un mètode de predicció que consisteix en dos passos: a partir de les distàncies entre observacions obtenim les variables latents, les quals passen a ser els regressors en un model lineal de mínims quadrats ordinaris. Les distàncies les calculem a partir dels predictors originals fent us d'una funció de dissimilaritats adequada. Donat que, en general, els regressors estan relacionats de manera no lineal amb la resposta, la seva selecció amb el test F usual no és possible. En aquest treball proposem una solució a aquest problema de selecció de predictors definint tests estadístics generalitzats i adaptant un mètode de bootstrap no paramètric per a l'estimació dels p-valors. Incluim un exemple numèric amb dades de l'assegurança d'automòbils.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper we examine whether access to markets had a significant influence onmigration choices of Spanish internal migrants in the inter-war years. We perform astructural contrast of a New Economic Geography model that focus on the forwardlinkage that links workers location choice with the geography of industrial production,one of the centripetal forces that drive agglomeration in the NEG models. The resultshighlight the presence of this forward linkage in the Spanish economy of the inter-warperiod. That is, we prove the existence of a direct relation between workers¿ localizationdecisions and the market potential of the host regions. In addition, the direct estimationof the values associated with key parameters in the NEG model allows us to simulatethe migratory flows derived from different scenarios of the relative size of regions andthe distances between them. We show that in Spain the power of attraction of theagglomerations grew as they increased in size, but the high elasticity estimated for themigration costs reduced the intensity of the migratory flows. This could help to explainthe apparently low intensity of internal migrations in Spain until its upsurge during the1920s. This also explains the geography of migrations in Spain during this period,which hardly affected the regions furthest from the large industrial agglomerations (i.e.,regions such as Andalusia, Estremadura and Castile-La Mancha) but had an intenseeffect on the provinces nearest to the principal centres of industrial development.