Biblioteca Digital

59 resultados para Subtractive clustering

Tuning clustering in random networks with arbitrary degree distributions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a generator of random networks where both the degree-dependent clustering coefficient and the degree distribution are tunable. Following the same philosophy as in the configuration model, the degree distribution and the clustering coefficient for each class of nodes of degree k are fixed ad hoc and a priori. The algorithm generates corresponding topologies by applying first a closure of triangles and second the classical closure of remaining free stubs. The procedure unveils an universal relation among clustering and degree-degree correlations for all networks, where the level of assortativity establishes an upper limit to the level of clustering. Maximum assortativity ensures no restriction on the decay of the clustering coefficient whereas disassortativity sets a stronger constraint on its behavior. Correlation measures in real networks are seen to observe this structural bound.

Conserved chromosomal clustering of genes governed by chromatin regulators in Drosophila

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The trithorax group (trxG) and Polycomb group (PcG) proteins are responsible for the maintenance of stable transcriptional patterns of many developmental regulators. They bind to specific regions of DNA and direct the post-translational modifications of histones, playing a role in the dynamics of chromatin structure.Results: We have performed genome-wide expression studies of trx and ash2 mutants in Drosophila melanogaster. Using computational analysis of our microarray data, we have identified 25 clusters of genes potentially regulated by TRX. Most of these clusters consist of genes that encode structural proteins involved in cuticle formation. This organization appears to be a distinctive feature of the regulatory networks of TRX and other chromatin regulators, since we have observed the same arrangement in clusters after experiments performed with ASH2, as well as in experiments performed by others with NURF, dMyc, and ASH1. We have also found many of these clusters to be significantly conserved in D. simulans, D. yakuba, D. pseudoobscura and partially in Anopheles gambiae.Conclusion: The analysis of genes governed by chromatin regulators has led to the identification of clusters of functionally related genes conserved in other insect species, suggesting this chromosomal organization is biologically important. Moreover, our results indicate that TRX and other chromatin regulators may act globally on chromatin domains that contain transcriptionally co-regulated genes.

Deciphering the global organization of clustering in real complex networks

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We uncover the global organization of clustering in real complex networks. To this end, we ask whether triangles in real networks organize as in maximally random graphs with given degree and clustering distributions, or as in maximally ordered graph models where triangles are forced into modules. The answer comes by way of exploring m-core landscapes, where the m-core is defined, akin to the k-core, as the maximal subgraph with edges participating in at least m triangles. This property defines a set of nested subgraphs that, contrarily to k-cores, is able to distinguish between hierarchical and modular architectures. We find that the clustering organization in real networks is neither completely random nor ordered although, surprisingly, it is more random than modular. This supports the idea that the structure of real networks may in fact be the outcome of self-organized processes based on local optimization rules, in contrast to global optimization principles.

Clustering of grape yield maps to delineate site-specific management zones

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Zonal management in vineyards requires the prior delineation of stable yield zones within the parcel. Among the different methodologies used for zone delineation, cluster analysis of yield data from several years is one of the possibilities cited in scientific literature. However, there exist reasonable doubts concerning the cluster algorithm to be used and the number of zones that have to be delineated within a field. In this paper two different cluster algorithms have been compared (k-means and fuzzy c-means) using the grape yield data corresponding to three successive years (2002, 2003 and 2004), for a ‘Pinot Noir’ vineyard parcel. Final choice of the most recommendable algorithm has been linked to obtaining a stable pattern of spatial yield distribution and to allowing for the delineation of compact and average sized areas. The general recommendation is to use reclassified maps of two clusters or yield classes (low yield zone and high yield zone) and, consequently, the site-specific vineyard management should be based on the prior delineation of just two different zones or sub-parcels. The two tested algorithms are good options for this purpose. However, the fuzzy c-means algorithm allows for a better zoning of the parcel, forming more compact areas and with more equilibrated zonal differences over time.

Anonymizing Graphs: Measuring Quality for Clustering

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Peer-reviewed

A new approach to segmentation based on fusing circumscribed contours, region growing and clustering

Relevância:

20.00% 20.00%

Publicador:

Resumo:

One of the major problems in machine vision is the segmentation of images of natural scenes. This paper presents a new proposal for the image segmentation problem which has been based on the integration of edge and region information. The main contours of the scene are detected and used to guide the posterior region growing process. The algorithm places a number of seeds at both sides of a contour allowing stating a set of concurrent growing processes. A previous analysis of the seeds permits to adjust the homogeneity criterion to the regions's characteristics. A new homogeneity criterion based on clustering analysis and convex hull construction is proposed

Are one factor logarithmic volatility models useful to fit the features of financial data? An application to microsoft data.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper provides empirical evidence that continuous time models with one factor of volatility, in some conditions, are able to fit the main characteristics of financial data. It also reports the importance of the feedback factor in capturing the strong volatility clustering of data, caused by a possible change in the pattern of volatility in the last part of the sample. We use the Efficient Method of Moments (EMM) by Gallant and Tauchen (1996) to estimate logarithmic models with one and two stochastic volatility factors (with and without feedback) and to select among them.

Repeated games played in a network

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Delayed perfect monitoring in an infinitely repeated discounted game is modelled by letting the players form a connected and undirected network. Players observe their immediate neighbors' behavior only, but communicate over time the repeated game's history truthfully throughout the network. The Folk Theorem extends to this setup, although for a range of discount factors strictly below 1, the set of sequential equilibria and the corresponding payoff set may be reduced. A general class of games is analyzed without imposing restrictions on the dimensionality of the payoff space. This and the bilateral communication structure allow for limited results under strategic communication only. As a by-product this model produces a network result; namely, the level of cooperation in this setup depends on the network's diameter, and not on its clustering coefficient as in other models.

A nonlinear threshold model for the dependence of extremes of stationary sequences

Relevância:

10.00% 10.00%

Publicador:

Resumo:

One of the main implications of the efficient market hypothesis (EMH) is that expected future returns on financial assets are not predictable if investors are risk neutral. In this paper we argue that financial time series offer more information than that this hypothesis seems to supply. In particular we postulate that runs of very large returns can be predictable for small time periods. In order to prove this we propose a TAR(3,1)-GARCH(1,1) model that is able to describe two different types of extreme events: a first type generated by large uncertainty regimes where runs of extremes are not predictable and a second type where extremes come from isolated dread/joy events. This model is new in the literature in nonlinear processes. Its novelty resides on two features of the model that make it different from previous TAR methodologies. The regimes are motivated by the occurrence of extreme values and the threshold variable is defined by the shock affecting the process in the preceding period. In this way this model is able to uncover dependence and clustering of extremes in high as well as in low volatility periods. This model is tested with data from General Motors stocks prices corresponding to two crises that had a substantial impact in financial markets worldwide; the Black Monday of October 1987 and September 11th, 2001. By analyzing the periods around these crises we find evidence of statistical significance of our model and thereby of predictability of extremes for September 11th but not for Black Monday. These findings support the hypotheses of a big negative event producing runs of negative returns in the first case, and of the burst of a worldwide stock market bubble in the second example. JEL classification: C12; C15; C22; C51 Keywords and Phrases: asymmetries, crises, extreme values, hypothesis testing, leverage effect, nonlinearities, threshold models

Progress Towards to Equity Market Integration in Eastern Europe

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The advent of the European Union has decreased the diversification benefits available from country based equity market indices in the region. This paper measures the increase in stock integration between the three largest new EU members (Hungary, the Czech Republic and Poland who joined in May 2004) and the Euro-zone. A potentially gradual transition in correlations is accommodated in a single VAR model by embedding smooth transition conditional correlation models with fat tails, spillovers, volatility clustering, and asymmetric volatility effects. At the country market index level all three Eastern European markets show a considerable increase in correlations in 2006. At the industry level the dates and transition periods for the correlations differ, and the correlations are lower although also increasing. The results show that sectoral indices in Eastern European markets may provide larger diversification opportunities than the aggregate market. JEL classifications: C32; C51; F36; G15 Keywords: Multivariate GARCH; Smooth Transition Conditional Correlation; Stock Return Comovement; Sectoral correlations; New EU Members

Una aproximació d'aprenentatge automàtic per a extracció d'informació adaptativa

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Les tècniques de clustering poden ajudar a reduir la supervisió en processos d'obtenció de patrons per a Extracció d'Informació. En aquest treball, que abarca un període de 4 anys de recerca, es comença per estudiar la representació de documents més adequada per a la tasca de clustering. Per tal d'evitar els biaixos dels mètodes individuals de clustering, es consideren mètodes de clustering conjunt. S'exploren diversos mètodes de combinació supervisada, i s'hi afegeixen estratègies automàtiques per a determinar el nombre de clusters de la combinació. També es consideren mecanismes per a obtenir clusterings conjunts ponderats, així com estratègies de combinació no supervisada. Finalment, els resultats del clustering s'utilitzen en un sistema d'adquisició de patrons per a substituir els elements de supervisió humana. Totes aquestes estratègies i mètodes s'avaluen en tasques de clustering de documents i adquisició de patrons usant dades reals. Es comprova que els mots com representació de documents superen altres models per a la tasca de clustering, així com que el clustering conjunt supera les limitacions dels clusterings individuals, i que les estratègies no supervisades d'adquisició de patrons obtenen resultats competitius respecte a les estratègies supervisades.

Clúster creatius en entorns urbans: 22@Bcn i Shoreditch East London

Relevância:

10.00% 10.00%

Publicador:

Resumo:

La localització de les empreses de nova economia en zones urbanes, a pesar que el factor distància no sigui important, no deixa de ser considerable pels seus avantatges que els suposa estar situades conjuntament en relació amb les infraestructures, consum, beneficis socioculturals, i facilitat en les transaccions cara a cara. És inevitable que el primer quart del segle vint-i-un estigui lligat a l’economia creativa de forma similar amb que el començament del segle vint estava íntimament lligat a l’economia industrial i la invenció del sistema de producció en massa. La ciutat també va jugar un dels papers més importants per al desenvolupament de “la nova economia industrial” a les albors del segle vint, com ho és la ciutat del coneixement que acull “la nova economia creativa” al segle vint-i-un. És evident que els resultats morfològics, socials, econòmics i urbans són ben diferents en ambdós fenòmens, però l’impacte a les ciutats és molt gran. L’objectiu d’aquest estudi és analitzar els mecanismes d’aglomeració (clustering) d’activitats competitives basades en creació de coneixement i de serveis avançats que estan al darrera de desenvolupaments punters a ciutats com Barcelona, el projecte 22@bcn, i East London, el projecte Shoreditch. L’esforç que han posat les autoritats locals en crear l’entorn apropiat per atreure i crear empreses innovadores, com a motor de desenvolupament d’algunes ciutats modernes europees ha resultat en el sorgiment de nuclis o centres urbans molt dinàmics que suposadament estan preparats i acullen punts de creació de coneixement (“Urban Knowledge Hubs”), amb una demanda i llocs de treball altament qualificats. Aquest és el cas dels projectes de Barcelona (22@bcn) i East London (Shoreditch).

Alineamiento múltiple de secuencias con T-Coffee: una aproximación paralela

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Las aplicaciones de alineamiento múltiple de secuencias son prototipos de aplicaciones que requieren elevada potencia de cómputo y memoria. Se destacan por la relevancia científica que tienen los resultados que brindan a investigaciones científicas en el campo de la biomedicina, genética y farmacología. Las aplicaciones de alineamiento múltiple tienen la limitante de que no son capaces de procesar miles de secuencias, por lo que se hace necesario crear un modelo para resolver la problemática. Analizando el volumen de datos que se manipulan en el área de las ciencias biológica y la complejidad de los algoritmos de alineamiento de secuencias, la única vía de solución del problema es a través de la utilización de entornos de cómputo paralelos y la computación de altas prestaciones. La investigación realizada por nosotros tiene como objetivo la creación de un modelo paralelo que le permita a los algoritmos de alineamiento múltiple aumentar el número de secuencias a procesar, tratando de mantener la calidad en los resultados para garantizar la precisión científica. El modelo que proponemos emplea como base la clusterización de las secuencias de entrada utilizando criterios biológicos que permiten mantener la calidad de los resultados. Además, el modelo se enfoca en la disminución del tiempo de cómputo y consumo de memoria. Para presentar y validar el modelo utilizamos T-Coffee, como plataforma de desarrollo e investigación. El modelo propuesto pudiera ser aplicado a cualquier otro algoritmo de alineamiento múltiple de secuencias.

Soporte geoespacial para modelos de distribución de especies (I): generalización del mapa de vegetación del Montseny apoyada en información de imágenes de satélite

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Los mapas de vegetación son a menudo utilizados como proxis de una estratificación de hábitats para generar distribuciones geográficas contínuas de organismos a partir de datos discretos mediante modelos multi-variantes. Sin embargo, los mapas de vegetación suelen ser poco apropiados para ser directamente aplicados a este fin, pues sus categorías no se concibieron con la intención de corresponder a tipos de hábitat. En este artículo presentamos y aplicamos el método de Agrupamiento por Doble Criterio para generalizar un mapa de vegetación extraordinariamente detallado (350 clases) del Parque Natural del Montseny (Cataluña) en categorías que mantienen la coherencia tanto desde el punto de vista estructural (a través de una matriz de disimilaridad espectral calculada mediante una imágen del satélite SPOT-5) como en términos de vegetación (gracias a una matriz de disimilaridad calculada mediante propiedades de vegetación deducidas de la leyenda jerárquica del mapa). El método simplifica de 114 a 18 clases el 67% del área de estudio. Añadiendo otras agregaciones más triviales basadas exclusivamente en criterios de cubierta de suelo, el 73% del área de estudio pasa de 167 a 25 categorías. Como valor añadido, el método identifica el 10% de los polígonos originales como anómalos (a partir de comparar las propiedades espectrales de cada polígono con el resto de los de su clases), lo que implica cambios en la cubierta entre las fechas del soporte utilizado para generar el mapa original y la imagen de satélite, o errores en la producción de éste.

Industrial district effects and innovation in the Tuscan shipbuilding industry

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The aim of the present work is to investigate innovative processes within a geographical cluster, and thus contribute to the debate on the effects of industrial clusters on innovation capacity. In particular, we would like to ascertain whether the advantages of industrial districts in promoting innovation, as already revealed by literature (diffusion of knowledge, social capital and trust, efficient networking), are also keys to success in the Tuscan shipbuilding industry of pleasure and sporting boats. First, we verify the existence of clusters of shipbuilding in Tuscany, using a specific methodology. Next, in the identified clusters, we analyse three innovative networks financed in a policy to support innovation, and examine whether the typical features of a cluster for promoting innovation are at work, using a questionnaire administered to 71 actors. Finally, we develop a performance analysis of the cluster firms and ascertain whether their different behaviours also lead to different performances. The analysis results show that our case records effects of industrial clustering on innovation capacity, such as the important role given to trust and social capital, the significant worth put in interfirm relations and in each partner’s specific competencies, or even the distinctive performance of firms belonging to a cluster.

«
1
2
3
4
»