935 resultados para Complex data


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The coverage and volume of geo-referenced datasets are extensive and incessantly¦growing. The systematic capture of geo-referenced information generates large volumes¦of spatio-temporal data to be analyzed. Clustering and visualization play a key¦role in the exploratory data analysis and the extraction of knowledge embedded in¦these data. However, new challenges in visualization and clustering are posed when¦dealing with the special characteristics of this data. For instance, its complex structures,¦large quantity of samples, variables involved in a temporal context, high dimensionality¦and large variability in cluster shapes.¦The central aim of my thesis is to propose new algorithms and methodologies for¦clustering and visualization, in order to assist the knowledge extraction from spatiotemporal¦geo-referenced data, thus improving making decision processes.¦I present two original algorithms, one for clustering: the Fuzzy Growing Hierarchical¦Self-Organizing Networks (FGHSON), and the second for exploratory visual data analysis:¦the Tree-structured Self-organizing Maps Component Planes. In addition, I present¦methodologies that combined with FGHSON and the Tree-structured SOM Component¦Planes allow the integration of space and time seamlessly and simultaneously in¦order to extract knowledge embedded in a temporal context.¦The originality of the FGHSON lies in its capability to reflect the underlying structure¦of a dataset in a hierarchical fuzzy way. A hierarchical fuzzy representation of¦clusters is crucial when data include complex structures with large variability of cluster¦shapes, variances, densities and number of clusters. The most important characteristics¦of the FGHSON include: (1) It does not require an a-priori setup of the number¦of clusters. (2) The algorithm executes several self-organizing processes in parallel.¦Hence, when dealing with large datasets the processes can be distributed reducing the¦computational cost. (3) Only three parameters are necessary to set up the algorithm.¦In the case of the Tree-structured SOM Component Planes, the novelty of this algorithm¦lies in its ability to create a structure that allows the visual exploratory data analysis¦of large high-dimensional datasets. This algorithm creates a hierarchical structure¦of Self-Organizing Map Component Planes, arranging similar variables' projections in¦the same branches of the tree. Hence, similarities on variables' behavior can be easily¦detected (e.g. local correlations, maximal and minimal values and outliers).¦Both FGHSON and the Tree-structured SOM Component Planes were applied in¦several agroecological problems proving to be very efficient in the exploratory analysis¦and clustering of spatio-temporal datasets.¦In this thesis I also tested three soft competitive learning algorithms. Two of them¦well-known non supervised soft competitive algorithms, namely the Self-Organizing¦Maps (SOMs) and the Growing Hierarchical Self-Organizing Maps (GHSOMs); and the¦third was our original contribution, the FGHSON. Although the algorithms presented¦here have been used in several areas, to my knowledge there is not any work applying¦and comparing the performance of those techniques when dealing with spatiotemporal¦geospatial data, as it is presented in this thesis.¦I propose original methodologies to explore spatio-temporal geo-referenced datasets¦through time. Our approach uses time windows to capture temporal similarities and¦variations by using the FGHSON clustering algorithm. The developed methodologies¦are used in two case studies. In the first, the objective was to find similar agroecozones¦through time and in the second one it was to find similar environmental patterns¦shifted in time.¦Several results presented in this thesis have led to new contributions to agroecological¦knowledge, for instance, in sugar cane, and blackberry production.¦Finally, in the framework of this thesis we developed several software tools: (1)¦a Matlab toolbox that implements the FGHSON algorithm, and (2) a program called¦BIS (Bio-inspired Identification of Similar agroecozones) an interactive graphical user¦interface tool which integrates the FGHSON algorithm with Google Earth in order to¦show zones with similar agroecological characteristics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Alloreactive T cells are thought to be a potentially rich source of high-avidity T cells with therapeutic potential since tolerance to self-Ags is restricted to self-MHC recognition. Given the particularly high frequency of alloreactive T cells in the peripheral immune system, we used numerous MHC class I multimers to directly visualize and isolate viral and tumor Ag-specific alloreactive CD8 T cells. In fact, all but one specificities screened were undetectable in ex vivo labeling. In this study, we report the occurrence of CD8 T cells specifically labeled with allo-HLA-A*0201/Melan-A/MART-1(26-35) multimers at frequencies that are in the range of 10(-4) CD8 T cells and are thus detectable ex vivo by flow cytometry. We report the thymic generation and shaping of tumor Ag-specific, alloreactive T cells as well as their fate once seeded in the periphery. We show that these cells resemble their counterparts in HLA-A*0201-positive individuals, based on their structural and functional attributes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the trend in molecular epidemiology towards both genome-wide association studies and complex modelling, the need for large sample sizes to detect small effects and to allow for the estimation of many parameters within a model continues to increase. Unfortunately, most methods of association analysis have been restricted to either a family-based or a case-control design, resulting in the lack of synthesis of data from multiple studies. Transmission disequilibrium-type methods for detecting linkage disequilibrium from family data were developed as an effective way of preventing the detection of association due to population stratification. Because these methods condition on parental genotype, however, they have precluded the joint analysis of family and case-control data, although methods for case-control data may not protect against population stratification and do not allow for familial correlations. We present here an extension of a family-based association analysis method for continuous traits that will simultaneously test for, and if necessary control for, population stratification. We further extend this method to analyse binary traits (and therefore family and case-control data together) and accurately to estimate genetic effects in the population, even when using an ascertained family sample. Finally, we present the power of this binary extension for both family-only and joint family and case-control data, and demonstrate the accuracy of the association parameter and variance components in an ascertained family sample.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The proportion of population living in or around cites is more important than ever. Urban sprawl and car dependence have taken over the pedestrian-friendly compact city. Environmental problems like air pollution, land waste or noise, and health problems are the result of this still continuing process. The urban planners have to find solutions to these complex problems, and at the same time insure the economic performance of the city and its surroundings. At the same time, an increasing quantity of socio-economic and environmental data is acquired. In order to get a better understanding of the processes and phenomena taking place in the complex urban environment, these data should be analysed. Numerous methods for modelling and simulating such a system exist and are still under development and can be exploited by the urban geographers for improving our understanding of the urban metabolism. Modern and innovative visualisation techniques help in communicating the results of such models and simulations. This thesis covers several methods for analysis, modelling, simulation and visualisation of problems related to urban geography. The analysis of high dimensional socio-economic data using artificial neural network techniques, especially self-organising maps, is showed using two examples at different scales. The problem of spatiotemporal modelling and data representation is treated and some possible solutions are shown. The simulation of urban dynamics and more specifically the traffic due to commuting to work is illustrated using multi-agent micro-simulation techniques. A section on visualisation methods presents cartograms for transforming the geographic space into a feature space, and the distance circle map, a centre-based map representation particularly useful for urban agglomerations. Some issues on the importance of scale in urban analysis and clustering of urban phenomena are exposed. A new approach on how to define urban areas at different scales is developed, and the link with percolation theory established. Fractal statistics, especially the lacunarity measure, and scale laws are used for characterising urban clusters. In a last section, the population evolution is modelled using a model close to the well-established gravity model. The work covers quite a wide range of methods useful in urban geography. Methods should still be developed further and at the same time find their way into the daily work and decision process of urban planners. La part de personnes vivant dans une région urbaine est plus élevé que jamais et continue à croître. L'étalement urbain et la dépendance automobile ont supplanté la ville compacte adaptée aux piétons. La pollution de l'air, le gaspillage du sol, le bruit, et des problèmes de santé pour les habitants en sont la conséquence. Les urbanistes doivent trouver, ensemble avec toute la société, des solutions à ces problèmes complexes. En même temps, il faut assurer la performance économique de la ville et de sa région. Actuellement, une quantité grandissante de données socio-économiques et environnementales est récoltée. Pour mieux comprendre les processus et phénomènes du système complexe "ville", ces données doivent être traitées et analysées. Des nombreuses méthodes pour modéliser et simuler un tel système existent et sont continuellement en développement. Elles peuvent être exploitées par le géographe urbain pour améliorer sa connaissance du métabolisme urbain. Des techniques modernes et innovatrices de visualisation aident dans la communication des résultats de tels modèles et simulations. Cette thèse décrit plusieurs méthodes permettant d'analyser, de modéliser, de simuler et de visualiser des phénomènes urbains. L'analyse de données socio-économiques à très haute dimension à l'aide de réseaux de neurones artificiels, notamment des cartes auto-organisatrices, est montré à travers deux exemples aux échelles différentes. Le problème de modélisation spatio-temporelle et de représentation des données est discuté et quelques ébauches de solutions esquissées. La simulation de la dynamique urbaine, et plus spécifiquement du trafic automobile engendré par les pendulaires est illustrée à l'aide d'une simulation multi-agents. Une section sur les méthodes de visualisation montre des cartes en anamorphoses permettant de transformer l'espace géographique en espace fonctionnel. Un autre type de carte, les cartes circulaires, est présenté. Ce type de carte est particulièrement utile pour les agglomérations urbaines. Quelques questions liées à l'importance de l'échelle dans l'analyse urbaine sont également discutées. Une nouvelle approche pour définir des clusters urbains à des échelles différentes est développée, et le lien avec la théorie de la percolation est établi. Des statistiques fractales, notamment la lacunarité, sont utilisées pour caractériser ces clusters urbains. L'évolution de la population est modélisée à l'aide d'un modèle proche du modèle gravitaire bien connu. Le travail couvre une large panoplie de méthodes utiles en géographie urbaine. Toutefois, il est toujours nécessaire de développer plus loin ces méthodes et en même temps, elles doivent trouver leur chemin dans la vie quotidienne des urbanistes et planificateurs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The recently measured inclusive electron-proton cross section in the nucleon resonance region, performed with the CLAS detector at the Thomas Jefferson Laboratory, has provided new data for the nucleon structure function F2 with previously unavailable precision. In this paper we propose a description of these experimental data based on a Regge-dual model for F2. The basic inputs in the model are nonlinear complex Regge trajectories producing both isobar resonances and a smooth background. The model is tested against the experimental data, and the Q2 dependence of the moments is calculated. The fitted model for the structure function (inclusive cross section) is a limiting case of the more general scattering amplitude equally applicable to deeply virtual Compton scattering. The connection between the two is discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The region of greatest variability on soil maps is along the edge of their polygons, causing disagreement among pedologists about the appropriate description of soil classes at these locations. The objective of this work was to propose a strategy for data pre-processing applied to digital soil mapping (DSM). Soil polygons on a training map were shrunk by 100 and 160 m. This strategy prevented the use of covariates located near the edge of the soil classes for the Decision Tree (DT) models. Three DT models derived from eight predictive covariates, related to relief and organism factors sampled on the original polygons of a soil map and on polygons shrunk by 100 and 160 m were used to predict soil classes. The DT model derived from observations 160 m away from the edge of the polygons on the original map is less complex and has a better predictive performance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Introduction.- Pain and beliefs have an influence on the patient's course in rehabilitation and their relationships are complex. The aim of this study was to understand the relationships between pain at admission and the evolution of beliefs during rehabilitation as well as the relationships between pain and beliefs one year after rehabilitation.Patients and methods.- Six hundred and thirty-one consecutive patients admitted in rehabilitation after musculoskeletal trauma, were included and assessed at admission, at discharge and one year after discharge. Pain was measured by VAS (Visual Analogical Scale) and beliefs by judgement on Lickert scales. Four kinds of beliefs were evaluated: fear of a severe origin of pain, fear of movement, fear of pain and feeling of distress (loss of control). The association between pain and beliefs was assessed by logistic regressions, adjusted for gender, age, native language, education and bio-psycho-social complexity.Results.- At discharge, 44% of patients felt less distressed by pain, 34% are reinsured with regard to their fear of a severe origin of pain, 38% have less fear of pain and 33% have less fear of movement. The higher the pain at admission, the higher the probability that the distress diminished, this being true up to a threshold (70 mm/100) beyond which there was a plateau. At one year, the higher the pain, the more dysfunctional the fears.Discussion.- The relationships between pain and beliefs are complex and may change all along rehabilitation. During hospitalization, one could hope that the patient would be reinsured and would gain self-control again, if pain does not exceed a certain threshold. After one year, high pain increases the risk of dysfunctional beliefs. For clinical practice, these data suggest to think in terms of the more accessible "entrance door", act against pain and/or against beliefs, adpated to each patient.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Since different pedologists will draw different soil maps of a same area, it is important to compare the differences between mapping by specialists and mapping techniques, as for example currently intensively discussed Digital Soil Mapping. Four detailed soil maps (scale 1:10.000) of a 182-ha sugarcane farm in the county of Rafard, São Paulo State, Brazil, were compared. The area has a large variation of soil formation factors. The maps were drawn independently by four soil scientists and compared with a fifth map obtained by a digital soil mapping technique. All pedologists were given the same set of information. As many field expeditions and soil pits as required by each surveyor were provided to define the mapping units (MUs). For the Digital Soil Map (DSM), spectral data were extracted from Landsat 5 Thematic Mapper (TM) imagery as well as six terrain attributes from the topographic map of the area. These data were summarized by principal component analysis to generate the map designs of groups through Fuzzy K-means clustering. Field observations were made to identify the soils in the MUs and classify them according to the Brazilian Soil Classification System (BSCS). To compare the conventional and digital (DSM) soil maps, they were crossed pairwise to generate confusion matrices that were mapped. The categorical analysis at each classification level of the BSCS showed that the agreement between the maps decreased towards the lower levels of classification and the great influence of the surveyor on both the mapping and definition of MUs in the soil map. The average correspondence between the conventional and DSM maps was similar. Therefore, the method used to obtain the DSM yielded similar results to those obtained by the conventional technique, while providing additional information about the landscape of each soil, useful for applications in future surveys of similar areas.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

New stratigraphic data along a profile from the Helvetic Gotthard Massif to the remnants of the North Penninic Basin in eastern Ticino and Graubunden are presented. The stratigraphic record together with existing geochemical and structural data, motivate a new interpretation of the fossil European distal margin. We introduce a new group of Triassic facies, the North-Penninic-Triassic (NPT), which is characterised by the Ladinian "dolomie bicolori". The NPT was located in-between the Briançonnais carbonate platform and the Helvetic lands. The observed horizontal transition, coupled with the stratigraphic superposition of an Helvetic Liassic on a Briaçonnais Triassic in the Luzzone-Terri nappe, links, prior to Jurassic rifting, the Briançonnais paleogeographic domain at the Helvetic Margin, south of the Gotthard. Our observations suggest that the Jurassic rifting separated the Briançonnais domain from the Helvetic margin by complex and protracted extension. The syn-rift stratigraphic record in the Adula nappe and surroundings suggests the presence of a diffuse rising area with only moderately subsiding basins above a thinned continental and proto-oceanic crust. Strong subsidence occurred in a second phase following protracted extension and the resulting delamination of the rising area. The stratigraphic coherency in the Adula's Mesozoic questions the idea of a lithospheric mélange in the eclogitic Adula nappe, which is more likely to be a coherent alpine tectonic unit. The structural and stratigraphic observations in the Piz Terri-Lunschania zone suggest the activity of syn-rift detachments. During the alpine collision these faults are reactivated (and inverted) and played a major role in allowing the Adula subduction, the "Penninic Thrust" above it and in creating the structural complexity of the Central Alps.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Malondialdehyde (MDA) is a small, ubiquitous, and potentially toxic aldehyde that is produced in vivo by lipid oxidation and that is able to affect gene expression. Tocopherol deficiency in the vitamin E2 mutant vte2-1 of Arabidopsis thaliana leads to massive lipid oxidation and MDA accumulation shortly after germination. MDA accumulation correlates with a strong visual phenotype (growth reduction, cotyledon bleaching) and aberrant GST1 (glutathione S-transferase 1) expression. We suppressed MDA accumulation in the vte2-1 background by genetically removing tri-unsaturated fatty acids. The resulting quadruple mutant, fad3-2 fad7-2 fad8 vte2-1, did not display the visual phenotype or the aberrant GST1 expression observed in vte2-1. Moreover, cotyledon bleaching in vte2-1 was chemically phenocopied by treatment of wild-type plants with MDA. These data suggest that products of tri-unsaturated fatty acid oxidation underlie the vte2-1 seedling phenotype, including cellular toxicity and gene regulation properties. Generation of the quadruple mutant facilitated the development of an in situ fluorescence assay based on the formation of adducts of MDA with 2-thiobarbituric acid at 37 degrees C. Specificity was verified by measuring pentafluorophenylhydrazine derivatives of MDA and by liquid chromatography analysis of MDA-2-thiobarbituric acid adducts. Potentially applicable to other organisms, this method allowed the localization of MDA pools throughout the body of Arabidopsis and revealed an undiscovered pool of the compound unlikely to be derived from trienoic fatty acids in the vicinity of the root tip quiescent center.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complex regional pain syndrome (CRPS) is a rare but debilitating pain disorder that mostly occurs after injuries to the upper limb. A number of studies indicated altered brain function in CRPS, whereas possible influences on brain structure remain poorly investigated. We acquired structural magnetic resonance imaging data from CRPS type I patients and applied voxel-by-voxel statistics to compare white and gray matter brain segments of CRPS patients with matched controls. Patients and controls were statistically compared in two different ways: First, we applied a 2-sample ttest to compare whole brain white and gray matter structure between patients and controls. Second, we aimed to assess structural alterations specifically of the primary somatosensory (S1) and motor cortex (M1) contralateral to the CRPS affected side. To this end, MRI scans of patients with left-sided CRPS (and matched controls) were horizontally flipped before preprocessing and region-of-interest-based group comparison. The unpaired ttest of the "non-flipped" data revealed that CRPS patients presented increased gray matter density in the dorsomedial prefrontal cortex. The same test applied to the "flipped" data showed further increases in gray matter density, not in the S1, but in the M1 contralateral to the CRPS-affected limb which were inversely related to decreased white matter density of the internal capsule within the ipsilateral brain hemisphere. The gray-white matter interaction between motor cortex and internal capsule suggests compensatory mechanisms within the central motor system possibly due to motor dysfunction. Altered gray matter structure in dorsomedial prefrontal cortex may occur in response to emotional processes such as pain-related suffering or elevated analgesic top-down control.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The graphical representation of spatial soil properties in a digital environment is complex because it requires a conversion of data collected in a discrete form onto a continuous surface. The objective of this study was to apply three-dimension techniques of interpolation and visualization on soil texture and fertility properties and establish relationships with pedogenetic factors and processes in a slope area. The GRASS Geographic Information System was used to generate three-dimensional models and ParaView software to visualize soil volumes. Samples of the A, AB, BA, and B horizons were collected in a regular 122-point grid in an area of 13 ha, in Pinhais, PR, in southern Brazil. Geoprocessing and graphic computing techniques were effective in identifying and delimiting soil volumes of distinct ranges of fertility properties confined within the soil matrix. Both three-dimensional interpolation and the visualization tool facilitated interpretation in a continuous space (volumes) of the cause-effect relationships between soil texture and fertility properties and pedological factors and processes, such as higher clay contents following the drainage lines of the area. The flattest part with more weathered soils (Oxisols) had the highest pH values and lower Al3+ concentrations. These techniques of data interpolation and visualization have great potential for use in diverse areas of soil science, such as identification of soil volumes occurring side-by-side but that exhibit different physical, chemical, and mineralogical conditions for plant root growth, and monitoring of plumes of organic and inorganic pollutants in soils and sediments, among other applications. The methodological details for interpolation and a three-dimensional view of soil data are presented here.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The in vitro adenovirus (Ad) DNA replication system provides an assay to study the interaction of viral and host replication proteins with the DNA template in the formation of the preinitiation complex. This initiation system requires in addition to the origin DNA sequences 1) Ad DNA polymerase (Pol), 2) Ad preterminal protein (pTP), the covalent acceptor for protein-primed DNA replication, and 3) nuclear factor I (NFI), a host cell protein identical to the CCAAT box-binding transcription factor. The interactions of these proteins were studied by coimmunoprecipitation and Ad origin DNA binding assays. The Ad Pol can bind to origin sequences only in the presence of another protein which can be either pTP or NFI. While NFI alone can bind to its origin recognition sequence, pTP does not specifically recognize DNA unless Ad Pol is present. Thus, protein-protein interactions are necessary for the targetting of either Ad Pol or pTP to the preinitiation complex. DNA footprinting demonstrated that the Ad DNA site recognized by the pTP.Pol complex was within the first 18 bases at the end of the template which constitutes the minimal origin of replication. Mutagenesis studies have defined the Ad Pol interaction site on NFI between amino acids 68-150, which overlaps the DNA binding and replication activation domain of this factor. A putative zinc finger on the Ad Pol has been mutated to a product that fails to bind the Ad origin sequences but still interacts with pTP. These results indicate that both protein-protein and protein-DNA interactions mediate specific recognition of the replication origin by Ad DNA polymerase.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We compared specimens of Tripterygion tripteronotus from 52 localities of the Mediterranean Sea and adjacent waters, using four gene sequences (12S rRNA, tRNA-valine, 16S rRNA and COI) and morphological characters. Two well-differentiated clades with a mean genetic divergence of 6.89±0.73% were found with molecular data, indicating the existence of two different species. These two species have disjunctive geographic distribution areas without any molecular hybrid populations. Subtle but diagnostic morphological differences were also present between the two species. T. tripteronotus is restricted to the northern Mediterranean basin, from the NE coast of Spain to Greece and Turkey, including the islands of Malta and Cyprus. T. tartessicum n. sp. is geographically distributed along the southern coast of Spain, from Cape of La Nao to the Gulf of Cadiz, the Balearic Islands and northern Africa, from Morocco to Tunisia. According to molecular data, these two species could have diverged during the Pliocene glaciations 2.7-3.6 Mya.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Numerous sources of evidence point to the fact that heterogeneity within the Earth's deep crystalline crust is complex and hence may be best described through stochastic rather than deterministic approaches. As seismic reflection imaging arguably offers the best means of sampling deep crustal rocks in situ, much interest has been expressed in using such data to characterize the stochastic nature of crustal heterogeneity. Previous work on this problem has shown that the spatial statistics of seismic reflection data are indeed related to those of the underlying heterogeneous seismic velocity distribution. As of yet, however, the nature of this relationship has remained elusive due to the fact that most of the work was either strictly empirical or based on incorrect methodological approaches. Here, we introduce a conceptual model, based on the assumption of weak scattering, that allows us to quantitatively link the second-order statistics of a 2-D seismic velocity distribution with those of the corresponding processed and depth-migrated seismic reflection image. We then perform a sensitivity study in order to investigate what information regarding the stochastic model parameters describing crustal velocity heterogeneity might potentially be recovered from the statistics of a seismic reflection image using this model. Finally, we present a Monte Carlo inversion strategy to estimate these parameters and we show examples of its application at two different source frequencies and using two different sets of prior information. Our results indicate that the inverse problem is inherently non-unique and that many different combinations of the vertical and lateral correlation lengths describing the velocity heterogeneity can yield seismic images with the same 2-D autocorrelation structure. The ratio of all of these possible combinations of vertical and lateral correlation lengths, however, remains roughly constant which indicates that, without additional prior information, the aspect ratio is the only parameter describing the stochastic seismic velocity structure that can be reliably recovered.