902 resultados para Large Data Sets
Resumo:
Tephras are important for the NZ-INTIMATE project because they link all three records comprising the composite inter-regional stratotype developed for the New Zealand climate event stratigraphy (NZ-CES). Here we firstly report new calendar ages for 24 widespread marker tephras erupted since 30,000 calendar (cal.) years ago in New Zealand to help facilitate their use as chronostratigraphic dating tools for the NZ-CES and for other palaeoenvironmental and geological applications. The selected tephras comprise 12 rhyolitic tephras from Taupo, nine rhyolitic tephras from Okataina, one peralkaline rhyolitic tephra from Tuhua, and one andesitic tephra each from Tongariro and Egmont/Taranaki volcanic centres. Age models for the tephras were obtained using three methods: (i) C-based wiggle-match dating of wood from trees killed by volcanic eruptions (these dates published previously); (ii) flexible depositional modelling of a high-resolution C-dated age-depth sequence at Kaipo bog using two Bayesian-based modelling programs, Bacon and OxCal's P_Sequence function, and the IntCal09 data set (with SH offset correction-44±17yr); and (iii) calibration of C ages using OxCal's Tau_Boundary function and the SHCal04 and IntCal09 data sets. Our preferred dates or calibrated ages for the 24 tephras are as follows (youngest to oldest, all mid-point or mean ages of 95% probability ranges): Kaharoa AD 1314±12; Taupo (Unit Y) AD 232±10; Mapara (Unit X) 2059±118cal.yrBP; Whakaipo (Unit V) 2800±60cal.yrBP; Waimihia (Unit S) 3401±108cal.yrBP; Stent (Unit Q) 4322±112cal.yrBP; Unit K 5111±210cal.yrBP; Whakatane 5526±145cal.yrBP; Tuhua 6577±547cal.yrBP; Mamaku 7940±257cal.yrBP; Rotoma 9423±120cal.yrBP; Opepe (Unit E) 9991±160cal.yrBP; Poronui (Unit C) 11,170±115cal.yrBP; Karapiti (Unit B) 11,460±172cal.yrBP; Okupata 11,767±192cal.yrBP; Konini (bed b) 11,880±183cal.yrBP; Waiohau 14,009±155cal.yrBP; Rotorua 15,635±412cal.yrBP; Rerewhakaaitu 17,496±462cal.yrBP; Okareka 21,858±290cal.yrBP; Te Rere 25,171±964cal.yrBP; Kawakawa/Oruanui 25,358±162cal.yrBP; Poihipi 28,446±670cal.yrBP; and Okaia 28,621±1428cal.yrBP.Secondly, we have re-dated the start and end of the Lateglacial cool episode (climate event NZce-3 in theNZ-CES), previously referred to as the Lateglacial climate reversal, as defined at Kaipo bog in eastern North Island, New Zealand, using both Bacon and OxCal P_Sequence modelling with the IntCal09 data set. The ca1200-yr-long cool episode, indicated by a lithostratigraphic change in the Kaipo peat sequence to grey mudwith lowered carbon content, and a high-resolution pollen-derived cooling signal, began 13,739±125cal.yrBP and ended 12,550±140cal.yrBP (mid-point ages of the 95% highest posterior density regions, Bacon modelling). The OxCal modelling, generating almost identical ages, confirmed these ages. The Lateglacial cool episode (ca 13.8-12.6cal.kaBP) thus overlaps a large part of the entire Antarctic Cold Reversal chronozone (ca 14.1-12.4cal.kaBP or ca 14.6-12.8cal.kaBP), and an early part of the Greenland Stadial-1 (Younger Dryas) chronozone (ca 12.9-11.7cal.kaBP). The timing of the Lateglacial cool episode at Kaipo is broadly consistent with the latitudinal patterns in the Antarctic Cold Reversal signal suggested for the New Zealand archipelago from marine and terrestrial records, and with records from southern South America. © 2012 Elsevier Ltd.
Resumo:
Accurate conceptual models of groundwater systems are essential for correct interpretation of monitoring data in catchment studies. In surface-water dominated hard rock regions, modern ground and surface water monitoring programmes often have very high resolution chemical, meteorological and hydrological observations but lack an equivalent emphasis on the subsurface environment, the properties of which exert a strong control on flow pathways and interactions with surface waters. The reasons for this disparity are the complexity of the system and the difficulty in accurately characterising the subsurface, except locally at outcrops or in boreholes. This is particularly the case in maritime north-western Europe, where a legacy of glacial activity, combined with large areas underlain by heterogeneous igneous and metamorphic bedrock, make the structure and weathering of bedrock difficult to map or model. Traditional approaches which seek to extrapolate information from borehole to field-scale are of limited application in these environments due to the high degree of spatial heterogeneity. Here we apply an integrative and multi-scale approach, optimising and combining standard geophysical techniques to generate a three-dimensional geological conceptual model of the subsurface in a catchment in NE Ireland. Available airborne LiDAR, electromagnetic and magnetic data sets were analysed for the region. At field-scale surface geophysical methods, including electrical resistivity tomography, seismic refraction, ground penetrating radar and magnetic surveys, were used and combined with field mapping of outcrops and borehole testing. The study demonstrates how combined interpretation of multiple methods at a range of scales produces robust three-dimensional conceptual models and a stronger basis for interpreting groundwater and surface water monitoring data.
Resumo:
Aim: To critically appraise recent research into associations between periodontal disease and systemic diseases and conditions specifically respiratory disease, chronic kidney disease, rheumatoid arthritis, cognitive impairment, obesity, metabolic syndrome and cancer.
Methods: A MEDLINE literature search of papers published between 2002 and April 2012 was conducted. Studies that included periodontitis as an exposure were identified. Cross-sectional epidemiological investigations on large samples, prospective studies and systematic reviews formed the basis of the narrative review. A threshold set for the identification of periodontitis was used to identify those studies that contributed to the conclusions of the review.
Results: Many of the investigations were cross-sectional secondary analyses of existing data sets in particular the NHANES studies. There were a small number of systematic reviews and prospective studies. There was substantial variability in the definitions of exposure to periodontitis. A small number of studies met the threshold set for periodontitis and supported associations; however, in some of the chronic diseases there were no such studies. There was strong evidence from randomized controlled trials that interventions, which improve oral hygiene have positive effects on the prevention of nosocomial pneumonias.
Conclusions: There was substantial heterogeneity in the definitions used to identify periodontitis and very few studies met a stringent threshold for periodontitis. Published evidence supports modest associations between periodontitis and some, although not all, of the diseases and conditions reviewed. There is a need to reach a consensus on what constitutes periodontitis for future studies of putative associations with systemic diseases.
Resumo:
We present Roche tomograms of the K4V secondary star in the cataclysmic variable AE Aqr, reconstructed from two data sets taken 9 d apart, and measure the differential rotation of the stellar surface. The tomograms show many large, cool starspots, including a large high-latitude spot and a prominent appendage down the trailing hemisphere. We find two distinct bands of spots around 22° and 43° latitude, and estimate a spot coverage of 15.4-17 per cent on the Northern hemisphere. Assuming a solar-like differential rotation law, the differential rotation of AE Aqr was measured using two different techniques. The first method yields an equator-pole lap time of 269 d and the second yields a lap time of 262 d. This shows that the star is not fully tidally locked, as was previously assumed for CVs, but has a co-rotation latitude of ˜40°. We discuss the implications that these observations have on stellar dynamo theory, as well as the impact that spot traversal across the L1 point may have on accretion rates in CVs as well as some of their other observed properties. The entropy landscape technique was applied to determine the system parameters of AE Aqr. For the two independent data sets, we find M1 = 1.20 and 1.17 M⊙, M2 = 0.81 and 0.78 M⊙, and orbital inclinations of 50° to 51° at optimal systemic velocities of γ = -64.7 and -62.9 km s-1.
Resumo:
The quantity and quality of spatial data are increasing rapidly. This is particularly evident in the case of movement data. Devices capable of accurately recording the position of moving entities have become ubiquitous and created an abundance of movement data. Valuable knowledge concerning processes occurring in the physical world can be extracted from these large movement data sets. Geovisual analytics offers powerful techniques to achieve this. This article describes a new geovisual analytics tool specifically designed for movement data. The tool features the classic space-time cube augmented with a novel clustering approach to identify common behaviour. These techniques were used to analyse pedestrian movement in a city environment which revealed the effectiveness of the tool for identifying spatiotemporal patterns. © 2014 Taylor & Francis.
Resumo:
Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.
Resumo:
In the study of complex genetic diseases, the identification of subgroups of patients sharing similar genetic characteristics represents a challenging task, for example, to improve treatment decision. One type of genetic lesion, frequently investigated in such disorders, is the change of the DNA copy number (CN) at specific genomic traits. Non-negative Matrix Factorization (NMF) is a standard technique to reduce the dimensionality of a data set and to cluster data samples, while keeping its most relevant information in meaningful components. Thus, it can be used to discover subgroups of patients from CN profiles. It is however computationally impractical for very high dimensional data, such as CN microarray data. Deciding the most suitable number of subgroups is also a challenging problem. The aim of this work is to derive a procedure to compact high dimensional data, in order to improve NMF applicability without compromising the quality of the clustering. This is particularly important for analyzing high-resolution microarray data. Many commonly used quality measures, as well as our own measures, are employed to decide the number of subgroups and to assess the quality of the results. Our measures are based on the idea of identifying robust subgroups, inspired by biologically/clinically relevance instead of simply aiming at well-separated clusters. We evaluate our procedure using four real independent data sets. In these data sets, our method was able to find accurate subgroups with individual molecular and clinical features and outperformed the standard NMF in terms of accuracy in the factorization fitness function. Hence, it can be useful for the discovery of subgroups of patients with similar CN profiles in the study of heterogeneous diseases.
Resumo:
The Copney Stone Circle Complex, Co. Tyrone, N. Ireland, is an important Bronze Age site forming part of the Mid-Ulster Stone Circle Complex. The Environment Service: Historic Monuments and Buildings (ESHMB) initiated a program of bog-clearance in August 1994 to excavate the stone circles. This work was completed by October 1994 and the excavated site was surveyed in August 1995. Almost immediately, the rate at which the stones forming the circles were breaking down was noted and a program of study initiated to make recommendations upon the conservation of this important site. Digital photogrammetric techniques were applied to aerial images of the stone circles and digital terrain models created from the images at a range of scales. These provide base data sets for comparison with identical surveys to be completed in successive years and will allow the rate of deterioration, and the areas most affected, of the circles to be determined. In addition, a 2D analysis of the stones provides an accurate analysis of the absolute 2D dimensions of the stones for rapid desktop computer analysis by researchers remote from the digital photogrammetric workstation used in the survey.
The products of this work are readily incorporated into web sites, educational packages and databases. The technique provides a rapid and user friendly method of presentation of a large body of information and measurements, and a reliable method of storage of the information from Copney should it become necessary to re-cover the site.
Resumo:
Kumphawapi, which is Thailand’s largest natural freshwater lake, contains a >10,000-year-long climatic and environmental archive. New data sets (stratigraphy, chronology, hydrogen isotopes, plant macrofossil and charcoal records) for two sedimentary sequences are here combined with earlier multi-proxy studies to provide a comprehensive reconstruction of past climatic and environmental changes for Northeast Thailand. Gradually higher moisture availability due to a strengthening of the summer monsoon led to the formation of a large shallow lake in the Kumphawapi basin between >10,700 and c. 7000 cal. BP. The marked increase in moisture availability and lower evaporation between c. 7000 and 6400 cal. BP favoured the growth and expansion of vegetation in and around the shallow lake. The increase in biomass led to gradual overgrowing and infilling, to an apparent lake level lowering and to the development of a wetland. Multiple hiatuses are apparent in all investigated sequences between c. 6500 and 1400 cal. BP and are explained by periodic desiccation events of the wetland and erosion due to the subsequent lake level rise. The rise in lake level, which started c. 2000 cal. BP and reached shallower parts c. 1400 cal. BP, is attributed to an increase in effective moisture availability. The timing of hydroclimatic conditions during the past 2000 years cannot be resolved because of chronological limitations.
Resumo:
The algorithm developed uses an octree pyramid in which noise is reduced at the expense of the spatial resolution. At a certain level an unsupervised clustering without spatial connectivity constraints is applied. After the classification, isolated voxels and insignificant regions are removed by assigning them to their neighbours. The spatial resolution is then increased by the downprojection of the regions, level by level. At each level the uncertainty of the boundary voxels is minimised by a dynamic selection and classification of these, using an adaptive 3D filtering. The algorithm is tested using different data sets, including NMR data.
Resumo:
La mise en registre 3D (opération parfois appelée alignement) est un processus de transformation d’ensembles de données 3D dans un même système de coordonnées afin d’en aligner les éléments communs. Deux ensembles de données alignés ensemble peuvent être les scans partiels des deux vues différentes d’un même objet. Ils peuvent aussi être deux modèles complets, générés à des moments différents, d’un même objet ou de deux objets distincts. En fonction des ensembles de données à traiter, les méthodes d’alignement sont classées en mise en registre rigide ou non-rigide. Dans le cas de la mise en registre rigide, les données sont généralement acquises à partir d’objets rigides. Le processus de mise en registre peut être accompli en trouvant une seule transformation rigide globale (rotation, translation) pour aligner l’ensemble de données source avec l’ensemble de données cible. Toutefois, dans le cas non-rigide, où les données sont acquises à partir d’objets déformables, le processus de mise en registre est plus difficile parce qu’il est important de trouver à la fois une transformation globale et des déformations locales. Dans cette thèse, trois méthodes sont proposées pour résoudre le problème de mise en registre non-rigide entre deux ensembles de données (représentées par des maillages triangulaires) acquises à partir d’objets déformables. La première méthode permet de mettre en registre deux surfaces se chevauchant partiellement. La méthode surmonte les limitations des méthodes antérieures pour trouver une grande déformation globale entre deux surfaces. Cependant, cette méthode est limitée aux petites déformations locales sur la surface afin de valider le descripteur utilisé. La seconde méthode est s’appuie sur le cadre de la première et est appliquée à des données pour lesquelles la déformation entre les deux surfaces est composée à la fois d’une grande déformation globale et de petites déformations locales. La troisième méthode, qui se base sur les deux autres méthodes, est proposée pour la mise en registre d’ensembles de données qui sont plus complexes. Bien que la qualité que elle fournit n’est pas aussi bonne que la seconde méthode, son temps de calcul est accéléré d’environ quatre fois parce que le nombre de paramètres optimisés est réduit de moitié. L’efficacité des trois méthodes repose sur des stratégies via lesquelles les correspondances sont déterminées correctement et le modèle de déformation est exploité judicieusement. Ces méthodes sont mises en oeuvre et comparées avec d’autres méthodes sur diverses données afin d’évaluer leur robustesse pour résoudre le problème de mise en registre non-rigide. Les méthodes proposées sont des solutions prometteuses qui peuvent être appliquées dans des applications telles que la mise en registre non-rigide de vues multiples, la reconstruction 3D dynamique, l’animation 3D ou la recherche de modèles 3D dans des banques de données.
Resumo:
Tese de doutoramento, Ciências Geofísicas e da Geoinformação (Geofisíca), Universidade de Lisboa, Faculdade de Ciências, 2014
Resumo:
In the age of E-Business many companies faced with massive data sets that must be analysed for gaining a competitive edge. these data sets are in many instances incomplete and quite often not of very high quality. Although statistical analysis can be used to pre-process these data sets, this technique has its own limitations. In this paper we are presenting a system - and its underlying model - that can be used to test the integrity of existing data and pre-process the data into clearer data sets to be mined. LH5 is a rule-based system, capable of self-learning and is illustrated using a medical data set.
Resumo:
UW access only. Questions about spatial data can be directed to uwlib-gis [at] uw [dot] edu, include the URI address below and any information you have.
Resumo:
Trabalho de Projeto para obtenção do grau de Mestre em Engenharia Informática e de Computadores