895 resultados para Data Linkage
Resumo:
Background Spatial analysis is increasingly important for identifying modifiable geographic risk factors for disease. However, spatial health data from surveys are often incomplete, ranging from missing data for only a few variables, to missing data for many variables. For spatial analyses of health outcomes, selection of an appropriate imputation method is critical in order to produce the most accurate inferences. Methods We present a cross-validation approach to select between three imputation methods for health survey data with correlated lifestyle covariates, using as a case study, type II diabetes mellitus (DM II) risk across 71 Queensland Local Government Areas (LGAs). We compare the accuracy of mean imputation to imputation using multivariate normal and conditional autoregressive prior distributions. Results Choice of imputation method depends upon the application and is not necessarily the most complex method. Mean imputation was selected as the most accurate method in this application. Conclusions Selecting an appropriate imputation method for health survey data, after accounting for spatial correlation and correlation between covariates, allows more complete analysis of geographic risk factors for disease with more confidence in the results to inform public policy decision-making.
Resumo:
BACKGROUND Many koala populations around Australia are in serious decline, with a substantial component of this decline in some Southeast Queensland populations attributed to the impact of Chlamydia. A Chlamydia vaccine for koalas is in development and has shown promise in early trials. This study contributes to implementation preparedness by simulating vaccination strategies designed to reverse population decline and by identifying which age and sex category it would be most effective to target. METHODS We used field data to inform the development and parameterisation of an individual-based stochastic simulation model of a koala population endemic with Chlamydia. The model took into account transmission, morbidity and mortality caused by Chlamydia infections. We calibrated the model to characteristics of typical Southeast Queensland koala populations. As there is uncertainty about the effectiveness of the vaccine in real-world settings, a variety of potential vaccine efficacies, half-lives and dosing schedules were simulated. RESULTS Assuming other threats remain constant, it is expected that current population declines could be reversed in around 5-6 years if female koalas aged 1-2 years are targeted, average vaccine protective efficacy is 75%, and vaccine coverage is around 10% per year. At lower vaccine efficacies the immunological effects of boosting become important: at 45% vaccine efficacy population decline is predicted to reverse in 6 years under optimistic boosting assumptions but in 9 years under pessimistic boosting assumptions. Terminating a successful vaccination programme at 5 years would lead to a rise in Chlamydia prevalence towards pre-vaccination levels. CONCLUSION For a range of vaccine efficacy levels it is projected that population decline due to endemic Chlamydia can be reversed under realistic dosing schedules, potentially in just 5 years. However, a vaccination programme might need to continue indefinitely in order to maintain Chlamydia prevalence at a sufficiently low level for population growth to continue.
Resumo:
In this paper, we show implementation results of various algorithms that sort data encrypted with Fully Homomorphic Encryption scheme based on Integers. We analyze the complexities of sorting algorithms over encrypted data by considering Bubble Sort, Insertion Sort, Bitonic Sort and Odd-Even Merge sort. Our complexity analysis together with implementation results show that Odd-Even Merge Sort has better performance than the other sorting techniques. We observe that complexity of sorting in homomorphic domain will always have worst case complexity independent of the nature of input. In addition, we show that combining different sorting algorithms to sort encrypted data does not give any performance gain when compared to the application of sorting algorithms individually.
Resumo:
The recent trend for journals to require open access to primary data included in publications has been embraced by many biologists, but has caused apprehension amongst researchers engaged in long-term ecological and evolutionary studies. A worldwide survey of 73 principal investigators (Pls) with long-term studies revealed positive attitudes towards sharing data with the agreement or involvement of the PI, and 93% of PIs have historically shared data. Only 8% were in favor of uncontrolled, open access to primary data while 63% expressed serious concern. We present here their viewpoint on an issue that can have non-trivial scientific consequences. We discuss potential costs of public data archiving and provide possible solutions to meet the needs of journals and researchers.
Resumo:
Developing innovative library services requires a real world understanding of faculty members' desired curricular goals. This study aimed to develop a comprehensive and deeper understanding of Purdue's nutrition science and political science faculties' expectations for student learning related to information and data information literacies. Course syllabi were examined using grounded theory techniques that allowed us to identify how faculty were addressing information and data information literacies in their courses, but it also enabled us to understand the interconnectedness of these literacies to other departmental intentions for student learning, such as developing a professional identity or learning to conduct original research. The holistic understanding developed through this research provides the necessary information for designing and suggesting information literacy and data information literacy services to departmental faculty in ways supportive of curricular learning outcomes.
Resumo:
Rapid advances in sequencing technologies (Next Generation Sequencing or NGS) have led to a vast increase in the quantity of bioinformatics data available, with this increasing scale presenting enormous challenges to researchers seeking to identify complex interactions. This paper is concerned with the domain of transcriptional regulation, and the use of visualisation to identify relationships between specific regulatory proteins (the transcription factors or TFs) and their associated target genes (TGs). We present preliminary work from an ongoing study which aims to determine the effectiveness of different visual representations and large scale displays in supporting discovery. Following an iterative process of implementation and evaluation, representations were tested by potential users in the bioinformatics domain to determine their efficacy, and to understand better the range of ad hoc practices among bioinformatics literate users. Results from two rounds of small scale user studies are considered with initial findings suggesting that bioinformaticians require richly detailed views of TF data, features to compare TF layouts between organisms quickly, and ways to keep track of interesting data points.
Resumo:
Multiple sclerosis (MS) is an autoimmune disease with a genetic component, caused at least in part by aberrant lymphocyte activity. The whole blood mRNA transcriptome was measured for 99 untreated MS patients: 43 primary progressive MS, 20 secondary progressive MS, 36 relapsing remitting MS and 45 age-matched healthy controls. The ANZgene Multiple Sclerosis Genetics Consortium genotyped more than 300 000 SNPs for 115 of these samples. Transcription from genes on translational regulation, oxidative phosphorylation, immune synapse and antigen presentation pathways was markedly increased in all forms of MS. Expression of genes tagging T cells was also upregulated (P < 10-12) in MS. A T cell gene signature predicts disease state with a concordance index of 0.79 with age and gender as co-variables, but the signature is not associated with clinical course or disability. The ANZgene genome wide association screen identified two novel regions with genome wide significance: one encoding the T cell co-stimulatory molecule, CD40; the other a region on chromosome 12q13-14. The CD40 haplotype associated with increased MS susceptibility has decreased gene expression in MS (P < 0.0007). The second MS susceptibility region includes 17 genes on 12q13-14 in tight linkage disequilibrium. Of these, only 13 are expressed in leukocytes, and of these the expression of one, FAM119B, is much lower in the susceptibility haplotype (P tdthomlt; 10-14). Overall, these data indicate dysregulation of T cells can be detected in the whole blood of untreated MS patients, and supports targeting of activated T cells in therapy for all forms of MS.
Resumo:
This thesis has investigated how to cluster a large number of faces within a multi-media corpus in the presence of large session variation. Quality metrics are used to select the best faces to represent a sequence of faces; and session variation modelling improves clustering performance in the presence of wide variations across videos. Findings from this thesis contribute to improving the performance of both face verification systems and the fully automated clustering of faces from a large video corpus.
Resumo:
A strong association between ERAP1 and ankylosing spondylitis (AS) was recently identified by the Wellcome Trust Case Control Consortium and the Australo-Anglo-American Spondylitis Consortium (WTCCC-TASC) study. ERAP1 is highly polymorphic with strong linkage disequilibrium evident across the gene. We therefore conducted a series of experiments to try to identify the primary genetic association(s) with ERAP1. We replicated the original associations in an independent set of 730 patients and 1021 controls, resequenced ERAP1 to define the full extent of coding polymorphisms and tested all variants in additional association studies. The genetic association with ERAP1 was independently confirmed; the strongest association was with rs30187 in the replication set (P = 3.4 × 103). When the data were combined with the original WTCCC-TASC study the strongest association was with rs27044 (P = 1.1 × 10-9). We identified 33 sequence polymorphisms in ERAP1, including three novel and eight known non-synonymous polymorphisms. We report several new associations between AS and polymorphisms distributed across ERAP1 from the extended case-control study, the most significant of which was with rs27434 (P = 4.7 × 10-7). Regression analysis failed to identify a primary association clearly; we therefore used data from HapMap to impute genotypes for an additional 205 non-coding SNPs located within and adjacent to ERAP1. A number of highly significant associations (P < 5 × 10-9) were identified in regulatory sequences which are good candidates for causing susceptibility to AS, possibly by regulating ERAP1 expression. © 2009 The Author(s).
Resumo:
PURPOSE The restricted genetic diversity and homogeneous molecular basis of Mendelian disorders in isolated founder populations have rarely been explored in epilepsy research. Our long-term goal is to explore the genetic basis of epilepsies in one such population, the Gypsies. The aim of this report is the clinical and genetic characterization of a Gypsy family with a partial epilepsy syndrome. METHODS Clinical information was collected using semistructured interviews with affected subjects and informants. At least one interictal electroencephalography (EEG) recording was performed for each patient and previous data obtained from records. Neuroimaging included structural magnetic resonance imaging (MRI). Linkage and haplotype analysis was performed using the Illumina IVb Linkage Panel, supplemented with highly informative microsatellites in linked regions and Affymetrix SNP 5.0 array data. RESULTS We observed an early-onset partial epilepsy syndrome with seizure semiology strongly suggestive of temporal lobe epilepsy (TLE), with mild intellectual deficit co-occurring in a large proportion of the patients. Psychiatric morbidity was common in the extended pedigree but did not cosegregate with epilepsy. Linkage analysis definitively excluded previously reported loci, and identified a novel locus on 5q31.3-q32 with an logarithm of the odds (LOD) score of 3 corresponding to the expected maximum in this family. DISCUSSION The syndrome can be classified as familial temporal lobe epilepsy (FTLE) or possibly a new syndrome with mild intellectual deficit. The linked 5q region does not contain any ion channel-encoding genes and is thus likely to contribute new knowledge about epilepsy pathogenesis. Identification of the mutation in this family and in additional patients will define the full phenotypic spectrum.
Resumo:
Sensor networks for environmental monitoring present enormous benefits to the community and society as a whole. Currently there is a need for low cost, compact, solar powered sensors suitable for deployment in rural areas. The purpose of this research is to develop both a ground based wireless sensor network and data collection using unmanned aerial vehicles. The ground based sensor system is capable of measuring environmental data such as temperature or air quality using cost effective low power sensors. The sensor will be configured such that its data is stored on an ATMega16 microcontroller which will have the capability of communicating with a UAV flying overhead using UAV communication protocols. The data is then either sent to the ground in real time or stored on the UAV using a microcontroller until it lands or is close enough to enable the transmission of data to the ground station.
Resumo:
This technical report describes a Light Detection and Ranging (LiDAR) augmented optimal path planning at low level flight methodology for remote sensing and sampling Unmanned Aerial Vehicles (UAV). The UAV is used to perform remote air sampling and data acquisition from a network of sensors on the ground. The data that contains information on the terrain is in the form of a 3D point clouds maps is processed by the algorithms to find an optimal path. The results show that the method and algorithm are able to use the LiDAR data to avoid obstacles when planning a path from a start to a target point. The report compares the performance of the method as the resolution of the LIDAR map is increased and when a Digital Elevation Model (DEM) is included. From a practical point of view, the optimal path plan is loaded and works seemingly with the UAV ground station and also shows the UAV ground station software augmented with more accurate LIDAR data.
Resumo:
We have genotyped 14,436 nonsynonymous SNPs (nsSNPs) and 897 major histocompatibility complex (MHC) tag SNPs from 1,000 independent cases of ankylosing spondylitis (AS), autoimmune thyroid disease (AITD), multiple sclerosis (MS) and breast cancer (BC). Comparing these data against a common control dataset derived from 1,500 randomly selected healthy British individuals, we report initial association and independent replication in a North American sample of two new loci related to ankylosing spondylitis, ARTS1 and IL23R, and confirmation of the previously reported association of AITD with TSHR and FCRL3. These findings, enabled in part by increased statistical power resulting from the expansion of the control reference group to include individuals from the other disease groups, highlight notable new possibilities for autoimmune regulation and suggest that IL23R may be a common susceptibility factor for the major 'seronegative' diseases.
Resumo:
Fibrodysplasia ossificans progressiva (FOP) is a rare autosomal dominant disorder of skeletal malformations and progressive extraskeletal ossification. We mapped FOP to chromosome 2q23-24 by linkage analysis and identified an identical heterozygous mutation (617G→A; R206H) in the glycine-serine (GS) activation domain of ACVR1, a BMP type I receptor, in all affected individuals examined. Protein modeling predicts destabilization of the GS domain, consistent with constitutive activation of ACVR1 as the underlying cause of the ectopic chondrogenesis, osteogenesis and joint fusions seen in FOP.