960 resultados para data sets


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Purpose Improved survival for men with prostate cancer has led to increased attention to factors influencing quality of life (QOL). As protein levels of vascular endothelial growth factor (VEGF) and insulin-like growth factor 1 (IGF-1) have been reported to be associated with QOL in people with cancer, we sought to identify whether single-nucleotide polymorphisms (SNPs) of these genes were associated with QOL in men with prostate cancer. Methods Multiple linear regression of two data sets (including approximately 750 men newly diagnosed with prostate cancer and 550 men from the general population) was used to investigate SNPs of VEGF and IGF-1 (10 SNPs in total) for associations with QOL (measured by the SF-36v2 health survey). Results Men with prostate cancer who carried the minor ‘T’ allele for IGF-1 SNP rs35767 had higher mean Role-Physical scale scores (≥0.3 SD) compared to non-carriers (p < 0.05). While this association was not identified in men from the general population, one IGF-1 SNP rs7965399 was associated with higher mean Bodily Pain scale scores in men from the general population that was not found in men with prostate cancer. Men from the general population who carried the rare ‘C’ allele had higher mean Bodily Pain scale scores (≥0.3 SD) than non-carriers (p < 0.05). Conclusions Through identifying SNPs that are associated with QOL in men with prostate cancer and men from the general population, this study adds to the mapping of complex interrelationships that influence QOL and suggests a role for IGF-I in physical QOL outcomes. Future research may identify biomarkers associated with increased risk of poor QOL that could assist in the provision of pre-emptive support for those identified at risk.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This project constructed virtual plant leaf surfaces from digitised data sets for use in droplet spray models. Digitisation techniques for obtaining data sets for cotton, chenopodium and wheat leaves are discussed and novel algorithms for the reconstruction of the leaves from these three plant species are developed. The reconstructed leaf surfaces are included into agricultural droplet spray models to investigate the effect of the nozzle and spray formulation combination on the proportion of spray retained by the plant. A numerical study of the post-impaction motion of large droplets that have formed on the leaf surface is also considered.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Substation Automation Systems have undergone many transformational changes triggered by improvements in technologies. Prior to the digital era, it made sense to confirm that the physical wiring matched the schematic design by meticulous and laborious point to point testing. In this way, human errors in either the design or the construction could be identified and fixed prior to entry into service. However, even though modern secondary systems today are largely computerised, we are still undertaking commissioning testing using the same philosophy as if each signal were hard wired. This is slow and tedious and doesn’t do justice to modern computer systems and software automation. One of the major architectural advantages of the IEC 61850 standard is that it “abstracts” the definition of data and services independently of any protocol allowing the mapping of them to any protocol that can meet the modelling and performance requirements. On this basis, any substation element can be defined using these common building blocks and are made available at the design, configuration and operational stages of the system. The primary advantage of accessing data using this methodology rather than the traditional position method (such as DNP 3.0) is that generic tools can be created to manipulate data. Self-describing data contains the information that these tools need to manipulate different data types correctly. More importantly, self-describing data makes the interface between programs robust and flexible. This paper proposes that the improved data definitions and methods for dealing with this data within a tightly bound and compliant IEC 61850 Substation Automation System could completely revolutionise the need to test systems when compared to traditional point to point methods. Using the outcomes of an undergraduate thesis project, we can demonstrate with some certainty that it is possible to automatically test the configuration of a protection relay by comparing the IEC 61850 configuration extracted from the relay against its SCL file for multiple relay vendors. The software tool provides a quick and automatic check that the data sets on a particular relay are correct according to its CID file, thus ensuring that no unexpected modifications are made at any stage of the commissioning process. This tool has been implemented in a Java programming environment using an open source IEC 61850 library to facilitate the server-client association with the relay.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The discovery of several genes that affect the risk for Alzheimer's disease ignited a worldwide search for single-nucleotide polymorphisms (SNPs), common genetic variants that affect the brain. Genome-wide search of all possible SNP-SNP interactions is challenging and rarely attempted because of the complexity of conducting approximately 1011 pairwise statistical tests. However, recent advances in machine learning, for example, iterative sure independence screening, make it possible to analyze data sets with vastly more predictors than observations. Using an implementation of the sure independence screening algorithm (called EPISIS), we performed a genome-wide interaction analysis testing all possible SNP-SNP interactions affecting regional brain volumes measured on magnetic resonance imaging and mapped using tensor-based morphometry. We identified a significant SNP-SNP interaction between rs1345203 and rs1213205 that explains 1.9% of the variance in temporal lobe volume. We mapped the whole brain, voxelwise effects of the interaction in the Alzheimer's Disease Neuroimaging Initiative data set and separately in an independent replication data set of healthy twins (Queensland Twin Imaging). Each additional loading in the interaction effect was associated with approximately 5% greater brain regional brain volume (a protective effect) in both Alzheimer's Disease Neuroimaging Initiative and Queensland Twin Imaging samples.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

An automated method for extracting brain volumes from three commonly acquired three-dimensional (3D) MR images (proton density, T1 weighted, and T2-weighted) of the human head is described. The procedure is divided into four levels: preprocessing, segmentation, scalp removal, and postprocessing. A user-provided reference point is the sole operator-dependent input required. The method's parameters were first optimized and then fixed and applied to 30 repeat data sets from 15 normal older adult subjects to investigate its reproducibility. Percent differences between total brain volumes (TBVs) for the subjects' repeated data sets ranged from .5% to 2.2%. We conclude that the method is both robust and reproducible and has the potential for wide application.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This research investigates how to obtain accurate and reliable positioning results with global navigation satellite systems (GNSS). The work provides a theoretical framework for reliability control in GNSS carrier phase ambiguity resolution, which is the key technique for precise GNSS positioning in centimetre levels. The proposed approach includes identification and exclusion procedures of unreliable solutions and hypothesis tests, allowing the reliability of solutions to be controlled in the aspects of mathematical models, integer estimation and ambiguity acceptance tests. Extensive experimental results with both simulation and observed data sets effectively demonstrate the reliability performance characteristics based on the proposed theoretical framework and procedures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Objective: To replicate and refine the reported association of ankylosing spondylitis (AS) with two nonsynonymous single nucleotide polymorphisms (nsSNPs) on chromosome 16q22.1. Methods: Firstly, 730 independent UK patients with AS were genotyped for rs9939768 and rs6979 and allele frequencies were compared with 2879 previously typed historic disease controls. Secondly, the two data sets were combined in meta-analyses. Finally, 5 tagging SNPs, located between rs9939768 and rs6979, were analysed in 1604 cases and 1020 controls. Results: The association of rs6979 with AS was replicated, p=0.03, OR=1.14 (95% CI 1.01 to 1.28), and a trend for association with rs9939768 detected, p=0.06, OR=1.25 (95% CI 0.99 to 1.57). Meta-analyses revealed association of both SNPs with AS, p=0.0008, OR=1.31 (95% CI 1.12 to 1.54) and p=0.0009, OR=1.15 (95% CI 1.06 to 1.23) for rs9939768 and rs6979, respectively. New associations with rs9033 and rs868213 (p=0.00002, OR=1.23 (95% CI 1.12 to 1.36) and p=0.00002 OR=1.45 (95% CI 1.22 to 1.72), respectively, were identified. Conclusions: The region on chromosome 16 that has been replicated in the present work is interesting as the highly plausible candidate gene, tumour necrosis factor receptor type 1 (TNFR1)-associated death domain (TRADD), is located between rs9033 and rs868213. It will require additional work to identify the primary genetic association(s) with AS.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This review is focused on the impact of chemometrics for resolving data sets collected from investigations of the interactions of small molecules with biopolymers. These samples have been analyzed with various instrumental techniques, such as fluorescence, ultraviolet–visible spectroscopy, and voltammetry. The impact of two powerful and demonstrably useful multivariate methods for resolution of complex data—multivariate curve resolution–alternating least squares (MCR–ALS) and parallel factor analysis (PARAFAC)—is highlighted through analysis of applications involving the interactions of small molecules with the biopolymers, serum albumin, and deoxyribonucleic acid. The outcomes illustrated that significant information extracted by the chemometric methods was unattainable by simple, univariate data analysis. In addition, although the techniques used to collect data were confined to ultraviolet–visible spectroscopy, fluorescence spectroscopy, circular dichroism, and voltammetry, data profiles produced by other techniques may also be processed. Topics considered including binding sites and modes, cooperative and competitive small molecule binding, kinetics, and thermodynamics of ligand binding, and the folding and unfolding of biopolymers. Applications of the MCR–ALS and PARAFAC methods reviewed were primarily published between 2008 and 2013.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The climate in the Arctic is changing faster than anywhere else on earth. Poorly understood feedback processes relating to Arctic clouds and aerosol–cloud interactions contribute to a poor understanding of the present changes in the Arctic climate system, and also to a large spread in projections of future climate in the Arctic. The problem is exacerbated by the paucity of research-quality observations in the central Arctic. Improved formulations in climate models require such observations, which can only come from measurements in situ in this difficult-to-reach region with logistically demanding environmental conditions. The Arctic Summer Cloud Ocean Study (ASCOS) was the most extensive central Arctic Ocean expedition with an atmospheric focus during the International Polar Year (IPY) 2007–2008. ASCOS focused on the study of the formation and life cycle of low-level Arctic clouds. ASCOS departed from Longyearbyen on Svalbard on 2 August and returned on 9 September 2008. In transit into and out of the pack ice, four short research stations were undertaken in the Fram Strait: two in open water and two in the marginal ice zone. After traversing the pack ice northward, an ice camp was set up on 12 August at 87°21' N, 01°29' W and remained in operation through 1 September, drifting with the ice. During this time, extensive measurements were taken of atmospheric gas and particle chemistry and physics, mesoscale and boundary-layer meteorology, marine biology and chemistry, and upper ocean physics. ASCOS provides a unique interdisciplinary data set for development and testing of new hypotheses on cloud processes, their interactions with the sea ice and ocean and associated physical, chemical, and biological processes and interactions. For example, the first-ever quantitative observation of bubbles in Arctic leads, combined with the unique discovery of marine organic material, polymer gels with an origin in the ocean, inside cloud droplets suggests the possibility of primary marine organically derived cloud condensation nuclei in Arctic stratocumulus clouds. Direct observations of surface fluxes of aerosols could, however, not explain observed variability in aerosol concentrations, and the balance between local and remote aerosols sources remains open. Lack of cloud condensation nuclei (CCN) was at times a controlling factor in low-level cloud formation, and hence for the impact of clouds on the surface energy budget. ASCOS provided detailed measurements of the surface energy balance from late summer melt into the initial autumn freeze-up, and documented the effects of clouds and storms on the surface energy balance during this transition. In addition to such process-level studies, the unique, independent ASCOS data set can and is being used for validation of satellite retrievals, operational models, and reanalysis data sets.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND Many patients presenting to the emergency department (ED) for assessment of possible acute coronary syndrome (ACS) have low cardiac troponin concentrations that change very little on repeat blood draw. It is unclear if a lack of change in cardiac troponin concentration can be used to identify acutely presenting patients at low risk of ACS. METHODS We used the hs-cTnI assay from Abbott Diagnostics, which can detect cTnI in the blood of nearly all people. We identified a population of ED patients being assessed for ACS with repeat cTnI measurement who ultimately were proven to have no acute cardiac disease at the time of presentation. We used data from the repeat sampling to calculate total within-person CV (CV(T)) and, knowing the assay analytical CV (CV(A)), we could calculate within-person biological variation (CV(i)), reference change values (RCVs), and absolute RCV delta cTnI concentrations. RESULTS We had data sets on 283 patients. Men and women had similar CV(i) values of approximately 14%, which was similar at all concentrations <40 ng/L. The biological variation was not dependent on the time interval between sample collections (t = 1.5-17 h). The absolute delta critical reference change value was similar no matter what the initial cTnI concentration was. More than 90% of subjects had a critical reference change value <5 ng/L, and 97% had values of <10 ng/L. CONCLUSIONS With this hs-cTnI assay, delta cTnI seems to be a useful tool for rapidly identifying ED patients at low risk for possible ACS.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Species identification based on short sequences of DNA markers, that is, DNA barcoding, has emerged as an integral part of modern taxonomy. However, software for the analysis of large and multilocus barcoding data sets is scarce. The Basic Local Alignment Search Tool (BLAST) is currently the fastest tool capable of handling large databases (e.g. >5000 sequences), but its accuracy is a concern and has been criticized for its local optimization. However, current more accurate software requires sequence alignment or complex calculations, which are time-consuming when dealing with large data sets during data preprocessing or during the search stage. Therefore, it is imperative to develop a practical program for both accurate and scalable species identification for DNA barcoding. In this context, we present VIP Barcoding: a user-friendly software in graphical user interface for rapid DNA barcoding. It adopts a hybrid, two-stage algorithm. First, an alignment-free composition vector (CV) method is utilized to reduce searching space by screening a reference database. The alignment-based K2P distance nearest-neighbour method is then employed to analyse the smaller data set generated in the first stage. In comparison with other software, we demonstrate that VIP Barcoding has (i) higher accuracy than Blastn and several alignment-free methods and (ii) higher scalability than alignment-based distance methods and character-based methods. These results suggest that this platform is able to deal with both large-scale and multilocus barcoding data with accuracy and can contribute to DNA barcoding for modern taxonomy. VIP Barcoding is free and available at http://msl.sls.cuhk.edu.hk/vipbarcoding/.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Structural equation modeling (SEM) is a powerful statistical approach for the testing of networks of direct and indirect theoretical causal relationships in complex data sets with intercorrelated dependent and independent variables. SEM is commonly applied in ecology, but the spatial information commonly found in ecological data remains difficult to model in a SEM framework. Here we propose a simple method for spatially explicit SEM (SE-SEM) based on the analysis of variance/covariance matrices calculated across a range of lag distances. This method provides readily interpretable plots of the change in path coefficients across scale and can be implemented using any standard SEM software package. We demonstrate the application of this method using three studies examining the relationships between environmental factors, plant community structure, nitrogen fixation, and plant competition. By design, these data sets had a spatial component, but were previously analyzed using standard SEM models. Using these data sets, we demonstrate the application of SE-SEM to regularly spaced, irregularly spaced, and ad hoc spatial sampling designs and discuss the increased inferential capability of this approach compared with standard SEM. We provide an R package, sesem, to easily implement spatial structural equation modeling.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Recent advances in optical and fluorescent protein technology have rapidly raised expectations in cell biology, allowing quantitative insights into dynamic intracellular processes like never before. However, quantitative live-cell imaging comes with many challenges including how best to translate dynamic microscopy data into numerical outputs that can be used to make meaningful comparisons rather than relying on representative data sets. Here, we use analysis of focal adhesion turnover dynamics as a straightforward specific example on how to image, measure, and analyze intracellular protein dynamics, but we believe this outlines a thought process and can provide guidance on how to understand dynamic microcopy data of other intracellular structures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we aim at predicting protein structural classes for low-homology data sets based on predicted secondary structures. We propose a new and simple kernel method, named as SSEAKSVM, to predict protein structural classes. The secondary structures of all protein sequences are obtained by using the tool PSIPRED and then a linear kernel on the basis of secondary structure element alignment scores is constructed for training a support vector machine classifier without parameter adjusting. Our method SSEAKSVM was evaluated on two low-homology datasets 25PDB and 1189 with sequence homology being 25% and 40%, respectively. The jackknife test is used to test and compare our method with other existing methods. The overall accuracies on these two data sets are 86.3% and 84.5%, respectively, which are higher than those obtained by other existing methods. Especially, our method achieves higher accuracies (88.1% and 88.5%) for differentiating the α + β class and the α/β class compared to other methods. This suggests that our method is valuable to predict protein structural classes particularly for low-homology protein sequences. The source code of the method in this paper can be downloaded at http://math.xtu.edu.cn/myphp/math/research/source/SSEAK_source_code.rar.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A major challenge in human genetics is to devise a systematic strategy to integrate disease-associated variants with diverse genomic and biological data sets to provide insight into disease pathogenesis and guide drug discovery for complex traits such as rheumatoid arthritis (RA)1. Here we performed a genome-wide association study meta-analysis in a total of >100,000 subjects of European and Asian ancestries (29,880 RA cases and 73,758 controls), by evaluating ~10 million single-nucleotide polymorphisms. We discovered 42 novel RA risk loci at a genome-wide level of significance, bringing the total to 101 (refs 2, 3, 4). We devised an in silico pipeline using established bioinformatics methods based on functional annotation5, cis-acting expression quantitative trait loci6 and pathway analyses7, 8, 9—as well as novel methods based on genetic overlap with human primary immunodeficiency, haematological cancer somatic mutations and knockout mouse phenotypes—to identify 98 biological candidate genes at these 101 risk loci. We demonstrate that these genes are the targets of approved therapies for RA, and further suggest that drugs approved for other indications may be repurposed for the treatment of RA. Together, this comprehensive genetic study sheds light on fundamental genes, pathways and cell types that contribute to RA pathogenesis, and provides empirical evidence that the genetics of RA can provide important information for drug discovery.