952 resultados para Data sets storage


Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: Available methods to simulate nucleotide or amino acid data typically use Markov models to simulate each position independently. These approaches are not appropriate to assess the performance of combinatorial and probabilistic methods that look for coevolving positions in nucleotide or amino acid sequences. RESULTS: We have developed a web-based platform that gives a user-friendly access to two phylogenetic-based methods implementing the Coev model: the evaluation of coevolving scores and the simulation of coevolving positions. We have also extended the capabilities of the Coev model to allow for the generalization of the alphabet used in the Markov model, which can now analyse both nucleotide and amino acid data sets. The simulation of coevolving positions is novel and builds upon the developments of the Coev model. It allows user to simulate pairs of dependent nucleotide or amino acid positions. CONCLUSIONS: The main focus of our paper is the new simulation method we present for coevolving positions. The implementation of this method is embedded within the web platform Coev-web that is freely accessible at http://coev.vital-it.ch/, and was tested in most modern web browsers.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: Several distributions of country-specific blood pressure (BP) percentiles by sex, age, and height for children and adolescents have been established worldwide. However, there are no globally unified BP references for defining elevated BP in children and adolescents, which limits international comparisons of the prevalence of pediatric elevated BP. We aimed to establish international BP references for children and adolescents by using 7 nationally representative data sets (China, India, Iran, Korea, Poland, Tunisia, and the United States). METHODS AND RESULTS: Data on BP for 52 636 nonoverweight children and adolescents aged 6 to 19 years were obtained from 7 large nationally representative cross-sectional surveys in China, India, Iran, Korea, Poland, Tunisia, and the United States. BP values were obtained with certified mercury sphygmomanometers in all 7 countries by using standard procedures for BP measurement. Smoothed BP percentiles (50th, 90th, 95th, and 99th) by age and height were estimated by using the Generalized Additive Model for Location Scale and Shape model. BP values were similar between males and females until the age of 13 years and were higher in males than females thereafter. In comparison with the BP levels of the 90th and 95th percentiles of the US Fourth Report at median height, systolic BP of the corresponding percentiles of these international references was lower, whereas diastolic BP was similar. CONCLUSIONS: These international BP references will be a useful tool for international comparison of the prevalence of elevated BP in children and adolescents and may help to identify hypertensive youths in diverse populations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

As increasingly large molecular data sets are collected for phylogenomics, the conflicting phylogenetic signal among gene trees poses challenges to resolve some difficult nodes of the Tree of Life. Among these nodes, the phylogenetic position of the honey bees (Apini) within the corbiculate bee group remains controversial, despite its considerable importance for understanding the emergence and maintenance of eusociality. Here, we show that this controversy stems in part from pervasive phylogenetic conflicts among GC-rich gene trees. GC-rich genes typically have a high nucleotidic heterogeneity among species, which can induce topological conflicts among gene trees. When retaining only the most GC-homogeneous genes or using a nonhomogeneous model of sequence evolution, our analyses reveal a monophyletic group of the three lineages with a eusocial lifestyle (honey bees, bumble bees, and stingless bees). These phylogenetic relationships strongly suggest a single origin of eusociality in the corbiculate bees, with no reversal to solitary living in this group. To accurately reconstruct other important evolutionary steps across the Tree of Life, we suggest removing GC-rich and GC-heterogeneous genes from large phylogenomic data sets. Interpreted as a consequence of genome-wide variations in recombination rates, this GC effect can affect all taxa featuring GC-biased gene conversion, which is common in eukaryotes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Mammalian physiology and behavior follow daily rhythms that are orchestrated by endogenous timekeepers known as circadian clocks. Rhythms in transcription are considered the main mechanism to engender rhythmic gene expression, but important roles for posttranscriptional mechanisms have recently emerged as well (reviewed in Lim and Allada (2013) [1]). We have recently reported on the use of ribosome profiling (RPF-seq), a method based on the high-throughput sequencing of ribosome protected mRNA fragments, to explore the temporal regulation of translation efficiency (Janich et al., 2015 [2]). Through the comparison of around-the-clock RPF-seq and matching RNA-seq data we were able to identify 150 genes, involved in ribosome biogenesis, iron metabolism and other pathways, whose rhythmicity is generated entirely at the level of protein synthesis. The temporal transcriptome and translatome data sets from this study have been deposited in NCBI's Gene Expression Omnibus under the accession number GSE67305. Here we provide additional information on the experimental setup and on important optimization steps pertaining to the ribosome profiling technique in mouse liver and to data analysis.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Aim: Emerging polyploids may depend on environmental niche shifts for successful establishment. Using the alpine plant Ranunculus kuepferi as a model system, we explore the niche shift hypothesis at different spatial resolutions and in contrasting parts of the species range. Location: European Alps. Methods: We sampled 12 individuals from each of 102 populations of R. kuepferi across the Alps, determined their ploidy levels, derived coarse-grain (100x100m) environmental descriptors for all sampling sites by downscaling WorldClim maps, and calculated fine-scale environmental descriptors (2x2m) from indicator values of the vegetation accompanying the sampled individuals. Both coarse and fine-scale variables were further computed for 8239 vegetation plots from across the Alps. Subsequently, we compared niche optima and breadths of diploid and tetraploid cytotypes by combining principal components analysis and kernel smoothing procedures. Comparisons were done separately for coarse and fine-grain data sets and for sympatric, allopatric and the total set of populations. Results: All comparisons indicate that the niches of the two cytotypes differ in optima and/or breadths, but results vary in important details. The whole-range analysis suggests differentiation along the temperature gradient to be most important. However, sympatric comparisons indicate that this climatic shift was not a direct response to competition with diploid ancestors. Moreover, fine-grained analyses demonstrate niche contraction of tetraploids, especially in the sympatric range, that goes undetected with coarse-grained data. Main conclusions: Although the niche optima of the two cytotypes differ, separation along ecological gradients was probably less decisive for polyploid establishment than a shift towards facultative apomixis, a particularly effective strategy to avoid minority cytotype exclusion. In addition, our results suggest that coarse-grained analyses overestimate niche breadths of widely distributed taxa. Niche comparison analyses should hence be conducted at environmental data resolutions appropriate for the organism and question under study.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Longline fisheries, oil spills, and offshore wind farms are some of the major threats increasing seabird mortality at sea, but the impact of these threats on specific populations has been difficult to determine so far. We tested the use of molecular markers, morphometric measures, and stable isotope (δ15N and δ13C) and trace element concentrations in the first primary feather (grown at the end of the breeding period) to assign the geographic origin of Calonectris shearwaters. Overall, we sampled birds from three taxa: 13 Mediterranean Cory's Shearwater (Calonectris diomedea diomedea) breeding sites, 10 Atlantic Cory's Shearwater (Calonectris diomedea borealis) breeding sites, and one Cape Verde Shearwater (C. edwardsii) breeding site. Assignment rates were investigated at three spatial scales: breeding colony, breeding archipelago, and taxa levels. Genetic analyses based on the mitochondrial control region (198 birds from 21 breeding colonies) correctly assigned 100% of birds to the three main taxa but failed in detecting geographic structuring at lower scales. Discriminant analyses based on trace elements composition achieved the best rate of correct assignment to colony (77.5%). Body measurements or stable isotopes mainly succeeded in assigning individuals among taxa (87.9% and 89.9%, respectively) but failed at the colony level (27.1% and 38.0%, respectively). Combining all three approaches (morphometrics, isotopes, and trace elements on 186 birds from 15 breeding colonies) substantially improved correct classifications (86.0%, 90.7%, and 100% among colonies, archipelagos, and taxa, respectively). Validations using two independent data sets and jackknife cross-validation confirmed the robustness of the combined approach in the colony assignment (62.5%, 58.8%, and 69.8% for each validation test, respectively). A preliminary application of the discriminant model based on stable isotope δ15N and δ13C values and trace elements (219 birds from 17 breeding sites) showed that 41 Cory's Shearwaters caught by western Mediterranean long-liners came mainly from breeding colonies in Menorca (48.8%), Ibiza (14.6%), and Crete (31.7%). Our findings show that combining analyses of trace elements and stable isotopes on feathers can achieve high rates of correct geographic assignment of birds in the marine environment, opening new prospects for the study of seabird mortality at sea.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Coastal birds are an integral part of coastal ecosystems, which nowadays are subject to severe environmental pressures. Effective measures for the management and conservation of seabirds and their habitats call for insight into their population processes and the factors affecting their distribution and abundance. Central to national and international management and conservation measures is the availability of accurate data and information on bird populations, as well as on environmental trends and on measures taken to solve environmental problems. In this thesis I address different aspects of the occurrence, abundance, population trends and breeding success of waterbirds breeding on the Finnish coast of the Baltic Sea, and discuss the implications of the results for seabird monitoring, management and conservation. In addition, I assess the position and prospects of coastal bird monitoring data, in the processing and dissemination of biodiversity data and information in accordance with the Convention on Biological Diversity (CBD) and other national and international commitments. I show that important factors for seabird habitat selection are island area and elevation, water depth, shore openness, and the composition of island cover habitats. Habitat preferences are species-specific, with certain similarities within species groups. The occurrence of the colonial Arctic Tern (Sterna paradisaea) is partly affected by different habitat characteristics than its abundance. Using long-term bird monitoring data, I show that eutrophication and winter severity have reduced the populations of several Finnish seabird species. A major demographic factor through which environmental changes influence bird populations is breeding success. Breeding success can function as a more rapid indicator of sublethal environmental impacts than population trends, particularly for long-lived and slowbreeding species, and should therefore be included in coastal bird monitoring schemes. Among my target species, local breeding success can be shown to affect the populations of the Mallard (Anas platyrhynchos), the Eider (Somateria mollissima) and the Goosander (Mergus merganser) after a time lag corresponding to their species-specific recruitment age. For some of the target species, the number of individuals in late summer can be used as an easier and more cost-effective indicator of breeding success than brood counts. My results highlight that the interpretation and application of habitat and population studies require solid background knowledge of the ecology of the target species. In addition, the special characteristics of coastal birds, their habitats, and coastal bird monitoring data have to be considered in the assessment of their distribution and population trends. According to the results, the relationships between the occurrence, abundance and population trends of coastal birds and environmental factors can be quantitatively assessed using multivariate modelling and model selection. Spatial data sets widely available in Finland can be utilised in the calculation of several variables that are relevant to the habitat selection of Finnish coastal species. Concerning some habitat characteristics field work is still required, due to a lack of remotely sensed data or the low resolution of readily available data in relation to the fine scale of the habitat patches in the archipelago. While long-term data sets exist for water quality and weather, the lack of data concerning for instance the food resources of birds hampers more detailed studies of environmental effects on bird populations. Intensive studies of coastal bird species in different archipelago areas should be encouraged. The provision and free delivery of high-quality coastal data concerning bird populations and their habitats would greatly increase the capability of ecological modelling, as well as the management and conservation of coastal environments and communities. International initiatives that promote open spatial data infrastructures and sharing are therefore highly regarded. To function effectively, international information networks, such as the biodiversity Clearing House Mechanism (CHM) under the CBD, need to be rooted at regional and local levels. Attention should also be paid to the processing of data for higher levels of the information hierarchy, so that data are synthesized and developed into high-quality knowledge applicable to management and conservation.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Despite recent advances, early diagnosis of Alzheimer’s disease (AD) from electroencephalography (EEG) remains a difficult task. In this paper, we offer an added measure through which such early diagnoses can potentially be improved. One feature that has been used for discriminative classification is changes in EEG synchrony. So far, only the decrease of synchrony in the higher frequencies has been deeply analyzed. In this paper, we investigate the increase of synchrony found in narrow frequency ranges within the θ band. This particular increase of synchrony is used with the well-known decrease of synchrony in the band to enhance detectable differences between AD patients and healthy subjects. We propose a new synchrony ratio that maximizes the differences between two populations. The ratio is tested using two different data sets, one of them containing mild cognitive impairment patients and healthy subjects, and another one, containing mild AD patients and healthy subjects. The results presented in this paper show that classification rate is improved, and the statistical difference between AD patients and healthy subjects is increased using the proposed ratio.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objective. Recently, significant advances have been made in the early diagnosis of Alzheimer’s disease from EEG. However, choosing suitable measures is a challenging task. Among other measures, frequency Relative Power and loss of complexity have been used with promising results. In the present study we investigate the early diagnosis of AD using synchrony measures and frequency Relative Power on EEG signals, examining the changes found in different frequency ranges. Approach. We first explore the use of a single feature for computing the classification rate, looking for the best frequency range. Then, we present a multiple feature classification system that outperforms all previous results using a feature selection strategy. These two approaches are tested in two different databases, one containing MCI and healthy subjects (patients age: 71.9 ± 10.2, healthy subjects age: 71.7 ± 8.3), and the other containing Mild AD and healthy subjects (patients age: 77.6 ± 10.0; healthy subjects age: 69.4± 11.5). Main Results. Using a single feature to compute classification rates we achieve a performance of 78.33% for the MCI data set and of 97.56 % for Mild AD. Results are clearly improved using the multiple feature classification, where a classification rate of 95% is found for the MCI data set using 11 features, and 100% for the Mild AD data set using 4 features. Significance. The new features selection method described in this work may be a reliable tool that could help to design a realistic system that does not require prior knowledge of a patient's status. With that aim, we explore the standardization of features for MCI and Mild AD data sets with promising results.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The present study builds on a previous proposal for assigning probabilities to the outcomes computed using different primary indicators in single-case studies. These probabilities are obtained comparing the outcome to previously tabulated reference values and reflect the likelihood of the results in case there was no intervention effect. The current study explores how well different metrics are translated into p values in the context of simulation data. Furthermore, two published multiple baseline data sets are used to illustrate how well the probabilities could reflect the intervention effectiveness as assessed by the original authors. Finally, the importance of which primary indicator is used in each data set to be integrated is explored; two ways of combining probabilities are used: a weighted average and a binomial test. The results indicate that the translation into p values works well for the two nonoverlap procedures, with the results for the regression-based procedure diverging due to some undesirable features of its performance. These p values, both when taken individually and when combined, were well-aligned with the effectiveness for the real-life data. The results suggest that assigning probabilities can be useful for translating the primary measure into the same metric, using these probabilities as additional evidence on the importance of behavioral change, complementing visual analysis and professional's judgments.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The uncertainty of any analytical determination depends on analysis and sampling. Uncertainty arising from sampling is usually not controlled and methods for its evaluation are still little known. Pierre Gy’s sampling theory is currently the most complete theory about samplingwhich also takes the design of the sampling equipment into account. Guides dealing with the practical issues of sampling also exist, published by international organizations such as EURACHEM, IUPAC (International Union of Pure and Applied Chemistry) and ISO (International Organization for Standardization). In this work Gy’s sampling theory was applied to several cases, including the analysis of chromite concentration estimated on SEM (Scanning Electron Microscope) images and estimation of the total uncertainty of a drug dissolution procedure. The results clearly show that Gy’s sampling theory can be utilized in both of the above-mentioned cases and that the uncertainties achieved are reliable. Variographic experiments introduced in Gy’s sampling theory are beneficially applied in analyzing the uncertainty of auto-correlated data sets such as industrial process data and environmental discharges. The periodic behaviour of these kinds of processes can be observed by variographic analysis as well as with fast Fourier transformation and auto-correlation functions. With variographic analysis, the uncertainties are estimated as a function of the sampling interval. This is advantageous when environmental data or process data are analyzed as it can be easily estimated how the sampling interval is affecting the overall uncertainty. If the sampling frequency is too high, unnecessary resources will be used. On the other hand, if a frequency is too low, the uncertainty of the determination may be unacceptably high. Variographic methods can also be utilized to estimate the uncertainty of spectral data produced by modern instruments. Since spectral data are multivariate, methods such as Principal Component Analysis (PCA) are needed when the data are analyzed. Optimization of a sampling plan increases the reliability of the analytical process which might at the end have beneficial effects on the economics of chemical analysis,

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Imide compounds have shown biological activity. These compounds can be easily synthesized with good yields. The objective of this paper was the rational planning of imides and sulfonamides with antinociceptive activity using the 3D-QSAR/CoMFA approach. The studies were performed using two data sets. The first set consisted of 39 cyclic imides while the second set consisted of 39 imides and 15 sulfonamides. The 3D- QSAR/CoMFA models have shown that the steric effect is important for the antinociceptive activity of imide and sulphonamide compounds. Ten new compounds with improved potential antinociceptive activity have been proposed by de novo design leapfrog simulations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Most metazoans rely on aerobic energy production, which is dependent on adequate oxygen supply. In the case of reduced oxygen supply (hypoxia), the most profound changes in gene expression are mediated by transcription factors named hypoxia-inducible factors (HIF alpha). These proteins are post-translationally regulated by prolyl-4-hydroxylase (PHD) enzymes that are direct “sensors” of cellular oxygen levels. This thesis examines the molecular evolution of metazoan HIF systems. In early metazoans the HIF system emerged from pre-existing PHD oxygen sensors and early bHLH-PAS transcription factors. In invertebrates our analysis revealed an unexpected diversity of PHD genes and HIF alpha sequence characteristics. An early branching vertebrate, the epaulette shark (Hemiscyllium ocellatum) was chosen for sequencing and hypoxia preconditioning studies of HIF alpha and PHD genes. As no quantitative PCR reference genes were available, this thesis includes the first study of reference genes in cartilaginous fish species. Applying multiple statistical analysis we also discoveredthat commonly used reference gene software may perform poorly with some data sets. Novel reference genes allowed accurate measurements of the mRNAlevels of the studied target genes. Cartilaginous fishes have three genomic duplicates of both HIF alpha and PHD genes like mammals and teleost fishes. Combining functional divergence and selection analyses it was possible to describe how sequence changes in both HIF alpha and PHD duplicates may have contributed to the differential oxygen sensitivityof HIF alphas. Additionally, novel teleost HIF-1 alpha sequences were produced and used to reveal the molecular evolution of HIF-1 alpha in this lineage rich with hypoxia tolerant species.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

App Engine on lyhenne englanninkielisistä termeistä application, sovellus ja engine, moottori. Kyseessä on Google, Inc. -konsernin toteuttama kaupallinen palvelu, joka noudattaa pilvimallin tietojenkäsittelyn periaatteita ja mahdollistaa asiakkaan oman sovelluskehityksen. Järjestelmään on mahdollista ohjelmoida itse ideoitu palvelu Internet - verkon välityksellä käytettäväksi, joko yksityisesti tai julkisesti. Kyse on siis hajautetusta palvelinjärjestelmästä, jonka tarjoaa dynaamisesti kuormitukseen sopeutuvan sovellusalustan, jossa asiakas ei vuokraa virtuaalikoneita. Myös järjestelmän tarjoama tallennuskapasiteetti on saatavilla joustavasti. Itse kandidaatintyössä syvennytään yksityiskohtaisemmin sovelluksen toteuttamiseen palvelussa, rajoitteisiin ja soveltuvuuteen. Alussa käydään läpi pilvikäsite, joista monilla tietokoneiden käyttäjillä on epäselvä käsitys. Erilaisia kokonaisuuksia voidaan luoda erittäin monella tavalla, joista rajaamme käsittelyn kohteeksi toteuttamiskelpoiset yleiset ratkaisut.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Although social capital and health have been extensively studied during the last decade, there are still open issues in current empirical research. These concern for instance the measurement of the concept in different contexts, as well as the association between different types of social capital and different dimensions of health. The present thesis addressed these questions. The general aim was to promote the understanding of social capital and health by investigating the oldest old and the two major language groups in Finland, Swedish- and Finnish-speakers. Another aim was to contribute to the discussion on methodological issues in social capital and health research. The present thesis investigated two empirical data sets, Umeå 85+ and Health 2000. The Umeå 85+ study was a cross-sectional study of 163 individuals aged 85, 90, and 95 or older, living in the municipality of Umeå, Sweden, in the year of 2000. The Health 2000 survey was a national study of 8,028 persons aged 30 or above carried out in Finland in 2000-2001. Different indicators of structural (e.g. social contacts) and cognitive (e.g. trust) social capital, as well as health indicators were used as variables in the analyses. The Umeå 85+ data set was analyzed with factor analysis, as well as univariate and multivariate analysis of variance. The Health 2000 data was analyzed with logistic regression techniques. The results showed that the Swedish-speakers in the Finnish data set Health 2000 had consistently higher prevalence of social capital compared to the Finnish-speakers even after controlling for central sociodemographic variables. The results further showed that even if the language group differences in health were small, the Swedishspeakers experienced in general better self-reported health compared with the Finnish-speakers. Common sociodemographic variables could not explain these observed differences in health. The results imply that social capital is often, but not always, associated with health. This was clearly seen in the Umeå 85+ data set where only one health indicator (depressive symptoms) was associated with structural social capital among the oldest old. The results based on the analysis of the Health 2000 survey demonstrated that the cognitive component of social capital was associated with self-rated health and psychological health rather than with participation in social activities and social contacts. In addition, social capital statistically reduced the health advantage especially for Swedish-speaking men, indicating that high prevalence of social capital may promote health. Finally, the present thesis also discussed the issue of methodological challenges faced with when analyzing social capital and health. It was suggested that certain components of social capital such as bonding and bridging social capital may be more relevant than structural and cognitive components when investigating social capital among the two language groups in Finland. The results concerning the oldest old indicated that the structural aspects of social capital probably reflect current living conditions, whereas cognitive social capital reflects attitudes and traits often acquired decades earlier. This is interpreted as an indication of the fact that structural and cognitive social capital are closely related yet empirically two distinctive concepts. Taken together, some components of social capital may be more relevant to study than others depending on which population group and age group is under study. The results also implied that the choice of cut-off point of dichotomization of selfrated health has an impact on the estimated effects of the explanatory variables. When the whole age interval, 35-64 years, was analyzed with logistic regression techniques the choice of cut-off point did not matter for the estimated effects of marital status and educational level. The results changed, however, when the age interval was divided into three shorter intervals. If self-rated health is explored using wide age intervals that do not account for age-dependent covariates there is a risk of drawing misleading conclusions. In conclusion, the results presented in the thesis suggest that the uneven distribution of social capital observed between the two language groups in Finland are of importance when trying to further understand health inequalities that exist between Swedish- and Finnish-speakers in Finland. Although social capital seemed to be relevant to the understanding of health among the oldest old, the meaning of social capital is probably different compared to a less vulnerable age group. This should be noticed in future empirical research. In the present thesis, it was shown that the relationship between social capital and health is complex and multidimensional. Different aspects of social capital seem to be important for different aspects of health. This reduces the possibility to generalize the results and to recommend general policy implementations in this area. An increased methodological awareness regarding social capital as well as health are called for in order to further understand the cfomplex association between them. However, based on the present data and findings social capital is associated with health. To understand individual health one must also consider social aspects of the individuals’ environment such as social capital.