Biblioteca Digital

790 resultados para Datasets

Spatial patterns and environmental constraints on ecosystem services at a catchment scale

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Improved understanding and prediction of the fundamental environmental controls on ecosystem service supply across the landscape will help to inform decisions made by policy makers and land-water managers. To evaluate this issue for a local catchment case study, we explored metrics and spatial patterns of service supply for water quality regulation, agriculture production, carbon storage, and biodiversity for the Macronutrient Conwy catchment. Methods included using ecosystem models such as LUCI and JULES, integration of national scale field survey datasets, earth observation products and plant trait databases, to produce finely resolved maps of species richness and primary production. Analyses were done with both 1x1 km gridded and subcatchment data. A common single gradient characterised catchment scale ecosystem services supply with agricultural production and carbon storage at opposing ends of the gradient as reported for a national-scale assessment. Species diversity was positively related to production due to the below national average productivity levels in the Conwy combined with the unimodal relationship between biodiversity and productivity at the national scale. In contrast to the national scale assessment, a strong reduction in water quality as production increased was observed in these low productive systems. Various soil variables were tested for their predictive power of ecosystem service supply. Soil carbon, nitrogen, their ratio and soil pH all had double the power of rainfall and altitude, each explaining around 45% of variation but soil pH is proposed as a potential metric for ecosystem service supply potential as it is a simple and practical metric which can be carried out in the field with crowd-sourcing technologies now available. The study emphasises the importance of considering multiple ecosystem services together due to the complexity of covariation at local and national scales, and the benefits of exploiting a wide range of metrics for each service to enhance data robustness.

Discovering biomarkers for antidepressant response: protocol from the Canadian biomarker integration network in depression (CAN-BIND) and clinical characteristics of the first patient cohort

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background Major Depressive Disorder (MDD) is among the most prevalent and disabling medical conditions worldwide. Identification of clinical and biological markers (“biomarkers”) of treatment response could personalize clinical decisions and lead to better outcomes. This paper describes the aims, design, and methods of a discovery study of biomarkers in antidepressant treatment response, conducted by the Canadian Biomarker Integration Network in Depression (CAN-BIND). The CAN-BIND research program investigates and identifies biomarkers that help to predict outcomes in patients with MDD treated with antidepressant medication. The primary objective of this initial study (known as CAN-BIND-1) is to identify individual and integrated neuroimaging, electrophysiological, molecular, and clinical predictors of response to sequential antidepressant monotherapy and adjunctive therapy in MDD. Methods CAN-BIND-1 is a multisite initiative involving 6 academic health centres working collaboratively with other universities and research centres. In the 16-week protocol, patients with MDD are treated with a first-line antidepressant (escitalopram 10–20 mg/d) that, if clinically warranted after eight weeks, is augmented with an evidence-based, add-on medication (aripiprazole 2–10 mg/d). Comprehensive datasets are obtained using clinical rating scales; behavioural, dimensional, and functioning/quality of life measures; neurocognitive testing; genomic, genetic, and proteomic profiling from blood samples; combined structural and functional magnetic resonance imaging; and electroencephalography. De-identified data from all sites are aggregated within a secure neuroinformatics platform for data integration, management, storage, and analyses. Statistical analyses will include multivariate and machine-learning techniques to identify predictors, moderators, and mediators of treatment response. Discussion From June 2013 to February 2015, a cohort of 134 participants (85 outpatients with MDD and 49 healthy participants) has been evaluated at baseline. The clinical characteristics of this cohort are similar to other studies of MDD. Recruitment at all sites is ongoing to a target sample of 290 participants. CAN-BIND will identify biomarkers of treatment response in MDD through extensive clinical, molecular, and imaging assessments, in order to improve treatment practice and clinical outcomes. It will also create an innovative, robust platform and database for future research.

Heterogeneous tensor decomposition for clustering via manifold optimization

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tensor clustering is an important tool that exploits intrinsically rich structures in real-world multiarray or Tensor datasets. Often in dealing with those datasets, standard practice is to use subspace clustering that is based on vectorizing multiarray data. However, vectorization of tensorial data does not exploit complete structure information. In this paper, we propose a subspace clustering algorithm without adopting any vectorization process. Our approach is based on a novel heterogeneous Tucker decomposition model taking into account cluster membership information. We propose a new clustering algorithm that alternates between different modes of the proposed heterogeneous tensor model. All but the last mode have closed-form updates. Updating the last mode reduces to optimizing over the multinomial manifold for which we investigate second order Riemannian geometry and propose a trust-region algorithm. Numerical experiments show that our proposed algorithm compete effectively with state-of-the-art clustering algorithms that are based on tensor factorization.

An assessment of the latent and sensible heat flux on the simulated regional climate over Southwestern South Atlantic Ocean

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A Regional Climate Model (RegCM3) 10-year (1990-1999) simulation over southwestern South Atlantic Ocean (SAO) is evaluated to assess the mean climatology and the simulation errors of turbulent fluxes over the sea. Moreover, the relationship between these fluxes and the rainfall over some cyclogenetic areas is also analyzed. The RegCM3 results are validated using some reanalyses datasets (ERA40, R2, GPCP and WHOI). The summer and winter spatial patterns of latent and sensible heat fluxes simulated by the RegCM3 are in agreement with the reanalyses (WHOI, R2 and ERA40). They show large latent heat fluxes exchange in the subtropical SAO and at higher latitudes in the warm waters of Brazil Current. In particular, the magnitude of RegCM3 latent heat fluxes is similar to the WHOI, which is probably related to two factors: (a) small specific humidity bias, and (b) the RegCM3 flux algorithm. In contrast, the RegCM3 presents large overestimation of sensible heat flux, though it simulates well their spatial pattern. This simulation error is associated with the RegCM3 underestimation of the 2-m air temperature. In southwestern SAO, in three known cyclogenetic areas, the reanalyses and the RegCM3 show the existence of different physical mechanisms that control the annual cycles of latent/sensible heating and rainfall. It is shown that over the eastern coast of Uruguay (35A degrees-43A degrees S) and the southeastern coast of Argentina (44A degrees-52A degrees S) the sea-air moisture and heat exchange play an important role to control the annual cycle of precipitation. This does not happen on the south/southeastern coast of Brazil.

A Time-Series Analysis of the H alpha Emission Line in V3885 Sagitarii

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Flickering is a phenomenon related to mass accretion observed among many classes of astrophysical objects. In this paper we present a study of flickering emission lines and the continuum of the cataclysmic variable V3885 Sgr. The flickering behavior was first analyzed through statistical analysis and the power spectra of lightcurves. Autocorrelation techniques were then employed to estimate the flickering timescale of flares. A cross-correlation study between the line and its underlying continuum variability is presented. The cross-correlation between the photometric and spectroscopic data is also discussed. Periodograms, calculated using emission-line data, show a behavior that is similar to those obtained from photometric datasets found in the literature, with a plateau at lower frequencies and a power-law at higher frequencies. The power-law index is consistent with stochastic events. The cross-correlation study indicates the presence of a correlation between the variability on Ha and its underlying continuum. Flickering timescales derived from the photometric data were estimated to be 25 min for two lightcurves and 10 min for one of them. The average timescales of the line flickering is 40 min, while for its underlying continuum it drops to 20 min.

Coda wave attenuation in the Parecis Basin, Amazon Craton, Brazil: sensitivity to basement depth

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Small local earthquakes from two aftershock sequences in Porto dos GaA(0)chos, Amazon craton-Brazil, were used to estimate the coda wave attenuation in the frequency band of 1 to 24 Hz. The time-domain coda-decay method of a single backscattering model is employed to estimate frequency dependence of the quality factor (Q (c)) of coda waves modeled usingwhere Q (0) is the coda quality factor at frequency of 1 Hz and eta is the frequency parameter. We also used the independent frequency model approach (Morozov, Geophys J Int, 175:239-252, 2008), based in the temporal attenuation coefficient, chi(f) instead of Q(f), given by the equation for the calculation of the geometrical attenuation (gamma) and effective attenuation Q (c) values have been computed at central frequencies (and band) of 1.5 (1-2), 3.0 (2-4), 6.0 (4-8), 9.0 (6-12), 12 (8-16), and 18 (12-24) Hz for five different datasets selected according to the geotectonic environment as well as the ability to sample shallow or deeper structures, particularly the sediments of the Parecis basin and the crystalline basement of the Amazon craton. For the Parecis basin for the surrounding shield and for the whole region of Porto dos GaA(0)chos Using the independent frequency model, we found: for the cratonic zone, gamma = 0.014 s (-aEuro parts per thousand 1), nu a parts per thousand 1.12; for the basin zone with sediments of similar to 500 m, gamma = 0.031 s (-aEuro parts per thousand 1), nu a parts per thousand 1.27; and for the Parecis basin with sediments of similar to 1,000 m, gamma = 0.047 s (-aEuro parts per thousand 1), nu a parts per thousand 1.42. Analysis of the attenuation factor (Q (c)) for different values of the geometrical spreading parameter (nu) indicated that an increase of nu generally causes an increase in Q (c), both in the basin as well as in the craton. But the differences in the attenuation between different geological environments are maintained for different models of geometrical spreading. It was shown that the energy of coda waves is attenuated more strongly in the sediments, (in the deepest part of the basin), than in the basement, (in the craton). Thus, the coda wave analysis can contribute to studies of geological structures in the upper crust, as the average coda quality factor is dependent on the thickness of sedimentary layer.

Effects of species turnover on reserve site selection in a fragmented landscape

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Changes in species composition is an important process in many ecosystems but rarely considered in systematic reserve site selection. To test the influence of temporal variability in species composition on the establishment of a reserve network, we compared network configurations based on species data of small mammals and frogs sampled during two consecutive years in a fragmented Atlantic Forest landscape (SE Brazil). Site selection with simulated annealing was carried out with the datasets of each single year and after merging the datasets of both years. Site selection resulted in remarkably divergent network configurations. Differences are reflected in both the identity of the selected fragments and in the amount of flexibility and irreplaceability in network configuration. Networks selected when data for both years were merged did not include all sites that were irreplaceable in one of the 2 years. Results of species number estimation revealed that significant changes in the composition of the species community occurred. Hence, temporal variability of community composition should be routinely tested and considered in systematic reserve site selection in dynamic systems.

The modularity of seed dispersal: differences in structure and robustness between bat- and bird-fruit networks

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In networks of plant-animal mutualisms, different animal groups interact preferentially with different plants, thus forming distinct modules responsible for different parts of the service. However, what we currently know about seed dispersal networks is based only on birds. Therefore, we wished to fill this gap by studying bat-fruit networks and testing how they differ from bird-fruit networks. As dietary overlap of Neotropical bats and birds is low, they should form distinct mutualistic modules within local networks. Furthermore, since frugivory evolved only once among Neotropical bats, but several times independently among Neotropical birds, greater dietary overlap is expected among bats, and thus connectance and nestedness should be higher in bat-fruit networks. If bat-fruit networks have higher nestedness and connectance, they should be more robust to extinctions. We analyzed 1 mixed network of both bats and birds and 20 networks that consisted exclusively of either bats (11) or birds (9). As expected, the structure of the mixed network was both modular (M = 0.45) and nested (NODF = 0.31); one module contained only birds and two only bats. In 20 datasets with only one disperser group, bat-fruit networks (NODF = 0.53 +/- A 0.09, C = 0.30 +/- A 0.11) were more nested and had a higher connectance than bird-fruit networks (NODF = 0.42 +/- A 0.07, C = 0.22 +/- A 0.09). Unexpectedly, robustness to extinction of animal species was higher in bird-fruit networks (R = 0.60 +/- A 0.13) than in bat-fruit networks (R = 0.54 +/- A 0.09), and differences were explained mainly by species richness. These findings suggest that a modular structure also occurs in seed dispersal networks, similar to pollination networks. The higher nestedness and connectance observed in bat-fruit networks compared with bird-fruit networks may be explained by the monophyletic evolution of frugivory in Neotropical bats, among which the diets of specialists seem to have evolved from the pool of fruits consumed by generalists.

Protein identification pipeline for the homology-driven proteomics

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Homology-driven proteomics is a major tool to characterize proteomes of organisms with unsequenced genomes. This paper addresses practical aspects of automated homology-driven protein identifications by LC-MS/MS on a hybrid LTQ orbitrap mass spectrometer. All essential software elements supporting the presented pipeline are either hosted at the publicly accessible web server, or are available for free download. (C) 2008 Elsevier B.V. All rights reserved.

Toward a resolution of Campanulid phylogeny, with special reference to the placement of Dipsacales

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Broad-scale phylogenetic analyses of the angiosperms and of the Asteridae have failed to confidently resolve relationships among the major lineages of the campanulid Asteridae (i.e., the euasterid II of APG II, 2003). To address this problem we assembled presently available sequences for a core set of 50 taxa, representing the diversity of the four largest lineages (Apiales, Aquifoliales, Asterales, Dipsacales) as well as the smaller ""unplaced"" groups (e.g., Bruniaceae, Paracryphiaceae, Columelliaceae). We constructed four data matrices for phylogenetic analysis: a chloroplast coding matrix (atpB, matK, ndhF, rbcL), a chloroplast non-coding matrix (rps16 intron, trnT-F region, trnV-atpE IGS), a combined chloroplast dataset (all seven chloroplast regions), and a combined genome matrix (seven chloroplast regions plus 18S and 26S rDNA). Bayesian analyses of these datasets using mixed substitution models produced often well-resolved and supported trees. Consistent with more weakly supported results from previous studies, our analyses support the monophyly of the four major clades and the relationships among them. Most importantly, Asterales are inferred to be sister to a clade containing Apiales and Dipsacales. Paracryphiaceae is consistently placed sister to the Dipsacales. However, the exact relationships of Bruniaceae, Columelliaceae, and an Escallonia clade depended upon the dataset. Areas of poor resolution in combined analyses may be partly explained by conflict between the coding and non-coding data partitions. We discuss the implications of these results for our understanding of campanulid phylogeny and evolution, paying special attention to how our findings bear on character evolution and biogeography in Dipsacales.

The South American Plasmodium falciparum var gene repertoire is limited, highly shared and possibly lacks several antigenic types

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Plasmodium falciparum var gene family encodes large variant antigens, which are important virulence factors, and also targets of the humoral host response. The frequently observed mild outcomes of falciparum malaria in many places of the Amazon area prompted us to ask whether a globally restricted variant (var) gene repertoire is present in currently circulating and older isolates of this area. By exhaustive analysis of var gene tags from 89 isolates and clones taken during many years from all over the Brazilian Amazon, we estimate that there are probably no more than 350-430 distinct sequence types, less than for any similar sized area studied so far. Detailed analysis of the var tags from genetically distinct clones obtained from single isolates revealed restricted and redundant repertoires suggesting either a low incidence of infective bites or restricted variant gene diversity in inoculated parasites. Additionally, we found a structuring of var gene repertoires observed as a higher pairwise typing sharing in isolates from the same microregion compared to isolates from different regions. Fine analysis of translated var tags revealed that certain Distinct Sequence Identifiers (DSIDs) were differently represented in Brazilian/South American isolates when compared to datasets from other continents. By global alignment of worldwide var DBL alpha sequences and sorting in groups with more than 76% identity, 125 clusters were formed and more than half of all genes were found in nine clusters with 50 or more sequences. While Brazilian/South American sequences were represented only in 64 groups, African sequences were found in the majority of clusters. DSID type 1 related sequences accumulated almost completely in one single cluster, indicating that limited recombination occurs in these specific var gene types. These data demonstrate the so far highest pairwise type sharing values for the var gene family in isolates from all over an entire subcontinent. The apparent lack of specific sequences types suggests that the P. falciparum transmission dynamics in the whole Amazon are probably different from any other endemic region studied and possibly interfere with the parasite`s ability to efficiently diversify its variant gene repertoires. (C) 2010 Elsevier B.V. All rights reserved.

Slicing the metric space to provide quick indexing of complex data in the main memory

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Searching in a dataset for elements that are similar to a given query element is a core problem in applications that manage complex data, and has been aided by metric access methods (MAMs). A growing number of applications require indices that must be built faster and repeatedly, also providing faster response for similarity queries. The increase in the main memory capacity and its lowering costs also motivate using memory-based MAMs. In this paper. we propose the Onion-tree, a new and robust dynamic memory-based MAM that slices the metric space into disjoint subspaces to provide quick indexing of complex data. It introduces three major characteristics: (i) a partitioning method that controls the number of disjoint subspaces generated at each node; (ii) a replacement technique that can change the leaf node pivots in insertion operations; and (iii) range and k-NN extended query algorithms to support the new partitioning method, including a new visit order of the subspaces in k-NN queries. Performance tests with both real-world and synthetic datasets showed that the Onion-tree is very compact. Comparisons of the Onion-tree with the MM-tree and a memory-based version of the Slim-tree showed that the Onion-tree was always faster to build the index. The experiments also showed that the Onion-tree significantly improved range and k-NN query processing performance and was the most efficient MAM, followed by the MM-tree, which in turn outperformed the Slim-tree in almost all the tests. (C) 2010 Elsevier B.V. All rights reserved.

Partitions selection strategy for set of clustering solutions

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Clustering is a difficult task: there is no single cluster definition and the data can have more than one underlying structure. Pareto-based multi-objective genetic algorithms (e.g., MOCK Multi-Objective Clustering with automatic K-determination and MOCLE-Multi-Objective Clustering Ensemble) were proposed to tackle these problems. However, the output of such algorithms can often contains a high number of partitions, becoming difficult for an expert to manually analyze all of them. In order to deal with this problem, we present two selection strategies, which are based on the corrected Rand, to choose a subset of solutions. To test them, they are applied to the set of solutions produced by MOCK and MOCLE in the context of several datasets. The study was also extended to select a reduced set of partitions from the initial population of MOCLE. These analysis show that both versions of selection strategy proposed are very effective. They can significantly reduce the number of solutions and, at the same time, keep the quality and the diversity of the partitions in the original set of solutions. (C) 2010 Elsevier B.V. All rights reserved.

Investigation of a new GRASP-based clustering algorithm applied to biological data

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A large amount of biological data has been produced in the last years. Important knowledge can be extracted from these data by the use of data analysis techniques. Clustering plays an important role in data analysis, by organizing similar objects from a dataset into meaningful groups. Several clustering algorithms have been proposed in the literature. However, each algorithm has its bias, being more adequate for particular datasets. This paper presents a mathematical formulation to support the creation of consistent clusters for biological data. Moreover. it shows a clustering algorithm to solve this formulation that uses GRASP (Greedy Randomized Adaptive Search Procedure). We compared the proposed algorithm with three known other algorithms. The proposed algorithm presented the best clustering results confirmed statistically. (C) 2009 Elsevier Ltd. All rights reserved.

Towards improving cluster-based feature selection with a simplified silhouette filter

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper proposes a filter-based algorithm for feature selection. The filter is based on the partitioning of the set of features into clusters. The number of clusters, and consequently the cardinality of the subset of selected features, is automatically estimated from data. The computational complexity of the proposed algorithm is also investigated. A variant of this filter that considers feature-class correlations is also proposed for classification problems. Empirical results involving ten datasets illustrate the performance of the developed algorithm, which in general has obtained competitive results in terms of classification accuracy when compared to state of the art algorithms that find clusters of features. We show that, if computational efficiency is an important issue, then the proposed filter May be preferred over their counterparts, thus becoming eligible to join a pool of feature selection algorithms to be used in practice. As an additional contribution of this work, a theoretical framework is used to formally analyze some properties of feature selection methods that rely on finding clusters of features. (C) 2011 Elsevier Inc. All rights reserved.

«
1
2
...
45
46
47
48
49
50
51
52
53
»