40 resultados para Datasets


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Hepatitis B is a worldwide health problem affecting about 2 billion people and more than 350 million are chronic carriers of the virus. Nine HBV genotypes (A to I) have been described. The geographical distribution of HBV genotypes is not completely understood due to the limited number of samples from some parts of the world. One such example is Colombia, in which few studies have described the HBV genotypes. In this study, we characterized HBV genotypes in 143 HBsAg-positive volunteer blood donors from Colombia. A fragment of 1306 bp partially comprising HBsAg and the DNA polymerase coding regions (S/POL) was amplified and sequenced. Bayesian phylogenetic analyses were conducted using the Markov Chain Monte Carlo (MCMC) approach to obtain the maximum clade credibility (MCC) tree using BEAST v.1.5.3. Of all samples, 68 were positive and 52 were successfully sequenced. Genotype F was the most prevalent in this population (77%) - subgenotypes F3 (75%) and Fib (2%). Genotype G (7.7%) and subgenotype A2 (15.3%) were also found. Genotype G sequence analysis suggests distinct introductions of this genotype in the country. Furthermore, we estimated the time of the most recent common ancestor (TMRCA) for each HBV/F subgenotype and also for Colombian F3 sequences using two different datasets: (i) 77 sequences comprising 1306 bp of S/POL region and (ii) 283 sequences comprising 681 bp of S/POL region. We also used two other previously estimated evolutionary rates: (i) 2.60 x 10(-4) s/s/y and (ii) 1.5 x 10(-5) s/s/y. Here we report the HBV genotypes circulating in Colombia and estimated the TMRCA for the four different subgenotypes of genotype F. (C) 2010 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Mitochondrial DNA (mtDNA) population data for forensic purposes are still scarce for some populations, which may limit the evaluation of forensic evidence especially when the rarity of a haplotype needs to be determined in a database search. In order to improve the collection of mtDNA lineages from the Iberian and South American subcontinents, we here report the results of a collaborative study involving nine laboratories from the Spanish and Portuguese Speaking Working Group of the International Society for Forensic Genetics (GHEP-ISFG) and EMPOP. The individual laboratories contributed population data that were generated throughout the past 10 years, but in the majority of cases have not been made available to the scientific community. A total of 1019 haplotypes from Iberia (Basque Country, 2 general Spanish populations, 2 North and 1 Central Portugal populations), and Latin America (3 populations from Sao Paulo) were collected, reviewed and harmonized according to defined EMPOP criteria. The majority of data ambiguities that were found during the reviewing process (41 in total) were transcription errors confirming that the documentation process is still the most error-prone stage in reporting mtDNA population data, especially when performed manually. This GHEP-EMPOP collaboration has significantly improved the quality of the individual mtDNA datasets and adds mtDNA population data as valuable resource to the EMPOP database (www.empop.org). (C) 2010 Elsevier Ireland Ltd. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we propose a method based on association rule-mining to enhance the diagnosis of medical images (mammograms). It combines low-level features automatically extracted from images and high-level knowledge from specialists to search for patterns. Our method analyzes medical images and automatically generates suggestions of diagnoses employing mining of association rules. The suggestions of diagnosis are used to accelerate the image analysis performed by specialists as well as to provide them an alternative to work on. The proposed method uses two new algorithms, PreSAGe and HiCARe. The PreSAGe algorithm combines, in a single step, feature selection and discretization, and reduces the mining complexity. Experiments performed on PreSAGe show that this algorithm is highly suitable to perform feature selection and discretization in medical images. HiCARe is a new associative classifier. The HiCARe algorithm has an important property that makes it unique: it assigns multiple keywords per image to suggest a diagnosis with high values of accuracy. Our method was applied to real datasets, and the results show high sensitivity (up to 95%) and accuracy (up to 92%), allowing us to claim that the use of association rules is a powerful means to assist in the diagnosing task.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Human leukocyte antigen (HLA) haplotypes are frequently evaluated for population history inferences and association studies. However, the available typing techniques for the main HLA loci usually do not allow the determination of the allele phase and the constitution of a haplotype, which may be obtained by a very time-consuming and expensive family-based segregation study. Without the family-based study, computational inference by probabilistic models is necessary to obtain haplotypes. Several authors have used the expectation-maximization (EM) algorithm to determine HLA haplotypes, but high levels of erroneous inferences are expected because of the genetic distance among the main HLA loci and the presence of several recombination hotspots. In order to evaluate the efficiency of computational inference methods, 763 unrelated individuals stratified into three different datasets had their haplotypes manually defined in a family-based study of HLA-A, -B, -DRB1 and -DQB1 segregation, and these haplotypes were compared with the data obtained by the following three methods: the Expectation-Maximization (EM) and Excoffier-Laval-Balding (ELB) algorithms using the arlequin 3.11 software, and the PHASE method. When comparing the methods, we observed that all algorithms showed a poor performance for haplotype reconstruction with distant loci, estimating incorrect haplotypes for 38%-57% of the samples considering all algorithms and datasets. We suggest that computational haplotype inferences involving low-resolution HLA-A, HLA-B, HLA-DRB1 and HLA-DQB1 haplotypes should be considered with caution.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objectives To evaluate the presence of false flow three-dimensional (3D) power Doppler signals in `flow-free` models. Methods 3D power Doppler datasets were acquired from three different flow-free phantoms (muscle, air and water) with two different transducers and Virtual Organ Computer-aided AnaLysis was used to generate a sphere that was serially applied through the 3D dataset. The vascularization flow index was used to compare artifactual signals at different depths (from 0 to 6 cm) within the different phantoms and at different gain and pulse repetition frequency (PR F) settings. Results Artifactual Doppler signals were seen in all phantoms despite these being flow-free. The pattern was very similar and the degree of artifact appeared to be dependent on the gain and distance from the transducer. False signals were more evident in the far field and increased as the gain was increased, with false signals first appearing with a gain of 1 dB in the air and muscle phantoms. False signals were seen at a lower gain with the water phantom (-15 dB) and these were associated with vertical lines of Doppler artifact that were related to PRF, and disappeared when reflections were attenuated. Conclusions Artifactual Doppler signals are seen in flow-free phantoms and are related to the gain settings and the distance from the transducer. In the in-vivo situation, the lowest gain settings that allow the detection of blood flow and adequate definition of vessel architecture should be used, which invariably means using a setting near or below the middle of the range available. Additionally, observers should be aware of vertical lines when evaluating cystic or liquid-containing structures. Copyright (C) 2010 ISUOC. Published by John Wiley & Sons, Ltd.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Functional MRI (fMRI) data often have low signal-to-noise-ratio (SNR) and are contaminated by strong interference from other physiological sources. A promising tool for extracting signals, even under low SNR conditions, is blind source separation (BSS), or independent component analysis (ICA). BSS is based on the assumption that the detected signals are a mixture of a number of independent source signals that are linearly combined via an unknown mixing matrix. BSS seeks to determine the mixing matrix to recover the source signals based on principles of statistical independence. In most cases, extraction of all sources is unnecessary; instead, a priori information can be applied to extract only the signal of interest. Herein we propose an algorithm based on a variation of ICA, called Dependent Component Analysis (DCA), where the signal of interest is extracted using a time delay obtained from an autocorrelation analysis. We applied such method to inspect functional Magnetic Resonance Imaging (fMRI) data, aiming to find the hemodynamic response that follows neuronal activation from an auditory stimulation, in human subjects. The method localized a significant signal modulation in cortical regions corresponding to the primary auditory cortex. The results obtained by DCA were also compared to those of the General Linear Model (GLM), which is the most widely used method to analyze fMRI datasets.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tick-borne zoonoses (TBZ) are emerging diseases worldwide. A large amount of information (e.g. case reports, results of epidemiological surveillance, etc.) is dispersed through various reference sources (ISI and non-ISI journals, conference proceedings, technical reports, etc.). An integrated database-derived from the ICTTD-3 project (http://www.icttd.nl)-was developed in order to gather TBZ records in the (sub-)tropics, collected both by the authors and collaborators worldwide. A dedicated website (http://www.tickbornezoonoses.org) was created to promote collaboration and circulate information. Data collected are made freely available to researchers for analysis by spatial methods, integrating mapped ecological factors for predicting TBZ risk. The authors present the assembly process of the TBZ database: the compilation of an updated list of TBZ relevant for (sub-)tropics, the database design and its structure, the method of bibliographic search, the assessment of spatial precision of geo-referenced records. At the time of writing, 725 records extracted from 337 publications related to 59 countries in the (sub-)tropics, have been entered in the database. TBZ distribution maps were also produced. Imported cases have been also accounted for. The most important datasets with geo-referenced records were those on Spotted Fever Group rickettsiosis in Latin-America and Crimean-Congo Haemorrhagic Fever in Africa. The authors stress the need for international collaboration in data collection to update and improve the database. Supervision of data entered remains always necessary. Means to foster collaboration are discussed. The paper is also intended to describe the challenges encountered to assemble spatial data from various sources and to help develop similar data collections.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A Regional Climate Model (RegCM3) 10-year (1990-1999) simulation over southwestern South Atlantic Ocean (SAO) is evaluated to assess the mean climatology and the simulation errors of turbulent fluxes over the sea. Moreover, the relationship between these fluxes and the rainfall over some cyclogenetic areas is also analyzed. The RegCM3 results are validated using some reanalyses datasets (ERA40, R2, GPCP and WHOI). The summer and winter spatial patterns of latent and sensible heat fluxes simulated by the RegCM3 are in agreement with the reanalyses (WHOI, R2 and ERA40). They show large latent heat fluxes exchange in the subtropical SAO and at higher latitudes in the warm waters of Brazil Current. In particular, the magnitude of RegCM3 latent heat fluxes is similar to the WHOI, which is probably related to two factors: (a) small specific humidity bias, and (b) the RegCM3 flux algorithm. In contrast, the RegCM3 presents large overestimation of sensible heat flux, though it simulates well their spatial pattern. This simulation error is associated with the RegCM3 underestimation of the 2-m air temperature. In southwestern SAO, in three known cyclogenetic areas, the reanalyses and the RegCM3 show the existence of different physical mechanisms that control the annual cycles of latent/sensible heating and rainfall. It is shown that over the eastern coast of Uruguay (35A degrees-43A degrees S) and the southeastern coast of Argentina (44A degrees-52A degrees S) the sea-air moisture and heat exchange play an important role to control the annual cycle of precipitation. This does not happen on the south/southeastern coast of Brazil.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Flickering is a phenomenon related to mass accretion observed among many classes of astrophysical objects. In this paper we present a study of flickering emission lines and the continuum of the cataclysmic variable V3885 Sgr. The flickering behavior was first analyzed through statistical analysis and the power spectra of lightcurves. Autocorrelation techniques were then employed to estimate the flickering timescale of flares. A cross-correlation study between the line and its underlying continuum variability is presented. The cross-correlation between the photometric and spectroscopic data is also discussed. Periodograms, calculated using emission-line data, show a behavior that is similar to those obtained from photometric datasets found in the literature, with a plateau at lower frequencies and a power-law at higher frequencies. The power-law index is consistent with stochastic events. The cross-correlation study indicates the presence of a correlation between the variability on Ha and its underlying continuum. Flickering timescales derived from the photometric data were estimated to be 25 min for two lightcurves and 10 min for one of them. The average timescales of the line flickering is 40 min, while for its underlying continuum it drops to 20 min.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Small local earthquakes from two aftershock sequences in Porto dos GaA(0)chos, Amazon craton-Brazil, were used to estimate the coda wave attenuation in the frequency band of 1 to 24 Hz. The time-domain coda-decay method of a single backscattering model is employed to estimate frequency dependence of the quality factor (Q (c)) of coda waves modeled usingwhere Q (0) is the coda quality factor at frequency of 1 Hz and eta is the frequency parameter. We also used the independent frequency model approach (Morozov, Geophys J Int, 175:239-252, 2008), based in the temporal attenuation coefficient, chi(f) instead of Q(f), given by the equation for the calculation of the geometrical attenuation (gamma) and effective attenuation Q (c) values have been computed at central frequencies (and band) of 1.5 (1-2), 3.0 (2-4), 6.0 (4-8), 9.0 (6-12), 12 (8-16), and 18 (12-24) Hz for five different datasets selected according to the geotectonic environment as well as the ability to sample shallow or deeper structures, particularly the sediments of the Parecis basin and the crystalline basement of the Amazon craton. For the Parecis basin for the surrounding shield and for the whole region of Porto dos GaA(0)chos Using the independent frequency model, we found: for the cratonic zone, gamma = 0.014 s (-aEuro parts per thousand 1), nu a parts per thousand 1.12; for the basin zone with sediments of similar to 500 m, gamma = 0.031 s (-aEuro parts per thousand 1), nu a parts per thousand 1.27; and for the Parecis basin with sediments of similar to 1,000 m, gamma = 0.047 s (-aEuro parts per thousand 1), nu a parts per thousand 1.42. Analysis of the attenuation factor (Q (c)) for different values of the geometrical spreading parameter (nu) indicated that an increase of nu generally causes an increase in Q (c), both in the basin as well as in the craton. But the differences in the attenuation between different geological environments are maintained for different models of geometrical spreading. It was shown that the energy of coda waves is attenuated more strongly in the sediments, (in the deepest part of the basin), than in the basement, (in the craton). Thus, the coda wave analysis can contribute to studies of geological structures in the upper crust, as the average coda quality factor is dependent on the thickness of sedimentary layer.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Changes in species composition is an important process in many ecosystems but rarely considered in systematic reserve site selection. To test the influence of temporal variability in species composition on the establishment of a reserve network, we compared network configurations based on species data of small mammals and frogs sampled during two consecutive years in a fragmented Atlantic Forest landscape (SE Brazil). Site selection with simulated annealing was carried out with the datasets of each single year and after merging the datasets of both years. Site selection resulted in remarkably divergent network configurations. Differences are reflected in both the identity of the selected fragments and in the amount of flexibility and irreplaceability in network configuration. Networks selected when data for both years were merged did not include all sites that were irreplaceable in one of the 2 years. Results of species number estimation revealed that significant changes in the composition of the species community occurred. Hence, temporal variability of community composition should be routinely tested and considered in systematic reserve site selection in dynamic systems.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In networks of plant-animal mutualisms, different animal groups interact preferentially with different plants, thus forming distinct modules responsible for different parts of the service. However, what we currently know about seed dispersal networks is based only on birds. Therefore, we wished to fill this gap by studying bat-fruit networks and testing how they differ from bird-fruit networks. As dietary overlap of Neotropical bats and birds is low, they should form distinct mutualistic modules within local networks. Furthermore, since frugivory evolved only once among Neotropical bats, but several times independently among Neotropical birds, greater dietary overlap is expected among bats, and thus connectance and nestedness should be higher in bat-fruit networks. If bat-fruit networks have higher nestedness and connectance, they should be more robust to extinctions. We analyzed 1 mixed network of both bats and birds and 20 networks that consisted exclusively of either bats (11) or birds (9). As expected, the structure of the mixed network was both modular (M = 0.45) and nested (NODF = 0.31); one module contained only birds and two only bats. In 20 datasets with only one disperser group, bat-fruit networks (NODF = 0.53 +/- A 0.09, C = 0.30 +/- A 0.11) were more nested and had a higher connectance than bird-fruit networks (NODF = 0.42 +/- A 0.07, C = 0.22 +/- A 0.09). Unexpectedly, robustness to extinction of animal species was higher in bird-fruit networks (R = 0.60 +/- A 0.13) than in bat-fruit networks (R = 0.54 +/- A 0.09), and differences were explained mainly by species richness. These findings suggest that a modular structure also occurs in seed dispersal networks, similar to pollination networks. The higher nestedness and connectance observed in bat-fruit networks compared with bird-fruit networks may be explained by the monophyletic evolution of frugivory in Neotropical bats, among which the diets of specialists seem to have evolved from the pool of fruits consumed by generalists.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Homology-driven proteomics is a major tool to characterize proteomes of organisms with unsequenced genomes. This paper addresses practical aspects of automated homology-driven protein identifications by LC-MS/MS on a hybrid LTQ orbitrap mass spectrometer. All essential software elements supporting the presented pipeline are either hosted at the publicly accessible web server, or are available for free download. (C) 2008 Elsevier B.V. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Broad-scale phylogenetic analyses of the angiosperms and of the Asteridae have failed to confidently resolve relationships among the major lineages of the campanulid Asteridae (i.e., the euasterid II of APG II, 2003). To address this problem we assembled presently available sequences for a core set of 50 taxa, representing the diversity of the four largest lineages (Apiales, Aquifoliales, Asterales, Dipsacales) as well as the smaller ""unplaced"" groups (e.g., Bruniaceae, Paracryphiaceae, Columelliaceae). We constructed four data matrices for phylogenetic analysis: a chloroplast coding matrix (atpB, matK, ndhF, rbcL), a chloroplast non-coding matrix (rps16 intron, trnT-F region, trnV-atpE IGS), a combined chloroplast dataset (all seven chloroplast regions), and a combined genome matrix (seven chloroplast regions plus 18S and 26S rDNA). Bayesian analyses of these datasets using mixed substitution models produced often well-resolved and supported trees. Consistent with more weakly supported results from previous studies, our analyses support the monophyly of the four major clades and the relationships among them. Most importantly, Asterales are inferred to be sister to a clade containing Apiales and Dipsacales. Paracryphiaceae is consistently placed sister to the Dipsacales. However, the exact relationships of Bruniaceae, Columelliaceae, and an Escallonia clade depended upon the dataset. Areas of poor resolution in combined analyses may be partly explained by conflict between the coding and non-coding data partitions. We discuss the implications of these results for our understanding of campanulid phylogeny and evolution, paying special attention to how our findings bear on character evolution and biogeography in Dipsacales.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Plasmodium falciparum var gene family encodes large variant antigens, which are important virulence factors, and also targets of the humoral host response. The frequently observed mild outcomes of falciparum malaria in many places of the Amazon area prompted us to ask whether a globally restricted variant (var) gene repertoire is present in currently circulating and older isolates of this area. By exhaustive analysis of var gene tags from 89 isolates and clones taken during many years from all over the Brazilian Amazon, we estimate that there are probably no more than 350-430 distinct sequence types, less than for any similar sized area studied so far. Detailed analysis of the var tags from genetically distinct clones obtained from single isolates revealed restricted and redundant repertoires suggesting either a low incidence of infective bites or restricted variant gene diversity in inoculated parasites. Additionally, we found a structuring of var gene repertoires observed as a higher pairwise typing sharing in isolates from the same microregion compared to isolates from different regions. Fine analysis of translated var tags revealed that certain Distinct Sequence Identifiers (DSIDs) were differently represented in Brazilian/South American isolates when compared to datasets from other continents. By global alignment of worldwide var DBL alpha sequences and sorting in groups with more than 76% identity, 125 clusters were formed and more than half of all genes were found in nine clusters with 50 or more sequences. While Brazilian/South American sequences were represented only in 64 groups, African sequences were found in the majority of clusters. DSID type 1 related sequences accumulated almost completely in one single cluster, indicating that limited recombination occurs in these specific var gene types. These data demonstrate the so far highest pairwise type sharing values for the var gene family in isolates from all over an entire subcontinent. The apparent lack of specific sequences types suggests that the P. falciparum transmission dynamics in the whole Amazon are probably different from any other endemic region studied and possibly interfere with the parasite`s ability to efficiently diversify its variant gene repertoires. (C) 2010 Elsevier B.V. All rights reserved.