8 resultados para comonomer sequence distribution
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
Abstract Background A large number of probabilistic models used in sequence analysis assign non-zero probability values to most input sequences. To decide when a given probability is sufficient the most common way is bayesian binary classification, where the probability of the model characterizing the sequence family of interest is compared to that of an alternative probability model. We can use as alternative model a null model. This is the scoring technique used by sequence analysis tools such as HMMER, SAM and INFERNAL. The most prevalent null models are position-independent residue distributions that include: the uniform distribution, genomic distribution, family-specific distribution and the target sequence distribution. This paper presents a study to evaluate the impact of the choice of a null model in the final result of classifications. In particular, we are interested in minimizing the number of false predictions in a classification. This is a crucial issue to reduce costs of biological validation. Results For all the tests, the target null model presented the lowest number of false positives, when using random sequences as a test. The study was performed in DNA sequences using GC content as the measure of content bias, but the results should be valid also for protein sequences. To broaden the application of the results, the study was performed using randomly generated sequences. Previous studies were performed on aminoacid sequences, using only one probabilistic model (HMM) and on a specific benchmark, and lack more general conclusions about the performance of null models. Finally, a benchmark test with P. falciparum confirmed these results. Conclusions Of the evaluated models the best suited for classification are the uniform model and the target model. However, the use of the uniform model presents a GC bias that can cause more false positives for candidate sequences with extreme compositional bias, a characteristic not described in previous studies. In these cases the target model is more dependable for biological validation due to its higher specificity.
Resumo:
On the basis of thin-section studies of cuttings and a core from two wells in the Amapa Formation of the Foz do Amazonas Basin, five main microfacies have been recognized within three stratigraphic sequences deposited during the Late Paleocene to Early Eocene. The facies are: 1) Ranikothalia grainstone to packstone facies; 2) ooidal grainstone to packstone facies; 3) larger foraminiferal and red algal grainstone to packstone facies; 4) Amphistegina and Helicostegina packstone facies; and 5) green algal and small benthic foraminiferal grainstone to packstone facies, divisible locally into a green algal and the miliolid foraminiferal subfacies and a green algal and small rotaliine foraminiferal subfacies. The lowermost sequence (Si) was deposited in the Late Paleocene-Early Eocene (biozone LF1, equivalent to P3-P6?) and includes rudaceous grainstones and packstones with large specimens of Ranikothalia bermudezi representative of the mid- and inner ramp. The intermediate and uppermost sequences (S2 and S3) display well-developed lowstand deposits formed at the end of the Late Paleocene (upper biozone LF1) and beginning of the Early Eocene (biozone LF2) on the inner ramp (larger foraminiferal and red algal grainstone to packstone facies), in lagoons (green algal and small benthic foraminiferal facies) and as shoals (ooidal facies) or banks (Amphistegina and Helicostegina facies). Depth and oceanic influence were the main controls on the distribution of these microfacies. Stratal stacking patterns evident within these sequences may well have been related to sea level changes postulated for the Late Paleocene and Early Eocene. During this time, the Amapa Formation was dominated by cyclic sedimentation on a gently sloping ramp. Environmental and ecological stress brought about by sea level change at the end of the biozone LF1 led to the extinction of the larger foraminifera (Ranikothalia bermudezi). (c) 2009 Elsevier B.V. All rights reserved.
Resumo:
Hepatitis C virus (HCV) is a public health problem throughout the world and 3% of the world population is infected with this virus. It is estimated that 3-4 millions individuals are being infected every year. It has been estimated that around 1.5% of Brazilian population is anti-HCV positive and the Northeast region showed the highest prevalence in Brazil. The aim of this study was to characterize HCV genotypes circulating in Pernambuco State (PE), Brazil, located in the Northeast region of the country. This study included 85 anti-HCV positive patients followed up between 2004 and 2011. For genotyping, a 380bp fragment of HCV RNA in the NS5B region was amplified by nested PCR. Phylogenetic analysis was conducted using Bayesian Markov chain Monte Carlo simulation (MCMC) using BEAST v.1.5.3. From 85 samples, 63 (74.1%) positive to NS5B fragment were successfully sequenced. Subtype 1b was the most prevalent in this population (42-66.7%), followed by 3a (16-25.4%), 1a (4-6.3%) and 2b (1-1.6%). Twelve (63.1%) and seven (36.9%) patients with HCV and schistosomiasis were infected with subtypes 1b and 3a, respectively. Brazil is a large country with many different population backgrounds; a large variation in the frequencies of HCV genotypes is predictable throughout its territory. This study reports HCV genotypes from Pernambuco State where subtype 1b was found to be the most prevalent. Phylogenetic analysis suggests the presence of the different HCV strains circulating within this population. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Cellular membranes have relevant roles in processes related to proteases like human kallikreins and cathepsins. As enzyme and substrate may interact with cell membranes and associated co-factors, it is important to take into account the behavior of peptide substrates in the lipid environment. In this paper we report an study based on energy transfer in two bradykinin derived peptides labeled with the donor-acceptor pair Abz/Eddnp (ortho-aminobenzoic acid/N-[2,4-dinitrophenyl]-ethylenediamine). Time-resolved fluorescence experiments were performed in phosphate buffer and in the presence of large unilamelar vesicles of phospholipids, and of micelles of sodium dodecyl sulphate (SDS). The decay kinetics were analyzed using the program CONTIN to obtain end-to-end distance distribution functions f(r). Despite of the large difference in the number of residues the end-to-end distance of the longer peptide (9 amino acid residues) is only 20 % larger than the values obtained for the shorter peptide (5 amino acid residues). The proline residue, in position 4 of the bradykinin sequence promotes a turn in the longer peptide chain, shortening its end-to-end distance. The surfactant SDS has a strong disorganizing effect, substantially broadening the distance distributions, while temperature increase has mild effects in the flexibility of the chains, causing small increase in the distribution width. The interaction with phospholipid vesicles stabilizes more compact conformations, decreasing end-to-end distances in the peptides. Anisotropy experiments showed that rotational diffusion was not severely affected by the interaction with the vesicles, suggesting a location for the peptides in the surface region of the bilayer, a result consistent with small effect of lipid phase transition on the peptides conformations.
Resumo:
Aims. We studied four young star clusters to characterise their anomalous extinction or variable reddening and asses whether they could be due to contamination by either dense clouds or circumstellar effects. Methods. We evaluated the extinction law (R-V) by adopting two methods: (i) the use of theoretical expressions based on the colour-excess of stars with known spectral type; and (ii) the analysis of two-colour diagrams, where the slope of the observed colour distribution was compared to the normal distribution. An algorithm to reproduce the zero-age main-sequence (ZAMS) reddened colours was developed to derive the average visual extinction (A(V)) that provides the closest fit to the observational data. The structure of the clouds was evaluated by means of a statistical fractal analysis, designed to compare their geometric structure with the spatial distribution of the cluster members. Results. The cluster NGC 6530 is the only object of our sample affected by anomalous extinction. On average, the other clusters suffer normal extinction, but several of their members, mainly in NGC 2264, seem to have high R-V, probably because of circumstellar effects. The ZAMS fitting provides A(V) values that are in good agreement with those found in the literature. The fractal analysis shows that NGC 6530 has a centrally concentrated distribution of stars that differs from the substructures found in the density distribution of the cloud projected in the A(V) map, suggesting that the original cloud was changed by the cluster formation. However, the fractal dimension and statistical parameters of Berkeley 86, NGC 2244, and NGC 2264 indicate that there is a good cloud-cluster correlation, when compared to other works based on an artificial distribution of points.
Resumo:
Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR) is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs) and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.
Resumo:
Abstract Background Identification of nontuberculous mycobacteria (NTM) based on phenotypic tests is time-consuming, labor-intensive, expensive and often provides erroneous or inconclusive results. In the molecular method referred to as PRA-hsp65, a fragment of the hsp65 gene is amplified by PCR and then analyzed by restriction digest; this rapid approach offers the promise of accurate, cost-effective species identification. The aim of this study was to determine whether species identification of NTM using PRA-hsp65 is sufficiently reliable to serve as the routine methodology in a reference laboratory. Results A total of 434 NTM isolates were obtained from 5019 cultures submitted to the Institute Adolpho Lutz, Sao Paulo Brazil, between January 2000 and January 2001. Species identification was performed for all isolates using conventional phenotypic methods and PRA-hsp65. For isolates for which these methods gave discordant results, definitive species identification was obtained by sequencing a 441 bp fragment of hsp65. Phenotypic evaluation and PRA-hsp65 were concordant for 321 (74%) isolates. These assignments were presumed to be correct. For the remaining 113 discordant isolates, definitive identification was based on sequencing a 441 bp fragment of hsp65. PRA-hsp65 identified 30 isolates with hsp65 alleles representing 13 previously unreported PRA-hsp65 patterns. Overall, species identification by PRA-hsp65 was significantly more accurate than by phenotype methods (392 (90.3%) vs. 338 (77.9%), respectively; p < .0001, Fisher's test). Among the 333 isolates representing the most common pathogenic species, PRA-hsp65 provided an incorrect result for only 1.2%. Conclusion PRA-hsp65 is a rapid and highly reliable method and deserves consideration by any clinical microbiology laboratory charged with performing species identification of NTM.
Resumo:
Abstract Background Xanthomonads are plant-associated bacteria responsible for diseases on economically important crops. Xanthomonas fuscans subsp. fuscans (Xff) is one of the causal agents of common bacterial blight of bean. In this study, the complete genome sequence of strain Xff 4834-R was determined and compared to other Xanthomonas genome sequences. Results Comparative genomics analyses revealed core characteristics shared between Xff 4834-R and other xanthomonads including chemotaxis elements, two-component systems, TonB-dependent transporters, secretion systems (from T1SS to T6SS) and multiple effectors. For instance a repertoire of 29 Type 3 Effectors (T3Es) with two Transcription Activator-Like Effectors was predicted. Mobile elements were associated with major modifications in the genome structure and gene content in comparison to other Xanthomonas genomes. Notably, a deletion of 33 kbp affects flagellum biosynthesis in Xff 4834-R. The presence of a complete flagellar cluster was assessed in a collection of more than 300 strains representing different species and pathovars of Xanthomonas. Five percent of the tested strains presented a deletion in the flagellar cluster and were non-motile. Moreover, half of the Xff strains isolated from the same epidemic than 4834-R was non-motile and this ratio was conserved in the strains colonizing the next bean seed generations. Conclusions This work describes the first genome of a Xanthomonas strain pathogenic on bean and reports the existence of non-motile xanthomonads belonging to different species and pathovars. Isolation of such Xff variants from a natural epidemic may suggest that flagellar motility is not a key function for in planta fitness.