996 resultados para Base Composition
Resumo:
The isolation of the four Xenopus laevis vitellogenin genes has been completed by the purification from a DNA library of the B2 gene together with its flanking sequences. The overlapping DNA fragments analyzed cover 34 kilobases. The B2 gene which has a length of 17.5 kilobases was characterized by heteroduplex and R-loop mapping in the electron microscope and by in vitro transcription in a HeLa whole-cell extract. Its structural organization is compared with that of the closely related B1 gene. The mRNA-coding sequence of about 6 kilobases is interrupted 34 times in the B1 gene and 33 times in the B2 gene. Sequence homology between the two genes was not only found in exons. In addition, 54% of the intron sequences as well as 63% and 48.5% respectively of the 5' and 3' flanking sequences, show enough homology to form stable duplexes. These findings are compared with earlier results obtained with the two other closely related members of the vitellogenin gene family, the A1 and the A2 genes.
Resumo:
We have analyzed middle repetitive DNA in the albumin and vitellogenin gene families of Xenopus laevis. Mapping specific repetitive DNA sequences derived from introns of the A1 vitellogenin gene reveals that these sequences are scattered within and around the four vitellogenin genes (A1, A2, B1 and B2) and the two albumin genes (74 kd and 68 kd). Three repetitive DNA elements present in the A1 vitellogenin transcriptional unit are also located in introns of the 74 kd albumin gene. This apparently random distribution of middle repetitive DNA in the two gene families suggests that the analyzed sequences are not involved in gene regulation, but rather that they might represent unstable genetic elements. This hypothesis is further supported by the finding that size polymorphism in the A1 vitellogenin gene and in the 74 kd albumin gene is correlated with the presence or absence of repetitive DNA.
Resumo:
Arbuscular mycorrhizal fungi (AMF) are an ecologically important group of fungi. Previous studies showed the presence of divergent copies of beta-tubulin and V-type vacuolar H+-ATPase genes in AMF genomes and suggested horizontal gene transfer from host plants or mycoparasites to AMF. We sequenced these genes from DNA isolated from an in vitro cultured isolate of Glomus intraradices that was free of any obvious contaminants. We found two highly variable beta-tubulin sequences and variable H+-ATPase sequences. Despite this high variation, comparison of the sequences with those in gene banks supported a glomeromycotan origin of G. intraradices beta-tubulin and H+-ATPase sequences. Thus, our results are in sharp contrast with the previously reported polyphyletic origin of those genes. We present evidence that some highly divergent sequences of beta-tubulin and H+-ATPase deposited in the databases are likely to be contaminants. We therefore reject the prediction of horizontal transfer to AMF genomes. High differences in GC content between glomeromycotan sequences and sequences grouping in other lineages are shown and we suggest they can be used as an indicator to detect such contaminants. H+-ATPase phylogeny gave unexpected results and failed to resolve fungi as a natural group. beta-Tubulin phylogeny supported Glomeromeromycota as sister group of the Chytridiomycota. Contrasts between our results and trees previously generated using rDNA sequences are discussed.
Resumo:
BACKGROUND: Cleavage of messenger RNA (mRNA) precursors is an essential step in mRNA maturation. The signal recognized by the cleavage enzyme complex has been characterized as an A rich region upstream of the cleavage site containing a motif with consensus AAUAAA, followed by a U or UG rich region downstream of the cleavage site. RESULTS: We studied these signals using exhaustive databases of cleavage sites obtained from aligning raw expressed sequence tags (EST) sequences to genomic sequences in Homo sapiens and Drosophila melanogaster. These data show that the polyadenylation signal is highly conserved in human and fly. In addition, de novo motif searches generated a refined description of the U-rich downstream sequence (DSE) element, which shows more divergence between the two species. These refined motifs are applied, within a Hidden Markov Model (HMM) framework, to predict mRNA cleavage sites. CONCLUSION: We demonstrate that the DSE is a specific motif in both human and Drosophila. These findings shed light on the sequence correlates of a highly conserved biological process, and improve in silico prediction of 3' mRNA cleavage and polyadenylation sites.
Resumo:
BACKGROUND: The expansion of amino acid repeats is determined by a high mutation rate and can be increased or limited by selection. It has been suggested that recent expansions could be associated with the potential of adaptation to new environments. In this work, we quantify the strength of this association, as well as the contribution of potential confounding factors. RESULTS: Mammalian positively selected genes have accumulated more recent amino acid repeats than other mammalian genes. However, we found little support for an accelerated evolutionary rate as the main driver for the expansion of amino acid repeats. The most significant predictors of amino acid repeats are gene function and GC content. There is no correlation with expression level. CONCLUSIONS: Our analyses show that amino acid repeat expansions are causally independent from protein adaptive evolution in mammalian genomes. Relaxed purifying selection or positive selection do not associate with more or more recent amino acid repeats. Their occurrence is slightly favoured by the sequence context but mainly determined by the molecular function of the gene.
Resumo:
BACKGROUND: The genome of Protochlamydia amoebophila UWE25, a Parachlamydia-related endosymbiont of free-living amoebae, was recently published, providing the opportunity to search for genomic islands (GIs). RESULTS: On the residual cumulative G+C content curve, a G+C-rich 19-kb region was observed. This sequence is part of a 100-kb chromosome region, containing 100 highly co-oriented ORFs, flanked by two 17-bp direct repeats. Two identical gly-tRNA genes in tandem are present at the proximal end of this genetic element. Several mobility genes encoding transposases and bacteriophage-related proteins are located within this chromosome region. Thus, this region largely fulfills the criteria of GIs. The G+C content analysis shows that several modules compose this GI. Surprisingly, one of them encodes all genes essential for F-like conjugative DNA transfer (traF, traG, traH, traN, traU, traW, and trbC), involved in sex pilus retraction and mating pair stabilization, strongly suggesting that, similarly to the other F-like operons, the parachlamydial tra unit is devoted to DNA transfer. A close relatedness of this tra unit to F-like tra operons involved in conjugative transfer is confirmed by phylogenetic analyses performed on concatenated genes and gene order conservation. These analyses and that of gly-tRNA distribution in 140 GIs suggest a proteobacterial origin of the parachlamydial tra unit. CONCLUSIONS: A GI of the UWE25 chromosome encodes a potentially functional F-like DNA conjugative system. This is the first hint of a putative conjugative system in chlamydiae. Conjugation most probably occurs within free-living amoebae, that may contain hundreds of Parachlamydia bacteria tightly packed in vacuoles. Such a conjugative system might be involved in DNA transfer between internalized bacteria. Since this system is absent from the sequenced genomes of Chlamydiaceae, we hypothesize that it was acquired after the divergence between Parachlamydiaceae and Chlamydiaceae, when the Parachlamydia-related symbiont was an intracellular bacteria. It suggests that this heterologous DNA was acquired from a phylogenetically-distant bacteria sharing an amoebal vacuole. Since Parachlamydiaceae are emerging agents of pneumonia, this GI might be involved in pathogenicity. In future, conjugative systems might be developed as genetic tools for Chlamydiales.
Resumo:
Background: The ratio of the rates of non-synonymous and synonymous substitution (d(N)/d(S)) is commonly used to estimate selection in coding sequences. It is often suggested that, all else being equal, d(N)/d(S) should be lower in populations with large effective size (Ne) due to increased efficacy of purifying selection. As N-e is difficult to measure directly, life history traits such as body mass, which is typically negatively associated with population size, have commonly been used as proxies in empirical tests of this hypothesis. However, evidence of whether the expected positive correlation between body mass and d(N)/d(S) is consistently observed is conflicting. Results: Employing whole genome sequence data from 48 avian species, we assess the relationship between rates of molecular evolution and life history in birds. We find a negative correlation between dN/dS and body mass, contrary to nearly neutral expectation. This raises the question whether the correlation might be a method artefact. We therefore in turn consider non-stationary base composition, divergence time and saturation as possible explanations, but find no clear patterns. However, in striking contrast to d(N)/d(S), the ratio of radical to conservative amino acid substitutions (K-r/K-c) correlates positively with body mass. Conclusions: Our results in principle accord with the notion that non-synonymous substitutions causing radical amino acid changes are more efficiently removed by selection in large populations, consistent with nearly neutral theory. These findings have implications for the use of d(N)/d(S) and suggest that caution is warranted when drawing conclusions about lineage-specific modes of protein evolution using this metric.
Resumo:
BACKGROUND: The increasing number of completely sequenced bacterial genomes allows comparing their architecture and genetic makeup. Such new information highlights the crucial role of lateral genetic exchanges in bacterial evolution and speciation. RESULTS: Here we analyzed the twelve sequenced genomes of Streptococcus pyogenes by a naïve approach that examines the preferential nucleotide usage along the chromosome, namely the usage of G versus C (GC-skew) and T versus A (TA-skew). The cumulative GC-skew plot presented an inverted V-shape composed of two symmetrical linear segments, where the minimum and maximum corresponded to the origin and terminus of DNA replication. In contrast, the cumulative TA-skew presented a V-shape, which segments were interrupted by several steep slopes regions (SSRs), indicative of a different nucleotide composition bias. Each S. pyogenes genome contained up to nine individual SSRs, encompassing all described strain-specific prophages. In addition, each genome contained a similar unique non-phage SSR, the core of which consisted of 31 highly homologous genes. This core includes the M-protein, other mga-related factors and other virulence genes, totaling ten intrinsic virulence genes. In addition to a high content in virulence-related genes and to a peculiar nucleotide bias, this SSR, which is 47 kb-long in a M1GAS strain, harbors direct repeats and a tRNA gene, suggesting a mobile element. Moreover, its complete absence in a M-protein negative group A Streptococcus natural isolate demonstrates that it could be spontaneously lost, but in vitro deletion experiments indicates that its excision occurred at very low rate. The stability of this SSR, combined to its presence in all sequenced S. pyogenes sequenced genome, suggests that it results from an ancient acquisition. CONCLUSION: Thus, this non-phagic SSR is compatible with a pathogenicity island, acquired before S. pyogenes speciation. Its potential excision might bear relevance for vaccine development, because vaccines targeting M-protein might select for M-protein-negative variants that still carry other virulence determinants.
Resumo:
Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio omega. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the omega ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to similar to 80%) and low GC (similar to 30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.
Resumo:
The high-affinity siderophore salicylate is an intermediate in the biosynthetic pathway of pyochelin, another siderophore and chelator of transition metal ions, in Pseudomonas aeruginosa. The 2.5-kb region upstream of the salicylate biosynthetic genes pchBA was sequenced and found to contain two additional, contiguous genes, pchD and pchC, having the same orientation. The deduced amino acid sequence of the 60-kDa PchD protein was similar to those of the EntE protein (2,3-dihydroxybenzoate-AMP ligase) of Escherichia coli and other adenylate-forming enzymes, suggesting that salicylate might be adenylated at the carboxyl group by PchD. The 28-kDa PchC protein showed similarities to thioesterases of prokaryotic and eukaryotic origin and might participate in the release of the product(s) formed from activated salicylate. One potential product, dihydroaeruginoate (Dha), was identified in culture supernatants of iron-limited P. aeruginosa cells. The antifungal antibiotic Dha is thought to arise from the reaction of salicylate with cysteine, followed by cyclization of cysteine. Inactivation of the chromosomal pchD gene by insertion of the transcription and translation stop element omega Sm/Sp abolished the production of Dha and pyochelin, implying that PchD-mediated activation of salicylate may be a common first step in the synthesis of both metabolites. Furthermore, the pchD::omega Sm/Sp mutation had a strong polar effect on the expression of the pchBA genes, i.e., on salicylate synthesis, indicating that the pchDCBA genes constitute a transcriptional unit. A full-length pchDCBA transcript of ca. 4.4 kb could be detected in iron-deprived, growing cells of P. aeruginosa. Transcription of pchD started at tandemly arranged promoters, which overlapped with two Fur boxes (binding sites for the ferric uptake regulator) and the promoter of the divergently transcribed pchR gene encoding an activator of pyochelin biosynthesis. This promoter arrangement allows tight iron-mediated repression of the pchDCBA operon.
Resumo:
While there is evidence that the two ubiquitously expressed thyroid hormone (T3) receptors, TRalpha1 and TRbeta1, have distinct functional specificities, the mechanism by which they discriminate potential target genes remains largely unexplained. In this study, we demonstrate that the thyroid hormone response elements (TRE) from the malic enzyme and myelin basic protein genes (METRE and MBPTRE) respectively, are not functionally equivalent. The METRE, which is a direct repeat motif with a 4-base pair gap between the two half-site hexamers binds thyroid hormone receptor as a heterodimer with 9-cis-retinoic acid receptor (RXR) and mediates a high T3-dependent activation in response to TRalpha1 or TRbeta1 in NIH3T3 cells. In contrast, the MBPTRE, which consists of an inverted palindrome formed by two hexamers spaced by 6 base pairs, confers an efficient transactivation by TRbeta1 but a poor transactivation by TRalpha1. While both receptors form heterodimers with RXR on MBPTRE, the poor transactivation by TRalpha1 correlates also with its ability to bind efficiently as a monomer. This monomer, which is only observed with TRalpha1 bound to MBPTRE, interacts neither with N-CoR nor with SRC-1, explaining its functional inefficacy. However, in Xenopus oocytes, in which RXR proteins are not detectable, the transactivation mediated by TRalpha1 and TRbeta1 is equivalent and independent of a RXR supply, raising the question of the identity of the thyroid hormone receptor partner in these cells. Thus, in mammalian cells, the binding characteristics of TRalpha1 to MBPTRE (i.e. high monomer binding efficiency and low transactivation activity) might explain the particular pattern of T3 responsiveness of MBP gene expression during central nervous system development.
Resumo:
BACKGROUND: Root-colonizing fluorescent pseudomonads are known for their excellent abilities to protect plants against soil-borne fungal pathogens. Some of these bacteria produce an insecticidal toxin (Fit) suggesting that they may exploit insect hosts as a secondary niche. However, the ecological relevance of insect toxicity and the mechanisms driving the evolution of toxin production remain puzzling. RESULTS: Screening a large collection of plant-associated pseudomonads for insecticidal activity and presence of the Fit toxin revealed that Fit is highly indicative of insecticidal activity and predicts that Pseudomonas protegens and P. chlororaphis are exclusive Fit producers. A comparative evolutionary analysis of Fit toxin-producing Pseudomonas including the insect-pathogenic bacteria Photorhabdus and Xenorhadus, which produce the Fit related Mcf toxin, showed that fit genes are part of a dynamic genomic region with substantial presence/absence polymorphism and local variation in GC base composition. The patchy distribution and phylogenetic incongruence of fit genes indicate that the Fit cluster evolved via horizontal transfer, followed by functional integration of vertically transmitted genes, generating a unique Pseudomonas-specific insect toxin cluster. CONCLUSIONS: Our findings suggest that multiple independent evolutionary events led to formation of at least three versions of the Mcf/Fit toxin highlighting the dynamic nature of insect toxin evolution.
Resumo:
AC microsatellites have proved particularly useful as genetic markers. For some purposes, such as in population biology, the inferences drawn depend on the quantitative values of their mutation rates. This, together with intrinsic biological interest, has led to widespread study of microsatellite mutational mechanisms. Now, however, inconsistencies are appearing in the results of marker-based versus non-marker-based studies of mutational mechanisms. The reasons for this have not been investigated, but one possibility, pursued here, is that the differences result from structural differences between markers and genomic microsatellites. Here we report a comparison between the CEPH AC marker microsatellites and the global population of AC microsatellites in the human genome. AC marker microsatellites are longer than the global average. Controlling for length, marker microsatellites contain on average fewer interruptions, and have longer segments, than their genomic counterparts. Related to this, marker microsatellites show a greater tendency to concentrate the majority of their repeats into one segment. These differences plausibly result from scientists selecting markers for their high polymorphism. In addition to the structural differences, there are differences in the base composition of flanking sequences, marker flanking regions being richer in C and G and poorer in A and T. Our results indicate that there are profound differences between marker and genomic microsatellites that almost certainly affect their mutation rates. There is a need for a unified model of mutational mechanisms that accounts for both marker-derived and genomic observations. A suggestion is made as to how this might be done.
Resumo:
Five Gram-negative, motile, aerobic to microaerophilic spirilla were isolated from various depths of the hypersaline, heliothermal and meromictic Ekho Lake (East Antarctica). The strains are oxidase- and catalase-positive, metabolize a variety of sugars and carboxylic acids and have an absolute requirement for sodium ions. The predominant fatty acids of the organisms are C-16: (1)omega7c, C-16:0 and C(18:1)omega7c, with C-10:1 3-OH, C-10:0 3-OH, C-12:0 3-OH, C-14:1 3-OH, C-14:0 3-OH and C-19:1 present in smaller amounts. The main polar lipids are diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylglycerol and phosphatidylmonomethylamine. The DNA base composition of the strains is 54-55 mol% G + C. 16S rRNA gene sequence comparisons show that the isolates are related to the genera Oceanospirillum, Pseudospirillum, Marinospirillum, Halomonas and Chromohalobacter in the gamma-Proteobacteria. Morphological, physiological and genotypic differences from these previously described genera support the description of a novel genus and species, Saccharospirillum impatiens gen. nov., sp. nov. The type strain is EL-105(T) (= DSM 12546(T) = CECT 5721(T)).