122 resultados para coding sequence
Resumo:
Monomer-sequence information in synthetic copolyimides can be recognised by tweezer-type molecules binding to adjacent triplet-sequences on the polymer chains. In the present paper different tweezer-molecules are found to have different sequence-selectivities, as demonstrated in solution by 1H NMR spectroscopy and in the solid state by single crystal X-ray analyses of tweezer-complexes with linear and macrocyclic oligo-imides. This work provides clear-cut confirmation of polyimide chain-folding and adjacent-tweezer-binding. It also reveals a new and entirely unexpected mechanism for sequence-recognition which, by analogy with a related process in biomolecular information processing, may be termed "frameshift-reading". The ability of one particular tweezer-molecule to detect, with exceptionally high sensitivity, long-range sequence-information in chain-folding aromatic copolyimides, is readily explained by this novel process.
Resumo:
The full lengths of three genome segments of Iranian wheat stripe virus (IWSV) were amplified by reverse transcription (RT) followed by polymerase chain reaction (PCR) using a primer complementary to tenuivirus conserved terminal sequences. The segments were sequenced and found to comprise 3469, 2337, and 1831 nt, respectively. The gene organization of these segments is similar to that of other known tenuiviruses, each displaying an ambisense coding strategy. IWSV segments, however, are different from those of other viruses with respect to the number of nucleotides and deduced amino acid sequence for each ORF. Depending on the segment, the first 16-22 nt at the 5' end and the first 16 nt at the 3' end are highly conserved among IWSV and rice hoja blanca virus (RHBV), rice stripe virus (RSV) and maize stripe virus ( MStV). In addition, the first 15-18 nt at the 5' end are complementary to the first 16-18 nt at the 3' end. Phylogenetic analyses showed close similarity and a common ancestor for IWSV, RHBV, and Echinochloa hoja blanca virus (EHBV). These findings confirm the position of IWSV as a distinct species in the genus Tenuivirus.
A hierarchical Bayesian model for predicting the functional consequences of amino-acid polymorphisms
Resumo:
Genetic polymorphisms in deoxyribonucleic acid coding regions may have a phenotypic effect on the carrier, e.g. by influencing susceptibility to disease. Detection of deleterious mutations via association studies is hampered by the large number of candidate sites; therefore methods are needed to narrow down the search to the most promising sites. For this, a possible approach is to use structural and sequence-based information of the encoded protein to predict whether a mutation at a particular site is likely to disrupt the functionality of the protein itself. We propose a hierarchical Bayesian multivariate adaptive regression spline (BMARS) model for supervised learning in this context and assess its predictive performance by using data from mutagenesis experiments on lac repressor and lysozyme proteins. In these experiments, about 12 amino-acid substitutions were performed at each native amino-acid position and the effect on protein functionality was assessed. The training data thus consist of repeated observations at each position, which the hierarchical framework is needed to account for. The model is trained on the lac repressor data and tested on the lysozyme mutations and vice versa. In particular, we show that the hierarchical BMARS model, by allowing for the clustered nature of the data, yields lower out-of-sample misclassification rates compared with both a BMARS and a frequen-tist MARS model, a support vector machine classifier and an optimally pruned classification tree.
Resumo:
The order Fabales, including Leguminosae, Polygalaceae, Quillajaceae and Surianaceae, represents a novel hypothesis emerging from angiosperm molecular phylogenies. Despite good support for the order, molecular studies to date have suggested contradictory, poorly supported interfamilial relationships. Our reappraisal of relationships within Fabales addresses past taxon sampling deficiencies, and employs parsimony and Bayesian approaches using sequences from the plastid regions rbcL (166 spp.) and matK (78 spp.). Five alternative hypotheses for interfamilial relationships within Fabales were recovered. The Shimodaira-Hasegawa test found the likelihood of a resolved topology significantly higher than the one calculated for a polytomy, but did not favour any of the alternative hypotheses of relationship within Fabales. In the light of the morphological evidence available and the comparative behavior of rbcL and matK, the topology recovering Polygalaceae as sister to the rest of the order Fabales with Leguminosae more closely related to Quillajaceae + Surianaceae, is considered the most likely hypothesis of interfamilial relationships of the order. Dating of selected crown clades in the Fabales phylogeny using penalized likelihood suggests rapid radiation of the Leguminosae, Polygalaceae, and (Quillajaceae + Surianaceae) crown clades.
Resumo:
Diversity in the chloroplast genome of 171 accessions representing the Brassica 'C' (n = 9) genome, including domesticated and wild B. oleracea and nine inter-fertile related wild species, was investigated using six chloroplast SSR (microsatellite) markers. The lack of diversity detected among 105 cultivated and wild accessions of B. oleracea contrasted starkly with that found within its wild relatives. The vast majority of B. oleracea accessions shared a single haplotype, whereas as many as six haplotypes were detected in two wild species, B. villosa Biv. and B. cretica Lam.. The SSRs proved to be highly polymorphic across haplotypes, with calculated genetic diversity values (H) of 0.23-0.87. In total, 23 different haplotypes were detected in C genome species, with an additional five haplotypes detected in B. rapa L. (A genome n = 10) and another in B. nigra L. (B genome, n = 8). The low chloroplast diversity of B. oleracea is not suggestive of multiple domestication events. The predominant B. oleracea haplotype was also common in B. incana Ten. and present in low frequencies in B. villosa, B. macrocarpa Guss, B. rupestris Raf. and B. cretica. The chloroplast SSRs reveal a wealth of diversity within wild Brassica species that will facilitate further evolutionary and phylogeographic studies of this important crop genus.
Resumo:
Investigations were conducted during the 2003, 2004 and 2005 growing seasons in northern Greece to evaluate effects of tillage regime (mouldboard plough, chisel plough and rotary tiller), cropping sequence (continuous cotton, cotton-sugar beet rotation and continuous tobacco) and herbicide treatment on weed seedbank dynamics. Amaranthus spp. and Portulaca oleracea were the most abundant species, ranging from 76% to 89% of total weed seeds found in 0-15 and 15-30 cm soil depths during the 3 years. With the mouldboard plough, 48% and 52% of the weed seedbank was found in the 0-15 and 15-30 cm soil horizons, while approximately 60% was concentrated in the upper 15 cm soil horizon for chisel plough and rotary tillage. Mouldboard ploughing significantly buried more Echinochloa crus-galli seeds in the 15-30 cm soil horizon compared with the other tillage regimes. Total seedbank (0-30 cm) of P. oleracea was significantly reduced in cotton-sugar beet rotation compared with cotton and tobacco monocultures, while the opposite occurred for E. crus-galli. Total seed densities of most annual broad-leaved weed species (Amaranthus spp., P. oleracea, Solanum nigrum) and E. crus-galli were lower in herbicide treated than in untreated plots. The results suggest that in light textured soils, conventional tillage with herbicide use gradually reduces seed density of small seeded weed species in the top 15 cm over several years. In contrast, crop rotation with the early established sugar beet favours spring-germinating grass weed species, but also prevents establishment of summer-germinating weed species by the early developing crop canopy.
Resumo:
The full lengths of three genome segments of Iranian wheat stripe virus (IWSV) were amplified by reverse transcription (RT) followed by polymerase chain reaction (PCR) using a primer complementary to tenuivirus conserved terminal sequences. The segments were sequenced and found to comprise 3469, 2337, and 1831 nt, respectively. The gene organization of these segments is similar to that of other known tenuiviruses, each displaying an ambisense coding strategy. IWSV segments, however, are different from those of other viruses with respect to the number of nucleotides and deduced amino acid sequence for each ORF. Depending on the segment, the first 16-22 nt at the 5' end and the first 16 nt at the 3' end are highly conserved among IWSV and rice hoja blanca virus (RHBV), rice stripe virus (RSV) and maize stripe virus ( MStV). In addition, the first 15-18 nt at the 5' end are complementary to the first 16-18 nt at the 3' end. Phylogenetic analyses showed close similarity and a common ancestor for IWSV, RHBV, and Echinochloa hoja blanca virus (EHBV). These findings confirm the position of IWSV as a distinct species in the genus Tenuivirus.
Resumo:
The monophyly of the Peltophorum group, one of nine informal groups recognized by Polhill in the Caesalpinieae, was tested using sequence data from the trnL-F, rbcL, and rps16 regions of the chloroplast genome. Exemplars were included from all 16 genera of the Peltophorum group, and from 15 genera representing seven of the other eight informal groups in the tribe. The data were analyzed separately and in combined analyses using parsimony and Bayesian methods. The analysis method had little effect on the topology of well-supported relationships. The molecular data recovered a generally well-supported phylogeny with many intergeneric relationships resolved. Results show that the Peltophorum group as currently delimited is polyphyletic, but that eight genera plus one undescribed genus form a core Peltophorum group, which is referred to here as the Peltophorum group sensu stricto. These genera are Bussea, Conzattia, Colvillea, Delonix, Heteroflorum (inedit.), Lemuropisum, Parkinsonia, Peltophorum, and Schizolobium. The remaining eight genera of the Peltophorum group s.l. are distributed across the Caesalpinieae. Morphological support for the redelimited Peltophorum group and the other recovered clades was assessed, and no unique synapomorphy was found for the Peltophorum group s.s. A proposal for the reclassification of the Peltophorum group s.l. is presented.
Resumo:
The 5' terminus of picornavirus genomic RNA is covalently linked to the virus-encoded peptide 313 (VTg). Foot-and-mouth disease virus (FMDV) is unique in encoding and using 3 distinct forms of this peptide. These peptides each act as primers for RNA synthesis by the virus-encoded RNA polymerase 3D(pol). To act as the primer for positive-strand RNA synthesis, the 3B peptides have to be uridylylated to form VPgpU(pU). For certain picornaviruses, it has been shown that this reaction is achieved by the 3D(pol) in the presence of the 3CD precursor plus an internal RNA sequence termed a cis-acting replication element (cre). The FMDV ere has been identified previously to be within the 5' untranslated region, whereas all other picornavirus cre structures are within the viral coding region. The requirements for the in vitro uridylylation of each of the FMDV 313 peptides has now been determined, and the role of the FMDV ere (also known as the 3B-uridylylation site, or bus) in this reaction has been analyzed. The poly(A) tail does not act as a significant template for FMDV 3B uridylylation.
Resumo:
The elaC gene of Escherichia coli encodes a binuclear zinc phosphodiesterase (ZiPD). ZiPD homologs from various species act as 3' tRNA processing endoribonucleases, and although the homologous gene in Bacillus subtilis is essential for viability [EMBO J. 22 (2003) 4534], the physiological function of E. coli ZiPD has remained enigmatic. In order to investigate the function of E. coli ZiPD we generated and characterized an E. coli elaC deletion mutant. Surprisingly, the E. coli elaC deletion mutant was viable and had wild-type like growth properties. Micro array-based transcriptional analysis indicated expression of the E. coli elaC gene at basal levels during aerobic growth. The elaC gene deletion had no effect on the expression of genes coding for RNases or amino-acyl tRNA synthetases or any other gene among a total of > 1300 genes probed. 2D-PAGE analysis showed that the elaC mutation, likewise, had no effect on the proteome. These results strengthen doubts about the involvement of E. coli ZiPD in tRNA maturation and suggest functional diversity within the ZiPD/ElaCl protein family. In addition to these unexpected features of the E. coli elaC deletion mutant, a sequence comparison of ZiPD (ElaCl) proteins revealed specific regions for either enterobacterial or mammalian ZiPD (ElaCl) proteins. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
It has long been suggested that the overall shape of the antigen combining site (ACS) of antibodies is correlated with the nature of the antigen. For example, deep pockets are characteristic of antibodies that bind haptens, grooves indicate peptide binders, while antibodies that bind to proteins have relatively flat combining sites. In. 1996, MacCallum, Martin and Thornton used a fractal shape descriptor and showed a strong correlation of the shape of the binding region with the general nature of the antigen. However, the shape of the ACS is determined primarily by the lengths of the six complementarity-determining regions (CDRs). Here, we make a direct correlation between the lengths of the CDRs and the nature of the antigen. In addition, we show significant differences in the residue composition of the CDRs of antibodies that bind to different antigen classes. As well as helping us to understand the process of antigen recognition, autoimmune disease and cross-reactivity these results are of direct application in the design of antibody phage libraries and modification of affinity. (C) 2003 Elsevier Science Ltd. All rights reserved.
Resumo:
This article presents a statistical method for detecting recombination in DNA sequence alignments, which is based on combining two probabilistic graphical models: (1) a taxon graph (phylogenetic tree) representing the relationship between the taxa, and (2) a site graph (hidden Markov model) representing interactions between different sites in the DNA sequence alignments. We adopt a Bayesian approach and sample the parameters of the model from the posterior distribution with Markov chain Monte Carlo, using a Metropolis-Hastings and Gibbs-within-Gibbs scheme. The proposed method is tested on various synthetic and real-world DNA sequence alignments, and we compare its performance with the established detection methods RECPARS, PLATO, and TOPAL, as well as with two alternative parameter estimation schemes.
Resumo:
Inter-simple sequence repeat (ISSR) analysis and aggressiveness assays were used to investigate genetic variability within a global collection of Fusarium culmorum isolates. A set of four ISSR primers were tested, of which three primers amplified a total of 37 bands out of which 30 (81%) were polymorphic. The intraspecific diversity was high, ranging from four to 28 different ISSR genotypes for F. culmorum depending on the primer. The combined analysis of ISSR data revealed 59 different genotypes clustered into seven distinct clades amongst 75 isolates of F. culmorum examined. All the isolates were assayed to test their aggressiveness on a winter wheat cv. 'Armada'. A significant quantitative variation for aggressiveness was found among the isolates. The ISSR and aggressiveness variation existed on a macro- as well as micro-geographical scale. The data suggested a long-range dispersal of F. culmorum and indicated that this fungus may have been introduced into Canada from Europe. In addition to the high level of intraspecific diversity observed in F. culmorum, the index of multilocus association calculated using ISSR data indicated that reproduction in F. culmorum cannot be exclusively clonal and recombination is likely to occur.
Resumo:
Population subdivision complicates analysis of molecular variation. Even if neutrality is assumed, three evolutionary forces need to be considered: migration, mutation, and drift. Simplification can be achieved by assuming that the process of migration among and drift within subpopulations is occurring fast compared to Mutation and drift in the entire population. This allows a two-step approach in the analysis: (i) analysis of population subdivision and (ii) analysis of molecular variation in the migrant pool. We model population subdivision using an infinite island model, where we allow the migration/drift parameter Theta to vary among populations. Thus, central and peripheral populations can be differentiated. For inference of Theta, we use a coalescence approach, implemented via a Markov chain Monte Carlo (MCMC) integration method that allows estimation of allele frequencies in the migrant pool. The second step of this approach (analysis of molecular variation in the migrant pool) uses the estimated allele frequencies in the migrant pool for the study of molecular variation. We apply this method to a Drosophila ananassae sequence data set. We find little indication of isolation by distance, but large differences in the migration parameter among populations. The population as a whole seems to be expanding. A population from Bogor (Java, Indonesia) shows the highest variation and seems closest to the species center.