957 resultados para Repetitive DNA sequences


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Transcription activator-like effectors (TALEs) are virulence factors, produced by the bacterial plant-pathogen Xanthomonas, that function as gene activators inside plant cells. Although the contribution of individual TALEs to infectivity has been shown, the specific roles of most TALEs, and the overall TALE diversity in Xanthomonas spp. is not known. TALEs possess a highly repetitive DNA-binding domain, which is notoriously difficult to sequence. Here, we describe an improved method for characterizing TALE genes by the use of PacBio sequencing. We present 'AnnoTALE', a suite of applications for the analysis and annotation of TALE genes from Xanthomonas genomes, and for grouping similar TALEs into classes. Based on these classes, we propose a unified nomenclature for Xanthomonas TALEs that reveals similarities pointing to related functionalities. This new classification enables us to compare related TALEs and to identify base substitutions responsible for the evolution of TALE specificities. © 2016, Nature Publishing Group. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Retrotransposons are a class of transposable elements that represent a major fraction of the repetitive DNA of most eukaryotes. Their abundance stems from their expansive replication strategies. We screened and isolated sequence fragments of long terminal repeat (LTR), gypsy-like reverse transcriptase (rt) and gypsy-like envelope (env) domains, and two partial sequences of non-LTR retrotransposons, long interspersed element (LINE), in the clonally propagated allohexaploid sweet potato (Ipomoea batatas (L.) Lam.) genome. Using dot-blot hybridization, these elements were found to be present in the ~1597 Mb haploid sweet potato genome with copy numbers ranging from ~50 to ~4100 as observed in the partial LTR (IbLtr-1) and LINE (IbLi-1) sequences, respectively. The continuous clonal propagation of sweet potato may have contributed to such a multitude of copies of some of these genomic elements. Interestingly, the isolated gypsy-like env and gypsy-like rt sequence fragments, IbGy-1 (~2100 copies) and IbGy-2 (~540 copies), respectively, were found to be homologous to the Bagy-2 cDNA sequences of barley (Hordeum vulgare L.). Although the isolated partial sequences were found to be homologous to other transcriptionally active elements, future studies are required to determine whether they represent elements that are transcriptionally active under normal and (or) stressful conditions.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Studies continue to report ancient DNA sequences and viable microbial cells that are many millions of years old. In this paper we evaluate some of the most extravagant claims of geologically ancient DNA. We conclude that although exciting, the reports suffer from inadequate experimental setup and insufficient authentication of results. Consequently, it remains doubtful whether amplifiable DNA sequences and viable bacteria can survive over geological timescales. To enhance the credibility of future studies and assist in discarding false-positive results, we propose a rigorous set of authentication criteria for work with geologically ancient DNA.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The effect of two different DNA minor groove binding molecules, Hoechst 33258 and distamycin A, on the binding kinetics of NF-κB p50 to three different specific DNA sequences was studied at various salt concentrations. Distamycin A was shown to significantly increase the dissociation rate constant of p50 from the sequences PRDII (5′-GGGAAATTCC-3′) and Ig-κ B (5′-GGGACTTTCC-3′) but had a negligible effect on the dissociation from the palindromic target-κB binding site (5′-GGGAATTCCC-3′). By comparison, the effect of Hoechst 33258 on binding of p50 to each sequence was found to be minimal. The dissociation rates for the protein–DNA complexes increased at higher potassium chloride concentrations for the PRDII and Ig-κB binding motifs and this effect was magnified by distamycin A. In contrast, p50 bound to the palindromic target-κB site with a much higher intrinsic affinity and exhibited a significantly reduced salt dependence of binding over the ionic strength range studied, retaining a KD of less than 10 pM at 150 mM KCl. Our results demonstrate that the DNA binding kinetics of p50 and their salt dependence is strongly sequence-dependent and, in addition, that the binding of p50 to DNA can be influenced by the addition of minor groove-binding drugs in a sequence-dependent manner.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Redundant DNA can buffer sequence dependent structural deviations from an ideal double helix. Buffering serves a mechanistic function by reducing extraneous conformational effects which could interfere with readout or which would impose energetic constraints on evolution. It also serves an evolutionary function by allowing for gradual variations in conformation-dependent regulation of gene expression. Such gradualism is critical for the rate of evolution. The buffer structure concept provides a new interpretation for repetitive DNA and for exons and introns.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Certain recent models of sex determination in mammals, Drosophila melanogaster, Caenorhabditis elegans, and snakes are examined in the light of the hypothesis that the relevant genetic regulatory mechanisms are similar and interrelated. The proposed key element in each of these instances is a noncoding DNA sequence, which serves as a high-affinity binding site for a repressor-like molecule regulating the activity of a major "sex-determining" gene. On this basis it is argued that, in several eukaryotes, (i) certain DNA sequences that are sex-determining are noncoding, in the sense that they are not the structural genes of a sex-determining protein; (ii) in some species these noncoding sequences are present in one sex and absent in the other, while in others their copy number or accessibility to regulatory molecules is significantly unequal between the two sexes; and (iii) this inequality determines whether the embryo develops into a male or a female.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Mycobacterium leprae recA harbors an in-frame insertion sequence that encodes an intein homing endonuclease (PI-MleI). Most inteins (intein endonucleases) possess two conserved LAGLIDADG (DOD) motifs at their ctive center. A common feature of LAGLIDADG-type homing endonucleases is that they recognize and cleave the same or very similar DNA sequences. However, PI-MleI is distinctive from other members of the family of LAGLIDADG-type HEases for its modular structure with functionally separable domains for DNA-binding and cleavage, each with distinct sequence preferences. Sequence alignment analyses of PI-MleI revealed three putative LAGLIDADG motifs; however, there is conflicting bioinformatics data in regard to their identity and specific location within the intein polypeptide. To resolve this conflict and to determine the active-site residues essential for DNA target site recognition and double-stranded DNA cleavage, we performed site-directed mutagenesis of presumptive catalytic residues in the LAGLIDADG motifs. Analysis of target DNA recognition and kinetic parameters of the wild-type PI-MleI and its variants disclosed that the two amino acid residues, Asp(122) (in Block C) and Asp(193) (in functional Block E), are crucial to the double-stranded DNA endonuclease activity, whereas Asp(218) (in pseudo-Block E) is not. However, despite the reduced catalytic activity, the PI-MleI variants, like the wild-type PI-MleI, generated a footprint of the same length around the insertion site. The D122T variant showed significantly reduced catalytic activity, and D122A and D193A mutations although failed to affect their DNA-binding affinities, but abolished the double-stranded DNA cleavage activity. On the other hand, D122C variant showed approximately twofold higher double-stranded DNA cleavage activity, compared with the wild-type PI-MleI. These results provide compelling evidence that Asp(122) and Asp(193) in DOD motif I and II, respectively, are bona fide active-site residues essential for DNA cleavage activity. The implications of these results are discussed in this report.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Study of the evolution of species or organisms is essential for various biological applications. Evolution is typically studied at the molecular level by analyzing the mutations of DNA sequences of organisms. Techniques have been developed for building phylogenetic or evolutionary trees for a set of sequences. Though phylogenetic trees capture the overall evolutionary relationships among the sequences, they do not reveal fine-level details of the evolution. In this work, we attempt to resolve various fine-level sequence transformation details associated with a phylogenetic tree using cellular automata. In particular, our work tries to determine the cellular automata rules for neighbor-dependent mutations of segments of DNA sequences. We also determine the number of time steps needed for evolution of a progeny from an ancestor and the unknown segments of the intermediate sequences in the phylogenetic tree. Due to the existence of vast number of cellular automata rules, we have developed a grid system that performs parallel guided explorations of the rules on grid resources. We demonstrate our techniques by conducting experiments on a grid comprising machines in three countries and obtaining potentially useful statistics regarding evolutions in three HIV sequences. In particular, our work is able to verify the phenomenon of neighbor-dependent mutations and find that certain combinations of neighbor-dependent mutations, defined by a cellular automata rule, occur with greater than 90% probability. We also find the average number of time steps for mutations for some branches of phylogenetic tree over a large number of possible transformations with standard deviations less than 2.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Sequence motifs occurring in a particular order in proteins or DNA have been proved to be of biological interest. In this paper, a new method to locate the occurrences of up to five user-defined motifs in a specified order in large proteins and in nucleotide sequence databases is proposed. It has been designed using the concept of quantifiers in regular expressions and linked lists for data storage. The application of this method includes the extraction of relevant consensus regions from biological sequences. This might be useful in clustering of protein families as well as to study the correlation between positions of motifs and their functional sites in DNA sequences.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

DNA sequences containing a stretch of several A:T basepairs without a 5'-TA-3' step are known as A-tracts and have been the subject of extensive investigation because of their unique structural features such as a narrow minor groove and their crucial role in several biological processes. One of the aspects under investigation has been the influence of the 5-methyl group of thymine on the properties of A-tracts. Detailed molecular dynamics simulation studies of the sequences d(CGCAAAUUUGCG) and d(CGCAAATTTGCG) indicate that the presence of the 5-methyl group in thymine increases the frequency of a narrow minor groove conformation, which could facilitate its specific recognition by proteins, and reduce its susceptibility to cleavage by DNase I. The bias toward a wider minor groove in the absence of the thymine 5-methyl group is a static structural feature. Our results also indicate that the presence of the thymine 5-methyl group is necessary for calibrating the backbone conformation and the basepair and dinucleotide step geometry of the core A-tract as well as the flanking CA/TG and the neighboring GC/GC steps, as observed in free and protein-bound DNA. As a consequence, it also fine-tunes the curvature of the longer DNA fragment in which the A-tract is embedded.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The crystal structure of a hexamer duplex d(CACGTG)(2) has been determined and refined to an R-factor of 18.3% using X-ray data up to 1.2 angstrom resolution. The sequence crystallizes as a left-handed Z-form double helix with Watson-Crick base pairing. There is one hexamer duplex, a spermine molecule, 71 water molecules, and an unexpected diamine (Z-5, 1,3-propanediamine, C3H10N2)) in the asymmetric unit. This is the high-resolution non-disordered structure of a Z-DNA hexamer containing two AT base pairs in the interior of a duplex with no modifications such as bromination or methylation on cytosine bases. This structure does not possess multivalent cations such as cobalt hexaammine that are known to stabilize Z-DNA. The overall duplex structure and its crystal interactions are similar to those of the pure-spermine form of the d(CGCGCG)(2) structure. The spine of hydration in the minor groove is intact except in the vicinity of the T5A8 base pair. The binding of the Z-5 molecule in the minor grove of the d(CACGTG)(2) duplex appears to have a profound effect in conferring stability to a Z-DNA conformation via electrostatic complementarity and hydrogen bonding interactions. The successive base stacking geometry in d(CACGTG)(2) is similar to the corresponding steps in d(CG)(3). These results suggest that specific polyamines such as Z-5 could serve as powerful inducers of Z-type conformation in unmodified DNA sequences with AT base pairs. This structure provides a molecular basis for stabilizing AT base pairs incorporated into an alternating d(CG) sequence.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Taking advantage of the degeneracy of the genetic code we have developed a novel approach to introduce, within a gene, DNA sequences capable of adopting unusual structures and to investigate the role of such sequences in regulation of gene expression in vivo. We used a computer program that generates alternative codon sequences for the same amino-acid sequence to convert a stretch of nucleotides into an inverted-repeat sequence with the potential to adopt cruciform structure. This approach was used to replace a 51-base-pair EcoRI-HindIII segment in the N-terminal region of the beta-galactosidase gene in plasmid pUC19 with a 51-bp synthetic oligonucleotide sequence with the potential to adopt a cruciform structure with 18 bp in the stem region. In selecting the 51-bp sequence, care was taken to include those codons that are preferred in E. coli. E. coli DH5-alpha cells harbouring the plasmid containing the redesigned sequence showed drastic reduction in expression of the beta-galactosidase gene compared to cells harbouring the plasmid with the native sequence. This approach demonstrates the possibility of introducing DNA secondary-structure elements to alter regulation of gene expression in vivo.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The ability of DNA sequences to adopt unusual structures under the superhelical torsional stress has been studied. Sequences that are forced to adopt unusual conformation in topologically constrained pBR322 form V DNA (Lk=0) were mapped using restriction enzymes as probes. Restriction enzymes such as BamHI, Pstl, Aval and HindIII could not cleave their recognition sequences. The removal of topological constraint relieved this inhibition. The influence of neighbouring sequences on the ability of a given sequence to adopt unusual DNA structure, presumably left handed Z conformation, was studied through single hit analysis. Using multiple cut restriction enzymes such as Narl and Fspl, it could be shown that under identical topological strain, the extent of structural alteration is greatly influenced by the neighbouring sequences. In the light of the variety of sequences and locations that could be mapped to adopt non-6 conformation in pBR322 form V DNA, restriction enzymes appear as potential structural probes for natural DNA sequences.