927 resultados para Mitochondrial-dna Sequences


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Certain recent models of sex determination in mammals, Drosophila melanogaster, Caenorhabditis elegans, and snakes are examined in the light of the hypothesis that the relevant genetic regulatory mechanisms are similar and interrelated. The proposed key element in each of these instances is a noncoding DNA sequence, which serves as a high-affinity binding site for a repressor-like molecule regulating the activity of a major "sex-determining" gene. On this basis it is argued that, in several eukaryotes, (i) certain DNA sequences that are sex-determining are noncoding, in the sense that they are not the structural genes of a sex-determining protein; (ii) in some species these noncoding sequences are present in one sex and absent in the other, while in others their copy number or accessibility to regulatory molecules is significantly unequal between the two sexes; and (iii) this inequality determines whether the embryo develops into a male or a female.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Two complete mitochondrial genomes of the black marlin Istiompax indica were assembled from approximately 3.5 and 2.5 million reads produced by Ion Torrent next generation sequencing. The complete genomes were 16,531 bp and 16,532 bp in length consisting of 2 rRNA, 13 protein-coding genes, 22tRNA and 2 coding regions. They demonstrated a similar A + T base (52.6%) to other teleosts. Intraspecific sequence variation was 99.5% for three I. indica mitogenomes and 99.7% for X. gladius. A lower value (85%) was found for the I. platypterus mitogenomes from genbank and accredited to inadvertent inclusion of gene regions from a con-familial species in one record, highlighting the need for cautious downstream use of genbank data. © 2014 Informa UK Ltd.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Genome sequence information has generated increasing evidence for the claim that repetitive DNA sequences present within and around genes could play a important role in the regulation of gene expression. Polypurine/polypyrimidine sequences [poly(Pu/Py)] have been observed in the vicinity of promoters and within the transcribed regions of many genes. To understand whether such sequences influence the level of gene expression, we constructed several prokaryotic and eukaryotic expression vectors incorporating poly(Pu/Py) repeats both within and upstream of a reporter gene, lacZ (encoding β-galactosidase), and studied its expression in vivo. We find that, in contrast to the situation in Escherichia coli, the presence of poly(Pu/Py) sequences within the gene does not significantly inhibit gene expression in mammalian cells. On the other hand, the presence of such sequences upstream of lacZ leads to a several-fold reduction of gene expression in mammalian cells. Similar down-regulation was observed when a structural cassette containing poly(Pu/Py) sequences upstream of lacZ was integrated into yeast chromosome V. Sequence analysis of the nine totally sequenced yeast chromosomes shows that a large number of such sequences occur upstream of ORFs. On the basis of our experimental results and DNA sequence analysis, we propose that these sequences can function as cis-acting transcriptional regulators.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The current explosion of DNA sequence information has generated increasing evidence for the claim that noncoding repetitive DNA sequences present within and around different genes could play an important role in genetic control processes, although the precise role and mechanism by which these sequences function are poorly understood. Several of the simple repetitive sequences which occur in a large number of loci throughout the human and other eukaryotic genomes satisfy the sequence criteria for forming non-B DNA structures in vitro. We have summarized some of the features of three different types of simple repeats that highlight the importance of repetitive DNA in the control of gene expression and chromatin organization. (i) (TG/CA)n repeats are widespread and conserved in many loci. These sequences are associated with nucleosomes of varying linker length and may play a role in chromatin organization. These Z-potential sequences can help absorb superhelical stress during transcription and aid in recombination. (ii) Human telomeric repeat (TTAGGG)n adopts a novel quadruplex structure and exhibits unusual chromatin organization. This unusual structural motif could explain chromosome pairing and stability. (iii) Intragenic amplification of (CTG)n/(CAG)n trinucleotide repeat, which is now known to be associated with several genetic disorders, could down-regulate gene expression in vivo. The overall implications of these findings vis-à-vis repetitive sequences in the genome are summarized.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Mycobacterium leprae recA harbors an in-frame insertion sequence that encodes an intein homing endonuclease (PI-MleI). Most inteins (intein endonucleases) possess two conserved LAGLIDADG (DOD) motifs at their ctive center. A common feature of LAGLIDADG-type homing endonucleases is that they recognize and cleave the same or very similar DNA sequences. However, PI-MleI is distinctive from other members of the family of LAGLIDADG-type HEases for its modular structure with functionally separable domains for DNA-binding and cleavage, each with distinct sequence preferences. Sequence alignment analyses of PI-MleI revealed three putative LAGLIDADG motifs; however, there is conflicting bioinformatics data in regard to their identity and specific location within the intein polypeptide. To resolve this conflict and to determine the active-site residues essential for DNA target site recognition and double-stranded DNA cleavage, we performed site-directed mutagenesis of presumptive catalytic residues in the LAGLIDADG motifs. Analysis of target DNA recognition and kinetic parameters of the wild-type PI-MleI and its variants disclosed that the two amino acid residues, Asp(122) (in Block C) and Asp(193) (in functional Block E), are crucial to the double-stranded DNA endonuclease activity, whereas Asp(218) (in pseudo-Block E) is not. However, despite the reduced catalytic activity, the PI-MleI variants, like the wild-type PI-MleI, generated a footprint of the same length around the insertion site. The D122T variant showed significantly reduced catalytic activity, and D122A and D193A mutations although failed to affect their DNA-binding affinities, but abolished the double-stranded DNA cleavage activity. On the other hand, D122C variant showed approximately twofold higher double-stranded DNA cleavage activity, compared with the wild-type PI-MleI. These results provide compelling evidence that Asp(122) and Asp(193) in DOD motif I and II, respectively, are bona fide active-site residues essential for DNA cleavage activity. The implications of these results are discussed in this report.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Childhood-onset mitochondrial diseases comprise a heterogeneous group of disorders, which may manifest with almost any symptom and affect any tissue or organ. Due to challenging diagnostics, most children still lack a specific aetiological diagnosis. The aim of this thesis was to find molecular causes for childhood-onset mitochondrial disorders in Finland. We identified the underlying cause for 25 children, and found three new diseases, which had not been diagnosed in Finland before. These diseases caused severe progressive infantile-onset encephalomyopathies, and were due to defects in mitochondrial DNA (mtDNA) maintenance. Furthermore, the thesis provides the molecular background of Finnish patients with ‘leukoencephalopathy with brain stem and spinal cord involvement and elevated brain lactate’ (LBSL). A new phenotype was identified to be due to mutations in Twinkle, resembling ‘infantile onset spinocerebellar ataxia’ (IOSCA). These mutations caused mtDNA depletion in the liver, thus confirming the essential role of Twinkle in mtDNA maintenance, and expanding the molecular background of mtDNA depletion syndromes. The major aetiology for infantile mitochondrial myopathy in Finland was discovered to be due to mutations in thymidine kinase 2 (TK2). A novel mutation with Finnish ancestry was identified, and a genotype-phenotype correlation with mutation-specific distribution of tissue involvement was found, thus proving that deficient TK2 may cause multi-tissue depletion and impair neuronal function. This work established the molecular diagnosis and advanced the knowledge of phenotypes among paediatric patients with polymerase gamma (POLG) mutations. The patients showed severe early-onset encephalopathy with intractable epilepsy. POLG mutations are not a prevalent cause of children’s ataxias, although ataxia is a major presenting symptom among adults. Our findings indicate that POLG mutations should be investigated even if typical MRI, histochemical or biochemical abnormalities are lacking. LBSL patients showed considerable variation in phenotype despite identical mutations. A common, most likely European, ancestry, and a relative high carrier frequency of these mutations in Finland were discovered; suggesting that LBSL may be a quite common leukoencephalopathy in other populations as well. The results suggest that MRI findings are so unique that the diagnosis of LBSL is possible to make without genetic studies. This thesis work has resulted in identification of new mitochondrial disorders in Finland, enhancing the understanding of the clinical variability and the importance of tissue-specificity of these disorders. In addition to providing specific diagnosis to the patients, these findings give light to the underlying pathogenetic mechanisms of childhood-onset mitochondrial disorders.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Study of the evolution of species or organisms is essential for various biological applications. Evolution is typically studied at the molecular level by analyzing the mutations of DNA sequences of organisms. Techniques have been developed for building phylogenetic or evolutionary trees for a set of sequences. Though phylogenetic trees capture the overall evolutionary relationships among the sequences, they do not reveal fine-level details of the evolution. In this work, we attempt to resolve various fine-level sequence transformation details associated with a phylogenetic tree using cellular automata. In particular, our work tries to determine the cellular automata rules for neighbor-dependent mutations of segments of DNA sequences. We also determine the number of time steps needed for evolution of a progeny from an ancestor and the unknown segments of the intermediate sequences in the phylogenetic tree. Due to the existence of vast number of cellular automata rules, we have developed a grid system that performs parallel guided explorations of the rules on grid resources. We demonstrate our techniques by conducting experiments on a grid comprising machines in three countries and obtaining potentially useful statistics regarding evolutions in three HIV sequences. In particular, our work is able to verify the phenomenon of neighbor-dependent mutations and find that certain combinations of neighbor-dependent mutations, defined by a cellular automata rule, occur with greater than 90% probability. We also find the average number of time steps for mutations for some branches of phylogenetic tree over a large number of possible transformations with standard deviations less than 2.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Sequence motifs occurring in a particular order in proteins or DNA have been proved to be of biological interest. In this paper, a new method to locate the occurrences of up to five user-defined motifs in a specified order in large proteins and in nucleotide sequence databases is proposed. It has been designed using the concept of quantifiers in regular expressions and linked lists for data storage. The application of this method includes the extraction of relevant consensus regions from biological sequences. This might be useful in clustering of protein families as well as to study the correlation between positions of motifs and their functional sites in DNA sequences.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The complete mitochondrial genome of the tarnished plant bug, Lygus lineolaris, comprised 17,027 bp. The genome contained 13 protein coding regions, 22 tRNA genes and 2 ribosomal RNA genes. The gene arrangement corresponded to the common order found among insect mtDNAs which was considered to be the ancestral arrangement. The protein coding genes started with ATN and stopped with TAA or TAG. The nucleotide distribution was 76.0% A + T. The control region contained two repeat regions, one was 24 bp and the other was 161 bp. The Genbank accession for the complete L. lineolaris mt genome is EU401991.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

DNA sequences containing a stretch of several A:T basepairs without a 5'-TA-3' step are known as A-tracts and have been the subject of extensive investigation because of their unique structural features such as a narrow minor groove and their crucial role in several biological processes. One of the aspects under investigation has been the influence of the 5-methyl group of thymine on the properties of A-tracts. Detailed molecular dynamics simulation studies of the sequences d(CGCAAAUUUGCG) and d(CGCAAATTTGCG) indicate that the presence of the 5-methyl group in thymine increases the frequency of a narrow minor groove conformation, which could facilitate its specific recognition by proteins, and reduce its susceptibility to cleavage by DNase I. The bias toward a wider minor groove in the absence of the thymine 5-methyl group is a static structural feature. Our results also indicate that the presence of the thymine 5-methyl group is necessary for calibrating the backbone conformation and the basepair and dinucleotide step geometry of the core A-tract as well as the flanking CA/TG and the neighboring GC/GC steps, as observed in free and protein-bound DNA. As a consequence, it also fine-tunes the curvature of the longer DNA fragment in which the A-tract is embedded.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The crystal structure of a hexamer duplex d(CACGTG)(2) has been determined and refined to an R-factor of 18.3% using X-ray data up to 1.2 angstrom resolution. The sequence crystallizes as a left-handed Z-form double helix with Watson-Crick base pairing. There is one hexamer duplex, a spermine molecule, 71 water molecules, and an unexpected diamine (Z-5, 1,3-propanediamine, C3H10N2)) in the asymmetric unit. This is the high-resolution non-disordered structure of a Z-DNA hexamer containing two AT base pairs in the interior of a duplex with no modifications such as bromination or methylation on cytosine bases. This structure does not possess multivalent cations such as cobalt hexaammine that are known to stabilize Z-DNA. The overall duplex structure and its crystal interactions are similar to those of the pure-spermine form of the d(CGCGCG)(2) structure. The spine of hydration in the minor groove is intact except in the vicinity of the T5A8 base pair. The binding of the Z-5 molecule in the minor grove of the d(CACGTG)(2) duplex appears to have a profound effect in conferring stability to a Z-DNA conformation via electrostatic complementarity and hydrogen bonding interactions. The successive base stacking geometry in d(CACGTG)(2) is similar to the corresponding steps in d(CG)(3). These results suggest that specific polyamines such as Z-5 could serve as powerful inducers of Z-type conformation in unmodified DNA sequences with AT base pairs. This structure provides a molecular basis for stabilizing AT base pairs incorporated into an alternating d(CG) sequence.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Taking advantage of the degeneracy of the genetic code we have developed a novel approach to introduce, within a gene, DNA sequences capable of adopting unusual structures and to investigate the role of such sequences in regulation of gene expression in vivo. We used a computer program that generates alternative codon sequences for the same amino-acid sequence to convert a stretch of nucleotides into an inverted-repeat sequence with the potential to adopt cruciform structure. This approach was used to replace a 51-base-pair EcoRI-HindIII segment in the N-terminal region of the beta-galactosidase gene in plasmid pUC19 with a 51-bp synthetic oligonucleotide sequence with the potential to adopt a cruciform structure with 18 bp in the stem region. In selecting the 51-bp sequence, care was taken to include those codons that are preferred in E. coli. E. coli DH5-alpha cells harbouring the plasmid containing the redesigned sequence showed drastic reduction in expression of the beta-galactosidase gene compared to cells harbouring the plasmid with the native sequence. This approach demonstrates the possibility of introducing DNA secondary-structure elements to alter regulation of gene expression in vivo.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The ability of DNA sequences to adopt unusual structures under the superhelical torsional stress has been studied. Sequences that are forced to adopt unusual conformation in topologically constrained pBR322 form V DNA (Lk=0) were mapped using restriction enzymes as probes. Restriction enzymes such as BamHI, Pstl, Aval and HindIII could not cleave their recognition sequences. The removal of topological constraint relieved this inhibition. The influence of neighbouring sequences on the ability of a given sequence to adopt unusual DNA structure, presumably left handed Z conformation, was studied through single hit analysis. Using multiple cut restriction enzymes such as Narl and Fspl, it could be shown that under identical topological strain, the extent of structural alteration is greatly influenced by the neighbouring sequences. In the light of the variety of sequences and locations that could be mapped to adopt non-6 conformation in pBR322 form V DNA, restriction enzymes appear as potential structural probes for natural DNA sequences.