952 resultados para Dna-sequence
Resumo:
This paper analyzes DNA information using entropy and phase plane concepts. First, the DNA code is converted into a numerical format by means of histograms that capture DNA sequence length ranging from one up to ten bases. This strategy measures dynamical evolutions from 4 up to 410 signal states. The resulting histograms are analyzed using three distinct entropy formulations namely the Shannon, Rényie and Tsallis definitions. Charts of entropy versus sequence length are applied to a set of twenty four species, characterizing 486 chromosomes. The information is synthesized and visualized by adapting phase plane concepts leading to a categorical representation of chromosomes and species.
Resumo:
We report the sequence of a 9000 bp fragment from the right arm of Saccharomyces cerevisiae chromosome VII. Analysis of the sequence revealed four complete previously unknown open reading frames, which were named G7587, G7589, G7591 and G7594 following standard rules for provisional nomenclature. Outstanding features of some of these proteins were the homology of the putative protein coded by G7589 with proteins involved in transcription regulation and the transmembrane domains predicted in the putative protein coded by G7591.
Resumo:
Our purpose was to compare the genetic polymorphism of six samples of P. brasiliensis (113, 339, BAT, T1F1, T3B6, T5LN1), with four samples of P. cerebriformis (735, 741, 750, 361) from the Mycological Laboratory of the Instituto de Medicina Tropical de São Paulo, using Random Amplified Polymorphic DNA Analysis (RAPD). RAPD profiles clearly segregated P. brasiliensis and P. cerebriformis isolates. However, the variation on band patterns among P. cerebriformis isolates was high. Sequencing of the 28S rDNA gene showed nucleotide conservancy among P. cerebriformis isolates, providing basis for taxonomical grouping, and disclosing high divergence to P. brasiliensis supporting that they are in fact two distinct species. Moreover, DNA sequence suggests that P. cerebriformis belongs in fact to the Aspergillus genus.
Resumo:
The restriction fragment length polymorphism of the 195 bp repeated DNA sequence of Trypanosoma cruzi was analyzed among 23 T. cruzi stocks giving a reliable picture of the whole phylogenetic variability of the species. The profiles observed with the enzymes Hinf I and Hae III were linked together and supported the existence of two groups. Group 1 shows a 195 bp repeated unit (Hinf I) and high molecular weight DNA (Hae III), while group 2 presents a ladder profile for each enzyme, which is a characteristic of tandemly repeated DNA. The two groups, respectively, clustered stocks pertaining to the two principal lineages evidenced by isoenzyme and RAPD markers. The congruence among these three independent genomic markers corroborates the existence of two real phylogenetic lineages in T. cruzi. The specific monomorphic profiles for each major phylogenetic lineage suggest the existence of ancient sexuality and cryptic biological speciation.
Resumo:
DNA sequence comparison of 412 base-pairs fragments of the mitochondrial cytochrome B gene was used to infer the genetic structure of nine geographical Triatoma infestans populations and their phylogenetic relationship with T. melanosoma and T. brasiliensis. T. infestans and T. melanosoma were compared by morphometry, allozyme and cytogenetic analyses, as well as subjected to reciprocal crosses, in order to clarify the taxonomic status of the latter. No differences were found to distinguish the two species and the crosses between them yielded progeny. T. infestans populations presented four haplotypes that could be separated in two clusters: one formed by the samples from Bolivia (Andes and Chaco) and the other formed by samples from Argentina and Brazil. Silvatic and domestic T. infestans populations from Bolivia (Andes) were genetically identical.
Resumo:
In the last decade microsatellites have become one of the most useful genetic markers used in a large number of organisms due to their abundance and high level of polymorphism. Microsatellites have been used for individual identification, paternity tests, forensic studies and population genetics. Data on microsatellite abundance comes preferentially from microsatellite enriched libraries and DNA sequence databases. We have conducted a search in GenBank of more than 16,000 Schistosoma mansoni ESTs and 42,000 BAC sequences. In addition, we obtained 300 sequences from CA and AT microsatellite enriched genomic libraries. The sequences were searched for simple repeats using the RepeatMasker software. Of 16,022 ESTs, we detected 481 (3%) sequences that contained 622 microsatellites (434 perfect, 164 imperfect and 24 compounds). Of the 481 ESTs, 194 were grouped in 63 clusters containing 2 to 15 ESTs per cluster. Polymorphisms were observed in 16 clusters. The 287 remaining ESTs were orphan sequences. Of the 42,017 BAC end sequences, 1,598 (3.8%) contained microsatellites (2,335 perfect, 287 imperfect and 79 compounds). The 1,598 BAC end sequences 80 were grouped into 17 clusters containing 3 to 17 BAC end sequences per cluster. Microsatellites were present in 67 out of 300 sequences from microsatellite enriched libraries (55 perfect, 38 imperfect and 15 compounds). From all of the observed loci 55 were selected for having the longest perfect repeats and flanking regions that allowed the design of primers for PCR amplification. Additionally we describe two new polymorphic microsatellite loci.
Resumo:
Mycobacterium tuberculosis strains resistant to streptomycin (SM), isoniazid (INH), and/or rifampin (RIF) as determined by the conventional Löwenstein-Jensen proportion method (LJPM) were compared with the E test, a minimum inhibitory concentration susceptibility method. Discrepant isolates were further evaluated by BACTEC and by DNA sequence analyses for mutations in genes most often associated with resistance to these drugs (rpsL, katG, inhA, and rpoB). Preliminary discordant E test results were seen in 75% of isolates resistant to SM and in 11% to INH. Discordance improved for these two drugs (63%) for SM and none for INH when isolates were re-tested but worsened for RIF (30%). Despite good agreement between phenotypic results and sequencing analyses, wild type profiles were detected on resistant strains mainly for SM and INH. It should be aware that susceptible isolates according to molecular methods might contain other mechanisms of resistance. Although reproducibility of the LJPM susceptibility method has been established, variable E test results for some M. tuberculosis isolates poses questions regarding its reproducibility particularly the impact of E test performance which may vary among laboratories despite adherence to recommended protocols. Further studies must be done to enlarge the evaluated samples and looked possible mutations outside of the hot spot sequenced gene among discrepant strains.
Resumo:
Axial deflection of DNA molecules in solution results from thermal motion and intrinsic curvature related to the DNA sequence. In order to measure directly the contribution of thermal motion we constructed intrinsically straight DNA molecules and measured their persistence length by cryo-electron microscopy. The persistence length of such intrinsically straight DNA molecules suspended in thin layers of cryo-vitrified solutions is about 80 nm. In order to test our experimental approach, we measured the apparent persistence length of DNA molecules with natural "random" sequences. The result of about 45 nm is consistent with the generally accepted value of the apparent persistence length of natural DNA sequences. By comparing the apparent persistence length to intrinsically straight DNA with that of natural DNA, it is possible to determine both the dynamic and the static contributions to the apparent persistence length.
Resumo:
We have determined the sequence of the first 1371 nucleotides at the 5' end of the genome of mouse mammary tumor virus using molecularly cloned proviral DNA of the GR virus strain. The most likely initiation codon used for the gag gene of mouse mammary tumor virus is the first one, located 312 nucleotides from the 5' end of the viral RNA. The 5' splicing site for the subgenomic mRNA's is located approximately 288 nucleotides downstream from the 5' end of the viral RNA. From the DNA sequence the amino acid sequence of the N-terminal half of the gag precursor protein, including p10 and p21, was deduced (353 amino acids).
Resumo:
Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.
Resumo:
Partial DNA sequences from two mitochondrial (mt) and one nuclear gene (cytochrome b, 12S rRNA, and C-mos) were used to estimate the phylogenetic relationships among the six extant species of skinks endemic to the Cape Verde Archipelago. The species form a monophyletic unit, indicating a single colonization of the islands, probably from West Africa. Mabuya vaillanti and M. delalandii are sister taxa, as indicated by morphological characters. Mabuya fogoensis and M. stangeri are closely related, but the former is probably paraphyletic. Mabuya spinalis and M. salensis are also probably paraphyletic. Within species, samples from separate islands always form monophyletic groups. Some colonization events can be hypothesized, which are in line with the age of the islands. C-mos variation is concordant with the topology derived from mtDNA.
Resumo:
The ecdysone-responsive DNA sequence of the Drosophila hsp27 gene promoter contains four direct and inverted repeats reminiscent of those that compose the vertebrate palindromic estrogen response element (ERE) and the thyroid hormone/retinoic acid response element (TRE/RRE). Interestingly, a 3 bp substitution in the wild-type Hsp27 ecdysone response element (EcdRE) increases both its similarity with the vertebrate ERE and TRE/RRE and its capacity to confer ecdysone responsiveness to a heterologous promoter. Remarkably, increasing the spacing between the inverted repeats of this strong EcdRE by two nucleotides converts it into an ERE. Inversely, decreasing the spacing between the two inverted repeats of the vertebrate consensus palindromic ERE, from three to one nucleotide, converts it into a functional EcdRE. Thus, the only difference between an invertebrate EcdRE and a vertebrate palindromic ERE or TRE/RRE is in the spacing between the conserved inverted repeated motifs forming these palindromic HREs. The finding that the sequence motif 5'-GGTCA-3' present in the vertebrate ERE and TRE/RRE is also a functionally important characteristic of an invertebrate HRE, suggests that a common ancestor regulatory DNA sequence gave rise to all HREs known so far. We discuss the possibility that this progenitor motif is the GGTCA sequence.
Resumo:
BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Resumo:
BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Resumo:
Background: It has been suggested that chromosomal rearrangements harbor the molecular footprint of the biological phenomena which they induce, in the form, for instance, of changes in the sequence divergence rates of linked genes. So far, all the studies of these potential associations have focused on the relationship between structural changes and the rates of evolution of single-copy DNA and have tried to exclude segmental duplications (SDs). This is paradoxical, since SDs are one of the primary forces driving the evolution of structure and function in our genomes and have been linked not only with novel genes acquiring new functions, but also with overall higher DNA sequence divergence and major chromosomal rearrangements.Results: Here we take the opposite view and focus on SDs. We analyze several of the features of SDs, including the rates of intraspecific divergence between paralogous copies of human SDs and of interspecific divergence between human SDs and chimpanzee DNA. We study how divergence measures relate to chromosomal rearrangements, while considering other factors that affect evolutionary rates in single copy DNA. Conclusion: We find that interspecific SD divergence behaves similarly to divergence of single-copy DNA. In contrast, old and recent paralogous copies of SDs do present different patterns of intraspecific divergence. Also, we show that some relatively recent SDs accumulate in regions that carry inversions in sister lineages.