980 resultados para nucleotide-sequence
Resumo:
The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is maintained at the European Bioinformatics Institute (EBI) in an international collaboration with the DNA Data Bank of Japan (DDBJ) and GenBank at the NCBI (USA). Data is exchanged amongst the collaborating databases on a daily basis. The major contributors to the EMBL database are individual authors and genome project groups. Webin is the preferred web-based submission system for individual submitters, whilst automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via ftp, email and World Wide Web interfaces. EBI’s Sequence Retrieval System (SRS), a network browser for databanks in molecular biology, integrates and links the main nucleotide and protein databases plus many specialized databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT.
Resumo:
Clones encoding pro-phenol oxidase [pro-PO; zymogen of phenol oxidase (monophenol, L-dopa:oxygen oxidoreductase, EC 1.14.18.1)] A1 were isolated from a lambda gt10 library that originated from Drosophila melanogaster strain Oregon-R male adults. The 2294 bp of the cDNA included a 13-bp 5'-noncoding region, a 2070-bp encoding open reading frame of 690 amino acids, and a 211-bp 3'-noncoding region. A hydrophobic NH2-terminal sequence for a signal peptide is absent in the protein. Furthermore, there are six potential N-glycosylation sites in the sequence, but no amino sugar was detected in the purified protein by amino acid analysis, indicating the lack of an N-linked sugar chain. The potential copper-binding sites, amino acids 200-248 and 359-414, are highly homologous to the corresponding sites of hemocyanin of the tarantula Eurypelma californicum, the horseshoe crab Limulus polyphemus, and the spiny lobster Panulirus interruptus. On the basis of the phylogenetic tree constructed by the neighbor-joining method, vertebrate tyrosinases and molluscan hemocyanins constitute one family, whereas pro-POs and arthropod hemocyanins group with another family. It seems, therefore, likely that pro-PO originates from a common ancestor with arthropod hemocyanins, independently to the vertebrate and microbial tyrosinases.
Resumo:
Chromosome I from the yeast Saccharomyces cerevisiae contains a DNA molecule of approximately 231 kbp and is the smallest naturally occurring functional eukaryotic nuclear chromosome so far characterized. The nucleotide sequence of this chromosome has been determined as part of an international collaboration to sequence the entire yeast genome. The chromosome contains 89 open reading frames and 4 tRNA genes. The central 165 kbp of the chromosome resembles other large sequenced regions of the yeast genome in both its high density and distribution of genes. In contrast, the remaining sequences flanking this DNA that comprise the two ends of the chromosome and make up more than 25% of the DNA molecule have a much lower gene density, are largely not transcribed, contain no genes essential for vegetative growth, and contain several apparent pseudogenes and a 15-kbp redundant sequence. These terminally repetitive regions consist of a telomeric repeat called W', flanked by DNA closely related to the yeast FLO1 gene. The low gene density, presence of pseudogenes, and lack of expression are consistent with the idea that these terminal regions represent the yeast equivalent of heterochromatin. The occurrence of such a high proportion of DNA with so little information suggests that its presence gives this chromosome the critical length required for proper function.
Resumo:
Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes a range of economically important plant diseases. Here we report the complete genome sequence of X. fastidiosa clone 9a5c, which causes citrus variegated chlorosis - a serious disease of orange trees. The genome comprises a 52.7% GC-rich 2,679,305-base-pair (bp) circular chromosome and 'two plasmids of 51,158 bp and 1,285 bp. We can assign putative functions to47% of the 2,904 predicted coding regions. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source, supporting existence in the nutrient-poor xylem sap. The mechanisms associated with pathogenicity and virulence involve toxins, antibiotics and ion sequestration systems, as well as bacterium-bacterium and bacterium-host interactions mediated by a range of proteins. Orthologues of some of these proteins have only been identified in animal and human pathogens; their presence in X. fastidiosa indicates that the molecular basis for bacterial pathogenicity is both conserved and independent of host. At least 83 genes are bacteriophage-derived and include virulence-associated genes from other bacteria, providing direct evidence of phage-mediated horizontal gene transfer.
Resumo:
Intergenic spacers of chloroplast DNA (cpDNA) are very useful in phylogenetic and population genetic studies of plant species, to study their potential integration in phylogenetic analysis. The non-coding trnE-trnT intergenic spacer of cpDNA was analyzed to assess the nucleotide sequence polymorphism of 16 Solanaceae species and to estimate its ability to contribute to the resolution of phylogenetic studies of this group. Multiple alignments of DNA sequences of trnE-trnT intergenic spacer made the identification of nucleotide variability in this region possible and the phylogeny was estimated by maximum parsimony and rooted with Convolvulaceae Ipomoea batalas, the most closely related family. Besides, this intergenic spacer was tested for the phylogenetic ability to differentiate taxonomic levels. For this purpose, species from four other families were analyzed and compared with Solanaceae species. Results confirmed polymorphism in the trnE-trnT region at different taxonomic levels.
Resumo:
In Brazil, human T-lymphotropic virus type 2 (HTLV-2) is endemic in Amerindians and epidemic in intravenous drug users (IDUs). The long terminal repeat (LTR) is the most divergent genomic region of HTLV-2, therefore useful to characterize subtypes. Nucleotide sequence and restriction fragment length polymorphism (RFLP) analysis of LTR genomic segments of fourteen HTLV-2 strains isolated from HIV-infected patients of Londrina, Southern Brazil, were carried out. Molecular analysis disclosed that all HTLV-2 strains belonged to 2a subtype, and RFLP detected the presence of the a4, a5, and a6 subgroups according to Switzer's nomenclature. RFLP correlated with nucleotide sequence, and phylogenetic analysis clustered HTLV-2 sequences of IDUs into subgroups a5 and a6. HTLV-2 sequences from individuals of sexual risk factor clustered into the a4 subgroup. These results extend the knowledge of the genetic diversity of HTLV-2 circulating in Brazil and provide insights into HTLV-2 transmission and virus movement in this geographic area.
Resumo:
The complete genome sequences of two Brazilian wild-type rabies viruses (RABV), a BR-DR1 isolate from a haematophagous bat (Desmodus rotundus) and a BR-AL1 isolate from a frugivorous bat (Artibeus lituratus), were determined. The genomes of the BR-DR1 and RR-AL1 had 11,923 and 11,922 nt, respectively, and both encoded the five standard genes of rhabdoviruses. The complete nucleotide sequence identity between the BR-DR1 and BR-AL1 isolates was 97%. The BR-DR1 and BR-AL1 isolates had some conserved functional sites revealed by the fixed isolates, whereas both isolates had unique amino acid substitutions in the antigenic region IV of the nucleocapsid gene. Therefore, it is speculated that both isolates were nearly identical in virologic character. According to our phylogenetic analysis based on the complete genomes, both isolates belonged to genotype 1, and to the previously defined ""vampire bat-related RABV lineage"" which consisted of mainly D. rotundus- and A. lituratus- isolates; however, a branch pattern with high bootstrap values suggested that BR-DR1 was more closely related to the 9001FRA isolate, which was collected from a dog bitten by a bat in French Guiana, than to BR-AL1. This result suggests that the vampire bat-related RABV lineage includes Brazilian vampire bat and Brazilian frugivorous bat RABV and is further divided into Brazilian vampire bat and Brazilian frugivorous bat RABV sub-lineages. The phylogenetic analysis based on the complete genomes was valuable in discriminating among very closely related isolates.
Resumo:
Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombination-detecting approaches can vary, depending on evolutionary events that take place after recombination. We recently evaluated the effects of post-recombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach in delineating breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.
Resumo:
Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombination-detecting approaches can vary, depending on evolutionary events that take place after recombination. We previously evaluated the effects of post-recombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach to delineate breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.
Resumo:
Bellerophon is a program for detecting chimeric sequences in multiple sequence datasets by an adaption of partial treeing analysis. Bellerophon was specifically developed to detect 16S rRNA gene chimeras in PCR-clone libraries of environmental samples but can be applied to other nucleotide sequence alignments.
Resumo:
Liver samples from rabbits killed by RHDV, collected from five States in Australia in 1996 and 1997 were analysed by RT-PCR. A 398 bp fragment of the capsid protein (VP60) gene was amplified by PCR and directly sequenced. The alignment of the nucleotide and amino acid sequences and their comparison with the original strain of the virus released in Australia indicated genetic changes after two years have been small with 98.2% to 100% identity. The constructed phylogenetic tree suggests slight differences in nucleotide substitutions in various States but there is no clear evidence of clustering of sequences according to their geographic origin. In practical terms, sequencing of viral RNA provides a means of testing the efficacy of further releases and subsequent spread of the virus if such a strategy is employed as a means of enhancing RHD as a biological control of the wild rabbit in Australia.
Resumo:
There have been no reports of DNA sequences of hepatitis B virus (HBV) strains from Australian Aborigines, although the hepatitis B surface antigen (HBsAg) was discovered among them. To investigate the characteristics of DNA sequences of HBV strains from Australian Aborigines, the complete nucleotide sequences of HBV strains were determined and subjected to molecular evolutionary analysis. Serum samples positive for HBsAg were collected from five Australian Aborigines. Phylogenetic analysis of the five complete nucleotide sequences compared with DNA sequences of 54 global HBV isolates from international databases revealed that three of the five were classified into genotype D and were most closely related in terms of evolutionary distance to a strain isolated from a healthy blood donor in Papua New Guinea. Two of the five were classified into a novel variant genotype C, which has not been reported previously, and were closely related to a strain isolated from Polynesians, particularly in the X and Core genes. These two strains of variant genotype C differed from known genotype C strains by 5.9-7.4% over the complete nucleotide sequence and 4.0-5.6 % in the small-S gene, and had residues Arg(122), Thr(127) and Lys(160) characteristic of serotype ayw3, which have not been reported previously in genotype C. In conclusion, this is the first report of the characteristics of complete nucleotide sequences of HBV from Australian Aborigines. These results contribute to the investigation of the worldwide spread of HBV, the relationship between serotype and genotype and the ancient common origin of Australian Aborigines.
Resumo:
The complete nucleotide sequence of the mitochondrial (mt) DNA molecule of the liverfluke, Fasciola hepatica (phylum Platyhelminthes, class Trematoda, family Fasciolidae), was determined, It comprises 14462 bp, contains 12 protein-encoding, 2 ribosomal and 22 transfer RNA genes, and is the second complete flatworm (and the first trematode) mitochondrial sequence to be described in detail. All of the genes are transcribed from the same strand. Of the genes typically found in mitochondrial genomes of eumetazoans, only atp8 is absent. The nad4L and nad4 genes overlap by 40 nt. Most intergenic sequences are very short. Two larger non-coding regions are present. The longer one (817 nt) is located between trnG and cox3 and consists of 8 identical tandem repeats of 85 nt, rich in G and C, followed by 1 imperfect repeat. The shorter non-coding region (187 nt) exhibits no special features and is separated from the longer region by trnG. The gene arrangement resembles that of some other trematodes including the eastern Asian Schistosoma species (and cyclophyllidean cestode species) but it is strikingly different from that of the African schistosomes, represented by Schistosoma mansoni. The genetic code is as inferred previously for flatworms. Transfer RNA genes range in length from 58 to 70 nt, their products producing characteristic 'clover leaf' structures, except for tRNA(S-VON) and tRNA(S-AGN) lacking the DHU arm.
Resumo:
Our previous studies have shown that two distinct genotypes of Sindbis (SIN) virus occur in Australia. One of these, the Oriental/Australian type, circulates throughout most of the Australian continent, whereas the recently identified south-west (SW) genetic type appears to be restricted to a distinct geographic region located in the temperate south-west of Australia. We have now determined the complete nucleotide and translated amino acid sequences of a SW isolate of SIN virus (SW6562) and performed comparative analyses with other SIN viruses at the genomic level. The genome of SW6562 is 11,569 nucleotides in length, excluding the cap nucleotide and poly (A) tail. Overall this virus differs from the prototype SIN virus (strain AR339) by 23% in nucleotide sequence and 12.5% in amino acid sequence. Partial sequences of four regions of the genome of four SW isolates were determined and compared with the corresponding sequences from a number of SIN isolates from different regions of the World. These regions are the non-structural protein (nsP3), the E2 gene, the capsid gene, and the repeated sequence elements (RSE) of the 3'UTR. These comparisons revealed that the SW SIN viruses were more closely related to South African and European strains than to other Australian isolates of SIN virus. Thus the SW genotype of SIN virus may have been introduced into this region of Australia by viremic humans or migratory birds and subsequently evolved independently in the region. The sequence data also revealed that the SW genotype contains a unique deletion in the RSE of the 3'UTR region of the genome. Previous studies have shown that deletions in this region of the SIN genome can have significant effects on virus replication in mosquito and avian cells, which may explain the restricted distribution of this genotype of SIN virus.
Resumo:
Nucleotide sequence analyses of the Pvs48/45 and Pvs47 genes were conducted in 46 malaria patients from the Republic of Korea (ROK) (n = 40) and returning travellers from India (n = 3) and Indonesia (n = 3). The domain structures, which were based on cysteine residue position and secondary protein structure, were similar between Plasmodium vivax (Pvs48/45 and Pvs47) and Plasmodium falciparum (Pfs48/45 and Pfs47). In comparison to the Sal-1 reference strain (Pvs48/45, PVX_083235 and Pvs47, PVX_083240), Korean isolates revealed seven polymorphisms (E35K, H211N, K250N, D335Y, A376T, I380T and K418R) in Pvs48/45. These isolates could be divided into five haplotypes with the two major types having frequencies of 47.5% and 20%, respectivelfy. In Pvs47, 10 polymorphisms (F22L, F24L, K27E, D31N, V230I, M233I, E240D, I262T, I273M and A373V) were found and they could be divided into four haplotypes with one major type having a frequency of 75%. The Pvs48/45 isolates from India showed a unique amino acid substitution site (K26R). Compared to the Sal-1 and ROK isolates, the Pvs47 isolates from travellers returning from India and Indonesia had amino acid substitutions (S57T and I262K). The current data may contribute to the development of the malaria transmission-blocking vaccine in future clinical trials.