934 resultados para Sequence homology, Nucleic acid
Resumo:
This article introduces a new interface for T-Coffee, a consistency-based multiple sequence alignment program. This interface provides an easy and intuitive access to the most popular functionality of the package. These include the default T-Coffee mode for protein and nucleic acid sequences, the M-Coffee mode that allows combining the output of any other aligners, and template-based modes of T-Coffee that deliver high accuracy alignments while using structural or homology derived templates. These three available template modes are Expresso for the alignment of protein with a known 3D-Structure, R-Coffee to align RNA sequences with conserved secondary structures and PSI-Coffee to accurately align distantly related sequences using homology extension. The new server benefits from recent improvements of the T-Coffee algorithm and can align up to 150 sequences as long as 10 000 residues and is available from both http://www.tcoffee.org and its main mirror http://tcoffee.crg.cat.
Resumo:
The elucidation of the domain content of a given protein sequence in the absence of determined structure or significant sequence homology to known domains is an important problem in structural biology. Here we address how successfully the delineation of continuous domains can be accomplished in the absence of sequence homology using simple baseline methods, an existing prediction algorithm (Domain Guess by Size), and a newly developed method (DomSSEA). The study was undertaken with a view to measuring the usefulness of these prediction methods in terms of their application to fully automatic domain assignment. Thus, the sensitivity of each domain assignment method was measured by calculating the number of correctly assigned top scoring predictions. We have implemented a new continuous domain identification method using the alignment of predicted secondary structures of target sequences against observed secondary structures of chains with known domain boundaries as assigned by Class Architecture Topology Homology (CATH). Taking top predictions only, the success rate of the method in correctly assigning domain number to the representative chain set is 73.3%. The top prediction for domain number and location of domain boundaries was correct for 24% of the multidomain set (±20 residues). These results have been put into context in relation to the results obtained from the other prediction methods assessed
Resumo:
The complete amino acid sequence of myotoxin II (godMT-II), a myotoxic phospholipase A( 2 )(PLA(2)) homologue from the venom of the Central American crotaline snake Cerrophidion (Bothrops) godmani, was determined by direct protein sequencing methods. GodMT-II is a class II PLA, showing a Lys instead of Asp at position 49. An additional substitution in the calcium binding loop region (Asn instead of Tyr at position 28) suggests the lack of enzymatic activity observed in this toxin is due to loss of its ability to bind the co-factor Ca2+, since the residues involved in forming the catalytic network of PLA(2)s (His-48, Tyr-52 and Asp-99) an conserved in godMT-II. This myotoxin shows highest sequence homology with other Lys-49 PLA(2)s from Bothrops, Agkistrodon and Trimeresurus species, suggesting that they constitute a conserved family of proteins, yet in contrast presents lower homology with Bothrops asper myotoxin III, a catalytically-active PLA(2). The C-terminal region of godMT-II, which is rich in cationic and hydrophobic residues, shares high sequence homology to the corresponding region in the myotoxin II from B. asper, which has been proposed to play an important role in the Ca2+-independent membrane damaging activity. (C) 1998 Elsevier B.V. B.V. All rights reserved.
Resumo:
In the last decade, two tools, one drawn from information theory and the other from artificial neural networks, have proven particularly useful in many different areas of sequence analysis. The work presented herein indicates that these two approaches can be joined in a general fashion to produce a very powerful search engine that is capable of locating members of a given nucleic acid sequence family in either local or global sequence searches. This program can, in turn, be queried for its definition of the motif under investigation, ranking each base in context for its contribution to membership in the motif family. In principle, the method used can be applied to any binding motif, including both DNA and RNA sequence families, given sufficient family size.
Resumo:
In vitro selection of nucleic acid binding species (aptamers) is superficially similar to the immune response. Both processes produce biopolymers that can recognize targets with high affinity and specificity. While antibodies are known to recognize the sequence and conformation of protein surface features (epitopes), very little is known about the precise interactions between aptamers and their epitopes. Therefore, aptamers that could recognize a particular epitope, a peptide fragment of human immunodeficiency virus type I Rev, were selected from a random sequence RNA pool. Several of the selected RNAs could bind the free peptide more tightly than a natural RNA ligand, the Rev-binding element. In accord with the hypothesis that protein and nucleic acid binding cusps are functionally similar, interactions between aptamers and the peptide target could be disrupted by sequence substitutions. Moreover, the aptamers appeared to be able to bind peptides with different solution conformations, implying an induced fit mechanism for binding. Just as anti-peptide antibodies can sometimes recognize the corresponding epitope when presented in a protein, the anti-peptide aptamers were found to specifically bind to Rev.
Resumo:
Analysis of the 16S rDNA sequence of Conglomeromonas largomobilis subsp. largomobilis supports a phylogenetic relationship with the species of the genus Azospirillum. This confirms results of previous nucleic acid hybridization studies (FALK, E. C., J. L. JOHNSON, V. D. L. BALDANI, J. DOBEREINER, and N. R. KRIEG. 1986. Int. J. Syst. Bacteriol. 36: 80-85). Conglomeromonas largomobilis subsp. largomobilis was most closely related to the species Azospirillim lipoferum and Azospirillum brasilense but sufficiently distant to warrant separate species status. Conglomeromonas largomobilis subsp. parooensis was more distantly related to the existing species of Azospirillum and represents an isolated subline of descent. On the basis of the phylogenetic evidence a prosposal is made to transfer the subspecies Conglom-eromonas largomobilis subsp. largomobilis to the genus Azospirillum as Azospirillum largomobile comb. nov. and to retain the genus Conglomeromonas by elevating the subspecies C. largomobilis subsp. parooensis to the type species of Conglomeromonas as Conglomeromonas parooensis sp. nov.
Resumo:
Dissertação apresentada para obtenção do grau de Doutor em Bioquímica - especialidade Biotecnologia, pela Universidade Nova de Lisboa,Faculdade de Ciências e Tecnologia
Resumo:
Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics.
Resumo:
We have determined the sequence of the first 1371 nucleotides at the 5' end of the genome of mouse mammary tumor virus using molecularly cloned proviral DNA of the GR virus strain. The most likely initiation codon used for the gag gene of mouse mammary tumor virus is the first one, located 312 nucleotides from the 5' end of the viral RNA. The 5' splicing site for the subgenomic mRNA's is located approximately 288 nucleotides downstream from the 5' end of the viral RNA. From the DNA sequence the amino acid sequence of the N-terminal half of the gag precursor protein, including p10 and p21, was deduced (353 amino acids).
Resumo:
Starting from a biologically active recombinant DNA clone of exogenous unintegrated GR mouse mammary tumor virus, we have generated three subclones of PstI fragments of 1.45, 1.1, and 2.0 kb in the plasmid vector PBR322. The nucleotide sequence has been determined for the clone of 1.45 kb which includes almost the complete region of the long terminal repeat (LTR) plus an adjacent stretch of unique sequence DNA. A short region of the 2.0 kb clone, containing the beginning of the LTR, has also been sequenced. Starting with the A of an initiation codon outside the LTR, we detected an open reading frame of 960 nucleotides, potentially coding for a protein of 320 amino acids (36K). Two hundred nucleotides downstream from the termination codon, and approximately 25 nucleotides upstream from the presumptive initiation site of viral RNA synthesis, we found a promoter-like sequence. The sequence AGTAAA was detected approximately 15-20 nucleotides upstream from the 3' end of virion RNA and probably serves as a polyadenylation signal. The 1.45 kb PstI fragment has been transfected into Ltk- cells together with a plasmid containing the thymidine kinase gene of herpes simplex virus. The virus-specific RNA synthesis detected in a Tk+ cell clone was strongly stimulated by the addition of dexamethasone.
Resumo:
The peroxisome proliferator-activated receptor alpha is a ligand-activated transcription factor that plays an important role in the regulation of lipid homeostasis. PPARalpha mediates the effects of fibrates, which are potent hypolipidemic drugs, on gene expression. To better understand the biological effects of fibrates and PPARalpha, we searched for genes regulated by PPARalpha using oligonucleotide microarray and subtractive hybridization. By comparing liver RNA from wild-type and PPARalpha null mice, it was found that PPARalpha decreases the mRNA expression of enzymes involved in the metabolism of amino acids. Further analysis by Northern blot revealed that PPARalpha influences the expression of several genes involved in trans- and deamination of amino acids, and urea synthesis. Direct activation of PPARalpha using the synthetic PPARalpha ligand WY14643 decreased mRNA levels of these genes, suggesting that PPARalpha is directly implicated in the regulation of their expression. Consistent with these data, plasma urea concentrations are modulated by PPARalpha in vivo. It is concluded that in addition to oxidation of fatty acids, PPARalpha also regulates metabolism of amino acids in liver, indicating that PPARalpha is a key controller of intermediary metabolism during fasting.
Resumo:
The sequence profile method (Gribskov M, McLachlan AD, Eisenberg D, 1987, Proc Natl Acad Sci USA 84:4355-4358) is a powerful tool to detect distant relationships between amino acid sequences. A profile is a table of position-specific scores and gap penalties, providing a generalized description of a protein motif, which can be used for sequence alignments and database searches instead of an individual sequence. A sequence profile is derived from a multiple sequence alignment. We have found 2 ways to improve the sensitivity of sequence profiles: (1) Sequence weights: Usage of individual weights for each sequence avoids bias toward closely related sequences. These weights are automatically assigned based on the distance of the sequences using a published procedure (Sibbald PR, Argos P, 1990, J Mol Biol 216:813-818). (2) Amino acid substitution table: In addition to the alignment, the construction of a profile also needs an amino acid substitution table. We have found that in some cases a new table, the BLOSUM45 table (Henikoff S, Henikoff JG, 1992, Proc Natl Acad Sci USA 89:10915-10919), is more sensitive than the original Dayhoff table or the modified Dayhoff table used in the current implementation. Profiles derived by the improved method are more sensitive and selective in a number of cases where previous methods have failed to completely separate true members from false positives.
Resumo:
The isolation of the four Xenopus laevis vitellogenin genes has been completed by the purification from a DNA library of the B2 gene together with its flanking sequences. The overlapping DNA fragments analyzed cover 34 kilobases. The B2 gene which has a length of 17.5 kilobases was characterized by heteroduplex and R-loop mapping in the electron microscope and by in vitro transcription in a HeLa whole-cell extract. Its structural organization is compared with that of the closely related B1 gene. The mRNA-coding sequence of about 6 kilobases is interrupted 34 times in the B1 gene and 33 times in the B2 gene. Sequence homology between the two genes was not only found in exons. In addition, 54% of the intron sequences as well as 63% and 48.5% respectively of the 5' and 3' flanking sequences, show enough homology to form stable duplexes. These findings are compared with earlier results obtained with the two other closely related members of the vitellogenin gene family, the A1 and the A2 genes.
Resumo:
Vitellogenin is synthesized under estrogen control in the liver, extensively modified, transported to the ovary, and there processed to the yolk proteins lipovitellin and phosvitin. In the frog Xenopus laevis there are at least four distinct but related vitellogenin genes. The two genes A1 and A2 have a 95 percent sequence homology in their messenger RNA coding regions, and contain 33 introns that interrupt the coding region (exons) at homologous positions. Sequences and lengths of analogous introns differ, and many introns contain repetitive DNA elements. The introns in these two genes that have apparently arisen by duplication have diverged extensively by events that include deletions, insertions, and probably duplications. Rapid evolutionary change involving rearrangements and the presence of repeated DNA suggests that the bulk of the sequences within introns may not have any specific function.