106 resultados para CONVERGENT SEQUENCES
em National Center for Biotechnology Information - NCBI
Resumo:
Apolipoprotein(a) [apo(a)] is the distinguishing protein component of lipoprotein(a), a major inherited risk factor for atherosclerosis. Human apo(a) is homologous to plasminogen. It contains from 15 to 50 repeated domains closely related to plasminogen kringle four, plus single kringle five-like and inactive protease-like domains. This expressed gene is confined to a subset of primates. Although most mammals lack apo(a), hedgehogs produce an apo(a)-like protein composed of highly repeated copies of a plasminogen kringle three-like domain, with complete absence of protease domain sequences. Both human and hedgehog apo(a)-like proteins form covalently linked lipoprotein particles that can bind to fibrin and other substrates shared with plasminogen. DNA sequence comparisons and phylogenetic analysis indicate that the human type of apo(a) evolved from a duplicated plasminogen gene during recent primate evolution. In contrast, the kringle three-based type of apo(a) evolved from an independent duplication of the plasminogen gene approximately 80 million years ago. In a type of convergent evolution, the plasminogen gene has been independently remodeled twice during mammalian evolution to produce similar forms of apo(a) in two widely divergent groups of species.
Resumo:
Lactate dehydrogenase (LDH) is present in the amitochondriate parasitic protist Trichomonas vaginalis and some but not all other trichomonad species. The derived amino acid sequence of T. vaginalis LDH (TvLDH) was found to be more closely related to the cytosolic malate dehydrogenase (MDH) of the same species than to any other LDH. A key difference between the two T. vaginalis sequences was that Arg91 of MDH, known to be important in coordinating the C-4 carboxyl of oxalacetate/malate, was replaced by Leu91 in LDH. The change Leu91Arg by site-directed mutagenesis converted TvLDH into an MDH. The reverse single amino acid change Arg91Leu in TvMDH, however, gave a product with no measurable LDH activity. Phylogenetic reconstructions indicate that TvLDH arose from an MDH relatively recently.
Resumo:
The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50°C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83–92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.
Resumo:
The polymerase (PB2) and nucleocapsid (NP) genes encoded by the genome of influenza virus are essential for replication of the virus. When synthetic genes that express RNAs for external guide sequences targeted to the mRNAs of the PB2 and NP genes are stably incorporated into mouse cells in tissue culture, infection of these cells with influenza virus is nonproductive. Endogenous RNase P cleaves the targeted influenza virus mRNAs when they are in a complex with the external guide sequences. Targeting two different mRNAs simultaneously inhibits viral particle production more efficiently than does targeting only one mRNA.
Resumo:
Although infection by primary HIV type 1 (HIV-1) isolates normally requires the functional interaction of the viral envelope protein with both CD4 and the CCR-5 coreceptor, a subset of such isolates also are able to use the distinct CCR-3 receptor. By analyzing the ability of a series of wild-type and chimeric HIV-1 envelope proteins to mediate CCR-3-dependent infection, we have determined that CCR-3 tropism maps to the V1 and V2 variable region of envelope. Although substitution of the V1/V2 region of a CCR-3 tropic envelope into the context of a CCR-5 tropic envelope is both necessary and sufficient to confer CCR-3 tropism, this same substitution has no phenotypic effect when inserted into a CXCR-4 tropic HIV-1 envelope context. However, this latter chimera acquires both CCR-3 and CCR-5 tropism when a CCR-5 tropic V3 loop sequence also is introduced. These data demonstrate that the V1/2 region of envelope can, like the V3 loop region, encode a particular coreceptor requirement and suggest that a functional envelope:CCR-3 interaction may depend on the cooperative interaction of CCR-3 with both the V1/V2 and the V3 region of envelope.
Resumo:
The identification of cDNA clones from genomic regions known to contain human genes is usually the rate-limiting factor in positional cloning strategies. We demonstrate here that human genes present on yeast artificial chromosomes (YACs) are transcribed in yeast host cells. We have used the arbitrarily primed RNA (RAP) fingerprinting method to identify human-specific, transcribed sequences from YACs located in the 13q12 chromosome region. By comparing the RAP fingerprints generated using defined, arbitrary primers from various fragmented YACs, megaYACs, and host yeast, we were able to identify and map 20 products transcribed from the human YAC inserts. This method, therefore, permits the simultaneous isolation and mapping of novel expressed sequences directly from whole YACs.
Resumo:
Caveolae form the terminus for a major pathway of intracellular free cholesterol (FC) transport. Caveolin mRNA levels in confluent human skin fibroblasts were up-regulated following increased uptake of low density lipoprotein (LDL) FC. The increase induced by FC was not associated with detectable change in mRNA stability, indicating that caveolin mRNA levels were mediated at the level of gene transcription. A total of 924 bp of 5′ flanking region of the caveolin gene were cloned and sequenced. The promoter sequence included three G+C-rich potential sterol regulatory elements (SREs), a CAAT sequence and a Sp1 consensus sequence. Deletional mutagenesis of individual SRE-like sequences indicated that of these two (at −646 and −395 bp) were essential for the increased transcription rates mediated by LDL-FC, whereas the third was inconsequential. Gel shift analysis of protein binding from nuclear extracts to these caveolin promoter DNA sequences, together with DNase I footprinting, confirmed nucleoprotein binding to the SRE-like elements as part of the transcriptional response to LDL-FC. A supershift obtained with antibody to SRE-binding protein 1 (SPEBP-1) indicated that this protein binds at −395 bp. There was no reaction at −395 bp with anti-Sp1 antibody nor with either antibody at −646 bp. The cysteine protease inhibitor N-acetyl-leu-leu-norleucinal (ALLN), which inhibits SREBP catabolism, superinhibited caveolin mRNA levels regardless of LDL-FC. This finding suggests that SREBP inhibits caveolin gene transcription in contrast to its stimulating effect on other promoters. The findings of this study are consistent with the postulated role for caveolin as a regulator of cellular FC homeostasis in quiescent peripheral cells, and the coordinate regulation by SREBP of FC influx and efflux.
Resumo:
Certain peptides derived from the α1 domain of the major histocompatibility class I antigen complex (MHC-I) inhibit receptor internalization, increasing the steady-state number of active receptors on the cell surface and thereby enhancing the sensitivity to hormones and other agonists. These peptides self-assemble, and they also bind to MHC-I at the same site from which they are derived, suggesting that they could bind to receptor sites with significant sequence similarity. Receptors affected by MHC-I peptides do, indeed, have such sequence similarity, as illustrated here by insulin receptor (IR) and insulin-like growth factor-1 receptor. A synthetic peptide with sequence identical to a certain extracellular receptor domain binds to that receptor in a ligand-dependent manner and inhibits receptor internalization. Moreover, each such peptide is selective for its cognate receptor. An antibody to the IR peptide not only binds to IR and competes with the peptide but also inhibits insulin-dependent internalization of IR. These observations, and binding studies with deletion mutants of IR, indicate that the sequence QILKELEESSF encoded by exon 10 plays a key role in IR internalization. Our results illustrate a principle for identifying receptor-specific sites of importance for receptor internalization, and for enhancing sensitivity to hormones and other agonists.
Resumo:
We examine the occurrence of the ≈300 known protein folds in different groups of organisms. To do this, we characterize a large fraction of the currently known protein sequences (≈140,000) in structural terms, by matching them to known structures via sequence comparison (or by secondary-structure class prediction for those without structural homologues). Overall, we find that an appreciable fraction of the known folds are present in each of the major groups of organisms (e.g., bacteria and eukaryotes share 156 of 275 folds), and most of the common folds are associated with many families of nonhomologous sequences (i.e., >10 sequence families for each common fold). However, different groups of organisms have characteristically distinct distributions of folds. So, for instance, some of the most common folds in vertebrates, such as globins or zinc fingers, are rare or absent in bacteria. Many of these differences in fold usage are biologically reasonable, such as the folds of metabolic enzymes being common in bacteria and those associated with extracellular transport and communication being common in animals. They also have important implications for database-based methods for fold recognition, suggesting that an unknown sequence from a plant is more likely to have a certain fold (e.g., a TIM barrel) than an unknown sequence from an animal.
Resumo:
Homobasidiomycete fungi display many complex fruiting body morphologies, including mushrooms and puffballs, but their anatomical simplicity has confounded efforts to understand the evolution of these forms. We performed a comprehensive phylogenetic analysis of homobasidiomycetes, using sequences from nuclear and mitochondrial ribosomal DNA, with an emphasis on understanding evolutionary relationships of gilled mushrooms and puffballs. Parsimony-based optimization of character states on our phylogenetic trees suggested that strikingly similar gilled mushrooms evolved at least six times, from morphologically diverse precursors. Approximately 87% of gilled mushrooms are in a single lineage, which we call the “euagarics.” Recently discovered 90 million-year-old fossil mushrooms are probably euagarics, suggesting that (i) the origin of this clade must have occurred no later than the mid-Cretaceous and (ii) the gilled mushroom morphology has been maintained in certain lineages for tens of millions of years. Puffballs and other forms with enclosed spore-bearing structures (Gasteromycetes) evolved at least four times. Derivation of Gasteromycetes from forms with exposed spore-bearing structures (Hymenomycetes) is correlated with repeated loss of forcible spore discharge (ballistospory). Diverse fruiting body forms and spore dispersal mechanisms have evolved among Gasteromycetes. Nevertheless, it appears that Hymenomycetes have never been secondarily derived from Gasteromycetes, which suggests that the loss of ballistospory has constrained evolution in these lineages.
Resumo:
Although integration of viral DNA into host chromosomes occurs regularly in bacteria and animals, there are few reported cases in plants, and these involve insertion at only one or a few sites. Here, we report that pararetrovirus-like sequences have integrated repeatedly into tobacco chromosomes, attaining a copy number of ≈103. Insertion apparently occurred by illegitimate recombination. From the sequences of 22 independent insertions recovered from a healthy plant, an 8-kilobase genome encoding a previously uncharacterized pararetrovirus that does not contain an integrase function could be assembled. Preferred boundaries of the viral inserts may correspond to recombinogenic gaps in open circular viral DNA. An unusual feature of the integrated viral sequences is a variable tandem repeat cluster, which might reflect defective genomes that preferentially recombine into plant DNA. The recurrent invasion of pararetroviral DNA into tobacco chromosomes demonstrates that viral sequences can contribute significantly to plant genome evolution.
Resumo:
Lipochitooligosaccharides (LCOs) are a novel class of plant growth regulators that activate in tobacco protoplasts the expression of AXI1, a gene implicated in auxin signaling. Transient assays with a chimeric PAXI-GUS expression plasmid revealed that the N-octadecenoylated monosaccharide GlcN has all structural requirements for a biological active glycolipid, whereas the inactive N-acylated GalN epimer inhibits LCO action. Specific inhibition of LCO and auxin action shows that both signals are transduced within the tobacco cell via separate pathways that converge at or before AXI1 transcription. Cytokinin is suggested to be a common effector of LCO and auxin signaling. We also show that activation of AXI1 correlates with growth factor-induced cell division.
Resumo:
In this work, we report the posttranscriptional addition of poly(A)-rich sequences to mRNA in chloroplasts of higher plants. Several sites in the coding region and the mature end of spinach chloroplast psbA mRNA, which encodes the D1 protein of photosystem II, are detected as polyadenylylated sites. In eukaryotic cells, the addition of multiple adenosine residues to the 3′ end of nuclear RNA plays a key role in generating functional mRNAs and in regulating mRNA degradation. In bacteria, the adenylation of several RNAs greatly accelerates their decay. The poly(A) moiety in the chloroplast, in contrast to that in eukaryotic nuclear encoded and bacterial RNAs, is not a ribohomopolymer of adenosine residues, but clusters of adenosines bounded mostly by guanosines and rarely by cytidines and uridines; it may be as long as several hundred nucleotides. Further analysis of the initial steps of chloroplast psbA mRNA decay revealed specific endonuclease cleavage sites that perfectly matched the sites where poly(A)-rich sequences were added. Our results suggest a mechanism for the degradation of psbA mRNA in which endonucleolytic cleavages are followed by the addition of poly(A)-rich sequences to the upstream cleavage products, which target these RNAs for rapid decay.
Resumo:
We have investigated mRNA 3′-end-processing signals in each of six eukaryotic species (yeast, rice, arabidopsis, fruitfly, mouse, and human) through the analysis of more than 20,000 3′-expressed sequence tags. The use and conservation of the canonical AAUAAA element vary widely among the six species and are especially weak in plants and yeast. Even in the animal species, the AAUAAA signal does not appear to be as universal as indicated by previous studies. The abundance of single-base variants of AAUAAA correlates with their measured processing efficiencies. As found previously, the plant polyadenylation signals are more similar to those of yeast than to those of animals, with both common content and arrangement of the signal elements. In all species examined, the complete polyadenylation signal appears to consist of an aggregate of multiple elements. In light of these and previous results, we present a broadened concept of 3′-end-processing signals in which no single exact sequence element is universally required for processing. Rather, the total efficiency is a function of all elements and, importantly, an inefficient word in one element can be compensated for by strong words in other elements. These complex patterns indicate that effective tools to identify 3′-end-processing signals will require more than consensus sequence identification.
Resumo:
The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression.