56 resultados para Genome-specific Sequence
em National Center for Biotechnology Information - NCBI
Resumo:
The SfiI endonuclease cleaves DNA at the sequence GGCCNNNN↓NGGCC, where N is any base and ↓ is the point of cleavage. Proteins that recognise discontinuous sequences in DNA can be affected by the unspecified sequence between the specified base pairs of the target site. To examine whether this applies to SfiI, a series of DNA duplexes were made with identical sequences apart from discrete variations in the 5 bp spacer. The rates at which SfiI cleaved each duplex were measured under steady-state conditions: the steady-state rates were determined by the DNA cleavage step in the reaction pathway. SfiI cleaved some of these substrates at faster rates than other substrates. For example, the change in spacer sequence from AACAA to AAACA caused a 70-fold increase in reaction rate. In general, the extrapolated values for kcat and Km were both higher on substrates with inflexible spacers than those with flexible structures. The dinucleotide at the site of cleavage was largely immaterial. SfiI activity is thus highly dependent on conformational variations in the spacer DNA.
Resumo:
A satellite DNA sequence, As120a, specific to the A-genome chromosomes in the hexaploid oat, Avena sativa L., was isolated by subcloning a fragment with internal tandem repeats from a plasmid, pAs120, that had been obtained from an Avena strigosa (As genome) genomic library. Southern and in situ hybridization showed that sequences with homology to sequences within pAs120 were dispersed throughout the genome of diploid (A and C genomes), tetraploid (AC genomes), and hexaploid (ACD genomes) Avena species. In contrast, sequences homologous to As120a were found in two A-genome species (A. strigosa and Avena longiglumis) and in the hexaploid A. sativa whereas this sequence was little amplified in the tetraploid Avena murphyi and was absent in the remaining A- and C-genome diploid species. In situ hybridization of pAs120a to hexaploid oat species revealed the distribution of elements of the As120a repeated family over both arms of 14 of 42 chromosomes of this species. By using double in situ hybridization with pAs120a and a C genome-specific probe, three sets of 14 chromosomes were revealed corresponding to the A, C, and D genomes of the hexaploid species. Simultaneous in situ hybridizations with pAs120a and ribosomal probes were used to assign the SAT chromosomes of hexaploid species to their correct genomes. This work reports a sequence able to distinguish between the closely related A and D genomes of hexaploid oats. This sequence offers new opportunities to analyze the relationships of Avena species and to explore the possible evolution of various polyploid oat species.
Resumo:
Multiple isoforms of type 1 hexokinase (HK1) are transcribed during spermatogenesis in the mouse, including at least three that are presumably germ cell specific: HK1-sa, HK1-sb, and HK1-sc. Each of these predicted proteins contains a common, germ cell-specific sequence that replaces the porin-binding domain found in somatic HK1. Although HK1 protein is present in mature sperm and is tyrosine phosphorylated, it is not known whether the various potential isoforms are differentially translated and localized within the developing germ cells and mature sperm. Using antipeptide antisera against unique regions of HK1-sa and HK1-sb, it was demonstrated that these isoforms were not found in pachytene spermatocytes, round spermatids, condensing spermatids, or sperm, suggesting that HK1-sa and HK1-sb are not translated during spermatogenesis. Immunoreactivity was detected in protein from round spermatids, condensing spermatids, and mature sperm using an antipeptide antiserum against the common, germ cell-specific region, suggesting that HK1-sc was the only germ cell-specific isoform present in these cells. Two-dimensional SDS-PAGE suggested that all of the sperm HK1-sc was tyrosine phosphorylated, and that the somatic HK1 isoform was not present. Immunoelectron microscopy revealed that HK1-sc was associated with the mitochondria and with the fibrous sheath of the flagellum and was found in discrete clusters in the region of the membranes of the sperm head. The unusual distribution of HK1-sc in sperm suggests novel functions, such as extramitochondrial energy production, and also demonstrates that a hexokinase without a classical porin-binding domain can localize to mitochondria.
Resumo:
The Escherichia coli MG1655 genome has been completely sequenced. The annotated sequence, biochemical information, and other information were used to reconstruct the E. coli metabolic map. The stoichiometric coefficients for each metabolic enzyme in the E. coli metabolic map were assembled to construct a genome-specific stoichiometric matrix. The E. coli stoichiometric matrix was used to define the system's characteristics and the capabilities of E. coli metabolism. The effects of gene deletions in the central metabolic pathways on the ability of the in silico metabolic network to support growth were assessed, and the in silico predictions were compared with experimental observations. It was shown that based on stoichiometric and capacity constraints the in silico analysis was able to qualitatively predict the growth potential of mutant strains in 86% of the cases examined. Herein, it is demonstrated that the synthesis of in silico metabolic genotypes based on genomic, biochemical, and strain-specific information is possible, and that systems analysis methods are available to analyze and interpret the metabolic phenotype.
Resumo:
Chlorarachniophyte algae contain a complex, multi-membraned chloroplast derived from the endosymbiosis of a eukaryotic alga. The vestigial nucleus of the endosymbiont, called the nucleomorph, contains only three small linear chromosomes with a haploid genome size of 380 kb and is the smallest known eukaryotic genome. Nucleotide sequence data from a subtelomeric fragment of chromosome III were analyzed as a preliminary investigation of the coding capacity of this vestigial genome. Several housekeeping genes including U6 small nuclear RNA (snRNA), ribosomal proteins S4 and S13, a core protein of the spliceosome [small nuclear ribonucleoprotein (snRNP) E], and a cip-like protease (clpP) were identified. Expression of these genes was confirmed by combinations of Northern blot analysis, in situ hybridization, immunocytochemistry, and cDNA analysis. The protein-encoding genes are typically eukaryotic in overall structure and their messenger RNAs are polyadenylylated. A novel feature is the abundance of 18-, 19-, or 20-nucleotide introns; the smallest spliceosomal introns known. Two of the genes, U6 and S13, overlap while another two genes, snRNP E and clpP, are cotranscribed in a single mRNA. The overall gene organization is extraordinarily compact, making the nucleomorph a unique model for eukaryotic genomics.
Resumo:
A polymorphic C-->T transition located on the human Y chromosome was found by the systematic comparative sequencing of Y-specific sequence-tagged sites by denaturing high-performance liquid chromatography. The results of genotyping representative global indigenous populations indicate that the locus is polymorphic exclusively within the Western Hemisphere. The pre-Columbian T allele occurs at > 90% frequency within the native South and Central American populations examined, while its occurrence in North America is approximately 50%. Concomitant genotyping at the polymorphic tetranucleotide microsatellite DYS19 locus revealed that the C-->T mutation displayed significant linkage disequilibrium with the 186-bp allele. The data suggest a single origin of linguistically diverse native Americans with subsequent haplotype differentiation within radiating indigenous populations as well as post-Columbian European and African gene flow. The mutation may have originated either in North America at a very early time during the expansion or before it, in the ancestral population(s) from which all Americans may have originated. The analysis of linkage of the DYS199 and the DYS19 tetranucleotide loci suggests that the C-->T mutation may have occurred around 30,000 years ago. We estimate the nucleotide diversity over 4.2 kb of the nonrecombining portion of the Y chromosome to be 0.00014. compared to autosomes, the majority of variation is due to the smaller effective population size of the Y chromosome rather than selective sweeps. There begins to emerge a pattern of pronounced geographical localization of Y-specific nucleotide substitution polymorphisms.
Resumo:
Many flowering plants possess self-incompatibility (SI) systems that prevent inbreeding. In Brassica, SI is controlled by a single polymorphic locus, the S locus. Two highly polymorphic S locus genes, SLG (S locus glycoprotein) and SRK (S receptor kinase), have been identified, both of which are expressed predominantly in the stigmatic papillar cell. We have shown recently that SRK is the determinant of the S haplotype specificity of the stigma. SRK is thought to serve as a receptor for a pollen ligand, which presumably is encoded by another polymorphic gene at the S locus. We previously have identified an S locus gene, SP11 (S locus protein 11), of the S9 haplotype of Brassica campestris and proposed that it potentially encodes the pollen ligand. SP11 is a novel member of the PCP (pollen coat protein) family of proteins, some members of which have been shown to interact with SLG. In this work, we identified the SP11 gene from three additional S haplotypes and further characterized the gene. We found that (i) SP11 showed an S haplotype-specific sequence polymorphism; (ii) SP11 was located in the immediate flanking region of the SRK gene of the four S haplotypes examined; (iii) SP11 was expressed in the tapetum of the anther, a site consistent with sporophytic control of Brassica SI; and (iv) recombinant SP11 of the S9 haplotype applied to papillar cells of S9 stigmas, but not of S8 stigmas, elicited SI response, resulting in inhibition of hydration of cross-pollen. All these results taken together strongly suggest that SP11 is the pollen S determinant in SI.
Resumo:
This study identified and purified specific isoamylase- and pullulanase-type starch-debranching enzymes (DBEs) present in developing maize (Zea mays L.) endosperm. The cDNA clone Zpu1 was isolated based on its homology with a rice (Oryza sativa L.) cDNA coding for a pullulanase-type DBE. Comparison of the protein product, ZPU1, with 18 other DBEs identified motifs common to both isoamylase- and pullulanase-type enzymes, as well as class-specific sequence blocks. Hybridization of Zpu1 to genomic DNA defined a single-copy gene, zpu1, located on chromosome 2. Zpu1 mRNA was abundant in endosperm throughout starch biosynthesis, but was not detected in the leaf or the root. Anti-ZPU1 antiserum specifically recognized the approximately 100-kD ZPU1 protein in developing endosperm, but not in leaves. Pullulanase- and isoamylase-type DBEs were purified from extracts of developing maize kernels. The pullulanase-type activity was identified as ZPU1 and the isoamylase-type activity as SU1. Mutations of the sugary1 (su1) gene are known to cause deficiencies of SU1 isoamylase and a pullulanase-type DBE. ZPU1 activity, protein level, and electrophoretic mobility were altered in su1-mutant kernels, indicating that it is the affected pullulanase-type DBE. The Zpu1 transcript levels were equivalent in nonmutant and su1-mutant kernels, suggesting that coordinated regulation of ZPU1 and SU1 occurs posttranscriptionally.
Resumo:
Many resident membrane proteins of the endoplasmic reticulum (ER) do not have known retrieval sequences. Among these are the so-called tail-anchored proteins, which are bound to membranes by a hydrophobic tail close to the C terminus and have most of their sequence as a cytosolically exposed N-terminal domain. Because ER tail-anchored proteins generally have short (< or = 17 residues) hydrophobic domains, we tested whether this feature is important for localization, using cytochrome b5 as a model. The hydrophobic domain of cytochrome b5 was lengthened by insertion of five amino acids (ILAAV), and the localization of the mutant was analyzed by immunofluorescence in transiently transfected mammalian cells. While the wild-type cytochrome was localized to the ER, the mutant was relocated to the surface. This relocation was not due to the specific sequence introduced, as demonstrated by the ER localization of a second mutant, in which the original length of the membrane anchor was restored, while maintaining the inserted ILAAV sequence. Experiments with brefeldin A and with cycloheximide demonstrated that the extended anchor mutant reached the plasma membrane by transport along the secretory pathway. We conclude that the short membrane anchor of cytochrome b5 is important for its ER residency, and we discuss the relevance of this finding for other ER tail-anchored proteins.
Resumo:
Rev-erb alpha belongs to the nuclear receptor superfamily, which contains receptors for steroids, thyroid hormones, retinoic acid, and vitamin D, as well as "orphan" receptors. No ligand has been found for Rev-erb alpha to date, making it one of these orphan receptors. Similar to some other orphan receptors, Rev-erb alpha has been shown to bind DNA as a monomer on a specific sequence called a Rev-erb alpah responsive element (RevRE), but its transcriptional activity remains unclear. In this paper, we characterize a functional RevRE located in the human Rev-erb alpha promoter itself. We also present evidence that (i) Rev-erb alpha mediates transcriptional repression of its own promoter in vitro, (ii) this repressing effect strictly depends on the binding of Rev-erb alpha to its responsive element and is transferable to a heterologous promoter; and (iii) Rev-erb alpha binds to this responsive sequence as a homodimer.
Resumo:
Chimeric genomes of poliovirus (PV) have been constructed in which the cognate internal ribosomal entry site (IRES) element was replaced by genetic elements of hepatitis C virus (HCV). Replacement of PV IRES with nt 9-332 of the genotype Ib HCV genome, a sequence comprising all but the first eight residues of the 5' nontranslated region (5'NTR) of HCV, resulted in a lethal phenotype. Addition of 366 nt of the HCV core-encoding sequence downstream of the HCV 5'NTR yielded a viable PV/HCV chimera, which expressed a stable, small-plaque phenotype. This chimeric genome encoded a truncated HCV core protein that was fused to the N terminus of the PV polyprotein via an engineered cleavage site for PV proteinase 3CPpro. Manipulation of the HCV core-encoding sequence of this viable chimera by deletion and frameshift yielded results suggesting that the 5'-proximal sequences of the HCV open reading frame were essential for viability of the chimera and that the N-terminal basic region of the HCV core protein is required for efficient replication of the chimeric virus. These data suggest that the bona fide HCV IRES includes genetic information mapping to the 5'NTR and sequences of the HCV open reading frame. PV chimeras replicating under translational control of genetic elements of HCV can serve to study HCV IRES function in vivo and to search for anti-HCV chemotherapeutic agents.
Resumo:
NACP, a 140-amino acid presynaptic protein, is the precursor of NAC [the non-amyloid beta/A4 protein (A beta) component of Alzheimer disease (AD) amyloid], a peptide isolated from and immunologically localized to brain amyloid of patients afflicted with AD. NACP produced in Escherichia coli bound to A beta peptides, the major component of AD amyloid. NACP bound to A beta 1-38 and A beta 25-35 immobilized on nitrocellulose but did not bind to A beta 1-28 on the filter under the same conditions. NACP binding to A beta 1-38 was abolished by addition of A beta 25-35 but not by A beta 1-28, suggesting that the hydrophobic region of the A beta peptide is critical to this binding. NACP-112, a shorter splice variant of NACP containing the NAC sequence, bound to A beta, but NACP delta, a deletion mutant of NACP lacking the NAC domain, did not bind A beta 1-38. Furthermore, binding between NACP-112 and A beta 1-38 was decreased by addition of peptide Y, a peptide that covers the last 15 residues of NAC. In an aqueous solution, A beta 1-38 aggregation was observed when NACP was also present in an incubation mixture at a ratio of 1:125 (NACP/A beta), whereas A beta 1-38 alone or NACP alone did not aggregate under the same conditions, suggesting that the formation of a complex between A beta and NACP may promote aggregation of A beta. Thus, NACP can bind A beta peptides through the specific sequence and can promote A beta aggregation, raising the possibility that NACP may play a role in the development of AD amyloid.
Resumo:
The existence of a code relating the set of possible sequences at a given position in a protein backbone to the local structure at that location is investigated. It is shown that only 73% of 4-C alpha structure fragments in a sample of 114 protein structures exhibit a preference for a particular set of sequences. The remaining structures can accommodate essentially any sequence. The structures that encode specific sequence distributions include the classical "secondary" structures, with the notable exception of planar (beta) bends. It is suggested that this has implications as to the mechanism of folding in proteins with extensive sheet/barrel structure. The possible role of structures that do not encode specific sequences as mutation hot spots is noted.
Resumo:
A 5.2-kb mRNA band that contains estrogen receptor (ER) sequence and exhibits sex- and tissue-specific expression has been identified in rat pituitary via Northern analysis; this band is composed of at least two distinctive ER mRNA isoforms. This mRNA is expressed in high levels in female pituitary but is absent in male pituitary and uterus, whereas the mRNA encoding the full-length receptor (6.2 kb) is expressed in all the aforementioned tissues. Estradiol treatment potently induces the expression of the 5.2-kb band in the male pituitary. Oligonucleotide hybridization and ribonuclease-protection experiments indicate that the pituitary ER variant is missing exons 1-4. Two corresponding cDNA clones, truncated estrogen receptor product 1 and 2 (TERP-1 and TERP-2), were isolated by using the anchored PCR. Both sequences contain a 31-bp segment of specific sequence upstream of exon 5; TERP-2, however, contains an additional 66 bp of specific sequence between the 31-bp segment and exon 5. On Northern analysis, probes complementary to the 31-bp segment of specific sequence hybridize only to the 5.2-kb band. Immunoblotting identified several proteins in rat pituitary that could represent the translation products of these or related transcripts. In summary, several ER isoforms have been identified that exhibit both tissue-specific expression and marked estrogen regulation and differ from full-length receptor by virtue of sequence upstream of the exon 4/5 boundary. Physiologically, the putative proteins encoded by these or similar isoforms might be important modulators of the tissue- and promoter-specific effects of estradiol.
Resumo:
We present a method for discovering conserved sequence motifs from families of aligned protein sequences. The method has been implemented as a computer program called emotif (http://motif.stanford.edu/emotif). Given an aligned set of protein sequences, emotif generates a set of motifs with a wide range of specificities and sensitivities. emotif also can generate motifs that describe possible subfamilies of a protein superfamily. A disjunction of such motifs often can represent the entire superfamily with high specificity and sensitivity. We have used emotif to generate sets of motifs from all 7,000 protein alignments in the blocks and prints databases. The resulting database, called identify (http://motif.stanford.edu/identify), contains more than 50,000 motifs. For each alignment, the database contains several motifs having a probability of matching a false positive that range from 10−10 to 10−5. Highly specific motifs are well suited for searching entire proteomes, while generating very few false predictions. identify assigns biological functions to 25–30% of all proteins encoded by the Saccharomyces cerevisiae genome and by several bacterial genomes. In particular, identify assigned functions to 172 of proteins of unknown function in the yeast genome.