99 resultados para multiple locus sequence typing
em National Center for Biotechnology Information - NCBI
Resumo:
Multiple lipoxygenase sequence alignments and structural modeling of the enzyme/substrate interaction of the cucumber lipid body lipoxygenase suggested histidine 608 as the primary determinant of positional specificity. Replacement of this amino acid by a less-space-filling valine altered the positional specificity of this linoleate 13-lipoxygenase in favor of 9-lipoxygenation. These alterations may be explained by the fact that H608V mutation may demask the positively charged guanidino group of R758, which, in turn, may force an inverse head-to-tail orientation of the fatty acid substrate. The R758L+H608V double mutant exhibited a strongly reduced reaction rate and a random positional specificity. Trilinolein, which lacks free carboxylic groups, was oxygenated to the corresponding (13S)-hydro(pero)xy derivatives by both the wild-type enzyme and the linoleate 9-lipoxygenating H608V mutant. These data indicate the complete conversion of a linoleate 13-lipoxygenase to a 9-lipoxygenating species by a single point mutation. It is hypothesized that H608V exchange may alter the orientation of the substrate at the active site and/or its steric configuration in such a way that a stereospecific dioxygen insertion at C-9 may exclusively take place.
Resumo:
Multiple copies of the hexamer TGCATG have been shown to regulate fibronectin pre-mRNA alternative splicing. GCATG repeats also are clustered near the regulated calcitonin-specific 3′ splice site in the rat calcitonin/CGRP gene. Specific mutagenesis of these repeats in calcitonin/CGRP pre-mRNA resulted in the loss of calcitonin-specific splicing, suggesting that the native repeats act to enhance alternative exon inclusion. Mutation of subsets of these elements implies that alternative splicing requires a minimum of two repeats, and that the combination of one intronic and one exonic repeat is necessary for optimal cell-specific splicing. However, multimerized intronic repeats inhibited calcitonin-specific splicing in both the wild-type context and in a transcript lacking endogenous repeats. These results suggest that both the number and distribution of repeats may be important features for the regulation of tissue-specific alternative splicing. Further, RNA containing a single repeat bound cell-specific protein complexes, but tissue-specific differences in protein binding were not detected by using multimerized repeats. Together, these data support a novel model for alternative splicing regulation that requires the cell-specific recognition of multiple, distributed sequence elements.
Resumo:
The function of many of the uncharacterized open reading frames discovered by genomic sequencing can be determined at the level of expressed gene products, the proteome. However, identifying the cognate gene from minute amounts of protein has been one of the major problems in molecular biology. Using yeast as an example, we demonstrate here that mass spectrometric protein identification is a general solution to this problem given a completely sequenced genome. As a first screen, our strategy uses automated laser desorption ionization mass spectrometry of the peptide mixtures produced by in-gel tryptic digestion of a protein. Up to 90% of proteins are identified by searching sequence data bases by lists of peptide masses obtained with high accuracy. The remaining proteins are identified by partially sequencing several peptides of the unseparated mixture by nanoelectrospray tandem mass spectrometry followed by data base searching with multiple peptide sequence tags. In blind trials, the method led to unambiguous identification in all cases. In the largest individual protein identification project to date, a total of 150 gel spots—many of them at subpicomole amounts—were successfully analyzed, greatly enlarging a yeast two-dimensional gel data base. More than 32 proteins were novel and matched to previously uncharacterized open reading frames in the yeast genome. This study establishes that mass spectrometry provides the required throughput, the certainty of identification, and the general applicability to serve as the method of choice to connect genome and proteome.
Resumo:
A multiple protein–DNA complex formed at a human α-globin locus-specific regulatory element, HS-40, confers appropriate developmental expression pattern on human embryonic ζ-globin promoter activity in humans and transgenic mice. We show here that introduction of a 1-bp mutation in an NF-E2/AP1 sequence motif converts HS-40 into an erythroid-specific locus-control region. Cis-linkage with this locus-control region, in contrast to the wild-type HS-40, allows erythroid lineage-specific derepression of the silenced human ζ-globin promoter in fetal and adult transgenic mice. Furthermore, ζ-globin promoter activities in adult mice increase in proportion to the number of integrated DNA fragments even at 19 copies/genome. The mutant HS-40 in conjunction with human ζ-globin promoter thus can be used to direct position-independent and copy number-dependent expression of transgenes in adult erythroid cells. The data also supports a model in which competitive DNA binding of different members of the NF-E2/AP1 transcription factor family modulates the developmental stage specificity of an erythroid enhancer. Feasibility to reswitch on embryonic/fetal globin genes through the manipulation of nuclear factor binding at a single regulatory DNA motif is discussed.
Resumo:
Multiple-complete-digest mapping is a DNA mapping technique based on complete-restriction-digest fingerprints of a set of clones that provides highly redundant coverage of the mapping target. The maps assembled from these fingerprints order both the clones and the restriction fragments. Maps are coordinated across three enzymes in the examples presented. Starting with yeast artificial chromosome contigs from the 7q31.3 and 7p14 regions of the human genome, we have produced cosmid-based maps spanning more than one million base pairs. Each yeast artificial chromosome is first subcloned into cosmids at a redundancy of ×15–30. Complete-digest fragments are electrophoresed on agarose gels, poststained, and imaged on a fluorescent scanner. Aberrant clones that are not representative of the underlying genome are rejected in the map construction process. Almost every restriction fragment is ordered, allowing selection of minimal tiling paths with clone-to-clone overlaps of only a few thousand base pairs. These maps demonstrate the practicality of applying the experimental and software-based steps in multiple-complete-digest mapping to a target of significant size and complexity. We present evidence that the maps are sufficiently accurate to validate both the clones selected for sequencing and the sequence assemblies obtained once these clones have been sequenced by a “shotgun” method.
Resumo:
Translocations involving c-myc and an Ig locus have been reported rarely in human multiple myeloma (MM). Using specific fluorescence in situ hybridization probes, we show complex karyotypic abnormalities of the c-myc or L-myc locus in 19 of 20 MM cell lines and approximately 50% of advanced primary MM tumors. These abnormalities include unusual and complex translocations and insertions that often juxtapose myc with an IgH or IgL locus. For two advanced primary MM tumors, some tumor cells contain a karyotypic abnormality of the c-myc locus, whereas other tumor cells do not, indicating that this karyotypic abnormality of c-myc occurs as a late event. All informative MM cell lines show monoallelic expression of c-myc. For Burkitt's lymphoma and mouse plasmacytoma tumors, balanced translocation that juxtaposes c-myc with one of the Ig loci is an early, invariant event that is mediated by B cell-specific DNA modification mechanisms. By contrast, for MM, dysregulation of c-myc apparently is caused principally by complex genomic rearrangements that occur during late stages of MM progression and do not involve B cell-specific DNA modification mechanisms.
Resumo:
dinP is an Escherichia coli gene recently identified at 5.5 min of the genetic map, whose product shows a similarity in amino acid sequence to the E. coli UmuC protein involved in DNA damage-induced mutagenesis. In this paper we show that the gene is identical to dinB, an SOS gene previously localized near the lac locus at 8 min, the function of which was shown to be required for mutagenesis of nonirradiated λ phage infecting UV-preirradiated bacterial cells (termed λUTM for λ untargeted mutagenesis). A newly constructed dinP null mutant exhibited the same defect for λUTM as observed previously with a dinB::Mu mutant, and the defect was complemented by plasmids carrying dinP as the only intact bacterial gene. Furthermore, merely increasing the dinP gene expression, without UV irradiation or any other DNA-damaging treatment, resulted in a strong enhancement of mutagenesis in F′lac plasmids; at most, 800-fold increase in the G6-to-G5 change. The enhanced mutagenesis did not depend on recA, uvrA, or umuDC. Thus, our results establish that E. coli has at least two distinct pathways for SOS-induced mutagenesis: one dependent on umuDC and the other on dinB/P.
Resumo:
The cell cycle-dependent, ordered assembly of protein prereplicative complexes suggests that eukaryotic replication origins determine when genomic replication initiates. By comparison, the factors that determine where replication initiates relative to the sites of prereplicative complex formation are not known. In the human globin gene locus previous work showed that replication initiates at a single site 5′ to the β-globin gene when protein synthesis is inhibited by emetine. The present study has examined the pattern of initiation around the genetically defined β-globin replicator in logarithmically growing HeLa cells, using two PCR-based nascent strand assays. In contrast to the pattern of initiation detected in emetine-treated cells, analysis of the short nascent strands at five positions spanning a 40 kb globin gene region shows that replication initiates at more than one site in non-drug-treated cells. Quantitation of nascent DNA chains confirmed that replication begins at several locations in this domain, including one near the initiation region (IR) identified in emetine-treated cells. However, the abundance of short nascent strands at another initiation site ∼20 kb upstream is ∼4-fold as great as that at the IR. The latter site abuts an early S phase replicating fragment previously defined at low resolution in logarithmically dividing cells.
Resumo:
By detailed NMR analysis of a human telomere repeating unit, d(CCCTAA), we have found that three distinct tetramers, each of which consists of four symmetric single-strands, slowly exchange in a slightly acidic solution. Our new finding is a novel i-motif topology (T-form) where T4 is intercalated between C1 and C2 of the other duplex. The other two tetramers have a topology where C1 is intercalated between C2 and C3 of the other parallel duplex, resulting in the non-stacking T4 residues (R-form), and a topology where C1 is stacked between C3 and T4 of the other duplex (S-form). From the NMR denaturation profile, the R-form is the most stable of the three structures in the temperature range of 15–50°C, the S-form the second and the T-form the least stable. The thermodynamic parameters indicate that the T-form is the most enthalpically driven and entropically opposed, and its population is increased with decreasing temperature. The T-form structure determined by restrained molecular dynamics calculation suggests that inter-strand van der Waals contacts in the narrow grooves should contribute to the enthalpic stabilization of the T-form.
Resumo:
In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.
Resumo:
Integration of viral DNA into the host nuclear genome, although not unusual in bacterial and animal systems, has surprisingly not been reported for plants. We have discovered geminvirus-related DNA (GRD) sequences, in the form of distinct sets of multiple direct repeats comprising three related repeat classes, situated in a unique locus in the Nicotiana tabacum (tobacco) nuclear genome. The organization of these sequences is similar or identical in eight different tobacco cultivars we have examined. DNA sequence analysis reveals that each repeat has sequences most resembling those of the New World geminiviral DNA replication origin plus the adjacent AL1 gene, encoding the viral replication protein. We believe these GRD sequences originated quite recently in Nicotiana evolution through integration of geminiviral DNA by some combination of the processes of illegitimate recombination, amplification, deletions, and rearrangements. These events must have occurred in plant tissue that was subsequently able to contribute to meristematic tissue yielding gametes. GRD may have been retained in tobacco by selection or by random fixation in a small evolving population. Although we cannot detect transcription of these sequences, this does not exclude the possibility that they may originally have been expressed.
Resumo:
Transmission of human immunodeficiency virus 1 (HIV-1) from an infected women to her offspring during gestation and delivery was found to be influenced by the infant's major histocompatibility complex class II DRB1 alleles. Forty-six HIV-infected infants and 63 seroreverting infants, born with passively acquired anti-HIV antibodies but not becoming detectably infected, were typed by an automated nucleotide-sequence-based technique that uses low-resolution PCR to select either the simpler Taq or the more demanding T7 sequencing chemistry. One or more DR13 alleles, including DRB1*1301, 1302, and 1303, were found in 31.7% of seroreverting infants and 15.2% of those becoming HIV-infected [OR (odds ratio) = 2.6 (95% confidence interval 1.0-6.8); P = 0.048]. This association was influenced by ethnicity, being seen more strongly among the 80 Black and Hispanic children [OR = 4.3 (1.2-16.4); P = 0.023], with the most pronounced effect among Black infants where 7 of 24 seroreverters inherited these alleles with none among 12 HIV-infected infants (Haldane OR = 12.3; P = 0.037). The previously recognized association of DR13 alleles with some situations of long-term nonprogression of HIV suggests that similar mechanisms may regulate both the occurrence of infection and disease progression after infection. Upon examining for residual associations, only only the DR2 allele DRB1*1501 was associated with seroreversion in Caucasoid infants (OR = 24; P = 0.004). Among Caucasoids the DRB1*03011 allele was positively associated with the occurrence of HIV infection (P = 0.03).
Resumo:
Cancer/testis (CT) antigens—immunogenic protein antigens that are expressed in testis and a proportion of diverse human cancer types—are promising targets for cancer vaccines. To identify new CT antigens, we constructed an expression cDNA library from a melanoma cell line that expresses a wide range of CT antigens and screened the library with an allogeneic melanoma patient serum known to contain antibodies against two CT antigens, MAGE-1 and NY-ESO-1. cDNA clones isolated from this library identified four CT antigen genes: MAGE-4a, NY-ESO-1, LAGE-1, and CT7. Of these four, only MAGE-4a and NY-ESO-1 proteins had been shown to be immunogenic. LAGE-1 is a member of the NY-ESO-1 gene family, and CT7 is a newly defined gene with partial sequence homology to the MAGE family at its carboxyl terminus. The predicted CT7 protein, however, contains a distinct repetitive sequence at the 5′ end and is much larger than MAGE proteins. Our findings document the immunogenicity of LAGE-1 and CT7 and emphasize the power of serological analysis of cDNA expression libraries in identifying new human tumor antigens.
Resumo:
In filamentous fungi, het loci (for heterokaryon incompatibility) are believed to regulate self/nonself-recognition during vegetative growth. As filamentous fungi grow, hyphal fusion occurs within an individual colony to form a network. Hyphal fusion can occur also between different individuals to form a heterokaryon, in which genetically distinct nuclei occupy a common cytoplasm. However, heterokaryotic cells are viable only if the individuals involved have identical alleles at all het loci. One het locus, het-c, has been characterized at the molecular level in Neurospora crassa and encodes a glycine-rich protein. In an effort to understand the role of this locus in filamentous fungi, we chose to study its evolution by analyzing het-c sequence variability in species within Neurospora and related genera. We determined that the het-c locus was polymorphic in a field population of N. crassa with close to equal frequency of each of the three allelic types. Different species and even genera within the Sordariaceae shared het-c polymorphisms, indicating that these polymorphisms originated in an ancestral species. Finally, an analysis of the het-c specificity region shows a high occurrence of nonsynonymous substitution. The persistence of allelic lineages, the nearly equal allelic distribution within populations, and the high frequency of nonsynonymous substitutions in the het-c specificity region suggest that balancing selection has operated to maintain allelic diversity at het-c. Het-c shares this particular evolutionary characteristic of departing from neutrality with other self/nonself-recognition systems such as major histocompatibility complex loci in mammals and the S (self-incompatibility) locus in angiosperms.