916 resultados para Genome-specific Sequence
Resumo:
The product of the gene (ATM) mutated in the human genetic disorder ataxia-telangiectasia (A-T) is a high molecular weight, protein (similar to350 kDa) containing a C-terminal protein kinase domain and a number of other putative domains not yet functionally defined. The majority of ATM gene mutations in A-T patients are truncating, resulting in prematurely terminated products that are highly unstable. Missense mutations within the kinase domain and elsewhere in the molecule alter the stability of the protein and lead to loss of protein kinase activity. Only rarely are patients observed with two missense mutations and this gives rise to a milder disease phenotype. Evidence for a dominant interfering effect on normal ATM kinase activity has been reported in cell lines transfected with missense mutant ATM and in cell lines from some A-T heterozygotes. The dominant negative effect of mutant ATM is manifested by an enhancement of cellular radiosensitivity and may be responsible for the cancer predisposition observed in carriers of ATM missense mutations. In this review, we explore the domain structure of the ATM molecule, sites of interaction with other proteins and the consequences of specific amino acid changes on function. (C) 2003 Elsevier B.V. All rights reserved.
Resumo:
There are eight genotypes and nine subtypes of HBV. Small differences in geographical origin are associated with sequence changes in the surface gene. Here, we compared core gene sequences from different genotypes and geographical regions. Specific combinations of 24 amino acid substitutions at nine residues allowed allocation of a sequence to a subtype. Six of these nine residues were located in different T cell epitopes depending on HBV geographical area and/or genotype. Thirty-seven nucleotide changes were associated uniquely with specific genotypes and subtypes. Unique amino acid and nucleotide variants were found in a majority of sequences from specific countries as well as within subtype ayw2 and adr. Specific nucleotide motifs were defined for Korean, Indian, Chinese, Italian and Pacific region isolates. Finally, we observed amino acid motifs that were common to either South-east Asian or Western populations, irrespective of subtype. We believe that HBV strains spread within constrained ethnic groups, result in selection pressures that define sequence variability within each subtype. It suggests that particular T cell epitopes are specific for geographical regions, and thus ethnic groups; this may affect the design of immunomodulatory therapies.
Resumo:
Sequence diversity in the coat protein coding region of Australian strains of Johnsongrass mosaic virus (JGMV) was investigated. Field isolates were sampled during a seven year period from Johnsongrass, sorghum and corn across the northern grain growing region. The 23 isolates were found to have greater than 94% nucleotide and amino acid sequence identity. The Australian isolates and two strains from the U.S.A. had about 90% nucleotide sequence identity and were between 19 and 30% different in the N-terminus of the coat protein. Two amino acid residues were found in the core region of the coat protein in isolates obtained from sorghum having the Krish gene for JGMV resistance that differed from those found in isolates from other hosts which did not have this single dominant resistance gene. These amino acid changes may have been responsible for overcoming the resistance conferred by the Krish gene for JGMV resistance in sorghum. The identification of these variable regions was essential for the development of durable pathogen-derived resistance to JGMV in sorghum.
Resumo:
We completed the genome sequence of Lettuce necrotic yellows virus (LNYV) by determining the nucleotide sequences of the 4a (putative phosphoprotein), 4b, M (matrix protein), G (glycoprotein) and L (polymerase) genes. The genome consists of 12,807 nucleotides and encodes six genes in the order 3' leader-N-4a(P)-4b-M-G-L-5' trailer. Sequences were derived from clones of a cDNA library from LNYV genomic RNA and from fragments amplified using reverse transcription-polymerase chain reaction. The 4a protein has a low isoelectric point characteristic for rhabdovirus phosphoproteins. The 4b protein has significant sequence similarities with the movement proteins of capillo- and trichoviruses and may be involved in cell-to-cell movement. The putative G protein sequence contains a predicted 25 amino acids signal peptide and endopeptidase cleavage site, three predicted glycosylation sites and a putative transmembrane domain. The deduced L protein sequence shows similarities with the L proteins of other plant rhabdoviruses and contains polymerase module motifs characteristic for RNA-dependent RNA polymerases of negative-strand RNA viruses. Phylogenetic analysis of this motif among rhabdoviruses placed LNYV in a group with other sequenced cytorhabdoviruses, most closely related to Strawberry crinkle virus. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
This Article Right arrow Full Text Right arrow Full Text (PDF) Right arrow Supplemental material Right arrow Alert me when this article is cited Right arrow Alert me if a correction is posted Services Right arrow Similar articles in this journal Right arrow Similar articles in PubMed Right arrow Alert me to new issues of the journal Right arrow Download to citation manager Right arrow Reprints and Permissions Right arrow Copyright Information Right arrow Books from ASM Press Right arrow MicrobeWorld Citing Articles Right arrow Citing Articles via HighWire Right arrow Citing Articles via Google Scholar Google Scholar Right arrow Articles by Lee, N. Right arrow Articles by McCarthy, J. Right arrow Search for Related Content PubMed Right arrow PubMed Citation Right arrow Articles by Lee, N. Right arrow Articles by McCarthy, J. Right arrow Pubmed/NCBI databases * Substance via MeSH Previous Article | Next Article Journal of Clinical Microbiology, August 2006, p. 2773-2778, Vol. 44, No. 8 0095-1137/06/$08.00+0 doi:10.1128/JCM.02557-05 Copyright © 2006, American Society for Microbiology. All Rights Reserved. Effect of Sequence Variation in Plasmodium falciparum Histidine- Rich Protein 2 on Binding of Specific Monoclonal Antibodies: Implications for Rapid Diagnostic Tests for Malaria{dagger} Nelson Lee,1,2 Joanne Baker,2 Kathy T. Andrews,1 Michelle L. Gatton,1,3 David Bell,4 Qin Cheng,2,3 and James McCarthy1* Australian Centre for International and Tropical Health and Nutrition, Queensland Institute of Medical Research and School of Population Health, University of Queensland, Queensland, Australia,1 Department of Drug Resistance and Diagnostics, Australian Army Malaria Institute, Brisbane, Australia,2 Malaria Drug Resistance and Chemotherapy, Queensland Institute of Medical Research, Queensland, Australia,3 World Health Organization, Regional Office for the Western Pacific, Manila, Philippines4 Received 8 December 2005/ Returned for modification 23 February 2006/ Accepted 26 May 2006 The ability to accurately diagnose malaria infections, particularly in settings where laboratory facilities are not well developed, is of key importance in the control of this disease. Rapid diagnostic tests (RDTs) offer great potential to address this need. Reports of significant variation in the field performance of RDTs based on the detection of Plasmodium falciparum histidine-rich protein 2 (HRP2) (PfHRP2) and of significant sequence polymorphism in PfHRP2 led us to evaluate the binding of four HRP2-specific monoclonal antibodies (MABs) to parasite proteins from geographically distinct P. falciparum isolates, define the epitopes recognized by these MABs, and relate the copy number of the epitopes to MAB reactivity. We observed a significant difference in the reactivity of the same MAB to different isolates and between different MABs tested with single isolates. When the target epitopes of three of the MABs were determined and mapped onto the peptide sequences of the field isolates, significant variability in the frequency of these epitopes was observed. These findings support the role of sequence variation as an explanation for variations in the performance of HRP2-based RDTs and point toward possible approaches to improve their diagnostic sensitivities
Resumo:
Recent large-scale analyses of mainly full-length cDNA libraries generated from a variety of mouse tissues indicated that almost half of all representative cloned sequences did flat contain ail apparent protein-coding sequence, and were putatively derived from non-protein-coding RNA (ncRNA) genes. However, many of these clones were singletons and the majority were unspliced, raising the possibility that they may be derived from genomic DNA or unprocessed pre-rnRNA contamination during library construction, or alternatively represent nonspecific transcriptional noise. Here we Show, using reverse transcriptase-dependent PCR, microarray, and Northern blot analyses, that many of these clones were derived from genuine transcripts Of unknown function whose expression appears to be regulated. The ncRNA transcripts have larger exons and fewer introns than protein-coding transcripts. Analysis of the genomic landscape around these sequences indicates that some cDNA clones were produced not from terminal poly(A) tracts but internal priming sites within longer transcripts, only a minority of which is encompassed by known genes. A significant proportion of these transcripts exhibit tissue-specific expression patterns, as well as dynamic changes in their expression in macrophages following lipopolysaccharide Stimulation. Taken together, the data provide strong support for the conclusion that ncRNAs are an important, regulated component of the mammalian transcriptome.
Resumo:
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Mammalian promoters can be separated into two classes, conserved TATA box-enriched promoters, which initiate at a welldefined site, and more plastic, broad and evolvable CpG-rich promoters. We have sequenced tags corresponding to several hundred thousand transcription start sites (TSSs) in the mouse and human genomes, allowing precise analysis of the sequence architecture and evolution of distinct promoter classes. Different tissues and families of genes differentially use distinct types of promoters. Our tagging methods allow quantitative analysis of promoter usage in different tissues and show that differentially regulated alternative TSSs are a common feature in protein-coding genes and commonly generate alternative N termini. Among the TSSs, we identified new start sites associated with the majority of exons and with 3' UTRs. These data permit genome-scale identification of tissue-specific promoters and analysis of the cis-acting elements associated with them.
Resumo:
Full-length genome sequences of five virulent and five avirulent strains of Newcastle disease virus isolated between 1998 and 2002 in Victoria and New South Wales, Australia were determined. Comparisons between these strains revealed that coding sequence variability in the haemagglutinin-neuraminidase (HN), matrix (M) and phosphoprotein (P) gene sequences appeared to be more variable than in the fusion (F), nucleocapsid (N) and RNA dependent-RNA replicase (L) genes. Sequence analysis of a number of other isolates made during the recent virulent NDV outbreaks, also identified the presence of a number of variants with altered F gene cleavage sites, which resulted in altered biological properties of those viruses. Quasispecies analysis of a number of field isolates indicated the presence of virulent virus in one particular isolate. Gene sequence analysis of the progenitor virus isolated in 1998 showed very little sequence variation when compared to that of a progenitor-like virus isolated in 2001 demonstrating that in the field. viral genome sequence variation appears to be biologically restricted to that of a consensus sequence. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
The southern cattle tick, Boophilus microplus (Canestrini), causes annual economic losses in the hundreds of millions of dollars to cattle producers throughout the world, and ranks as the most economically important tick from a global perspective. Control failures attributable to the development of pesticide resistance have become commonplace, and novel control technologies are needed. The availability of the genome sequence will facilitate the development of these new technologies, and we are proposing sequencing to a 4-6X draft coverage. Many existing biological resources are available to facilitate a genome sequencing project, including several inbred laboratory tick strains, a database of approximate to 45,000 expressed sequence tags compiled into a B. microplus Gene Index, a bacterial artificial chromosome (BAC) library, an established B. microplus cell line, and genomic DNA suitable for library synthesis. Collaborative projects are underway to map BACs and cDNAs to specific chromosomes and to sequence selected BAC clones. When completed, the genome sequences from the cow, B. microphis, and the B. microphis-borne pathogens Babesia bovis and Anaplasma marginale will enhance studies of host-vector-pathogen systems. Genes involved in the regeneration of amputated tick limbs and transitions through developmental stages are largely unknown. Studies of these and other interesting biological questions will be advanced by tick genome sequence data. Comparative genomics offers the prospect of new insight into many, perhaps all, aspects of the biology of ticks and the pathogens they transmit to farm animals and people. The B. microplus genome sequence will fill a major gap in comparative genomics: a sequence from the Metastriata lineage of ticks. The purpose of the article is to synergize interest in and provide rationales for sequencing the genome of B. microplus and for publicizing currently available genomic resources for this tick.
Resumo:
Sequence specificity of antibodies to UV-damaged DNA has not been described previously. The antisera investigated here were specific for UV-modified DNA and were absolutely dependent upon the presence of thymine residues. Using a series of oligonucleotides in competition ELISA, increased inhibition was observed with increasing chain length of UV-polythymidylate. A minimum of three adjacent thymines was required for effective inhibition; alone, dimers of thymine were poor antigens. Although UV-irradiated poly(dC) was not antigenic, cytosines could partially replace thymines within the smallest effective epitope (T-T-T) with a high degree of sequence specificity, not previously described. The main epitope induced by UV was formed from adjacent thymines and either a 3' or a 5' pyrimidine.
Resumo:
Affinity purification of plasmid DNA is an attractive option for the biomanufacture of therapeutic plasmids, which are strictly controlled for levels of host protein, DNA, RNA, and endotoxin. Plasmid vectors are considered to be a safer alternative than viruses for gene therapy, but milligram quantities of DNA are required per dose. Previous affinity approaches have involved triplex DNA formation and a sequence-specific zinc finger protein. We present a more generically applicable protein-based approach, which exploits the lac operator, present in a wide diversity of plasmids, as a target sequence. We used a GFP/His-tagged Lacl protein, which is precomplexed with the plasmid, and the resulting complex was immobilized on a solid support (TALON resin). Ensuing elution gives plasmid DNA, in good yield (>80% based on recovered starting material, 35-50% overall process), free from detectable RNA and protein and with minimal genomic DNA contamination. Such an affinity-based process should enhance plasmid purity and ultimately, after appropriate development, may simplify the biomanufacturing process of therapeutic plasmids.
Resumo:
Bacteriophage T7 DNA primase recognizes 5'-GTC-3' in single-stranded DNA. The primase contains a single Cys4 zinc-binding motif that is essential for recognition. Biochemical and mutagenic analyses suggest that the Cys4 motif contacts cytosine of 5'-GTC-3' and may also contribute to thymine recognition. Residues His33 and Asp31 are critical for these interactions. Biochemical analysis also reveals that T7 primase selectively binds CTP in the absence of DNA. We propose that bound CTP selects the remaining base G, of 5'-GTC-3', by base pairing. Our deduced mechanism for recognition of ssDNA by Cys4 motifs bears little resemblance to the recognition of trinucleotides of double-stranded DNA by Cys2His2 zinc fingers.
Resumo:
The mammalian high mobility group protein AT-hook 2 (HMGA2) is a small transcriptional factor involved in cell development and oncogenesis. It contains three "AT-hook" DNA binding domains, which specifically recognize the minor groove of AT-rich DNA sequences. It also has an acidic C-terminal motif. Previous studies showed that HMGA2 mediates all its biological effects through interactions with AT-rich DNA sequences in the promoter regions. In this dissertation, I used a variety of biochemical and biophysical methods to examine the physical properties of HMGA2 and to further investigate HMGA2's interactions with AT-rich DNA sequences. The following are three avenues perused in this study: (1) due to the asymmetrical charge distribution of HMGA2, I have developed a rapid procedure to purify HMGA2 in the milligram range. Preparation of large amounts of HMGA2 makes biophysical studies possible; (2) Since HMGA2 binds to different AT-rich sequences in the promoter regions, I used a combination of isothermal titration calorimetry (ITC) and DNA UV melting experiment to characterize interactions of HMGA2 with poly(dA-dT) 2 and poly(dA)poly(dT). My results demonstrated that (i) each HMGA2 molecule binds to 15 AT bp; (ii) HMGA2 binds to both AT DNAs with very high affinity. However, the binding reaction of HMGA2 to poly(dA-dT) 2 is enthalpy-driven and the binding reaction of HMGA2 with poly(dA)poly(dT) is entropy-driven; (iii) the binding reactions are strongly depended on salt concentrations; (3) Previous studies showed that HMGA2 may have sequence specificity. In this study, I used a PCR-based SELEX procedure to examine the DNA binding specificity of HMGA2. Two consensus sequences for HMGA2 have been identified: 5'-ATATTCGCGAWWATT-3' and 5'-ATATTGCGCAWWATT-3', where W represents A or T. These consensus sequences have a unique feature: the first five base pairs are AT-rich, the middle four to five base pairs are GC-rich, and the last five to six base pairs are AT-rich. All three segments are critical for high affinity binding. Replacing either one of the AT-rich sequences to a non-AT-rich sequence causes at least 100-fold decrease in the binding affinity. Intriguingly, if the GC-segment is substituted by an AT-rich segment, the binding affinity of HMGA2 is reduced approximately 5-fold. Identification of the consensus sequences for HMGA2 represents an important step towards finding its binding sites within the genome.