888 resultados para Genome Sequence
Resumo:
Mango is an important industry for Queensland, Australia, with an annual value exceeding $80 million. The Kensington Pride cultivar, prized by consumers for desirable taste and colour characteristics, commands 60% of the domestic market though this market share has declined in recent years as new varieties, such as Calypso™, get established with consumers. In 2005, the Queensland Government's Department of Agriculture and Fisheries commenced the Mango Genomics Initiative. This project brought together multidisciplinary teams of breeders, pathologists, sensory scientists, flavour chemists and molecular biologists to develop a suite of tools and inter-related data sets to support the accelerated development of new commercial mango varieties. An overview of the Mango Genomics Initiative will be presented here culminating in the generation of a draft Kensington Pride mango genome sequence.
Resumo:
Genome sequence information has generated increasing evidence for the claim that repetitive DNA sequences present within and around genes could play a important role in the regulation of gene expression. Polypurine/polypyrimidine sequences [poly(Pu/Py)] have been observed in the vicinity of promoters and within the transcribed regions of many genes. To understand whether such sequences influence the level of gene expression, we constructed several prokaryotic and eukaryotic expression vectors incorporating poly(Pu/Py) repeats both within and upstream of a reporter gene, lacZ (encoding β-galactosidase), and studied its expression in vivo. We find that, in contrast to the situation in Escherichia coli, the presence of poly(Pu/Py) sequences within the gene does not significantly inhibit gene expression in mammalian cells. On the other hand, the presence of such sequences upstream of lacZ leads to a several-fold reduction of gene expression in mammalian cells. Similar down-regulation was observed when a structural cassette containing poly(Pu/Py) sequences upstream of lacZ was integrated into yeast chromosome V. Sequence analysis of the nine totally sequenced yeast chromosomes shows that a large number of such sequences occur upstream of ORFs. On the basis of our experimental results and DNA sequence analysis, we propose that these sequences can function as cis-acting transcriptional regulators.
Resumo:
Background: Trypanosoma evansi infections, commonly called 'surra', cause significant economic losses to livestock industry. While this infection is mainly restricted to large animals such as camels, donkeys and equines, recent reports indicate their ability to infect humans. There are no World Animal Health Organization (WAHO) prescribed diagnostic tests or vaccines available against this disease and the available drugs show significant toxicity. There is an urgent need to develop improved methods of diagnosis and control measures for this disease. Unlike its related human parasites T. brucei and T. cruzi whose genomes have been fully sequenced T. evansi genome sequence remains unavailable and very little efforts are being made to develop improved methods of prevention, diagnosis and treatment. With a view to identify potential diagnostic markers and drug targets we have studied the clinical proteome of T. evansi infection using mass spectrometry (MS).Methodology/Principal Findings: Using shot-gun proteomic approach involving nano-lc Quadrupole Time Of Flight (QTOF) mass spectrometry we have identified over 160 proteins expressed by T. evansi in mice infected with camel isolate. Homology driven searches for protein identification from MS/MS data led to most of the matches arising from related Trypanosoma species. Proteins identified belonged to various functional categories including metabolic enzymes; DNA metabolism; transcription; translation as well as cell-cell communication and signal transduction. TCA cycle enzymes were strikingly missing, possibly suggesting their low abundances. The clinical proteome revealed the presence of known and potential drug targets such as oligopeptidases, kinases, cysteine proteases and more.Conclusions/Significance: Previous proteomic studies on Trypanosomal infections, including human parasites T. brucei and T. cruzi, have been carried out from lab grown cultures. For T. evansi infection this is indeed the first ever proteomic study reported thus far. In addition to providing a glimpse into the biology of this neglected disease, our study is the first step towards identification of diagnostic biomarkers, novel drug targets as well as potential vaccine candidates to fight against T. evansi infections.
Resumo:
Lactobacillus rhamnosus GG is a probiotic bacterium that is known worldwide. Since its discovery in 1985, the health effects and biology of this health-promoting strain have been researched at an increasing rate. However, knowledge of the molecular biology responsible for these health effects is limited, even though research in this area has continued to grow since the publication of the whole genome sequence of L. rhamnosus GG in 2009. In this thesis, the molecular biology of L. rhamnosus GG was explored by mapping the changes in protein levels in response to diverse stress factors and environmental conditions. The proteomics data were supplemented with transcriptome level mapping of gene expression. The harsh conditions of the gastro-intestinal tract, which involve acidic conditions and detergent-like bile acids, are a notable challenge to the survival of probiotic bacteria. To simulate these conditions, L. rhamnosus GG was exposed to a sudden bile stress, and several stress response mechanisms were revealed, among others various changes in the cell envelope properties. L. rhamnosus GG also responded in various ways to mild acid stress, which probiotic bacteria may face in dairy fermentations and product formulations. The acid stress response of L. rhamnosus GG included changes in central metabolism and specific responses related to the control of intracellular pH. Altogether, L. rhamnosus GG was shown to possess a large repertoire of mechanisms for responding to stress conditions, which is a beneficial character of a probiotic organism. Adaptation to different growth conditions was studied by comparing the proteome level responses of L. rhamnosus GG to divergent growth media and to different phases of growth. Comparing different growth phases revealed that the metabolism of L. rhamnosus GG is modified markedly during shift from the exponential to the stationary phase of growth. These changes were seen both at proteome and transcriptome levels and in various different cellular functions. When the growth of L. rhamnosus GG in a rich laboratory medium and in an industrial whey-based medium was compared, various differences in metabolism and in factors affecting the cell surface properties could be seen. These results led us to recommend that the industrial-type media should be used in laboratory studies of L. rhamnosus GG and other probiotic bacteria to achieve a similar physiological state for the bacteria as that found in industrial products, which would thus yield more relevant information about the bacteria. In addition, an interesting phenomenon of protein phosphorylation was observed in L. rhamnosus GG. Phosphorylation of several proteins of L. rhamnosus GG was detected, and there were hints that the degree of phosphorylation may be dependent on the growth pH.
Resumo:
Triplex forming oligonucleotides (TFOs) have the potential to modulate gene expression. While most of the experiments are directed towards triplex mediated inhibition of gene expression the strategy potentially could be used for gene specific activation. In an attempt to design a strategy for gene specific activation in vivo applicable to a large number of genes we have designed a TFO based activator-target system which may be utilized in Saccharomyces cerevisiae or any other system where Gal4 protein is ectopically expressed. The total genome sequence of Saccharomyces cerevisiae and expression profiles were used to select the target genes with upstream poly (pu/py) sequences. We have utilized the paradigm of Gal4 protein and its binding site. We describe here the selection of target genes and design of hairpin-TFO including the targeting sequences containing polypurine stretch found in the upstream promoter regions of weakly expressed genes. We demonstrate, the formation of hairpin-TFO, its binding to Gal4 protein, its ability to form triplex with the target duplex in vitro, the effect of polyethylenimine on complex formation and discuss the implication on in vivo transcription activation.
Resumo:
The ability to metabolize aromatic beta-glucosides such as salicin and arbutin varies among members of the Enterobacteriaceae. The ability of Escherichia coli to degrade salicin and arbutin appears to be cryptic, subject to activation of the bgl genes, whereas many members of the Klebsiella genus can metabolize these sugars. We have examined the genetic basis for beta-glucoside utilization in Klebsiella aerogenes. The Klebsiella equivalents of bglG, bglB and bglR have been cloned using the genome sequence database of Klebsiella pneumoniae. Nucleotide sequencing shows that the K. aerogenes bgl genes show substantial similarities to the E. coli counterparts. The K. aerogenes bgl genes in multiple copies can also complement E. coli mutants deficient in bglG encoding the antiterminator and bglB encoding the phospho-beta-glucosidase, suggesting that they are functional homologues. The regulatory region bglR of K aerogenes shows a high degree of similarity of the sequences involved in BglG-mediated regulation. Interestingly, the regions corresponding to the negative elements present in the E. coli regulatory region show substantial divergence in K aerogenes. The possible evolutionary implications of the results are discussed. (C) 2003 Federation of European Microbiological Societies. Published by Elsevier Science B.v. All rights reserved.
Resumo:
About a third of the human population is estimated to be infected with Mycobacterium tuberculosis. The bacterium displays an excellent adaptability to survive within the host macrophages. As the reactive environment of macrophages is capable of inducing DNA damage, the ability of the pathogen to safeguard its DNA against the damage is of paramount significance for its survival within the host. Analysis of the genome sequence has provided important insights into the DNA repair machinery of the pathogen, and the studies on DNA repair in mycobacteria have gained momentum in the past few years. The studies have revealed considerable differences in the mycobacterial DNA repair machinery when compared with those of the other bacteria. This review article focuses especially on the aspects of base excision, and nucleotide excision repair pathways in mycobacteria. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
Of the similar to 4000 ORFs identified through the genome sequence of Mycobacterium tuberculosis (TB) H37Rv, experimentally determined structures are available for 312. Since knowledge of protein structures is essential to obtain a high-resolution understanding of the underlying biology, we seek to obtain a structural annotation for the genome, using computational methods. Structural models were obtained and validated for similar to 2877 ORFs, covering similar to 70% of the genome. Functional annotation of each protein was based on fold-based functional assignments and a novel binding site based ligand association. New algorithms for binding site detection and genome scale binding site comparison at the structural level, recently reported from the laboratory, were utilized. Besides these, the annotation covers detection of various sequence and sub-structural motifs and quaternary structure predictions based on the corresponding templates. The study provides an opportunity to obtain a global perspective of the fold distribution in the genome. The annotation indicates that cellular metabolism can be achieved with only 219 folds. New insights about the folds that predominate in the genome, as well as the fold-combinations that make up multi-domain proteins are also obtained. 1728 binding pockets have been associated with ligands through binding site identification and sub-structure similarity analyses. The resource (http://proline.physics.iisc.ernet.in/Tbstructuralannotation), being one of the first to be based on structure-derived functional annotations at a genome scale, is expected to be useful for better understanding of TB and for application in drug discovery. The reported annotation pipeline is fairly generic and can be applied to other genomes as well.
Resumo:
About a third of the human population is estimated to be infected with Mycobacterium tuberculosis. Emergence of drug resistant strains and the protracted treatment strategies have compelled the scientific community to identify newer drug targets, and to develop newer vaccines. In the host macrophages, the bacterium survives within an environment rich in reactive nitrogen and oxygen species capable of damaging its genome. Therefore, for its successful persistence in the host, the pathogen must need robust DNA repair mechanisms. Analysis of M. tuberculosis genome sequence revealed that it lacks mismatch repair pathway suggesting a greater role for other DNA repair pathways such as the nucleotide excision repair, and base excision repair pathways. In this article, we summarize the outcome of research involving these two repair pathways in mycobacteria focusing primarily on our own efforts. Our findings, using Mycobacterium smegmatis model, suggest that deficiency of various DNA repair functions in single or in combinations severely compromises their DNA repair capacity and attenuates their growth under conditions typically encountered in macrophages. (C) 2011 Elsevier Ireland Ltd. All rights reserved.
Resumo:
A decade since the availability of Mycobacterium tuberculosis (Mtb) genome sequence, no promising drug has seen the light of the day. This not only indicates the challenges in discovering new drugs but also suggests a gap in our current understanding of Mtb biology. We attempt to bridge this gap by carrying out extensive re-annotation and constructing a systems level protein interaction map of Mtb with an objective of finding novel drug target candidates. Towards this, we synergized crowd sourcing and social networking methods through an initiative `Connect to Decode' (C2D) to generate the first and largest manually curated interactome of Mtb termed `3interactome pathway' (IPW), encompassing a total of 1434 proteins connected through 2575 functional relationships. Interactions leading to gene regulation, signal transduction, metabolism, structural complex formation have been catalogued. In the process, we have functionally annotated 87% of the Mtb genome in context of gene products. We further combine IPW with STRING based network to report central proteins, which may be assessed as potential drug targets for development of drugs with least possible side effects. The fact that five of the 17 predicted drug targets are already experimentally validated either genetically or biochemically lends credence to our unique approach.
Resumo:
Luteal insufficiency affects fertility and hence study of mechanisms that regulate corpus luteum (CL) function is of prime importance to overcome infertility problems. Exploration of human genome sequence has helped to study the frequency of single nucleotide polymorphisms (SNPs). Clinical benefits of screening SNPs in infertility are being recognized well in recent times. Examining SNPs in genes associated with maintenance and regression of CL may help to understand unexplained luteal insufficiency and related infertility. Publicly available microarray gene expression databases reveal the global gene expression patterns in primate CL during the different functional state. We intend to explore computationally the deleterious SNPs of human genes reported to be common targets of luteolysin and luteotropin in primate CL Different computational algorithms were used to dissect out the functional significance of SNPs in the luteinizing hormone sensitive genes. The results raise the possibility that screening for SNPs might be integrated to evaluate luteal insufficiency associated with human female infertility for future studies. (C) 2012 Elsevier B.V. All rights reserved,
Resumo:
Host cell remodelling is a hallmark of malaria pathogenesis. It involves protein folding, unfolding and trafficking events and thus participation of chaperones such as Hsp70s and Hsp40s is well speculated. Until recently, only Hsp40s were thought to be the sole representative of the parasite chaperones in the exportome. However, based on the re-annotated Plasmodium falciparum genome sequence, a putative candidate for exported Hsp70 has been reported, which otherwise was known to be a pseudogene. We raised a specific antiserum against a C-terminal peptide uniquely present in PfHsp70-x. Immunoblotting and immunofluorescence-based approaches in combination with sub-cellular fractionation by saponin and streptolysin-O have been taken to determine the expression and localization of PfHsp70-x in infected erythrocyte. The re-annotated sequence of PfHsp70-x reveals it to be a functional protein with an endoplasmic reticulum signal peptide. It gets maximally expressed at the schizont stage of intra-erythrocytic life cycle. Majority of the protein localizes to the parasitophorous vacuole and some of it gets exported to the erythrocyte compartment where it associates with Maurer's clefts. The identification of an exported parasite Hsp70 chaperone presents us with the fact that the parasite has evolved customized chaperones which might be playing crucial roles in aspects of trafficking and host cell remodelling.
Resumo:
Genomic sequences are far from being random but are made up of systematically ordered and information rich patterns. These repeated sequence patterns have been vastly utilized for their fundamental importance in understanding the genome function and organization. To this end, a comprehensive toolkit, RepEx, has been developed which extracts repeat (inverted, everted and mirror) patterns from the given genome sequence(s) without any constraints. The toolkit can also be used to fetch the inverted repeats present in the protein sequence (s). Further, it is capable of extracting exact and degenerate repeats with a user defined spacer intervals. It is remarkably more precise and sensitive when compared to the existing tools. An example with comprehensive case studies and a performance evaluation of the proposed toolkit has been presented to authenticate its efficiency and accuracy. (C) 2013 Elsevier Inc. All rights reserved.
Resumo:
Background: mIHF belongs to a subfamily of proteins, distinct from E. coli IHF. Results: Functionally important amino acids of mIHF and the mechanism(s) underlying DNA binding, DNA bending, and site-specific recombination are distinct from that of E. coli IHF. Conclusion: mIHF functions could contribute beyond nucleoid compaction. Significance: Because mIHF is essential for growth, the molecular mechanisms identified here can be exploited in drug screening efforts. The annotated whole-genome sequence of Mycobacterium tuberculosis revealed that Rv1388 (Mtihf) is likely to encode for a putative 20-kDa integration host factor (mIHF). However, very little is known about the functional properties of mIHF or the organization of the mycobacterial nucleoid. Molecular modeling of the mIHF three-dimensional structure, based on the cocrystal structure of Streptomyces coelicolor IHF duplex DNA, a bona fide relative of mIHF, revealed the presence of Arg-170, Arg-171, and Arg-173, which might be involved in DNA binding, and a conserved proline (Pro-150) in the tight turn. The phenotypic sensitivity of Escherichia coli ihfA and ihfB strains to UV and methyl methanesulfonate could be complemented with the wild-type Mtihf but not its alleles bearing mutations in the DNA-binding residues. Protein-DNA interaction assays revealed that wild-type mIHF, but not its DNA-binding variants, binds with high affinity to fragments containing attB and attP sites and curved DNA. Strikingly, the functionally important amino acid residues of mIHF and the mechanism(s) underlying its binding to DNA, DNA bending, and site-specific recombination are fundamentally different from that of E. coli IHF. Furthermore, we reveal novel insights into IHF-mediated DNA compaction depending on the placement of its preferred binding sites; mIHF promotes DNA compaction into nucleoid-like or higher order filamentous structures. We therefore propose that mIHF is a distinct member of a subfamily of proteins that serve as essential cofactors in site-specific recombination and nucleoid organization and that these findings represent a significant advance in our understanding of the role(s) of nucleoid-associated proteins.
Resumo:
The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. (C) 2014 Elsevier Ltd. All rights reserved.