39 resultados para Whole Genome Sequences


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Motivation: The number of bacterial genomes being sequenced is increasing very rapidly and hence, it is crucial to have procedures for rapid and reliable annotation of their functional elements such as promoter regions, which control the expression of each gene or each transcription unit of the genome. The present work addresses this requirement and presents a generic method applicable across organisms. Results: Relative stability of the DNA double helical sequences has been used to discriminate promoter regions from non-promoter regions. Based on the difference in stability between neighboring regions, an algorithm has been implemented to predict promoter regions on a large scale over 913 microbial genome sequences. The average free energy values for the promoter regions as well as their downstream regions are found to differ, depending on their GC content. Threshold values to identify promoter regions have been derived using sequences flanking a subset of translation start sites from all microbial genomes and then used to predict promoters over the complete genome sequences. An average recall value of 72% (which indicates the percentage of protein and RNA coding genes with predicted promoter regions assigned to them) and precision of 56% is achieved over the 913 microbial genome dataset.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Molecular understanding of disease processes can be accelerated if all interactions between the host and pathogen are known. The unavailability of experimental methods for large-scale detection of interactions across host and pathogen organisms hinders this process. Here we apply a simple method to predict protein-protein interactions across a host and pathogen organisms. We use homology detection approaches against the protein-protein interaction databases. DIP and iPfam in order to predict interacting proteins in a host-pathogen pair. In the present work, we first applied this approach to the test cases involving the pairs phage T4 - Escherichia coli and phage lambda - E. coli and show that previously known interactions could be recognized using our approach. We further apply this approach to predict interactions between human and three pathogens E. coli, Salmonella enterica typhimurium and Yersinia pestis. We identified several novel interactions involving proteins of host or pathogen that could be thought of as highly relevant to the disease process. Serendipitously, many interactions involve hypothetical proteins of yet unknown function. Hypothetical proteins are predicted from computational analysis of genome sequences with no laboratory analysis on their functions yet available. The predicted interactions involving such proteins could provide hints to their functions. (C) 2011 Elsevier B.V. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: Insulin like growth factor binding proteins modulate the mitogenic and pro survival effects of IGF. Elevated expression of IGFBP2 is associated with progression of tumors that include prostate, ovarian, glioma among others. Though implicated in the progression of breast cancer, the molecular mechanisms involved in IGFBP2 actions are not well defined. This study investigates the molecular targets and biological pathways targeted by IGFBP2 in breast cancer. Methods: Transcriptome analysis of breast tumor cells (BT474) with stable knockdown of IGFBP2 and breast tumors having differential expression of IGFBP2 by immunohistochemistry was performed using microarray. Differential gene expression was established using R-Bioconductor package. For validation, gene expression was determined by qPCR. Inhibitors of IGF1R and integrin pathway were utilized to study the mechanism of regulation of beta-catenin. Immunohistochemical and immunocytochemical staining was performed on breast tumors and experimental cells, respectively for beta-catenin and IGFBP2 expression. Results: Knockdown of IGFBP2 resulted in differential expression of 2067 up regulated and 2002 down regulated genes in breast cancer cells. Down regulated genes principally belong to cell cycle, DNA replication, repair, p53 signaling, oxidative phosphorylation, Wnt signaling. Whole genome expression analysis of breast tumors with or without IGFBP2 expression indicated changes in genes belonging to Focal adhesion, Map kinase and Wnt signaling pathways. Interestingly, IGFBP2 knockdown clones showed reduced expression of beta-catenin compared to control cells which was restored upon IGFBP2 re-expression. The regulation of beta-catenin by IGFBP2 was found to be IGF1R and integrin pathway dependent. Furthermore, IGFBP2 and beta-catenin are co-ordinately overexpressed in breast tumors and correlate with lymph node metastasis. Conclusion: This study highlights regulation of beta-catenin by IGFBP2 in breast cancer cells and most importantly, combined expression of IGFBP2 and beta-catenin is associated with lymph node metastasis of breast tumors.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Staphylococcus aureus is a commensal gram positive bacteria which causes severe and non severe infections in humans and livestock. In India, ST772 is a dominant and ST672 is an emerging clone of Staphylococcus aureus. Both cause serious human diseases, and carry type V SCCmec elements. The objective of this study was to characterize SCCmec type V elements of ST772 and ST672 because the usual PCR methods did not amplify all primers specific to the type. Whole genome sequencing analysis of seven ST772 and one ST672 S. aureus isolates revealed that the SCCmec elements of six of the ST772 isolates were the smallest of the extant type V elements and in addition have several other novel features. Only one ST772 isolate and the ST672 isolate carried bigger SCCmec cassettes which were composites carrying multiple ccrC genes. These cassettes had some similarities to type V SCCmec element from M013 isolate (ST59) from Taiwan in certain aspects. SCCmec elements of all Indian isolates had an inversion of the mec complex, similar to the bovine SCCmec type X. This study reveals that six out of seven ST772 S. aureus isolates have a novel type V (5C2) SCCmec element while one each of ST772 and ST672 isolates have a composite SCCmec type V element (5C2&5) formed by the integration of type V SCCmec into a MSSA carrying a SCC element, in addition to the mec gene complex inversions and extensive recombinations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The TSC2 gene, mutated in patients with tuberous sclerosis complex (TSC), encodes a 200 kDa protein TSC2 (tuberin). The importance of TSC2 in the regulation of cell growth and proliferation is irrefutable. TSC2 in complex with TSC1 negatively regulates the mTOR complex 1 (mTORC1) via RHEB in the PI3K-AKT-mTOR pathway and in turn regulates cell proliferation. It shows nuclear as well as cytoplasmic localization. However, its nuclear function remains elusive. In order to identify the nuclear function of TSC2, a whole-genome expression profiling of TSC2 overexpressing cells was performed, and the results showed differential regulation of 266 genes. Interestingly, transcription was found to be the most populated functional category. EREG (Epiregulin), a member of the epidermal growth factor family, was found to be the most downregulated gene in the microarray analysis. Previous reports have documented elevated levels of EREG in TSC lesions, making its regulatory aspects intriguing. Using the luciferase reporter, ChIP and EMSA techniques, we show that TSC2 binds to the EREG promoter between -352 bp and -303 bp and negatively regulates its expression. This is the first evidence for the role of TSC2 as a transcription factor and of TSC2 binding to the promoter of any gene.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: mIHF belongs to a subfamily of proteins, distinct from E. coli IHF. Results: Functionally important amino acids of mIHF and the mechanism(s) underlying DNA binding, DNA bending, and site-specific recombination are distinct from that of E. coli IHF. Conclusion: mIHF functions could contribute beyond nucleoid compaction. Significance: Because mIHF is essential for growth, the molecular mechanisms identified here can be exploited in drug screening efforts. The annotated whole-genome sequence of Mycobacterium tuberculosis revealed that Rv1388 (Mtihf) is likely to encode for a putative 20-kDa integration host factor (mIHF). However, very little is known about the functional properties of mIHF or the organization of the mycobacterial nucleoid. Molecular modeling of the mIHF three-dimensional structure, based on the cocrystal structure of Streptomyces coelicolor IHF duplex DNA, a bona fide relative of mIHF, revealed the presence of Arg-170, Arg-171, and Arg-173, which might be involved in DNA binding, and a conserved proline (Pro-150) in the tight turn. The phenotypic sensitivity of Escherichia coli ihfA and ihfB strains to UV and methyl methanesulfonate could be complemented with the wild-type Mtihf but not its alleles bearing mutations in the DNA-binding residues. Protein-DNA interaction assays revealed that wild-type mIHF, but not its DNA-binding variants, binds with high affinity to fragments containing attB and attP sites and curved DNA. Strikingly, the functionally important amino acid residues of mIHF and the mechanism(s) underlying its binding to DNA, DNA bending, and site-specific recombination are fundamentally different from that of E. coli IHF. Furthermore, we reveal novel insights into IHF-mediated DNA compaction depending on the placement of its preferred binding sites; mIHF promotes DNA compaction into nucleoid-like or higher order filamentous structures. We therefore propose that mIHF is a distinct member of a subfamily of proteins that serve as essential cofactors in site-specific recombination and nucleoid organization and that these findings represent a significant advance in our understanding of the role(s) of nucleoid-associated proteins.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: The heterotrimeric M. tuberculosis RecBCD complex, or each of its individual subunits, remains uncharacterized. Results: MtRecD exists as a homodimer in solution, catalyzes ssDNA-dependent ATP hydrolysis, unwinding of DNA replication/recombination intermediates, and interacts with RecA. Conclusion: MtRecD possesses strong 5 3- and weak 3 5-helicase activities. Significance: These findings provide insights into the mechanism underlying DSB repair and homologous recombination in mycobacteria. The annotated whole-genome sequence of Mycobacterium tuberculosis revealed the presence of a putative recD gene; however, the biochemical characteristics of its encoded protein product (MtRecD) remain largely unknown. Here, we show that MtRecD exists in solution as a stable homodimer. Protein-DNA binding assays revealed that MtRecD binds efficiently to single-stranded DNA and linear duplexes containing 5 overhangs relative to the 3 overhangs but not to blunt-ended duplex. Furthermore, MtRecD bound more robustly to a variety of Y-shaped DNA structures having 18-nucleotide overhangs but not to a similar substrate containing 5-nucleotide overhangs. MtRecD formed more salt-tolerant complexes with Y-shaped structures compared with linear duplex having 3 overhangs. The intrinsic ATPase activity of MtRecD was stimulated by single-stranded DNA. Site-specific mutagenesis of Lys-179 in motif I abolished the ATPase activity of MtRecD. Interestingly, although MtRecD-catalyzed unwinding showed a markedly higher preference for duplex substrates with 5 overhangs, it could also catalyze significant unwinding of substrates containing 3 overhangs. These results support the notion that MtRecD is a bipolar helicase with strong 5 3 and weak 3 5 unwinding activities. The extent of unwinding of Y-shaped DNA structures was approximate to 3-fold lower compared with duplexes with 5 overhangs. Notably, direct interaction between MtRecD and its cognate RecA led to inhibition of DNA strand exchange promoted by RecA. Altogether, these studies provide the first detailed characterization of MtRecD and present important insights into the type of DNA structure the enzyme is likely to act upon during the processes of DNA repair or homologous recombination.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The annotated whole-genome sequence of Mycobacterium tuberculosis indicated that Rv1388 (Mtihf) likely encodes a putative 20 kDa integration host factor (mIHF). However, very little is known about the functional properties of mIHF or organization of mycobacterial nucleoid. Molecular modeling of the mIHF three-dimensional structure, based on the cocrystal structure of Streptomyces coelicolor IHF-duplex DNA, a bona fide relative of mIHF, revealed the presence of Arg170, Arg171, and Arg173, which might be involved in DNA binding, and a conserved proline (P150) in the tight turn. The phenotypic sensitivity of Escherichia coli Delta ihfA and Delta ihfB strains to UV and methylmethanesulfonate could be complemented with the wild-type Mtihf, but not its alleles bearing mutations in the DNA-binding residues. Protein DNA interaction assays revealed that wild-type mIHF, but not its DNA-binding variants, bind with high affinity to fragments containing attB and attP sites and curved DNA. Strikingly, the functionally important amino acid residues of mIHF and the mechanism(s) underlying its binding to DNA, DNA bending, and site-specific recombination are fundamentally different from that of E. coli IHF alpha beta. Furthermore, we reveal novel insights into IHF-mediated DNA compaction depending on the placement of its preferred binding sites; mIHF promotes compaction of DNA into nucleoid-like or higher-order filamentous structures. We hence propose that mIHF is a distinct member of a subfamily of proteins that serve as essential cofactors in site-specific recombination and nucleoid organization and that these findings represent a significant advance in our understanding of the role(s) of nucleoid-associated proteins.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Many bacterial transcription factors do not behave as per the textbook operon model. We draw on whole genome work, as well as reported diversity across different bacteria, to argue that transcription factors may have evolved from nucleoid-associated proteins. This view would explain a large amount of recent data gleaned from high-throughput sequencing and bioinformatic analyses.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The current explosion of DNA sequence information has generated increasing evidence for the claim that noncoding repetitive DNA sequences present within and around different genes could play an important role in genetic control processes, although the precise role and mechanism by which these sequences function are poorly understood. Several of the simple repetitive sequences which occur in a large number of loci throughout the human and other eukaryotic genomes satisfy the sequence criteria for forming non-B DNA structures in vitro. We have summarized some of the features of three different types of simple repeats that highlight the importance of repetitive DNA in the control of gene expression and chromatin organization. (i) (TG/CA)n repeats are widespread and conserved in many loci. These sequences are associated with nucleosomes of varying linker length and may play a role in chromatin organization. These Z-potential sequences can help absorb superhelical stress during transcription and aid in recombination. (ii) Human telomeric repeat (TTAGGG)n adopts a novel quadruplex structure and exhibits unusual chromatin organization. This unusual structural motif could explain chromosome pairing and stability. (iii) Intragenic amplification of (CTG)n/(CAG)n trinucleotide repeat, which is now known to be associated with several genetic disorders, could down-regulate gene expression in vivo. The overall implications of these findings vis-à-vis repetitive sequences in the genome are summarized.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A stretch of 71 nucleotides in a 1.2 kilobase pair Pst I fragment of rice DNA was identified as tRNA~ gene by hybridization and nucleotide sequence analyses. The hybridization of genomic DNA with the tRNA gene showed that there are about 10 glycine tRNA genes per diploid rice genome. The 3' and 5' internal control regions, where RNA polymerase III and transcription factors bind, were found to be present in the coding sequence. The gene was transcribed into a 4S product in an yeast cell-free extract. The substitution of 5' internal control region with analogous sequences from either M13mpl9 or M13mpl8 DNA did not affect the transcription of the gene in vitro. The changes in three highly conserved nucleotides in the consensus 5' internal control region (RGYNNARYGG; R = purine, Y = pyrimidine, N = any nucleotide) did not affect transcription showing that these nucleotides are not essential for promotion of transcription. There were two 16 base pair repeats, 'TGTTTGTTTCAGCTTA' at - 130 and - 375 positions upstream from the start of the gene. Deletion of 5' flanking sequences including the 16 base pair repeat at - 375 showed increased transcription indicating that these sequences negatively modulate the expression of the gene.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Genome sequence information has generated increasing evidence for the claim that repetitive DNA sequences present within and around genes could play a important role in the regulation of gene expression. Polypurine/polypyrimidine sequences [poly(Pu/Py)] have been observed in the vicinity of promoters and within the transcribed regions of many genes. To understand whether such sequences influence the level of gene expression, we constructed several prokaryotic and eukaryotic expression vectors incorporating poly(Pu/Py) repeats both within and upstream of a reporter gene, lacZ (encoding β-galactosidase), and studied its expression in vivo. We find that, in contrast to the situation in Escherichia coli, the presence of poly(Pu/Py) sequences within the gene does not significantly inhibit gene expression in mammalian cells. On the other hand, the presence of such sequences upstream of lacZ leads to a several-fold reduction of gene expression in mammalian cells. Similar down-regulation was observed when a structural cassette containing poly(Pu/Py) sequences upstream of lacZ was integrated into yeast chromosome V. Sequence analysis of the nine totally sequenced yeast chromosomes shows that a large number of such sequences occur upstream of ORFs. On the basis of our experimental results and DNA sequence analysis, we propose that these sequences can function as cis-acting transcriptional regulators.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete genome of the baker's yeast S. cerevisiae was analyzed for the presence of polypurine/polypyrimidine (poly[pu/py]) repeats and their occurrences were classified on the basis of their location within and outside open reading frames (ORFs). The analysis reveals that such sequence motifs are present abundantly both in coding as well as noncoding regions. Clear positional preferences are seen when these tracts occur in noncoding regions. These motifs appear to occur predominantly at a unit nucleosomal length both upstream and downstream of ORFs. Moreover, there is a biased distribution of polypurines in the coding strands when these motifs occur within open reading frames. The significance of the biased distribution is discussed with reference to the occurrence of these motifs in other known mRNA sequences and expressed sequence tags. A model for cis regulation of gene expression is proposed based on the ability of these motifs to form an intermolecular triple helix structure when present within the coding region and/or to modulate nucleosome positioning via enhanced histone affinity when present outside coding regions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA methyltransferases (MTases) are a group of enzymes that catalyze the methyl group transfer from S-adenosyl-L-methionine in a sequence-specific manner. Orthodox Type II DNA MTases usually recognize palindromic DNA sequences and add a methyl group to the target base (either adenine or cytosine) on both strands. However, there are a number of MTases that recognize asymmetric target sequences and differ in their subunit organization. In a bacterial cell, after each round of replication, the substrate for any MTase is hemimethylated DNA, and it therefore needs only a single methylation event to restore the fully methylated state. This is in consistent with the fact that most of the DNA MTases studied exist as monomers in solution. Multiple lines of evidence suggest that some DNA MTases function as dimers. Further, functional analysis of many restriction-modification systems showed the presence of more than one or fused MTase genes. It was proposed that presence of two MTases responsible for the recognition and methylation of asymmetric sequences would protect the nascent strands generated during DNA replication from cognate restriction endonuclease. In this review, MTases recognizing asymmetric sequences have been grouped into different subgroups based on their unique properties. Detailed characterization of these unusual MTases would help in better understanding of their specific biological roles and mechanisms of action. The rapid progress made by the genome sequencing of bacteria and archaea may accelerate the identification and study of species- and strain-specific MTases of host-adapted bacteria and their roles in pathogenic mechanisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA methyltransferases (MTases) are a group of enzymes that catalyze the methyl group transfer from S-adenosyl-L-methionine in a sequence-specific manner. Orthodox Type II DNA MTases usually recognize palindromic DNA sequences and add a methyl group to the target base (either adenine or cytosine) on both strands. However, there are a number of MTases that recognize asymmetric target sequences and differ in their subunit organization. In a bacterial cell, after each round of replication, the substrate for any MTase is hemimethylated DNA, and it therefore needs only a single methylation event to restore the fully methylated state. This is in consistent with the fact that most of the DNA MTases studied exist as monomers in solution. Multiple lines of evidence suggest that some DNA MTases function as dimers. Further, functional analysis of many restriction-modification systems showed the presence of more than one or fused MTase genes. It was proposed that presence of two MTases responsible for the recognition and methylation of asymmetric sequences would protect the nascent strands generated during DNA replication from cognate restriction endonuclease. In this review, MTases recognizing asymmetric sequences have been grouped into different subgroups based on their unique properties. Detailed characterization of these unusual MTases would help in better understanding of their specific biological roles and mechanisms of action. The rapid progress made by the genome sequencing of bacteria and archaea may accelerate the identification and study of species- and strain-specific MTases of host-adapted bacteria and their roles in pathogenic mechanisms.