154 resultados para Gene annotation

em Indian Institute of Science - Bangalore - Índia


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Motivation: The number of bacterial genomes being sequenced is increasing very rapidly and hence, it is crucial to have procedures for rapid and reliable annotation of their functional elements such as promoter regions, which control the expression of each gene or each transcription unit of the genome. The present work addresses this requirement and presents a generic method applicable across organisms. Results: Relative stability of the DNA double helical sequences has been used to discriminate promoter regions from non-promoter regions. Based on the difference in stability between neighboring regions, an algorithm has been implemented to predict promoter regions on a large scale over 913 microbial genome sequences. The average free energy values for the promoter regions as well as their downstream regions are found to differ, depending on their GC content. Threshold values to identify promoter regions have been derived using sequences flanking a subset of translation start sites from all microbial genomes and then used to predict promoters over the complete genome sequences. An average recall value of 72% (which indicates the percentage of protein and RNA coding genes with predicted promoter regions assigned to them) and precision of 56% is achieved over the 913 microbial genome dataset.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The number of genome-wide association studies (GWAS) has increased rapidly in the past couple of years, resulting in the identification of genes associated with different diseases. The next step in translating these findings into biomedically useful information is to find out the mechanism of the action of these genes. However, GWAS studies often implicate genes whose functions are currently unknown; for example, MYEOV, ANKLE1, TMEM45B and ORAOV1 are found to be associated with breast cancer, but their molecular function is unknown. Results: We carried out Bayesian inference of Gene Ontology (GO) term annotations of genes by employing the directed acyclic graph structure of GO and the network of protein-protein interactions (PPIs). The approach is designed based on the fact that two proteins that interact biophysically would be in physical proximity of each other, would possess complementary molecular function, and play role in related biological processes. Predicted GO terms were ranked according to their relative association scores and the approach was evaluated quantitatively by plotting the precision versus recall values and F-scores (the harmonic mean of precision and recall) versus varying thresholds. Precisions of similar to 58% and similar to 40% for localization and functions respectively of proteins were determined at a threshold of similar to 30 (top 30 GO terms in the ranked list). Comparison with function prediction based on semantic similarity among nodes in an ontology and incorporation of those similarities in a k nearest neighbor classifier confirmed that our results compared favorably. Conclusions: This approach was applied to predict the cellular component and molecular function GO terms of all human proteins that have interacting partners possessing at least one known GO annotation. The list of predictions is available at http://severus.dbmi.pitt.edu/engo/GOPRED.html. We present the algorithm, evaluations and the results of the computational predictions, especially for genes identified in GWAS studies to be associated with diseases, which are of translational interest.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cis-peptide embedded segments are rare in proteins but often highlight their important role in molecular function when they do occur. The high evolutionary conservation of these segments illustrates this observation almost universally, although no attempt has been made to systematically use this information for the purpose of function annotation. In the present study, we demonstrate how geometric clustering and level-specific Gene Ontology molecular-function terms (also known as annotations) can be used in a statistically significant manner to identify cis-embedded segments in a protein linked to its molecular function. The present study identifies novel cis-peptide fragments, which are subsequently used for fragment-based function annotation. Annotation recall benchmarks interpreted using the receiver-operator characteristic plot returned an area-under-curve >0.9, corroborating the utility of the annotation method. In addition, we identified cis-peptide fragments occurring in conjunction with functionally important trans-peptide fragments, providing additional insights into molecular function. We further illustrate the applicability of our method in function annotation where homology-based annotation transfer is not possible. The findings of the present study add to the repertoire of function annotation approaches and also facilitate engineering, design and allied studies around the cis-peptide neighborhood of proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Several late gene expression factors (Lefs) have been implicated in fostering high levels of transcription from the very late gene promoters of polyhedrin and p10 from baculoviruses. We cloned and characterized from Bombyx mori nuclear polyhedrosis virus a late gene expression factor (Bmlef2) that encodes a 209-amino-acid protein harboring a Cys-rich C-terminal domain. The temporal transcription profiles of lef2 revealed a 1.2-kb transcript in both delayed early and late periods after virus infection. Transcription start site mapping identified the presence of an aphidicolin-sensitive late transcript arising from a TAAG motif located at -352 nucleotides and an aphidicolin-insensitive early transcript originating from a TTGT motif located 35 nucleotides downstream to a TATA box at -312 nucleotides, with respect to the +1 ATG of lef2. BmLef2 trans-activated very late gene expression from both polyhedrin and p10 promoters in transient expression assays. Internal deletion of the Cys-rich domain from the C-terminal region abolished the transcriptional activation. Inactivation of Lef2 synthesis by antisense lef2 transcripts drastically reduced the very late gene transcription but showed little effect on the expression from immediate early promoter. Decrease in viral DNA synthesis and a reduction in virus titer were observed only when antisense lef2 was expressed under the immediate early (ie-1) promoter. Furthermore, the antisense experiments suggested that lef2 plays a direct role in very late gene transcription.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Transcription of tRNA genes by RNA polymerase III is controlled by the internal conserved sequences within the coding region and the immediate upstream flanking sequences. A highly transcribed copy of glycyl tRNA gene tRNA1(Gly)-1 from Bombyx mori is down regulated by sequences located much farther upstream in the region -150 to -300 nucleotides (nt), with respect to the +1 nt of tRNA. The negative regulatory effect has been narrowed down to a sequence motif 'TATATAA', a perfect consensus recognised by the TATA binding protein, TBP. This sequence element, when brought closer to the transcription start point, on the other hand, exerts a positive effect by promoting transcription of the gene devoid of other cis regulatory elements. The identity of the nuclear protein interacting with this 'TATATAA' element to TBP has been established by antibody and mutagenesis studies. The 'TATATAA' element thus influences the transcription of tRNA genes positively or negatively in a position-dependent manner either by recruitment or sequestration of TBP from the transcription machinery.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Although LH is essential for survival and function of the corpus luteum (CL) in higher primates, luteolysis occurs during nonfertile cycles without a discernible decrease in circulating LH levels. Using genome-wide expression analysis, several experiments were performed to examine the processes of luteolysis and rescue of luteal function in monkeys. Induced luteolysis with GnRH receptor antagonist (Cetrorelix) resulted in differential regulation of 3949 genes, whereas replacement with exogenous LH (Cetrorelix plus LH) led to regulation of 4434 genes (1563 down-regulation and 2871 up-regulation). A model system for prostaglandin (PG) F-2 alpha-induced luteolysis in the monkey was standardized and demonstrated that PGF(2 alpha) regulated expression of 2290 genes in the CL. Analysis of the LH-regulated luteal transcriptome revealed that 120 genes were regulated in an antagonistic fashion by PGF(2 alpha). Based on the microarray data, 25 genes were selected for validation by real-time RT-PCR analysis, and expression of these genes was also examined in the CL throughout the luteal phase and from monkeys treated with human chorionic gonadotropin (hCG) to mimic early pregnancy. The results indicated changes in expression of genes favorable to PGF(2 alpha) action during the late to very late luteal phase, and expressions of many of these genes were regulated in an opposite manner by exogenous hCG treatment. Collectively, the findings suggest that curtailment of expression of downstream LH-target genes possibly through PGF(2 alpha) action on the CL is among the mechanisms underlying cross talk between the luteotropic and luteolytic signaling pathways that result in the cessation of luteal function, but hCG is likely to abrogate the PGF(2 alpha)-responsive gene expression changes resulting in luteal rescue crucial for the maintenance of early pregnancy. (Endocrinology 150: 1473-1484, 2009)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

THE rapid development of recombinant DNA technology has brought forth a revolution in biology'>", it aids us to have a closer look at the 'way genes are organized, eS11 ecially in the complex eucaryotic genornes'<", Although many animal and yeast genes have been studied in detail using recombinant DNA technology, plant genes have seldom been targets for such studie., Germination is an ideal process to study gene expression .because it effects a . shift in the metabolic status of seeds from a state of 'dormancy to an active one. AJ;l understanding of gene organization and regulation darin.g germination can be accomplblted by molecular cloning of DNA from seeds lik.e rice. To study the status of histone, rRNA tRNA and other genes in the rice genome, a general method was developed to clone eucarvotic DNA in a' plasmid vector pBR 322. This essentially ~ involves the following steps. The rice embryo and plasmid pBR 322 DNAs were cut witll restriction endonuclease Bam Hi to generate stick.Y ends, The plasmid DNA was puosphatased, the DNA~ ware a~·tnealed and joined 'by T4 phage DNA ligase. The recombinant DNA molecules thus produced were transjerred into E. coli and colonies containing them Were selected by their sensitivity to tetracycline and resistance to ampicillin, Two clones were identified . 2S haVing tRNA genes by hybridization of the DNA in the clones \vitl1 32P-la.belled rice tRNAs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In bovines characterization of biochemical and molecular determinants of the dominant follicle before and during different time intervals after gonadotrophin surge requires precise identification of the dominant follicle from a follicular wave. The objectives of the present study were to standardize an experimental model in buffalo cows for accurately identifying the dominant follicle of the first wave of follicular growth and characterize changes in follicular fluid hormone concentrations as well as expression patterns of various genes associated with the process of ovulation. From the day of estrus (day 0), animals were subjected to blood sampling and ultrasonography for monitoring circulating progesterone levels and follicular growth. On day 7 of the cycle, animals were administered a PGF2α analogue (Tiaprost Trometamol, 750 μg i.m.) followed by an injection of hCG (2000 IU i.m.) 36 h later. Circulating progesterone levels progressively increased from day 1 of the cycle to 2.26 ± 0.17 ng/ml on day 7 of the cycle, but declined significantly after PGF2α injection. A progressive increase in the size of the dominant follicle was observed by ultrasonography. The follicular fluid estradiol and progesterone concentrations in the dominant follicle were 600 ± 16.7 and 38 ± 7.6 ng/ml, respectively, before hCG injection and the concentration of estradiol decreased to 125.8 ± 25.26 ng/ml, but concentration of progesterone increased to 195 ± 24.6 ng/ml, 24 h post-hCG injection. Inh-α and Cyp19A1 expressions in granulosa cells were maximal in the dominant follicle and declined in response to hCG treatment. Progesterone receptor, oxytocin and cycloxygenase-2 expressions in granulosa cells, regarded as markers of ovulation, were maximal at 24 h post-hCG. The expressions of genes belonging to the super family of proteases were also examined; Cathepsin L expression decreased, while ADAMTS 3 and 5 expressions increased 24 h post-hCG treatment. The results of the current study indicate that sequential treatments of PGF2α and hCG during early estrous cycle in the buffalo cow leads to follicular growth that culminates in ovulation. The model system reported in the present study would be valuable for examining temporo-spatial changes in the periovulatory follicle immediately before and after the onset of gonadotrophin surge.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Redundant DNA can buffer sequence dependent structural deviations from an ideal double helix. Buffering serves a mechanistic function by reducing extraneous conformational effects which could interfere with readout or which would impose energetic constraints on evolution. It also serves an evolutionary function by allowing for gradual variations in conformation-dependent regulation of gene expression. Such gradualism is critical for the rate of evolution. The buffer structure concept provides a new interpretation for repetitive DNA and for exons and introns.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A stretch of 71 nucleotides in a 1.2 kilobase pair Pst I fragment of rice DNA was identified as tRNA~ gene by hybridization and nucleotide sequence analyses. The hybridization of genomic DNA with the tRNA gene showed that there are about 10 glycine tRNA genes per diploid rice genome. The 3' and 5' internal control regions, where RNA polymerase III and transcription factors bind, were found to be present in the coding sequence. The gene was transcribed into a 4S product in an yeast cell-free extract. The substitution of 5' internal control region with analogous sequences from either M13mpl9 or M13mpl8 DNA did not affect the transcription of the gene in vitro. The changes in three highly conserved nucleotides in the consensus 5' internal control region (RGYNNARYGG; R = purine, Y = pyrimidine, N = any nucleotide) did not affect transcription showing that these nucleotides are not essential for promotion of transcription. There were two 16 base pair repeats, 'TGTTTGTTTCAGCTTA' at - 130 and - 375 positions upstream from the start of the gene. Deletion of 5' flanking sequences including the 16 base pair repeat at - 375 showed increased transcription indicating that these sequences negatively modulate the expression of the gene.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A cDNA clone for cytochrome P-450e, a phenobarbitone-inducible species in rat liver, has been isolated and characterized. With the use of this cloned DNA, an attempt has been initiated to elucidate the factors regulating the cytochrome P-450 gene expression. Inhibitors of heme synthesis such as cobalt chloride and 3-amino-1,2,4-triazole block the induction of cytochrome P-450e by phenobarbitone at the level of transcription. This is evident from the decrease in the rate of synthesis of cytochrome P-450e, a decrease in the levels of specific translatable messenger RNA, a decrease in the specific cytoplasmic and nuclear messenger RNA contents, and nuclear transcription of cytochrome P-450e gene, as revealed by hybridization to the cloned probe, under these conditions. It is proposed that heme is a general regulator of cytochrome P-450 gene expression at the level of transcription, whereas the drug or its metabolite would impart the specificity needed for the induction of a particular species.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A cDNA clone for the Ya subunit of glutathione transferase from rat liver was constructed in E.coli. The clone hybridized to Ya and Yc subunit messenger RNAs. On the basis of experiments involving cell-free translation and hybridization to the cloned probe, it was shown that prototype inducers of cytochrome P-450 such as phenobarbitone and 3-methylcholanthrene as well as inhibitors such as CoCl2 and 3-amino-l,2,4-triazole enhanced the glutathione transferase (Ya+Yc) messenger RNA contents in rat liver. A comparative study with the induction of cytochrome P-450 (b+e) by phenobarbitone revealed that the drug manifested a striking increase in the nuclear pre-messenger RNAs for the cytochrome at 12 hr, but did not significantly affect the same in the case of glutathione transferase (Ya+Yc). 3-Amino-l, 2,4-tnazole and CoCl- blocked the phenobarbitone mediated increase in cytochrome P-450 (b+e) nuclear pre-messenger RNAs. These compounds did not significantly affect the glutathione transferase (Ya+Yc) nuclear pre-messenger RNA levels. The polysomal, poly (A)- containing messenger RNAs for cytochrome P-450 (b+e) increased by 12–15 fold after phenobarbitone administration, reached a maximum around 16hr and then decreased sharply. In comparison, the increase in the case of glutathione transferase (Ya+Yc) mesenger RNAs was sluggish and steady and a value of 3–4 fold was reached around 24 hr. Run-off transcription rates for cytochrome P-450 (b+e) increased by nearly 15 fold in 4 hr after phenobarbitone administration, whereas the increase for glutathione transferase (Ya+Yc) was only 2.0 fold. At 12 hr after the drug administration, the glutathione transferase (Ya+Yc) transcription rates were near normal. Administration of 3-amino-l,2,4-triazole and CoCl2 blocked the phenobarbitone-mediated increase in the transcription of cytochrome P-450 (b+e) messenger RNAs. These compounds at best had only marginal effects on the transcription of glutathione transferase (Ya+Yc) messenger RNAs. The half-life of cytochrome P-450 (b+e) messenger RNA was estimated to be 3–4 hr, whereas that for glutathione transferase (Ya+Yc) was found to be 8-9 hr. Administration of phenobarbitone enhanced the half-life of glutathione transferase (Ya+Yc) messenger RNA by nearly two fold. It is suggested that while transcription activation may play a primary role in the induction of cytochrome P-450 (b+e), the induction of glutathione transferase (Ya+Yc) may essentially involve stabilization of the messenger RNAs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This feature article describes the recent developments in the design of cationic lipids and their applications in gene delivery. Various structure-activity investigations explaining the variations in gene transfection efficacies with respect to different molecular structures of the cationic lipids have been discussed. Gene transfer abilities are presented in relation to aggregation properties of different aqueous formulations such as cationic liposomes and surfactant aggregates from various amphiphiles and cationic lipids, as a function of their hydrophobic parts, linkers and head groups.