977 resultados para Termes Gene Ontology


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Recently, we identified a large number of ultraconserved (uc) sequences in noncoding regions of human, mouse, and rat genomes that appear to be essential for vertebrate and amniote ontogeny. Here, we used similar methods to identify ultraconserved genomic regions between the insect species Drosophila melanogaster and Drosophila pseudoobscura, as well as the more distantly related Anopheles gambiae. As with vertebrates, ultraconserved sequences in insects appear to Occur primarily in intergenic and intronic sequences, and at intron-exon junctions. The sequences are significantly associated with genes encoding developmental regulators and transcription factors, but are less frequent and are smaller in size than in vertebrates. The longest identical, nongapped orthologous match between the three genomes was found within the homothorax (hth) gene. This sequence spans an internal exon-intron junction, with the majority located within the intron, and is predicted to form a highly stable stem-loop RNA structure. Real-time quantitative PCR analysis of different hth splice isoforms and Northern blotting showed that the conserved element is associated with a high incidence of intron retention in hth pre-mRNA, suggesting that the conserved intronic element is critically important in the post-transcriptional regulation of hth expression in Diptera.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The gastrointestinal tracts of multi-cellular blood-feeding parasites are targets for vaccines and drugs. Recently, recombinant vaccines that interrupt the digestion of blood in the hookworm gut have shown efficacy, so we explored the intestinal transcriptomes of the human and canine hookworms, Necator americanus and Ancylostoma caninum, respectively. We used Laser Microdissection Microscopy to dissect gut tissue from the parasites, extracted the RNA and generated cDNA libraries. A total of 480 expressed sequence tags were sequenced from each library and assembled into contigs, accounting for 268 N. americanus genes and 276 A. caninum genes. Only 17% of N. americanus and 36% of A. caninum contigs were assigned Gene Ontology classifications. Twenty-six (9.8%) N. americanus and 18 (6.5%) A. caninum contigs did not have homologues in any databases including dbEST-of these novel clones, seven N. americanus and three A. caninum contigs had Open Reading Frames with predicted secretory signal peptides. The most abundant transcripts corresponded to mRNAs encoding cholesterol-and fatty acid-binding proteins, C-type lectins, Activation-Associated Secretory Proteins, and proteases of different mechanistic classes, particularly astacin-like metallopeptidases. Expressed sequence tags corresponding to known and potential recombinant vaccines were identified and these included homologues of proteases, anti-clotting factors, defensins and integral membrane proteins involved in cell adhesion. (c) 2006 Australian Society for Parasitology Inc Published by Elsevier Ltd. All fights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Using the two largest collections of Mus musculus and Homo sapiens transcription start sites ( TSSs) determined based on CAGE tags, ditags, full- length cDNAs, and other transcript data, we describe the compositional landscape surrounding TSSs with the aim of gaining better insight into the properties of mammalian promoters. We classified TSSs into four types based on compositional properties of regions immediately surrounding them. These properties highlighted distinctive features in the extended core promoters that helped us delineate boundaries of the transcription initiation domain space for both species. The TSS types were analyzed for associations with initiating dinucleotides, CpG islands, TATA boxes, and an extensive collection of statistically significant cis- elements in mouse and human. We found that different TSS types show preferences for different sets of initiating dinucleotides and ciselements. Through Gene Ontology and eVOC categories and tissue expression libraries we linked TSS characteristics to expression. Moreover, we show a link of TSS characteristics to very specific genomic organization in an example of immune- response- related genes ( GO: 0006955). Our results shed light on the global properties of the two transcriptomes not revealed before and therefore provide the framework for better understanding of the transcriptional mechanisms in the two species, as well as a framework for development of new and more efficient promoter- and gene- finding tools.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Application of a computational membrane organization prediction pipeline, MemO, identified putative type II membrane proteins as proteins predicted to encode a single alpha-helical transmembrane domain (TMD) and no signal peptides. MemO was applied to RIKEN's mouse isoform protein set to identify 1436 non-overlapping genomic regions or transcriptional units (TUs), which encode exclusively type II membrane proteins. Proteins with overlapping predicted InterPro and TMDs were reviewed to discard false positive predictions resulting in a dataset comprised of 1831 transcripts in 1408 TUs. This dataset was used to develop a systematic protocol to document subcellular localization of type II membrane proteins. This approach combines mining of published literature to identify subcellular localization data and a high-throughput, polymerase chain reaction (PCR)-based approach to experimentally characterize subcellular localization. These approaches have provided localization data for 244 and 169 proteins. Type II membrane proteins are localized to all major organelle compartments; however, some biases were observed towards the early secretory pathway and punctate structures. Collectively, this study reports the subcellular localization of 26% of the defined dataset. All reported localization data are presented in the LOCATE database (http://www.locate.imb.uq.edu.au).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Large-scale gene discovery has been performed for the grass fungal endophytes Neotyphodium coenophialum, Neotyphodium lolii, and Epichloe festucae. The resulting sequences have been annotated by comparison with public DNA and protein sequence databases and using intermediate gene ontology annotation tools. Endophyte sequences have also been analysed for the presence of simple sequence repeat and single nucleotide polymorphism molecular genetic markers. Sequences and annotation are maintained within a MySQL database that may be queried using a custom web interface. Two cDNA-based microarrays have been generated from this genome resource, They permit the interrogation of 3806 Neotyphodium genes (Nchip (TM) rnicroarray), and 4195 Neotyphodium and 920 Epichloe genes (EndoChip (TM) microarray), respectively. These microarrays provide tools for high-throughput transcriptome analysis, including genome-specific gene expression studies, profiling of novel endophyte genes, and investigation of the host grass-symbiont interaction. Comparative transcriptome analysis in Neotyphodium and Epichloe was performed. (c) 2006 Elsevier

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Despite the presence of over 3 million transposons separated on average by similar to 500 bp, the human and mouse genomes each contain almost 1000 transposon-free regions (TFRs) over 10 kb in length. The majority of human TFRs correlate with orthologous TFRs in the mouse, despite the fact that most transposons are lineage specific. Many human TFRs also overlap with orthologous TFRs in the marsupial opossum, indicating that these regions have remained refractory to transposon insertion for long evolutionary periods. Over 90% of the bases covered by TFRs are noncoding, much of which is not highly conserved. Most TFRs are not associated with unusual nucleotide composition, but are significantly associated with genes encoding developmental regulators, suggesting that they represent extended regions of regulatory information that are largely unable to tolerate insertions, a conclusion difficult to reconcile with current conceptions of gene regulation.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The application of mechanical insults to the spinal cord results in profound cellular and molecular changes, including the induction of neuronal cell death and altered gene expression profiles. Previous studies have described alterations in gene expression following spinal cord injury, but the specificity of this response to mechanical stimuli is difficult to investigate in vivo. Therefore, we have investigated the effect of cyclic tensile stresses on cultured spinal cord cells from E15 Sprague-Dawley rats, using the FX3000 Flexercell Strain Unit. We examined cell morphology and viability over a 72 hour time course. Microarray analysis of gene expression was performed using the Affymetrix GeneChip System, where categorization of identified genes was performed using the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) systems. Changes in expression of 12 genes were validated with quantitative real-time reverse transcription polymerase chain reaction (RT-PCR).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

To capture the genomic profiles for histone modification, chromatin immunoprecipitation (ChIP) is combined with next generation sequencing, which is called ChIP-seq. However, enriched regions generated from the ChIP-seq data are only evaluated on the limited knowledge acquired from manually examining the relevant biological literature. This paper proposes a novel framework, which integrates multiple knowledge sources such as biological literature, Gene Ontology, and microarray data. In order to precisely analyze ChIP-seq data for histone modification, knowledge integration is based on a unified probabilistic model. The model is employed to re-rank the enriched regions generated from peak finding algorithms. Through filtering the reranked enriched regions using some predefined threshold, more reliable and precise results could be generated. The combination of the multiple knowledge sources with the peaking finding algorithm produces a new paradigm for ChIP-seq data analysis. © (2012) Trans Tech Publications, Switzerland.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Previous studies have described alterations in gene expression following spinal cord injury, but this response to mechanical stimuli is difficult to investigate in vivo. Therefore, we have investigated the effect of cyclic tensile strain on cultured spinal cord cells from E15 Sprague-Dawley rats. Microarray analysis of gene expression and categorization of identified genes were performed using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) systems. The application of cyclic tensile strain reduced the viability of cultured spinal cord cells significantly in a dose- and time-dependent manner. GO analysis identified candidate genes related to apoptosis (44) and to response to stimulus (17). KEGG analysis identified changes in the expression levels of 12 genes of the mitogen-activated protein kinase (MAPK) signaling pathway, which were confirmed to be upregulated and validated by RT-PCR analysis. Spinal cord cells undergo cell death in response to cyclic tensile strain, which were dose- and time-dependent, with upregulation of various genes, in particular of the MAPK pathway.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Estresses ambientais abióticos são fatores que causam respostas ao nível molecular, fisiológico e morfológico em plantas, dependendo também de sua intensidade e duração. É visto que algumas espécies apresentam tolerância a condições estressantes e ao mesmo tempo são fontes naturais de matéria prima para indústria. Nesse contexto encontra-se a mamona (Ricinus comunnis L.), principal fonte de óleo de rícino valorizado por suas aplicações farmacêuticas e principalmente industriais, vem sendo usada como cultura em regiões onde a disponibilidade de água é reduzida, usada como fonte de renda para agricultura da região nordeste brasileira. Visto que pouco se sabe sobre as respostas moleculares que levam essa planta a tolerar regiões secas e como as sementes, principais foco de interesse, respondem a essa escassez, nesse trabalho foram construídas duas bibliotecas de cDNAs, onde a partir de uma abordagem subtrativa, continham RNAs diferencialmente expressos em sementes de plantas mamona submetidas ao estresse hídrico durante 5 dias (biblioteca L7), e a outra RNAs diferencialmente expressos em sementes controle (biblioteca L5). A biblioteca L7 apresentou a maior variedade de transcritos com um total de 182. A maior parte das funções estabelecidas pelo sistema Gene Ontology - GO, foram direcionadas aos “Processos Metabólicos” (526), em segundo “Respostas a estímulos” (57), o terceiro termo mais abundante foram referentes a “Desenvolvimento”(26). Já na biblioteca L5, foram encontrados 91 transcritos, com maior parte de suas funções referentes a “Processos Metabólicos”(413), em segundo “Respostas a estímulos” (8) e em terceiro Regulação (6). Alguns dos transcritos da biblioteca L7 foram escolhidos para análise por repetirem-se mais de 3x e não aparecerem na biblioteca L5, o que indica uma possível regulação positiva sobre estresse. As análises sobre Metalotioneína (4x), mostraram que a sequência de proteica apresentava os domínios conservados que a caracterizava como tipo II, onde são encontrados dois domínios funcionais ricos em cisteína com posições altamente conservadas, desempenhando a função de ligar-se a metais pesados, correlacionadas assim como a atividade de eliminação EROs e defesa contra o estresse oxidativo, além de apresentar homologia com a sequência de Bruguiera gymnorhiza, uma planta de mangue adaptada a ambientes salinos. Analisamos também os transcritos da referente a proteína AUXIN-REPRESSED 12.5 KDA (3x), apontada como sendo reprimida pelo hormônio auxina e associada ao processo de dormência da semente, é descrito em uma família gênica onde vários membros pertencem as vias de resposta ao estresse. Por último, analisamos a proteína GLUTELIN TYPE-A 3 (5x), uma importante proteína de armazenamento com caráter hidrofílico, possivelmente direcionada para o vacúolo. Em nosso trabalho foi possível observar um aumento de transcritos em relação a subtração controle, possivelmente reflexo do aumento do metabolismo da semente, tanto para resposta defensiva ao estresse hídrico quanto para o amadurecimento rápido da semente onde foram observados transcritos referentes a resposta oxidativa, controle hormonal, proteínas de reserva e produção de óleo.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background
It is generally acknowledged that a functional understanding of a biological system can only be obtained by an understanding of the collective of molecular interactions in form of biological networks. Protein networks are one particular network type of special importance, because proteins form the functional base units of every biological cell. On a mesoscopic level of protein networks, modules are of significant importance because these building blocks may be the next elementary functional level above individual proteins allowing to gain insight into fundamental organizational principles of biological cells.
Results
In this paper, we provide a comparative analysis of five popular and four novel module detection algorithms. We study these module prediction methods for simulated benchmark networks as well as 10 biological protein interaction networks (PINs). A particular focus of our analysis is placed on the biological meaning of the predicted modules by utilizing the Gene Ontology (GO) database as gold standard for the definition of biological processes. Furthermore, we investigate the robustness of the results by perturbing the PINs simulating in this way our incomplete knowledge of protein networks.
Conclusions
Overall, our study reveals that there is a large heterogeneity among the different module prediction algorithms if one zooms-in the biological level of biological processes in the form of GO terms and all methods are severely affected by a slight perturbation of the networks. However, we also find pathways that are enriched in multiple modules, which could provide important information about the hierarchical organization of the system

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Fertilization is a multistep and complex process culminating in the merge of gamete membranes, cytoplasmic unity and fusion of genome. CD81 is a tetraspanin protein that participates in sperm-oocyte interaction, being present at the oocyte surface. CD81 has also been implicated in other biological processes, however its specific function and molecular mechanisms of action remain to be elucidated. The interaction between CD81 and its binding partner proteins may underlie the CD81 involvement in a variety of cellular processes and modulate CD81/interactors specific functions. Interestingly, in a Yeast two Hybrid system previously performed in our lab, CD81 has emerged as a putative interactor of the Amyloid Precursor Protein (APP). In the work here described, bioinformatics analyses of CD81 interacting proteins were performed and the retrieved information used to construct a protein-protein interaction network, as well as to perform Gene Ontology enrichment analyses. CD81 expression was further evaluated in CHO, GC-1 and SH-SY5Y cell lines, and in human sperm cells. Additionally, its subcellular localization was analyzed in sperm cells and in the neuronal-like SH-SY5Y cell line. Subsequently, coimmunoprecipitation assays were performed in CHO and SH-SY5Y cells to attempt to prove the physical interaction between CD81 and APP. A functional interaction between these two proteins was accessed thought the analyses of the effects of CD81 overexpression on APP levels. A co-localization analysis of CD81 and some interactors proteins retrieved from the bioinformatics analyses, such as APP, AKT1 and cytoskeleton-related proteins, was also performed in sperm cells and in SH-SY5Y cells. The effects of CD81 in cytoskeleton remodeling was evaluated in SH-SY5Y cells through monitoring the effects of CD81 overexpression in actin and tubulin levels, and analyzing the colocalization between overexpressed CD81 and F-actin. Our results showed that CD81 is expressed in all cell lines tested, and also provided the first evidence of the presence of CD81 in human sperm cells. CD81 immunoreactivity was predominantly detected in the sperm head, including the acrosome membrane, and in the midpiece, where it co-localized with APP, as well as in the post-acrosomal region. Furthermore, CD81 co-localizes with APP in the plasma membrane and in cellular projections in SH-SY5Y cells, where CD81 overexpression has an influence on APP levels, also visible in CHO cells. The analysis of CD81 interacting proteins such as AKT1 and cytoskeletonrelated proteins showed that CD81 is involved in a variety of pathways that may underlie cytoskeleton remodeling events, related to processes such as sperm motility, cell migration and neuritogenesis. These results deepen our understanding on the functions of CD81 and some of its interactors in sperm and neuronal cells.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The exocarp, or skin, of fleshy fruit is a specialized tissue that protects the fruit, attracts seed dispersing fruit eaters, and has large economical relevance for fruit quality. Development of the exocarp involves regulated activities of many genes. This research analyzed global gene expression in the exocarp of developing sweet cherry (Prunus avium L., 'Regina'), a fruit crop species with little public genomic resources. A catalog of transcript models (contigs) representing expressed genes was constructed from de novo assembled short complementary DNA (cDNA) sequences generated from developing fruit between flowering and maturity at 14 time points. Expression levels in each sample were estimated for 34 695 contigs from numbers of reads mapping to each contig. Contigs were annotated functionally based on BLAST, gene ontology and InterProScan analyses. Coregulated genes were detected using partitional clustering of expression patterns. The results are discussed with emphasis on genes putatively involved in cuticle deposition, cell wall metabolism and sugar transport. The high temporal resolution of the expression patterns presented here reveals finely tuned developmental specialization of individual members of gene families. Moreover, the de novo assembled sweet cherry fruit transcriptome with 7760 full-length protein coding sequences and over 20 000 other, annotated cDNA sequences together with their developmental expression patterns is expected to accelerate molecular research on this important tree fruit crop.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Human mesenchymal stem cells (MSC) are powerful sources for cell therapy in regenerative medicine. The long time cultivation can result in replicative senescence or can be related to the emergence of chromosomal alterations responsible for the acquisition of tumorigenesis features in vitro. In this study, for the first time, the expression profile of MSC with a paracentric chromosomal inversion (MSC/inv) was compared to normal karyotype (MSC/n) in early and late passages. Furthermore, we compared the transcriptome of each MSC in early passages with late passages. MSC used in this study were obtained from the umbilical vein of three donors, two MSC/n and one MSC/inv. After their cryopreservation, they have been expanded in vitro until reached senescence. Total RNA was extracted using the RNeasy mini kit (Qiagen) and marked with the GeneChip ® 3 IVT Express Kit (Affymetrix Inc.). Subsequently, the fragmented aRNA was hybridized on the microarranjo Affymetrix Human Genome U133 Plus 2.0 arrays (Affymetrix Inc.). The statistical analysis of differential gene expression was performed between groups MSC by the Partek Genomic Suite software, version 6.4 (Partek Inc.). Was considered statistically significant differences in expression to p-value Bonferroni correction ˂.01. Only signals with fold change ˃ 3.0 were included in the list of differentially expressed. Differences in gene expression data obtained from microarrays were confirmed by Real Time RT-PCR. For the interpretation of biological expression data were used: IPA (Ingenuity Systems) for analysis enrichment functions, the STRING 9.0 for construction of network interactions; Cytoscape 2.8 to the network visualization and analysis bottlenecks with the aid of the GraphPad Prism 5.0 software. BiNGO Cytoscape pluggin was used to access overrepresentation of Gene Ontology categories in Biological Networks. The comparison between senescent and young at each group of MSC has shown that there is a difference in the expression parttern, being higher in the senescent MSC/inv group. The results also showed difference in expression profiles between the MSC/inv versus MSC/n, being greater when they are senescent. New networks were identified for genes related to the response of two of MSC over cultivation time. Were also identified genes that can coordinate functional categories over represented at networks, such as CXCL12, SFRP1, xvi EGF, SPP1, MMP1 e THBS1. The biological interpretation of these data suggests that the population of MSC/inv has different constitutional characteristics, related to their potential for differentiation, proliferation and response to stimuli, responsible for a distinct process of replicative senescence in MSC/inv compared to MSC/n. The genes identified in this study are candidates for biomarkers of cellular senescence in MSC, but their functional relevance in this process should be evaluated in additional in vitro and/or in vivo assays