3 resultados para protein domain

em Digital Commons - Michigan Tech


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Nitrogen and water are essential for plant growth and development. In this study, we designed experiments to produce gene expression data of poplar roots under nitrogen starvation and water deprivation conditions. We found low concentration of nitrogen led first to increased root elongation followed by lateral root proliferation and eventually increased root biomass. To identify genes regulating root growth and development under nitrogen starvation and water deprivation, we designed a series of data analysis procedures, through which, we have successfully identified biologically important genes. Differentially Expressed Genes (DEGs) analysis identified the genes that are differentially expressed under nitrogen starvation or drought. Protein domain enrichment analysis identified enriched themes (in same domains) that are highly interactive during the treatment. Gene Ontology (GO) enrichment analysis allowed us to identify biological process changed during nitrogen starvation. Based on the above analyses, we examined the local Gene Regulatory Network (GRN) and identified a number of transcription factors. After testing, one of them is a high hierarchically ranked transcription factor that affects root growth under nitrogen starvation. It is very tedious and time-consuming to analyze gene expression data. To avoid doing analysis manually, we attempt to automate a computational pipeline that now can be used for identification of DEGs and protein domain analysis in a single run. It is implemented in scripts of Perl and R.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Analyzing large-scale gene expression data is a labor-intensive and time-consuming process. To make data analysis easier, we developed a set of pipelines for rapid processing and analysis poplar gene expression data for knowledge discovery. Of all pipelines developed, differentially expressed genes (DEGs) pipeline is the one designed to identify biologically important genes that are differentially expressed in one of multiple time points for conditions. Pathway analysis pipeline was designed to identify the differentially expression metabolic pathways. Protein domain enrichment pipeline can identify the enriched protein domains present in the DEGs. Finally, Gene Ontology (GO) enrichment analysis pipeline was developed to identify the enriched GO terms in the DEGs. Our pipeline tools can analyze both microarray gene data and high-throughput gene data. These two types of data are obtained by two different technologies. A microarray technology is to measure gene expression levels via microarray chips, a collection of microscopic DNA spots attached to a solid (glass) surface, whereas high throughput sequencing, also called as the next-generation sequencing, is a new technology to measure gene expression levels by directly sequencing mRNAs, and obtaining each mRNA’s copy numbers in cells or tissues. We also developed a web portal (http://sys.bio.mtu.edu/) to make all pipelines available to public to facilitate users to analyze their gene expression data. In addition to the analyses mentioned above, it can also perform GO hierarchy analysis, i.e. construct GO trees using a list of GO terms as an input.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Wood formation is an economically and environmentally important process and has played a significant role in the evolution of terrestrial plants. Despite its significance, the molecular underpinnings of the process are still poorly understood. We have previously shown that four Lateral Boundary Domain (LBD) transcription factors have important roles in the regulation of wood formation with two (LBD1 and LBD4) involved in secondary phloem and ray cell development and two (LBD15 and LBD18) in secondary xylem formation. Here, we used comparative phylogenetic analyses to test potential roles of the four LBD genes in the evolution of woodiness. We studied the copy number and variation in DNA and amino acid sequences of the four LBDs in a wide range of woody and herbaceous plant taxa with fully sequenced and annotated genomes. LBD1 showed the highest gene copy number across the studied species, and LBD1 gene copy number was strongly and significantly correlated with the level of ray seriation. The lianas, cucumber and grape, with multiseriate ray cells showed the highest gene copy number (12 and 11, respectively). Because lianas’ growth habit requires significant twisting and bending, the less lignified ray parenchyma cells likely facilitate stem flexibility and maintenance of xylem conductivity. We further demonstrate conservation of amino acids in the LBD18 protein sequences that are specific to woody taxa. Neutrality tests showed evidence for strong purifying selection on these gene regions across various orders, indicating adaptive convergent evolution of LBD18. Structural modeling demonstrates that the conserved amino acids have a significant impact on the tertiary protein structure and thus are likely of significant functional importance.