978 resultados para Gene Set Enrichment
Resumo:
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Resumo:
Surrogate methods for detecting lateral gene transfer are those that do not require inference of phylogenetic trees. Herein I apply four such methods to identify open reading frames (ORFs) in the genome of Escherichia coli K12 that may have arisen by lateral gene transfer. Only two of these methods detect the same ORFs more frequently than expected by chance, whereas several intersections contain many fewer ORFs than expected. Each of the four methods detects a different non-random set of ORFs. The methods may detect lateral ORFs of different relative ages; testing this hypothesis will require rigorous inference of trees. (C) 2001 Federation of European Microbiological Societies. Published by Elsevier Science BN. All rights reserved.
Resumo:
Understanding the genetic architecture of quantitative traits can greatly assist the design of strategies for their manipulation in plant-breeding programs. For a number of traits, genetic variation can be the result of segregation of a few major genes and many polygenes (minor genes). The joint segregation analysis (JSA) is a maximum-likelihood approach for fitting segregation models through the simultaneous use of phenotypic information from multiple generations. Our objective in this paper was to use computer simulation to quantify the power of the JSA method for testing the mixed-inheritance model for quantitative traits when it was applied to the six basic generations: both parents (P-1 and P-2), F-1, F-2, and both backcross generations (B-1 and B-2) derived from crossing the F-1 to each parent. A total of 1968 genetic model-experiment scenarios were considered in the simulation study to quantify the power of the method. Factors that interacted to influence the power of the JSA method to correctly detect genetic models were: (1) whether there were one or two major genes in combination with polygenes, (2) the heritability of the major genes and polygenes, (3) the level of dispersion of the major genes and polygenes between the two parents, and (4) the number of individuals examined in each generation (population size). The greatest levels of power were observed for the genetic models defined with simple inheritance; e.g., the power was greater than 90% for the one major gene model, regardless of the population size and major-gene heritability. Lower levels of power were observed for the genetic models with complex inheritance (major genes and polygenes), low heritability, small population sizes and a large dispersion of favourable genes among the two parents; e.g., the power was less than 5% for the two major-gene model with a heritability value of 0.3 and population sizes of 100 individuals. The JSA methodology was then applied to a previously studied sorghum data-set to investigate the genetic control of the putative drought resistance-trait osmotic adjustment in three crosses. The previous study concluded that there were two major genes segregating for osmotic adjustment in the three crosses. Application of the JSA method resulted in a change in the proposed genetic model. The presence of the two major genes was confirmed with the addition of an unspecified number of polygenes.
Resumo:
The haploid NK model developed by Kauffman can be extended to diploid genomes and to incorporate gene-by-environment interaction effects in combination with epistasis. To provide the flexibility to include a wide range of forms of gene-by-environment interactions, a target population of environment types (TPE) is defined. The TPE consists of a set of E different environment types, each with their own frequency of occurrence. Each environment type conditions a different NK gene network structure or series of gene effects for a given network structure, providing the framework for defining gene-by-environment interactions. Thus, different NK models can be partially or completely nested within the E environment types of a TPE, giving rise to the E(NK) model for a biological system. With this model it is possible to examine how populations of genotypes evolve in context with properties of the environment that influence the contributions of genes to the fitness values of genotypes. We are using the E(NK) model to investigate how both epistasis and gene-by-environment interactions influence the genetic improvement of quantitative traits by plant breeding strategies applied to agricultural systems. © 2002 Wiley Periodicals, Inc.
Resumo:
In this paper we refer to the gene-to-phenotype modeling challenge as the GP problem. Integrating information across levels of organization within a genotype-environment system is a major challenge in computational biology. However, resolving the GP problem is a fundamental requirement if we are to understand and predict phenotypes given knowledge of the genome and model dynamic properties of biological systems. Organisms are consequences of this integration, and it is a major property of biological systems that underlies the responses we observe. We discuss the E(NK) model as a framework for investigation of the GP problem and the prediction of system properties at different levels of organization. We apply this quantitative framework to an investigation of the processes involved in genetic improvement of plants for agriculture. In our analysis, N genes determine the genetic variation for a set of traits that are responsible for plant adaptation to E environment-types within a target population of environments. The N genes can interact in epistatic NK gene-networks through the way that they influence plant growth and development processes within a dynamic crop growth model. We use a sorghum crop growth model, available within the APSIM agricultural production systems simulation model, to integrate the gene-environment interactions that occur during growth and development and to predict genotype-to-phenotype relationships for a given E(NK) model. Directional selection is then applied to the population of genotypes, based on their predicted phenotypes, to simulate the dynamic aspects of genetic improvement by a plant-breeding program. The outcomes of the simulated breeding are evaluated across cycles of selection in terms of the changes in allele frequencies for the N genes and the genotypic and phenotypic values of the populations of genotypes.
Resumo:
In microarray studies, the application of clustering techniques is often used to derive meaningful insights into the data. In the past, hierarchical methods have been the primary clustering tool employed to perform this task. The hierarchical algorithms have been mainly applied heuristically to these cluster analysis problems. Further, a major limitation of these methods is their inability to determine the number of clusters. Thus there is a need for a model-based approach to these. clustering problems. To this end, McLachlan et al. [7] developed a mixture model-based algorithm (EMMIX-GENE) for the clustering of tissue samples. To further investigate the EMMIX-GENE procedure as a model-based -approach, we present a case study involving the application of EMMIX-GENE to the breast cancer data as studied recently in van 't Veer et al. [10]. Our analysis considers the problem of clustering the tissue samples on the basis of the genes which is a non-standard problem because the number of genes greatly exceed the number of tissue samples. We demonstrate how EMMIX-GENE can be useful in reducing the initial set of genes down to a more computationally manageable size. The results from this analysis also emphasise the difficulty associated with the task of separating two tissue groups on the basis of a particular subset of genes. These results also shed light on why supervised methods have such a high misallocation error rate for the breast cancer data.
Resumo:
Dissertation presented to obtain the Master Degree in Molecular, Genetics and Biomedicine
Resumo:
Tese de Doutoramento em Biologia de Plantas.
Resumo:
BACKGROUND: Early detection and treatment of colorectal adenomatous polyps (AP) and colorectal cancer (CRC) is associated with decreased mortality for CRC. However, accurate, non-invasive and compliant tests to screen for AP and early stages of CRC are not yet available. A blood-based screening test is highly attractive due to limited invasiveness and high acceptance rate among patients. AIM: To demonstrate whether gene expression signatures in the peripheral blood mononuclear cells (PBMC) were able to detect the presence of AP and early stages CRC. METHODS: A total of 85 PBMC samples derived from colonoscopy-verified subjects without lesion (controls) (n = 41), with AP (n = 21) or with CRC (n = 23) were used as training sets. A 42-gene panel for CRC and AP discrimination, including genes identified by Digital Gene Expression-tag profiling of PBMC, and genes previously characterised and reported in the literature, was validated on the training set by qPCR. Logistic regression analysis followed by bootstrap validation determined CRC- and AP-specific classifiers, which discriminate patients with CRC and AP from controls. RESULTS: The CRC and AP classifiers were able to detect CRC with a sensitivity of 78% and AP with a sensitivity of 46% respectively. Both classifiers had a specificity of 92% with very low false-positive detection when applied on subjects with inflammatory bowel disease (n = 23) or tumours other than CRC (n = 14). CONCLUSION: This pilot study demonstrates the potential of developing a minimally invasive, accurate test to screen patients at average risk for colorectal cancer, based on gene expression analysis of peripheral blood mononuclear cells obtained from a simple blood sample.
Resumo:
We have used massively parallel signature sequencing (MPSS) to sample the transcriptomes of 32 normal human tissues to an unprecedented depth, thus documenting the patterns of expression of almost 20,000 genes with high sensitivity and specificity. The data confirm the widely held belief that differences in gene expression between cell and tissue types are largely determined by transcripts derived from a limited number of tissue-specific genes, rather than by combinations of more promiscuously expressed genes. Expression of a little more than half of all known human genes seems to account for both the common requirements and the specific functions of the tissues sampled. A classification of tissues based on patterns of gene expression largely reproduces classifications based on anatomical and biochemical properties. The unbiased sampling of the human transcriptome achieved by MPSS supports the idea that most human genes have been mapped, if not functionally characterized. This data set should prove useful for the identification of tissue-specific genes, for the study of global changes induced by pathological conditions, and for the definition of a minimal set of genes necessary for basic cell maintenance. The data are available on the Web at http://mpss.licr.org and http://sgb.lynxgen.com.
Resumo:
The fire ant Solenopsis invicta and its close relatives display an important social polymorphism involving differences in colony queen number. Colonies are headed by either a single reproductive queen (monogyne form) or multiple queens (polygyne form). This variation in social organization is associated with variation at the gene Gp-9, with monogyne colonies harboring only B-like allelic variants and polygyne colonies always containing b-like variants as well. We describe naturally occurring variation at Gp-9 in fire ants based on 185 full-length sequences, 136 of which were obtained from S. invicta collected over much of its native range. While there is little overall differentiation between most of the numerous alleles observed, a surprising amount is found in the coding regions of the gene, with such substitutions usually causing amino acid replacements. This elevated coding-region variation may result from a lack of negative selection acting to constrain amino acid replacements over much of the protein, different mutation rates or biases in coding and non-coding sequences, negative selection acting with greater strength on non-coding than coding regions, and/or positive selection acting on the protein. Formal selection analyses provide evidence that the latter force played an important role in the basal b-like lineages coincident with the emergence of polygyny. While our data set reveals considerable paraphyly and polyphyly of S. invicta sequences with respect to those of other fire ant species, the b-like alleles of the socially polymorphic species are monophyletic. An expanded analysis of colonies containing alleles of this clade confirmed the invariant link between their presence and expression of polygyny. Finally, our discovery of several unique alleles bearing various combinations of b-like and B-like codons allows us to conclude that no single b-like residue is completely predictive of polygyne behavior and, thus, potentially causally involved in its expression. Rather, all three typical b-like residues appear to be necessary.
Resumo:
BACKGROUND: Experimental evidences show that glutathione and its rate-limiting synthesizing enzyme, the glutamate-cysteine ligase (GCL), are involved in the pathogenesis of schizophrenia. Furthermore, genetic association has been previously reported between two single nucleotide polymorphisms lying in noncoding regions of glutamate cysteine ligase modifier (GCLM) gene, which specifies for the modifier subunit of GCL and schizophrenia. OBJECTIVE: We wanted to investigate the presence of GCLM true functional mutations, likely in linkage disequilibrium with the previously identified single nucleotide polymorphism alleles, in the same set of cases that allowed the detection of the original association signal. METHODS: We screened all the coding regions of GCLM and their intronic flanking vicinities in 353 patients with schizophrenia by direct DNA sequencing. RESULTS: Ten sequence variations were identified, five of which were not previously described. None of these DNA changes was within the GCLM coding sequence and in-silico analysis failed to indicate functional impairment induced by these variations. Furthermore, screening of normal controls and downstream statistical analyses revealed no significant relationship of any of these DNA variants with schizophrenia. CONCLUSION: It is unlikely that functional mutations in the GCLM gene could play a major role in genetic predisposition to schizophrenia and further studies will be required to assess its etiological function in the disease.
Resumo:
AbstractBACKGROUND: KRAB-ZFPs (Krüppel-associated box domain-zinc finger proteins) are vertebrate-restricted transcriptional repressors encoded in the hundreds by the mouse and human genomes. They act via an essential cofactor, KAP1, which recruits effectors responsible for the formation of facultative heterochromatin. We have recently shown that KRAB/KAP1 can mediate long-range transcriptional repression through heterochromatin spreading, but also demonstrated that this process is at times countered by endogenous influences.METHOD: To investigate this issue further we used an ectopic KRAB-based repressor. This system allowed us to tether KRAB/KAP1 to hundreds of euchromatic sites within genes, and to record its impact on gene expression. We then correlated this KRAB/KAP1-mediated transcriptional effect to pre-existing genomic and chromatin structures to identify specific characteristics making a gene susceptible to repression.RESULTS: We found that genes that were susceptible to KRAB/KAP1-mediated silencing carried higher levels of repressive histone marks both at the promoter and over the transcribed region than genes that were insensitive. In parallel, we found a high enrichment in euchromatic marks within both the close and more distant environment of these genes.CONCLUSION: Together, these data indicate that high levels of gene activity in the genomic environment and the pre-deposition of repressive histone marks within a gene increase its susceptibility to KRAB/KAP1-mediated repression.
Resumo:
Little is known about the role of the transcription factor peroxisome proliferator-activated receptor (PPAR) beta/delta in liver. Here we set out to better elucidate the function of PPARbeta/delta in liver by comparing the effect of PPARalpha and PPARbeta/delta deletion using whole genome transcriptional profiling and analysis of plasma and liver metabolites. In fed state, the number of genes altered by PPARalpha and PPARbeta/delta deletion was similar, whereas in fasted state the effect of PPARalpha deletion was much more pronounced, consistent with the pattern of gene expression of PPARalpha and PPARbeta/delta. Minor overlap was found between PPARalpha- and PPARbeta/delta-dependent gene regulation in liver. Pathways upregulated by PPARbeta/delta deletion were connected to innate immunity and inflammation. Pathways downregulated by PPARbeta/delta deletion included lipoprotein metabolism and various pathways related to glucose utilization, which correlated with elevated plasma glucose and triglycerides and reduced plasma cholesterol in PPARbeta/delta-/- mice. Downregulated genes that may underlie these metabolic alterations included Pklr, Fbp1, Apoa4, Vldlr, Lipg, and Pcsk9, which may represent novel PPARbeta/delta target genes. In contrast to PPARalpha-/- mice, no changes in plasma free fatty acid, plasma beta-hydroxybutyrate, liver triglycerides, and liver glycogen were observed in PPARbeta/delta-/- mice. Our data indicate that PPARbeta/delta governs glucose utilization and lipoprotein metabolism and has an important anti-inflammatory role in liver. Overall, our analysis reveals divergent roles of PPARalpha and PPARbeta/delta in regulation of gene expression in mouse liver.
Resumo:
Plant membrane compartments and trafficking pathways are highly complex, and are often distinct from those of animals and fungi. Progress has been made in defining trafficking in plants using transient expression systems. However, many processes require a precise understanding of plant membrane trafficking in a developmental context, and in diverse, specialized cell types. These include defense responses to pathogens, regulation of transporter accumulation in plant nutrition or polar auxin transport in development. In all of these cases a central role is played by the endosomal membrane system, which, however, is the most divergent and ill-defined aspect of plant cell compartmentation. We have designed a new vector series, and have generated a large number of stably transformed plants expressing membrane protein fusions to spectrally distinct, fluorescent tags. We selected lines with distinct subcellular localization patterns, and stable, non-toxic expression. We demonstrate the power of this multicolor 'Wave' marker set for rapid, combinatorial analysis of plant cell membrane compartments, both in live-imaging and immunoelectron microscopy. Among other findings, our systematic co-localization analysis revealed that a class of plant Rab1-homologs has a much more extended localization than was previously assumed, and also localizes to trans-Golgi/endosomal compartments. Constructs that can be transformed into any genetic background or species, as well as seeds from transgenic Arabidopsis plants, will be freely available, and will promote rapid progress in diverse areas of plant cell biology.