51 resultados para Gene Set Enrichment
Resumo:
We set out to define patterns of gene expression during kidney organogenesis by using high-density DNA array technology. Expression analysis of 8,740 rat genes revealed five discrete patterns or groups of gene expression during nephrogenesis. Group 1 consisted of genes with very high expression in the early embryonic kidney, many with roles in protein translation and DNA replication. Group 2 consisted of genes that peaked in midembryogenesis and contained many transcripts specifying proteins of the extracellular matrix. Many additional transcripts allied with groups 1 and 2 had known or proposed roles in kidney development and included LIM1, POD1, GFRA1, WT1, BCL2, Homeobox protein A11, timeless, pleiotrophin, HGF, HNF3, BMP4, TGF-α, TGF-β2, IGF-II, met, FGF7, BMP4, and ganglioside-GD3. Group 3 consisted of transcripts that peaked in the neonatal period and contained a number of retrotransposon RNAs. Group 4 contained genes that steadily increased in relative expression levels throughout development, including many genes involved in energy metabolism and transport. Group 5 consisted of genes with relatively low levels of expression throughout embryogenesis but with markedly higher levels in the adult kidney; this group included a heterogeneous mix of transporters, detoxification enzymes, and oxidative stress genes. The data suggest that the embryonic kidney is committed to cellular proliferation and morphogenesis early on, followed sequentially by extracellular matrix deposition and acquisition of markers of terminal differentiation. The neonatal burst of retrotransposon mRNA was unexpected and may play a role in a stress response associated with birth. Custom analytical tools were developed including “The Equalizer” and “eBlot,” which contain improved methods for data normalization, significance testing, and data mining.
Resumo:
The transcriptional effects of deregulated myc gene overexpression are implicated in tumorigenesis in a spectrum of experimental and naturally occurring neoplasms. In follicles of the chicken bursa of Fabricius, myc induction of B-cell neoplasia requires a target cell population present during early bursal development and progresses through preneoplastic transformed follicles to metastatic lymphomas. We developed a chicken immune system cDNA microarray to analyze broad changes in gene expression that occur during normal embryonic B-cell development and during myc-induced neoplastic transformation in the bursa. The number of mRNAs showing at least 3-fold change was greater during myc-induced lymphomagenesis than during normal development, and hierarchical cluster analysis of expression patterns revealed that levels of several hundred mRNAs varied in concert with levels of myc overexpression. A set of 41 mRNAs were most consistently elevated in myc-overexpressing preneoplastic and neoplastic cells, most involved in processes thought to be subject to regulation by Myc. The mRNAs for another cluster of genes were overexpressed in neoplasia independent of myc expression level, including a small subset with the expression signature of embryonic bursal lymphocytes. Overexpression of myc, and some of the genes overexpressed with myc, may be important for generation of preneoplastic transformed follicles. However, expression profiles of late metastatic tumors showed a large variation in concert with myc expression levels, and some showed minimal myc overexpression. Therefore, high-level myc overexpression may be more important in the early induction of these lymphomas than in maintenance of late-stage metastases.
Resumo:
Precise classification of tumors is critically important for cancer diagnosis and treatment. It is also a scientifically challenging task. Recently, efforts have been made to use gene expression profiles to improve the precision of classification, with limited success. Using a published data set for purposes of comparison, we introduce a methodology based on classification trees and demonstrate that it is significantly more accurate for discriminating among distinct colon cancer tissues than other statistical approaches used heretofore. In addition, competing classification trees are displayed, which suggest that different genes may coregulate colon cancers.
Resumo:
Current evidence on the long-term evolutionary effect of insertion of sequence elements into gene regions is reviewed, restricted to cases where a sequence derived from a past insertion participates in the regulation of expression of a useful gene. Ten such examples in eukaryotes demonstrate that segments of repetitive DNA or mobile elements have been inserted in the past in gene regions, have been preserved, sometimes modified by selection, and now affect control of transcription of the adjacent gene. Included are only examples in which transcription control was modified by the insert. Several cases in which merely transcription initiation occurred in the insert were set aside. Two of the examples involved the long terminal repeats of mammalian endogenous retroviruses. Another two examples were control of transcription by repeated sequence inserts in sea urchin genomes. There are now six published examples in which Alu sequences were inserted long ago into human gene regions, were modified, and now are central in control/enhancement of transcription. The number of published examples of Alu sequences affecting gene control has grown threefold in the last year and is likely to continue growing. Taken together, all of these examples show that the insertion of sequence elements in the genome has been a significant source of regulatory variation in evolution.
Resumo:
The c-myc oncogene has been shown to play a role in cell proliferation and apoptosis. The realization that myc oncogenes may control the level of expression of other genes has opened the field to search for genetic targets for Myc regulation. Recently, using a subtraction/coexpression strategy, a murine genetic target for Myc regulation, called EC439, was isolated. To further characterize the ECA39 gene, we set out to determine the evolutionary conservation of its regulatory and coding sequences. We describe the human, nematode, and budding yeast homologs of the mouse ECA39 gene. Identities between the mouse ECA39 protein and the human, nematode, or yeast proteins are 79%, 52%, and 49%, respectively. Interestingly, the recognition site for Myc binding, located 3' to the start site of transcription in the mouse gene, is also conserved in the human homolog. This regulatory element is missing in the ECA39 homologs from nematode or yeast, which also lack the regulator c-myc. To understand the function of ECA39, we deleted the gene from the yeast genome. Disruption of ECA39 which is a recessive mutation that leads to a marked alteration in the cell cycle. Mutant haploids and homozygous diploids have a faster growth rate than isogenic wild-type strains. Fluorescence-activated cell sorter analyses indicate that the mutation shortens the G1 stage in the cell cycle. Moreover, mutant strains show higher rates of UV-induced mutations. The results suggest that the product of ECA39 is involved in the regulation of G1 to S transition.
Resumo:
The origin of land vertebrates was one of the major transitions in the history of vertebrates. Yet, despite many studies that are based on either morphology or molecules, the phylogenetic relationships among tetrapods and the other two living groups of lobe-finned fishes, the coelacanth and the lungfishes, are still unresolved and debated. Knowledge of the relationships among these lineages, which originated back in the Devonian, has profound implications for the reconstruction of the evolutionary scenario of the conquest of land. We collected the largest molecular data set on this issue so far, about 3,500 base pairs from seven species of the large 28S nuclear ribosomal gene. All phylogenetic analyses (maximum parsimony, neighbor-joining, and maximum likelihood) point toward the hypothesis that lungfishes and coelacanths form a monophyletic group and are equally closely related to land vertebrates. This evolutionary hypothesis complicates the identification of morphological or physiological preadaptations that might have permitted the common ancestor of tetrapods to colonize land. This is because the reconstruction of its ancestral conditions would be hindered by the difficulty to separate uniquely derived characters from shared derived characters in the coelacanth/lungfish and tetrapod lineages. This molecular phylogeny aids in the reconstruction of morphological evolutionary steps by providing a framework; however, only paleontological evidence can determine the sequence of morphological acquisitions that allowed lobe-finned fishes to colonize land.
Resumo:
Addition of a saturated fatty acid (SFA) induced a strong increase in heat shock (HS) mRNA transcription when cells were heat-shocked at 37 degrees C, whereas treatment with an unsaturated fatty acid (UFA) reduced or eliminated the level of HS gene transcription at 37 degrees C. Transcription of the delta 9-desaturase gene (Ole1) of Histoplasma capsulatum, whose gene product is responsible for the synthesis of UFA, is up-regulated in a temperature-sensitive strain. We show that when the L8-14C mutant of Saccharomyces cerevisiae, which has a disrupted Ole1 gene, is complemented with its own Ole1 coding region under control of its own promoter or Ole1 promoters of H. capsulatum, the level of HS gene transcription depends on the activity of the promoters. Fluorescence anisotropy of mitochondrial membranes of completed strains corresponded to the different activity of the Ole1 promoter used. We propose that the SFA/UFA ratio and perturbation of membrane lipoprotein complexes are involved in the perception of rapid temperature changes and under HS conditions disturbance of the preexisting membrane physical state causes transduction of a signal that induces transcription of HS genes.
Resumo:
The three members of the Brn-3 family of POU domain transcription factors are found in highly restricted sets of central nervous system neurons. Within the retina, these factors are present only within subsets of ganglion cells. We show here that in the developing mouse retina, Brn-3b protein is first observed in presumptive ganglion cell precursors as they begin to migrate from the zone of dividing neuroblasts to the future ganglion cell layer, and that targeted disruption of the Brn-3b gene leads in the homozygous state to a selective loss of 70% of retinal ganglion cells. In Brn-3b (-/-) mice other neurons within the retina and brain are minimally or not at all affected. These experiments indicate that Brn-3b plays an essential role in the development of specific ganglion cell types.
Resumo:
Plectin, a 500-kDa intermediate filament binding protein, has been proposed to provide mechanical strength to cells and tissues by acting as a cross-linking element of the cytoskeleton. To set the basis for future studies on gene regulation, tissue-specific expression, and pathological conditions involving this protein, we have cloned the human plectin gene, determined its coding sequence, and established its genomic organization. The coding sequence contains 32 exons that extend over 32 kb of the human genome. Most of the introns reside within a region encoding the globular N-terminal domain of the molecule, whereas the entire central rod domain and the entire C-terminal globular domain were found to be encoded by single exons of remarkable length, >3 kb and >6 kb, respectively. Overall, the organization of the human plectin gene was strikingly similar to that of human bullous pemphigoid antigen 1 (BPAG1), confirming that both proteins belong to the same gene family. Comparison of the deduced protein sequences for human and rat plectin revealed that they were 93% identical. By using fluorescence in situ hybridization, we have mapped the plectin gene to the long arm of chromosome 8 within the telomeric region. This gene locus (8q24) has previously been implicated in the human blistering skin disease epidermolysis bullosa simplex Ogna. Detailed knowledge of the structure of the plectin gene and its chromosome localization will aid in the elucidation of whether this or any other pathological conditions are linked to alterations in the plectin gene.
Resumo:
The expression of at least 24 distinct genes of Pseudomonas aeruginosa PAO1 is under direct control of the "ferric uptake regulator" (Fur). Novel targets of the Fur protein were isolated in a powerful SELEX (systematic evolution of ligands by exponential enrichment)-like cycle selection consisting of in vitro DNA-Fur interaction, binding to anti-Fur antibody, purification on protein G, and PCR amplification. DNA fragments obtained after at least three exponential enrichment cycles were cloned and subjected to DNA mobility-shift assays and DNase I footprint analyses to verify the specific interaction with the Fur protein in vitro. Iron-dependent expression of the corresponding genes in vivo was monitored by RNase protection analysis. In total, 20 different DNA fragments were identified which represent actual Pseudomonas iron-regulated genes (PIGs). While four PIGs are identical to already known genes (pfeR, pvdS, tonB, and fumC, respectively), 16 PIGs represent previously unknown genes. Homology studies of the putative proteins encoded by the PIGs allowed us to speculate about their possible function. Two PIG products were highly similar to siderophore receptors from various species, and three PIG products were significantly homologous to alternative sigma factors. Furthermore, homologs of the Escherichia coli ORF1-tolQ, nuoA, stringent starvation protein Ssp, and of a two-component regulatory system similar to the Pseudomonas syringae LemA sensor kinase were identified. The putative gene products of seven additional PIGs did not show significant homologies to any known proteins. The PIGs were mapped on the P.aeruginosa chromosome. Their possible role in iron metabolism and virulence of P. aeruginosa is discussed.
Resumo:
With global heavy metal contamination increasing, plants that can process heavy metals might provide efficient and ecologically sound approaches to sequestration and removal. Mercuric ion reductase, MerA, converts toxic Hg2+ to the less toxic, relatively inert metallic mercury (Hg0) The bacterial merA sequence is rich in CpG dinucleotides and has a highly skewed codon usage, both of which are particularly unfavorable to efficient expression in plants. We constructed a mutagenized merA sequence, merApe9, modifying the flanking region and 9% of the coding region and placing this sequence under control of plant regulatory elements. Transgenic Arabidopsis thaliana seeds expressing merApe9 germinated, and these seedlings grew, flowered, and set seed on medium containing HgCl2 concentrations of 25-100 microM (5-20 ppm), levels toxic to several controls. Transgenic merApe9 seedlings evolved considerable amounts of Hg0 relative to control plants. The rate of mercury evolution and the level of resistance were proportional to the steady-state mRNA level, confirming that resistance was due to expression of the MerApe9 enzyme. Plants and bacteria expressing merApe9 were also resistant to toxic levels of Au3+. These and other data suggest that there are potentially viable molecular genetic approaches to the phytoremediation of metal ion pollution.
Resumo:
PCR amplification of template DNAs extracted from mixed, naturally occurring microbial populations, using oligonucleotide primers complementary to highly conserved sequences, was used to obtain a large collection of diverse RNase P RNA-encoding genes. An alignment of these sequences was used in a comparative analysis of RNase P RNA secondary and tertiary structure. The new sequences confirm the secondary structure model based on sequences from cultivated organisms (with minor alterations in helices P12 and P18), providing additional support for nearly every base pair. Analysis of sequence covariation using the entire RNase P RNA data set reveals elements of tertiary structure in the RNA; the third nucleotides (underlined) of the GNRA tetraloops L14 and L18 are seen to interact with adjacent Watson-Crick base pairs in helix P8, forming A:G/C or G:A/U base triples. These experiments demonstrate one way in which the enormous diversity of natural microbial populations can be used to elucidate molecular structure through comparative analysis.
Resumo:
An entire gene encoding wheat (var. Hard Red Winter Tam 107) acetyl-CoA carboxylase [ACCase; acetyl-CoA:carbon-dioxide ligase (ADP-forming), EC 6.4.1.2] has been cloned and sequenced. Comparison of the 12-kb genomic sequence with the 7.4-kb cDNA sequence reported previously revealed 29 introns. Within the coding region, the exon sequence is 98% identical to the known wheat cDNA sequence. A second ACCase gene was identified by sequencing fragments of genomic clones that include the first two exons and the first intron. Additional transcripts were detected by 5' and 3' RACE analysis (rapid amplification of cDNA ends). One set of transcripts had a 5' end sequence identical to the cDNA found previously and another set was identical to the gene reported here. The 3' RACE clones fall into four distinguishable sequence sets, bringing the number of ACCase sequences to six. None of these cDNA or genomic clones encodes a chloroplast targeting signal. Identification of six different sequences suggests that either the cytosolic ACCase genes are duplicated in the three chromosome sets in hexaploid wheat or that each of the six alleles of the cytosolic ACCase gene has a readily distinguishable DNA sequence.
Resumo:
The structure of the small hepatitis B virus surface antigen (HBsAg) was investigated by epitope mapping of four anti-HBsAg monoclonal antibodies (mAbs). Amino acid sequences of epitopes were derived from affinity-enrichment experiments (biopanning) using a filamentous phage peptide library. The library consists of 10(9) different clones bearing a 30-residue peptide fused to gene III. Sequence homologies between peptides obtained from panning the library against the antibodies and the native HBsAg sequence allowed for precise description of the binding regions. Three of four mAbs were found to bind to distinct discontinuous epitopes between amino acid residues 101 and 207 of HBsAg. The fourth mAb was demonstrated to bind to residues 121-124. The sequence data are supported by ELISA assays demonstrating the binding of the HBsAg-specific peptides on filamentous phage to mAbs. The sequence data were used to map the surface of HBsAg and to derive a topological model for the alpha-carbon trace of the 101-207 region of HBsAg. The approach should be useful for other proteins for which the crystal structure is not available but a representative set of mAbs can be obtained.
Resumo:
Pathogenic yersiniae secrete a set of antihost proteins, called Yops, by a type III secretion mechanism. Upon infection of cultured epithelial cells, extracellular Yersinia pseudotuberculosis and Yersinia enterocolitica translocate cytotoxin YopE across the host cell plasma membrane. Several lines of evidence suggest that tyrosine phosphatase YopH follows the same pathway. We analyzed internalization of YopE and YopH into murine PU5-1.8 macrophages by using recombinant Y. enterocolitica producing truncated YopE and YopH proteins fused to a calmodulin-dependent adenylate cyclase. The YopE-cyclase and YopH-cyclase hybrids were readily secreted by Y. enterocolitica. The N-terminal domain required for secretion was not longer than 15 residues of YopE and 17 residues of YopH. Internalization into eukaryotic cells, revealed by cAMP production, only required the N-terminal 50 amino acid residues of YopE and the N-terminal 71 amino acid residues of YopH. YopE and YopH are thus modular proteins composed of a secretion domain, a translocation domain, and an effector domain. Translocation of YopE and YopH across host cell's membranes was also dependent on the secretion of YopB and YopD by the same bacterium. The cyclase fusion approach could be readily extended to study the fate of other proteins secreted by invasive bacterial pathogens.