69 resultados para Molecular biology|Microbiology|Oceanography
Resumo:
We have determined the three-dimensional structure of the protein complex between latexin and carboxypeptidase A using a combination of chemical cross-linking, mass spectrometry and molecular docking. The locations of three intermolecular cross-links were identified using mass spectrometry and these constraints were used in combination with a speed-optimised docking algorithm allowing us to evaluate more than 3 x 10(11) possible conformations. While cross-links represent only limited structural constraints, the combination of only three experimental cross-links with very basic molecular docking was sufficient to determine the complex structure. The crystal structure of the complex between latexin and carboxypeptidase A4 determined recently allowed us to assess the success of this structure determination approach. Our structure was shown to be within 4 angstrom r.m.s. deviation of C alpha atoms of the crystal structure. The study demonstrates that cross-linking in combination with mass spectrometry can lead to efficient and accurate structural modelling of protein complexes.
Resumo:
Molecular modelling of human CYP1B1 based on homology with the mammalian P450, CYP2C5, of known three-dimensional structure is reported. The enzyme model has been used to investigate the likely mode of binding for selected CYP1B1 substrates, particularly with regard to the possible effects of allelic variants of CYP1B1 on metabolism. In general, it appears that the CYP1B1 model is consistent with known substrate selectivity for the enzyme, and the sites of metabolism can be rationalized in terms of specific contacts with key amino acid residues within the CYP1B1 heme locus. Further-more, a mode of binding interaction for the inhibitor, a-naphthoflavone, is presented which accords with currently available information. The current paper shows that a combination of molecular modelling and experimental determinations on the substrate metabolism for CYP1B1 allelic variants can aid in the understanding of structure-function relationships within P450 enzymes. (C) 2003 Elsevier Science Ireland Ltd. All rights reserved.
Resumo:
The cell cycle is one of the most fundamental processes within a cell. Phase-dependent expression and cell-cycle checkpoints require a high level of control. A large number of genes with varying functions and modes of action are responsible for this biology. In a targeted exploration of the FANTOM2-Variable Protein Set, a number of mouse homologs to known cell-cycle regulators as well as novel members of cell-cycle families were identified. Focusing on two prototype cell-cycle families, the cyclins and the NIMA-related kinases (NEKs), we believe we have identified all of the mouse members of these families, 24 cyclins and 10 NEKs, and mapped them to ENSEMBL transcripts. To attempt to globally identify all potential cell cycle-related genes within mouse, the MGI (Mouse Genome Database) assignments for the RIKEN Representative Set (RPS) and the results from two homology-based queries were merged. We identified 1415 genes with possible cell-cycle roles, and 1758 potential paralogs. We comment on the genes identified in this screen and evaluate the merits of each approach.
Resumo:
This article investigates the expression patterns of 160 genes that are expressed during early mouse development. The cDNAs were isolated from 7.5 d postcoitum (dpc) encloderm, a region that comprises visceral encloderm (VE), definitive encloderm, and the node-tissues that are required for the initial steps of axial specification and tissue patterning in the mouse. To avoid examining the same gene more than once, and to exclude potentially ubiquitously expressed housekeeping genes, cDNA sequence was derived from 1978 clones of the Endoderm library. These yielded 1440 distinct cDNAs, of which 123 proved to be novel in the mouse. In situ hybridization analysis was carried out on 160 of the cDNAs, and of these, 29 (18%) proved to have restricted expression patterns.
Resumo:
The number of known mRNA transcripts in the mouse has been greatly expanded by the RIKEN Mouse Gene Encyclopedia project. Validation of their reproducible expression in a tissue is an important contribution to the study of functional genomics. In this report, we determine the expression profile of 57,931 clones on 20 mouse tissues using cDNA microarrays. Of these 57,931 clones, 22,928 clones correspond to the FANTOM2 clone set. The set represents 20,234 transcriptional units (TUs) out of 33,409 TUs in the FANTOM2 set. We identified 7206 separate clones that satisfied stringent criteria for tissue-specific expression. Gene Ontology terms were assigned for these 7206 clones, and the proportion of 'molecular function' ontology for each tissue-specific clone was examined. These data will provide insights into the function of each tissue. Tissue-specific gene expression profiles obtained using our cDNA microarrays were also compared with the data extracted from the GNF Expression Atlas based on Affymetrix microarrays. One major outcome of the RIKEN transcriptome analysis is the identification of numerous nonprotein-coding mRNAs. The expression profile was also used to obtain evidence of expression for putative noncoding RNAs. In addition, 1926 clones (70%) of 2768 clones that were categorized as unknown EST, and 1969 (58%) clones of 3388 clones that were categorized as unclassifiable were also shown to be reproducibly expressed.
Resumo:
We report the construction of the mouse full-length cDNA encyclopedia, the most extensive view of a complex transcriptome, on the basis of preparing and sequencing 246 libraries. Before cloning, cDNAs were enriched in full-length by Cap-Trapper, and in most cases, aggressively subtracted/normalized. We have produced 1,442,236 successful 3'-end sequences clustered into 171,144 groups, from which 60,770 clones were fully sequenced cDNAs annotated in the FANTOM-2 annotation. We have also produced 547,149 5' end reads, which clustered into 124,258 groups. Altogether, these cDNAs were further grouped in 70,000 transcriptional units (TU), which represent the best coverage of a transcriptome so far. By monitoring the extent of normalization/subtraction, we define the tentative equivalent coverage (TEC), which was estimated to be equivalent to >12,000,000 ESTs derived from standard libraries. High coverage explains discrepancies between the very large. numbers of clusters (and TUs) of this project, which also include non-protein-coding RNAs, and the lower gene number estimation of genome annotations. Altogether, S'-end clusters identify regions that are potential promoters for 8637 known genes and S'-end clusters suggest the presence of almost 63,000 transcriptional starting points. An estimate of the frequency of polyadenylation signals suggests that at least half of the singletons in the EST set represent real mRNAs. Clones accounting for about half of the predicted TUs await further sequencing. The continued high-discovery rate suggests that the task of transcriptome discovery is not yet complete.
Resumo:
Manual curation has long been held to be the gold standard for functional annotation of DNA sequence. Our experience with the annotation of more than 20,000 full-length cDNA sequences revealed problems with this approach, including inaccurate and inconsistent assignment of gene names, as well as many good assignments that were difficult to reproduce using only computational methods. For the FANTOM2 annotation of more than 60,000 cDNA clones, we developed a number of methods and tools to circumvent some of these problems, including an automated annotation pipeline that provides high-quality preliminary annotation for each sequence by introducing an uninformative filter that eliminates uninformative annotations, controlled vocabularies to accurately reflect both the functional assignments and the evidence supporting them, and a highly refined, Web-based manual annotation tool that allows users to view a wide array of sequence analyses and to assign gene names and putative functions using a consistent nomenclature. The ultimate utility of our approach is reflected in the low rate of reassignment of automated assignments by manual curation. Based on these results, we propose a new standard for large-scale annotation, in which the initial automated annotations are manually investigated and then computational methods are iteratively modified and improved based on the results of manual curation.
Resumo:
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Resumo:
The current RIKEN transcript set represents a significant proportion of the mouse transcriptome but transcripts expressed in the innate and acquired immune systems are poorly represented. In the present study we have assessed the complexity of the transcriptome expressed in mouse macrophages before and after treatment with lipopolysaccharide, a global regulator of macrophage gene expression, using existing RIKEN 19K arrays. By comparison to array profiles of other cells and tissues, we identify a large set of macrophage-enriched genes, many of which have obvious functions in endocytosis and phagocytosis. In addition, a significant number of LPS-inducible genes were identified. The data suggest that macrophages are a complex source of mRNA for transcriptome studies. To assess complexity and identify additional macrophage expressed genes, cDNA libraries were created from purified populations of macrophage and dendritic cells, a functionally related cell type. Sequence analysis revealed a high incidence of novel mRNAs within these cDNA libraries. These studies provide insights into the depths of transcriptional complexity still untapped amongst products of inducible genes, and identify macrophage and dendritic cell populations as a starting point for sampling the inducible mammalian transcriptome.
Resumo:
We analyzed the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences. We developed a new computational procedure to identify and classify the forms of splice variation evident in this data set and organized the results into a publicly accessible database that can be used for future expression array construction, structural genomics, and analyses of the mechanism and regulation of alternative splicing. Statistical analysis shows that at least 41% and possibly as much as 60% of multiexon genes in mouse have multiple splice forms. Of the transcription units with multiple splice forms, 49% contain transcripts in which the apparent use of an alternative transcription start (stop) is accompanied by alternative splicing of the initial (terminal) exon. This implies that alternative transcription may frequently induce alternative splicing. The fact that 73% of all exons with splice variation fall within the annotated coding region indicates that most splice variation is likely to affect the protein form. Finally, we compared the set of constitutive (present in all transcripts) exons with the set of cryptic (present only in some transcripts) exons and found statistically significant differences in their length distributions, the nucleoticle distributions around their splice junctions, and the frequencies of occurrence of several short sequence motifs.
Resumo:
The landscape of the Australian Wet Tropics can be described as islands of montane rainforest Surrounded by warmer or more xeric habitats. Historical glaciation cycles have caused expansion and contraction of these rainforest islands leading to consistent patterns of genetic divergence within species of vertebrates. To explore whether this dynamic history has promoted speciation in endemic and diverse groups Of insects, we used a combination of mtDNA sequencing and morphological characters to estimate relationships and the tempo of divergence among Australian representatives of the dung beetle genus Temnoplectron. This phylogenetic hypothesis shares a number of well-supported clades with a previously published phylogenetic hypothesis based on morphological data. though statistical support for several nodes is weak. Sister species relationships well-supported in both tree topologies. and a tree obtained by combining the two data sets. suggest that speciation has mostly been allopatric. We identify a number of speciation barriers, which coincide with phylogeographic breaks found in vertebrate species. Large sequence divergences between species emphasize that speciation events are ancient (pre-Pleistocene). The flightless, rainforest species appear to have speciated rapidly. but also in the distant past. (C) 2003 Elsevier Inc. All rights reserved.
Resumo:
Mutations of the MEN1 gene, encoding the tumor suppressor menin, predispose individuals to the cancer syndrome multiple endocrine neoplasia type 1, characterized by the development of tumors of the endocrine pancreas and anterior pituitary and parathyroid glands. We have targeted the murine Men1 gene by using Cre recombinase-loxP technology to develop both total and tissue-specific knockouts of the gene. Conditional homozygous inactivation of the Men1 gene in the pituitary gland and endocrine pancreas bypasses the embryonic lethality associated with a constitutional Men1(-/-) genotype and leads to beta-cell hyperplasia in less than 4 months and insulinomas and prolactinomas starting at 9 months. The pituitary gland and pancreas develop normally in the conditional absence of menin, but loss of this transcriptional cofactor is sufficient to cause beta-cell hyperplasia in some islets; however, such loss is not sufficient to initiate pituitary gland tumorigenesis, suggesting that additional genetic events are necessary for the latter.
Resumo:
To examine the effects of recent habitat fragmentation, we assayed genetic diversity in a rain forest endemic lizard, the prickly forest skink (Gnypetoscincus queenslandiae), from seven forest fragments and five sites in continuous forest on the Atherton tableland of northeastern Queensland, Australia. The rain forest in this region was fragmented by logging and clearing for dairy farms in the early 1900s and most forest fragments studied have been isolated for 50-80 years or nine to 12 skink generations. We genotyped 411 individuals at nine microsatellite DNA loci and found fewer alleles per locus in prickly forest skinks from small rain forest fragments and a lower ratio of allele number to allele size range in forest fragments than in continuous forest, indicative of a decrease in effective population size. In contrast, and as expected for populations with small neighbourhood sizes, neither heterozygosity nor variance in allele size differed between fragments and sites in continuous forests. Considering measures of among population differentiation, there was no increase in F-ST among fragments and a significant isolation by distance pattern was identified across all 12 sites. However, the relationship between genetic (F-ST) and geographical distance was significantly stronger for continuous forest sites than for fragments, consistent with disruption of gene flow among the latter. The observed changes in genetic diversity within and among populations are small, but in the direction predicted by the theory of genetic erosion in recently fragmented populations. The results also illustrate the inherent difficulty in detecting genetic consequences of recent habitat fragmentation, even in genetically variable species, and especially when effective population size and dispersal rates are low.
Resumo:
Clustering of the T cell integrin, LFA-1, at specialized regions of intercellular contact initiates integrin-mediated adhesion and downstream signaling, events that are necessary for a successful immunological response. But how clustering is achieved and sustained is not known. Here we establish that an LFA-1-associated molecule, PTA-1, is localized to membrane rafts and binds the carboxyl-terminal domain of isoforms of the actin-binding protein 4.1G. Protein 4.1 is known to associate with the membrane-associated guanylate kinase homologue, human discs large. We show that the carboxyl-terminal peptide of PTA-1 also can bind human discs large and that the presence or absence of this peptide greatly influences binding between PTA-1 and different isoforms of 4.1G. T cell stimulation with phorbol ester or PTA-1 cross-linking induces PTA-1 and 4.1G to associate tightly with the cytoskeleton, and the PTA-1 from such activated cells now can bind to the amino-terminal region of 4.1G. We propose that these dynamic associations provide the structural basis for a regulated molecular adhesive complex that serves to cluster and transport LFA-1 and associated molecules.