903 resultados para Human genome - Theses
Resumo:
The Mouse Tumor Biology (MTB) Database serves as a curated, integrated resource for information about tumor genetics and pathology in genetically defined strains of mice (i.e., inbred, transgenic and targeted mutation strains). Sources of information for the database include the published scientific literature and direct data submissions by the scientific community. Researchers access MTB using Web-based query forms and can use the database to answer such questions as ‘What tumors have been reported in transgenic mice created on a C57BL/6J background?’, ‘What tumors in mice are associated with mutations in the Trp53 gene?’ and ‘What pathology images are available for tumors of the mammary gland regardless of genetic background?’. MTB has been available on the Web since 1998 from the Mouse Genome Informatics web site (http://www.informatics.jax.org). We have recently implemented a number of enhancements to MTB including new query options, redesigned query forms and results pages for pathology and genetic data, and the addition of an electronic data submission and annotation tool for pathology data.
Resumo:
The ARKdb genome databases provide comprehensive public repositories for genome mapping data from farmed species and other animals (http://www.thearkdb.org) providing a resource similar in function to that offered by GDB or MGD for human or mouse genome mapping data, respectively. Because we have attempted to build a generic mapping database, the system has wide utility, particularly for those species for which development of a specific resource would be prohibitive. The ARKdb genome database model has been implemented for 10 species to date. These are pig, chicken, sheep, cattle, horse, deer, tilapia, cat, turkey and salmon. Access to the ARKdb databases is effected via the World Wide Web using the ARKdb browser and Anubis map viewer. The information stored includes details of loci, maps, experimental methods and the source references. Links to other information sources such as PubMed and EMBL/GenBank are provided. Responsibility for data entry and curation is shared amongst scientists active in genome research in the species of interest. Mirror sites in the United States are maintained in addition to the central genome server at Roslin.
Resumo:
Familial structural rearrangements of chromosomes represent a factor of malformation risk that could vary over a large range, making genetic counseling difficult. However, they also represent a powerful tool for increasing knowledge of the genome, particularly by studying breakpoints and viable imbalances of the genome. We have developed a collaborative database that now includes data on more than 4100 families, from which we have developed a web site called HC Forum® (http://HCForum.imag.fr). It offers geneticists assistance in diagnosis and in genetic counseling by assessing the malformation risk with statistical models. For researchers, interactive interfaces exhibit the distribution of chromosomal breakpoints and of the genome regions observed at birth in trisomy or in monosomy. Dedicated tools including an interactive pedigree allow electronic submission of data, which will be anonymously shown in a forum for discussions. After validation, data are definitively registered in the database with the email of the sender, allowing direct location of biological material. Thus HC Forum® constitutes a link between diagnosis laboratories and genome research centers, and after 1 year, more than 700 users from about 40 different countries already exist.
Resumo:
The Plasmodium falciparum Genome Database (http://PlasmoDB.org) integrates sequence information, automated analyses and annotation data emerging from the P.falciparum genome sequencing consortium. To date, raw sequence coverage is available for >90% of the genome, and two chromosomes have been finished and annotated. Data in PlasmoDB are organized by chromosome (1–14), and can be accessed using a variety of tools for graphical and text-based browsing or downloaded in various file formats. The GUS (Genomics Unified Schema) implementation of PlasmoDB provides a multi-species genomic relational database, incorporating data from human and mouse, as well as P.falciparum. The relational schema uses a highly structured format to accommodate diverse data sets related to genomic sequence and gene expression. Tools have been designed to facilitate complex biological queries, including many that are specific to Plasmodium parasites and malaria as a disease. Additional projects seek to integrate genomic information with the rich data sets now becoming available for RNA transcription, protein expression, metabolic pathways, genetic and physical mapping, antigenic and population diversity, and phylogenetic relationships with other apicomplexan parasites. The overall goal of PlasmoDB is to facilitate Internet- and CD-ROM-based access to both finished and unfinished sequence information by the global malaria research community.
Resumo:
The uptake and expression of extracellular DNA has been established as a mechanism for horizontal transfer of genes between bacterial species. Such transfer can support acquisition of advantageous elements, including determinants that affect the interactions between infectious organisms and their hosts. Here we show that erythrocyte-stage Plasmodium falciparum malaria parasites spontaneously take up DNA from the host cell cytoplasm into their nuclei. We have exploited this finding to produce levels of reporter expression in P.falciparum that are substantially improved over those obtained by electroporation protocols currently used to transfect malaria parasites. Parasites were transformed to a drug-resistant state when placed into cell culture with erythrocytes containing a plasmid encoding the human dihydrofolate reductase sequence. The findings reported here suggest that the malaria genome may be continually exposed to exogenous DNA from residual nuclear material in host erythrocytes.
Resumo:
We performed a genome-wide analysis of gene expression in primary human CD15+ myeloid progenitor cells. By using the serial analysis of gene expression (SAGE) technique, we obtained quantitative information for the expression of 37,519 unique SAGE-tag sequences. Of these unique tags, (i) 25% were detected at high and intermediate levels, whereas 75% were present as single copies, (ii) 53% of the tags matched known expressed sequences, 34% of which were matched to more than one known expressed sequence, and (iii) 47% of the tags had no matches and represent potentially novel genes. The correct genes were confirmed by application of the generation of longer cDNA fragments from SAGE tags for gene identification (GLGI) technique for high-copy tags with multiple matches. A set of genes known to be important in myeloid differentiation were expressed at various levels and used different spliced forms. This study provides a normal baseline for comparison of gene expression in myeloid diseases. The strategy of using SAGE and GLGI techniques in this study has broad applications to the genome-wide identification of expressed genes.
Resumo:
We report here that the DNA-dependent protein kinase (DNA-PK) affects the molecular fate of the recombinant adeno-associated virus (rAAV) genome in skeletal muscle. rAAV-human α1-antitrypsin (rAAV-hAAT) vectors were delivered by intramuscular injection to either C57BL/6 (DNA-PKcs+) or C57BL/6-SCID [severe combined immunodeficient (SCID), DNA-PKcs−] mice. In both strains, high levels of transgene expression were sustained for up to 1 year after a single injection. Southern blot analysis showed that rAAV genomes persisted as linear episomes for more than 1 year in SCID mice, whereas only circular episomal forms were observed in the C57BL/6 strain. These results indicate that DNA-PK is involved in the formation of circular rAAV episomes.
Resumo:
Ehrlichiae are responsible for important tick-transmitted diseases, including anaplasmosis, the most prevalent tick-borne infection of livestock worldwide, and the emerging human diseases monocytic and granulocytic ehrlichiosis. Antigenic variation of major surface proteins is a key feature of these pathogens that allows persistence in the mammalian host, a requisite for subsequent tick transmission. In Anaplasma marginale pseudogenes for two antigenically variable gene families, msp2 and msp3, appear in concert. These pseudogenes can be recombined into the functional expression site to generate new antigenic variants. Coordinated control of the recombination of these genes would allow these two gene families to act synergistically to evade the host immune response.
Resumo:
The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.
Resumo:
The poly(A)-binding protein (PABP) recognizes the 3′ mRNA poly(A) tail and plays an essential role in eukaryotic translation initiation and mRNA stabilization/degradation. PABP is a modular protein, with four N-terminal RNA-binding domains and an extensive C terminus. The C-terminal region of PABP is essential for normal growth in yeast and has been implicated in mediating PABP homo-oligomerization and protein–protein interactions. A small, proteolytically stable, highly conserved domain has been identified within this C-terminal segment. Remarkably, this domain is also present in the hyperplastic discs protein (HYD) family of ubiquitin ligases. To better understand the function of this conserved region, an x-ray structure of the PABP-like segment of the human HYD protein has been determined at 1.04-Å resolution. The conserved domain adopts a novel fold resembling a right-handed supercoil of four α-helices. Sequence profile searches and comparative protein structure modeling identified a small ORF from the Arabidopsis thaliana genome that encodes a structurally similar but distantly related PABP/HYD domain. Phylogenetic analysis of the experimentally determined (HYD) and homology modeled (PABP) protein surfaces revealed a conserved feature that may be responsible for binding to a PABP interacting protein, Paip1, and other shared interaction partners.
Resumo:
The 1,852,442-bp sequence of an M1 strain of Streptococcus pyogenes, a Gram-positive pathogen, has been determined and contains 1,752 predicted protein-encoding genes. Approximately one-third of these genes have no identifiable function, with the remainder falling into previously characterized categories of known microbial function. Consistent with the observation that S. pyogenes is responsible for a wider variety of human disease than any other bacterial species, more than 40 putative virulence-associated genes have been identified. Additional genes have been identified that encode proteins likely associated with microbial “molecular mimicry” of host characteristics and involved in rheumatic fever or acute glomerulonephritis. The complete or partial sequence of four different bacteriophage genomes is also present, with each containing genes for one or more previously undiscovered superantigen-like proteins. These prophage-associated genes encode at least six potential virulence factors, emphasizing the importance of bacteriophages in horizontal gene transfer and a possible mechanism for generating new strains with increased pathogenic potential.
Resumo:
We have shown that the DNA demethylation complex isolated from chicken embryos has a G⋅T mismatch DNA glycosylase that also possesses 5-methylcytosine DNA glycosylase (5-MCDG) activity. Herein we show that human embryonic kidney cells stably transfected with 5-MCDG cDNA linked to a cytomegalovirus promoter overexpress 5-MCDG. A 15- to 20-fold overexpression of 5-MCDG results in the specific demethylation of a stably integrated ecdysone-retinoic acid responsive enhancer-promoter linked to a β-galactosidase reporter gene. Demethylation occurs in the absence of the ligand ponasterone A (an analogue of ecdysone). The state of methylation of the transgene was investigated by Southern blot analysis and by the bisulfite genomic sequencing reaction. Demethylation occurs downstream of the hormone response elements. No genome-wide demethylation was observed. The expression of an inactive mutant of 5-MCDG or the empty vector does not elicit any demethylation of the promoter-enhancer of the reporter gene. An increase in 5-MCDG activity does not influence the activity of DNA methyltransferase(s) when tested in vitro with a hemimethylated substrate. There is no change in the transgene copy number during selection of the clones with antibiotics. Immunoprecipitation combined with Western blot analysis showed that an antibody directed against 5-MCDG precipitates a complex containing the retinoid X receptor α. The association between retinoid receptor and 5-MCDG is not ligand dependent. These results suggest that a complex of the hormone receptor with 5-MCDG may target demethylation of the transgene in this system.
Comprehensive copy number and gene expression profiling of the 17q23 amplicon in human breast cancer
Resumo:
The biological significance of DNA amplification in cancer is thought to be due to the selection of increased expression of a single or few important genes. However, systematic surveys of the copy number and expression of all genes within an amplified region of the genome have not been performed. Here we have used a combination of molecular, genomic, and microarray technologies to identify target genes for 17q23, a common region of amplification in breast cancers with poor prognosis. Construction of a 4-Mb genomic contig made it possible to define two common regions of amplification in breast cancer cell lines. Analysis of 184 primary breast tumors by fluorescence in situ hybridization on tissue microarrays validated these results with the highest amplification frequency (12.5%) observed for the distal region. Based on GeneMap'99 information, 17 known genes and 26 expressed sequence tags were localized to the contig. Analysis of genomic sequence identified 77 additional transcripts. A comprehensive analysis of expression levels of these transcripts in six breast cancer cell lines was carried out by using complementary DNA microarrays. The expression patterns varied from one cell line to another, and several overexpressed genes were identified. Of these, RPS6KB1, MUL, APPBP2, and TRAP240 as well as one uncharacterized expressed sequence tag were located in the two common amplified regions. In summary, comprehensive analysis of the 17q23 amplicon revealed a limited number of highly expressed genes that may contribute to the more aggressive clinical course observed in breast cancer patients with 17q23-amplified tumors.
Resumo:
The recurrent t(1;22)(p13;q13) translocation is exclusively associated with infant acute megakaryoblastic leukemia. We have identified the two genes involved in this translocation. Both genes possess related sequences in the Drosophila genome. The chromosome 22 gene (megakaryocytic acute leukemia, MAL) product is predicted to be involved in chromatin organization, and the chromosome 1 gene (one twenty-two, OTT) product is related to the Drosophila split-end (spen) family of proteins. Drosophila genetic experiments identified spen as involved in connecting the Raf and Hox pathways. Because almost all of the sequences and all of the identified domains of both OTT and MAL proteins are included in the predicted fusion protein, the OTT-MAL fusion could aberrantly modulate chromatin organization, Hox differentiation pathways, or extracellular signaling.
Resumo:
The genetic basis for virulence in influenza virus is largely unknown. To explore the mutational basis for increased virulence in the lung, the H3N2 prototype clinical isolate, A/HK/1/68, was adapted to the mouse. Genomic sequencing provided the first demonstration, to our knowledge, that a group of 11 mutations can convert an avirulent virus to a virulent variant that can kill at a minimal dose. Thirteen of the 14 amino acid substitutions (93%) detected among clonal isolates were likely instrumental in adaptation because of their positive selection, location in functional regions, and/or independent occurrence in other virulent influenza viruses. Mutations in virulent variants repeatedly involved nuclear localization signals and sites of protein and RNA interaction, implicating them as novel modulators of virulence. Mouse-adapted variants with the same hemagglutinin mutations possessed different pH optima of fusion, indicating that fusion activity of hemagglutinin can be modulated by other viral genes. Experimental adaptation resulted in the selection of three mutations that were in common with the virulent human H5N1 isolate A/HK/156/97 and that may be instrumental in its extreme virulence. Analysis of viral adaptation by serial passage appears to provide the identification of biologically relevant mutations.