201 resultados para Human Genome Project.
Resumo:
Somatic copy number aberrations (CNA) represent a mutation type encountered in the majority of cancer genomes. Here, we present the 2014 edition of arrayMap (http://www.arraymap.org), a publicly accessible collection of pre-processed oncogenomic array data sets and CNA profiles, representing a vast range of human malignancies. Since the initial release, we have enhanced this resource both in content and especially with regard to data mining support. The 2014 release of arrayMap contains more than 64,000 genomic array data sets, representing about 250 tumor diagnoses. Data sets included in arrayMap have been assembled from public repositories as well as additional resources, and integrated by applying custom processing pipelines. Online tools have been upgraded for a more flexible array data visualization, including options for processing user provided, non-public data sets. Data integration has been improved by mapping to multiple editions of the human reference genome, with the majority of the data now being available for the UCSC hg18 as well as GRCh37 versions. The large amount of tumor CNA data in arrayMap can be freely downloaded by users to promote data mining projects, and to explore special events such as chromothripsis-like genome patterns.
Resumo:
Human and chimpanzee genomes are 98.8% identical within comparable sequences. However, they differ structurally in nine pericentric inversions, one fusion that originated human chromosome 2, and content and localization of heterochromatin and lineage-specific segmental duplications. The possible functional consequences of these cytogenetic and structural differences are not fully understood and their possible involvement in speciation remains unclear. We show that subtelomeric regions-regions that have a species-specific organization, are more divergent in sequence, and are enriched in genes and recombination hotspots-are significantly enriched for species-specific histone modifications that decorate transcription start sites in different tissues in both human and chimpanzee. The human lineage-specific chromosome 2 fusion point and ancestral centromere locus as well as chromosome 1 and 18 pericentric inversion breakpoints showed enrichment of human-specific H3K4me3 peaks in the prefrontal cortex. Our results reveal an association between plastic regions and potential novel regulatory elements.
Resumo:
The lymphatic vascular system, the body's second vascular system present in vertebrates, has emerged in recent years as a crucial player in normal and pathological processes. It participates in the maintenance of normal tissue fluid balance, the immune functions of cellular and antigen trafficking and absorption of fatty acids and lipid-soluble vitamins in the gut. Recent scientific discoveries have highlighted the role of lymphatic system in a number of pathologic conditions, including lymphedema, inflammatory diseases, and tumor metastasis. Development of genetically modified animal models, identification of lymphatic endothelial specific markers and regulators coupled with technological advances such as high-resolution imaging and genome-wide approaches have been instrumental in understanding the major steps controlling growth and remodeling of lymphatic vessels. This review highlights the recent insights and developments in the field of lymphatic vascular biology.
Resumo:
Human genetic variation contributes to differences in susceptibility to HIV-1 infection. To search for novel host resistance factors, we performed a genome-wide association study (GWAS) in hemophilia patients highly exposed to potentially contaminated factor VIII infusions. Individuals with hemophilia A and a documented history of factor VIII infusions before the introduction of viral inactivation procedures (1979-1984) were recruited from 36 hemophilia treatment centers (HTCs), and their genome-wide genetic variants were compared with those from matched HIV-infected individuals. Homozygous carriers of known CCR5 resistance mutations were excluded. Single nucleotide polymorphisms (SNPs) and inferred copy number variants (CNVs) were tested using logistic regression. In addition, we performed a pathway enrichment analysis, a heritability analysis, and a search for epistatic interactions with CCR5 Δ32 heterozygosity. A total of 560 HIV-uninfected cases were recruited: 36 (6.4%) were homozygous for CCR5 Δ32 or m303. After quality control and SNP imputation, we tested 1 081 435 SNPs and 3686 CNVs for association with HIV-1 serostatus in 431 cases and 765 HIV-infected controls. No SNP or CNV reached genome-wide significance. The additional analyses did not reveal any strong genetic effect. Highly exposed, yet uninfected hemophiliacs form an ideal study group to investigate host resistance factors. Using a genome-wide approach, we did not detect any significant associations between SNPs and HIV-1 susceptibility, indicating that common genetic variants of major effect are unlikely to explain the observed resistance phenotype in this population.
Resumo:
The major mood disorders, which include bipolar disorder and major depressive disorder (MDD), are considered heritable traits, although previous genetic association studies have had limited success in robustly identifying risk loci. We performed a meta-analysis of five case-control cohorts for major mood disorder, including over 13,600 individuals genotyped on high-density SNP arrays. We identified SNPs at 3p21.1 associated with major mood disorders (rs2251219, P = 3.63 x 10(-8); odds ratio = 0.87; 95% confidence interval, 0.83-0.92), with supportive evidence for association observed in two out of three independent replication cohorts. These results provide an example of a shared genetic susceptibility locus for bipolar disorder and MDD.
Resumo:
Adult height is a model polygenic trait, but there has been limited success in identifying the genes underlying its normal variation. To identify genetic variants influencing adult human height, we used genome-wide association data from 13,665 individuals and genotyped 39 variants in an additional 16,482 samples. We identified 20 variants associated with adult height (P < 5 x 10(-7), with 10 reaching P < 1 x 10(-10)). Combined, the 20 SNPs explain approximately 3% of height variation, with a approximately 5 cm difference between the 6.2% of people with 17 or fewer 'tall' alleles compared to the 5.5% with 27 or more 'tall' alleles. The loci we identified implicate genes in Hedgehog signaling (IHH, HHIP, PTCH1), extracellular matrix (EFEMP1, ADAMTSL3, ACAN) and cancer (CDK6, HMGA2, DLEU7) pathways, and provide new insights into human growth and developmental processes. Finally, our results provide insights into the genetic architecture of a classic quantitative trait.
Resumo:
SummaryResearch projects presented in this thesis aimed to investigate two major aspects of the arenaviruses life cycle in the host cell: viral entry and the biosynthesis of the viral envelope glycoprotein.Old World arenaviruses (OWAV), such as Lassa virus (LASV) and lymphocytic choriomeningitis virus (LCMV), attach to the cell by binding to their receptor, alpha-dystroglycan. Virions are then internalized by a largely unknown pathway of endocytosis and delivered to the late endosome/lysosome where fusion occurs at low pH. In the major project of my thesis, we sought to identify cellular factors involved in OWAV cell entry. Our work indicates that OWAV cell entry requires microtubular transport and a functional multivesicular body (MVB) compartment. Infection indeed depends on phosphatidyl inositol 3-kinase (PI3K) activity and lysobisphosphatidic acid (LBPA), a lipid found in membranes of intraluminal vesicles (ILVs) of the MVB. We further found a requirement of factors that are part of the endosomal sorting complex required for transport (ESCRT), involved in the formation of ILVs. This suggests an ESCRT-mediated sorting of virus- receptor complex during the entry process.During viral replication, biosynthesis of viral glycoprotein takes place in the endoplasmic reticulum (ER) of the host cell. When protein load exceeds the folding capacity of the ER, the accumulation of unfolded proteins is sensed by three ER resident proteins, activating transcription factor 6 (ATF6), inositol-requiring enzyme 1 (IRE1) and PKR-like ER kinase (PERK), whose signaling induces the cellular unfolded protein response (UPR). Our results indicate that acute LCMV infection transiently induces the activation of the ATF6 branch of the UPR, whereas the PERK, and IRE1 axis of UPR are neither triggered nor blocked during infection. Our data also demonstrate that activation of ATF6 pathway is required for optimal viral replication during acute infection.The formation of the mature, fusion-active form of arenaviruses glycoproteins requires proteolytic cleavage mediated by the cellular protease subtilisin kexin isozyme-1 (SKI-l)/site-l protease (SIP). We show that targeting the SKI-1/S1P enzymatic activity with specific inhibitors is a powerful strategy to block arenaviruses productive infection. Moreover, characterization of protease function highlights differences in processing between cellular and viral substrates, opening new possibilities in term of drug development against human pathogenic arenaviruses.RésuméLes projets de recherche présentés dans cette thèse visaient à étudier deux aspects du cycle de vie des arenavirus: l'entrée du virus dans la cellule hôte et la biosynthèse de la glycoprotéine durant la réplication virale.Les arenavirus du vieux monde (OWAV), tels que le virus de Lassa (LASV) et le virus de la chorioméningite lymphocytaire (LCMV) s'attachent à la cellule hôte en se liant à leur récepteur, l'alpha-dystroglycane. Les virions sont ensuite intemalisés par une voie d'endocytose inconnue et livrés à l'endosome tardif/lysosome, où le pH acide permet la fusion entre l'enveloppe virale et la membrane du compartiment. Le projet principal de ma thèse consistait à identifier les facteurs cellulaires impliqués dans l'entrée des OWAV dans la cellule hôte. Nos résultats indiquent que l'entrée des OWAV nécessite le transport microtubulaire et la présence d'un corps multivésiculaire (MVB) fonctionnel. L'infection dépend en effet de l'activité de phosphatidyl inositol 3-kinase (PI3K) et de lysobisphosphatidic acid (LBPA), un lipide présent dans les membranes des vésicules intraluminales (ILVs) du MVB. Nous avons également trouvé l'implication de facteurs constituant l'endosomal sorting complex required for sorting (ESCRT) qui joue un rôle dans la formation des ILVs. Ces donnés suggèrent l'incorporation du complexe virus-récepteur dans des ILVs durant le processus d'entrée.Lors de la réplication virale, la biosynthèse de la glycoprotéine virale a lieu dans le réticulum endoplasmique (ER) de la cellule hôte. Lorsque la charge de protéines nouvellement synthétisées excède la capacité de pliage des protéines dans le ER, l'accumulation de protéines mal pliées est détectée par trois facteurs: activating transcription factor 6 (ATF6), inositol-requiring enzyme 1 (IRE1) et PKR-like ER kinase (PERK). Leur signalisation constitue la réponse cellulaire face aux protéines mal pliées (UPR). Nos résultats montrent que l'infection aiguë avec LCMV induit transitoirement l'activation de la voie de signalisation ATF6 alors que les axes PERK et IRE1 de l'UPR ne sont ni induits ni bloqués pendant l'infection. Nos données prouvent également que l'activation de la voie ATF6 est nécessaire à une réplication virale optimale lors de l'infection aiguë avec LCMV.La maturation des glycoprotéines des arenavirus nécessite un clivage protéolytique par la protéase cellulaire subtilisin kexin isozyme-1 (SKI-l)/site-l protease (SIP). Nous avons démontré que le ciblage de l'activité enzymatique de SKI-1/SIΡ avec des inhibiteurs spécifiques est une stratégie prometteuse pour bloquer l'infection par les arenavirus. La caractérisation du mécanisme d'action de la protéase a, par ailleurs, révélé des différences au niveau du clivage entre les substrats cellulaires et viraux, ce qui ouvre de nouvelles perspectives en terme de développement de médicaments contre les arenavirus pathogènes pour l'homme.
Resumo:
To develop a comprehensive overview of copy number aberrations (CNAs) in stage-II/III colorectal cancer (CRC), we characterized 302 tumors from the PETACC-3 clinical trial. Microsatellite-stable (MSS) samples (n = 269) had 66 minimal common CNA regions, with frequent gains on 20 q (72.5%), 7 (41.8%), 8 q (33.1%) and 13 q (51.0%) and losses on 18 (58.6%), 4 q (26%) and 21 q (21.6%). MSS tumors have significantly more CNAs than microsatellite-instable (MSI) tumors: within the MSI tumors a novel deletion of the tumor suppressor WWOX at 16 q23.1 was identified (p<0.01). Focal aberrations identified by the GISTIC method confirmed amplifications of oncogenes including EGFR, ERBB2, CCND1, MET, and MYC, and deletions of tumor suppressors including TP53, APC, and SMAD4, and gene expression was highly concordant with copy number aberration for these genes. Novel amplicons included putative oncogenes such as WNK1 and HNF4A, which also showed high concordance between copy number and expression. Survival analysis associated a specific patient segment featured by chromosome 20 q gains to an improved overall survival, which might be due to higher expression of genes such as EEF1B2 and PTK6. The CNA clustering also grouped tumors characterized by a poor prognosis BRAF-mutant-like signature derived from mRNA data from this cohort. We further revealed non-random correlation between CNAs among unlinked loci, including positive correlation between 20 q gain and 8 q gain, and 20 q gain and chromosome 18 loss, consistent with co-selection of these CNAs. These results reinforce the non-random nature of somatic CNAs in stage-II/III CRC and highlight loci and genes that may play an important role in driving the development and outcome of this disease.
Resumo:
The identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of HC21 open reading frames (C21orfs) identified by mapping spliced expressed sequence tags (ESTs) and predicted genes (PREDs), identified only in silico. This study underscored the limitations of in silico-only gene prediction, as many PREDs were incorrectly predicted. To refine the HC21 annotation, we have developed a reliable algorithm to extract and stringently map sequences that contain bona fide 3' transcript ends to the genome. We then created a specific 21q graphical display allowing an integrated view of the data that incorporates new ESTs as well as features such as CpG islands, repeats, and gene predictions. Using these tools we identified 27 new putative genes. To validate these, we sequenced previously cloned cDNAs and carried out RT-PCR, 5'- and 3'-RACE procedures, and comparative mapping. These approaches substantiated 19 new transcripts, thus increasing the HC21 gene count by 9.5%. These transcripts were likely not previously identified because they are small and encode small proteins. We also identified four transcriptional units that are spliced but contain no obvious open reading frame. The HC21 data presented here further emphasize that current gene prediction algorithms miss a substantial number of transcripts that nevertheless can be identified using a combination of experimental approaches and multiple refined algorithms.
Resumo:
The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10(-8)) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ~115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits.
Resumo:
In eukaryotes, homologous recombination proteins such as RAD51 and RAD52 play crucial roles in DNA repair and genome stability. Human RAD52 is a member of a large single-strand annealing protein (SSAP) family [1] and stimulates Rad51-dependent recombination [2, 3]. In prokaryotes and phages, it has been difficult to establish the presence of RAD52 homologs with conserved sequences. Putative SSAPs were recently found in several phages that infect strains of Lactococcus lactis[4]. One of these SSAPs was identified as Sak and was found in the virulent L. lactis phage ul36, which belongs to the Siphoviridae family [4, 5]. In this study, we show that Sak is homologous to the N terminus of human RAD52. Purified Sak binds single-stranded DNA (ssDNA) preferentially over double-stranded DNA (dsDNA) and promotes the renaturation of long complementary ssDNAs. Sak also binds RecA and stimulates homologous recombination reactions. Mutations shown to modulate RAD52 DNA binding [6] affect Sak similarly. Remarkably, electron-microscopic reconstruction of Sak reveals an undecameric (11) subunit ring, similar to the crystal structure of the N-terminal fragment of human RAD52 [7, 8]. For the first time, we propose a viral homolog of RAD52 at the amino acid, phylogenic, functional, and structural levels.
Resumo:
Inter-individual differences in gene expression are likely to account for an important fraction of phenotypic differences, including susceptibility to common disorders. Recent studies have shown extensive variation in gene expression levels in humans and other organisms, and that a fraction of this variation is under genetic control. We investigated the patterns of gene expression variation in a 25 Mb region of human chromosome 21, which has been associated with many Down syndrome (DS) phenotypes. Taqman real-time PCR was used to measure expression variation of 41 genes in lymphoblastoid cells of 40 unrelated individuals. For 25 genes found to be differentially expressed, additional analysis was performed in 10 CEPH families to determine heritabilities and map loci harboring regulatory variation. Seventy-six percent of the differentially expressed genes had significant heritabilities, and genomewide linkage analysis led to the identification of significant eQTLs for nine genes. Most eQTLs were in trans, with the best result (P=7.46 x 10(-8)) obtained for TMEM1 on chromosome 12q24.33. A cis-eQTL identified for CCT8 was validated by performing an association study in 60 individuals from the HapMap project. SNP rs965951 located within CCT8 was found to be significantly associated with its expression levels (P=2.5 x 10(-5)) confirming cis-regulatory variation. The results of our study provide a representative view of expression variation of chromosome 21 genes, identify loci involved in their regulation and suggest that genes, for which expression differences are significantly larger than 1.5-fold in control samples, are unlikely to be involved in DS-phenotypes present in all affected individuals.
Resumo:
To analyze the neural basis of electric taste we performed electrical neuroimaging analyses of event-related potentials (ERPs) recorded while participants received electrical pulses to the tongue. Pulses were presented at individual taste threshold to excite gustatory fibers selectively without concomitant excitation of trigeminal fibers and at high intensity evoking a prickling and, thus, activating trigeminal fibers. Sour, salty and metallic tastes were reported at both intensities while clear prickling was reported at high intensity only. ERPs exhibited augmented amplitudes and shorter latencies for high intensity. First activations of gustatory areas (bilateral anterior insula, medial orbitofrontal cortex) were observed at 70-80ms. Common somatosensory regions were more strongly, but not exclusively, activated at high intensity. Our data provide a comprehensive view on the dynamics of cortical processing of the gustatory and trigeminal portions of electric taste and suggest that gustatory and trigeminal afferents project to overlapping cortical areas.
Resumo:
Dysregulation of intestinal epithelial cell performance is associated with an array of pathologies whose onset mechanisms are incompletely understood. While whole-genomics approaches have been valuable for studying the molecular basis of several intestinal diseases, a thorough analysis of gene expression along the healthy gastrointestinal tract is still lacking. The aim of this study was to map gene expression in gastrointestinal regions of healthy human adults and to implement a procedure for microarray data analysis that would allow its use as a reference when screening for pathological deviations. We analyzed the gene expression signature of antrum, duodenum, jejunum, ileum, and transverse colon biopsies using a biostatistical method based on a multivariate and univariate approach to identify region-selective genes. One hundred sixty-six genes were found responsible for distinguishing the five regions considered. Nineteen had never been described in the GI tract, including a semaphorin probably implicated in pathogen invasion and six novel genes. Moreover, by crossing these genes with those retrieved from an existing data set of gene expression in the intestine of ulcerative colitis and Crohn's disease patients, we identified genes that might be biomarkers of Crohn's and/or ulcerative colitis in ileum and/or colon. These include CLCA4 and SLC26A2, both implicated in ion transport. This study furnishes the first map of gene expression along the healthy human gastrointestinal tract. Furthermore, the approach implemented here, and validated by retrieving known gene profiles, allowed the identification of promising new leads in both healthy and disease states.
Resumo:
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.