934 resultados para Human Genome Project.
Resumo:
Extracellular calcium participates in several key physiological functions, such as control of blood coagulation, bone calcification or muscle contraction. Calcium homeostasis in humans is regulated in part by genetic factors, as illustrated by rare monogenic diseases characterized by hypo or hypercalcaemia. Both serum calcium and urinary calcium excretion are heritable continuous traits in humans. Serum calcium levels are tightly regulated by two main hormonal systems, i.e. parathyroid hormone and vitamin D, which are themselves also influenced by genetic factors. Recent technological advances in molecular biology allow for the screening of the human genome at an unprecedented level of detail and using hypothesis-free approaches, such as genome-wide association studies (GWAS). GWAS identified novel loci for calcium-related phenotypes (i.e. serum calcium and 25-OH vitamin D) that shed new light on the biology of calcium in humans. The substantial overlap (i.e. CYP24A1, CASR, GATA3; CYP2R1) between genes involved in rare monogenic diseases and genes located within loci identified in GWAS suggests a genetic and phenotypic continuum between monogenic diseases of calcium homeostasis and slight disturbances of calcium homeostasis in the general population. Future studies using whole-exome and whole-genome sequencing will further advance our understanding of the genetic architecture of calcium homeostasis in humans. These findings will likely provide new insight into the complex mechanisms involved in calcium homeostasis and hopefully lead to novel preventive and therapeutic approaches. Keyword: calcium, monogenic, genome-wide association studies, genetics.
Resumo:
BACKGROUND: It is unknown why patients with extensive ulcerative colitis (UC) have a higher risk of colorectal cancer compared with patients with left-sided UC. This study characterizes the inflammatory processes in left-sided UC, pancolitis, and UC-associated dysplasia at the transcriptional level to identify potential biomarkers and transcripts of importance for the carcinogenic behavior of chronic inflammation. METHODS: The Affymetrix GeneChip Human Genome U133 Plus 2.0 was applied on colonic biopsies from UC patients with left-sided UC, pancolitis, dysplasia, and controls. Reverse transcription polymerase chain reaction and immunohistochemistry were performed for validating selected transcripts in the initial cohort and in 2 independent cohorts of patients with UC. Microarray data were analyzed by principal component analysis, and reverse transcription polymerase chain reaction and immunohistochemistry data by the Wilcoxon's rank-sum test. RESULTS: The principal component analysis results revealed separate clusters for left-sided UC, pancolitis, dysplasia, and controls. Close clustering of dysplastic and pancolitic samples indicated similarities in gene expression. Indeed, 101 and 656 parallel upregulated and downregulated transcripts, respectively, were identified in specimens from dysplasia and pancolitis. Validation of selected transcripts hereof identified insulin receptor alpha (INSRA) and MAP kinase interacting serine/threonine kinase 2 (MKNK2) with an enhanced expression in dysplasia compared with left-sided UC and controls, whereas laminin γ2 (LAMC2) was found with a lower expression in dysplasia compared with the remaining 3 groups. CONCLUSIONS: This study demonstrates pancolitis and left-sided UC as distinct inflammatory processes at the transcriptional level, and identifies INSRA, MKNK2, and LAMC2 as potential critical transcripts in the inflammation-driven preneoplastic process of UC.
Resumo:
BACKGROUND: Membrane-bound organelles are a defining feature of eukaryotic cells, and play a central role in most of their fundamental processes. The Rab G proteins are the single largest family of proteins that participate in the traffic between organelles, with 66 Rabs encoded in the human genome. Rabs direct the organelle-specific recruitment of vesicle tethering factors, motor proteins, and regulators of membrane traffic. Each organelle or vesicle class is typically associated with one or more Rab, with the Rabs present in a particular cell reflecting that cell's complement of organelles and trafficking routes. RESULTS: Through iterative use of hidden Markov models and tree building, we classified Rabs across the eukaryotic kingdom to provide the most comprehensive view of Rab evolution obtained to date. A strikingly large repertoire of at least 20 Rabs appears to have been present in the last eukaryotic common ancestor (LECA), consistent with the 'complexity early' view of eukaryotic evolution. We were able to place these Rabs into six supergroups, giving a deep view into eukaryotic prehistory. CONCLUSIONS: Tracing the fate of the LECA Rabs revealed extensive losses with many extant eukaryotes having fewer Rabs, and none having the full complement. We found that other Rabs have expanded and diversified, including a large expansion at the dawn of metazoans, which could be followed to provide an account of the evolutionary history of all human Rabs. Some Rab changes could be correlated with differences in cellular organization, and the relative lack of variation in other families of membrane-traffic proteins suggests that it is the changes in Rabs that primarily underlies the variation in organelles between species and cell types.
Resumo:
One of the first useful products from the human genome will be a set of predicted genes. Besides its intrinsic scientific interest, the accuracy and completeness of this data set is of considerable importance for human health and medicine. Though progress has been made on computational gene identification in terms of both methods and accuracy evaluation measures, most of the sequence sets in which the programs are tested are short genomic sequences, and there is concern that these accuracy measures may not extrapolate well to larger, more challenging data sets. Given the absence of experimentally verified large genomic data sets, we constructed a semiartificial test set comprising a number of short single-gene genomic sequences with randomly generated intergenic regions. This test set, which should still present an easier problem than real human genomic sequence, mimics the approximately 200kb long BACs being sequenced. In our experiments with these longer genomic sequences, the accuracy of GENSCAN, one of the most accurate ab initio gene prediction programs, dropped significantly, although its sensitivity remained high. Conversely, the accuracy of similarity-based programs, such as GENEWISE, PROCRUSTES, and BLASTX was not affected significantly by the presence of random intergenic sequence, but depended on the strength of the similarity to the protein homolog. As expected, the accuracy dropped if the models were built using more distant homologs, and we were able to quantitatively estimate this decline. However, the specificities of these techniques are still rather good even when the similarity is weak, which is a desirable characteristic for driving expensive follow-up experiments. Our experiments suggest that though gene prediction will improve with every new protein that is discovered and through improvements in the current set of tools, we still have a long way to go before we can decipher the precise exonic structure of every gene in the human genome using purely computational methodology.
Resumo:
For the ∼1% of the human genome in the ENCODE regions, only about half of the transcriptionally active regions (TARs) identified with tiling microarrays correspond to annotated exons. Here we categorize this large amount of “unannotated transcription.” We use a number of disparate features to classify the 6988 novel TARs—array expression profiles across cell lines and conditions, sequence composition, phylogenetic profiles (presence/absence of syntenic conservation across 17 species), and locations relative to genes. In the classification, we first filter out TARs with unusual sequence composition and those likely resulting from cross-hybridization. We then associate some of those remaining with proximal exons having correlated expression profiles. Finally, we cluster unclassified TARs into putative novel loci, based on similar expression and phylogenetic profiles. To encapsulate our classification, we construct a Database of Active Regions and Tools (DART.gersteinlab.org). DART has special facilities for rapidly handling and comparing many sets of TARs and their heterogeneous features, synchronizing across builds, and interfacing with other resources. Overall, we find that ∼14% of the novel TARs can be associated with known genes, while ∼21% can be clustered into ∼200 novel loci. We observe that TARs associated with genes are enriched in the potential to form structural RNAs and many novel TAR clusters are associated with nearby promoters. To benchmark our classification, we design a set of experiments for testing the connectivity of novel TARs. Overall, we find that 18 of the 46 connections tested validate by RT-PCR and four of five sequenced PCR products confirm connectivity unambiguously.
Resumo:
Abstract : Gene duplication is an essential source of material for the origin of genetic novelty and the evolution of lineage- or species-specific phenotypic traits. The reverse transcription of source gene mRNA followed by the genomic insertion of the resulting cDNA - retroposition - has provided the human genome with a significant number of gene copies during the last ~63 million years (MYA) of primate evolution. We estimated that at least 1 new functional gene (retrogene) per MYA emerged by retroposition in the primate lineage leading to humans. Using a combination of comparative sequencing and evolutionary simulations, we obtained strong evidence of functionality for 7 primate specific retrogenes. Most of these genes are specifically expressed in testis suggesting that retroposition has contributed with genetic raw material necessary for the evolution ofmale-specific functions in primates. We characterized CDC14Bretro (identified in the previous survey) that originated from the retroposition of a cell cycle gene - CDC14B - in the common ancestor of humans and apes. We demonstrate that CDC14Bretro experienced a period of intense positive selection in the African ape ancestor. By virtue of the amino acid substitutions that occurred during this period CDC 14Bretro adapted to a new subcellular compartment in African apes. Further analyses indicate that this subcellular shift reflects the evolution of anew functional role of CDC 14Bretro. Prompted by this result, we used yeast (Saccharomyces cerevisiae) to investigate on a global scale the extent of functional diversification of duplicate genes through the subcellular adaptation of their encoded proteins. We found that duplicate proteins frequently evolved new cellular localization patterns, either by partitioning of ancestral localizations ("sublocalization"), or more frequently by relocalization to previously unoccupied compartments ("neolocalization"). Interestingly, proteins involved in processes with a wider subcellular distribution more frequently evolved new localization patterns suggesting that subcellular localization changes are dependent on progenitor gene functions. Relocated proteins adapted to their new subcellular environments and evolved new functional roles through changes of their physio-chemical properties, expression levels, and interaction partners. Our work suggests an important role of subcellular adaptation for the emergence of new gene functions.
Resumo:
Microsatellites are important highly polymorphic genetic markers dispersed in the human genome. Using a panel of 22 (CA)n repeat microsatellite markers mapped to recurrent breakpoint cluster regions specifically involved in leukemia, we investigated 114 adult leukemias (25 acute lymphocytic leukemia [ALL], 32 acute myeloid leukemia [AML], 36 chronic lymphocytic leukemia [CLL], and 21 chronic myeloid leukemia [CML] in chronic phase) for somatic mutations at these loci. In each patient, DNA from fresh leukemia samples was analyzed alongside normal constitutive DNA from buccal epithelium. We detected loss of heterozygosity (LOH) in 81 of 114 patients (ALL 16/25, AML 25/32, CLL 30/36, CML 10/21). Deletions were most often seen in ALL at 11q23 and 19p13; in AML at 8q22 and 11q23; in CLL at 13q14.3, 11q13, and 11q23; and in CML at 3q26. Only six deletions were reported in 74 karyotypes analyzed, whereas in these same cases, 91 LOH events were detected by microsatellites. Of 26 leukemias with a normal karyotype, 16 nevertheless showed at least one LOH by microsatellite analysis. Replication errors were found in 10 of 114 patients (8.8%). Thus, microsatellite instability is rare in leukemia in contrast to many solid tumors. Our findings suggest that in adult leukemia, LOH may be an important genetic event in addition to typical chromosomal translocations. LOH may point to the existence of tumor suppressor genes involved in leukemogenesis to a degree that has hitherto been underestimated.
Resumo:
Résumé II y a cinq ans, la découverte d'un nouveau domaine, le PYD domaine, lié aux domaines de la mort, a permis la description de la nouvelle famille des NALP protéines. L'analyse structurelle de cette famille de protéines révéla la présence de deux autres domaines, impliqués dans l'oligomerisation, NACHT, et la détection des ligands, Leucine rich repeats ou LRR. Cette architecture protéique est homologue à celle qui est décrite pour les NODs, les Tol1 récepteurs et tes protéines de résistance chez les plantes. Cette homologie suggère une possible implication des NALPs dans la régulation de l'immunité innée. Premièrement, nous avons décrit les composants minimaux qui permettent à l'inflammasomeNALP3 d'activer la caspase pro-inflammatoire, caspase-1. En comparaison à NALP1, NALP3 ne contient pas de FIIND domaine, ni de CARD domaine en C-terminus et n'interagit pas avec caspase-5. Nous avons découvert une protéine très homologue au C-terminus de NALP1, Cardinal, qui se lie au NACHT domaine de NALP2 et NALP3 par l'intermédiaire de son FIIND domaine. Cardinal possède la capacité d'interagir avec caspase-l, mais seul ASC semble être nécessaire à la maturation de la prointerleukine-1β suite à la stimulation de NALP3. Deuxièmement, notre étude s'est concentrée sur la nature du stimulus capable d'induire la formation et l'activation de l'inflammasome-NALP3. Nous avons démontré que l'ajout de muramyl dipeptide (MDP), produit à partir de la digestion enzymatique de peptidoglycaris bactériens, induit à la fois l'expression de la proIL-1β par la voie NOD2 et sa maturation en IL-1β active par la voie NALP3. Bien que le MDP active l'inflammasome-NALP3, il est incapable d'induire la sécrétion de l'IL-1β mature dans la lignée cellulaire THP1, comparé aux monocytes primaires humains. Cette différence pourrait être liée à l'absence, dans les THP1, de la protéine Filamin, qui est proposée d'interagir avec Cardinal. L'implication de NALP3 dans la maturation de l'IL-lb est confirmée suite à la découverte de mutations sur le gène CIAS1/NALP3/cryopyrin associées à trois maladies auto-inflammatoires : le syndrome de Muckle-Wells (MWS), l'urticaire familial au froid (FCU) et le syndrome CINCA/NOMID. Une élévation constitutive de la maturation et de la sécrétion de la proIL-1β en absence de stimulation MDP est détectée dans les macrophages des patients Muckle-Wells. En conclusion, nos études ont démontré que l'inflammasome-NALP3 doit être finement régulé pour éviter une activité incontrôlée qui représente la base moléculaire des symptômes associés aux syndromes auto-inflammatoires liés à NALP3. Summary Five years ago, the description of the NALP family originated from the discovery of a new death-domain fold family, the PYD domain. NALP contains aprotein-protein interaction domain (PYD), an oligomerization domain (NACHT) and a ligand-sensing domain, leucine rich repeats or LRR. This protein architecture shares similarity with receptors involved in immunity, such as NODS, Toll receptors (TLRs) and related plant resistance proteins, and points to an important role of NALPs in defense mechanisms. We first described the minimal complex involved in the pro-inflammatory Interleukin-1beta (IL-1β) cytokine maturation, called the inflammasome, which contains NALP3. In contrast to NALP1, NALP3, like other members of the NALP family, is devoid of C-terminal FIIND and CARD domains and does not interact with the pro-inflammatory caspase-5. Interestingly, a homolog of the C-terminal portion of NALP1 was found in the human genome and was named Cardinal. We found that NALP2 and NALP3 interact with the CARD-containing proteins Cardinal. Cardinal is able to bind to caspase-1 but is not required for IL-1β maturation through NALP3 activation, as demonstrated for the adaptor ASC. Secondly, our study focused on the stimuli involved in the activation of the NALP3 inflammasome. MDP was shown to induce the expression of proIL1β through NOD2 and then the maturation into active IL-1β by activation of the NALP3 inflammasome. However, in the monocytic THP1 cell line, secretion of IL-1β upon MDP stimulation seems to be independent of the inflammasome activation compared to human primary monocytes. This difference might be linked to a Cardinal-interacting protein, filamin. Until now, the role of Cardinal and filamin is still unknown and remains to be elucidated. Finally, mutations in the NALP3/cryopyrin/CIAS1 gene are associated with three autoinflammatory diseases: Muckle-Wells syndrome, familial cold autoinflammatory syndrome, and CINCA. Constitutive, elevated IL-1β maturation and secretion, even in the absence of MDP stimulation, was observed in macrophages from Muckle-Wells patients and confirmed a key role for the NALP3 inflammasome in innate immunity In conclusion, our studies describes the formation of the NALP3 inflammasome and suggests that this complex has to be tightly regulated to avoid an increased deregulated inflammasome activity that is the molecular basis for the symptoms associated with NALP3-dependent autoinflammatory disorders.
Resumo:
Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing; an individual's DNA can be used to infer their geographic origin with surprising accuracy-often to within a few hundred kilometres.
Resumo:
Transposable elements, as major components of most eukaryotic organisms' genomes, define their structural organization and plasticity. They supply host genomes with functional elements, for example, binding sites of the pleiotropic master transcription factor p53 were identified in LINE1, Alu and LTR repeats in the human genome. Similarly, in this report we reveal the role of zebrafish (Danio rerio) EnSpmN6_DR non-autonomous DNA transposon in shaping the repertoire of the p53 target genes. The multiple copies of EnSpmN6_DR and their embedded p53 responsive elements drive in several instances p53-dependent transcriptional modulation of the adjacent gene, whose human orthologs were frequently previously annotated as p53 targets. These transposons define predominantly a set of target genes whose human orthologs contribute to neuronal morphogenesis, axonogenesis, synaptic transmission and the regulation of programmed cell death. Consistent with these biological functions the orthologs of the EnSpmN6_DR-colonized loci are enriched for genes expressed in the amygdala, the hippocampus and the brain cortex. Our data pinpoint a remarkable example of convergent evolution: the exaptation of lineage-specific transposons to shape p53-regulated neuronal morphogenesis-related pathways in both a hominid and a teleost fish.
Resumo:
Our current knowledge of the general factor requirement in transcription by the three mammalian RNA polymerases is based on a small number of model promoters. Here, we present a comprehensive chromatin immunoprecipitation (ChIP)-on-chip analysis for 28 transcription factors on a large set of known and novel TATA-binding protein (TBP)-binding sites experimentally identified via ChIP cloning. A large fraction of identified TBP-binding sites is located in introns or lacks a gene/mRNA annotation and is found to direct transcription. Integrated analysis of the ChIP-on-chip data and functional studies revealed that TAF12 hitherto regarded as RNA polymerase II (RNAP II)-specific was found to be also involved in RNAP I transcription. Distinct profiles for general transcription factors and TAF-containing complexes were uncovered for RNAP II promoters located in CpG and non-CpG islands suggesting distinct transcription initiation pathways. Our study broadens the spectrum of general transcription factor function and uncovers a plethora of novel, functional TBP-binding sites in the human genome.
Resumo:
The current availability of five complete genomes of different primate species allows the analysis of genetic divergence over the last 40 million years of evolution. We hypothesized that the interspecies differences observed in susceptibility to HIV-1 would be influenced by the long-range selective pressures on host genes associated with HIV-1 pathogenesis. We established a list of human genes (n = 140) proposed to be involved in HIV-1 biology and pathogenesis and a control set of 100 random genes. We retrieved the orthologous genes from the genome of humans and of four nonhuman primates (Pan troglodytes, Pongo pygmaeus abeli, Macaca mulatta, and Callithrix jacchus) and analyzed the nucleotide substitution patterns of this data set using codon-based maximum likelihood procedures. In addition, we evaluated whether the candidate genes have been targets of recent positive selection in humans by analyzing HapMap Phase 2 single-nucleotide polymorphisms genotyped in a region centered on each candidate gene. A total of 1,064 sequences were used for the analyses. Similar median K(A)/K(S) values were estimated for the set of genes involved in HIV-1 pathogenesis and for control genes, 0.19 and 0.15, respectively. However, genes of the innate immunity had median values of 0.37 (P value = 0.0001, compared with control genes), and genes of intrinsic cellular defense had K(A)/K(S) values around or greater than 1.0 (P value = 0.0002). Detailed assessment allowed the identification of residues under positive selection in 13 proteins: AKT1, APOBEC3G, APOBEC3H, CD4, DEFB1, GML, IL4, IL8RA, L-SIGN/CLEC4M, PTPRC/CD45, Tetherin/BST2, TLR7, and TRIM5alpha. A number of those residues are relevant for HIV-1 biology. The set of 140 genes involved in HIV-1 pathogenesis did not show a significant enrichment in signals of recent positive selection in humans (intraspecies selection). However, we identified within or near these genes 24 polymorphisms showing strong signatures of recent positive selection. Interestingly, the DEFB1 gene presented signatures of both interspecies positive selection in primates and intraspecies recent positive selection in humans. The systematic assessment of long-acting selective pressures on primate genomes is a useful tool to extend our understanding of genetic variation influencing contemporary susceptibility to HIV-1.
Resumo:
The complexity of sleep-wake regulation, in addition to the many environmental influences, includes genetic predisposing factors, which begin to be discovered. Most of the current progress in the study of sleep genetics comes from animal models (dogs, mice, and drosophila). Multiple approaches using both animal models and different genetic techniques are needed to follow the segregation and ultimately to identify 'sleep genes' and molecular bases of sleep disorders. Recent progress in molecular genetics and the development of detailed human genome map have already led to the identification of genetic factors in several complex disorders. Only a few genes are known for which a mutation causes a sleep disorder. However, single gene disorders are rare and most common disorders are complex in terms of their genetic susceptibility, environmental factors, gene-gene, and gene-environment interactions. We review here the current progress in the genetics of normal and pathological sleep and suggest a few future perspectives.
Resumo:
Objectives: Sequencing and annotation of the genome of Aspergillus fumigatus has dramatically changed our knowledge about the proteins potentially encoded by the fungus. Own analysis have resulted in at least 47 of them contain a signal for secretion. Among those list we want to characterize those enzymes that may have impact on fungal growth outside and particularly inside the host. We thereby want to learn more about their function in general and to identify possible novel drug targets suited to combat invasive aspergillosis. Methods: Four groups of secreted proteases have been chosen for further analysis: 1 Serine-carboxyl proteases (sedolisins). Four of them were expressed in yeast and partly in bacteria. Substrate-specificity studies and kinetics as well as protein characterization of the yeast derived proteases were performed according to standard methods. Enzyme specific polyclonal antibodies were raised in rabbits using the peptides expressed in bacteria. Expression of proteases in A. fumigatus was investigated with these antibodies and gene knockout mutants for each enzyme as a control. All the following mentioned proteases will be investigated accordingly. 2 Two metalloproteases from the M12-family, ADAM-A and ADAM-B. Both proteases are likely membrane associated and may have inherent sheddase function as their counterparts in mammals. 3 One metalloprotease of the M43 family. An orthologue of this protease in Coccidioides posadasii is known to posses immunomodulating activities. 4 One putative endoprotease of the S28-family. An orthologue in Aspergillus niger is known to digest proline-rich proteins. In A. fumigatus this enzyme may facilitate invasion through proline-rich proteins like collagen. Results: All sedolisins expressed in yeast were proteolytically active: Three of them were characterized as tripeptidyl-peptidases whereas one enzyme is an endoprotease. Corresponding knockout mutants did not reveal a specific phenotype. Expression and investigations on all above mentioned proteases as well as generation of corresponding knockout mutants and double knockout mutants for the ADAMs, respectively, is underway. Promising candidates will be investigated in animal studies for reduced virulence. Conclusions : The real existence of so far hypothetical proteases predicted by the genome project was already demonstrated for the sedolisins by a reverse genetic approach (from gene to protein). With the aim of improving basic knowledge on function of other proteases potentially crucial for fungal growth and thus for pathogenesis, other hypothetical enzymes will be investigated. Those enzymes may turn out to be ideal drug targets for antimycotic chemotherapy.