973 resultados para Human Genome
Resumo:
Plectin, a 500-kDa intermediate filament binding protein, has been proposed to provide mechanical strength to cells and tissues by acting as a cross-linking element of the cytoskeleton. To set the basis for future studies on gene regulation, tissue-specific expression, and pathological conditions involving this protein, we have cloned the human plectin gene, determined its coding sequence, and established its genomic organization. The coding sequence contains 32 exons that extend over 32 kb of the human genome. Most of the introns reside within a region encoding the globular N-terminal domain of the molecule, whereas the entire central rod domain and the entire C-terminal globular domain were found to be encoded by single exons of remarkable length, >3 kb and >6 kb, respectively. Overall, the organization of the human plectin gene was strikingly similar to that of human bullous pemphigoid antigen 1 (BPAG1), confirming that both proteins belong to the same gene family. Comparison of the deduced protein sequences for human and rat plectin revealed that they were 93% identical. By using fluorescence in situ hybridization, we have mapped the plectin gene to the long arm of chromosome 8 within the telomeric region. This gene locus (8q24) has previously been implicated in the human blistering skin disease epidermolysis bullosa simplex Ogna. Detailed knowledge of the structure of the plectin gene and its chromosome localization will aid in the elucidation of whether this or any other pathological conditions are linked to alterations in the plectin gene.
Resumo:
The development of a highly reliable physical map with landmark sites spaced an average of 100 kbp apart has been a central goal of the Human Genome Project. We have approached the physical mapping of human chromosome 11 with this goal as a primary target. We have focused on strategies that would utilize yeast artificial chromosome (YAC) technology, thus permitting long-range coverage of hundreds of kilobases of genomic DNA, yet we sought to minimize the ambiguities inherent in the use of this technology, particularly the occurrence of chimeric genomic DNA clones. This was achieved through the development of a chromosome 11-specific YAC library from a human somatic cell hybrid line that has retained chromosome 11 as its sole human component.To maximize the efficiency of YAC contig assembly and extension, we have employed an Alu-PCR-based hybridization screening system. This system eliminates many of the more costly and time-consuming steps associated with sequence tagged site content mapping such as sequencing, primer production, and hierarchical screening, resulting in greater efficiency with increased throughput and reduced cost. Using these approaches, we have achieved YAC coverage for >90% of human chromosome 11, with an average intermarker distance of <100 kbp. Cytogenetic localization has been determined for each contig by fluorescent in situ hybridization and/or sequence tagged site content. The YAC contigs that we have generated should provide a robust framework to move forward to sequence-ready templates for the sequencing efforts of the Human Genome Project as well as more focused positional cloning on chromosome 11.
Resumo:
Integration of human immunodeficiency virus (HIV) DNA into the human genome requires the virus-encoded integrase (IN) protein, and therefore the IN protein is a suitable target for antiviral strategies. To find a potent HIV IN inhibitor, we screened a "synthetic peptide combinatorial library." We identified a hexapeptide with the sequence HCKFWW that inhibits IN-mediated 3'-processing and integration with an IC50 of 2 microM. The peptide is active on IN proteins from other retroviruses such as HIV-2, feline immunodeficiency virus, and Moloney murine leukemia virus, supporting the notion that a conserved region of IN is targeted. The hexapeptide was also tested in the disintegration reaction. This phosphoryl-transfer reaction can be carried out by the catalytic core of IN alone, and the peptide HCKFWW was found to inhibit this reaction, suggesting that the hexapeptide acts at or near the catalytic site of IN. Identification of an IN hexapeptide inhibitor provides proof of concept for the approach, and, moreover, this peptide may be useful for structure-function analysis of IN.
Resumo:
Many features of Down syndrome might result from the overdosage of only a few genes located in a critical region of chromosome 21. To search for these genes, cosmids mapping in this region were isolated and used for trapping exons. One of the trapped exons obtained has a sequence very similar to part of the Drosophila single-minded (sim) gene, a master regulator of the early development of the fly central nervous system midline. Mapping data indicated that this exonic sequence is only present in the Down syndrome-critical region in the human genome. Hybridization of this exonic sequence with human fetal kidney poly(A)+ RNA revealed two transcripts of 6 and 4.3 kb. In situ hybridization of a probe derived from this exon with human and rat fetuses showed that the corresponding gene is expressed during early fetal life in the central nervous system and in other tissues, including the facial, skull, palate, and vertebra primordia. The expression pattern of this gene suggests that it might be involved in the pathogenesis of some of the morphological features and brain anomalies observed in Down syndrome.
Resumo:
We report a general mass spectrometric approach for the rapid identification and characterization of proteins isolated by preparative two-dimensional polyacrylamide gel electrophoresis. This method possesses the inherent power to detect and structurally characterize covalent modifications. Absolute sensitivities of matrix-assisted laser desorption ionization and high-energy collision-induced dissociation tandem mass spectrometry are exploited to determine the mass and sequence of subpicomole sample quantities of tryptic peptides. These data permit mass matching and sequence homology searching of computerized peptide mass and protein sequence data bases for known proteins and design of oligonucleotide probes for cloning unknown proteins. We have identified 11 proteins in lysates of human A375 melanoma cells, including: alpha-enolase, cytokeratin, stathmin, protein disulfide isomerase, tropomyosin, Cu/Zn superoxide dismutase, nucleoside diphosphate kinase A, galaptin, and triosephosphate isomerase. We have characterized several posttranslational modifications and chemical modifications that may result from electrophoresis or subsequent sample processing steps. Detection of comigrating and covalently modified proteins illustrates the necessity of peptide sequencing and the advantages of tandem mass spectrometry to reliably and unambiguously establish the identity of each protein. This technology paves the way for studies of cell-type dependent gene expression and studies of large suites of cellular proteins with unprecedented speed and rigor to provide information complementary to the ongoing Human Genome Project.
Resumo:
The human genome contains many repeated DNA sequences that vary in complexity of repeating unit from a single nucleotide to a whole gene. The repeat sequences can be widely dispersed or in simple tandem arrays. Arrays of up to 5 or 6 nt are known as simple tandem repeats, and these are widely dispersed and highly polymorphic. Members of one group of the simple tandem repeats, the trinucleotide repeats, can undergo an increase in copy number by a process of dynamic mutation. Dynamic mutations of the CCG trinucleotide give rise to one group of fragile sites on human chromosomes, the rare folate-sensitive group. One member of this group, the fragile X (FRAXA) is responsible for the most common familial form of mental retardation. Another member of the group FRAXE is responsible for a rarer mild form of mental retardation. Similar mutations of AGC repeats give rise to a number of neurological disorders. The expanded repeats are unstable between generations and somatically. The intergenerational instability gives rise to unusual patterns of inheritance--particularly anticipation, the increasing severity and/or earlier age of onset of the disorder in successive generations. Dynamic mutations have been found only in the human species, and possible reasons for this are considered. The mechanism of dynamic mutation is discussed, and a number of observations of simple tandem repeat mutation that could assist in understanding this phenomenon are commented on.
Resumo:
CpG island is a GC-rich motif occurred in gene promoter region, which can play important roles in gene silencing and imprinting. Here, we present a set of discriminant functions that can recognize the structural and compositional features of CpG islands in the putative promoter regions (PPRs) of human and mouse immunoglobulin (Ig) genes. We showed that the PPRs of both human and mouse Ig genes irrespective of gene chromosomal localization are apparently CpG island poor, with a low percentage of the CpG islands overlapped with the transcription start site (TSS). The human Ig genes that have CpG islands in the PPRs show a very narrow range of CpG densities. 47% of the Ig genes fall in the range of 3.5-4 CpGs/100 bp. In contrast, the non-Ig genes examined have a wide range of the density of CpG island, with 10.5% having the density of 8.1-15 CpGs/100 bp. Meantime, five patterns of the CpG distributions within the CpG islands have been classified: Pat A, B, C, D, and E. 21.6% and 10.8% of the Ig genes fall into the Pat B and Pat D groups, respectively, which were significantly higher than the non-Ig genes examined (8.2% and 3.8%). Moreover, the length of CpG islands is shorter in human Ig genes than in non-Ig genes but is much longer than in mouse orthologues. These findings provide a clear picture of non-neutral and nonrandom occurrence of the CpG islands in the PPRs of human and mouse Ig genes, which facilitate rational recommendations regarding their nomenclature. (C) 2005 Elsevier B.V. All rights reserved.
Resumo:
Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in intron 2, 5 kb downstream of the core BRCA1 promoter. The functionality of these elements was examined using homologous-recombination-based mutagenesis of reporter gene-tagged cosmids incorporating these regions and flanking sequences from the BRCA1 locus. This showed that CNS-1 and CNS-2 have differential transcriptional regulatory activity in epithelial cell lines. Mutation of CNS-1 significantly reduced reporter gene expression to 30% of control levels. Conversely mutation of CNS-2 increased expression to 200% of control levels. Regulation is at the level of transcription and shows promoter specificity. Both elements also specifically bind nuclear proteins in vitro. These studies demonstrate that the combination of comparative genomics and functional analysis is a successful strategy to identify novel regulatory elements and provide the first direct evidence that conserved noncoding sequences in BRCA1 regulate gene expression. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
The zebrafish golden mutation is characterized by the production of small and irregular-shaped melanin granules, resulting in a lightening of the pigmented lateral stripes of the animal. The recent positional cloning and localization of the golden gene, combined with genotype-phenotype correlations of alleles of its human orthologue (SLC24A5) in African-American and African-Caribbean populations, provide insights into the genetic and molecular basis of human skin colour. SLC24A5 promotes melanin deposition through maturation of the melanosome, highlighting the importance of ion-exchange in the function of this organelle.
Resumo:
Improvements in genomic technology, both in the increased speed and reduced cost of sequencing, have expanded the appreciation of the abundance of human genetic variation. However the sheer amount of variation, as well as the varying type and genomic content of variation, poses a challenge in understanding the clinical consequence of a single mutation. This work uses several methodologies to interpret the observed variation in the human genome, and presents novel strategies for the prediction of allele pathogenicity.
Using the zebrafish model system as an in vivo assay of allele function, we identified a novel driver of Bardet-Biedl Syndrome (BBS) in CEP76. A combination of targeted sequencing of 785 cilia-associated genes in a cohort of BBS patients and subsequent in vivo functional assays recapitulating the human phenotype gave strong evidence for the role of CEP76 mutations in the pathology of an affected family. This portion of the work demonstrated the necessity of functional testing in validating disease-associated mutations, and added to the catalogue of known BBS disease genes.
Further study into the role of copy-number variations (CNVs) in a cohort of BBS patients showed the significant contribution of CNVs to disease pathology. Using high-density array comparative genomic hybridization (aCGH) we were able to identify pathogenic CNVs as small as several hundred bp. Dissection of constituent gene and in vivo experiments investigating epistatic interactions between affected genes allowed for an appreciation of several paradigms by which CNVs can contribute to disease. This study revealed that the contribution of CNVs to disease in BBS patients is much higher than previously expected, and demonstrated the necessity of consideration of CNV contribution in future (and retrospective) investigations of human genetic disease.
Finally, we used a combination of comparative genomics and in vivo complementation assays to identify second-site compensatory modification of pathogenic alleles. These pathogenic alleles, which are found compensated in other species (termed compensated pathogenic deviations [CPDs]), represent a significant fraction (from 3 – 10%) of human disease-associated alleles. In silico pathogenicity prediction algorithms, a valuable method of allele prioritization, often misrepresent these alleles as benign, leading to omission of possibly informative variants in studies of human genetic disease. We created a mathematical model that was able to predict CPDs and putative compensatory sites, and functionally showed in vivo that second-site mutation can mitigate the pathogenicity of disease alleles. Additionally, we made publically available an in silico module for the prediction of CPDs and modifier sites.
These studies have advanced the ability to interpret the pathogenicity of multiple types of human variation, as well as made available tools for others to do so as well.
Resumo:
A biologia molecular tem fornecido as ferramentas básicas para os geneticistas se aprofundarem nos mecanismos moleculares que influem na variação das doenças. Deve-se destacar a responsabilidade científica e moral dos pesquisadores, uma vez que os cientistas devem imaginar as consequências morais da aplicação comercial de testes genéticos, já que esse fato envolve não só o indivíduo e suas famílias, mas toda a população. Além de ser preciso, também, fazer uma reflexão sobre como essas informações do genoma humano serão utilizadas, para o bem ou mal. O objetivo desta revisão foi trazer à luz do conhecimento dados sobre características éticas da aplicação da biologia molecular, relacionando-a com os direitos do ser humano. Após análise bibliográfica, pôde-se observar que o Projeto Genoma Humano gerou várias possibilidades, como identificação de genes associados a doenças com propriedades sinergísticas, mas modificando às vezes comportamentos ao intervir geneticamente no ser humano, trazendo benefícios ou malefícios sociais. O grande desafio é decidir o que a humanidade pretende em relação a este gigantesco salto.
Resumo:
Background: The post-genomic era has brought new challenges regarding the understanding of the organization and function of the human genome. Many of these challenges are centered on the meaning of differential gene regulation under distinct biological conditions and can be performed by analyzing the Multiple Differential Expression (MDE) of genes associated with normal and abnormal biological processes. Currently MDE analyses are limited to usual methods of differential expression initially designed for paired analysis. Results: We proposed a web platform named ProbFAST for MDE analysis which uses Bayesian inference to identify key genes that are intuitively prioritized by means of probabilities. A simulated study revealed that our method gives a better performance when compared to other approaches and when applied to public expression data, we demonstrated its flexibility to obtain relevant genes biologically associated with normal and abnormal biological processes. Conclusions: ProbFAST is a free accessible web-based application that enables MDE analysis on a global scale. It offers an efficient methodological approach for MDE analysis of a set of genes that are turned on and off related to functional information during the evolution of a tumor or tissue differentiation. ProbFAST server can be accessed at http://gdm.fmrp.usp.br/probfast.
Resumo:
Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.
Resumo:
Interethnic differences exist in disease prevalence, especially with regard to cancer and cardiovascular diseases, which involve altered expression or activity of matrix metalloproteinases (MMPs). The hypothesis being tested in this study is that interethnic differences exist between blacks and whites with regard to the distribution of genetic variants of MMP polymorphisms and haplotypes. We examined the distribution of polymorphisms of MMP-2 and MMP-9 genes in 177 black and 140 white subjects. We studied the following polymorphisms: the C(-1306)T in the promoter of the MMP-2 gene, the C(-1562)T and a microsatellite -90(CA)(14-24) in the promoter, and the Q279R in exon 6 of the MMP-9 gene. We have also compared our results with those from Hapmap or Seattle SNPs Projects and estimated the haplotype frequency in these two ethnic groups. The ""C'' allele for the C(-1306)T polymorphism was more common in blacks (91.5%) than in whites (80.4%; p<0.0001). The ""T'' allele for the C(-1562)T polymorphism was more common in blacks (15.0%) than in whites (8.9%; p=0.0279), as well as the alleles with >21 repeats for the -90(CA)(14-24) were more common in blacks than in whites (61.9% in blacks and 49.3% in whites; p=0.0017). We found no interethnic differences for the Q279R polymorphism. Moreover, two haplotypes that combine ""detrimental'' alleles were found at higher frequencies in blacks than in whites (31% vs. 16.4%, respectively; p<0.05). The interethnic differences being reported here replicate those previously found with smaller number of subjects in the Hapmap or Seattle SNPs data and may help explain the higher prevalence of cancer and cardiovascular diseases in blacks compared with whites. Our findings suggest a proportional significance of these polymorphisms in each ethnic group.
Resumo:
The identification of alternatively spliced transcripts has contributed to a better comprehension of developmental mechanisms, tissue-specific physiological processes and human diseases. Polymerase chain reaction amplification of alternatively spliced variants commonly leads to the formation of heteroduplexes as a result of base pairing involving exons common between the two variants. S1 nuclease cleaves single-stranded loops of heteroduplexes and also nicks the opposite DNA strand. In order to establish a strategy for mapping alternative splice-prone sites in the whole transcriptome, we developed a method combining the formation of heteroduplexes between 2 distinct splicing variants and S1 nuclease digestion. For 20 consensuses identified here using this methodology, 5 revealed a conserved splice site after inspection of the cDNA alignment against the human genome (exact splice sites). For 8 other consensuses, conserved splice sites were mapped at 2 to 30 bp from the border, called proximal splice sites; for the other 7 consensuses, conserved splice sites were mapped at 40 to 800 bp, called distal splice sites. These latter cases showed a nonspecific activity of S1 nuclease in digesting double-strand DNA. From the 20 consensuses identified here, 5 were selected for reverse transcription-polymerase chain reaction validation, confirming the splice sites. These data showed the potential of the strategy in mapping splice sites. However, the lack of specificity of the S1 nuclease enzyme is a significant obstacle that impedes the use of this strategy in large-scale studies.