957 resultados para human genome variation
Resumo:
Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.
Resumo:
Interethnic differences exist in disease prevalence, especially with regard to cancer and cardiovascular diseases, which involve altered expression or activity of matrix metalloproteinases (MMPs). The hypothesis being tested in this study is that interethnic differences exist between blacks and whites with regard to the distribution of genetic variants of MMP polymorphisms and haplotypes. We examined the distribution of polymorphisms of MMP-2 and MMP-9 genes in 177 black and 140 white subjects. We studied the following polymorphisms: the C(-1306)T in the promoter of the MMP-2 gene, the C(-1562)T and a microsatellite -90(CA)(14-24) in the promoter, and the Q279R in exon 6 of the MMP-9 gene. We have also compared our results with those from Hapmap or Seattle SNPs Projects and estimated the haplotype frequency in these two ethnic groups. The ""C'' allele for the C(-1306)T polymorphism was more common in blacks (91.5%) than in whites (80.4%; p<0.0001). The ""T'' allele for the C(-1562)T polymorphism was more common in blacks (15.0%) than in whites (8.9%; p=0.0279), as well as the alleles with >21 repeats for the -90(CA)(14-24) were more common in blacks than in whites (61.9% in blacks and 49.3% in whites; p=0.0017). We found no interethnic differences for the Q279R polymorphism. Moreover, two haplotypes that combine ""detrimental'' alleles were found at higher frequencies in blacks than in whites (31% vs. 16.4%, respectively; p<0.05). The interethnic differences being reported here replicate those previously found with smaller number of subjects in the Hapmap or Seattle SNPs data and may help explain the higher prevalence of cancer and cardiovascular diseases in blacks compared with whites. Our findings suggest a proportional significance of these polymorphisms in each ethnic group.
Resumo:
The identification of alternatively spliced transcripts has contributed to a better comprehension of developmental mechanisms, tissue-specific physiological processes and human diseases. Polymerase chain reaction amplification of alternatively spliced variants commonly leads to the formation of heteroduplexes as a result of base pairing involving exons common between the two variants. S1 nuclease cleaves single-stranded loops of heteroduplexes and also nicks the opposite DNA strand. In order to establish a strategy for mapping alternative splice-prone sites in the whole transcriptome, we developed a method combining the formation of heteroduplexes between 2 distinct splicing variants and S1 nuclease digestion. For 20 consensuses identified here using this methodology, 5 revealed a conserved splice site after inspection of the cDNA alignment against the human genome (exact splice sites). For 8 other consensuses, conserved splice sites were mapped at 2 to 30 bp from the border, called proximal splice sites; for the other 7 consensuses, conserved splice sites were mapped at 40 to 800 bp, called distal splice sites. These latter cases showed a nonspecific activity of S1 nuclease in digesting double-strand DNA. From the 20 consensuses identified here, 5 were selected for reverse transcription-polymerase chain reaction validation, confirming the splice sites. These data showed the potential of the strategy in mapping splice sites. However, the lack of specificity of the S1 nuclease enzyme is a significant obstacle that impedes the use of this strategy in large-scale studies.
Resumo:
Background: Myelodysplastic syndromes (MDS) are a group of clonal hematological disorders characterized by ineffective hematopoiesis with morphological evidence of marrow cell dysplasia resulting in peripheral blood cytopenia. Microarray technology has permitted a refined high-throughput mapping of the transcriptional activity in the human genome. Non-coding RNAs (ncRNAs) transcribed from intronic regions of genes are involved in a number of processes related to post-transcriptional control of gene expression, and in the regulation of exon-skipping and intron retention. Characterization of ncRNAs in progenitor cells and stromal cells of MDS patients could be strategic for understanding gene expression regulation in this disease. Methods: In this study, gene expression profiles of CD34(+) cells of 4 patients with MDS of refractory anemia with ringed sideroblasts (RARS) subgroup and stromal cells of 3 patients with MDS-RARS were compared with healthy individuals using 44 k combined intron-exon oligoarrays, which included probes for exons of protein-coding genes, and for non-coding RNAs transcribed from intronic regions in either the sense or antisense strands. Real-time RT-PCR was performed to confirm the expression levels of selected transcripts. Results: In CD34(+) cells of MDS-RARS patients, 216 genes were significantly differentially expressed (q-value <= 0.01) in comparison to healthy individuals, of which 65 (30%) were non-coding transcripts. In stromal cells of MDS-RARS, 12 genes were significantly differentially expressed (q-value <= 0.05) in comparison to healthy individuals, of which 3 (25%) were non-coding transcripts. Conclusions: These results demonstrated, for the first time, the differential ncRNA expression profile between MDS-RARS and healthy individuals, in CD34(+) cells and stromal cells, suggesting that ncRNAs may play an important role during the development of myelodysplastic syndromes.
Resumo:
Sequencing technologies and new bioinformatics tools have led to the complete sequencing of various genomes. However, information regarding the human transcriptome and its annotation is yet to be completed. The Human Cancer Genome Project, using ORESTES (open reading frame EST sequences) methodology, contributed to this objective by generating data from about 1.2 million expressed sequence tags. Approximately 30 of these sequences did not align to ESTs in the public databases and were considered no-match ORESTES. On the basis that a set of these ESTs could represent new transcripts, we constructed a cDNA microarray. This platform was used to hybridize against 12 different normal or tumor tissues. We identified 3421 transcribed regions not associated with annotated transcripts, representing 83.3 of the platform. The total number of differentially expressed sequences was 1007. Also, 28 of analyzed sequences could represent noncoding RNAs. Our data reinforces the knowledge of the human genome being pervasively transcribed, and point out molecular marker candidates for different cancers. To reinforce our data, we confirmed, by real-time PCR, the differential expression of three out of eight potentially tumor markers in prostate tissues. Lists of 1007 differentially expressed sequences, and the 291 potentially noncoding tumor markers were provided.
Resumo:
Despite many successes of conventional DNA sequencing methods, some DNAs remain difficult or impossible to sequence. Unsequenceable regions occur in the genomes of many biologically important organisms, including the human genome. Such regions range in length from tens to millions of bases, and may contain valuable information such as the sequences of important genes. The authors have recently developed a technique that renders a wide range of problematic DNAs amenable to sequencing. The technique is known as sequence analysis via mutagenesis (SAM). This paper presents a number of algorithms for analysing and interpreting data generated by this technique.
Resumo:
Familial hyperaldosteronism type II (FH-II) is caused by adrenocortical hyperplasia or aldosteronoma or both and is frequently transmitted in an autosomal dominant fashion. Unlike FH type I (FI-I-I), which results from fusion of the CYP11B1 and CYP11B2 genes, hyperaldosteronism in FH-II is not glucocorticoid remediable. A large family with FH-II was used for a genome wide search and its members were evaluated by measuring the aldosterone:renin ratio. In those with an increased ratio, FH-II was confirmed by fludrocortisone suppression testing. After excluding most of the genome, genetic linkage was identified with a maximum two point lod score of 3.26 at theta =0, between FH-II in this family and the polymorphic markers D7S511, D7S517, and GATA24F03 on chromosome 7,a region that corresponds to cytogenetic band 7p22. This is the first identified locus for FH-II; its molecular elucidation may provide further insight into the aetiology of primary aldosteronism.
Resumo:
Pheochromocytomas are tumors of the adrenal medulla originating in the chromaffin cells derived from the neural crest. Ten % of these tumors are associated with the familial cancer syndromes multiple endocrine neoplasia type 2, von Hippel-Lindau disease (VHL), and rarely, neurofibromatosis type 1, in which germ-line mutations have been identified in RET, VHL, and NF1, respectively. In both the sporadic and familial forms of pheochromocytoma, allelic loss at 1p, 3p, 17p, and 22q has been reported, yet the molecular pathogenesis of these tumors is largely unknown. Allelic loss at chromosome 1p has also been reported in other endocrine tumors, such as medullary thyroid cancer and tumors of the parathyroid gland, as well as in tumors of neural crest origin including neuroblastoma and malignant melanoma, In this study, we performed fine structure mapping of deletions at chromosome 1p in familial and sporadic pheochromocytomas to identify discrete regions likely housing tumor suppressor genes involved in the development of these tumors. Ten microsatellite markers spanning a region of similar to 70 cM (Ipter to 1p34.3) were used to screen 20 pheochromocytomas from 19 unrelated patients for loss of heterozygosity (LOH). LOH was detected at five or more loci in 8 of 13 (61%)sporadic samples and at five or more loci in four of five (80%) tumor samples from patients with multiple endocrine neoplasia type 2. No LOH at 1p was detected in pheochromocytomas from two VHL patients, Analysis of the combined sporadic and familial tumor data suggested three possible regions of common somatic loss, designated as PCI (D1S243 to D1S244), PC2 (D1S228 to D1S507), and PC3 (D1S507 toward the centromere). We propose that chromosome Ip may be the site of at least three putative tumor suppressor loci involved in the tumorigenesis of pheochromocytomas. At least one of these loci, PC2 spanning an interval of <3.8 cM, is Likely to have a broader role in the development of endocrine malignancies.
Resumo:
Fragile sites appear visually as nonstaining gaps on chromosomes that are inducible by specific cell culture conditions. Expansion of CGG/ CCG repeats has been shown to be the molecular basis of all five folate-sensitive fragile sites characterized molecularly so far, i.e., FRAXA, FRAXE, FRAXF, FRA11B, and FRA16A. In the present study we have refined the localization of the FRA10A folate-sensitive fragile site by fluorescence in situ hybridization. Sequence analysis of a BAC clone spanning FRA10A identified a single, imperfect, but polymorphic CGG repeat that is part of a CpG island in the 5'UTR of a novel gene named FRA10ACl. The number of CGG repeats varied in the population from 8 to 13. Expansions exceeding 200 repeat units were methylated in all FRA10A fragile site carriers tested. The FRA10ACl gene consists of 19 exons and is transcribed in the centromeric direction from the FRA10A repeat. The major transcript of similar to 1450 nt is ubiquitously expressed and codes for a highly conserved protein, FRA10ACl, of unknown function. Several splice variants leading to alternative 3' ends were identified (particularly in testis). These give rise to FRA10ACl proteins with altered COOH-termini. Immunofluorescence analysis of full-length, recombinant EGFP-tagged FRA10ACl protein showed that it was present exclusively in the nucleoplasm. We show that the expression of FRA10A, in parallel to the other cloned folate-sensitive fragile sites, is caused by an expansion and subsequent methylation of an unstable CGG trinucleotide repeat. Taking advantage of three cSNPs within the FRA10ACl gene we demonstrate that one allele of the gene is not transcribed in a FRA10A carrier. Our data also suggest that in the heterozygous state FRA10A is likely a benign folate-sensitive fragile site. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
We report here genome sequences and comparative analyses of three closely related parasitoid wasps: Nasonia vitripennis, N. giraulti, and N. longicornis. Parasitoids are important regulators of arthropod populations, including major agricultural pests and disease vectors, and Nasonia is an emerging genetic model, particularly for evolutionary and developmental genetics. Key findings include the identification of a functional DNA methylation tool kit; hymenopteran-specific genes including diverse venoms; lateral gene transfers among Pox viruses, Wolbachia, and Nasonia; and the rapid evolution of genes involved in nuclear-mitochondrial interactions that are implicated in speciation. Newly developed genome resources advance Nasonia for genetic research, accelerate mapping and cloning of quantitative trait loci, and will ultimately provide tools and knowledge for further increasing the utility of parasitoids as pest insect-control agents.
Resumo:
In this paper we describe the assembly and restriction map of a 1.05-Mb cosmid contig spanning the candidate region for familial Mediterranean fever (FMF), a recessively inherited disorder of inflammation localized to 16p13.3. Using a combination of cosmid walking and screening for P1, PAC, BAG, and YAC clones, we have generated a contig of genomic clones spanning similar to 1050 kb that contains the FMF critical region. The map consists of 179 cosmid, 15 P1, 10 PAC, 3 BAG, and 17 YAC clones, anchored by 27 STS markers. Eight additional STSs have been developed from the similar to 700 kb immediately centromeric to this genomic region. Five of the 35 STSs are microsatellites that have not been previously reported. NotI and EcoRI mapping of the overlapping cosmids, hybridization of restriction fragments from cosmids to one another, and STS analyses have been used to validate the assembly of the contig. Our contig totally subsumes the 250-kb interval recently reported, by founder haplotype analysis, to contain the FMF gene. Thus, our high-resolution clone map provides an ideal resource for transcriptional mapping toward the eventual identification of this disease gene. (C) 1997 Academic Press.
Resumo:
Previously we found that levels of LRRC49 (leucine rich repeat containing 49; FLJ20156) transcripts were elevated in ER-positive breast tumors compared with ER-negative breast tumors. The LRRC49 gene is located on chromosome 15q23 in close proximity to the THAP10 (THAP domain containing 10) gene. These two genes have a bidirectional organization being arranged head-to-head on opposite strands, possibly sharing the same promoter region. Analysis of the promoter region of this gene pair revealed the presence of potential estrogen response elements (EREs), suggesting the potential of this promoter to be under the control of estrogen. We used quantitative real-time PCR (qPCR) to evaluate the expression of LRRC49 and THAP10 in a series of 72 primary breast tumors, and found reduced LRRC49 and THAP10 expression in 61 and 46% of the primary breast tumors analyzed, respectively. In addition, the occurrence of LRRC49/THAP10 promoter hypermethylation was examined by methylation specific PCR (MSP) in a sub-group of the breast tumors. Hypermethylation was observed in 57.5% of the breast tumors analyzed, and the levels of mRNA expression of both genes were inversely correlated with promoter hypermethylation. We investigated the effects of 17 beta-estradiol on LRRC49 and THAP10 expression in MCF-7 breast cancer cells and found both transcripts to be up-regulated 2- to 3-fold upon 17 beta-estradiol treatment. Our results show that the transcripts of LRRC49/THAP10 bidirectional gene pair are co-regulated by estrogen and that hypermethylation of the bidirectional promoter region simultaneously silences both genes. Further studies will be necessary to elucidate the role of LRRC49/THAP10 down-regulation in breast cancer.
Resumo:
The identification of genes responsible for the rare cases of familial leukemia may afford insight into the mechanism underlying the more common sporadic occurrences. Here we test a single family with 11 relevant meioses transmitting autosomal dominant acute myelogenous leukemia (AML) and myelodysplasia for linkage to three potential candidate loci. In a different family with inherited AML, linkage to chromosome 21q22.1-22.2 was recently reported; we exclude linkage to 21q22.1-22.2, demonstrating that familial AML is a heterogeneous disease. After reviewing familial leukemia and observing anticipation in the form of a declining age of onset with each generation, we had proposed 9p21-22 and 16q22 as additional candidate loci. Whereas linkage to 9p21-22 can be excluded, the finding of a maximum two-point LOD score of 2.82 with the microsatellite marker D16S522 at a recombination fraction theta = 0 provides evidence supporting linkage to 16q22. Haplotype analysis reveals a 23.5-cM (17.9-Mb) commonly inherited region among all affected family members extending from D16S451 to D1GS289, In order to extract maximum linkage information with missing individuals, incomplete informativeness with individual markers in this interval, and possible deviance from strict autosomal dominant inheritance, we performed nonparametric linkage analysis (NPL) and found a maximum NPL statistic corresponding to a P-value of .00098, close to the maximum conditional probability of linkage expected for a pedigree with this structure. Mutational analysis in this region specifically excludes expansion of the AT-rich minisatellite repeat FRA16B fragile site and the CAG trinucleotide repeat in the E2F-4 transcription factor. The ''repeat expansion detection'' method, capable of detecting dynamic mutation associated with anticipation, more generally excludes large CAG repeat expansion as a cause of leukemia in this family.
Resumo:
Cancer/testis Antigens (CTAs) are immunogenic proteins with a restricted expression pattern in normal tissues and aberrant expression in different types of tumors being considered promising candidates for immunotherapy. We used the alignment between EST sequences and the human genome sequence to identify novel CT genes. By examining the EST tissue composition of known CT clusters we defined parameters for the selection of 1184 EST clusters corresponding to putative CT genes. The expression pattern of 70 CT gene candidates was evaluated by RT-PCR in 21 normal tissues, 17 tumor cell lines and 160 primary tumors. We were able to identify 4 CT genes expressed in different types of tumors. The presence of antibodies against the protein encoded by 1 of these 4 CT genes (FAM46D) was exclusively detected in plasma samples from cancer patients. Due to its restricted expression pattern and immunogenicity FAM46D represents a novel target for cancer immunotherapy. (c) 2009 Elsevier Inc. All rights reserved.
Resumo:
BACKGROUND & AIMS: A sustained virologic response (SVR) to therapy for hepatitis C virus (HCV) infection is defined as the inability to detect HCV RNA 24 weeks after completion of treatment. Although small studies have reported that the SVR is durable and lasts for long periods, it has not been conclusively shown. METHODS: The durability of treatment responses was examined in patients originally enrolled in one of 9 randomized multicenter trials (n = 1343). The study included patients who received pegylated interferon (peginterferon) alfa-2a alone (n = 166) or in combination with ribavirin (n = 1077, including 79 patients with normal alanine aminotransferase levels and 100 patients who were coinfected with human immunodeficiency virus and HCV) and whose serum samples were negative for HCV RNA (<50 IU/mL) at their final assessment. Patients were assessed annually, from the date of last treatment, for a mean of 3.9 years (range, 0.8-7.1 years). RESULTS: Most patients (99.1%) who achieved an SVR had undetectable levels of HCV RNA in serum samples throughout the follow-up period. Serum samples from 0.9% of the patients contained HCV RNA a mean of 1.8 years (range, 1.1-2.9 years) after treatment ended. It is not clear if these patients were reinfected or experienced a relapse. CONCLUSIONS: In a large cohort of patients monitored for the durability of an SVR, the SVR was maintained for almost 4 years after treatment with peginterferon alfa-2a alone or in combination with ribavirin. In patients with chronic hepatitis C infection, the SVR is durable and these patients should be considered as cured.