986 resultados para Genomic Regions


Relevância:

30.00% 30.00%

Publicador:

Resumo:

An important topic in genomic sequence analysis is the identification of protein coding regions. In this context, several coding DNA model-independent methods based on the occurrence of specific patterns of nucleotides at coding regions have been proposed. Nonetheless, these methods have not been completely suitable due to their dependence on an empirically predefined window length required for a local analysis of a DNA region. We introduce a method based on a modified Gabor-wavelet transform (MGWT) for the identification of protein coding regions. This novel transform is tuned to analyze periodic signal components and presents the advantage of being independent of the window length. We compared the performance of the MGWT with other methods by using eukaryote data sets. The results show that MGWT outperforms all assessed model-independent methods with respect to identification accuracy. These results indicate that the source of at least part of the identification errors produced by the previous methods is the fixed working scale. The new method not only avoids this source of errors but also makes a tool available for detailed exploration of the nucleotide occurrence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

P>Approximately 50% of all carriers of 2q21-q31 deletions present epileptic seizures. The band 2q24 constitutes the smallest commonly deleted segment in these patients, and contains the voltage-gated sodium channel genes SCN1A and SCN2A, associated with Dravet syndrome and benign familial neonatal-infantile seizures, respectively. A further putative locus involving epilepsy in the region was previously identified through disruption of the SLC4A10 gene by translocation. In the course of performing high-resolution DNA copy number analyses on syndromic mentally impaired individuals, we encountered three patients with overlapping deletions in chromosome region 2q24. Two of these patients exhibited epileptic seizures in addition to mental deficiency. The deletion in one of the epileptic patients did not include the SCN cluster, demonstrating that a less severe form of epilepsy maps to an adjacent genomic region. This second region comprises about 3 Mb and contains the candidate gene SLC4A10, providing further support for the potential role of this gene in epilepsy.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Non-syndromic cleft lip with or without cleft palate (NS CL/P) is a complex disease in which heritability estimates vary widely depending on the population studied. To evaluate the importance of genetic contribution to NS CL/P in the Brazilian population, we conducted a study with 1,042 families from five different locations (Santarem, Fortaleza, Barbalha, Maceio, and Rio de Janeiro). We also evaluated the role of consanguinity and ethnic background. The proportion of familial cases varied significantly across locations, with the highest values found in Santarem (44%) and the lowest in Maceio (23%). Heritability estimates showed a higher genetic contribution to NS CL/P in Barbalha (85%), followed by Santarem (71%), Rio de Janeiro (70%), Fortaleza (64%), and Maceio (45%). Ancestry was not correlated with the occurrence of NS CL/P or with the variability in heritability. Only in Rio de Janeiro was the coefficient of inbreeding significantly larger in NS CL/P families than in the local population. Recurrence risk for the total sample was approximately 1.5-1.6%, varying according to the location studied (0.6-0.7% in Maceio to 2.2-2.8% in Barbalha). Our findings show that the degree of genetic contribution to NS CL/P varies according to the geographic region studied, and this difference cannot be attributed to consanguinity or ancestry. These findings suggest that Barbalha is a promising region for genetic studies. The data presented here will be useful in interpreting results from molecular analyses and show that care must be taken when pooling samples from different populations for association studies. (C) 2011 Wiley-Liss, Inc.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The molecular pathology of meningiomas and shwannomas involve the inactivation of the NF2 gene to generate grade I tumors. Genomic losses at 1p and 14q are observed in both neoplasms, although more frequently in meningiomas. The inactivation of unidentified genes located in these regions appears associated with tumor progression in meningiomas, but no clues to its molecular/clinical meaning are available in schwannomas. Recent microarray gene expression studies have demonstrated the existence of molecular subgroups in both entities. In the present study, we correlated the presence of genomic deletions at 1p, 14q, and 22q with the expression patterns of 96 tumor-related genes obtained by cDNA low-density microarrays in a series of 65 tumors including 42 meningiomas and 23 schwannomas. Two expression pattern groups were identified by cDNA mycroarray analysis when compared to the expression pattern in normal control RNA in both meningiomas and schwannomas, each one with patterns similar and different from the normal control. Meningioma and schwannoma subgroups differed in the expression of 38 and 16 genes, respectively. Using MLPA and microsatellites, we identified genomic losses at 1p, 14q, and 22q at nonrandom frequencies (12.5-69%) in meningiomas and schwannomas. Losses at 22q were almost equally frequent in both molecular expression subgroups in both neoplasms. However, deletions at 1p and 14q accumulated in meningiomas with a gene expression pattern different from the normal pattern, whereas the inverse situation occurred in schwannomas. Those anomalies characterized the schwannomas with expression pattern similar to the normal control. These findings suggest that deletions at 1p and 14q enhance the development of an abnormal tumor-related gene expression pattern in meningiomas, but this fact is not corroborated in schwannomas. (C) 2010 Elsevier Inc. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have previously reported successful trans-complementation of defective Kunjin virus genomic RNAs with a range of large lethal deletions in the nonstructural genes NSI, NS3, and NS5 (A. A. Khromykh et al., J. Virol. 74:3253-3263, 2000). In this study we have mapped further the minimal region in the NS5 gene essential for efficient trans-complementation of genome-length RNAs in repBHK cells to the first 316 of the 905 codons. To allow amplification and easy detection of complemented defective RNAs with deletions apparently affecting virus assembly, we have developed a dual replicon complementation system. In this system defective replicon RNAs with a deletion(s) in the nonstructural genes also encoded the puromycin resistance gene (PAC gene) and the reporter gene for beta-galactosidase (beta-Gal). Complementation of these defective replicon RNAs in repBHK cells resulted in expression of PAC and beta-Gal which allowed establishment of cell lines stably producing replicating defective RNAs by selection with puromycin and comparison of replication efficiencies of complemented defective RNAs by beta-Gal assay. Using this system we demonstrated that deletions in the C-terminal 434 codons of NS3 (codons 178 to 611) were complemented for RNA replication, while any deletions in the first 178 codons were not. None of the genome-length RNAs containing deletions in NS3 shown to be complementable for RNA replication produced secreted defective viruses during complementation in repBHK cells. In contrast, structural proteins produced from these complemented defective RNAs were able to package helper replicon RNA. The results define minimal regions in the NS3 and NS5 genes essential for the formation of complementable replication complex and show a requirement of NS3 in cis for virus assembly.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

To define the location of potential oncogenes and tumor suppressor genes in ocular melanoma we carried out comparative genomic hybridization (CGH) analysis on a population-based series of 25 formalin-fixed, paraffin-embedded primary tumors comprising 17 choroidal, 2 ciliary body, 4 iris, and 2 conjunctival melanomas. Twelve (48%) of the 25 melanomas showed no chromosomal changes and 13 (52%) had at least one chromosomal gain or loss. The mean number of CGH changes in all tumors was 3.3, with similar mean numbers of chromosomal gains (1.5) and losses (1.8). The highest number of chromosomal changes (i.e., nine) occurred in a conjunctival melanoma and included four changes not observed in tumors at any other ocular site (gains in 22q and 11p and losses in 6p and 17p). The most frequent gains in all primary ocular melanomas were on chromosome arm 8q (69%), 6p (31%) and 8p (23%) and the most frequent losses were on 6q (38%), 10q (23%), and 16q (23%). The most common pairing was gain in 8p and gain in 8q, implying a whole chromosome copy number increase; gains in 8p occurred only in conjunction with gains in 8q. The smallest regions of copy number alteration were mapped to gain of 8q21 and loss of 6q21, 10q21, and 16q22. Sublocalization of these chromosomal changes to single-band resolution should accelerate the identification of genes involved in the genesis of ocular melanoma.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Our previous studies have shown that two distinct genotypes of Sindbis (SIN) virus occur in Australia. One of these, the Oriental/Australian type, circulates throughout most of the Australian continent, whereas the recently identified south-west (SW) genetic type appears to be restricted to a distinct geographic region located in the temperate south-west of Australia. We have now determined the complete nucleotide and translated amino acid sequences of a SW isolate of SIN virus (SW6562) and performed comparative analyses with other SIN viruses at the genomic level. The genome of SW6562 is 11,569 nucleotides in length, excluding the cap nucleotide and poly (A) tail. Overall this virus differs from the prototype SIN virus (strain AR339) by 23% in nucleotide sequence and 12.5% in amino acid sequence. Partial sequences of four regions of the genome of four SW isolates were determined and compared with the corresponding sequences from a number of SIN isolates from different regions of the World. These regions are the non-structural protein (nsP3), the E2 gene, the capsid gene, and the repeated sequence elements (RSE) of the 3'UTR. These comparisons revealed that the SW SIN viruses were more closely related to South African and European strains than to other Australian isolates of SIN virus. Thus the SW genotype of SIN virus may have been introduced into this region of Australia by viremic humans or migratory birds and subsequently evolved independently in the region. The sequence data also revealed that the SW genotype contains a unique deletion in the RSE of the 3'UTR region of the genome. Previous studies have shown that deletions in this region of the SIN genome can have significant effects on virus replication in mosquito and avian cells, which may explain the restricted distribution of this genotype of SIN virus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The genomic sequences of the Envelope-Non-Structural protein 1 junction region (E/NS1) of 84 DEN-1 and 22 DEN-2 isolates from Brazil were determined. Most of these strains were isolated in the period from 1995 to 2001 in endemic and regions of recent dengue transmission in São Paulo State. Sequence data for DEN-1 and DEN-2 utilized in phylogenetic and split decomposition analyses also include sequences deposited in GenBank from different regions of Brazil and of the world. Phylogenetic analyses were done using both maximum likelihood and Bayesian approaches. Results for both DEN-1 and DEN-2 data are ambiguous, and support for most tree bipartitions are generally poor, suggesting that E/NS1 region does not contain enough information for recovering phylogenetic relationships among DEN-1 and DEN-2 sequences used in this study. The network graph generated in the split decomposition analysis of DEN-1 does not show evidence of grouping sequences according to country, region and clades. While the network for DEN-2 also shows ambiguities among DEN-2 sequences, it suggests that Brazilian sequences may belong to distinct subtypes of genotype III.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Soft tissue sarcomas (STS) with complex genomic profiles (50% of all STS) are predominantly composed of spindle cell/pleomorphic sarcomas, including leiomyosarcoma, myxofibrosarcoma, pleomorphic liposarcoma, pleomorphic rhabdomyosarcoma, malignant peripheral nerve sheath tumor, angiosarcoma, extraskeletal osteosarcoma, and spindle cell/pleomorphic unclassified sarcoma (previously called spindle cell/pleomorphic malignant fibrous histiocytoma). These neoplasms show, characteristically, gains and losses of numerous chromosomes or chromosome regions, as well as amplifications. Many of them share recurrent aberrations (e.g., gain of 5p13-p15) that seem to play a significant role in tumor progression and/or metastatic dissemination. In this paper, we review the cytogenetic, molecular genetic, and clinicopathologic characteristics of the most common STS displaying complex genomic profiles. Features of diagnostic or prognostic relevance will be discussed when needed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Splenic marginal zone lymphoma (SMZL) is a low grade B-cell non-Hodgkin's lymphoma. The molecular pathology of this entity remains poorly understood. To characterise this lymphoma at the molecular level, we performed an integrated analysis of 1) genome wide genetic copy number alterations 2) gene expression profiles and 3) epigenetic DNA methylation profiles.We have previously shown that SMZL is characterised by recurrent alterations of chromosomes 7q, 6q, 3q, 9q and 18; however, gene resolution oligonucleotide array comparative genomic hybridisation did not reveal evidence of cryptic amplification or deletion in these regions. The most frequently lost 7q32 region contains a cluster of miRNAs. qRT-PCR revealed that three of these (miR-182/96/183) show underexpression in SMZL, and miR-182 is somatically mutated in >20% of cases of SMZL, as well as in >20% of cases of follicular lymphoma, and between 5-15% of cases of chronic lymphocytic leukaemia, MALT-lymphoma and hairy cell leukaemia. We conclude that miR-182 is a strong candidate novel tumour suppressor miRNA in lymphoma.The overall gene expression signature of SMZL was found to be strongly distinct fromthose of other lymphomas. Functional analysis of gene expression data revealed SMZL to be characterised by abnormalities in B-cell receptor signalling (especially through the CD19/21-PI3K/AKT pathway) and apoptotic pathways. In addition, genes involved in the response to viral infection appeared upregulated. SMZL shows a unique epigenetic profile, but analysis of differentially methylated genes showed few with methylation related transcriptional deregulation, suggesting that DNA methylation abnormalities are not a critical component of the SMZL malignant phenotype.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: The Nuclear Factor I (NFI) family of DNA binding proteins (also called CCAAT box transcription factors or CTF) is involved in both DNA replication and gene expression regulation. Using chromatin immuno-precipitation and high throughput sequencing (ChIP-Seq), we performed a genome-wide mapping of NFI DNA binding sites in primary mouse embryonic fibroblasts. RESULTS: We found that in vivo and in vitro NFI DNA binding specificities are indistinguishable, as in vivo ChIP-Seq NFI binding sites matched predictions based on previously established position weight matrix models of its in vitro binding specificity. Combining ChIP-Seq with mRNA profiling data, we found that NFI preferentially associates with highly expressed genes that it up-regulates, while binding sites were under-represented at expressed but unregulated genes. Genomic binding also correlated with markers of transcribed genes such as histone modifications H3K4me3 and H3K36me3, even outside of annotated transcribed loci, implying NFI in the control of the deposition of these modifications. Positional correlation between + and - strand ChIP-Seq tags revealed that, in contrast to other transcription factors, NFI associates with a nucleosomal length of cleavage-resistant DNA, suggesting an interaction with positioned nucleosomes. In addition, NFI binding prominently occurred at boundaries displaying discontinuities in histone modifications specific of expressed and silent chromatin, such as loci submitted to parental allele-specific imprinted expression. CONCLUSIONS: Our data thus suggest that NFI nucleosomal interaction may contribute to the partitioning of distinct chromatin domains and to epigenetic gene expression regulation.NFI ChIP-Seq and input control DNA data were deposited at Gene Expression Omnibus (GEO) repository under accession number GSE15844. Gene expression microarray data for mouse embryonic fibroblasts are on GEO accession number GSE15871.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the last decade microsatellites have become one of the most useful genetic markers used in a large number of organisms due to their abundance and high level of polymorphism. Microsatellites have been used for individual identification, paternity tests, forensic studies and population genetics. Data on microsatellite abundance comes preferentially from microsatellite enriched libraries and DNA sequence databases. We have conducted a search in GenBank of more than 16,000 Schistosoma mansoni ESTs and 42,000 BAC sequences. In addition, we obtained 300 sequences from CA and AT microsatellite enriched genomic libraries. The sequences were searched for simple repeats using the RepeatMasker software. Of 16,022 ESTs, we detected 481 (3%) sequences that contained 622 microsatellites (434 perfect, 164 imperfect and 24 compounds). Of the 481 ESTs, 194 were grouped in 63 clusters containing 2 to 15 ESTs per cluster. Polymorphisms were observed in 16 clusters. The 287 remaining ESTs were orphan sequences. Of the 42,017 BAC end sequences, 1,598 (3.8%) contained microsatellites (2,335 perfect, 287 imperfect and 79 compounds). The 1,598 BAC end sequences 80 were grouped into 17 clusters containing 3 to 17 BAC end sequences per cluster. Microsatellites were present in 67 out of 300 sequences from microsatellite enriched libraries (55 perfect, 38 imperfect and 15 compounds). From all of the observed loci 55 were selected for having the longest perfect repeats and flanking regions that allowed the design of primers for PCR amplification. Additionally we describe two new polymorphic microsatellite loci.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: After age, sex is the most important risk factor for coronary artery disease (CAD). The mechanism through which women are protected from CAD is still largely unknown, but the observed sex difference suggests the involvement of the reproductive steroid hormone signaling system. Genetic association studies of the gene-encoding Estrogen Receptor α (ESR1) have shown conflicting results, although only a limited range of variation in the gene has been investigated. METHODS AND RESULTS: We exploited information made available by advanced new methods and resources in complex disease genetics to revisit the question of ESR1's role in risk of CAD. We performed a meta-analysis of 14 genome-wide association studies (CARDIoGRAM discovery analysis, N=≈87,000) to search for population-wide and sex-specific associations between CAD risk and common genetic variants throughout the coding, noncoding, and flanking regions of ESR1. In addition to samples from the MIGen (N=≈6000), WTCCC (N=≈7400), and Framingham (N=≈3700) studies, we extended this search to a larger number of common and uncommon variants by imputation into a panel of haplotypes constructed using data from the 1000 Genomes Project. Despite the widespread expression of ERα in vascular tissues, we found no evidence for involvement of common or low-frequency genetic variation throughout the ESR1 gene in modifying risk of CAD, either in the general population or as a function of sex. CONCLUSIONS: We suggest that future research on the genetic basis of sex-related differences in CAD risk should initially prioritize other genes in the reproductive steroid hormone biosynthesis system.