933 resultados para Italian Regions


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are “genomic fossils” valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome’s structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction (∼80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic–stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to ∼2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3′-UTRs. While we estimate a significant false discovery rate of ∼50%–70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

For the ∼1% of the human genome in the ENCODE regions, only about half of the transcriptionally active regions (TARs) identified with tiling microarrays correspond to annotated exons. Here we categorize this large amount of “unannotated transcription.” We use a number of disparate features to classify the 6988 novel TARs—array expression profiles across cell lines and conditions, sequence composition, phylogenetic profiles (presence/absence of syntenic conservation across 17 species), and locations relative to genes. In the classification, we first filter out TARs with unusual sequence composition and those likely resulting from cross-hybridization. We then associate some of those remaining with proximal exons having correlated expression profiles. Finally, we cluster unclassified TARs into putative novel loci, based on similar expression and phylogenetic profiles. To encapsulate our classification, we construct a Database of Active Regions and Tools (DART.gersteinlab.org). DART has special facilities for rapidly handling and comparing many sets of TARs and their heterogeneous features, synchronizing across builds, and interfacing with other resources. Overall, we find that ∼14% of the novel TARs can be associated with known genes, while ∼21% can be clustered into ∼200 novel loci. We observe that TARs associated with genes are enriched in the potential to form structural RNAs and many novel TAR clusters are associated with nearby promoters. To benchmark our classification, we design a set of experiments for testing the connectivity of novel TARs. Overall, we find that 18 of the 46 connections tested validate by RT-PCR and four of five sequenced PCR products confirm connectivity unambiguously.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Descriptors based on Molecular Interaction Fields (MIF) are highly suitable for drug discovery, but their size (thousands of variables) often limits their application in practice. Here we describe a simple and fast computational method that extracts from a MIF a handful of highly informative points (hot spots) which summarize the most relevant information. The method was specifically developed for drug discovery, is fast, and does not require human supervision, being suitable for its application on very large series of compounds. The quality of the results has been tested by running the method on the ligand structure of a large number of ligand-receptor complexes and then comparing the position of the selected hot spots with actual atoms of the receptor. As an additional test, the hot spots obtained with the novel method were used to obtain GRIND-like molecular descriptors which were compared with the original GRIND. In both cases the results show that the novel method is highly suitable for describing ligand-receptor interactions and compares favorably with other state-of-the-art methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The analysis of the promoter sequence of genes with similar expression patterns isa basic tool to annotate common regulatory elements. Multiple sequence alignments are on thebasis of most comparative approaches. The characterization of regulatory regions from coexpressedgenes at the sequence level, however, does not yield satisfactory results in manyoccasions as promoter regions of genes sharing similar expression programs often do not shownucleotide sequence conservation.Results: In a recent approach to circumvent this limitation, we proposed to align the maps ofpredicted transcription factors (referred as TF-maps) instead of the nucleotide sequence of tworelated promoters, taking into account the label of the corresponding factor and the position in theprimary sequence. We have now extended the basic algorithm to permit multiple promotercomparisons using the progressive alignment paradigm. In addition, non-collinear conservationblocks might now be identified in the resulting alignments. We have optimized the parameters ofthe algorithm in a small, but well-characterized collection of human-mouse-chicken-zebrafishorthologous gene promoters.Conclusion: Results in this dataset indicate that TF-map alignments are able to detect high-levelregulatory conservation at the promoter and the 3'UTR gene regions, which cannot be detectedby the typical sequence alignments. Three particular examples are introduced here to illustrate thepower of the multiple TF-map alignments to characterize conserved regulatory elements inabsence of sequence similarity. We consider this kind of approach can be extremely useful in thefuture to annotate potential transcription factor binding sites on sets of co-regulated genes fromhigh-throughput expression experiments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Statewide and Regional projected industry employment 2002 - 2012

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

MicroRNAs (miRNA) are recognized posttranscriptional gene repressors involved in the control of almost every biological process. Allelic variants in these regions may be an important source of phenotypic diversity and contribute to disease susceptibility. We analyzed the genomic organization of 325 human miRNAs (release 7.1, miRBase) to construct a panel of 768 single-nucleotide polymorphisms (SNPs) covering approximately 1 Mb of genomic DNA, including 131 isolated miRNAs (40%) and 194 miRNAs arranged in 48 miRNA clusters, as well as their 5-kb flanking regions. Of these miRNAs, 37% were inside known protein-coding genes, which were significantly associated with biological functions regarding neurological, psychological or nutritional disorders. SNP coverage analysis revealed a lower SNP density in miRNAs compared with the average of the genome, with only 24 SNPs located in the 325 miRNAs studied. Further genotyping of 340 unrelated Spanish individuals showed that more than half of the SNPs in miRNAs were either rare or monomorphic, in agreement with the reported selective constraint on human miRNAs. A comparison of the minor allele frequencies between Spanish and HapMap population samples confirmed the applicability of this SNP panel to the study of complex disorders among the Spanish population, and revealed two miRNA regions, hsa-mir-26a-2 in the CTDSP2 gene and hsa-mir-128-1 in the R3HDM1 gene, showing geographical allelic frequency variation among the four HapMap populations, probably because of differences in natural selection. The designed miRNA SNP panel could help to identify still hidden links between miRNAs and human disease.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Purpose: To evaluate the suitability of an improved version of an automatic segmentation method based on geodesic active regions (GAR) for segmenting cerebral vasculature with aneurysms from 3D X-ray reconstruc-tion angiography (3DRA) and time of °ight magnetic resonance angiography (TOF-MRA) images available in the clinical routine.Methods: Three aspects of the GAR method have been improved: execution time, robustness to variability in imaging protocols and robustness to variability in image spatial resolutions. The improved GAR was retrospectively evaluated on images from patients containing intracranial aneurysms in the area of the Circle of Willis and imaged with two modalities: 3DRA and TOF-MRA. Images were obtained from two clinical centers, each using di®erent imaging equipment. Evaluation included qualitative and quantitative analyses ofthe segmentation results on 20 images from 10 patients. The gold standard was built from 660 cross-sections (33 per image) of vessels and aneurysms, manually measured by interventional neuroradiologists. GAR has also been compared to an interactive segmentation method: iso-intensity surface extraction (ISE). In addition, since patients had been imaged with the two modalities, we performed an inter-modality agreement analysis with respect to both the manual measurements and each of the two segmentation methods. Results: Both GAR and ISE di®ered from the gold standard within acceptable limits compared to the imaging resolution. GAR (ISE, respectively) had an average accuracy of 0.20 (0.24) mm for 3DRA and 0.27 (0.30) mm for TOF-MRA, and had a repeatability of 0.05 (0.20) mm. Compared to ISE, GAR had a lower qualitative error in the vessel region and a lower quantitative error in the aneurysm region. The repeatabilityof GAR was superior to manual measurements and ISE. The inter-modality agreement was similar between GAR and the manual measurements. Conclusions: The improved GAR method outperformed ISE qualitatively as well as quantitatively and is suitable for segmenting 3DRA and TOF-MRA images from clinical routine.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Among the large number of granitic intrusions within the Dora-Maira massif, several main types can be distinguished. In this study we report field, petrographic and geochemical investigations as well as zircon typology and conventional U-Pb zircon dating of plutons representing these types. The main results are as follows: the Punta Muret augengneiss is a polymetamorphosed peraluminous granite of anatectic origin. It is 457 +/- 2 Ma old and represents one of the numerous Caledonian orthogneisses of the Alpine basement. All other dated granites are of Late Variscan age. The Cavour leucogranite is an evolved granite of probably calc-alkaline affiliation, dated at 304 +/- 2 Ma. The dioritic and granodioritic facies of the Malanaggio diorite (auct.) are typical calc-alkaline rocks, whose respective age of 290 +/- 2 and 288 +/- 2 Ma overlap within errors. The Sangone and Freidour granite types have very similar alkali-calcic characteristics; their ages are poorly constrained between 267-279 and 268-283 Ma, respectively. The new data for the Dora-Maira granites are in keeping with models of the overall evolution of the Late- to Post-Variscan magmatism in the Alpine area in terms of age distribution and progressive geochemical evolution towards alkaline melts. In a first approximation, granitic rocks across the Variscan belt seem to be increasingly younger towards the internal (southern) parts of the orogen. A Carboniferous, distensive Basin and Range situation is thought to be responsible for the magmatic activity. This tectonic context is comparable to the back-are opening of an active continental margin. The observed southward migration of the magmatism could be linked to the roll-back of the subducting Paleotethyan oceanic plate along the Variscan cordillera.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

PURPOSE: Obesity represents a growing public health concern worldwide. The latest data in Switzerland rely on self-reported body mass index (BMI), leading to underestimation of prevalence. We reassessed the prevalence of obesity and overweight in a sample of the Swiss population using measured BMI and waist circumference (WC) and explored the association with nutritional factors and living in different linguistic-cultural regions. METHODS: Data of 1,505 participants of a cross-sectional population-based survey in the three linguistic regions of Switzerland were analyzed. BMI and WC were measured, and a 24-h urine collection was performed to evaluate dietary sodium, potassium and protein intake. RESULTS: The prevalence of overweight, obesity and abdominal obesity was 32.2, 14.2 and 33.6 %, respectively. Significant differences were observed in the regional distribution, with a lower prevalence in the Italian-speaking population. Low educational level, current smoking, scarce physical activity and being migrant were associated with an higher prevalence of obesity. Sodium, potassium and protein intake increased significantly across BMI categories. CONCLUSIONS: Obesity and overweight affect almost half of the Swiss adolescents and adults, and the prevalence appears to increase. Using BMI and WC to define obesity led to different prevalences. Differences were furthermore observed across Swiss linguistic-cultural regions, despite a common socio-economic and governmental framework. We found a positive association between obesity and salt intake, with a potential deleterious synergistic effect on cardiovascular risk.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Microsatellites are important highly polymorphic genetic markers dispersed in the human genome. Using a panel of 22 (CA)n repeat microsatellite markers mapped to recurrent breakpoint cluster regions specifically involved in leukemia, we investigated 114 adult leukemias (25 acute lymphocytic leukemia [ALL], 32 acute myeloid leukemia [AML], 36 chronic lymphocytic leukemia [CLL], and 21 chronic myeloid leukemia [CML] in chronic phase) for somatic mutations at these loci. In each patient, DNA from fresh leukemia samples was analyzed alongside normal constitutive DNA from buccal epithelium. We detected loss of heterozygosity (LOH) in 81 of 114 patients (ALL 16/25, AML 25/32, CLL 30/36, CML 10/21). Deletions were most often seen in ALL at 11q23 and 19p13; in AML at 8q22 and 11q23; in CLL at 13q14.3, 11q13, and 11q23; and in CML at 3q26. Only six deletions were reported in 74 karyotypes analyzed, whereas in these same cases, 91 LOH events were detected by microsatellites. Of 26 leukemias with a normal karyotype, 16 nevertheless showed at least one LOH by microsatellite analysis. Replication errors were found in 10 of 114 patients (8.8%). Thus, microsatellite instability is rare in leukemia in contrast to many solid tumors. Our findings suggest that in adult leukemia, LOH may be an important genetic event in addition to typical chromosomal translocations. LOH may point to the existence of tumor suppressor genes involved in leukemogenesis to a degree that has hitherto been underestimated.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

AbstractOBJECTIVEThe aim of this study was to develop the Italian version of the Spanish Burnout Inventory (SBI) and to examine its psychometric properties within a sample of nursing staff.METHODThe study was cross-sectional and not randomized. The data were gathered using an anonymous, self-report questionnaire. The sample consisted of 391 staff nurses employed in three hospitals in the Northern Region of Italy To evaluate burnout, the SBI and the Maslach Burnout Inventory were administered.RESULTSAn Exploratory Factor Analysis showed a four-factor structure close to the expected one. All Cronbach's alpha values were satisfactory. Furthermore, correlations support the concurrent validity.CONCLUSIONOverall, the results of this study provided evidence that the SBI is an adequate instrument to study burnout in the Italian nursing sample and indicated the feeling of guilt as an important dimension to gauge the structure of this phenomenon.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Puhe