Biblioteca Digital

951 resultados para Epididymidis regions

Structured RNAs in the ENCODE selected regions of the human genome

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic–stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to ∼2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3′-UTRs. While we estimate a significant false discovery rate of ∼50%–70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).

The DART classification of unannotated transcription within the ENCODE regions: associating transcription with known and novel loci

Relevância:

20.00% 20.00%

Publicador:

Resumo:

For the ∼1% of the human genome in the ENCODE regions, only about half of the transcriptionally active regions (TARs) identified with tiling microarrays correspond to annotated exons. Here we categorize this large amount of “unannotated transcription.” We use a number of disparate features to classify the 6988 novel TARs—array expression profiles across cell lines and conditions, sequence composition, phylogenetic profiles (presence/absence of syntenic conservation across 17 species), and locations relative to genes. In the classification, we first filter out TARs with unusual sequence composition and those likely resulting from cross-hybridization. We then associate some of those remaining with proximal exons having correlated expression profiles. Finally, we cluster unclassified TARs into putative novel loci, based on similar expression and phylogenetic profiles. To encapsulate our classification, we construct a Database of Active Regions and Tools (DART.gersteinlab.org). DART has special facilities for rapidly handling and comparing many sets of TARs and their heterogeneous features, synchronizing across builds, and interfacing with other resources. Overall, we find that ∼14% of the novel TARs can be associated with known genes, while ∼21% can be clustered into ∼200 novel loci. We observe that TARs associated with genes are enriched in the potential to form structural RNAs and many novel TAR clusters are associated with nearby promoters. To benchmark our classification, we design a set of experiments for testing the connectivity of novel TARs. Overall, we find that 18 of the 46 connections tested validate by RT-PCR and four of five sequenced PCR products confirm connectivity unambiguously.

Prominent use of distal 5’ transcription start sites and discovery of a large number of additional exons in ENCODE regions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.

Development and validation of AMANDA, a new algorithm for selecting highly relevant regions in Molecular Interaction Fields

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Descriptors based on Molecular Interaction Fields (MIF) are highly suitable for drug discovery, but their size (thousands of variables) often limits their application in practice. Here we describe a simple and fast computational method that extracts from a MIF a handful of highly informative points (hot spots) which summarize the most relevant information. The method was specifically developed for drug discovery, is fast, and does not require human supervision, being suitable for its application on very large series of compounds. The quality of the results has been tested by running the method on the ligand structure of a large number of ligand-receptor complexes and then comparing the position of the selected hot spots with actual atoms of the receptor. As an additional test, the hot spots obtained with the novel method were used to obtain GRIND-like molecular descriptors which were compared with the original GRIND. In both cases the results show that the novel method is highly suitable for describing ligand-receptor interactions and compares favorably with other state-of-the-art methods.

Multiple non-collinear TF-map alignments of promoter regions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The analysis of the promoter sequence of genes with similar expression patterns isa basic tool to annotate common regulatory elements. Multiple sequence alignments are on thebasis of most comparative approaches. The characterization of regulatory regions from coexpressedgenes at the sequence level, however, does not yield satisfactory results in manyoccasions as promoter regions of genes sharing similar expression programs often do not shownucleotide sequence conservation.Results: In a recent approach to circumvent this limitation, we proposed to align the maps ofpredicted transcription factors (referred as TF-maps) instead of the nucleotide sequence of tworelated promoters, taking into account the label of the corresponding factor and the position in theprimary sequence. We have now extended the basic algorithm to permit multiple promotercomparisons using the progressive alignment paradigm. In addition, non-collinear conservationblocks might now be identified in the resulting alignments. We have optimized the parameters ofthe algorithm in a small, but well-characterized collection of human-mouse-chicken-zebrafishorthologous gene promoters.Conclusion: Results in this dataset indicate that TF-map alignments are able to detect high-levelregulatory conservation at the promoter and the 3'UTR gene regions, which cannot be detectedby the typical sequence alignments. Three particular examples are introduced here to illustrate thepower of the multiple TF-map alignments to characterize conserved regulatory elements inabsence of sequence similarity. We consider this kind of approach can be extremely useful in thefuture to annotate potential transcription factor binding sites on sets of co-regulated genes fromhigh-throughput expression experiments.

Industry Projections 2002 - 2012 Iowa Workforce Development Regions, 2002

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Statewide and Regional projected industry employment 2002 - 2012

Transcription factor map alignment of promoter regions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments.

Design and evaluation of a panel os single-nucleotide polymorphisms in microRNA genomic regions for association studies in human disease

Relevância:

20.00% 20.00%

Publicador:

Resumo:

MicroRNAs (miRNA) are recognized posttranscriptional gene repressors involved in the control of almost every biological process. Allelic variants in these regions may be an important source of phenotypic diversity and contribute to disease susceptibility. We analyzed the genomic organization of 325 human miRNAs (release 7.1, miRBase) to construct a panel of 768 single-nucleotide polymorphisms (SNPs) covering approximately 1 Mb of genomic DNA, including 131 isolated miRNAs (40%) and 194 miRNAs arranged in 48 miRNA clusters, as well as their 5-kb flanking regions. Of these miRNAs, 37% were inside known protein-coding genes, which were significantly associated with biological functions regarding neurological, psychological or nutritional disorders. SNP coverage analysis revealed a lower SNP density in miRNAs compared with the average of the genome, with only 24 SNPs located in the 325 miRNAs studied. Further genotyping of 340 unrelated Spanish individuals showed that more than half of the SNPs in miRNAs were either rare or monomorphic, in agreement with the reported selective constraint on human miRNAs. A comparison of the minor allele frequencies between Spanish and HapMap population samples confirmed the applicability of this SNP panel to the study of complex disorders among the Spanish population, and revealed two miRNA regions, hsa-mir-26a-2 in the CTDSP2 gene and hsa-mir-128-1 in the R3HDM1 gene, showing geographical allelic frequency variation among the four HapMap populations, probably because of differences in natural selection. The designed miRNA SNP panel could help to identify still hidden links between miRNAs and human disease.

Automated Segmentation of Cerebral Vasculature with Aneurysms in 3DRA and TOF-MRA using Geodesic Active Regions: an Evaluation Study

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Purpose: To evaluate the suitability of an improved version of an automatic segmentation method based on geodesic active regions (GAR) for segmenting cerebral vasculature with aneurysms from 3D X-ray reconstruc-tion angiography (3DRA) and time of °ight magnetic resonance angiography (TOF-MRA) images available in the clinical routine.Methods: Three aspects of the GAR method have been improved: execution time, robustness to variability in imaging protocols and robustness to variability in image spatial resolutions. The improved GAR was retrospectively evaluated on images from patients containing intracranial aneurysms in the area of the Circle of Willis and imaged with two modalities: 3DRA and TOF-MRA. Images were obtained from two clinical centers, each using di®erent imaging equipment. Evaluation included qualitative and quantitative analyses ofthe segmentation results on 20 images from 10 patients. The gold standard was built from 660 cross-sections (33 per image) of vessels and aneurysms, manually measured by interventional neuroradiologists. GAR has also been compared to an interactive segmentation method: iso-intensity surface extraction (ISE). In addition, since patients had been imaged with the two modalities, we performed an inter-modality agreement analysis with respect to both the manual measurements and each of the two segmentation methods. Results: Both GAR and ISE di®ered from the gold standard within acceptable limits compared to the imaging resolution. GAR (ISE, respectively) had an average accuracy of 0.20 (0.24) mm for 3DRA and 0.27 (0.30) mm for TOF-MRA, and had a repeatability of 0.05 (0.20) mm. Compared to ISE, GAR had a lower qualitative error in the vessel region and a lower quantitative error in the aneurysm region. The repeatabilityof GAR was superior to manual measurements and ISE. The inter-modality agreement was similar between GAR and the manual measurements. Conclusions: The improved GAR method outperformed ISE qualitatively as well as quantitatively and is suitable for segmenting 3DRA and TOF-MRA images from clinical routine.

Frequent clonal loss of heterozygosity but scarcity of microsatellite instability at chromosomal breakpoint cluster regions in adult leukemias.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Microsatellites are important highly polymorphic genetic markers dispersed in the human genome. Using a panel of 22 (CA)n repeat microsatellite markers mapped to recurrent breakpoint cluster regions specifically involved in leukemia, we investigated 114 adult leukemias (25 acute lymphocytic leukemia [ALL], 32 acute myeloid leukemia [AML], 36 chronic lymphocytic leukemia [CLL], and 21 chronic myeloid leukemia [CML] in chronic phase) for somatic mutations at these loci. In each patient, DNA from fresh leukemia samples was analyzed alongside normal constitutive DNA from buccal epithelium. We detected loss of heterozygosity (LOH) in 81 of 114 patients (ALL 16/25, AML 25/32, CLL 30/36, CML 10/21). Deletions were most often seen in ALL at 11q23 and 19p13; in AML at 8q22 and 11q23; in CLL at 13q14.3, 11q13, and 11q23; and in CML at 3q26. Only six deletions were reported in 74 karyotypes analyzed, whereas in these same cases, 91 LOH events were detected by microsatellites. Of 26 leukemias with a normal karyotype, 16 nevertheless showed at least one LOH by microsatellite analysis. Replication errors were found in 10 of 114 patients (8.8%). Thus, microsatellite instability is rare in leukemia in contrast to many solid tumors. Our findings suggest that in adult leukemia, LOH may be an important genetic event in addition to typical chromosomal translocations. LOH may point to the existence of tumor suppressor genes involved in leukemogenesis to a degree that has hitherto been underestimated.

Regions of rationality: Maps for bounded agents

Relevância:

20.00% 20.00%

Publicador:

Resumo:

An important problem in descriptive and prescriptive research in decision making is to identify regions of rationality, i.e., the areas for which heuristics are and are not effective. To map the contours of such regions, we derive probabilities that heuristics identify the best of m alternatives (m > 2) characterized by k attributes or cues (k > 1). The heuristics include a single variable (lexicographic), variations of elimination-by-aspects, equal weighting, hybrids of the preceding, and models exploiting dominance. We use twenty simulated and four empirical datasets for illustration. We further provide an overview by regressing heuristic performance on factors characterizing environments. Overall, sensible heuristics generally yield similar choices in many environments. However, selection of the appropriate heuristic can be important in some regions (e.g., if there is low inter-correlation among attributes/cues). Since our work assumes a hit or miss decision criterion, we conclude by outlining extensions for exploring the effects of different loss functions.

The poor stay poor: Non-convergence across countries and regions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We study the issue of income convergence across countries and regions witha Bayesian estimator which allows us to use information in an efficient andflexible way. We argue that the very slow convergence rates to a commonlevel of per-capita income found, e.g., by Barro and Xavier Sala-i-Martin,is due to a 'fixed effect bias' that their cross-sectional analysisintroduces in the results. Our approach permits the estimation of differentconvergence rates to different steady states for each cross sectional unit.When this diversity is allowed, we find that convergence of each unit to(its own) steady state income level is much faster than previously estimatedbut that cross sectional differences persist: inequalities will only bereduced by a small amount by the passage of time. The cross countrydistribution of the steady state is largely explained by the cross countrydistribution of initial conditions.

The fauna of phlebotomines (Diptera, Psychodidae) in different phytogeographic regions of the state of Maranhão, Brazil

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Phlebotomine specimens were captured in domiciliary and forest environments in 47 municipalities between 1982 and 2005 with the aid of CDC light traps. A total of 91 species were found, of which four belonged to genus Brumptomyia and 87 to genus Lutzomyia, distributed among the following subgenera: Evandromyia (6), Lutzomyia (5), Micropygomyia (2), Nyssomyia (9), Pintomyia (2), Pressatia (3), Psathyromyia (6), Psychodopygus (14), Sciopemyia (4), Trichophoromyia (2), Viannamyia (2); species groups: Aragaoi (2), Baityi (1), Dreisbachi (1), Migonei (12), Oswaldoi (8), Pilosa (1), Saulensis (2), Verrucarum (4) and ungrouped (1). Species diversity was greatest in areas where there was dense evergreen seasonal forest (52 species), ombrophilous forest (31) and meridional cerrados (23) and lowest in areas with mixed forest (forest with babassu palms, cerrado and caatinga). The greatest similarity index was observed for restinga and open evergreen seasonal forest (J=0.48). Dense evergreen seasonal forest had greatest similarity with ombrophilous forest (J=0.38). The phlebotomine fauna was species rich and unevenly distributed in Maranhão, reflecting the phytogeographical complexity of the state, which is a result of the great variety of ecosystems and climate zones.

Do interregional transfers improve the economic performance of poor regions? The case of Spain

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The 17 regional governments of Spain receive grants from both thecentral government and the European Union. The grants are generallyredistributive and are intended to stimulate economic activity inthe poorer regions. We evaluate the effectiveness of the grants bycomparing the economic performance of the regions before and afterthe implementation of the grant programs using a differences--in--differences approach. We find that these policies have not beeneffective at stimulating private investment or improving the overalleconomies of the poorer regions.

How do very open economies adjust to large immigration flows? Recent evidence from Spanish regions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In recent years, Spain has received unprecedented immigration flows. Between 2001 and 2006 the fraction of the population born abroad more than doubled, increasing from4.8% to 10.8%. For Spanish provinces with above-median inflows (relative to population),immigration increased by 24% the number of high school dropouts while only increasingcollege graduates by 11%. We study different channels by which regional labor markets haveabsorbed the large increase in relative supply of low educated workers. We identify theexogenous supply shock using historical immigrant settlement patterns by country of origin.Using data from the Labor Force Survey and the decennial Census, we find a large expansion ofemployment in high immigration regions. Disaggregating by industry, the absorption operatedthrough large increases in the share of low-educated workers, compared to the same industry inlow-immigration regions. We do not find changes in sectoral specialization. Overall, andperhaps surprisingly, the pattern of absorption is very similar to the one found in the US.

«
1
2
...
9
10
11
12
13
14
15
...
63
64
»