15 resultados para datasets

em Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco


Relevância:

20.00% 20.00%

Publicador:

Resumo:

More and more users aim at taking advantage of the existing Linked Open Data environment to formulate a query over a dataset and to then try to process the same query over different datasets, one after another, in order to obtain a broader set of answers. However, the heterogeneity of vocabularies used in the datasets on the one side, and the fact that the number of alignments among those datasets is scarce on the other, makes that querying task difficult for them. Considering this scenario we present in this paper a proposal that allows on demand translations of queries formulated over an original dataset, into queries expressed using the vocabulary of a targeted dataset. Our approach relieves users from knowing the vocabulary used in the targeted datasets and even more it considers situations where alignments do not exist or they are not suitable for the formulated query. Therefore, in order to favour the possibility of getting answers, sometimes there is no guarantee of obtaining a semantically equivalent translation. The core component of our proposal is a query rewriting model that considers a set of transformation rules devised from a pragmatic point of view. The feasibility of our scheme has been validated with queries defined in well known benchmarks and SPARQL endpoint logs, as the obtained results confirm.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper explores how audio chord estimation could improve if information about chord boundaries or beat onsets is revealed by an oracle. Chord estimation at the frame level is compared with three simulations, each using an oracle of increasing powers. The beat and chord segments revealed by an oracle are used to compute a chord ranking at the segment level, and to compute the cumulative probability of finding the correct chord among the top ranked chords. Oracle results on two different audio datasets demonstrate the substantial potential of segment versus frame approaches for chord audio estimation. This paper also provides a comparison of the oracle results on the Beatles dataset, the standard dataset in this area, with the new Billboard Hot 100 chord dataset.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

[ES]El objetivo principal de esta tesis de máster es el estudio del comportamiento térmico del instrumento TriboLAB durante su estancia en la Estación Espacial Internacional, junto con la comparación de dicho comportamiento con el pronosticado por los modelos térmicos matemáticos empleados en el diseño de su sistema de control térmico. El trabajo realizado ha permitido profundizar de forma importante en el conocimiento del mencionado comportamiento. Ello permitirá poner a disposición de otros experimentadores interesados en ubicar sus instrumentos en los balcones exteriores de la Estación Espacial Internacional, información real acerca del comportamiento térmico de un equipo de las características del TriboLAB en dichas condiciones. Información de gran interés para ser empleada en el diseño del control térmico de sus instrumentos, especialmente ahora que la vida útil de la Estación Espacial Internacional ha sido prorrogada hasta 2020. El control térmico de los equipos espaciales es un aspecto clave para asegurar su supervivencia y correcto funcionamiento bajo las extremas condiciones existentes en el espacio. Su misión es mantener los distintos componentes dentro de su rango de temperaturas admisibles, puesto que en caso contrario no podrían funcionar o incluso ni siquiera sobrevivir más allá de esas temperaturas. Adicionalmente ha sido posible comprobar la aplicabilidad de distintas técnicas de análisis de datos funcionales en lo que respecta al estudio del tipo de datos aquí contemplado. Así mismo, se han comparado los resultados de la campaña de ensayos térmicos con los modelos térmicos matemáticos que han guiado el diseño del control térmico, y que son una pieza fundamental en el diseño del control térmico de cualquier instrumento espacial. Ello ha permitido verificar tanto la validez del sistema de control térmico diseñado para el TriboLAB como con la adecuada similitud existente entre los resultados de los modelos térmicos matemáticos y las temperaturas registradas en el equipo. Todo ello, ha sido realizado desde la perspectiva del análisis de datos funcionales.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

[ES] Como parte de este proyecto de investigación se realizó el siguiente proyecto fin de carrera:

Relevância:

10.00% 10.00%

Publicador:

Resumo:

[EN] This paper is an outcome of the ERASMUS IP program called TOPCART, there are more information about this project that can be accessed from the following item:

Relevância:

10.00% 10.00%

Publicador:

Resumo:

[EN]Measuring semantic similarity and relatedness between textual items (words, sentences, paragraphs or even documents) is a very important research area in Natural Language Processing (NLP). In fact, it has many practical applications in other NLP tasks. For instance, Word Sense Disambiguation, Textual Entailment, Paraphrase detection, Machine Translation, Summarization and other related tasks such as Information Retrieval or Question Answering. In this masther thesis we study di erent approaches to compute the semantic similarity between textual items. In the framework of the european PATHS project1, we also evaluate a knowledge-base method on a dataset of cultural item descriptions. Additionaly, we describe the work carried out for the Semantic Textual Similarity (STS) shared task of SemEval-2012. This work has involved supporting the creation of datasets for similarity tasks, as well as the organization of the task itself.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: Colorectal cancer (CRC) is a disease of complex aetiology, with much of the expected inherited risk being due to several common low risk variants. Genome-Wide Association Studies (GWAS) have identified 20 CRC risk variants. Nevertheless, these have only been able to explain part of the missing heritability. Moreover, these signals have only been inspected in populations of Northern European origin. Results: Thus, we followed the same approach in a Spanish cohort of 881 cases and 667 controls. Sixty-four variants at 24 loci were found to be associated with CRC at p-values <10-5. We therefore evaluated the 24 loci in another Spanish replication cohort (1481 cases and 1850 controls). Two of these SNPs, rs12080929 at 1p33 (P-replication=0.042; P-pooled=5.523x10(-03); OR (CI95%)=0.866(0.782-0.959)) and rs11987193 at 8p12 (P-replication=0.039; P-pooled=6.985x10(-5); OR (CI95%)=0.786(0.705-0.878)) were replicated in the second Phase, although they did not reach genome-wide statistical significance. Conclusions: We have performed the first CRC GWAS in a Southern European population and by these means we were able to identify two new susceptibility variants at 1p33 and 8p12 loci. These two SNPs are located near the SLC5A9 and DUSP4 loci, respectively, which could be good functional candidates for the association signals. We therefore believe that these two markers constitute good candidates for CRC susceptibility loci and should be further evaluated in other larger datasets. Moreover, we highlight that were these two SNPs true susceptibility variants, they would constitute a decrease in the CRC missing heritability fraction.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In spite of over a century of research on cortical circuits, it is still unknown how many classes of cortical neurons exist. Neuronal classification has been a difficult problem because it is unclear what a neuronal cell class actually is and what are the best characteristics are to define them. Recently, unsupervised classifications using cluster analysis based on morphological, physiological or molecular characteristics, when applied to selected datasets, have provided quantitative and unbiased identification of distinct neuronal subtypes. However, better and more robust classification methods are needed for increasingly complex and larger datasets. We explored the use of affinity propagation, a recently developed unsupervised classification algorithm imported from machine learning, which gives a representative example or exemplar for each cluster. As a case study, we applied affinity propagation to a test dataset of 337 interneurons belonging to four subtypes, previously identified based on morphological and physiological characteristics. We found that affinity propagation correctly classified most of the neurons in a blind, non-supervised manner. In fact, using a combined anatomical/physiological dataset, our algorithm differentiated parvalbumin from somatostatin interneurons in 49 out of 50 cases. Affinity propagation could therefore be used in future studies to validly classify neurons, as a first step to help reverse engineer neural circuits.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Singular Value Decomposition (SVD) is a key linear algebraic operation in many scientific and engineering applications. In particular, many computational intelligence systems rely on machine learning methods involving high dimensionality datasets that have to be fast processed for real-time adaptability. In this paper we describe a practical FPGA (Field Programmable Gate Array) implementation of a SVD processor for accelerating the solution of large LSE problems. The design approach has been comprehensive, from the algorithmic refinement to the numerical analysis to the customization for an efficient hardware realization. The processing scheme rests on an adaptive vector rotation evaluator for error regularization that enhances convergence speed with no penalty on the solution accuracy. The proposed architecture, which follows a data transfer scheme, is scalable and based on the interconnection of simple rotations units, which allows for a trade-off between occupied area and processing acceleration in the final implementation. This permits the SVD processor to be implemented both on low-cost and highend FPGAs, according to the final application requirements.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Viral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow rapid phylogenetic characterization of these new viruses. Often, however, complete viral genomes are not recovered, but rather several distinct contigs derived from a single entity are, some of which have no sequence homology to any known proteins. De novo assembly of single viruses from a metagenome is challenging, not only because of the lack of a reference genome, but also because of intrapopulation variation and uneven or insufficient coverage. Here we explored different assembly algorithms, remote homology searches, genome-specific sequence motifs, k-mer frequency ranking, and coverage profile binning to detect and obtain viral target genomes from metagenomes. All methods were tested on 454-generated sequencing datasets containing three recently described RNA viruses with a relatively large genome which were divergent to previously known viruses from the viral families Rhabdoviridae and Coronaviridae. Depending on specific characteristics of the target virus and the metagenomic community, different assembly and in silico gap closure strategies were successful in obtaining near complete viral genomes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Lymphangioleiomyomatosis (LAM) is a rare lung-metastasizing neoplasm caused by the proliferation of smooth muscle-like cells that commonly carry loss-of-function mutations in either the tuberous sclerosis complex 1 or 2 (TSC1 or TSC2) genes. While allosteric inhibition of the mechanistic target of rapamycin (mTOR) has shown substantial clinical benefit, complementary therapies are required to improve response and/or to treat specific patients. However, there is a lack of LAM biomarkers that could potentially be used to monitor the disease and to develop other targeted therapies. We hypothesized that the mediators of cancer metastasis to lung, particularly in breast cancer, also play a relevant role in LAM. Analyses across independent breast cancer datasets revealed associations between low TSC1/2 expression, altered mTOR complex 1 (mTORC1) pathway signaling, and metastasis to lung. Subsequently, immunohistochemical analyses of 23 LAM lesions revealed positivity in all cases for the lung metastasis mediators fascin 1 (FSCN1) and inhibitor of DNA binding 1 (ID1). Moreover, assessment of breast cancer stem or luminal progenitor cell biomarkers showed positivity in most LAM tissue for the aldehyde dehydrogenase 1 (ALDH1), integrin-beta 3 (ITGB3/CD61), and/or the sex-determining region Y-box 9 (SOX9) proteins. The immunohistochemical analyses also provided evidence of heterogeneity between and within LAM cases. The analysis of Tsc2-deficient cells revealed relative over-expression of FSCN1 and ID1; however, Tsc2-deficient cells did not show higher sensitivity to ID1-based cancer inhibitors. Collectively, the results of this study reveal novel LAM biomarkers linked to breast cancer metastasis to lung and to cell stemness, which in turn might guide the assessment of additional or complementary therapeutic opportunities for LAM.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Elucidating the intricate relationship between brain structure and function, both in healthy and pathological conditions, is a key challenge for modern neuroscience. Recent progress in neuroimaging has helped advance our understanding of this important issue, with diffusion images providing information about structural connectivity (SC) and functional magnetic resonance imaging shedding light on resting state functional connectivity (rsFC). Here, we adopt a systems approach, relying on modular hierarchical clustering, to study together SC and rsFC datasets gathered independently from healthy human subjects. Our novel approach allows us to find a common skeleton shared by structure and function from which a new, optimal, brain partition can be extracted. We describe the emerging common structure-function modules (SFMs) in detail and compare them with commonly employed anatomical or functional parcellations. Our results underline the strong correspondence between brain structure and resting-state dynamics as well as the emerging coherent organization of the human brain.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We review the appropriateness of using SNIa observations to detect potential signatures of anisotropic expansion in the Universe. We focus on Union2 and SNLS3 SNIa datasets and use the hemispherical comparison method to detect possible anisotropic features. Unlike some previous works where nondiagonal elements of the covariance matrix were neglected, we use the full covariance matrix of the SNIa data, thus obtaining more realistic and not underestimated errors. As a matter of fact, the significance of previously claimed detections of a preferred direction in the Union2 dataset completely disappears once we include the effects of using the full covariance matrix. Moreover, we also find that such apreferred direction is aligned with the orthogonal direction of the SDSS observational plane and this suggests a clear indication that the SDSS subsample of the Union2 dataset introduces a significant bias, making the detected preferred direction unphysical. We thus find that current SNIa surveys are inappropriate to test anisotropic features due to their highly non-homogeneous angular distribution in the sky. In addition, after removal of the highest in homogeneous sub-samples, the number of SNIa is too low. Finally, we take advantage of the particular distribution of SNLS SNIa sub- sample in the SNLS3 data set, in which the observations were taken along four different directions. We fit each direction independently and find consistent results at the 1 sigma level. Although the likelihoods peak at relatively different values of Omega(m), the low number of data along each direction gives rise to large errors so that the likelihoods are sufficiently broad as to overlap within 1 sigma. (C) 2014 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http:// creativecommons. org/licenses/by/4.0/).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background Ubiquitination is known to regulate physiological neuronal functions as well as to be involved in a number of neuronal diseases. Several ubiquitin proteomic approaches have been developed during the last decade but, as they have been mostly applied to non-neuronal cell culture, very little is yet known about neuronal ubiquitination pathways in vivo. Methodology/Principal Findings Using an in vivo biotinylation strategy we have isolated and identified the ubiquitinated proteome in neurons both for the developing embryonic brain and for the adult eye of Drosophila melanogaster. Bioinformatic comparison of both datasets indicates a significant difference on the ubiquitin substrates, which logically correlates with the processes that are most active at each of the developmental stages. Detection within the isolated material of two ubiquitin E3 ligases, Parkin and Ube3a, indicates their ubiquitinating activity on the studied tissues. Further identification of the proteins that do accumulate upon interference with the proteasomal degradative pathway provides an indication of the proteins that are targeted for clearance in neurons. Last, we report the proof-of-principle validation of two lysine residues required for nSyb ubiquitination. Conclusions/Significance These data cast light on the differential and common ubiquitination pathways between the embryonic and adult neurons, and hence will contribute to the understanding of the mechanisms by which neuronal function is regulated. The in vivo biotinylation methodology described here complements other approaches for ubiquitome study and offers unique advantages, and is poised to provide further insight into disease mechanisms related to the ubiquitin proteasome system.