10 resultados para Query paging

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: Decreasing costs of DNA sequencing have made prokaryotic draft genome sequences increasingly common. A contig scaffold is an ordering of contigs in the correct orientation. A scaffold can help genome comparisons and guide gap closure efforts. One popular technique for obtaining contig scaffolds is to map contigs onto a reference genome. However, rearrangements that may exist between the query and reference genomes may result in incorrect scaffolds, if these rearrangements are not taken into account. Large-scale inversions are common rearrangement events in prokaryotic genomes. Even in draft genomes it is possible to detect the presence of inversions given sufficient sequencing coverage and a sufficiently close reference genome. Results: We present a linear-time algorithm that can generate a set of contig scaffolds for a draft genome sequence represented in contigs given a reference genome. The algorithm is aimed at prokaryotic genomes and relies on the presence of matching sequence patterns between the query and reference genomes that can be interpreted as the result of large-scale inversions; we call these patterns inversion signatures. Our algorithm is capable of correctly generating a scaffold if at least one member of every inversion signature pair is present in contigs and no inversion signatures have been overwritten in evolution. The algorithm is also capable of generating scaffolds in the presence of any kind of inversion, even though in this general case there is no guarantee that all scaffolds in the scaffold set will be correct. We compare the performance of SIS, the program that implements the algorithm, to seven other scaffold-generating programs. The results of our tests show that SIS has overall better performance. Conclusions: SIS is a new easy-to-use tool to generate contig scaffolds, available both as stand-alone and as a web server. The good performance of SIS in our tests adds evidence that large-scale inversions are widespread in prokaryotic genomes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We review recent visualization techniques aimed at supporting tasks that require the analysis of text documents, from approaches targeted at visually summarizing the relevant content of a single document to those aimed at assisting exploratory investigation of whole collections of documents.Techniques are organized considering their target input materialeither single texts or collections of textsand their focus, which may be at displaying content, emphasizing relevant relationships, highlighting the temporal evolution of a document or collection, or helping users to handle results from a query posed to a search engine.We describe the approaches adopted by distinct techniques and briefly review the strategies they employ to obtain meaningful text models, discuss how they extract the information required to produce representative visualizations, the tasks they intend to support and the interaction issues involved, and strengths and limitations. Finally, we show a summary of techniques, highlighting their goals and distinguishing characteristics. We also briefly discuss some open problems and research directions in the fields of visual text mining and text analytics.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Spatial data warehouses (SDWs) allow for spatial analysis together with analytical multidimensional queries over huge volumes of data. The challenge is to retrieve data related to ad hoc spatial query windows according to spatial predicates, avoiding the high cost of joining large tables. Therefore, mechanisms to provide efficient query processing over SDWs are essential. In this paper, we propose two efficient indices for SDW: the SB-index and the HSB-index. The proposed indices share the following characteristics. They enable multidimensional queries with spatial predicate for SDW and also support predefined spatial hierarchies. Furthermore, they compute the spatial predicate and transform it into a conventional one, which can be evaluated together with other conventional predicates by accessing a star-join Bitmap index. While the SB-index has a sequential data structure, the HSB-index uses a hierarchical data structure to enable spatial objects clustering and a specialized buffer-pool to decrease the number of disk accesses. The advantages of the SB-index and the HSB-index over the DBMS resources for SDW indexing (i.e. star-join computation and materialized views) were investigated through performance tests, which issued roll-up operations extended with containment and intersection range queries. The performance results showed that improvements ranged from 68% up to 99% over both the star-join computation and the materialized view. Furthermore, the proposed indices proved to be very compact, adding only less than 1% to the storage requirements. Therefore, both the SB-index and the HSB-index are excellent choices for SDW indexing. Choosing between the SB-index and the HSB-index mainly depends on the query selectivity of spatial predicates. While low query selectivity benefits the HSB-index, the SB-index provides better performance for higher query selectivity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: The hypothalamus plays a pivotal role in numerous mechanisms highly relevant to the maintenance of body homeostasis, such as the control of food intake and energy expenditure. Impairment of these mechanisms has been associated with the metabolic disturbances involved in the pathogenesis of obesity. Since rodent species constitute important models for metabolism studies and the rat hypothalamus is poorly characterized by proteomic strategies, we performed experiments aimed at constructing a two-dimensional gel electrophoresis (2-DE) profile of rat hypothalamus proteins. Results: As a first step, we established the best conditions for tissue collection and protein extraction, quantification and separation. The extraction buffer composition selected for proteome characterization of rat hypothalamus was urea 7 M, thiourea 2 M, CHAPS 4%, Triton X-100 0.5%, followed by a precipitation step with chloroform/methanol. Two-dimensional (2-D) gels of hypothalamic extracts from four-month-old rats were analyzed; the protein spots were digested and identified by using tandem mass spectrometry and database query using the protein search engine MASCOT. Eighty-six hypothalamic proteins were identified, the majority of which were classified as participating in metabolic processes, consistent with the finding of a large number of proteins with catalytic activity. Genes encoding proteins identified in this study have been related to obesity development. Conclusion: The present results indicate that the 2-DE technique will be useful for nutritional studies focusing on hypothalamic proteins. The data presented herein will serve as a reference database for studies testing the effects of dietary manipulations on hypothalamic proteome. We trust that these experiments will lead to important knowledge on protein targets of nutritional variables potentially able to affect the complex central nervous system control of energy homeostasis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Current commercial and academic OLAP tools do not process XML data that contains XLink. Aiming at overcoming this issue, this paper proposes an analytical system composed by LMDQL, an analytical query language. Also, the XLDM metamodel is given to model cubes of XML documents with XLink and to deal with syntactic, semantic and structural heterogeneities commonly found in XML documents. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this article to provide features for the LMDQL query processing. A prototype system enabling the analytical processing of XML documents that use XLink is also detailed. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery. To validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background MicroRNAs (miRNAs) are small regulatory RNAs, some of which are conserved in diverse plant genomes. Therefore, computational identification and further experimental validation of miRNAs from non-model organisms is both feasible and instrumental for addressing miRNA-based gene regulation and evolution. Sugarcane (Saccharum spp.) is an important biofuel crop with publicly available expressed sequence tag and genomic survey sequence databases, but little is known about miRNAs and their targets in this highly polyploid species. Results In this study, we have computationally identified 19 distinct sugarcane miRNA precursors, of which several are highly similar with their sorghum homologs at both nucleotide and secondary structure levels. The accumulation pattern of mature miRNAs varies in organs/tissues from the commercial sugarcane hybrid as well as in its corresponding founder species S. officinarum and S. spontaneum. Using sugarcane MIR827 as a query, we found a novel MIR827 precursor in the sorghum genome. Based on our computational tool, a total of 46 potential targets were identified for the 19 sugarcane miRNAs. Several targets for highly conserved miRNAs are transcription factors that play important roles in plant development. Conversely, target genes of lineage-specific miRNAs seem to play roles in diverse physiological processes, such as SsCBP1. SsCBP1 was experimentally confirmed to be a target for the monocot-specific miR528. Our findings support the notion that the regulation of SsCBP1 by miR528 is shared at least within graminaceous monocots, and this miRNA-based post-transcriptional regulation evolved exclusively within the monocots lineage after the divergence from eudicots. Conclusions Using publicly available nucleotide databases, 19 sugarcane miRNA precursors and one new sorghum miRNA precursor were identified and classified into 14 families. Comparative analyses between sugarcane and sorghum suggest that these two species retain homologous miRNAs and targets in their genomes. Such conservation may help to clarify specific aspects of miRNA regulation and evolution in the polyploid sugarcane. Finally, our dataset provides a framework for future studies on sugarcane RNAi-dependent regulatory mechanisms.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objective To analyze the possible association between dental caries, fluorosis and the need for treatment in 12 year-old schoolchildren and the socioeconomic conditions of parents/guardians in the city of Franca, in the state of São Paulo. Methods A random sample of schoolchildren aged 12 was obtained from the school records in Franca, using a systematic random technique. The epidemiological survey was carried out by a single calibrated examiner, on 258 public and private schoolchildren in order to obtain the prevalence of dental caries, the need for treatment and the severity of dental fluorosis. Parents/guardians were also interviewed to assess their socioeconomic conditions (education and per capita income). We used multiple correlation analysis to investigate associations between category variables. Results It was possible to identify two distinct groups, with associations between the variables: the first group, represented by schoolchildren with average prevalence of caries, need for treatment, low level of parental education and income; and a second group represented by schoolchildren with low prevalence of caries, no need for treatment, high parental education and income. The two dimensions explained approximately 35% of total inertia. The factors within each group are related. Conclusion High income and parental education are associated with the low prevalence of dental caries but there is no association with dental fluorosis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we present a novel approach to perform similarity queries over medical images, maintaining the semantics of a given query posted by the user. Content-based image retrieval systems relying on relevance feedback techniques usually request the users to label relevant/irrelevant images. Thus, we present a highly effective strategy to survey user profiles, taking advantage of such labeling to implicitly gather the user perceptual similarity. The profiles maintain the settings desired for each user, allowing tuning of the similarity assessment, which encompasses the dynamic change of the distance function employed through an interactive process. Experiments on medical images show that the method is effective and can improve the decision making process during analysis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Este ensaio apresenta a construção de um objeto de pesquisa com base na teoria da semiótica da cultura. São feitas reflexões sobre os sistemas modelizantes envolvidos no ciclo da comunicação científica em grupo de pesquisa de universidade, desde a busca da informação até a publicação dos resultados dos estudos. As linguagens naturais (idiomas) e artificiais (linguagem de busca em computadores e vocabulários controlados) são identificadas. A partir disso, o objeto se delineia como o conjunto de textos da cultura e a própria semiosfera, representada pelos diálogos dos sujeitos da cultura e o processo de comunicação envolvido. Alguns desafios se apresentam, como: a necessidade de aprofundamento na teoria da semiótica da cultura, a participação do pesquisador também como sujeito da pesquisa e o trabalho com a interdisciplinaridade para estudar um objeto com as vertentes da ciência da informação, biomedicina, semiótica e outras disciplinas a elas relacionadas.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Given a large image set, in which very few images have labels, how to guess labels for the remaining majority? How to spot images that need brand new labels different from the predefined ones? How to summarize these data to route the user’s attention to what really matters? Here we answer all these questions. Specifically, we propose QuMinS, a fast, scalable solution to two problems: (i) Low-labor labeling (LLL) – given an image set, very few images have labels, find the most appropriate labels for the rest; and (ii) Mining and attention routing – in the same setting, find clusters, the top-'N IND.O' outlier images, and the 'N IND.R' images that best represent the data. Experiments on satellite images spanning up to 2.25 GB show that, contrasting to the state-of-the-art labeling techniques, QuMinS scales linearly on the data size, being up to 40 times faster than top competitors (GCap), still achieving better or equal accuracy, it spots images that potentially require unpredicted labels, and it works even with tiny initial label sets, i.e., nearly five examples. We also report a case study of our method’s practical usage to show that QuMinS is a viable tool for automatic coffee crop detection from remote sensing images.