998 resultados para Solution mining.
Resumo:
Volumes of data used in science and industry are growing rapidly. When researchers face the challenge of analyzing them, their format is often the first obstacle. Lack of standardized ways of exploring different data layouts requires an effort each time to solve the problem from scratch. Possibility to access data in a rich, uniform manner, e.g. using Structured Query Language (SQL) would offer expressiveness and user-friendliness. Comma-separated values (CSV) are one of the most common data storage formats. Despite its simplicity, with growing file size handling it becomes non-trivial. Importing CSVs into existing databases is time-consuming and troublesome, or even impossible if its horizontal dimension reaches thousands of columns. Most databases are optimized for handling large number of rows rather than columns, therefore, performance for datasets with non-typical layouts is often unacceptable. Other challenges include schema creation, updates and repeated data imports. To address the above-mentioned problems, I present a system for accessing very large CSV-based datasets by means of SQL. It's characterized by: "no copy" approach - data stay mostly in the CSV files; "zero configuration" - no need to specify database schema; written in C++, with boost [1], SQLite [2] and Qt [3], doesn't require installation and has very small size; query rewriting, dynamic creation of indices for appropriate columns and static data retrieval directly from CSV files ensure efficient plan execution; effortless support for millions of columns; due to per-value typing, using mixed text/numbers data is easy; very simple network protocol provides efficient interface for MATLAB and reduces implementation time for other languages. The software is available as freeware along with educational videos on its website [4]. It doesn't need any prerequisites to run, as all of the libraries are included in the distribution package. I test it against existing database solutions using a battery of benchmarks and discuss the results.
Resumo:
O assunto Brasil foi analisado na base de teses francesas DocThèses, compreendendo os anos de 1969 a 1999. Utilizou-se a técnica de Data Mining como ferramenta para obter inteligência e conhecimento. O software utilizado para a limpeza da base DocThèses foi o Infotrans, e, para a preparação dos dados, empregou-se o Dataview. Os resultados da análise foram ilustrados com a aplicação dos pressupostos da Lei de Zipf, classificando-se as informações em trivial, interessante e ruído, conforme a distribuição de freqüência. Conclui-se que a técnica do Data Mining associada a softwares especialistas é uma poderosa aliada no emprego de inteligência no processo decisório em todos os níveis, inclusive o nível macro, pois oferece subsídios para a consolidação, investimento e desenvolvimento de ações e políticas.
Resumo:
Etude critique de Charles Larmore, Modernité et morale (Paris, PUF, 1993). Cet article présente et discute le projet de son auteur de défendre l'idée d'une morale « pragmatiste » et « intuitionniste ». Restituant la position de l'auteur, il expose les arguments en faveur d'une conception pragmatiste de la vérité morale et ceux en faveur du recours à l'intuition pour découvrir le contenu de nos obligations morales. Dans une brève note critique finale, il suggère que le pragmatisme semble peu à même d'échapper tout à fait au reproche de relativisme.
Resumo:
Motivation: Genome-wide association studies have become widely used tools to study effects of genetic variants on complex diseases. While it is of great interest to extend existing analysis methods by considering interaction effects between pairs of loci, the large number of possible tests presents a significant computational challenge. The number of computations is further multiplied in the study of gene expression quantitative trait mapping, in which tests are performed for thousands of gene phenotypes simultaneously. Results: We present FastEpistasis, an efficient parallel solution extending the PLINK epistasis module, designed to test for epistasis effects when analyzing continuous phenotypes. Our results show that the algorithm scales with the number of processors and offers a reduction in computation time when several phenotypes are analyzed simultaneously. FastEpistasis is capable of testing the association of a continuous trait with all single nucleotide polymorphism ( SNP) pairs from 500 000 SNPs, totaling 125 billion tests, in a population of 5000 individuals in 29, 4 or 0.5 days using 8, 64 or 512 processors.
Resumo:
The objective of this study was to establish critical values of the N indices, namely soil-plant analysis development (SPAD), petiole sap N-NO3 and organic N in the tomato leaf adjacent to the first cluster (LAC), under soil and nutrient solution conditions, determined by different statistical approaches. Two experiments were conducted in randomized complete block design with four repli-cations. Tomato plants were grown in soil, in 3 L pot, with five N rates (0, 100, 200, 400 and 800 mg kg-1) and in solution at N rates of 0, 4, 8, 12 and 16 mmol L-1. Experiments in nutrient solution and soil were finished at thirty seven and forty two days after transplanting, respectively. At those times, SPAD index and petiole sap N-NO3 were evaluated in the LAC. Then, plants were harvested, separated in leaves and stem, dried at 70ºC, ground and weighted. The organic N was determined in LAC dry matter. Three statistical procedures were used to calculate critical N values. There were accentuated discrepancies for critical values of N indices obtained with plants grown in soil and nutrient solution as well as for different statistical procedures. Critical values of nitrogen indices at all situations are presented.
Resumo:
BACKGROUND: The annotation of protein post-translational modifications (PTMs) is an important task of UniProtKB curators and, with continuing improvements in experimental methodology, an ever greater number of articles are being published on this topic. To help curators cope with this growing body of information we have developed a system which extracts information from the scientific literature for the most frequently annotated PTMs in UniProtKB. RESULTS: The procedure uses a pattern-matching and rule-based approach to extract sentences with information on the type and site of modification. A ranked list of protein candidates for the modification is also provided. For PTM extraction, precision varies from 57% to 94%, and recall from 75% to 95%, according to the type of modification. The procedure was used to track new publications on PTMs and to recover potential supporting evidence for phosphorylation sites annotated based on the results of large scale proteomics experiments. CONCLUSIONS: The information retrieval and extraction method we have developed in this study forms the basis of a simple tool for the manual curation of protein post-translational modifications in UniProtKB/Swiss-Prot. Our work demonstrates that even simple text-mining tools can be effectively adapted for database curation tasks, providing that a thorough understanding of the working process and requirements are first obtained. This system can be accessed at http://eagl.unige.ch/PTM/.
Resumo:
En aquest article es presenten breument els diferents capítols d’un treball interdisciplinari per tal d’entendre el context de prohibició de la mineria de ferro a Goa a finals del 2012 i proporcionar la informació necessària per tal d’orientar i gestionar la presa de decisions sobre l’activitat minera en un futur. Els sis primers capítols consisteixen en l’estudi del medi abiòtic, medi biòtic, fluxos de materials, aspectes socials, aspectes econòmics i finalment aspectes polítics. En canvi, en els dos últims capítols s'avaluen i es gestionen els impactes ambientals de la mineria mitjançant, per una banda, una anàlisi DPSIR i, d'altra banda, es proposen tres escenaris per integrar les diferents variables i fomentar la participació en la presa de decisions. S’ha dut a terme una extensa recerca mitjançant la recopilació de dades, entrevistes i visites a les zones d’estudi d’interès per tal d’entendre el conflicte de la mineria a Goa.
Resumo:
The main objective of this Master Thesis is to discover more about Girona’s image as a tourism destination from different agents’ perspective and to study its differences on promotion or opinions. In order to meet this objective, three components of Girona’s destination image will be studied: attribute-based component, the holistic component, and the affective component. It is true that a lot of research has been done about tourism destination image, but it is less when we are talking about the destination of Girona. Some studies have already focused on Girona as a tourist destination, but they used a different type of sample and different methodological steps. This study is new among destination studies in the sense that it is based only on textual online data and it follows a methodology based on text-miming. Text-mining is a kind of methodology that allows people extract relevant information from texts. Also, after this information is extracted by this methodology, some statistical multivariate analyses are done with the aim of discovering more about Girona’s tourism image
Resumo:
A multicomponent indicator displacement assay ( MIDA) based on an organometallic receptor and three dyes can be used for the identification and quantification of nucleotides in aqueous solution at neutral pH.
Resumo:
We conducted an open, randomized, and prospective study to determine the effect of hypertonic saline on the secretion of antidiuretic hormone (ADH) and aldosterone in children with severe head injury (Glasgow coma scale <8). Thirty-one consecutive patients at a level III pediatric intensive care unit at a children's hospital received either lactated Ringer's solution (Ringer's group, n = 16) or hypertonic saline (Hypertonic Saline group, n = 15) over a 3-day period. Serum ADH levels were significantly larger in the Hypertonic Saline group as compared with the Ringer's group (P = 0.001; analysis of variance) and were correlated to sodium intake (Ringer's group: r = 0.39, R(2) = 0.15, P = 0.02; Hypertonic Saline group: r = 0.42, R(2) = 0.18, P = 0.02) and volume of fluids given IV (Ringer's group: r = 0.38, R(2) = 0.15, P = 0.02; Hypertonic Saline group: r = 0.32, R(2) = 0.1, P = not significant). Correlation of ADH to plasma osmolality was significant if plasma osmolality was >280 mOsm/kg (r = 0.5, R(2) = 0.25, P = 0.06), indicating an osmotic threshold for ADH release. Serum aldosterone levels were larger on the first day than during Days 2 and 3 in both groups and inversely correlated to serum sodium levels only in the Ringer's group (r = -0.55, R(2) = 0.3, P < 0.001). This group received a significantly larger fluid volume on Day 1 (P = 0.05, Mann-Whitney U-test) than did patients in the Hypertonic Saline group, indicating hypovolemia during the first day. Head-injured children have appropriate levels of ADH. They may be hypovolemic during the first day of treatment, especially if they receive lactated Ringer's solution. IMPLICATIONS: In head-injured patients, we recommend fluid restriction to avoid inappropriate secretion of antidiuretic hormone. In a prospective, randomized, and controlled study in 31 children, we were able to show that the antidiuretic hormone levels are appropriate in response to hypovolemia, sodium load, or both.
Resumo:
The Brianconnais area is explained as a large scale exotic terrain separating from Europe during the opening of the Valais ocean. It's displacement history during the Alpine evolution allows to replace older concepts of multiple oceans separating narrow strips of continental crust.
Resumo:
It is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aetiology. However, single genetic variants typically explain small amounts of disease risk. Our idea is to construct allelic scores that explain greater proportions of the variance in biological intermediates, and subsequently use these scores to data mine GWAS. To investigate the approach's properties, we indexed three biological intermediates where the results of large GWAS meta-analyses were available: body mass index, C-reactive protein and low density lipoprotein levels. We generated allelic scores in the Avon Longitudinal Study of Parents and Children, and in publicly available data from the first Wellcome Trust Case Control Consortium. We compared the explanatory ability of allelic scores in terms of their capacity to proxy for the intermediate of interest, and the extent to which they associated with disease. We found that allelic scores derived from known variants and allelic scores derived from hundreds of thousands of genetic markers explained significant portions of the variance in biological intermediates of interest, and many of these scores showed expected correlations with disease. Genome-wide allelic scores however tended to lack specificity suggesting that they should be used with caution and perhaps only to proxy biological intermediates for which there are no known individual variants. Power calculations confirm the feasibility of extending our strategy to the analysis of tens of thousands of molecular phenotypes in large genome-wide meta-analyses. We conclude that our method represents a simple way in which potentially tens of thousands of molecular phenotypes could be screened for causal relationships with disease without having to expensively measure these variables in individual disease collections.
Resumo:
The use of quantum dots (QDs) in the area of fingermark detection is currently receiving a lot of attention in the forensic literature. Most of the research efforts have been devoted to cadmium telluride (CdTe) quantum dots often applied as powders to the surfaces of interests. Both the use of cadmium and the nano size of these particles raise important issues in terms of health and safety. This paper proposes to replace CdTe QDs by zinc sulphide QDs doped with copper (ZnS:Cu) to address these issues. Zinc sulphide-copper doped QDs were successfully synthesized, characterized in terms of size and optical properties and optimized to be applied for the detection of impressions left in blood, where CdTe QDs proved to be efficient. Effectiveness of detection was assessed in comparison with CdTe QDs and Acid Yellow 7 (AY7, an effective blood reagent), using two series of depletive blood fingermarks from four donors prepared on four non-porous substrates, i.e. glass, transparent polypropylene, black polyethylene and aluminium foil. The marks were cut in half and processed separately with both reagents, leading to two comparison series (ZnS:Cu vs. CdTe, and ZnS:Cu vs. AY7). ZnS:Cu proved to be better than AY7 and at least as efficient as CdTe on most substrates. Consequently, copper-doped ZnS QDs constitute a valid substitute for cadmium-based QDs to detect blood marks on non-porous substrates and offer a safer alternative for routine use.