Biblioteca Digital

917 resultados para structuration of lexical data bases

Population-based incidence of myeloid malignancies: fifteen years of epidemiological data in the province of Girona, Spain

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Myeloid malignancies (MMs) are a heterogeneous group of hematologic malignancies presenting different incidence, prognosis and survival.1–3 Changing classifications (FAB 1994, WHO 2001 and WHO 2008) and few available epidemiological data complicate incidence comparisons.4,5 Taking this into account, the aims of the present study were: a) to calculate the incidence rates and trends of MMs in the Province of Girona, northeastern Spain, between 1994 and 2008 according to the WHO 2001 classification; and b) to predict the number of MMs cases in Spain during 2013. Data were extracted from the population-based Girona Cancer Registry (GCR) located in the north-east of Catalonia, Spain, and covering a population of 731,864 inhabitants (2008 census). Cases were registered according to the rules of the European Network for Cancer Registries and the Manual for Coding and Reporting Haematological Malignancies (HAEMACARE project). To ensure the complete coverage of MMs in the GCR, and especially myeloproliferative neoplasms (MPN) and myelodysplastic syndromes (MDS), a retrospective search was performed. The ICD-O-2 (1990) codes were converted into their corresponding ICD-O-3 (2000) codes, including MDS, polycythemia vera (PV) and essential thrombocythemia (ET) as malignant diseases. Results of crude rate (CR) and European standardized incidence rate (ASRE) were expressed per 100,000 inhabitants/year

Gridifying IBM's Generic Log Adapter to Speed-Up the Processing of Log Data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Peer-reviewed

Regression analysis of compositional data when both the dependent variable and independent variable are components

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is well known that regression analyses involving compositional data need special attention because the data are not of full rank. For a regression analysis where both the dependent and independent variable are components we propose a transformation of the components emphasizing their role as dependent and independent variables. A simple linear regression can be performed on the transformed components. The regression line can be depicted in a ternary diagram facilitating the interpretation of the analysis in terms of components. An exemple with time-budgets illustrates the method and the graphical features

The single principle of compositional data analysis, continuing fallacies, confusionsand misunderstandings and some suggested remedies

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In any discipline, where uncertainty and variability are present, it is important to haveprinciples which are accepted as inviolate and which should therefore drive statisticalmodelling, statistical analysis of data and any inferences from such an analysis.Despite the fact that two such principles have existed over the last two decades andfrom these a sensible, meaningful methodology has been developed for the statisticalanalysis of compositional data, the application of inappropriate and/or meaninglessmethods persists in many areas of application. This paper identifies at least tencommon fallacies and confusions in compositional data analysis with illustrativeexamples and provides readers with necessary, and hopefully sufficient, arguments topersuade the culprits why and how they should amend their ways

Extending the roughness of the data via transitive closures of similarity indexes

Relevância:

100.00% 100.00%

Publicador:

Resumo:

One main assumption in the theory of rough sets applied to information tables is that the elements that exhibit the same information are indiscernible (similar) and form blocks that can be understood as elementary granules of knowledge about the universe. We propose a variant of this concept defining a measure of similarity between the elements of the universe in order to consider that two objects can be indiscernible even though they do not share all the attribute values because the knowledge is partial or uncertain. The set of similarities define a matrix of a fuzzy relation satisfying reflexivity and symmetry but transitivity thus a partition of the universe is not attained. This problem can be solved calculating its transitive closure what ensure a partition for each level belonging to the unit interval [0,1]. This procedure allows generalizing the theory of rough sets depending on the minimum level of similarity accepted. This new point of view increases the rough character of the data because increases the set of indiscernible objects. Finally, we apply our results to a not real application to be capable to remark the differences and the improvements between this methodology and the classical one

Analysis of Satellite Data for Forest Segmentation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this work we study the classification of forest types using mathematics based image analysis on satellite data. We are interested in improving classification of forest segments when a combination of information from two or more different satellites is used. The experimental part is based on real satellite data originating from Canada. This thesis gives summary of the mathematics basics of the image analysis and supervised learning , methods that are used in the classification algorithm. Three data sets and four feature sets were investigated in this thesis. The considered feature sets were 1) histograms (quantiles) 2) variance 3) skewness and 4) kurtosis. Good overall performances were achieved when a combination of ASTERBAND and RADARSAT2 data sets was used.

The Uexkü̈ll approach : evaluation of multivariate data organizations for support of visual information retrieval

Relevância:

100.00% 100.00%

Publicador:

Efficiency of distinct data mining algorithms for classifying stress level in piglets from their vocalization

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Among the challenges of pig farming in today's competitive market, there is factor of the product traceability that ensures, among many points, animal welfare. Vocalization is a valuable tool to identify situations of stress in pigs, and it can be used in welfare records for traceability. The objective of this work was to identify stress in piglets using vocalization, calling this stress on three levels: no stress, moderate stress, and acute stress. An experiment was conducted on a commercial farm in the municipality of Holambra, São Paulo State , where vocalizations of twenty piglets were recorded during the castration procedure, and separated into two groups: without anesthesia and local anesthesia with lidocaine base. For the recording of acoustic signals, a unidirectional microphone was connected to a digital recorder, in which signals were digitized at a frequency of 44,100 Hz. For evaluation of sound signals, Praat® software was used, and different data mining algorithms were applied using Weka® software. The selection of attributes improved model accuracy, and the best attribute selection was used by applying Wrapper method, while the best classification algorithms were the k-NN and Naive Bayes. According to the results, it was possible to classify the level of stress in pigs through their vocalization.

On Bayesian predictive modeling : with applications on the analysis of chromatography data in forensic investigations

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Med prediktion avses att man skattar det framtida värdet på en observerbar storhet. Kännetecknande för det bayesianska paradigmet är att osäkerhet gällande okända storheter uttrycks i form av sannolikheter. En bayesiansk prediktiv modell är således en sannolikhetsfördelning över de möjliga värden som en observerbar, men ännu inte observerad storhet kan anta. I de artiklar som ingår i avhandlingen utvecklas metoder, vilka bl.a. tillämpas i analys av kromatografiska data i brottsutredningar. Med undantag för den första artikeln, bygger samtliga metoder på bayesiansk prediktiv modellering. I artiklarna betraktas i huvudsak tre olika typer av problem relaterade till kromatografiska data: kvantifiering, parvis matchning och klustring. I den första artikeln utvecklas en icke-parametrisk modell för mätfel av kromatografiska analyser av alkoholhalt i blodet. I den andra artikeln utvecklas en prediktiv inferensmetod för jämförelse av två stickprov. Metoden tillämpas i den tredje artik eln för jämförelse av oljeprover i syfte att kunna identifiera den förorenande källan i samband med oljeutsläpp. I den fjärde artikeln härleds en prediktiv modell för klustring av data av blandad diskret och kontinuerlig typ, vilken bl.a. tillämpas i klassificering av amfetaminprover med avseende på produktionsomgångar.

Building Dynamic Data Centers for Fast Delivery of New Data and Data Updates

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Poster at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014

The requirements and needs of global data usage in product lifecycle management

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study was done for ABB Ltd. Motors and Generators business unit in Helsinki. In this study, global data movement in large businesses is examined from a product data management (PDM) and enterprise resource planning (ERP) point-of-view. The purpose of this study was to understand and map out how a large global business handles its data in a multiple site structure and how it can be applied in practice. This was done by doing an empirical interview study on five different global businesses with design locations in multiple countries. Their master data management (MDM) solutions were inspected and analyzed to understand which solution would best benefit a large global architecture with many design locations. One working solution is a transactional hub which negates the effects of multisite transfers and reduces lead times. Also, the requirements and limitations of the current MDM architecture were analyzed and possible reform ideas given.

Data Governance and Automated Marketing – A Case Study of Expected Benefits of Organizing Data Governance in an ICT Company

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This research is looking to find out what benefits employees expect the organization of data governance gains for an organization and how it benefits implementing automated marketing capabilities. Quality and usability of the data are crucial for organizations to meet various business needs. Organizations have more data and technology available what can be utilized for example in automated marketing. Data governance addresses the organization of decision rights and accountabilities for the management of an organization’s data assets. With automated marketing it is meant sending a right message, to a right person, at a right time, automatically. The research is a single case study conducted in Finnish ICT-company. The case company was starting to organize data governance and implementing automated marketing capabilities at the time of the research. Empirical material is interviews of the employees of the case company. Content analysis is used to interpret the interviews in order to find the answers to the research questions. Theoretical framework of the research is derived from the morphology of data governance. Findings of the research indicate that the employees expect the organization of data governance among others to improve customer experience, to improve sales, to provide abilities to identify individual customer’s life-situation, ensure that the handling of the data is according to the regulations and improve operational efficiency. The organization of data governance is expected to solve problems in customer data quality that are currently hindering implementation of automated marketing capabilities.

Comparison of baseline data between chronic kidney disease patients starting hemodialysis who live near and far from the dialysis center

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction: The treatment offered to chronic kidney disease (CKD) patients before starting hemodialysis (HD) impacts prognosis. Objective: We seek differences among incident HD patients according to the distance between home and the dialysis center. Methods: We included 179 CKD patients undergoing HD. Patients were stratified in two groups: "living near the dialysis center" (patients whose hometown was in cities up to 100 km from the dialysis center) or as "living far from the dialysis center" (patients whose hometown was more than 100 km from the dialysis center). Socioeconomic status, laboratory results, awareness of CKD before starting HD, consultation with nephrologist before the first HD session, and type of vascular access when starting HD were compared between the two groups. Comparisons of continuous and categorical variables were performed using Student's t-test and the Chi-square test, respectively. Results: Ninety (50.3%) patients were classified as "living near the dialysis center" and 89 (49.7%) as "living far from the dialysis center". Patients living near the dialysis center were more likely to know about their condition of CKD than those living far from the dialysis center, respectively 46.6% versus 28.0% (p = 0.015). Although without statistical significance, patients living near the dialysis center had more frequent previous consultation with nephrologists (55.5% versus 42.6%; p = 0.116) and first HD by fistula (30.0% versus 19.1%; p = 0.128) than those living far from the dialysis center. Conclusion: There are potential advantages of CKD awareness, referral to nephrologists and starting HD through fistula among patients living near the dialysis center.

Model of Big Data Failure: Review of Information System Failure

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the new age of information technology, big data has grown to be the prominent phenomena. As information technology evolves, organizations have begun to adopt big data and apply it as a tool throughout their decision-making processes. Research on big data has grown in the past years however mainly from a technical stance and there is a void in business related cases. This thesis fills the gap in the research by addressing big data challenges and failure cases. The Technology-Organization-Environment framework was applied to carry out a literature review on trends in Business Intelligence and Knowledge management information system failures. A review of extant literature was carried out using a collection of leading information system journals. Academic papers and articles on big data, Business Intelligence, Decision Support Systems, and Knowledge Management systems were studied from both failure and success aspects in order to build a model for big data failure. I continue and delineate the contribution of the Information System failure literature as it is the principal dynamics behind technology-organization-environment framework. The gathered literature was then categorised and a failure model was developed from the identified critical failure points. The failure constructs were further categorized, defined, and tabulated into a contextual diagram. The developed model and table were designed to act as comprehensive starting point and as general guidance for academics, CIOs or other system stakeholders to facilitate decision-making in big data adoption process by measuring the effect of technological, organizational, and environmental variables with perceived benefits, dissatisfaction and discontinued use.

Model of Big Data Failure: Review of Information System Failure

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the new age of information technology, big data has grown to be the prominent phenomena. As information technology evolves, organizations have begun to adopt big data and apply it as a tool throughout their decision-making processes. Research on big data has grown in the past years however mainly from a technical stance and there is a void in business related cases. This thesis fills the gap in the research by addressing big data challenges and failure cases. The Technology-Organization-Environment framework was applied to carry out a literature review on trends in Business Intelligence and Knowledge management information system failures. A review of extant literature was carried out using a collection of leading information system journals. Academic papers and articles on big data, Business Intelligence, Decision Support Systems, and Knowledge Management systems were studied from both failure and success aspects in order to build a model for big data failure. I continue and delineate the contribution of the Information System failure literature as it is the principal dynamics behind technology-organization-environment framework. The gathered literature was then categorised and a failure model was developed from the identified critical failure points. The failure constructs were further categorized, defined, and tabulated into a contextual diagram. The developed model and table were designed to act as comprehensive starting point and as general guidance for academics, CIOs or other system stakeholders to facilitate decision-making in big data adoption process by measuring the effect of technological, organizational, and environmental variables with perceived benefits, dissatisfaction and discontinued use.

«
1
2
...
8
9
10
11
12
13
14
...
61
62
»