860 resultados para Frequent mining


Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the last decade, data mining has emerged as one of the most dynamic and lively areas in information technology. Although many algorithms and techniques for data mining have been proposed, they either focus on domain independent techniques or on very specific domain problems. A general requirement in bridging the gap between academia and business is to cater to general domain-related issues surrounding real-life applications, such as constraints, organizational factors, domain expert knowledge, domain adaption, and operational knowledge. Unfortunately, these either have not been addressed, or have not been sufficiently addressed, in current data mining research and development.Domain-Driven Data Mining (D3M) aims to develop general principles, methodologies, and techniques for modeling and merging comprehensive domain-related factors and synthesized ubiquitous intelligence surrounding problem domains with the data mining process, and discovering knowledge to support business decision-making. This paper aims to report original, cutting-edge, and state-of-the-art progress in D3M. It covers theoretical and applied contributions aiming to: 1) propose next-generation data mining frameworks and processes for actionable knowledge discovery, 2) investigate effective (automated, human and machine-centered and/or human-machined-co-operated) principles and approaches for acquiring, representing, modelling, and engaging ubiquitous intelligence in real-world data mining, and 3) develop workable and operational systems balancing technical significance and applications concerns, and converting and delivering actionable knowledge into operational applications rules to seamlessly engage application processes and systems.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background. The assembly of the tree of life has seen significant progress in recent years but algae and protists have been largely overlooked in this effort. Many groups of algae and protists have ancient roots and it is unclear how much data will be required to resolve their phylogenetic relationships for incorporation in the tree of life. The red algae, a group of primary photosynthetic eukaryotes of more than a billion years old, provide the earliest fossil evidence for eukaryotic multicellularity and sexual reproduction. Despite this evolutionary significance, their phylogenetic relationships are understudied. This study aims to infer a comprehensive red algal tree of life at the family level from a supermatrix containing data mined from GenBank. We aim to locate remaining regions of low support in the topology, evaluate their causes and estimate the amount of data required to resolve them. Results. Phylogenetic analysis of a supermatrix of 14 loci and 98 red algal families yielded the most complete red algal tree of life to date. Visualization of statistical support showed the presence of five poorly supported regions. Causes for low support were identified with statistics about the age of the region, data availability and node density, showing that poor support has different origins in different parts of the tree. Parametric simulation experiments yielded optimistic estimates of how much data will be needed to resolve the poorly supported regions (ca. 103 to ca. 104 nucleotides for the different regions). Nonparametric simulations gave a markedly more pessimistic image, some regions requiring more than 2.8 105 nucleotides or not achieving the desired level of support at all. The discrepancies between parametric and nonparametric simulations are discussed in light of our dataset and known attributes of both approaches. Conclusions. Our study takes the red algae one step closer to meaningful inclusion in the tree of life. In addition to the recovery of stable relationships, the recognition of five regions in need of further study is a significant outcome of this work. Based on our analyses of current availability and future requirements of data, we make clear recommendations for forthcoming research.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background We had previously established that inactivation of RUNX3 occurs by frequent promoter hypermethylation and protein mislocalization in invasive ductal carcinomas (IDC) of breast. Here, we hypothesize that inactivation of RUNX3 occurring in ductal carcinoma in situ (DCIS) represent early event in breast carcinogenesis. Methods The study cohort of 40 patients included 17 pure DCIS cases and 23 cases of DCIS with associated IDC (DCIS-IDC). The DCIS and IDC components of mixed cases were manually microdissected to permit separate evaluation. All the 63 samples including 17 pure DCIS, 23 samples each of DCIS and IDC of DCIS-IDC cases were analyzed for RUNX3 protein expression using R3-6E9 monoclonal antibody as well as promoter methylation status by methylation specific PCR. Results Compared to matched normal breast samples (4 of 40, 10%), DCIS (35 of 40, 88%) and IDC (21 of 23, 91%) exhibited significant RUNX3 inactivation (P

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper explores the roles of science and market devices in the commodification of ‘nature’ and the configuration of flows of speculative capital. It focuses on mineral prospecting and the market for shares in ‘junior’ mining companies. In recent years these companies have expanded the reach of their exploration activities overseas, taking advantage of innovations in exploration methodologies and the liberalisation of fiscal and property regimes in ‘emerging’ mineral rich developing countries. Recent literature has explored how the reconfiguration of notions of ‘risk’ has structured the uneven distribution of rents. It is increasingly evident that neoliberal framing of environmental, political, social and economic risks has set in motion overflows that multinational mining capital had not bargained for (e.g. nationalisation, violence and political resistance). However, the role of ‘geological risk’ in animating flows of mining finance is often assumed as a ‘technical’ given. Yet geological knowledge claims, translated locally, designed to travel globally, assemble heterogeneous elements within distanciated regimes of metrology, valuation and commodity production. This paper explores how knowledge of nature is enrolled within systems of property relations, focusing on the genealogy of the knowledge practices that animate contemporary circuits of speculative mining finance. It argues that the financing of mineral prospecting mobilises pragmatic and situated forms of knowledge rather than actuarially driven calculations that promise predictability. A Canadian public enquiry struck in the wake of scandal associated with Bre-X’s prospecting activities in Indonesia is used to glean insights into the ways in which the construction of a system of public warrant to underpin financial speculation is predicated upon particular subjectivities and the outworking of everyday practices and struggles over ‘value’. Reflection on practical investments in processes of standardisation, rituals of verification and systems of accreditation reveal much about how the materiality of things shape the ways in which regional and global financial circuits are integrated, selectively transforming existing social relations and forms of knowledge production.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We conducted data-mining analyses of genome wide association (GWA) studies of the CATIE and MGS-GAIN datasets, and found 13 markers in the two physically linked genes, PTPN21 and EML5, showing nominally significant association with schizophrenia. Linkage disequilibrium (LD) analysis indicated that all 7 markers from PTPN21 shared high LD (r(2)>0.8), including rs2274736 and rs2401751, the two non-synonymous markers with the most significant association signals (rs2401751, P=1.10 × 10(-3) and rs2274736, P=1.21 × 10(-3)). In a meta-analysis of all 13 replication datasets with a total of 13,940 subjects, we found that the two non-synonymous markers are significantly associated with schizophrenia (rs2274736, OR=0.92, 95% CI: 0.86-0.97, P=5.45 × 10(-3) and rs2401751, OR=0.92, 95% CI: 0.86-0.97, P=5.29 × 10(-3)). One SNP (rs7147796) in EML5 is also significantly associated with the disease (OR=1.08, 95% CI: 1.02-1.14, P=6.43 × 10(-3)). These 3 markers remain significant after Bonferroni correction. Furthermore, haplotype conditioned analyses indicated that the association signals observed between rs2274736/rs2401751 and rs7147796 are statistically independent. Given the results that 2 non-synonymous markers in PTPN21 are associated with schizophrenia, further investigation of this locus is warranted.