33 resultados para Frequent itemsets mining

em Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho"


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Multi-relational data mining enables pattern mining from multiple tables. The existing multi-relational mining association rules algorithms are not able to process large volumes of data, because the amount of memory required exceeds the amount available. The proposed algorithm MRRadix presents a framework that promotes the optimization of memory usage. It also uses the concept of partitioning to handle large volumes of data. The original contribution of this proposal is enable a superior performance when compared to other related algorithms and moreover successfully concludes the task of mining association rules in large databases, bypass the problem of available memory. One of the tests showed that the MR-Radix presents fourteen times less memory usage than the GFP-growth. © 2011 IEEE.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The multi-relational Data Mining approach has emerged as alternative to the analysis of structured data, such as relational databases. Unlike traditional algorithms, the multi-relational proposals allow mining directly multiple tables, avoiding the costly join operations. In this paper, is presented a comparative study involving the traditional Patricia Mine algorithm and its corresponding multi-relational proposed, MR-Radix in order to evaluate the performance of two approaches for mining association rules are used for relational databases. This study presents two original contributions: the proposition of an algorithm multi-relational MR-Radix, which is efficient for use in relational databases, both in terms of execution time and in relation to memory usage and the presentation of the empirical approach multirelational advantage in performance over several tables, which avoids the costly join operations from multiple tables. © 2011 IEEE.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Once multi-relational approach has emerged as an alternative for analyzing structured data such as relational databases, since they allow applying data mining in multiple tables directly, thus avoiding expensive joining operations and semantic losses, this work proposes an algorithm with multi-relational approach. Methods: Aiming to compare traditional approach performance and multi-relational for mining association rules, this paper discusses an empirical study between PatriciaMine - an traditional algorithm - and its corresponding multi-relational proposed, MR-Radix. Results: This work showed advantages of the multi-relational approach in performance over several tables, which avoids the high cost for joining operations from multiple tables and semantic losses. The performance provided by the algorithm MR-Radix shows faster than PatriciaMine, despite handling complex multi-relational patterns. The utilized memory indicates a more conservative growth curve for MR-Radix than PatriciaMine, which shows the increase in demand of frequent items in MR-Radix does not result in a significant growth of utilized memory like in PatriciaMine. Conclusion: The comparative study between PatriciaMine and MR-Radix confirmed efficacy of the multi-relational approach in data mining process both in terms of execution time and in relation to memory usage. Besides that, the multi-relational proposed algorithm, unlike other algorithms of this approach, is efficient for use in large relational databases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The garimpo gold mining activity has released about 2.500 tons of mercury in the Brazilian Amazonian environment in the 1980-1995 period. The northern region of Mato Grosso State, an important gold mining and trading area during the Arnazonian gold rush is now at a turning point regarding its economic future. Nowadays, the activities related to gold mining have only a low relevance on its economy. Thus, the local communities are looking for economic alternatives for the development of the region. Cooperative fish farming is one of such alternatives. However, some projects are directly implemented on areas degraded by the former garimpo activity and the mercury left behind still poses risks, especially by its potential accumulation in fish. The objective of the present study was to evaluate the levels of mercury contamination in two fish farming areas, Paranaita and Alta Floresta, with and without records of past gold-washing activity, respectively. Data such as mercury concentration in fish of different trophic level, size, and weight as well as the water physical and chemical parameters were measured and considered. These preliminary data have shown no significant difference between these two fish fanning areas, relatively to mercury levels in fish. (c) 2004 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The main goal of our research was to search for SSRs in the Eucalyptus EST FORESTs database (using a software for mining SSR-motifs). With this objective, we created a database for cataloging Eucalyptus EST-derived SSRs, and developed a bioinformatics tool, named Satellyptus, for finding and analyzing microsatellites in the Eucalyptus EST database. The search for microsatellites in the FORESTs database containing 71,115 Eucalyptus EST sequences (52.09 Mb) revealed 20,530 SSRs in 15,621 ESTs. The SSR abundance detected on the Eucalyptus ESTs database (29% or one microsatellite every four sequences) is considered very high for plants. Amongst the categories of SSR motifs, the dimeric (37%) and trimeric ones (33%) predominated. The AG/CT motif was the most frequent (35.15%) followed by the trimeric CCG/CGG (12.81%). From a random sample of 1,217 sequences, 343 microsatellites in 265 SSR-containing sequences were identified. Approximately 48% of these ESTs containing microsatellites were homologous to proteins with known biological function. Most of the microsatellites detected in Eucalyptus ESTs were positioned at either the 5 or 3 end. Our next priority involves the design of flanking primers for codominant SSR loci, which could lead to the development of a set of microsatellite-based markers suitable for marker-assisted Eucalyptus breeding programs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cancer is regarded as the abnormal cellular multiplication; it is not controlled by the organism; and its cells present a differentiated DNA. Initially, the disease does not show clinical signs, but it can be diagnosed by laboratorial examinations. When tumors are present in the maxillofacial area, they can entail the loss of these area organs, which become responsible for the carrier's social environment exclusion. This paper aimed at showing, through a literature review, the cancers that more commonly happen in the face and the possibilities of regenerating in the patient mutilated through surgical reconstruction and prostheses.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background. Loss of heterozygosity (LOH) correlates with inactivated tumor suppressor genes. LOH at chromosome arm 22q has been found in a variety of human neoplasms, suggesting that this region contains a tumor suppressor gene(s) other than NF2 important to tumorigenesis. The aim of this study was to evaluate the presence of LOH on chromosome 22q11.2-13 and determine whether there was a relationship between loss in this genomic region and tumor histologic parameters, anatomic site, and survival in patients with squamous cell carcinoma of the head and neck (HNSCC).Methods. Fifty matched blood and HNSCC tumor samples taken at the time of surgical treatment were evaluated for LOH by use of four microsatellite markers mapping to 22q11.2-q13. Clinical information was available for all patients. The frequency and distribution of LOH was correlated with clinical (age, sex, use of tobacco and alcohol, site of primary tumor, clinical stage, adjuvant therapy and overall survival) and histologic parameters (histopathologic stage, tumor differentiation).Results. LOH at 22q was found in 19 of 50 (38%) informative tumors. The respective incidence of allelic loss for the patients was as follows: 28% at D22S421, 10% at D22S277, 8% at D22S44S, and 4% at D22S280. No statistical differences were apparent with a mean follow-up of 30 months. Laryngeal tumors showed a higher incidence of LOH compared with oral tumors.Conclusions. These results suggest that the D22S277 locus may be closely linked to a tumor suppressor gene (TSG) and involved in upper aerodigestive tract carcinogenesis. In particular, laryngeal tumors may harbor another putative TSG on 22q11.2-q12.3 that may play a role in aggressive stage III/IV disease. (C) 2000 John Wiley & Sons, Inc.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Variations in the phenotypic expression of heterozygous beta thalassemia reflect the formation of different populations. To better understand the profile of heterozygous beta-thalassemia of the Brazilian population, we aimed at establishing parameters to direct the diagnosis of carriers and calculate the frequency from information stored in an electronic database. Using a Data Mining tool, we evaluated information on 10,960 blood samples deposited in a relational database. Over the years, improved diagnostic technology has facilitated the elucidation of suspected beta thalassemia heterozygote cases with an average frequency of 3.5% of referred cases. We also found that the Brazilian beta thalassemia trait has classic increases of Hb A2 and Hb F (60%), mainly caused by mutations in beta zero thalassemia, especially in the southeast of the country.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Ore mines installed in the lower-middle portion of Ribeira de Iguape River (São Paulo State, southeastern Brazil), together with the Panelas Plant are responsible for the contamination of the Iguape-Cananeia-Paranagua lagoon-estuarine complex. The lower-middle portion of Grande Creek Basin, located in the district of Adrianopolis (Parana State, southern Brazil) is under environmental impact because of mining activities. The mines of Perau, at Perau Creek, Canoas at Canoas Creek, and Barrinha at Barrinha Creek and Laranjal Creek have been paralyzed. The transport of lead in fluvial sediments is mainly associated with organic matter, carbonates, the residual fraction, and adsorbables, whereas the transport of zinc is associated with the organic and residual fraction, oxides and hydroxides of iron and manganese, carbonate, and adsorbables. The transport of copper is associated with the residual fraction and oxides and hydroxides of iron and manganese, organic matter, carbonate, and adsorbables.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This investigation reports the results of a study realized in areas related to the development of sand mining activities, which belong to CRS-Mineragao Industria e Comercio Ltd. and Sibelco Mineracao Ltd. Both areas are located around Analandia municipality, nearly in the center of São Paulo State, Brazil. Flow rate and hydrochemical analyses were realized over different periods of time, with the aim of evaluating the possibility of release of several constituents to the liquid phase, which may be a source of pollution of the surface hydrological resources. This is because some tributaries from the Corumbatai River may be suffering contamination, implying on the impoverishment of the water quality that is a very important resource in the region, as it is extensively used for drinking purposes, among others.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The 1980-1990 Amazonian gold rush left an enormous liability that increasingly has been substituted by developing fish aquaculture. This work aimed at the identification of the mercury levels in the environment, associated with fish farms located in the North of Mato Grosso State, Southern Amazon. Sediment and soil samples were analyzed for total organic carbon and total mercury. Results indicate that the chemical characteristics of the sediment largely depend on the management procedures of the fish pond (liming, fish food used and fish population). The soils presented relatively low concentrations when compared with other data from the literature.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)