444 resultados para Duplicate tuples


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aiming to ensure greater reliability and consistency of data stored in the database, the data cleaning stage is set early in the process of Knowledge Discovery in Databases (KDD) and is responsible for eliminating problems and adjust the data for the later stages, especially for the stage of data mining. Such problems occur in the instance level and schema, namely, missing values, null values, duplicate tuples, values outside the domain, among others. Several algorithms were developed to perform the cleaning step in databases, some of them were developed specifically to work with the phonetics of words, since a word can be written in different ways. Within this perspective, this work presents as original contribution an optimization of algorithm for the detection of duplicate tuples in databases through phonetic based on multithreading without the need for trained data, as well as an independent environment of language to be supported for this. © 2011 IEEE.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Computação - IBILCE

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Gene duplication is the primary source of new genes with novel or altered functions. It is known that duplicates may obtain these new functional roles by evolving divergent expression patterns and/or protein functions after the duplication event. Here, using yeast (Saccharomyces cerevisiae) as a model organism, we investigate a previously little considered mode for the functional diversification of duplicate genes: subcellular adaptation of encoded proteins. RESULTS: We show that for 24-37% of duplicate gene pairs derived from the S. cerevisiae whole-genome duplication event, the two members of the pair encode proteins that localize to distinct subcellular compartments. The propensity of yeast duplicate genes to evolve new localization patterns depends to a large extent on the biological function of their progenitor genes. Proteins involved in processes with a wider subcellular distribution (for example, catabolism) frequently evolved new protein localization patterns after duplication, whereas duplicate proteins limited to a smaller number of organelles (for example, highly expressed biosynthesis/housekeeping proteins with a slow rate of evolution) rarely relocate within the cell. Paralogous proteins evolved divergent localization patterns by partitioning of ancestral localizations ('sublocalization'), but probably more frequently by relocalization to new compartments ('neolocalization'). We show that such subcellular reprogramming may occur through selectively driven substitutions in protein targeting sequences. Notably, our data also reveal that relocated proteins functionally adapted to their new subcellular environments and evolved new functional roles through changes of their physico-chemical properties, expression levels, and interaction partners. CONCLUSION: We conclude that protein subcellular adaptation represents a common mechanism for the functional diversification of duplicate genes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

En 1940, Paul Erdős énonça une conjecture sur la distribution des classes inversibles modulo un entier. La présente thèse étudie la distribution des k-uplets de classes inversibles propose une preuve de la conjecture d'Erdős étendue au cas des k-uplets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction A high saturated fatty acid intake is a well recognized risk factor for coronary heart disease development. More recently a high intake of n-6 polyunsaturated fatty acids (PUFA) in combination with a low intake of the long chain n-3 PUFA, eicosapentaenoic acid and docosahexaenoic acid has also been implicated as an important risk factor. Aim To compare total dietary fat and fatty acid intake measured by chemical analysis of duplicate diets with nutritional database analysis of estimated dietary records, collected over the same 3-day study period. Methods Total fat was analysed using soxhlet extraction and subsequently the individual fatty acid content of the diet was determined by gas chromatography. Estimated dietary records were analysed using a nutrient database which was supplemented with a selection of dishes commonly consumed by study participants. Results Bland & Altman statistical analysis demonstrated a lack of agreement between the two dietary assessment techniques for determining dietary fat and fatty acid intake. Conclusion The lack of agreement observed between dietary evaluation techniques may be attributed to inadequacies in either or both assessment techniques. This study highlights the difficulties that may be encountered when attempting to accurately evaluate dietary fat intake among the population.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Oceanic sediments deposited at high rate close to continents are dominated by terrigenous material. Aside from dilution by biogenic components, their chemical compositions reflect those of nearby continental masses. This study focuses on oceanic sediments coming from the juvenile Canadian Cordillera and highlights systematic differences between detritus deriving from juvenile crust and detritus from old and mature crust. We report major and trace element concentrations for 68 sediments from the northernmost part of the Cascade forearc, drilled at ODP Sites 888 and 1027. The calculated weighted averages for each site can then be used in the future to quantify the contribution of subducted sediments to Cascades volcanism. The two sites have similar compositions but Site 888, located closer to the continent, has higher sandy turbidite contents and displays higher bulk SiO2/Al2O3 with lower bulk Nb/Zr, attributed to the presence of zircons in the coarse sands. Comparison with published data for other oceanic sedimentary piles demonstrates the existence of systematic differences between modern sediments deriving from juvenile terranes (juvenile sediments) and modern sediments derived from mature continental areas (cratonic sediments). The most striking systematic difference is for Th/Nb, Th/U, Nb/U and Th/Rb ratios: juvenile sediments have much lower ratios than cratonic sediments. The small enrichment of Th over Nb in cratonic sediments may be explained by intracrustal magmatic and metamorphic differentiation processes. In contrast, their elevated Th/U and Nb/U ratios (average values of 6.87 and 7.95, respectively) in comparison to juvenile sediments (Th/U ~ 3.09, Nb/U ~ 5.15) suggest extensive U and Rb losses on old cratons. Uranium and Rb losses are attributed to long-term leaching by rain and river water during exposure of the continental crust at the surface. Over geological times, the weathering effects create a slow but systematic increase of Th/U with exposure time.