58 resultados para classification aided by clustering
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
A rapid method for classification of mineral waters is proposed. The discrimination power was evaluated by a novel combination of chemometric data analysis and qualitative multi-elemental fingerprints of mineral water samples acquired from different regions of the Brazilian territory. The classification of mineral waters was assessed using only the wavelength emission intensities obtained by inductively coupled plasma optical emission spectrometry (ICP OES), monitoring different lines of Al, B, Ba, Ca, Cl, Cu, Co, Cr, Fe, K, Mg, Mn, Na, Ni, P, Pb, S, Sb, Si, Sr, Ti, V, and Zn, and Be, Dy, Gd, In, La, Sc and Y as internal standards. Data acquisition was done under robust (RC) and non-robust (NRC) conditions. Also, the combination of signal intensities of two or more emission lines for each element were evaluated instead of the individual lines. The performance of two classification-k-nearest neighbor (kNN) and soft independent modeling of class analogy (SIMCA)-and preprocessing algorithms, autoscaling and Pareto scaling, were evaluated for the ability to differentiate between the various samples in each approach tested (combination of robust or non-robust conditions with use of individual lines or sum of the intensities of emission lines). It was shown that qualitative ICP OES fingerprinting in combination with multivariate analysis is a promising analytical tool that has potential to become a recognized procedure for rapid authenticity and adulteration testing of mineral water samples or other material whose physicochemical properties (or origin) are directly related to mineral content.
Resumo:
Microarray gene expression profiling is a high-throughput system used to identify differentially expressed genes and regulation patterns, and to discover new tumor markers. As the molecular pathogenesis of meningiomas and schwannomas, characterized by NF2 gene alterations, remains unclear and suitable molecular targets need to be identified, we used low density cDNA microarrays to establish expression patterns of 96 cancer-related genes on 23 schwannomas, 42 meningiomas and 3 normal cerebral meninges. We also performed a mutational analysis of the NF2 gene (PCR, dHPLC, Sequencing and MLPA), a search for 22q LOH and an analysis of gene silencing by promoter hypermethylation (MS-MLPA). Results showed a high frequency of NF2 gene mutations (40%), increased 22q LOH as aggressiveness increased, frequent losses and gains by MLPA in benign meningiomas, and gene expression silencing by hypermethylation. Array analysis showed decreased expression of 7 genes in meningiomas. Unsupervised analyses identified 2 molecular subgroups for both meningiomas and schwannomas showing 38 and 20 differentially expressed genes, respectively, and 19 genes differentially expressed between the two tumor types. These findings provide a molecular subgroup classification for meningiomas and schwannomas with possible implications for clinical practice.
Resumo:
A chemotaxonomic analysis is described of a database containing various types of compounds from the Heliantheae tribe (Asteraceae) using Self-Organizing Maps (SOM). The numbers of occurrences of 9 chemical classes in different taxa of the tribe were used as variables. The study shows that SOM applied to chemical data can contribute to differentiate genera, subtribes, and groups of subtribes (subtribe branches), as well as to tribal and subtribal classifications of Heliantheae, exhibiting a high hit percentage comparable to that of an expert performance, and in agreement with the previous tribe classification proposed by Stuessy.
Resumo:
Studying joint noise is an important parameter for diagnosing temporomandibular dysfunction. In this study, eight groups (n=9) were formed according to joint dysfunction classification, provided by employing vibration analysis equipment. Parameters for analyzing joint noise were: total vibration energy, peak amplitude, and peak frequency. Mouth opening range was also analyzed. Statistical analysis results for each parameter were significant at 1 %. Each analyzed group presented different noise characteristics. This allowed for inclusion of the groups within a determined value category. The patient group with normal condyle/disk relationship always presented the lowest values. The type of joint noise was characterized by analyzing total integral noise, peak amplitude, peak frequency, and mouth opening. Analyzing joint noise using electrovibratography suggests the type of joint dysfunction and may help to establish a diagnosis, as well as a treatment plan.
Resumo:
Ozone and inhalable particulate matter are the major air pollutants in the Metropolitan Area of São Paulo, Brazil, a region that has more than 19 million inhabitants and approximately 7 million registered vehicles. Proximity of roadways, adjacent land use, and local circulation are just some of the factors that can affect the results of monitoring of pollutant concentrations. The so-called weekend effect (higher ozone concentrations on weekends than on weekdays) might be related to the fact that concentrations of ozone precursors, such as nitrogen oxides (NOx) and Non Methane-Hydrocarbon (NMHC), are relatively lower on weekends. This phenomenon has been reported in some areas of the United States since the 1970s. The differences between the concentrations of ozone in period of weekend and weekday, were obtained from analysis of data hourly average of CETESB for 2004, studied the precursors to the formation of troposphere ozone, the meteorological variables and traffic profile for RMSP. Because of the proximity to sources of emissions from the station Pinheiros showed higher concentrations of NO and NO² and greater variations to the periods weekend and weekday. With fewer vehicles circulating during the weekend, and consequently less emission of pollutants, it has cleaner air and less concentration of NO and NO², there is the ideal setting to the formation of troposphere ozone, despite the lower concentration of NO². The proximity with the source emissions, aided by the increased availability of solar radiation and the presence of ozone precursors, were factors conditions for the occurrence of weekend effect.
Resumo:
Geographic Data Warehouses (GDW) are one of the main technologies used in decision-making processes and spatial analysis, and the literature proposes several conceptual and logical data models for GDW. However, little effort has been focused on studying how spatial data redundancy affects SOLAP (Spatial On-Line Analytical Processing) query performance over GDW. In this paper, we investigate this issue. Firstly, we compare redundant and non-redundant GDW schemas and conclude that redundancy is related to high performance losses. We also analyze the issue of indexing, aiming at improving SOLAP query performance on a redundant GDW. Comparisons of the SB-index approach, the star-join aided by R-tree and the star-join aided by GiST indicate that the SB-index significantly improves the elapsed time in query processing from 25% up to 99% with regard to SOLAP queries defined over the spatial predicates of intersection, enclosure and containment and applied to roll-up and drill-down operations. We also investigate the impact of the increase in data volume on the performance. The increase did not impair the performance of the SB-index, which highly improved the elapsed time in query processing. Performance tests also show that the SB-index is far more compact than the star-join, requiring only a small fraction of at most 0.20% of the volume. Moreover, we propose a specific enhancement of the SB-index to deal with spatial data redundancy. This enhancement improved performance from 80 to 91% for redundant GDW schemas.
Resumo:
A água como tema no contexto educacional é abordada a partir de diversas perspectivas. Diante das discussões em relação à crise socioambiental atual, acreditamos que a educação para a água deva ser realizada a partir da abordagem das dimensões espacial e temporal, considerando nesta última o tempo geológico e a história humana, sem a qual não é possível enfrentar a fragmentação do conhecimento que predomina no ambiente escolar. A abordagem do local, tendo como unidade de estudo a bacia hidrográfica, auxiliada pelos conteúdos das geociências e por metodologias interdisciplinares, proporciona uma visão integrada e contextualizada do tema para a construção do conhecimento.
Resumo:
Objective: To describe and compare foot anthropometry in healthy and diabetic subjects using Medial Longitudinal Arch (MLA) classificatory indexes: Arch Index (AI), Chippaux-Smirak Index (CSI) and (A) over cap Angle ((A) over cap), as well as to compare the classification of these methods in each group. Materials and Methods: Control Group (CG) composed by 21 healthy subjects and Diabetic Group (DG), with 46 diabetic neuropathy subjects. The indexes were calculated from footprints. Results: A larger proportion of flat feet was seen in DG for the three indexes (At: 32,2%, CSI: 59,7%, A: 17,5%), while highly arched feet acted oppositely. The groups were statistically different for the proportion of flat feet in (A) over cap (p=0,0080) and CSI (p=0,0000) and high feet in A (p=0,0036). There were significant differences when compared GC and GD in the three indexes: IA (p 0,0027), CSI (p=0,0064), (A) over cap (p=0,0296). Conclusion: Data showed motor and orthopedic changes originated by peripheral neuropathy, which is responsible for foot changes, causing longitudinal arch crumbling. It was seen that A Angle strongly disagreed when compared with the arch classification made by the other two indexes and therefore, its application needs care.
Resumo:
Carios mimon is an argasid tick common on Chiroptera, originally described from larvae collected on bats Mimon crenulatum from Bolivia and Eptesicus brasiliensis from Uruguay. Later it was also registered from Argentina and recently included among the Brazilian tick fauna. In Brazil, this species is very aggressive to man, resulting in intense inflammatory response and pain. It is known only by the larval description and its morphology resembles that from other species currently included into the genus Carios, formerly classified into the subgenus Alectorobius, genus Ornithodoros. Here we describe adults and redescribe the larva of C. mimon, based on light and scanning electron microscopy. Remarks about its morphological similarity with other species of this genus are also discussed. Molecular analysis inferred from a portion of the 16S rRNA mitochondrial gene placed C. mimon in a cluster supported by maximal bootstrap value (100%) with other argasid species (mostly bat parasites in the New World), which have been classified into either the genus Ornithodoros or Carios, depending on the Argasidae classification adopted by different authors.
Resumo:
Searching in a dataset for elements that are similar to a given query element is a core problem in applications that manage complex data, and has been aided by metric access methods (MAMs). A growing number of applications require indices that must be built faster and repeatedly, also providing faster response for similarity queries. The increase in the main memory capacity and its lowering costs also motivate using memory-based MAMs. In this paper. we propose the Onion-tree, a new and robust dynamic memory-based MAM that slices the metric space into disjoint subspaces to provide quick indexing of complex data. It introduces three major characteristics: (i) a partitioning method that controls the number of disjoint subspaces generated at each node; (ii) a replacement technique that can change the leaf node pivots in insertion operations; and (iii) range and k-NN extended query algorithms to support the new partitioning method, including a new visit order of the subspaces in k-NN queries. Performance tests with both real-world and synthetic datasets showed that the Onion-tree is very compact. Comparisons of the Onion-tree with the MM-tree and a memory-based version of the Slim-tree showed that the Onion-tree was always faster to build the index. The experiments also showed that the Onion-tree significantly improved range and k-NN query processing performance and was the most efficient MAM, followed by the MM-tree, which in turn outperformed the Slim-tree in almost all the tests. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
Here we present the results of magneto resistance measurements in tilted magnetic field and compare them with calculations. The comparison between calculated and measured spectra for the case of perpendicular fields enable us to estimate the dependence of the valley splitting as a function of the magnetic field and the total Lande g-factor (which is assumed to be independent of the magnetic field). Since both the exchange contribution to the Zeeman splitting as well as the valley splitting are properties associated with the 2D quantum confinement, they depend only on the perpendicular component of the magnetic field, while the bare Zeeman splitting depends on the total magnetic field. This information aided by the comparison between experimental and calculated gray scale maps permits to obtain separately the values of the exchange and the bare contribution to the g-factor.
Resumo:
Back-scattered imaging, X-ray element mapping and electron microprobe analyzer (EMPA) chemical dating reveal complex compositional and age zoning in monazite crystals from different layers and textural positions in a garnet-bearing migmatite in SE Brazil. Y-rich (variable Y(2)O(3), averaging 2.5 wt.%) relict cores are preserved in mesosome and melanosome monazite, and correspond to 793 +/- 6 Ma inherited crystals possibly generated in a previous metamorphic event. These cores are overgrown and widely replaced by two generations of monazite, which are present in all migmatite layers. The first, also Y-rich (average 2.5 wt.% Y(2)O(3)), was produced at similar to 635 Ma during prograde metamorphism under subsolidus conditions, while the second has an Y-poor (<1.5 wt.% Y(2)O(3)), low Th/U signature, and precipitated from low Y and HREE anatectic melts produced by reactions in which garnet was inert. Quartz-rich trondhjemitic leucosome represents lower temperature melt (bearing some subsolidus quartz and garnet with included monazite) formed at temperatures below muscovite breakdown; its Y-poor monazite indicates an age of 617 +/- 6 Ma. Granitic leucosomes formed close to peak metamorphic conditions (T>750 degrees C) above muscovite breakdown have their slightly younger character confirmed by a 609 +/- 7 Ma low-Y monazite age. A similar 606 +/- 5 Ma age was obtained for low-Y monazite rims and domains in mesosome and melanosome, and reflects the time of monazite saturation in interstitial granitic melt that was trapped in these layers. Our results confirm that inherited monazite crystals can be preserved during partial melting at temperatures above muscovite breakdown. Moreover, careful textural control aided by X-ray chemical mapping may allow monazite generated at different stages in a similar to 25 Myr prograde metamorphic path to be identified and dated using an electron microprobe. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.
Resumo:
The properties of recycled aggregate produced from mixed (masonry and concrete) construction and demolition (C&D) waste are highly variable, and this restricts the use of such aggregate in structural concrete production. The development of classification techniques capable of reducing this variability is instrumental for quality control purposes and the production of high quality C&D aggregate. This paper investigates how the classification of C&D mixed coarse aggregate according to porosity influences the mechanical performance of concrete. Concretes using a variety of C&D aggregate porosity classes and different water/cement ratios were produced and the mechanical properties measured. For concretes produced with constant volume fractions of water, cement, natural sand and coarse aggregate from recycled mixed C&D waste, the compressive strength and Young modulus are direct exponential functions of the aggregate porosity. Sink and float technique is a simple laboratory density separation tool that facilitates the separation of cement particles with lower porosity, a difficult task when done only by visual sorting. For this experiment, separation using a 2.2 kg/dmA(3) suspension produced recycled aggregate (porosity less than 17%) which yielded good performance in concrete production. Industrial gravity separators may lead to the production of high quality recycled aggregate from mixed C&D waste for structural concrete applications.
Resumo:
Oxidative stress is a physiological condition that is associated with atherosclerosis. and it can be influenced by diet. Our objective was to group fifty-seven individuals with dyslipidaemia controlled by statins according to four oxidative biomarkers, and to evaluate the diet pattern and blood biochemistry differences between these groups. Blood samples were collected and the following parameters were evaluated: diet intake; plasma fatty acids; lipoprotein concentration; glucose; oxidised LDL (oxLDL); malondialdehyde (MDA): total antioxidant activity by 2,2-diphenyl-1-picrylhydrazyl (DPPH) and ferric reducing ability power assays. Individuals were separated into five groups by cluster analysis. All groups showed a difference with respect to at least one of the four oxidative stress biomarkers. The separation of individuals in the first axis was based upon their total antioxidant activity. Clusters located on the right side showed higher total antioxidant activity, higher myristic fatty acid and lower arachidonic fatty acid proportions than clusters located on the left side. A negative correlation was observed between DPPH and the peroxidability index. The second axis showed differences in oxidation status as measured by MDA and oxLDL concentrations. Clusters located on the Upper side showed higher oxidative status and lower HDL cholesterol concentration than clusters located on the lower side. There were no differences in diet among the five clusters. Therefore, fatty acid synthesis and HDL cholesterol concentration seem to exert a more significant effect on the oxidative conditions of the individuals with dyslipidaemia controlled by statins than does their food intake.