31 resultados para Automatic Analysis of Multivariate Categorical Data Sets
Resumo:
This paper aims to find relations between the socioeconomic characteristics, activity participation, land use patterns and travel behavior of the residents in the Sao Paulo Metropolitan Area (SPMA) by using Exploratory Multivariate Data Analysis (EMDA) techniques. The variables influencing travel pattern choices are investigated using: (a) Cluster Analysis (CA), grouping and characterizing the Traffic Zones (17), proposing the independent variable called Origin Cluster and, (b) Decision Tree (DT) to find a priori unknown relations among socioeconomic characteristics, land use attributes of the origin TZ and destination choices. The analysis was based on the origin-destination home-interview survey carried out in SPMA in 1997. The DT application revealed the variables of greatest influence on the travel pattern choice. The most important independent variable considered by DT is car ownership, followed by the Use of Transportation ""credits"" for Transit tariff, and, finally, activity participation variables and Origin Cluster. With these results, it was possible to analyze the influence of a family income, car ownership, position of the individual in the family, use of transportation ""credits"" for transit tariff (mainly for travel mode sequence choice), activities participation (activity sequence choice) and Origin Cluster (destination/travel distance choice). (c) 2010 Elsevier Ltd. All rights reserved.
Resumo:
This study aimed to examine the sensory characteristics of the grains of 21 cultivars of Coffea arabica L. and Coffea canephora Pierre from the essays of genetic improvement of EPAMIG, located in Patrocinio Municipality, Minas Gerais State, where they were collected through cloths stripping method and washed. Subsequently to dry (11 to 12% moisture b.u.), we obtained the coffee designated as natural. The evaluated varieties were: Acaia Cerrado MG 1474; Bourbon Vermelho DATERRA; Catigua MG 1; Catigua MG 2; Catual Amarelo IAC 62; Catuai Vermelho IAC 15; H 419-3-1-4-2; H 419-6-2 -5-2; H 419-6-2-5-3; H 419-6-2-7-3 Vermelho; H 493-1-2-10; H 514-7-10-1 Vermelho; H 514-7-10-6; H 515-4-2-2; H 518-3-6-1; Icatu Amarelo IAC 3282; Mundo Novo 379-19; Mundo Novo TAO 376-4; Rubi MG 1192; Sacramento MG 1 and Topazio MG 1190, from 2005/2006 and 2006/2007 seasons. The cultivars according to the first principal component with notes above 80 points, regarded as superior drink according to attributes with the highest scores (flavor, sweetness, balance, acidity, clean drink, and aspect) were: Catigua MG2, Rubi MG 1192, 514-7-10-6 H, H 419-3-1-4-2, H 419-6-2-5-2, 493-1-2-10 H, H 514-7-10-1 Vermelho, Catigua MG1, Sacramento MG1, 419-6-2-5-3 H, H 515-9-2-2 and Catuai Amarelo IAC 62.
Resumo:
The stock market suffers uncertain relations throughout the entire negotiation process, with different variables exerting direct and indirect influence on stock prices. This study focuses on the analysis of certain aspects that may influence these values offered by the capital market, based on the Brazil Index of the Sao Paulo Stock Exchange (Bovespa), which selects 100 stocks among the most traded on Bovespa in terms of number of trades and financial volume. The selected variables are characterized by the companies` activity area and the business volume in the month of data collection, i.e. April/2007. This article proposes an analysis that joins the accounting view of the stock price variables that can be influenced with the use of multivariate qualitative data analysis. Data were explored through Correspondence Analysis (Anacor) and Homogeneity Analysis (Homals). According to the research, the selected variables are associated with the values presented by the stocks, which become an internal control instrument and a decision-making tool when it comes to choosing investments.
Resumo:
The success of plant reproduction depends on pollen-pistil interactions occurring at the stigma/style. These interactions vary depending on the stigma type: wet or dry. Tobacco (Nicotiana tabacum) represents a model of wet stigma, and its stigmas/styles express genes to accomplish the appropriate functions. For a large-scale study of gene expression during tobacco pistil development and preparation for pollination, we generated 11,216 high-quality expressed sequence tags (ESTs) from stigmas/styles and created the TOBEST database. These ESTs were assembled in 6,177 clusters, from which 52.1% are pistil transcripts/genes of unknown function. The 21 clusters with the highest number of ESTs (putative higher expression levels) correspond to genes associated with defense mechanisms or pollen-pistil interactions. The database analysis unraveled tobacco sequences homologous to the Arabidopsis (Arabidopsis thaliana) genes involved in specifying pistil identity or determining normal pistil morphology and function. Additionally, 782 independent clusters were examined by macroarray, revealing 46 stigma/style preferentially expressed genes. Real-time reverse transcription-polymerase chain reaction experiments validated the pistil-preferential expression for nine out of 10 genes tested. A search for these 46 genes in the Arabidopsis pistil data sets demonstrated that only 11 sequences, with putative equivalent molecular functions, are expressed in this dry stigma species. The reverse search for the Arabidopsis pistil genes in the TOBEST exposed a partial overlap between these dry and wet stigma transcriptomes. The TOBEST represents the most extensive survey of gene expression in the stigmas/styles of wet stigma plants, and our results indicate that wet and dry stigmas/styles express common as well as distinct genes in preparation for the pollination process.
Resumo:
Oral squamous cell carcinoma (OSCC) accounts for more than 95% of all malignant neoplasms in the oral cavity. Although several studies have shown the epidemiology of this cancer in Brazil, there do not seem to be any studies that describe the prognostic factors related to OSCC in the Amazon region. Therefore, the aim of this study was to determine the survival rate and prognostic significance of different factors in patients from this region affected by OSCC. Data from 85 patients with histologically confirmed squamous cell carcinoma of the tongue and floor of the mouth identified from the Ofir Loyola Hospital archives were collected and analyzed using univariate (log-rank test) and multivariate (Cox proportional hazard model) tests. The overall 5-year survival rate was found to be 27%. Univariate analysis showed that the 5-year survival rate was significantly higher for younger (<= 45 y) female patients, patients with T1-2 tumors and clinically clear neck nodes (N0), patients with early stage cancers (AJCC stage I-II), and patients treated with surgical procedures. However, multivariate analysis showed that the 5-year survival rate was significantly higher only in the younger patients and those who underwent surgical treatment. The age of the patient at the moment of diagnosis and treatment with surgical procedures were the only independent prognostic factors that affected the 5-year survival rate of the patients in this region.
Resumo:
Chemotherapy-induced oral mucositis is a frequent therapeutic challenge in cancer patients. The purpose of this retrospective study was to estimate the prevalence and risk factors of oral mucositis in 169 acute lymphoblastic leukaemia (ALL) patients treated according to different chemotherapeutic trials at the Darcy Vargas Children`s Hospital from 1994 to 2005. Demographic data, clinical history, chemotherapeutic treatment and patients` follow-up were recorded. The association of oral mucositis with age, gender, leucocyte counts at diagnosis and treatment was assessed by the chi-squared test and multivariate regression analysis. Seventy-seven ALL patients (46%) developed oral mucositis during the treatment. Patient age (P = 0.33), gender (P = 0.08) and leucocyte counts at diagnosis (P = 0.34) showed no correlation with the occurrence of oral mucositis. Multivariate regression analysis showed a significant risk for oral mucositis (P = 0.009) for ALL patients treated according to the ALL-BFM-95 protocol. These results strongly suggest the greater stomatotoxic effect of the ALL-BFM-95 trial when compared with Brazilian trials. We concluded that chemotherapy-induced oral mucositis should be systematically analysed prospectively in specialized centres for ALL treatment to establish the degree of toxicity of chemotherapeutic drugs and to improve the quality of life of patients based on more effective therapeutic and prophylactic approaches for prevention of its occurrence. Oral Diseases (2008) 14, 761-766
An improved estimate of leaf area index based on the histogram analysis of hemispherical photographs
Resumo:
Leaf area index (LAI) is a key parameter that affects the surface fluxes of energy, mass, and momentum over vegetated lands, but observational measurements are scarce, especially in remote areas with complex canopy structure. In this paper we present an indirect method to calculate the LAI based on the analyses of histograms of hemispherical photographs. The optimal threshold value (OTV), the gray-level required to separate the background (sky) and the foreground (leaves), was analytically calculated using the entropy crossover method (Sahoo, P.K., Slaaf, D.W., Albert, T.A., 1997. Threshold selection using a minimal histogram entropy difference. Optical Engineering 36(7) 1976-1981). The OTV was used to calculate the LAI using the well-known gap fraction method. This methodology was tested in two different ecosystems, including Amazon forest and pasturelands in Brazil. In general, the error between observed and calculated LAI was similar to 6%. The methodology presented is suitable for the calculation of LAI since it is responsive to sky conditions, automatic, easy to implement, faster than commercially available software, and requires less data storage. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
The South American (SA) rainy season is studied in this paper through the application of a multivariate Empirical Orthogonal Function (EOF) analysis to a SA gridded precipitation analysis and to the components of Lorenz Energy Cycle (LEC) derived from the National Centers for Environmental Prediction (NCEP) reanalysis. The EOF analysis leads to the identification of patterns of the rainy season and the associated mechanisms in terms of their energetics. The first combined EOF represents the northwest-southeast dipole of the precipitation between South and Central America, the South American Monsoon System (SAMS). The second combined EOF represents a synoptic pattern associated with the SACZ (South Atlantic convergence zone) and the third EOF is in spatial quadrature to the second EOF. The phase relationship of the EOFs, as computed from the principal components (PCs), suggests a nonlinear transition from the SACZ to the fully developed SAMS mode by November and between both components describing the SACZ by September-October (the rainy season onset). According to the LEC, the first mode is dominated by the eddy generation term at its maximum, the second by both baroclinic and eddy generation terms and the third by barotropic instability previous to the connection to the second mode by September-October. The predominance of the different LEC components at each phase of the SAMS can be used as an indicator of the onset of the rainy season in terms of physical processes, while the existence of the outstanding spectral peaks in the time dependence of the EOFs at the intraseasonal time scale could be used for monitoring purposes. Copyright (C) 2009 Royal Meteorological Society
Resumo:
Krameria plants are found in arid regions of the Americas and present a floral system that attracts oil-collecting bees. Niche modeling and multivariate tools were applied to examine ecological and geographical aspects of the 18 species of this genus, using occurrence data obtained from herbaria and literature. Niche modeling showed the potential areas of occurrence for each species and the analysis of climatic variables suggested that North American species occur mostly in deserted or xeric ecoregions with monthly precipitation below 140 mm and large temperature ranges. South American species are mainly found in deserted ecoregions and subtropical savannas where monthly precipitation often exceeds 150 mm and temperature ranges are smaller. Principal Component Analysis (PCA) performed with values of temperature and precipitation showed that the distribution limits of Krameria species are primarily associated with maximum and minimum temperatures. Modeling of Krameria species proved to be a useful tool for analyzing the influence of the ecological niche variables in the geographical distribution of species, providing new information to guide future investigations. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
Background Recent studies indicate an increased frequency of mutations in the gene encoding glucocerebrosidase (GBA), a deficiency of which causes Gaucher`s disease, among patients with Parkinson`s disease. We aimed to ascertain the frequency of GBA mutations in an ethnically diverse group of patients with Parkinson`s disease. Methods Sixteen centers participated in our international, collaborative study: five from the Americas, six from Europe, two from Israel, and three from Asia. Each center genotyped a standard DNA panel to permit comparison of the genotyping results across centers. Genotypes and phenotypic data from a total of 5691 patients with Parkinson`s disease (780 Ashkenazi Jews) and 4898 controls (387 Ashkenazi Jews) were analyzed, with multivariate logistic-regression models and the Mantel-Haenszel procedure used to estimate odds ratios across centers. Results All 16 centers could detect two GBA mutations, L444P and N370S. Among Ashkenazi Jewish subjects, either mutation was found in 15% of patients and 3% of controls, and among non-Ashkenazi Jewish subjects, either mutation was found in 3% of patients and less than 1% of controls. GBA was fully sequenced for 1883 non-Ashkenazi Jewish patients, and mutations were identified in 7%, showing that limited mutation screening can miss half the mutant alleles. The odds ratio for any GBA mutation in patients versus controls was 5.43 across centers. As compared with patients who did not carry a GBA mutation, those with a GBA mutation presented earlier with the disease, were more likely to have affected relatives, and were more likely to have atypical clinical manifestations. Conclusions Data collected from 16 centers demonstrate that there is a strong association between GBA mutations and Parkinson`s disease.
Resumo:
Evolutionary novelties in the skeleton are usually expressed as changes in the timing of growth of features intrinsically integrated at different hierarchical levels of development(1). As a consequence, most of the shape- traits observed across species do vary quantitatively rather than qualitatively(2), in a multivariate space(3) and in a modularized way(4,5). Because most phylogenetic analyses normally use discrete, hypothetically independent characters(6), previous attempts have disregarded the phylogenetic signals potentially enclosed in the shape of morphological structures. When analysing low taxonomic levels, where most variation is quantitative in nature, solving basic requirements like the choice of characters and the capacity of using continuous, integrated traits is of crucial importance in recovering wider phylogenetic information. This is particularly relevant when analysing extinct lineages, where available data are limited to fossilized structures. Here we show that when continuous, multivariant and modularized characters are treated as such, cladistic analysis successfully solves relationships among main Homo taxa. Our attempt is based on a combination of cladistics, evolutionary- development- derived selection of characters, and geometric morphometrics methods. In contrast with previous cladistic analyses of hominid phylogeny, our method accounts for the quantitative nature of the traits, and respects their morphological integration patterns. Because complex phenotypes are observable across different taxonomic groups and are potentially informative about phylogenetic relationships, future analyses should point strongly to the incorporation of these types of trait.
Resumo:
The order Scorpiones is one of the most cytogenetically interesting groups within Arachnida by virtue of the combination of chromosome singularities found in the 59 species analyzed so far. In this work, mitotic and meiotic chromosomes of 2 species of the family Bothriuridae were detailed. This family occupies a basal position within the superfamily Scorpionoidea. Furthermore, review of the cytogenetic data of all previously studied scorpions is presented. Light microscopy chromosome analysis showed that Bothriurus araguayae and Bothriurus rochensis possess low diploid numbers compared with those of species belonging to closely related families. Gonadal cells examined under light and in transmission electron microscopy revealed, for the first time, that the Bothriuridae species possess typical monocentric chromosomes, and male meiosis presented chromosomes with synaptic and achiasmatic behavior. Moreover, in the sample of B. araguayae studied, heterozygous translocations were verified. The use of techniques to highlight specific chromosomal regions also revealed additional differences between the 2 Bothriurus species. The results herein recorded and the overview elaborated using the available cytogenetic information of Scorpiones elucidated current understanding regarding the processes of chromosome evolution that have occurred in Bothriuridae and in Scorpiones as a whole.
Resumo:
Most multidimensional projection techniques rely on distance (dissimilarity) information between data instances to embed high-dimensional data into a visual space. When data are endowed with Cartesian coordinates, an extra computational effort is necessary to compute the needed distances, making multidimensional projection prohibitive in applications dealing with interactivity and massive data. The novel multidimensional projection technique proposed in this work, called Part-Linear Multidimensional Projection (PLMP), has been tailored to handle multivariate data represented in Cartesian high-dimensional spaces, requiring only distance information between pairs of representative samples. This characteristic renders PLMP faster than previous methods when processing large data sets while still being competitive in terms of precision. Moreover, knowing the range of variation for data instances in the high-dimensional space, we can make PLMP a truly streaming data projection technique, a trait absent in previous methods.
Resumo:
In the present study, we propose a theoretical graph procedure to investigate multiple pathways in brain functional networks. By taking into account all the possible paths consisting of h links between the nodes pairs of the network, we measured the global network redundancy R (h) as the number of parallel paths and the global network permeability P (h) as the probability to get connected. We used this procedure to investigate the structural and dynamical changes in the cortical networks estimated from a dataset of high-resolution EEG signals in a group of spinal cord injured (SCI) patients during the attempt of foot movement. In the light of a statistical contrast with a healthy population, the permeability index P (h) of the SCI networks increased significantly (P < 0.01) in the Theta frequency band (3-6 Hz) for distances h ranging from 2 to 4. On the contrary, no significant differences were found between the two populations for the redundancy index R (h) . The most significant changes in the brain functional network of SCI patients occurred mainly in the lower spectral contents. These changes were related to an improved propagation of communication between the closest cortical areas rather than to a different level of redundancy. This evidence strengthens the hypothesis of the need for a higher functional interaction among the closest ROIs as a mechanism to compensate the lack of feedback from the peripheral nerves to the sensomotor areas.
Dynamic Changes in the Mental Rotation Network Revealed by Pattern Recognition Analysis of fMRI Data
Resumo:
We investigated the temporal dynamics and changes in connectivity in the mental rotation network through the application of spatio-temporal support vector machines (SVMs). The spatio-temporal SVM [Mourao-Miranda, J., Friston, K. J., et al. (2007). Dynamic discrimination analysis: A spatial-temporal SVM. Neuroimage, 36, 88-99] is a pattern recognition approach that is suitable for investigating dynamic changes in the brain network during a complex mental task. It does not require a model describing each component of the task and the precise shape of the BOLD impulse response. By defining a time window including a cognitive event, one can use spatio-temporal fMRI observations from two cognitive states to train the SVM. During the training, the SVM finds the discriminating pattern between the two states and produces a discriminating weight vector encompassing both voxels and time (i.e., spatio-temporal maps). We showed that by applying spatio-temporal SVM to an event-related mental rotation experiment, it is possible to discriminate between different degrees of angular disparity (0 degrees vs. 20 degrees, 0 degrees vs. 60 degrees, and 0 degrees vs. 100 degrees), and the discrimination accuracy is correlated with the difference in angular disparity between the conditions. For the comparison with highest accuracy (08 vs. 1008), we evaluated how the most discriminating areas (visual regions, parietal regions, supplementary, and premotor areas) change their behavior over time. The frontal premotor regions became highly discriminating earlier than the superior parietal cortex. There seems to be a parcellation of the parietal regions with an earlier discrimination of the inferior parietal lobe in the mental rotation in relation to the superior parietal. The SVM also identified a network of regions that had a decrease in BOLD responses during the 100 degrees condition in relation to the 0 degrees condition (posterior cingulate, frontal, and superior temporal gyrus). This network was also highly discriminating between the two conditions. In addition, we investigated changes in functional connectivity between the most discriminating areas identified by the spatio-temporal SVM. We observed an increase in functional connectivity between almost all areas activated during the 100 degrees condition (bilateral inferior and superior parietal lobe, bilateral premotor area, and SMA) but not between the areas that showed a decrease in BOLD response during the 100 degrees condition.