924 resultados para Automatic Analysis of Multivariate Categorical Data Sets
Resumo:
OBJECTIVE: To estimate the spatial intensity of urban violence events using wavelet-based methods and emergency room data. METHODS: Information on victims attended at the emergency room of a public hospital in the city of São Paulo, Southeastern Brazil, from January 1, 2002 to January 11, 2003 were obtained from hospital records. The spatial distribution of 3,540 events was recorded and a uniform random procedure was used to allocate records with incomplete addresses. Point processes and wavelet analysis technique were used to estimate the spatial intensity, defined as the expected number of events by unit area. RESULTS: Of all georeferenced points, 59% were accidents and 40% were assaults. There is a non-homogeneous spatial distribution of the events with high concentration in two districts and three large avenues in the southern area of the city of São Paulo. CONCLUSIONS: Hospital records combined with methodological tools to estimate intensity of events are useful to study urban violence. The wavelet analysis is useful in the computation of the expected number of events and their respective confidence bands for any sub-region and, consequently, in the specification of risk estimates that could be used in decision-making processes for public policies.
Resumo:
Background: Community and clinical data have suggested there is an association between trauma exposure and suicidal behavior (i.e., suicide ideation, plans and attempts). However, few studies have assessed which traumas are uniquely predictive of: the first onset of suicidal behavior, the progression from suicide ideation to plans and attempts, or the persistence of each form of suicidal behavior over time. Moreover, few data are available on such associations in developing countries. The current study addresses each of these issues. Methodology/Principal Findings: Data on trauma exposure and subsequent first onset of suicidal behavior were collected via structured interviews conducted in the households of 102,245 (age 18+) respondents from 21 countries participating in the WHO World Mental Health Surveys. Bivariate and multivariate survival models tested the relationship between the type and number of traumatic events and subsequent suicidal behavior. A range of traumatic events are associated with suicidal behavior, with sexual and interpersonal violence consistently showing the strongest effects. There is a dose-response relationship between the number of traumatic events and suicide ideation/attempt; however, there is decay in the strength of the association with more events. Although a range of traumatic events are associated with the onset of suicide ideation, fewer events predict which people with suicide ideation progress to suicide plan and attempt, or the persistence of suicidal behavior over time. Associations generally are consistent across high-, middle-, and low-income countries. Conclusions/Significance: This study provides more detailed information than previously available on the relationship between traumatic events and suicidal behavior and indicates that this association is fairly consistent across developed and developing countries. These data reinforce the importance of psychological trauma as a major public health problem, and highlight the significance of screening for the presence and accumulation of traumatic exposures as a risk factor for suicide ideation and attempt.
Resumo:
Background: The inherent complexity of statistical methods and clinical phenomena compel researchers with diverse domains of expertise to work in interdisciplinary teams, where none of them have a complete knowledge in their counterpart's field. As a result, knowledge exchange may often be characterized by miscommunication leading to misinterpretation, ultimately resulting in errors in research and even clinical practice. Though communication has a central role in interdisciplinary collaboration and since miscommunication can have a negative impact on research processes, to the best of our knowledge, no study has yet explored how data analysis specialists and clinical researchers communicate over time. Methods/Principal Findings: We conducted qualitative analysis of encounters between clinical researchers and data analysis specialists (epidemiologist, clinical epidemiologist, and data mining specialist). These encounters were recorded and systematically analyzed using a grounded theory methodology for extraction of emerging themes, followed by data triangulation and analysis of negative cases for validation. A policy analysis was then performed using a system dynamics methodology looking for potential interventions to improve this process. Four major emerging themes were found. Definitions using lay language were frequently employed as a way to bridge the language gap between the specialties. Thought experiments presented a series of ""what if'' situations that helped clarify how the method or information from the other field would behave, if exposed to alternative situations, ultimately aiding in explaining their main objective. Metaphors and analogies were used to translate concepts across fields, from the unfamiliar to the familiar. Prolepsis was used to anticipate study outcomes, thus helping specialists understand the current context based on an understanding of their final goal. Conclusion/Significance: The communication between clinical researchers and data analysis specialists presents multiple challenges that can lead to errors.
Resumo:
Background: High-density tiling arrays and new sequencing technologies are generating rapidly increasing volumes of transcriptome and protein-DNA interaction data. Visualization and exploration of this data is critical to understanding the regulatory logic encoded in the genome by which the cell dynamically affects its physiology and interacts with its environment. Results: The Gaggle Genome Browser is a cross-platform desktop program for interactively visualizing high-throughput data in the context of the genome. Important features include dynamic panning and zooming, keyword search and open interoperability through the Gaggle framework. Users may bookmark locations on the genome with descriptive annotations and share these bookmarks with other users. The program handles large sets of user-generated data using an in-process database and leverages the facilities of SQL and the R environment for importing and manipulating data. A key aspect of the Gaggle Genome Browser is interoperability. By connecting to the Gaggle framework, the genome browser joins a suite of interconnected bioinformatics tools for analysis and visualization with connectivity to major public repositories of sequences, interactions and pathways. To this flexible environment for exploring and combining data, the Gaggle Genome Browser adds the ability to visualize diverse types of data in relation to its coordinates on the genome. Conclusions: Genomic coordinates function as a common key by which disparate biological data types can be related to one another. In the Gaggle Genome Browser, heterogeneous data are joined by their location on the genome to create information-rich visualizations yielding insight into genome organization, transcription and its regulation and, ultimately, a better understanding of the mechanisms that enable the cell to dynamically respond to its environment.
Resumo:
Using the published KTeV samples of K(L) -> pi(+/-)e(-/+)nu and K(L) -> pi(+/-)mu(-/+)nu decays, we perform a reanalysis of the scalar and vector form factors based on the dispersive parametrization. We obtain phase-space integrals I(K)(e) = 0.15446 +/- 0.00025 and I(K)(mu) = 0.10219 +/- 0.00025. For the scalar form factor parametrization, the only free parameter is the normalized form factor value at the Callan-Treiman point (C); our best-fit results in InC = 0.1915 +/- 0.0122. We also study the sensitivity of C to different parametrizations of the vector form factor. The results for the phase-space integrals and C are then used to make tests of the standard model. Finally, we compare our results with lattice QCD calculations of F(K)/F(pi) and f(+)(0).
Resumo:
Chlorocatechol 1,2-dioxygenase from the Gram-negative bacterium Pseudomonas putida (Pp 1,2-CCD) is considered to be an important biotechnological tool owing to its ability to process a broad spectrum of organic pollutants. In the current work, the crystallization, crystallographic characterization and phasing of the recombinant Pp 1,2-CCD enzyme are described. Reddish-brown crystals were obtained in the presence of polyethylene glycol and magnesium acetate by utilizing the vapour-diffusion technique in sitting drops. Crystal dehydration was the key step in obtaining data sets, which were collected on the D03B-MX2 beamline at the CNPEM/MCT - LNLS using a MAR CCD detector. Pp 1,2-CCD crystals belonged to space group P6(1)22 and the crystallographic structure of Pp 1,2-CCD has been solved by the MR-SAD technique using Fe atoms as scattering centres and the coordinates of 3-chlorocatechol 1,2-dioxygenase from Rhodococcus opacus (PDB entry
Resumo:
Artificial neural networks have been used to analyze a number of engineering problems, including settlement caused by different tunneling methods in various types of ground mass. This paper focuses on settlement over shotcrete- supported tunnels on Sao Paulo subway line 2 (West Extension) that were excavated in Tertiary sediments using the sequential excavation method. The adjusted network is a good tool for predicting settlement above new tunnels to be excavated in similar conditions. The influence of network training parameters on the quality of results is also discussed. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
This paper aims to find relations between the socioeconomic characteristics, activity participation, land use patterns and travel behavior of the residents in the Sao Paulo Metropolitan Area (SPMA) by using Exploratory Multivariate Data Analysis (EMDA) techniques. The variables influencing travel pattern choices are investigated using: (a) Cluster Analysis (CA), grouping and characterizing the Traffic Zones (17), proposing the independent variable called Origin Cluster and, (b) Decision Tree (DT) to find a priori unknown relations among socioeconomic characteristics, land use attributes of the origin TZ and destination choices. The analysis was based on the origin-destination home-interview survey carried out in SPMA in 1997. The DT application revealed the variables of greatest influence on the travel pattern choice. The most important independent variable considered by DT is car ownership, followed by the Use of Transportation ""credits"" for Transit tariff, and, finally, activity participation variables and Origin Cluster. With these results, it was possible to analyze the influence of a family income, car ownership, position of the individual in the family, use of transportation ""credits"" for transit tariff (mainly for travel mode sequence choice), activities participation (activity sequence choice) and Origin Cluster (destination/travel distance choice). (c) 2010 Elsevier Ltd. All rights reserved.
Resumo:
This study aimed to examine the sensory characteristics of the grains of 21 cultivars of Coffea arabica L. and Coffea canephora Pierre from the essays of genetic improvement of EPAMIG, located in Patrocinio Municipality, Minas Gerais State, where they were collected through cloths stripping method and washed. Subsequently to dry (11 to 12% moisture b.u.), we obtained the coffee designated as natural. The evaluated varieties were: Acaia Cerrado MG 1474; Bourbon Vermelho DATERRA; Catigua MG 1; Catigua MG 2; Catual Amarelo IAC 62; Catuai Vermelho IAC 15; H 419-3-1-4-2; H 419-6-2 -5-2; H 419-6-2-5-3; H 419-6-2-7-3 Vermelho; H 493-1-2-10; H 514-7-10-1 Vermelho; H 514-7-10-6; H 515-4-2-2; H 518-3-6-1; Icatu Amarelo IAC 3282; Mundo Novo 379-19; Mundo Novo TAO 376-4; Rubi MG 1192; Sacramento MG 1 and Topazio MG 1190, from 2005/2006 and 2006/2007 seasons. The cultivars according to the first principal component with notes above 80 points, regarded as superior drink according to attributes with the highest scores (flavor, sweetness, balance, acidity, clean drink, and aspect) were: Catigua MG2, Rubi MG 1192, 514-7-10-6 H, H 419-3-1-4-2, H 419-6-2-5-2, 493-1-2-10 H, H 514-7-10-1 Vermelho, Catigua MG1, Sacramento MG1, 419-6-2-5-3 H, H 515-9-2-2 and Catuai Amarelo IAC 62.
Resumo:
Regression analyses of a long series of light-trap catches at Narrabri, Australia, were used to describe the seasonal dynamics of Helicoverpa armigera (Hubner). The size of the second generation was significantly related to the size of the first generation, to winter rainfall, which had a positive effect, and to spring rainfall which had a negative effect. These variables accounted for up to 96% of the variation in size of the second generation from year to year. Rainfall and crop hosts were also important for the size of the third generation. The area and tonnage of many potential host crops were significantly correlated with winter rain. When winter rain was omitted from the analysis, the sizes of both the second and third generations could be expressed as a function of the size of the previous generation and of the areas planted to lucerne, sorghum and maize. Lucerne and maize always had positive coefficients and sorghum a negative one. We extended our analysis to catches of H. punctigera (Wallengren), which declines in abundance after the second generation. Winter rain had a positive effect on the sizes of the second and third generations, and rain in spring or early summer had a negative effect. Only the area grown to lucerne had a positive effect on abundance. Forecasts of pest levels from a few months to a few weeks in advance are discussed, along with the improved understanding of the seasonal dynamics of both species and the significance of crops in the management of insecticide resistance for H. armigera.
Resumo:
Dual-energy X-ray absorptiometry (DXA) is a widely used method for measuring bone mineral in the growing skeleton. Because scan analysis in children offers a number of challenges, we compared DXA results using six analysis methods at the total proximal femur (PF) and five methods at the femoral neck (FN), In total we assessed 50 scans (25 boys, 25 girls) from two separate studies for cross-sectional differences in bone area, bone mineral content (BMC), and areal bone mineral density (aBMD) and for percentage change over the short term (8 months) and long term (7 years). At the proximal femur for the short-term longitudinal analysis, there was an approximate 3.5% greater change in bone area and BMC when the global region of interest (ROI) was allowed to increase in size between years as compared with when the global ROI was held constant. Trend analysis showed a significant (p < 0.05) difference between scan analysis methods for bone area and BMC across 7 years. At the femoral neck, cross-sectional analysis using a narrower (from default) ROI, without change in location, resulted in a 12.9 and 12.6% smaller bone area and BMC, respectively (both p < 0.001), Changes in FN area and BMC over 8 months were significantly greater (2.3 %, p < 0.05) using a narrower FN rather than the default ROI, Similarly, the 7-year longitudinal data revealed that differences between scan analysis methods were greatest when the narrower FN ROI was maintained across all years (p < 0.001), For aBMD there were no significant differences in group means between analysis methods at either the PF or FN, Our findings show the need to standardize the analysis of proximal femur DXA scans in growing children.
Resumo:
The stock market suffers uncertain relations throughout the entire negotiation process, with different variables exerting direct and indirect influence on stock prices. This study focuses on the analysis of certain aspects that may influence these values offered by the capital market, based on the Brazil Index of the Sao Paulo Stock Exchange (Bovespa), which selects 100 stocks among the most traded on Bovespa in terms of number of trades and financial volume. The selected variables are characterized by the companies` activity area and the business volume in the month of data collection, i.e. April/2007. This article proposes an analysis that joins the accounting view of the stock price variables that can be influenced with the use of multivariate qualitative data analysis. Data were explored through Correspondence Analysis (Anacor) and Homogeneity Analysis (Homals). According to the research, the selected variables are associated with the values presented by the stocks, which become an internal control instrument and a decision-making tool when it comes to choosing investments.
Resumo:
The success of plant reproduction depends on pollen-pistil interactions occurring at the stigma/style. These interactions vary depending on the stigma type: wet or dry. Tobacco (Nicotiana tabacum) represents a model of wet stigma, and its stigmas/styles express genes to accomplish the appropriate functions. For a large-scale study of gene expression during tobacco pistil development and preparation for pollination, we generated 11,216 high-quality expressed sequence tags (ESTs) from stigmas/styles and created the TOBEST database. These ESTs were assembled in 6,177 clusters, from which 52.1% are pistil transcripts/genes of unknown function. The 21 clusters with the highest number of ESTs (putative higher expression levels) correspond to genes associated with defense mechanisms or pollen-pistil interactions. The database analysis unraveled tobacco sequences homologous to the Arabidopsis (Arabidopsis thaliana) genes involved in specifying pistil identity or determining normal pistil morphology and function. Additionally, 782 independent clusters were examined by macroarray, revealing 46 stigma/style preferentially expressed genes. Real-time reverse transcription-polymerase chain reaction experiments validated the pistil-preferential expression for nine out of 10 genes tested. A search for these 46 genes in the Arabidopsis pistil data sets demonstrated that only 11 sequences, with putative equivalent molecular functions, are expressed in this dry stigma species. The reverse search for the Arabidopsis pistil genes in the TOBEST exposed a partial overlap between these dry and wet stigma transcriptomes. The TOBEST represents the most extensive survey of gene expression in the stigmas/styles of wet stigma plants, and our results indicate that wet and dry stigmas/styles express common as well as distinct genes in preparation for the pollination process.
Resumo:
Oral squamous cell carcinoma (OSCC) accounts for more than 95% of all malignant neoplasms in the oral cavity. Although several studies have shown the epidemiology of this cancer in Brazil, there do not seem to be any studies that describe the prognostic factors related to OSCC in the Amazon region. Therefore, the aim of this study was to determine the survival rate and prognostic significance of different factors in patients from this region affected by OSCC. Data from 85 patients with histologically confirmed squamous cell carcinoma of the tongue and floor of the mouth identified from the Ofir Loyola Hospital archives were collected and analyzed using univariate (log-rank test) and multivariate (Cox proportional hazard model) tests. The overall 5-year survival rate was found to be 27%. Univariate analysis showed that the 5-year survival rate was significantly higher for younger (<= 45 y) female patients, patients with T1-2 tumors and clinically clear neck nodes (N0), patients with early stage cancers (AJCC stage I-II), and patients treated with surgical procedures. However, multivariate analysis showed that the 5-year survival rate was significantly higher only in the younger patients and those who underwent surgical treatment. The age of the patient at the moment of diagnosis and treatment with surgical procedures were the only independent prognostic factors that affected the 5-year survival rate of the patients in this region.