62 resultados para Variable-selection Problems
em Scielo Saúde Pública - SP
Resumo:
Geographical information systems (GIS) are tools that have been recently tested for improving our understanding of the spatial distribution of disease. The objective of this paper was to further develop the GIS technology to model and control schistosomiasis using environmental, social, biological and remote-sensing variables. A final regression model (R² = 0.39) was established, after a variable selection phase, with a set of spatial variables including the presence or absence of Biomphalaria glabrata, winter enhanced vegetation index, summer minimum temperature and percentage of houses with water coming from a spring or well. A regional model was also developed by splitting the state of Minas Gerais (MG) into four regions and establishing a linear regression model for each of the four regions: 1 (R² = 0.97), 2 (R² = 0.60), 3 (R² = 0.63) and 4 (R² = 0.76). Based on these models, a schistosomiasis risk map was built for MG. In this paper, geostatistics was also used to make inferences about the presence of Biomphalaria spp. The result was a map of species and risk areas. The obtained risk map permits the association of uncertainties, which can be used to qualify the inferences and it can be thought of as an auxiliary tool for public health strategies.
Resumo:
Genetic algorithm was used for variable selection in simultaneous determination of mixtures of glucose, maltose and fructose by mid infrared spectroscopy. Different models, using partial least squares (PLS) and multiple linear regression (MLR) with and without data pre-processing, were used. Based on the results obtained, it was verified that a simpler model (multiple linear regression with variable selection by genetic algorithm) produces results comparable to more complex methods (partial least squares). The relative errors obtained for the best model was around 3% for the sugar determination, which is acceptable for this kind of determination.
Resumo:
A model based on chemical structure was developed for the accurate prediction of octanol/water partition coefficient (K OW) of polychlorinated biphenyls (PCBs), which are molecules of environmental interest. Partial least squares (PLS) was used to build the regression model. Topological indices were used as molecular descriptors. Variable selection was performed by Hierarchical Cluster Analysis (HCA). In the modeling process, the experimental K OW measured for 30 PCBs by thin-layer chromatography - retention time (TLC-RT) has been used. The developed model (Q² = 0,990 and r² = 0,994) was used to estimate the log K OW values for the 179 PCB congeners whose K OW data have not yet been measured by TLC-RT method. The results showed that topological indices can be very useful to predict the K OW.
Resumo:
Calibration transfer has received considerable attention in the recent literature. Several standardization methods have been proposed for transferring calibration models between equipments. The goal of this paper is to present a general revision of calibration transfer techniques. Basic concepts will be reviewed, as well as the main advantages and drawbacks of each technique. A case study based on a set of 80 NIR spectra of maize samples recorded on two different instruments is used to illustrate the main calibration transfer techniques (direct standardization, piecewise direct standardization, orthogonal signal correction and robust variable selection).
Resumo:
QSAR modeling is a novel computer program developed to generate and validate QSAR or QSPR (quantitative structure- activity or property relationships) models. With QSAR modeling, users can build partial least squares (PLS) regression models, perform variable selection with the ordered predictors selection (OPS) algorithm, and validate models by using y-randomization and leave-N-out cross validation. An additional new feature is outlier detection carried out by simultaneous comparison of sample leverage with the respective Studentized residuals. The program was developed using Java version 6, and runs on any operating system that supports Java Runtime Environment version 6. The use of the program is illustrated. This program is available for download at lqta.iqm.unicamp.br.
Resumo:
This study developed and validated a method for moisture determination in artisanal Minas cheese, using near-infrared spectroscopy and partial-least-squares. The model robustness was assured by broad sample diversity, real conditions of routine analysis, variable selection, outlier detection and analytical validation. The model was built from 28.5-55.5% w/w, with a root-mean-square-error-of-prediction of 1.6%. After its adoption, the method stability was confirmed over a period of two years through the development of a control chart. Besides this specific method, the present study sought to provide an example multivariate metrological methodology with potential for application in several areas, including new aspects, such as more stringent evaluation of the linearity of multivariate methods.
Resumo:
The aim of this present work was to provide a more fast, simple and less expensive to analyze sulfur content in diesel samples than by the standard methods currently used. Thus, samples of diesel fuel with sulfur concentrations varying from 400 and 2500 mgkg-1 were analyzed by two methodologies: X-ray fluorescence, according to ASTM D4294 and by Fourier transform infrared spectrometry (FTIR). The spectral data obtained from FTIR were used to build multivariate calibration models by partial least squares (PLS). Four models were built in three different ways: 1) a model using the full spectra (665 to 4000 cm-1), 2) two models using some specific spectrum regions and 3) a model with variable selected by classic method of variable selection stepwise. The model obtained by variable selection stepwise and the model built with region spectra between 665 and 856 cm-1 and 1145 and 2717 cm-1 showed better results in the determination of sulfur content.
Resumo:
INTRODUCTION: The correct identification of the underlying cause of death and its precise assignment to a code from the International Classification of Diseases are important issues to achieve accurate and universally comparable mortality statistics These factors, among other ones, led to the development of computer software programs in order to automatically identify the underlying cause of death. OBJECTIVE: This work was conceived to compare the underlying causes of death processed respectively by the Automated Classification of Medical Entities (ACME) and the "Sistema de Seleção de Causa Básica de Morte" (SCB) programs. MATERIAL AND METHOD: The comparative evaluation of the underlying causes of death processed respectively by ACME and SCB systems was performed using the input data file for the ACME system that included deaths which occurred in the State of S. Paulo from June to December 1993, totalling 129,104 records of the corresponding death certificates. The differences between underlying causes selected by ACME and SCB systems verified in the month of June, when considered as SCB errors, were used to correct and improve SCB processing logic and its decision tables. RESULTS: The processing of the underlying causes of death by the ACME and SCB systems resulted in 3,278 differences, that were analysed and ascribed to lack of answer to dialogue boxes during processing, to deaths due to human immunodeficiency virus [HIV] disease for which there was no specific provision in any of the systems, to coding and/or keying errors and to actual problems. The detailed analysis of these latter disclosed that the majority of the underlying causes of death processed by the SCB system were correct and that different interpretations were given to the mortality coding rules by each system, that some particular problems could not be explained with the available documentation and that a smaller proportion of problems were identified as SCB errors. CONCLUSION: These results, disclosing a very low and insignificant number of actual problems, guarantees the use of the version of the SCB system for the Ninth Revision of the International Classification of Diseases and assures the continuity of the work which is being undertaken for the Tenth Revision version.
Resumo:
OBJECTIVE: To determine the prevalence and severity of occlusal problems in populations at the ages of deciduous and permanent dentition and to carry out a meta-analysis to estimate the weighted odds ratio for occlusal problems comparing both groups. METHODS: Data of a probabilistic sample (n=985) of schoolchildren aged 5 and 12 from an epidemiological study in the municipality of São Paulo, Brazil, were analyzed using univariate logistic regression (MLR). Results of cross-sectional study data published in the last 70 years were examined in the meta-analysis. RESULTS: The prevalence of occlusal problems increased from 49.0% (95% CI =47.4%-50.6%) in the deciduous dentition to 71.3% (95% CI =70.3%-72.3%) in the permanent dentition (p<0.001). Dentition was the only variable significantly associated to the severity of malocclusion (OR=1.87; 95% CI =1.43-2.45; p<0.001). The variables sex, type of school and ethnic group were not significant. The meta-analysis showed that a weighted OR of 1.95 (1.91; 1.98) when compared the second dentition period with deciduous and mixed dentition. CONCLUSIONS: In planning oral health services, some activities are indicated to reduce the proportion of moderate/severe malocclusion to levels that are socially more acceptable and economically sustainable.
Resumo:
Two populations of the wasp Trypoxylon rogenhoferi Kohl, 1884 from São Carlos and Luís Antônio, State of São Paulo, Brazil, were observed and sampled from May 1999 to February 2001 using trap-nests. This mass-provisioning wasp was used to test some aspects of optimal sex allocation theory. Both populations fit all the predictions of the models of Green and Brockmann and Grafen. Maternal provisions determined the size of each offspring, and females allocated well-stocked brood cells to daughters, the sex that benefits most being large. This strategy resulted in a difference in size between the sexes. In São Carlos, female weight at emergence was 1.18 times that of males, in Luís Antônio this value was 1.13. The brood cell volume was correlated with both wing length and weight at emergence in both sexes, and the chance that a given brood cell contained a male offspring decreased with increased brood cell volume. In T. rogenhoferi female body size was related to fitness. Larger females were able to collect more mass of spiders per day, the spiders they captured were heavier, and they provisioned more brood cells per day. They also produced larger daughters. For males, no relationship between body size and fitness was found, but the data were scarce. Since the patterns of provisioning were variable among different females in both study sites, it is possible that the females not follow a unique strategy for sex allocation. The sex ratio and/or investment ratio in the São Carlos population was female-biased and in Luís Antônio, male-biased. In spite of the influence of trap-nests diameters on male production in Luís Antônio, there is some evidence that in São Carlos population the local availability of prey and/or lower rate of parasitism may be major forces in determining the observed sex ratio, but further studies are necessary to verify such hypothesis.
Resumo:
The objective this study has been the selection of lipase productor microorganism, for removal of oils and grease, in the pre-treatment of biodiesel wastewater washing. For this, analyses of the physicist-chemistries characteristics had been made with the wastewater of the biodiesel washing, and then it had been isolated and chosen, by means of determinations of the lipase activity. Following, it was made a test of fat biodegradation, in the conditions: pH (5.95), temperature (35 ºC), rotation (180 rpm) and ammonium sulfate as nitrogen source (3 g L-1) and establishing as variable the two microorganism preselected and the time (24; 48; 72; 96 and 120 h). The biodiesel purification wastewater had presented high potential of environmental impact, presenting a concentration of O of 6.76 g L-1. From the six isolated microbiological cultures, two microorganisms (A and B) had been selected, with enzymatic index of 0.56 and 0.57, respectively. The treatment of the wastewater using the isolated microorganism (Klebsiella oxytoca) had 80% of the fatty removal in 48 h.
Resumo:
Products developed at industries, institutes and research centers are expected to have high level of quality and performance, having a minimum waste, which require efficient and robust tools to numerically simulate stringent project conditions with great reliability. In this context, Computational Fluid Dynamics (CFD) plays an important role and the present work shows two numerical algorithms that are used in the CFD community to solve the Euler and Navier-Stokes equations applied to typical aerospace and aeronautical problems. Particularly, unstructured discretization of the spatial domain has gained special attention by the international community due to its ease in discretizing complex spatial domains. This work has the main objective of illustrating some advantages and disadvantages of numerical algorithms using structured and unstructured spatial discretization of the flow governing equations. Numerical methods include a finite volume formulation and the Euler and Navier-Stokes equations are applied to solve a transonic nozzle problem, a low supersonic airfoil problem and a hypersonic inlet problem. In a structured context, these problems are solved using MacCormacks implicit algorithm with Steger and Warmings flux vector splitting technique, while, in an unstructured context, Jameson and Mavriplis explicit algorithm is used. Convergence acceleration is obtained using a spatially variable time stepping procedure.
Resumo:
We describe the expression of an anti-Z-DNA single chain variable region antibody fragment (scFv) on a filamentous phage surface. Four vectors for phage display were constructed. Two of them are able to display multiple copies of the antibody fragment, and the others can be used to make monovalent libraries. The vectors use different promoter/leader sequences to direct the expression of the fused proteins. All were able to promote the assembly of fusion virion particles. In this paper we also show the affinity selection (biopanning) of those phage-antibodies based on the capacity of their products to recognize the antigen. We used biotinylated Z-DNA and the selection was performed in a solution phase fashion. The data presented here indicate that these vectors can be further used to construct anti-nucleic acid antibody fragment libraries that can be used to study the basis of nucleic acid-protein interaction and its role in autoimmunity mechanisms.
Resumo:
This study aimed to evaluate the efficiency of simultaneous selection (selection indices) using estimated genetic gains in yellow passion fruit and to make a comparison between the methodologies of Mulamba & Mock and Elston. The study was conducted with 26 sib progenies of yellow passion fruit for intrinsic production characteristics including fruit number, fruit mass, fruit length and diameter, and for the fruit characteristics skin thickness, soluble solids and acidity. Two methodologies were applied: first, in the joint analysis of fruit characteristics and of intrinsic production characteristics in a single phase of selection; and second, in the analysis in two phases, in which priority was given to the intrinsic production characteristics in the first phase, and later, in the second phase, the best fruit characteristics were chosen among the progenies of the first phase. The analysis of variance was applied to the data to detect genetic variability among progenies. The Elston's selection indice was unable to provide distribution of genetic gains consistent with the purposes of the study, as it selected a single progeny of passion fruit. However, the index based on the sum of ranks of Mulamba & Mock was more suitable, as it provided a balanced distribution of gains, selecting a larger number of progenies. The methodology of selection using indices is advantageous in passion fruit, since it contributes to higher genetic gains for all the traits evaluated, and the selection in a single phase was proved efficient for progeny selection.