16 resultados para random forest data analysis
em Scielo Saúde Pública - SP
Resumo:
The structural modeling of spatial dependence, using a geostatistical approach, is an indispensable tool to determine parameters that define this structure, applied on interpolation of values at unsampled points by kriging techniques. However, the estimation of parameters can be greatly affected by the presence of atypical observations in sampled data. The purpose of this study was to use diagnostic techniques in Gaussian spatial linear models in geostatistics to evaluate the sensitivity of maximum likelihood and restrict maximum likelihood estimators to small perturbations in these data. For this purpose, studies with simulated and experimental data were conducted. Results with simulated data showed that the diagnostic techniques were efficient to identify the perturbation in data. The results with real data indicated that atypical values among the sampled data may have a strong influence on thematic maps, thus changing the spatial dependence structure. The application of diagnostic techniques should be part of any geostatistical analysis, to ensure a better quality of the information from thematic maps.
Resumo:
AbstractObjective:To evaluate the association between Hashimoto's thyroiditis (HT) and papillary thyroid carcinoma (PTC).Materials and Methods:The patients were evaluated by ultrasonography-guided fine needle aspiration cytology. Typical cytopathological aspects and/or classical histopathological findings were taken into consideration in the diagnosis of HT, and only histopathological results were considered in the diagnosis of PTC.Results:Among 1,049 patients with multi- or uninodular goiter (903 women and 146 men), 173 (16.5%) had cytopathological features of thyroiditis. Thirty-three (67.4%) out of the 49 operated patients had PTC, 9 (27.3%) of them with histopathological features of HT. Five (31.3%) out of the 16 patients with non-malignant disease also had HT. In the groups with HT, PTC, and PCT+HT, the female prevalence rate was 100%, 91.6%, and 77.8%, respectively. Mean age was 41.5, 43.3, and 48.5 years, respectively. No association was observed between the two diseases in the present study where HT occurred in 31.1% of the benign cases and in 27.3% of malignant cases (p = 0.8).Conclusion:In spite of the absence of association between HT and PCT, the possibility of malignancy in HT should always be considered because of the coexistence of the two diseases already reported in the literature.
Resumo:
GLUT4 protein expression in white adipose tissue (WAT) and skeletal muscle (SM) was investigated in 2-month-old, 12-month-old spontaneously obese or 12-month-old calorie-restricted lean Wistar rats, by considering different parameters of analysis, such as tissue and body weight, and total protein yield of the tissue. In WAT, a ~70% decrease was observed in plasma membrane and microsomal GLUT4 protein, expressed as µg protein or g tissue, in both 12-month-old obese and 12-month-old lean rats compared to 2-month-old rats. However, when plasma membrane and microsomal GLUT4 tissue contents were expressed as g body weight, they were the same. In SM, GLUT4 protein content, expressed as µg protein, was similar in 2-month-old and 12-month-old obese rats, whereas it was reduced in 12-month-old obese rats, when expressed as g tissue or g body weight, which may play an important role in insulin resistance. Weight loss did not change the SM GLUT4 content. These results show that altered insulin sensitivity is accompanied by modulation of GLUT4 protein expression. However, the true role of WAT and SM GLUT4 contents in whole-body or tissue insulin sensitivity should be determined considering not only GLUT4 protein expression, but also the strong morphostructural changes in these tissues, which require different types of data analysis.
Resumo:
This study sought to evaluate the acceptance of "dulce de leche" with coffee and whey. The results were analyzed through response surface, ANOVA, test of averages, histograms, and preference map correlating the global impression data with results of physical, physiochemical and sensory analysis. The response surface methodology, by itself, was not enough to find the best formulation. For ANOVA, test of averages, and preference map it was observed that the consumers' favorite "dulce de leche" were those of formulation 1 (10% whey and 1% coffee) and 2 (30% whey and 1% coffee), followed by formulation 9 (20% whey and 1.25% coffee). The acceptance of samples 1 and 2 was influenced by the higher acceptability in relation to the flavor and for presenting higher pH, L*, and b* values. It was observed that samples 1 and 2 presented higher purchase approval score and higher percentages of responses for the 'ideal' category in terms of sweetness and coffee flavor. It was found that consumers preferred the samples with low concentrations of coffee independent of the concentration of whey thus enabling the use of whey and coffee in the manufacture of dulce de leche, obtaining a new product.
Resumo:
Epidemiological studies have shown the effect of diet on the incidence of chronic diseases; however, proper planning, designing, and statistical modeling are necessary to obtain precise and accurate food consumption data. Evaluation methods used for short-term assessment of food consumption of a population, such as tracking of food intake over 24h or food diaries, can be affected by random errors or biases inherent to the method. Statistical modeling is used to handle random errors, whereas proper designing and sampling are essential for controlling biases. The present study aimed to analyze potential biases and random errors and determine how they affect the results. We also aimed to identify ways to prevent them and/or to use statistical approaches in epidemiological studies involving dietary assessments.
Resumo:
The precise sampling of soil, biological or micro climatic attributes in tropical forests, which are characterized by a high diversity of species and complex spatial variability, is a difficult task. We found few basic studies to guide sampling procedures. The objective of this study was to define a sampling strategy and data analysis for some parameters frequently used in nutrient cycling studies, i. e., litter amount, total nutrient amounts in litter and its composition (Ca, Mg, Κ, Ν and P), and soil attributes at three depths (organic matter, Ρ content, cation exchange capacity and base saturation). A natural remnant forest in the West of São Paulo State (Brazil) was selected as study area and samples were collected in July, 1989. The total amount of litter and its total nutrient amounts had a high spatial independent variance. Conversely, the variance of litter composition was lower and the spatial dependency was peculiar to each nutrient. The sampling strategy for the estimation of litter amounts and the amount of nutrient in litter should be different than the sampling strategy for nutrient composition. For the estimation of litter amounts and the amount of nutrients in litter (related to quantity) a large number of randomly distributed determinations are needed. Otherwise, for the estimation of litter nutrient composition (related to quality) a smaller amount of spatially located samples should be analyzed. The determination of sampling for soil attributes differed according to the depth. Overall, surface samples (0-5 cm) showed high short distance spatial dependent variance, whereas, subsurface samples exhibited spatial dependency in longer distances. Short transects with sampling interval of 5-10 m are recommended for surface sampling. Subsurface samples must also be spatially located, but with transects or grids with longer distances between sampling points over the entire area. Composite soil samples would not provide a complete understanding of the relation between soil properties and surface dynamic processes or landscape aspects. Precise distribution of Ρ was difficult to estimate.
Resumo:
In general, laboratory activities are costly in terms of time, space, and money. As such, the ability to provide realistically simulated laboratory data that enables students to practice data analysis techniques as a complementary activity would be expected to reduce these costs while opening up very interesting possibilities. In the present work, a novel methodology is presented for design of analytical chemistry instrumental analysis exercises that can be automatically personalized for each student and the results evaluated immediately. The proposed system provides each student with a different set of experimental data generated randomly while satisfying a set of constraints, rather than using data obtained from actual laboratory work. This allows the instructor to provide students with a set of practical problems to complement their regular laboratory work along with the corresponding feedback provided by the system's automatic evaluation process. To this end, the Goodle Grading Management System (GMS), an innovative web-based educational tool for automating the collection and assessment of practical exercises for engineering and scientific courses, was developed. The proposed methodology takes full advantage of the Goodle GMS fusion code architecture. The design of a particular exercise is provided ad hoc by the instructor and requires basic Matlab knowledge. The system has been employed with satisfactory results in several university courses. To demonstrate the automatic evaluation process, three exercises are presented in detail. The first exercise involves a linear regression analysis of data and the calculation of the quality parameters of an instrumental analysis method. The second and third exercises address two different comparison tests, a comparison test of the mean and a t-paired test.
Resumo:
ABSTRACT Geographic Information System (GIS) is an indispensable software tool in forest planning. In forestry transportation, GIS can manage the data on the road network and solve some problems in transportation, such as route planning. Therefore, the aim of this study was to determine the pattern of the road network and define transport routes using GIS technology. The present research was conducted in a forestry company in the state of Minas Gerais, Brazil. The criteria used to classify the pattern of forest roads were horizontal and vertical geometry, and pavement type. In order to determine transport routes, a data Analysis Model Network was created in ArcGIS using an Extension Network Analyst, allowing finding a route shorter in distance and faster. The results showed a predominance of horizontal geometry classes average (3) and bad (4), indicating presence of winding roads. In the case of vertical geometry criterion, the class of highly mountainous relief (4) possessed the greatest extent of roads. Regarding the type of pavement, the occurrence of secondary coating was higher (75%), followed by primary coating (20%) and asphalt pavement (5%). The best route was the one that allowed the transport vehicle travel in a higher specific speed as a function of road pattern found in the study.
Resumo:
The purpose of this study is to analyse the climatic aspects of the data collected in a forest site in comparison with conventional data obtained at different sites, such as clearing, rural an urban areas. The results showed that diverse climatic conditions do exist among the sites: the urban site showed higher temperature and lower relative humidity. In addition, evapotranspiration (potential and actual rates) was computed from the forest data set, using the classical Penman-Monteith's equation. The actual evapotranspiration is 30% of the potential value during dry period and seems to be almost constant during the whole year (tipically 2.0 to 2.5 mm day-1).
Resumo:
Changes in the floristic composition over an eight-year period in a logged area at the Tapajós National Forest in Brazilian Amazonia arc discussed. Two treatments of different intensities of logging were compared with an undisturbed (control) forest. Data were collected from permanent sample-plots. The effects of logging on floristic composition were stronger in the more heavily logged treatment. The number of species decreased immediately after logging, but started to increase before the fifth year after logging and was higher at the end of the study period than before logging. The more heavily logged plots responded more to disturbances, as judged by the increase in the number of species during the period after logging. This forest appears to recover its initial floristic composition after disturbance without intervention.
Resumo:
ABSTRACT The Amazon forest is rich in plant species diversity, among them,Piranhea trifoliata stands out, which is popularly known as piranheira, because their fruits are eaten by fish. Their barks are used as bath composition on uterus inflammation and as tea in malaria treatment. This study aimed to fractionate the dichloromethane and dichloromethane phase from methanolic extract of leaves of Piranhea trifoliata. The leaves were dried, grounded and extracted with dichloromethane, methanol and water. The methanol extract was partitioned with dichloromethane and ethyl acetate. The chromatographic fractionation yielded six pentacyclic triterpenoids: friedelan-3-one, 28-hydroxy-friedelan-3-one, 30-hydroxy-friedelan-3-one, lupeol, α- and β-amyrin mixture, besides the mixture of the steroids: β-sitosterol and stigmasterol. The substances structures were identified by 1H- and13C-Nuclear Magnetic Resonance (NMR) analysis and literature data comparison. This is the first report describing the chemical study of P. trifoliata leaves.
Resumo:
The geographic information system approach has permitted integration between demographic, socio-economic and environmental data, providing correlation between information from several data banks. In the current work, occurrence of human and canine visceral leishmaniases and insect vectors (Lutzomyia longipalpis) as well as biogeographic information related to 9 areas that comprise the city of Belo Horizonte, Brazil, between April 2001 and March 2002 were correlated and georeferenced. By using this technique it was possible to define concentration loci of canine leishmaniasis in the following regions: East; Northeast; Northwest; West; and Venda Nova. However, as for human leishmaniasis, it was not possible to perform the same analysis. Data analysis has also shown that 84.2% of the human leishmaniasis cases were related with canine leishmaniasis cases. Concerning biogeographic (altitude, area of vegetation influence, hydrographic, and areas of poverty) analysis, only altitude showed to influence emergence of leishmaniasis cases. A number of 4673 canine leishmaniasis cases and 64 human leishmaniasis cases were georeferenced, of which 67.5 and 71.9%, respectively, were living between 780 and 880 m above the sea level. At these same altitudes, a large number of phlebotomine sand flies were collected. Therefore, we suggest control measures for leishmaniasis in the city of Belo Horizonte, giving priority to canine leishmaniasis foci and regions at altitudes between 780 and 880 m.
Resumo:
Since the advent of mechanized farming and intensive use of agricultural machinery and implements on the properties, the soil began to receive greater load of machinery traffic, which can cause increased soil compaction. The aim of this study was to evaluate the spatial variability of soil mechanical resistance to penetration (RP) in the layers of 0.00-0.10, 0.10-0.20, 0.20-0.30 and 0.30-0.40m, using geostatistics in an area cultivated with mango in Haplic Vertisol of the northeastern semi-arid, with mobile unit equipped with electronic penetrometer. The RP data was collected in 56 points from an area of 3 ha, and random soil samples were collected to determine the soil moisture and texture. For RP data analysis we used descriptive statistics and geostatistics. The soil mechanical resistance to penetration presented increased variability, with adjustment of the spherical and exponential semivariograms in the layers. We found that 42% of the area in the layer of 0.10-0.20m showed RP values above 2.70 MPa. Maximum values of RP were found in the layer of 0.19-0.27m, predominantly in 56% of the area.
Resumo:
Aulonemia aristulata (Döll) McClure is a lignified bamboo species endemic to Brazil. This species occurs in southeastern forests and can reach high density at forest edges, dominating the understory of canopy-disturbed forest patches. The goal of this study was to describe the flowering period, floral biology, fruiting and seedling recruitment of A. aristulata in natural conditions in two areas located in a segment of the Atlantic Forest. Data on the morphology of the synflorescences and florets, timing and sequence of the anthesis events and floral visitors were recorded. Natural pollinators (open pollination or control) as well as spontaneous self-pollination were also checked. Pollen viability was estimated using the acetocarmine technique. Aulonemia aristulata is monocarpic (semelparous) with gregarious flowering. All culms in both studied areas blossomed and fruited between August and November 2007, dying subsequently between December 2007 and April 2008. Two types of synflorescences and flowers were observed: terminal with bisexual and protandric florets, with the anthesis lasting for 3-4 days; and axillary, with morphologically bisexual, but functionally female, florets and anthesis lasting for 3-4 days. The latter were also observed in the rhizome of plants whose aerial portion had been removed. The presence of axillary synflorescences with pistillate flowers is described here for the first time in Aulonemia species. Moreover, this is the first report of gynomonoecy in woody bamboo. Fruiting from bisexual florets under natural conditions (35%) was superior to that obtained from bagged synflorescences (11.5%). Fruiting from functional female florets was around 20%. Pollen viability was on the average of 90%. The results suggest that Aulonemia aristulata is anemophilous. The massive bamboo seedling recruitment observed after dieback with the ability to colonize open areas could promote the regeneration of Aulonemia aristulata.
Resumo:
Results of subgroup analysis (SA) reported in randomized clinical trials (RCT) cannot be adequately interpreted without information about the methods used in the study design and the data analysis. Our aim was to show how often inaccurate or incomplete reports occur. First, we selected eight methodological aspects of SA on the basis of their importance to a reader in determining the confidence that should be placed in the author's conclusions regarding such analysis. Then, we reviewed the current practice of reporting these methodological aspects of SA in clinical trials in four leading journals, i.e., the New England Journal of Medicine, the Journal of the American Medical Association, the Lancet, and the American Journal of Public Health. Eight consecutive reports from each journal published after July 1, 1998 were included. Of the 32 trials surveyed, 17 (53%) had at least one SA. Overall, the proportion of RCT reporting a particular methodological aspect ranged from 23 to 94%. Information on whether the SA preceded/followed the analysis was reported in only 7 (41%) of the studies. Of the total possible number of items to be reported, NEJM, JAMA, Lancet and AJPH clearly mentioned 59, 67, 58 and 72%, respectively. We conclude that current reporting of SA in RCT is incomplete and inaccurate. The results of such SA may have harmful effects on treatment recommendations if accepted without judicious scrutiny. We recommend that editors improve the reporting of SA in RCT by giving authors a list of the important items to be reported.