54 resultados para Data clustering. Fuzzy C-Means. Cluster centers initialization. Validation indices
Resumo:
GC/MS/FID analyses of volatile compounds from cladodes and inflorescences from male and female specimens of Baccharis trimera (Less.) DC. collected in the states of Paraná and Santa Catarina, Brazil, showed that carquejyl acetate was the primary volatile component (38% to 73%), while carquejol and ledol were identified in lower concentrations. Data were subjected to hierarchical cluster analysis and principal component analysis, which confirmed that the chemical compositions of all samples were similar. The results presented here highlight the occurrence of the same chemotype of B. trimera in three southern states of Brazil.
Resumo:
To study the dendritic morphology of retinal ganglion cells in wild-type mice we intracellularly injected these cells with Lucifer yellow in an in vitro preparation of the retina. Subsequently, quantified values of dendritic thickness, number of branching points and level of stratification of 73 Lucifer yellow-filled ganglion cells were analyzed by statistical methods, resulting in a classification into 9 groups. The variables dendritic thickness, number of branching points per cell and level of stratification were independent of each other. Number of branching points and level of stratification were independent of eccentricity, whereas dendritic thickness was positively dependent (r = 0.37) on it. The frequency distribution of dendritic thickness tended to be multimodal, indicating the presence of at least two cell populations composed of neurons with dendritic diameters either smaller or larger than 1.8 µm ("thin" or "thick" dendrites, respectively). Three cells (4.5%) were bistratified, having thick dendrites, and the others (95.5%) were monostratified. Using k-means cluster analysis, monostratified cells with either thin or thick dendrites were further subdivided according to level of stratification and number of branching points: cells with thin dendrites were divided into 2 groups with outer stratification (0-40%) and 2 groups with inner (50-100%) stratification, whereas cells with thick dendrites were divided into one group with outer and 3 groups with inner stratification. We postulate, that one group of cells with thin dendrites resembles cat ß-cells, whereas one group of cells with thick dendrites includes cells that resemble cat a-cells.
Resumo:
The range of 25-hydroxyvitamin D (25OHD) concentration was determined in a young healthy population based on bone metabolism parameters and environmental and behavioral aspects. We studied 121 healthy young volunteers (49 men, 72 women) living in São Paulo (23º 34' south latitude) belonging to three occupational categories: indoor workers (N = 28), medical school students (N = 44), and resident physicians (N = 49). Fasting morning blood samples were collected once from each volunteer from August 2002 to February 2004, and 25OHD, total calcium, albumin, alkaline phosphatase, phosphorus, creatinine, intact parathyroid hormone, osteocalcin, and type I collagen carboxyterminal telopeptide were measured. Data are reported as means ± SD. Mean subject age was 24.7 ± 2.68 years and mean 25OHD level for the entire group was 78.7 ± 33.1 nM. 25OHD levels were lower (P < 0.05) among resident physicians (67.1 ± 27.0 nM) than among students (81.5 ± 35.8 nM) and workers (94.0 ± 32.6 nM), with the last two categories displaying no difference. Parathyroid hormone was higher (P < 0.05) and osteocalcin was lower (P < 0.05) among resident physicians compared to non-physicians. Solar exposure and frequency of beach outings showed a positive association with 25OHD (P < 0.001), and summer samples presented higher results than winter ones (97.8 ± 33.5 and 62.9 ± 23.5 nM, respectively). To define normal levels, parameters such as occupational activity, seasonality and habits related to solar exposure should be taken into account. Based on these data, we considered concentrations above 74.5 nM to be desired optimal 25OHD levels, which were obtained during the summer for 75% of the non-physicians.
Resumo:
The Diagnosis and Recommendation Integrated System (DRIS) can improve interpretations of leaf analysis to determine the nutrient status. Diagnoses by this method require DRIS norms, which are however not known for oil content of soybean seeds. The aims of this study were to establish and test the DRIS method for oil content of soybean seed (maturity group II cultivars). Soybean leaves (207 samples) in the full flowering stage were analyzed for macro and micro-nutrients, and the DRIS was applied to assess the relationship between nutrient ratios and the seed oil content. Samples from experimental and farm field sites of the southernmost Brazilian state Rio Grande do Sul (28° - 29° southern latitude; 52° -53° western longitude) were assessed in two growing seasons (2007/2008 and 2008/2009). The DRIS norms related to seed oil content differed between the studied years. A unique DRIS norm was established for seed oil content higher than 18.68 % based on data of the 2007/2008 growing season. Higher DRIS indices of B, Ca, Mg and S were associated with a higher oil content, while the opposite was found for K, N and P. The DRIS can be used to evaluate the leaf nutrient status of soybean to improve the seed oil content of the crop.
Resumo:
Fingerprinting of Mycobacterium tuberculosis strains from tuberculosis (TB) patients attended in Community Health Centers (CHCs) of Rio de Janeiro was performed to verify possible risk factors for TB transmission. A prospective community-based study was performed during the period of July 1996 to December 1996 by collecting sputum samples of 489 patients in 11 different CHCs in four different planning areas (APs) of the city. Bacteriological, clinical, and epidemiological information was collected and M. tuberculosis genotypes defined after restriction fragment length polymorphism (IS6110-RFLP) and double repetitive element (DRE) fingerprinting of RFLP-clustered cases. Risk factors for TB transmission were looked for using three levels of cluster stringency. Among 349 (71%) positive cultures obtained, IS6110-RFLP typing could be performed on strains from 153 different patients. When using identity of RFLP patterns as cluster definition, 49 (32%) of the strains belonged to a cluster and none of the clinical or epidemiologic characteristics was associated with higher clustering levels. However, higher clustering level was observed in the AP including the central region of the city when compared to others. This strongly suggests that more recent transmission occurs in that area and this may be related with higher incidence of TB and HIV in this region.
Resumo:
This study aimed at identifying different conditions of coffee plants after harvesting period, using data mining and spectral behavior profiles from Hyperion/EO1 sensor. The Hyperion image, with spatial resolution of 30 m, was acquired in August 28th, 2008, at the end of the coffee harvest season in the studied area. For pre-processing imaging, atmospheric and signal/noise effect corrections were carried out using Flaash and MNF (Minimum Noise Fraction Transform) algorithms, respectively. Spectral behavior profiles (38) of different coffee varieties were generated from 150 Hyperion bands. The spectral behavior profiles were analyzed by Expectation-Maximization (EM) algorithm considering 2; 3; 4 and 5 clusters. T-test with 5% of significance was used to verify the similarity among the wavelength cluster means. The results demonstrated that it is possible to separate five different clusters, which were comprised by different coffee crop conditions making possible to improve future intervention actions.
Resumo:
Coronary artery disease (CAD) is a worldwide leading cause of death. The standard method for evaluating critical partial occlusions is coronary arteriography, a catheterization technique which is invasive, time consuming, and costly. There are noninvasive approaches for the early detection of CAD. The basis for the noninvasive diagnosis of CAD has been laid in a sequential analysis of the risk factors, and the results of the treadmill test and myocardial perfusion scintigraphy (MPS). Many investigators have demonstrated that the diagnostic applications of MPS are appropriate for patients who have an intermediate likelihood of disease. Although this information is useful, it is only partially utilized in clinical practice due to the difficulty to properly classify the patients. Since the seminal work of Lotfi Zadeh, fuzzy logic has been applied in numerous areas. In the present study, we proposed and tested a model to select patients for MPS based on fuzzy sets theory. A group of 1053 patients was used to develop the model and another group of 1045 patients was used to test it. Receiver operating characteristic curves were used to compare the performance of the fuzzy model against expert physician opinions, and showed that the performance of the fuzzy model was equal or superior to that of the physicians. Therefore, we conclude that the fuzzy model could be a useful tool to assist the general practitioner in the selection of patients for MPS.
Resumo:
OBJECTIVE: To estimate the incidence rate of type 1 diabetes in the urban area of Santiago, Chile, from March 21, 1997 to March 20, 1998, and to assess the spatio-temporal clustering of cases during that period. METHODS: All sixty-one incident cases were located temporally (day of diagnosis) and spatially (place of residence) in the area of study. Knox's method was used to assess spatio-temporal clustering of incident cases. RESULTS: The overall incidence rate of type 1 diabetes was 4.11 cases per 100,000 children aged less than 15 years per year (95% confidence interval: 3.06--5.14). The incidence rate seems to have increased since the last estimate of the incidence calculated for the years 1986--1992 in the metropolitan region of Santiago. Different combinations of space-time intervals have been evaluated to assess spatio-temporal clustering. The smallest p-value was found for the combination of critical distances of 750 meters and 60 days (uncorrected p-value = 0.048). CONCLUSIONS: Although these are preliminary results regarding space-time clustering in Santiago, exploratory analysis of the data method would suggest a possible aggregation of incident cases in space-time coordinates.
Resumo:
OBJECTIVE: To identify factors associated to poor glycemic control among diabetic patients seen at primary health care centers. METHODS: A cross-sectional study was carried out in a sample of 372 diabetic patients attending 32 primary health care centers in southern Brazil. Data on three hierarchical levels of health unit infrastructure, medical care and patient characteristics were collected. RESULTS: The frequency of poor glycemic control was 50.5%. Multivariate analysis (multilevel method) showed that patients with body mass indexes below 27 kg/m², patients on oral hypoglycemic agents or insulin, and patients diagnosed as diabetic over five years prior to the interview were more likely to present poor glycemic control when compared to their counterparts. CONCLUSIONS: Given the hierarchical data structuring, all associations found suggest that factors associated to hyperglycemia are related to patient-level characteristics.
Resumo:
A total of 730 children aged less than 7 years, attending 8 day-care centers (DCCs) in Belém, Brazil were followed-up from January to December 1997 to investigate the occurrence of human-herpes virus 6 (HHV-6) infection in these institutional settings. Between October and December 1997 there have been outbreaks of a febrile- and -exanthematous disease, affecting at least 15-20% of children in each of the DCCs. Both serum- and- plasma samples were obtained from 401 (55%) of the 730 participating children for the detection of HHV-6 antibodies by enzyme-linked immunosorbent assay (ELISA), and viral DNA amplification through the nested-PCR. Recent HHV-6 infection was diagnosed in 63.8% (256/401) of them, as defined by the presence of both IgM and IgG-specific antibodies (IgM+/IgG+); of these, 114 (44.5%) were symptomatic and 142 (55.5%) had no symptoms (p = 0.03). A subgroup of 123 (30.7%) children were found to be IgM-/IgG+, whereas the remaining 22 (5.5%) children had neither IgM nor IgG HHV-6- antibodies (IgM-/IgG-). Of the 118 children reacting strongly IgM-positive ( > or = 30 PANBIO units), 26 (22.0%) were found to harbour the HHV-6 DNA, as demonstrated by nested-PCR. Taken the ELISA-IgM- and- nested PCR-positive results together, HHV-6 infection was shown to have occurred in 5 of the 8 DCCs under follow-up. Serological evidence of recent infections by Epstein-Barr virus (EBV) and parvovirus B19 were identified in 2.0% (8/401) and 1.5% (6/401) of the children, respectively. Our data provide strong evidence that HHV-6 is a common cause of outbreaks of febrile/exanthematous diseases among children attending DCCs in the Belém area.
Resumo:
In this study, we concentrate on modelling gross primary productivity using two simple approaches to simulate canopy photosynthesis: "big leaf" and "sun/shade" models. Two approaches for calibration are used: scaling up of canopy photosynthetic parameters from the leaf to the canopy level and fitting canopy biochemistry to eddy covariance fluxes. Validation of the models is achieved by using eddy covariance data from the LBA site C14. Comparing the performance of both models we conclude that numerically (in terms of goodness of fit) and qualitatively, (in terms of residual response to different environmental variables) sun/shade does a better job. Compared to the sun/shade model, the big leaf model shows a lower goodness of fit and fails to respond to variations in the diffuse fraction, also having skewed responses to temperature and VPD. The separate treatment of sun and shade leaves in combination with the separation of the incoming light into direct beam and diffuse make sun/shade a strong modelling tool that catches more of the observed variability in canopy fluxes as measured by eddy covariance. In conclusion, the sun/shade approach is a relatively simple and effective tool for modelling photosynthetic carbon uptake that could be easily included in many terrestrial carbon models.
Resumo:
OBJECTIVE - A population-based prospective study was analysed to: a) determine the prevalence of hypertension; b) investigate the clustering of other cardiovascular risk factors and c) verify whether older differed from younger adults in the pattern of clustering. METHODS - The data comprised a representative sample of the population of Bambuí, Brazil. Multiple logistic regression was used to investigate the independent association between hypertension and selected factors. RESULTS - A total of 820 younger adults (82.5%) and 1494 older adults (85.9%) participated in this study. The overall prevalence of hypertension was 24.8% (SE=1.4 %), being higher in women (26.9±1.5%) than in men (22.0± 1.7%) (p=0.033). Hypertension was positively and significantly associated with physical inactivity, overweight, hypercholesterolemia hyperglycemia and hypertriglyceridemia. The coexistence of hypertension with 4 or more of these risk factors occurred 6 times more than expected by chance, after adjusting for age and sex (OR=6.3; 95%CI: 3.4-11.9). The pattern of risk factor clustering in hypertensive individuals differed with age. CONCLUSION - Our results reinforce the need to increase detection and treatment of hypertension and to approach patients' global risk profiles.
Resumo:
Striking similarities at the morphological, molecular and biological levels exist between many trypanosomatids isolated from sylvatic insects and/or vertebrate reservoir hosts that make the identification of medically important parasites demanding. Some molecular data have pointed to the relationship between some Leishmania species and Endotrypanum, which has an important epidemiological significance and can be helpful to understand the evolution of those parasites. In this study, we have demonstrated a close genetic relationship between Endotrypanum and two new leishmanial species, L. (V.) colombiensis and L. (V.) equatorensis. We have used (a) numerical zymotaxonomy and (b) the variability of the internal transcribed spacers of the rRNA genes to examine relationships in this group. The evolutionary trees obtained revealed high genetic similarity between L. (V.) colombiensis, L. (V.) equatorensis and Endotrypanum, forming a tight cluster of parasites. Based on further results of (c) minicircle kDNA heterogeneity analysis and (d) measurement of the sialidase activity these parasites were also grouped together.
Resumo:
The study of the Schistosoma mansoni genome, one of the etiologic agents of human schistosomiasis, is essential for a better understanding of the biology and development of this parasite. In order to get an overview of all S. mansoni catalogued gene sequences, we performed a clustering analysis of the parasite mRNA sequences available in public databases. This was made using softwares PHRAP and CAP3. The consensus sequences, generated after the alignment of cluster constituent sequences, allowed the identification by database homology searches of the most expressed genes in the worm. We analyzed these genes and looked for a correlation between their high expression and parasite metabolism and biology. We observed that the majority of these genes is related to the maintenance of basic cell functions, encoding genes whose products are related to the cytoskeleton, intracellular transport and energy metabolism. Evidences are presented here that genes for aerobic energy metabolism are expressed in all the developmental stages analyzed. Some of the most expressed genes could not be identified by homology searches and may have some specific functions in the parasite.
Estimation of surface roughness in a semiarid region from C-band ERS-1 synthetic aperture radar data
Resumo:
In this study, we investigated the feasibility of using the C-band European Remote Sensing Satellite (ERS-1) synthetic aperture radar (SAR) data to estimate surface soil roughness in a semiarid rangeland. Radar backscattering coefficients were extracted from a dry and a wet season SAR image and were compared with 47 in situ soil roughness measurements obtained in the rocky soils of the Walnut Gulch Experimental Watershed, southeastern Arizona, USA. Both the dry and the wet season SAR data showed exponential relationships with root mean square (RMS) height measurements. The dry C-band ERS-1 SAR data were strongly correlated (R² = 0.80), while the wet season SAR data have somewhat higher secondary variation (R² = 0.59). This lower correlation was probably provoked by the stronger influence of soil moisture, which may not be negligible in the wet season SAR data. We concluded that the single configuration C-band SAR data is useful to estimate surface roughness of rocky soils in a semiarid rangeland.