895 resultados para discriminant analysis and cluster analysis
Resumo:
The objective of this study was to define production environments by grouping different environmental factors and, consequently, to assess genotype by production environment interactions on weaning weight (WW) in the Angus populations of Brazil and Uruguay. Climatic conditions were represented by monthly temperature means (°C), minimum and maximum temperatures in winter and summer respectively and accumulated rainfall (mm/year). Mode in month of birth and weaning, and calf weight (kg) and age (days) at weaning were used as indicators of management conditions of 33 and 161 herds in 13 and 34 regions in Uruguay and Brazil, respectively. Two approaches were developed: (a) a bi-character analysis of extreme sub-datasets within each environmental factor (bottom and top 33% of regions), (b) three different production environments (including farms from both countries) were defined in a cluster analysis using standardized environmental factors. To identify the variables that influenced the cluster formation, a discriminant analysis was previously carried out. Management (month, age and weight at weaning) and climatic factors (accumulated rainfalls and winter and summer temperatures) were the most important factors in the clustering of farms. Bi or trivariate analyses were performed to estimate heritability and genetic correlations for WW in extreme sub-datasets within environmental factor or between clusters, using MTDFREML software. Heritability estimates of WW in the first approach ranged from 0.27 to 0.54, and genetic correlations between top and bottom sub-datasets within environmental factors, from -0.29 to 0.70. In the cluster approach, heritabilities were 0.58±0.04 for cluster 1, 0.31±0.01 for Cluster 2 and 0.40±0.02 for Cluster 3. Genetic correlations were 0.27±0.08, 0.32±0.09 and 0.33±0.09, between clusters 1 and 2, 1 and 3, and 2 and 3, respectively. Both approaches suggest the existence of genotype x environment interaction for weaning weight in Angus breed of Brazil and Uruguay. © 2012 Elsevier B.V.
Resumo:
China is an important center of origin for the genus Citrus L. of the family Rutaceae and is rich in wild Citrus species. The taxonomy of Citrus has been a subject of controversy for more than a half century. We propose that the metabolite profiles of Chinese native Citrus species can be used for classification and understanding of the taxonomic relationships within the Citrus germplasm. In this study, triplicate gas chromatography-mass spectrometry (GC-MS) metabolite profiles of 20 Citrus species/varieties were acquired, including 10 native varieties originating in China. R-(+)-limonene, alpha-pinene, sabinene and alpha-terpinene were found to be major characteristic components of the essential oils analyzed in this study, and these compounds contributed greatly to the metabolic classification. The three basic species of the subgenus Eucitrus (Swingle's system), i.e., C. reticulata Blanco, C. medica L. and C. grandis Osb., were clearly differentiated based upon their metabolite profiles using hierarchical cluster analysis (HCA) and partial least square-discriminant analysis (PLS-DA). All the presumed hybrid genotypes, including sweet orange (C. sinensis Osb.), sour orange (C. aurantium L.), lemon (C. limon Burm.f.), rough lemon (C. jambhiri Lush.), rangpur lime (C. limonia Osb.) and grapefruit (C. paradisi Macf.), were grouped closely together with one of their suggested parent species in the HCA-dendrogram and the PLS-DA score plot. These results clearly demonstrated that the metabolite profiles of Citrus species could be utilized for the taxonomic classification of the genus and are complementary to the existing taxonomic evidence, especially for the identification and differentiation of hybrid species.
Resumo:
This paper compares the responses of conventional and transgenic soybean to glyphosate application in terms of the contents of 17 detectable soluble amino acids in leaves, analyzed by HPLC and fluorescence detection. Glutamate, histidine, asparagine, arginine + alanine, glycine + threonine and isoleucine increased in conventional soybean leaves when compared to transgenic soybean leaves, whereas for other amino acids, no significant differences were recorded. Univariate analysis allowed us to make an approximate differentiation between conventional and transgenic lines, observing the changes of some variables by glyphosate application. In addition, by means of the multivariate analysis, using principal components analysis (PCA), cluster analysis (CA) and linear discriminant analysis (LDA) it was possible to identify and discriminate different groups based on the soybean genetic origin. (C) 2011 Elsevier Inc. All rights reserved.
Resumo:
In this work, 50 ceramic fragments from the Lago Grande and 30 from the Osvaldo archaeological site were compared to assess elemental similarities. The aim is to perform a preliminary comparison between the sites, which are located in the central Amazon, Brazil. The analytical technique employed to obtain the ceramics elemental composition was instrumental neutron activation analysis (INAA). The data set obtained was explored by the multivariate statistical techniques of cluster, principal component and discriminant analysis. The analyzed elements were: Na, Lu, U, Yb, La, Th, Cr, Cs, Sc, Fe, Eu, Ce and Hf. The results showed the existence of at least two compositional groups for Lago Grande and Osvaldo. Each compositional group of Osvaldo archaeological site matches with one group of Lago Grande. Correlated with the archaeological background, the results suggest commercial or cultural exchange in the region, which is an indicative of socio-cultural interactions between those sites.
Resumo:
Abstract Background Prostate cancer is a leading cause of death in the male population, therefore, a comprehensive study about the genes and the molecular networks involved in the tumoral prostate process becomes necessary. In order to understand the biological process behind potential biomarkers, we have analyzed a set of 57 cDNA microarrays containing ~25,000 genes. Results Principal Component Analysis (PCA) combined with the Maximum-entropy Linear Discriminant Analysis (MLDA) were applied in order to identify genes with the most discriminative information between normal and tumoral prostatic tissues. Data analysis was carried out using three different approaches, namely: (i) differences in gene expression levels between normal and tumoral conditions from an univariate point of view; (ii) in a multivariate fashion using MLDA; and (iii) with a dependence network approach. Our results show that malignant transformation in the prostatic tissue is more related to functional connectivity changes in their dependence networks than to differential gene expression. The MYLK, KLK2, KLK3, HAN11, LTF, CSRP1 and TGM4 genes presented significant changes in their functional connectivity between normal and tumoral conditions and were also classified as the top seven most informative genes for the prostate cancer genesis process by our discriminant analysis. Moreover, among the identified genes we found classically known biomarkers and genes which are closely related to tumoral prostate, such as KLK3 and KLK2 and several other potential ones. Conclusion We have demonstrated that changes in functional connectivity may be implicit in the biological process which renders some genes more informative to discriminate between normal and tumoral conditions. Using the proposed method, namely, MLDA, in order to analyze the multivariate characteristic of genes, it was possible to capture the changes in dependence networks which are related to cell transformation.
Resumo:
Analysts, politicians and international players from all over the world look at China as one of the most powerful countries on the international scenario, and as a country whose economic development can significantly impact on the economies of the rest of the world. However many aspects of this country have still to be investigated. First the still fundamental role played by Chinese rural areas for the general development of the country from a political, economic and social point of view. In particular, the way in which the rural areas have influenced the social stability of the whole country has been widely discussed due to their strict relationship with the urban areas where most people from the countryside emigrate searching for a job and a better life. In recent years many studies have mostly focused on the urbanization phenomenon with little interest in the living conditions in rural areas and in the deep changes which have occurred in some, mainly agricultural provinces. An analysis of the level of infrastructure is one of the main aspects which highlights the principal differences in terms of living conditions between rural and urban areas. In this thesis, I first carried out the analysis through the multivariate statistics approach (Principal Component Analysis and Cluster Analysis) in order to define the new map of rural areas based on the analysis of living conditions. In the second part I elaborated an index (Living Conditions Index) through the Fuzzy Expert/Inference System. Finally I compared this index (LCI) to the results obtained from the cluster analysis drawing geographic maps. The data source is the second national agricultural census of China carried out in 2006. In particular, I analysed the data refer to villages but aggregated at province level.
Resumo:
Dahl salt-sensitive (DS) and salt-resistant (DR) inbred rat strains represent a well established animal model for cardiovascular research. Upon prolonged administration of high-salt-containing diet, DS rats develop systemic hypertension, and as a consequence they develop left ventricular hypertrophy, followed by heart failure. The aim of this work was to explore whether this animal model is suitable to identify biomarkers that characterize defined stages of cardiac pathophysiological conditions. The work had to be performed in two stages: in the first part proteomic differences that are attributable to the two separate rat lines (DS and DR) had to be established, and in the second part the process of development of heart failure due to feeding the rats with high-salt-containing diet has to be monitored. This work describes the results of the first stage, with the outcome of protein expression profiles of left ventricular tissues of DS and DR rats kept under low salt diet. Substantial extent of quantitative and qualitative expression differences between both strains of Dahl rats in heart tissue was detected. Using Principal Component Analysis, Linear Discriminant Analysis and other statistical means we have established sets of differentially expressed proteins, candidates for further molecular analysis of the heart failure mechanisms.
Resumo:
Classical liquid-state high-resolution (HR) NMR spectroscopy has proved a powerful tool in the metabonomic analysis of liquid food samples like fruit juices. In this paper the application of (1)H high-resolution magic angle spinning (HR-MAS) NMR spectroscopy to apple tissue is presented probing its potential for metabonomic studies. The (1)H HR-MAS NMR spectra are discussed in terms of the chemical composition of apple tissue and compared to liquid-state NMR spectra of apple juice. Differences indicate that specific metabolic changes are induced by juice preparation. The feasibility of HR-MAS NMR-based multivariate analysis is demonstrated by a study distinguishing three different apple cultivars by principal component analysis (PCA). Preliminary results are shown from subsequent studies comparing three different cultivation methods by means of PCA and partial least squares discriminant analysis (PLS-DA) of the HR-MAS NMR data. The compounds responsible for discriminating organically grown apples are discussed. Finally, an outlook of our ongoing work is given including a longitudinal study on apples.
Resumo:
High altitude periodic breathing (PB) shares some common pathophysiologic aspects with sleep apnea, Cheyne-Stokes respiration and PB in heart failure patients. Methods that allow quantifying instabilities of respiratory control provide valuable insights in physiologic mechanisms and help to identify therapeutic targets. Under the hypothesis that high altitude PB appears even during physical activity and can be identified in comparison to visual analysis in conditions of low SNR, this study aims to identify PB by characterizing the respiratory pattern through the respiratory volume signal. A number of spectral parameters are extracted from the power spectral density (PSD) of the volume signal, derived from respiratory inductive plethysmography and evaluated through a linear discriminant analysis. A dataset of 34 healthy mountaineers ascending to Mt. Muztagh Ata, China (7,546 m) visually labeled as PB and non periodic breathing (nPB) is analyzed. All climbing periods within all the ascents are considered (total climbing periods: 371 nPB and 40 PB). The best crossvalidated result classifying PB and nPB is obtained with Pm (power of the modulation frequency band) and R (ratio between modulation and respiration power) with an accuracy of 80.3% and area under the receiver operating characteristic curve of 84.5%. Comparing the subjects from 1(st) and 2(nd) ascents (at the same altitudes but the latter more acclimatized) the effect of acclimatization is evaluated. SaO(2) and periodic breathing cycles significantly increased with acclimatization (p-value < 0.05). Higher Pm and higher respiratory frequencies are observed at lower SaO(2), through a significant negative correlation (p-value < 0.01). Higher Pm is observed at climbing periods visually labeled as PB with > 5 periodic breathing cycles through a significant positive correlation (p-value < 0.01). Our data demonstrate that quantification of the respiratory volume signal using spectral analysis is suitable to identify effects of hypobaric hypoxia on control of breathing.
Resumo:
OBJECT: In this study, 1H magnetic resonance (MR) spectroscopy was prospectively tested as a reliable method for presurgical grading of neuroepithelial brain tumors. METHODS: Using a database of tumor spectra obtained in patients with histologically confirmed diagnoses, 94 consecutive untreated patients were studied using single-voxel 1H spectroscopy (point-resolved spectroscopy; TE 135 msec, TE 135 msec, TR 1500 msec). A total of 90 tumor spectra obtained in patients with diagnostic 1H MR spectroscopy examinations were analyzed using commercially available software (MRUI/VARPRO) and classified using linear discriminant analysis as World Health Organization (WHO) Grade I/II, WHO Grade III, or WHO Grade IV lesions. In all cases, the classification results were matched with histopathological diagnoses that were made according to the WHO classification criteria after serial stereotactic biopsy procedures or open surgery. Histopathological studies revealed 30 Grade I/II tumors, 29 Grade III tumors, and 31 Grade IV tumors. The reliability of the histological diagnoses was validated considering a minimum postsurgical follow-up period of 12 months (range 12-37 months). Classifications based on spectroscopic data yielded 31 tumors in Grade I/II, 32 in Grade III, and 27 in Grade IV. Incorrect classifications included two Grade II tumors, one of which was identified as Grade III and one as Grade IV; two Grade III tumors identified as Grade II; two Grade III lesions identified as Grade IV; and six Grade IV tumors identified as Grade III. Furthermore, one glioblastoma (WHO Grade IV) was classified as WHO Grade I/II. This represents an overall success rate of 86%, and a 95% success rate in differentiating low-grade from high-grade tumors. CONCLUSIONS: The authors conclude that in vivo 1H MR spectroscopy is a reliable technique for grading neuroepithelial brain tumors.
Resumo:
Recently it has been proposed that the evaluation of effects of pollutants on aquatic organisms can provide an early warning system of potential environmental and human health risks (NRC 1991). Unfortunately there are few methods available to aquatic biologists to conduct assessments of the effects of pollutants on aquatic animal community health. The primary goal of this research was to develop and evaluate the feasibility of such a method. Specifically, the primary objective of this study was to develop a prototype rapid bioassessment technique similar to the Index of Biotic Integrity (IBI) for the upper Texas and Northwestern Gulf of Mexico coastal tributaries. The IBI consists of a series of "metrics" which describes specific attributes of the aquatic community. Each of these metrics are given a score which is then subtotaled to derive a total assessment of the "health" of the aquatic community. This IBI procedure may provide an additional assessment tool for professionals in water quality management.^ The experimental design consisted primarily of compiling previously collected data from monitoring conducted by the Texas Natural Resource Conservation Commission (TNRCC) at five bayous classified according to potential for anthropogenic impact and salinity regime. Standardized hydrological, chemical, and biological monitoring had been conducted in each of these watersheds. The identification and evaluation of candidate metrics for inclusion in the estuarine IBI was conducted through the use of correlation analysis, cluster analysis, stepwise and normal discriminant analysis, and evaluation of cumulative distribution frequencies. Scores of each included metric were determined based on exceedances of specific percentiles. Individual scores were summed and a total IBI score and rank for the community computed.^ Results of these analyses yielded the proposed metrics and rankings listed in this report. Based on the results of this study, incorporation of an estuarine IBI method as a water quality assessment tool is warranted. Adopted metrics were correlated to seasonal trends and less so to salinity gradients observed during the study (0-25 ppt). Further refinement of this method is needed using a larger more inclusive data set which includes additional habitat types, salinity ranges, and temporal variation. ^
Resumo:
The purpose of this study was to examine the relationship between enterotoxigenic ETEC and travelers' diarrhea over a period of five years in Guadalajara, Mexico. Specifically, this study identified and characterized ETEC from travelers with diarrhea. The objectives were to study the colonization factor antigens, toxins and antibiotic sensitivity patterns in ETEC from 1992 to 1997 and to study the molecular epidemiology of ETEC by plasmid content and DNA restriction fragment patterns. ^ In this survey of travelers' diarrhea in Guadalajara, Mexico, 928 travelers with diarrhea were screened for enteric pathogens between 1992 and 1997. ETEC were isolated in 195 (19.9%) of the patients, representing the most frequent enteric pathogen identified. ^ A total of 31 antimicrobial susceptibility patterns were identified among ETEC isolates over the five-year period. ^ The 195 ETEC isolates contained two to six plasmids each, which ranged in size from 2.0 to 23 kbp. ^ Three different reproducible rRNA gene restriction patterns (ribotypes R-1 to R-3) were obtained among the 195 isolates with the enzyme, HindIII. ^ Colonization factor antigens (CFAs) were identified in 99 (51%) of the 195 ETEC strains studied. ^ Cluster analysis of the observations seen in the four assays all confirmed the five distinct groups of study-year strains of ETEC. Each group had a >95% similarity level of strains within the group and <60% similarity level between the groups. In addition, discriminant analysis of assay variables used in predicting the ETEC strains, reveal a >80% relationship between both the plasmid and rRNA content of ETEC strains and study-year. ^ These findings, based on laboratory observations of the differences in biochemical, antimicrobial susceptibility, plasmid and ribotype content, suggest complex epidemiology for ETEC strains in a population with travelers' diarrhea. The findings of this study may have implications for our understanding of the epidemiology, transmission, treatment, control and prevention of the disease. It has been suggested that an ETEC vaccine for humans should contain the most prevalent CFAs. Therefore, it is important to know the prevalence of these factors in ETEC in various geographical areas. ^ CFAs described in this dissertation may be used in different epidemiological studies in which the prevalence of CFAs and other properties on ETEC will be evaluated. Furthermore, in spite of an intense search in near 200 ETEC isolates for strains that may have clonal relationship, we failed to identify such strains. However, further studies are in progress to construct suitable live vaccine strains and to introduce several of CFAs in the same host organism by recombinant DNA techniques (Dr. Ann-Mari Svennerholm's lab). (Abstract shortened by UMI.)^
Resumo:
The role of Soil Organic Carbon (SOC) in mitigating climate change, indicating soil quality and ecosystem function has created research interested to know the nature of SOC at landscape level. The objective of this study was to examine variation and distribution of SOC in a long-term land management at a watershed and plot level. This study was based on meta-analysis of three case studies and 128 surface soil samples from Ethiopia. Three sites (Gununo, Anjeni and Maybar) were compared after considering two Land Management Categories (LMC) and three types of land uses (LUT) in quasi-experimental design. Shapiro-Wilk tests showed non-normal distribution (p = 0.002, a = 0.05) of the data. SOC median value showed the effect of long-term land management with values of 2.29 and 2.38 g kg-1 for less and better-managed watersheds, respectively. SOC values were 1.7, 2.8 and 2.6 g kg-1 for Crop (CLU), Grass (GLU) and Forest Land Use (FLU), respectively. The rank order for SOC variability was FLU>GLU>CLU. Mann-Whitney U and Kruskal-Wallis test showed a significant difference in the medians and distribution of SOC among the LUT, between soil profiles (p<0.05, confidence interval 95%, a = 0.05) while it is not significant (p>0.05) for LMC. The mean and sum rank of Mann Whitney U and Kruskal Wallis test also showed the difference at watershed and plot level. Using SOC as a predictor, cross-validated correct classification with discriminant analysis showed 46 and 49% for LUT and LMC, respectively. The study showed how to categorize landscapes using SOC with respect to land management for decision-makers.
Resumo:
Objective: This study examined the recent trends and characteristics of reported pertussis in Harris County from 2005-2010. ^ Methods: The study population included surveillance data from all reported pertussis cases from January 1, 2005 to December 31, 2010 to Harris County Public Health and Environmental Services (HCPHES). We calculated incidence and attack rates for varying age groups, race/ethnicity, and gender. Spatial analyses were conducted of hot spot and cluster of incident cases in Harris County census tracts. Maps were constructed using geographic information system. ^ Results: Age-specific incidence rates of reported cases of pertussis were highest among infants under a year of age and lowest among adults age 20 and older. Hispanics represented the most cases reported compared to any other race or ethnic group (42% of 483 cases). Age-adjusted rates were highest in 2009 at 9.81 cases per 100,000 population. Only 31.2% of people received at least four of the recommended five doses of vaccine. Spatial analyses revealed statistically significant clusters within the northeast region of Harris County. ^ Conclusions: Hispanic infants are the most at risk group for pertussis. Although 70% of cases had a history of immunization, 41.8% of infants were appropriately vaccinated for their age. Increased vaccination coverage may decrease the incidence of pertussis.^
Resumo:
Background Objective assessment of psychomotor skills has become an important challenge in the training of minimally invasive surgical (MIS) techniques. Currently, no gold standard defining surgical competence exists for classifying residents according to their surgical skills. Supervised classification has been proposed as a means for objectively establishing competence thresholds in psychomotor skills evaluation. This report presents a study comparing three classification methods for establishing their validity in a set of tasks for basic skills’ assessment. Methods Linear discriminant analysis (LDA), support vector machines (SVM), and adaptive neuro-fuzzy inference systems (ANFIS) were used. A total of 42 participants, divided into an experienced group (4 expert surgeons and 14 residents with >10 laparoscopic surgeries performed) and a nonexperienced group (16 students and 8 residents with <10 laparoscopic surgeries performed), performed three box trainer tasks validated for assessment of MIS psychomotor skills. Instrument movements were captured using the TrEndo tracking system, and nine motion analysis parameters (MAPs) were analyzed. The performance of the classifiers was measured by leave-one-out cross-validation using the scores obtained by the participants. Results The mean accuracy performances of the classifiers were 71 % (LDA), 78.2 % (SVM), and 71.7 % (ANFIS). No statistically significant differences in the performance were identified between the classifiers. Conclusions The three proposed classifiers showed good performance in the discrimination of skills, especially when information from all MAPs and tasks combined were considered. A correlation between the surgeons’ previous experience and their execution of the tasks could be ascertained from results. However, misclassifications across all the classifiers could imply the existence of other factors influencing psychomotor competence.