822 resultados para discriminant analysis and cluster analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mycoplasma hyopneumoniae is the etiological agent of enzootic pneumonia in swine. Various reports indicate that different strains are circulating in the swine population. We investigated the variety of M. hyopneumoniae strains by a newly developed genetic typing method based on the polyserine repeat motif of the LppS homolog P146. PCR amplification using M. hyopneumoniae specific, conserved primers flanking the region encoding the repeat motif, followed by sequencing and cluster analysis was carried out. The study included strains isolated from different geographic regions as well as lysates from lung swabs from a series of pig farms in Switzerland. High diversity of M. hyopneumoniae was observed but farms being in close geographic or operative contact generally seemed to be affected by the same strains. Moreover, analysis of multiple samples from single pig farms indicated that these harbored the same, farm-specific strain. The results indicate that multiple strains of M. hyopneumoniae are found in the swine population but that specific strains or clones are responsible for local outbreaks. The method presented is a highly reproducible epidemiologic tool allowing direct typing of M. hyopneumoniae from clinical material without prior isolation and cultivation of strains.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In 1998-2001 Finland suffered the most severe insect outbreak ever recorded, over 500,000 hectares. The outbreak was caused by the common pine sawfly (Diprion pini L.). The outbreak has continued in the study area, Palokangas, ever since. To find a good method to monitor this type of outbreaks, the purpose of this study was to examine the efficacy of multi-temporal ERS-2 and ENVISAT SAR imagery for estimating Scots pine (Pinus sylvestris L.) defoliation. Three methods were tested: unsupervised k-means clustering, supervised linear discriminant analysis (LDA) and logistic regression. In addition, I assessed if harvested areas could be differentiated from the defoliated forest using the same methods. Two different speckle filters were used to determine the effect of filtering on the SAR imagery and subsequent results. The logistic regression performed best, producing a classification accuracy of 81.6% (kappa 0.62) with two classes (no defoliation, >20% defoliation). LDA accuracy was with two classes at best 77.7% (kappa 0.54) and k-means 72.8 (0.46). In general, the largest speckle filter, 5 x 5 image window, performed best. When additional classes were added the accuracy was usually degraded on a step-by-step basis. The results were good, but because of the restrictions in the study they should be confirmed with independent data, before full conclusions can be made that results are reliable. The restrictions include the small size field data and, thus, the problems with accuracy assessment (no separate testing data) as well as the lack of meteorological data from the imaging dates.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: To explore the feasibility and psychometric properties of a self-administered version of the 24-item Geriatric Pain Measure (GPM-24-SA). DESIGN: Secondary analysis of baseline data from the Prevention in Older People-Assessment in Generalists' practices trial, an international multi-center study of a health-risk appraisal system. PARTICIPANTS: One thousand seventy-two community dwelling nondisabled older adults self-reporting pain from London, UK; Hamburg, Germany; and Solothurn, Switzerland. OUTCOME MEASURES: GPM-24-SA as part of a multidimensional Health Risk Appraisal Questionnaire including self-reported demographic and health-related information. RESULTS: Among the 1,072 subjects, 655 had complete GPM-24-SA data, 404 had and 13 had >30% missing GPM-24-SA data. In psychometric analyses across the three European populations with complete GPM-24-SA data, the measure exhibited stable internal consistency, good convergent, divergent and discriminant validity, and produced stable pain measurements. However, factor analysis indicated differences in the GPM-24-SA across sites with discrepancies mainly related to items of a single subscale that failed to load appropriately. Analyses including imputation for subjects with and uncertainty in factor structure, further refinement and psychometric evaluation of the GPM-24-SA is needed before it could be recommended for widespread use.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this study, we demonstrate the power of applying complementary DNA (cDNA) microarray technology to identifying candidate loci that exhibit subtle differences in expression levels associated with a complex trait in natural populations of a nonmodel organism. Using a highly replicated experimental design involving 180 cDNA microarray experiments, we measured gene-expression levels from 1098 transcript probes in 90 individuals originating from six brown trout (Salmo trutta) and one Atlantic salmon (Salmo salar) population, which follow either a migratory or a sedentary life history. We identified several candidate genes associated with preparatory adaptations to different life histories in salmonids, including genes encoding for transaldolase 1, constitutive heat-shock protein HSC70-1 and endozepine. Some of these genes clustered into functional groups, providing insight into the physiological pathways potentially involved in the expression of life-history related phenotypic differences. Such differences included the down-regulation of genes involved in the respiratory system of future migratory individuals. In addition, we used linear discriminant analysis to identify a set of 12 genes that correctly classified immature individuals as migratory or sedentary with high accuracy. Using the expression levels of these 12 genes, 17 out of 18 individuals used for cross-validation were correctly assigned to their respective life-history phenotype. Finally, we found various candidate genes associated with physiological changes that are likely to be involved in preadaptations to seawater in anadromous populations of the genus Salmo, one of which was identified to encode for nucleophosmin 1. Our findings thus provide new molecular insights into salmonid life-history variation, opening new perspectives in the study of this complex trait.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The cultivation of dessert apples has to meet the consumer's increasing demand for high fruit quality and a sustainable mostly residue-free production while ensuring a competitive agricultural productivity. It is therefore of great interest to know the impact of different cultivation methods on the fruit quality and the chemical composition, respectively. Previous studies have demonstrated the feasibility of High Resolution Magic Angle Spinning (HR-MAS) NMR spectroscopy directly performed on apple tissue as analytical tool for metabonomic studies. In this study, HR-MAS NMR spectroscopy is applied to apple tissue to analyze the metabolic profiles of apples grown under 3 different cultivation methods. Golden Delicious apples were grown applying organic (Bio), integrated (IP) and low-input (LI) plant protection strategies. A total of 70 1H HR-MAS NMR spectra were analyzed by means of principle component analysis (PCA) and partial least squares discriminant analysis (PLS-DA). Apples derived from Bio-production could be well separated from the two other cultivation methods applying both, PCA and PLS-DA. Apples obtained from integrated (IP) and low-input (LI) production discriminated when taking the third PLS-component into account. The identified chemical composition and the compounds responsible for the separation, i.e. the PLS-loadings, are discussed. The results are compared with fruit quality parameters assessed by conventional methods. The present study demonstrates the potential of HR-MAS NMR spectroscopy of fruit tissue as analytical tool for finding markers for specific fruit production conditions like the cultivation method.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Few studies have examined the 20% of individuals who never experience an episode of low back pain (LBP). To date, no investigation has been undertaken that examines a group who claim to have never experienced LBP in their lifetime in comparison to two population-based case–control groups with and without momentary LBP. This study investigates whether LBP-resilient workers between 50 and 65 years had better general health, demonstrated more positive health behaviour and were better able to achieve routine activities compared with both case–control groups. Methods: Forty-two LBP-resilient participants completed the same pain assessment questionnaire as a population-based LBP sample from a nationwide, large-scale cross-sectional survey in Switzerland. The LBP-resilient participants were pairwise compared to the propensity score-matched case controls by exploring differences in demographic and work characteristics, and by calculating odds ratios (ORs) and effect sizes. A discriminant analysis explored group differences, while the multiple logistic regression analysis specified single indicators which accounted for group differences. Results: LBP-resilient participants were healthier than the case controls with momentary LBP and achieved routine activities more easily. Compared to controls without momentary LBP, LBP-resilient participants had a higher vitality, a lower workload, a healthier attitude towards health and behaved more healthily by drinking less alcohol. Conclusions: By demonstrating a difference between LBP-resilient participants and controls without momentary LBP, the question that arises is what additional knowledge can be attained. Three underlying traits seem to be relevant about LBP-resilient participants: personality, favourable work conditions and subjective attitudes/attributions towards health. These rationales have to be considered with respect to LBP prevention.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study compared three body measurements, height, hip width (bitrochanteric) and foot length, in 120 Hispanic women who had their first birth by cesarean section (N = 60) or by spontaneous vaginal delivery (N = 60). The objective of the study was to see if there were differences in these measurements that could be useful in predicting cephalopelvic disproportion. Data were collected from two public hospitals in Houston Texas over a 10 month period from December 1994 to October 1995. The statistical technique used to evaluate the measures was discriminant analysis.^ Women who delivered by cesarean section were older, shorter, had shorter feet and delivered heavier infants. There were no differences in the bitrochanteric widths of the women or in the mean gestational age or Apgar scores of the infants.^ Significantly more of the mothers and infants were ill following cesarean section delivery. Maternal illness was usually infection; infant illness was primarily infection or respiratory difficulties.^ Discriminant analysis is a technique which allows for classification and prediction to which group a particular entity will belong given a certain set of variables. Using discriminant analysis, with a probability of cesarean section 50 percent, the best combination to classify who would have a cesarean section was height and hip width, correctly classifying 74.2 percent of those who needed surgery. When the probability of cesarean section was 10 percent and probability of vaginal delivery was 90 percent, the best predictor of who would need operative delivery was height, hip width and age, correctly classifying 56.2 percent. In the population from which the study participants were selected the incidence of cephalopelvic disproportion was low, approximately 1 percent.^ With the technologic assistance available in most of the developed world, it is likely that the further pursuit of different measures and their use would not be of much benefit in attempting to predict and diagnose disproportion. However, in areas of the world where much of obstetrics is "hands on", the availability of technology extremely limited, and the incidence of disproportion larger, the use of anthropometric measures might be useful and of some potential benefit. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Most cows encounter a state of negative energy balance during the periparturient period, which may lead to metabolic disorders and impaired fertility. The aim of this study was to assess the potential of milk fatty acids as diagnostic tools of detrimental levels of blood plasma nonesterified fatty acids (NEFA), defined as NEFA concentrations beyond 0.6 mmol/L, in a data set of 92 early lactating cows fed a glucogenic or lipogenic diet and subjected to 0-, 30-, or 60-d dry period before parturition. Milk was collected in wk 2, 3, 4, and 8 (n = 368) and blood was sampled weekly from wk 2 to 8 after parturition. Milk was analyzed for milk fatty acids and blood plasma for NEFA. Data were classified as "at risk of detrimental blood plasma NEFA" (NEFA ≥ 0.6 mmol/L) and "not at risk of detrimental blood plasma NEFA" (NEFA <0.6 mmol/L). Concentrations of 45 milk fatty acids and milk fat C18:1 cis-9-to-C15:0 ratio were subjected to a discriminant analysis. Milk fat C18:1 cis-9 revealed the most discriminating variable to identify detrimental blood plasma NEFA. A false positive rate of 10% allowed us to diagnose 46% of the detrimental blood plasma NEFA cases based on a milk fat C18:1 cis-9 concentration of at least 230 g/kg of milk fatty acids. Additionally, it was assessed whether the milk fat C18:1 cis-9 concentrations of wk 2 could be used as an early warning for detrimental blood plasma NEFA risk during the first 8 wk in lactation. Cows with at least 240 g/kg of C18:1 cis-9 in milk fat had about 50% chance to encounter blood plasma NEFA values of 0.6 mmol/L or more during the first 8 wk of lactation, with a false positive rate of 11.4%. Profit simulations were based on costs for cows suffering from detrimental blood plasma NEFA, and costs for preventive treatment based on daily dosing of propylene glycol for 3 wk. Given the relatively low incidence rate (8% of all observations), continuous monitoring of milk fatty acids during the first 8 wk of lactation to diagnose detrimental blood plasma NEFA does not seem cost effective. On the contrary, milk fat C18:1 cis-9 of the second lactation week could be an early warning of cows at risk of detrimental blood NEFA. In this case, selective treatment may be cost effective.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Using a retrospective cross-sectional approach, this study quantitatively analyzed foodborne illness data, restaurant inspection data, and census-derived socioeconomic and demographic data within Harris County, Texas between 2005 and 2010. The main research question investigated involved determining the extent to which contextual and regulatory conditions distinguish outbreak and non-outbreak establishments within Harris County. Two groups of Harris County establishments were analyzed: outbreak and non-outbreak restaurants. STATA 11 was employed to determine the average profiles of each category across both the regulatory and socioeconomic (contextual) variables. Cross tabulations of all of the non-quantitative variables were also performed, and finally, a discriminant analysis was conducted to assess how well the variables were able to allocate the restaurants into their respective categories. Contextual and regulatory conditions were found to be minimally associated with the occurrence of foodborne outbreaks within Harris County. Across both the categories (outbreak and non-outbreak establishments), variables included were extremely similar in means, and when possible to observe, distributions. The variables analyzed in this study, both regulatory and contextual, were not found to significantly allocate the establishments into their correct outbreak or non-outbreak categories. The implications of these findings are that regulatory processes and guidelines in place in Harris County do not effectively to distinguish outbreak from non-outbreak restaurants. Additionally, no socioeconomic or racial/ethnic patterns are apparent in the incidence of foodborne disease in the county. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The evolution of the Southern Ocean climate during the late Eocene-late Oligocene interval is examined through highresolution, quantitative calcareous nannofossil analyses on samples from the Southern Ocean sections on Maud Rise and Kerguelen Plateau. We determined the abundance patterns of the counted species to clarify the biostratigraphy, which we correlated with high-resolution magnetostratigraphy [Roberts, A.P., Bicknell, S.J., Byatt, J., Bohaty, S.M., Florindo, F., Harwood, D.M., 2003a. Magnetostratigraphic calibration of Southern Ocean diatom datums from the Eocene-Oligocene of Kerguelen Plateau (Ocean Drilling Program Sites 744 and 748). In: Florindo, F., Cooper, A.K., O'Brien, P.A. (Eds.), Antarctic Cenozoic Palaeoenvironments: Geologic Record and Models. Palaeogeogr., Palaeoclimatol., Palaeoecol. 198 145-168; Florindo, F., Roberts, A.P., in press. Eocene-Oligocene magnetobiochronology of ODP Sites 689 and 690, Maud Rise, Weddell Sea, Antarctica. Geol. Soc. Am. Bull.], and used this data to interpret paleoceanographic changes through the late Eocene to late Oligocene. Percentage plots of the individual species, compared with R-mode principal component and cluster analysis results, allowed us to divide the assemblages into three groups: temperate-water taxa, cool-water taxa, and no temperature-affinity taxa. We attempt correlations between these paleoecological groups and the major sea-surface temperature (SST) variations with tectonic and paleoceanographic changes in the Southern Ocean. During the late Eocene, the nannofossil assemblage data reveal that there were several minor SST decreases (coolings) from 36 to 34 Ma, before the Eocene/Oligocene (E/O) boundary. A sharp cooling event, dated at 33.54 Ma (earliest Oligocene), occurred about 160 kyr after the E/O boundary, which is dated at 33.7 Ma. Relatively stable, cool conditions are interpreted to persist until the latest Oligocene, when an increase in abundance of temperate-water taxa, which corresponds to an antithetical decrease in abundance of cool-water indicators, is recorded. On the basis of our dating, the opening of the Drake Passage, allowing shallow-water circulation, began by 33.54 Ma at the latest, while the establishment of deep-water connections through the Tasmanian Gateway occurred at 33 Ma, as suggested by Exon et al. [Proc. ODP, Init. Rep. 189 (2001) 1].

Relevância:

100.00% 100.00%

Publicador:

Resumo:

At the NW-slope of Eckernforder Bay (Western Baltic) between 14 and 21 m water depth 7 sand cores were taken with a vibrocorer. The cores were between 85 and 250 cm long. The sand was analysed for grain size distribution, proportions of organic carbon and carbonate, and contents of microfossils. The radiometric age and stable carbon isotope ratios were determined on organic material from 14 sample. With regard to benthic foraminifera and other microorganisms four different types of depositional conditions could be distinguished: Types 1 and 2: two types of offshore sand areas. Type 3: lagoon and nearshore. Type 4: subaerial or limnic. Using sedimentological and geochemical parameters two formation areas could be distinguished with the aid of a discriminant analysis: offshore (types 1 and 2) and nearshore (types 3 and 4). A juxtaposition of core sections indicated two distinct profiles. Their ages fit into the picture of the assumed postglacial sea-level rise. The lagoon- and nearshore sands are interpreted as the result of sea-level stagnation at 17-18 m below present sea-level. The accumulation rates of the sand in the offshore areas are, with a maximum of 0.15 mm/yr., an order of magnitude smaller than in the mud areas, located several hundred metres away.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In order to assess recent submarine volcanic contributions to the sediments from the active Kolbeinsey Ridge, surface samples were analyzed chemically. The contribution of major and trace elements studied differ within the study area. A statistical analysis of the geochemical variables using factor analysis and cluster method allows to distinguish possible sample groups. Cluster method identifies three distinct sediment groups located in different areas of sedimentation. Group 1 is characterized by highest contents of Fe2O3, V, Co, Ni, Cu and Zn demonstrating the input of volcaniclastic material. Group 2 comprises high values of CaCO3, CaO and Sr representing biogenic carbonate. Group 3 is characterized by the elements K, Rb, Cs, La and Pb indicating the terrigenous component. The absolute percentage of the volcanic, biogenic and terrigenous components in the bulk sediments was calculated by using a normative sediment method. The highest volcanic component (> 60% on a carbonate free basis) is found on the ridge crest. The biogenic component is highest (10-30%) in the eastern part of the Spar Fracture Zone influenced by the East Iceland Current. Samples from the western and southeastern region of the study area contain more than 90% of terrigenous component which appears to be mainly controlled by input of ice-rafted debris.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Middle Eocene diatom and silicoflagellate record of ODP Site 1260A (Demerara Rise) is studied quantitatively in order to throw light on the changes that siliceous phytoplankton communities experienced during a Middle Eocene warming event that occurred between 44.0 and 42.0 Ma. Both Pianka's overlap index, calculated per couple of successive samples, and cluster analysis, point to a number of significant turnover events highlighted by changes in the structure of floristic communities. The pre-warming flora, dominated by cosmopolitan species of the diatom genus Triceratium, is replaced during the warming interval by a new and more diverse assemblage, dominated by Paralia sulcata (an indicator of high productivity) and two endemic tropical species of the genus Hemiaulus. The critical warming interval is characterized by a steady increase in biogenic silica and a comparable increase in excess Ba, both reflecting an increase in productivity. In general, it appears that high productivity not only increased the flux of biogenic silica, but also sustained a higher diversity in the siliceous phytoplankton communities. The microflora preserved above the critical interval is once again of low diversity and dominated by various species of the diatom genus Hemiaulus. All assemblages in the studied material are characterized by the total absence of continental and benthic diatoms and the relative abundance of neritic forms, suggesting a transitional depositional environment between the neritic and the oceanic realms.