881 resultados para multiple discriminant analysis
Resumo:
A microwave-assisted extraction (MAE) procedure to isolate phenolic compounds from almond skin byproducts was optimized. A three-level, three-factor Box–Behnken design was used to evaluate the effect of almond skin weight, microwave power, and irradiation time on total phenolic content (TPC) and antioxidant activity (DPPH). Almond skin weight was the most important parameter in the studied responses. The best extraction was achieved using 4 g, 60 s, 100 W, and 60 mL of 70% (v/v) ethanol. TPC, antioxidant activity (DPPH, FRAP), and chemical composition (HPLC-DAD-ESI-MS/MS) were determined by using the optimized method from seven different almond cultivars. Successful discrimination was obtained for all cultivars by using multivariate linear discriminant analysis (LDA), suggesting the influence of cultivar type on polyphenol content and antioxidant activity. The results show the potential of almond skin as a natural source of phenolics and the effectiveness of MAE for the reutilization of these byproducts.
Resumo:
Many multifactorial biologic effects, particularly in the context of complex human diseases, are still poorly understood. At the same time, the systematic acquisition of multivariate data has become increasingly easy. The use of such data to analyze and model complex phenotypes, however, remains a challenge. Here, a new analytic approach is described, termed coreferentiality, together with an appropriate statistical test. Coreferentiality is the indirect relation of two variables of functional interest in respect to whether they parallel each other in their respective relatedness to multivariate reference data, which can be informative for a complex effect or phenotype. It is shown that the power of coreferentiality testing is comparable to multiple regression analysis, sufficient even when reference data are informative only to a relatively small extent of 2.5%, and clearly exceeding the power of simple bivariate correlation testing. Thus, coreferentiality testing uses the increased power of multivariate analysis, however, in order to address a more straightforward interpretable bivariate relatedness. Systematic application of this approach could substantially improve the analysis and modeling of complex phenotypes, particularly in the context of human study where addressing functional hypotheses by direct experimentation is often difficult.
Resumo:
The aim of this study was to apply multifailure survival methods to analyze time to multiple occurrences of basal cell carcinoma (BCC). Data from 4.5 years of follow-up in a randomized controlled trial, the Nambour Skin Cancer Prevention Trial (1992-1996), to evaluate skin cancer prevention were used to assess the influence of sunscreen application on the time to first BCC and the time to subsequent BCCs. Three different approaches of time to ordered multiple events were applied and compared: the Andersen-Gill, Wei-Lin-Weissfeld, and Prentice-Williams-Peterson models. Robust variance estimation approaches were used for all multifailure survival models. Sunscreen treatment was not associated with time to first occurrence of a BCC (hazard ratio = 1.04, 95% confidence interval: 0.79, 1.45). Time to subsequent BCC tumors using the Andersen-Gill model resulted in a lower estimated hazard among the daily sunscreen application group, although statistical significance was not reached (hazard ratio = 0.82, 95% confidence interval: 0.59, 1.15). Similarly, both the Wei-Lin-Weissfeld marginal-hazards and the Prentice-Williams-Peterson gap-time models revealed trends toward a lower risk of subsequent BCC tumors among the sunscreen intervention group. These results demonstrate the importance of conducting multiple-event analysis for recurring events, as risk factors for a single event may differ from those where repeated events are considered.
Resumo:
Researchers often use 3-way interactions in moderated multiple regression analysis to test the joint effect of 3 independent variables on a dependent variable. However, further probing of significant interaction terms varies considerably and is sometimes error prone. The authors developed a significance test for slope differences in 3-way interactions and illustrate its importance for testing psychological hypotheses. Monte Carlo simulations revealed that sample size, magnitude of the slope difference, and data reliability affected test power. Application of the test to published data yielded detection of some slope differences that were undetected by alternative probing techniques and led to changes of results and conclusions. The authors conclude by discussing the test's applicability for psychological research. Copyright 2006 by the American Psychological Association.
Resumo:
Electronic commerce (e-commerce) has become an increasingly important initiative among organisations. The factors affecting adoption decisions have been well-documented, but there is a paucity of empirical studies that examine the adoption of e-commerce in developing economies in the Arab world. The aim of this study is to provide insights into the salient e-commerce adoption issues by focusing on Saudi Arabian businesses. Based on the Technology-Organisational-Environmental framework, an integrated research model was developed that explains the relative influence of 19 known determinants. A measurement scale was developed from prior empirical studies and revised based on feedback from the pilot study. Non-interactive adoption, interactive adoption and stabilisation of e-commerce adoption were empirically investigated using survey data collected from Saudi manufacturing and service companies. Multiple discriminant function analysis (MDFA) was used to analyse the data and research hypotheses. The analysis demonstrates that (1) regarding the non-interactive adoption of e-commerce, IT readiness, management team support, learning orientation, strategic orientation, pressure from business partner, regulatory and legal environment, technology consultants‘ participation and economic downturn are the most important factors, (2) when e-commerce interactive adoption is investigated, IT readiness, management team support, regulatory environment and technology consultants‘ participation emerge as the strongest drivers, (3) pressure from customers may not have much effect on the non-interactive adoption of e-commerce by companies, but does significantly influence the stabilisation of e-commerce use by firms, and (4) Saudi Arabia has a strong ICT infrastructure for supporting e-commerce practices. Taken together, these findings on the multi-dimensionality of e-commerce adoption show that non-interactive adoption, interactive adoption and stabilisation of e-commerce are not only different measures of e-commerce adoption, but also have different determinants. Findings from this study may be valuable for both policy and practice as it can offer a substantial understanding of the factors that enhance the widespread use of B2B e-commerce. Also, the integrated model provides a more comprehensive explanation of e-commerce adoption in organisations and could serve as a foundation for future research on information systems.
Resumo:
This research evaluates pattern recognition techniques on a subclass of big data where the dimensionality of the input space (p) is much larger than the number of observations (n). Specifically, we evaluate massive gene expression microarray cancer data where the ratio κ is less than one. We explore the statistical and computational challenges inherent in these high dimensional low sample size (HDLSS) problems and present statistical machine learning methods used to tackle and circumvent these difficulties. Regularization and kernel algorithms were explored in this research using seven datasets where κ < 1. These techniques require special attention to tuning necessitating several extensions of cross-validation to be investigated to support better predictive performance. While no single algorithm was universally the best predictor, the regularization technique produced lower test errors in five of the seven datasets studied.
Resumo:
The article investigates the division between member states of the European Union considering the aspect of their level of information and communication technology (ICT) development focusing on e-learning. With the help of discriminant analysis the countries are categorized into groups based on their ICT maturity and e-learning literacy level of development. Making a comparison with a benchmarking tool, the ITU (International Telecommunication Union)’s ICT Development Index (IDI) the results are confirmed partly correct. The article tries to find economical explanations for the re-grouping of the countries ranking. Finally the author examines the reliability of Hungary’s ranking results and the factors which may affect this divergence from the real picture.
Resumo:
This dissertation reports the results of a study that examined differences between genders in a sample of adolescents from a residential substance abuse treatment facility. The sample included 72 males and 65 females, ages 12 through 17. The data were archival, having been originally collected for a study of elopement from treatment. The current study included 23 variables. The variables were from multiple dimensions, including socioeconomic, legal, school, family, substance abuse, psychological, social support, and treatment histories. Collectively, they provided information about problem behaviors and psychosocial problems that are correlates of adolescent substance abuse. The study hypothesized that these problem behaviors and psychosocial problems exist in different patterns and combinations between genders.^ Further, it expected that these patterns and combinations would constitute profiles important for treatment. K-means cluster analysis identified differential profiles between genders in all three areas: problem behaviors, psychosocial problems, and treatment profiles. In the dimension of problem behaviors, the predominantly female group was characterized as suicidal and destructive, while the predominantly male group was identified as aggressive and low achieving. In the dimension of psychosocial problems, the predominantly female group was characterized as abused depressives, while the male group was identified as asocial, low problem severity. A third group, neither predominantly female or male, was characterized as social, high problem severity. When these dimensions were combined to form treatment profiles, the predominantly female group was characterized as abused, self-harmful, and social, and the male group was identified as aggressive, destructive, low achieving, and asocial. Finally, logistic regression and discriminant analysis were used to determine whether a history of sexual and physical abuse impacted problem behavior differentially between genders. Sexual abuse had a substantially greater influence in producing self-mutilating and suicidal behavior among females than among males. Additionally, a model including sexual abuse, physical abuse, low family support, and low support from friends showed a moderate capacity to predict unusual harmful behavior (fire-starting and cruelty to animals) among males. Implications for social work practice, social work research, and systems science are discussed. ^
Resumo:
This study is an exploratory analysis of an operational measure for resource development strategies, and an exploratory analysis of internal organizational contingencies influencing choices of these strategies in charitable nonprofit organizations. The study provides conceptual guidance for advancing understanding about resource development in the nonprofit sector. The statistical findings are, however, inconclusive without further rigorous examination. A three category typology based on organization technology is initially presented to define the strategies. Three dimensions of internal organizational contingencies explored represent organization identity, professional staff, and boards of directors. Based on relevant literature and key informant interviews, an original survey was administered by mail to a national sample of nonprofit organizations. The survey collected data on indicators of the proposed strategy types and selected contingencies. Factor analysis extracted two of the initial categories in the typology. The Building Resource Development Infrastructure Strategy encompasses information technology, personnel, legal structures, and policies facilitating fund development. The Building Resource Development Infrastructure Strategy encompasses the mission, service niche, and type of service delivery forming the basis for seeking financial support. Linear regressions with each strategy type as the dependent variable identified distinct and common contingencies which may partly explain choices of strategies. Discriminant analysis suggests the potential predictive accuracy of the contingencies. Follow-up case studies with survey respondents provide additional criteria for operationalizing future measures of resource development strategies, and support and expand the analysis on contingencies. The typology offers a beginning framework for defining alternative approaches to resource development, and for exploring organization capacity specific to each approach. Contingencies that may be integral components of organization capacity are funding, leadership frame, background and experience, staff and volunteer effort, board member support, and relationships in the external environment. Based on these findings, management questions are offered for nonprofit organization stakeholders to consider in planning for resource development. Lessons learned in designing and conducting this study are also provided to enhance future related research. ^
Resumo:
Eating disorders can lead to a negative impact on students' academic growth, nutrition and can cause death (Claude-Pierre, 1997; Manley, Rickson, & Standeven, 2000; Romeo, 1996). Early intervention by referring students to professional counseling might help counter these negative consequences. The teacher is in the position to assist students by providing health information, identifying those with problems, and intervening for a variety of dysfunctions that may include the eating disorders called anorexia nervosa and bulimia nervosa (Myers-Clark & Christopher, 2000). However teachers are in a difficult position to know when to address student concerns and judge what action to take (Ransley, 1999). Teachers' engagement seems crucial (Smolak, Harris, Levine, & Shisslak, 2001) since eating disorders are being identified in younger children. The purpose of this study was to examine (a) the relationships of the theoretical constructs, attitude, subjective norm, and perceived behavioral control of the theory of planned behavior as predictors of behavioral intention (Ajzen & Fishbein, 1980) of middle school teachers to identify and refer suspected anorexia nervosa (AN) and/or bulimia nervosa (BN) students for professional help; and (b) the actual behavior of middle school teachers who reported having ever referred a student suspected of having AN and BN and those teachers who reported not having made such a referral. One hundred fourteen middle school teachers in Broward County, Florida volunteered to participate in the ex post facto research. Data were collected from a questionnaire. Multiple regression analysis revealed that the constructs of subjective norm (perception of what others think about one's performance of behavior combined with motivation to comply) and perceived behavioral control (perception regarding the extent of the difficulty of performing the behavior) were predictive of teachers' intent (likelihood of engaging in a behavior) to refer. However, the analysis revealed that attitude (overall positive or negative feeling with respect to performing the behavior) was not predictive of teachers' intent. Discriminant function analysis revealed that both intent and perceived behavioral control were predictive of group membership, either having referred a student suspected of having an eating disorder for counseling or not having made such a referral. Attitude and subjective norm were not predictive of group membership.
Resumo:
Classification procedures, including atmospheric correction satellite images as well as classification performance utilizing calibration and validation at different levels, have been investigated in the context of a coarse land-cover classification scheme for the Pachitea Basin. Two different correction methods were tested against no correction in terms of reflectance correction towards a common response for pseudo-invariant features (PIF). The accuracy of classifications derived from each of the three methods was then assessed in a discriminant analysis using crossvalidation at pixel, polygon, region, and image levels. Results indicate that only regression adjusted images using PIFs show no significant difference between images in any of the bands. A comparison of classifications at different levels suggests though that at pixel, polygon, and region levels the accuracy of the classifications do not significantly differ between corrected and uncorrected images. Spatial patterns of land-cover were analyzed in terms of colonization history, infrastructure, suitability of the land, and landownership. The actual use of the land is driven mainly by the ability to access the land and markets as is obvious in the distribution of land cover as a function of distance to rivers and roads. When considering all rivers and roads a threshold distance at which disproportional agro-pastoral land cover switches from over represented to under represented is at about 1km. Best land use suggestions seem not to affect the choice of land use. Differences in abundance of land cover between watersheds are more prevailing than differences between colonist and indigenous groups.
Resumo:
The problems faced by scientists in charge of managing Atlantic salmon (Salmo salar) stocks are : i) how to maintain spawning runs consisting of repeat spawners and large multi-sea-winter (MSW) adults in the face of selective homewater and distant commercial fisheries and , ii) how to more accurately predict returns of adults. Using data from scales collected from maiden Atlantic salmon grilse from two locations on the Northern Peninsula of Newfoundland, St. Barbe Bay and Western Arm Brook, their length at smolting was back calculated. These data were then used to examine whether the St. Barbe commercial fishery is selective for salmon of particular smolt age and/or size. Analysis indicated that come commercial fishery selected larger, but not necessarily older adults that those escaping to Western Arm Brook over the period of this study, 1978-1987. It was determined that less than average size smolts survived better than above average size smolts. Slection for repeat spawners, large MSW salmon, and larger grilse has meant reductions in the proportions of these adults in the spawning runs on Western Arm Brook. This may impact the Western Arm Brook salmon stock by increasing the population instability. Sea survival was significantly correlated with selection by the commercial fishery. Characteristics of adults in Western Arm Brook during the period of study (1978-1987) did not help in explaining yearly variation in sea survival. The characteristics of smolts, however, when subjected to multiple regression analysis explained 57.2 percent of the yearly variation in sea survival.
Resumo:
A certain type of bacterial inclusion, known as a bacterial microcompartment, was recently identified and imaged through cryo-electron tomography. A reconstructed 3D object from single-axis limited angle tilt-series cryo-electron tomography contains missing regions and this problem is known as the missing wedge problem. Due to missing regions on the reconstructed images, analyzing their 3D structures is a challenging problem. The existing methods overcome this problem by aligning and averaging several similar shaped objects. These schemes work well if the objects are symmetric and several objects with almost similar shapes and sizes are available. Since the bacterial inclusions studied here are not symmetric, are deformed, and show a wide range of shapes and sizes, the existing approaches are not appropriate. This research develops new statistical methods for analyzing geometric properties, such as volume, symmetry, aspect ratio, polyhedral structures etc., of these bacterial inclusions in presence of missing data. These methods work with deformed and non-symmetric varied shaped objects and do not necessitate multiple objects for handling the missing wedge problem. The developed methods and contributions include: (a) an improved method for manual image segmentation, (b) a new approach to 'complete' the segmented and reconstructed incomplete 3D images, (c) a polyhedral structural distance model to predict the polyhedral shapes of these microstructures, (d) a new shape descriptor for polyhedral shapes, named as polyhedron profile statistic, and (e) the Bayes classifier, linear discriminant analysis and support vector machine based classifiers for supervised incomplete polyhedral shape classification. Finally, the predicted 3D shapes for these bacterial microstructures belong to the Johnson solids family, and these shapes along with their other geometric properties are important for better understanding of their chemical and biological characteristics.
Resumo:
A compositional multivariate approach is used to analyse regional scale soil geochemical data obtained as part of the Tellus Project generated by the Geological Survey Northern Ireland (GSNI). The multi-element total concentration data presented comprise XRF analyses of 6862 rural soil samples collected at 20cm depths on a non-aligned grid at one site per 2 km2. Censored data were imputed using published detection limits. Using these imputed values for 46 elements (including LOI), each soil sample site was assigned to the regional geology map provided by GSNI initially using the dominant lithology for the map polygon. Northern Ireland includes a diversity of geology representing a stratigraphic record from the Mesoproterozoic, up to and including the Palaeogene. However, the advance of ice sheets and their meltwaters over the last 100,000 years has left at least 80% of the bedrock covered by superficial deposits, including glacial till and post-glacial alluvium and peat. The question is to what extent the soil geochemistry reflects the underlying geology or superficial deposits. To address this, the geochemical data were transformed using centered log ratios (clr) to observe the requirements of compositional data analysis and avoid closure issues. Following this, compositional multivariate techniques including compositional Principal Component Analysis (PCA) and minimum/maximum autocorrelation factor (MAF) analysis method were used to determine the influence of underlying geology on the soil geochemistry signature. PCA showed that 72% of the variation was determined by the first four principal components (PC’s) implying “significant” structure in the data. Analysis of variance showed that only 10 PC’s were necessary to classify the soil geochemical data. To consider an improvement over PCA that uses the spatial relationships of the data, a classification based on MAF analysis was undertaken using the first 6 dominant factors. Understanding the relationship between soil geochemistry and superficial deposits is important for environmental monitoring of fragile ecosystems such as peat. To explore whether peat cover could be predicted from the classification, the lithology designation was adapted to include the presence of peat, based on GSNI superficial deposit polygons and linear discriminant analysis (LDA) undertaken. Prediction accuracy for LDA classification improved from 60.98% based on PCA using 10 principal components to 64.73% using MAF based on the 6 most dominant factors. The misclassification of peat may reflect degradation of peat covered areas since the creation of superficial deposit classification. Further work will examine the influence of underlying lithologies on elemental concentrations in peat composition and the effect of this in classification analysis.
Resumo:
Purpose – The objective of this exploratory study is to investigate the “flow-through” or relationship between top-line measures of hotel operating performance (occupancy, average daily rate and revenue per available room) and bottom-line measures of profitability (gross operating profit and net operating income), before and during the recent great recession. Design/methodology/approach – This study uses data provided by PKF Hospitality Research for the period from 2007-2009. A total of 714 hotels were analyzed and various top-line and bottom-line profitability changes were computed using both absolute levels and percentages. Multiple regression analysis was used to examine the relationship between top and bottom line measures, and to derive flow-through ratios. Findings – The results show that average daily rate (ADR) and occupancy are significantly and positively related to gross operating profit per available room (GOPPAR) and net operating income per available room (NOIPAR). The evidence indicates that ADR, rather than occupancy, appears to be the stronger predictor and better measure of RevPAR growth and bottom-line profitability. The correlations and explained variances are also higher than those reported in prior research. Flow-through ratios range between 1.83 and 1.91 for NOIPAR, and between 1.55 and 1.65 for GOPPAR, across all chain-scales. Research limitations/implications – Limitations of this study include the limited number of years in the study period, limited number of hotels in a competitive set, and self-selection of hotels by the researchers. Practical implications – While ADR and occupancy work in combination to drive profitability, the authors' study shows that ADR is the stronger predictor of profitability. Hotel managers can use flow-through ratios to make financial forecasts, or use them as inputs in valuation models, to forecast future profitability. Originality/value – This paper extends prior research on the relationship between top-line measures and bottom-line profitability and serves to inform lodging owners, operators and asset managers about flow-through ratios, and how these ratios impact hotel profitability.