10 resultados para categorical and mix datasets

em Brock University, Canada


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Feature selection plays an important role in knowledge discovery and data mining nowadays. In traditional rough set theory, feature selection using reduct - the minimal discerning set of attributes - is an important area. Nevertheless, the original definition of a reduct is restrictive, so in one of the previous research it was proposed to take into account not only the horizontal reduction of information by feature selection, but also a vertical reduction considering suitable subsets of the original set of objects. Following the work mentioned above, a new approach to generate bireducts using a multi--objective genetic algorithm was proposed. Although the genetic algorithms were used to calculate reduct in some previous works, we did not find any work where genetic algorithms were adopted to calculate bireducts. Compared to the works done before in this area, the proposed method has less randomness in generating bireducts. The genetic algorithm system estimated a quality of each bireduct by values of two objective functions as evolution progresses, so consequently a set of bireducts with optimized values of these objectives was obtained. Different fitness evaluation methods and genetic operators, such as crossover and mutation, were applied and the prediction accuracies were compared. Five datasets were used to test the proposed method and two datasets were used to perform a comparison study. Statistical analysis using the one-way ANOVA test was performed to determine the significant difference between the results. The experiment showed that the proposed method was able to reduce the number of bireducts necessary in order to receive a good prediction accuracy. Also, the influence of different genetic operators and fitness evaluation strategies on the prediction accuracy was analyzed. It was shown that the prediction accuracies of the proposed method are comparable with the best results in machine learning literature, and some of them outperformed it.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose: The influence of environment in the development of overweight and obesity is an ongoing concern. This investigation examined the influence of urbanization on the rates of childhood overweight and obesity. Method: 2167 (1090M, 1077F) grade four children from 75 schools in Ontario's Niagara Region were sampled. A sophisticated algorithm overlaying electoral boundaries, population densities, and the knowledge of community members was used to classify schools into one of three location categories: urban {N= 1588), urban fringe {N= 379), and rural (A^= 234). Each subject was measured for: height, weight, and aerobic performance (Leger). Physical activity was evaluated with the self-report Participation Questionnaire (free-time and organized sport activities), and teacher's evaluations of student activity. Overweight (overweight and obesity combined) was measured both as a continuous (BMI) and categorical variable (BMI category), to evaluate the prevalence by location. A multivariate analysis was used to test for a suppression effect. Results: BMI and BMI category did not differ significantly by location or gender, and no evidence of a gender interaction existed. According to both a linear and logistic regression, physical activity or fitness levels did not suppress the influence of location on BMI and BMI category. Age, gender, free-time activity, organized sports, fitness level, and number of siblings, were all found to significantly influence overweight. Conclusions: It is plausible that the prevalence of overweight does not differ in urban and rural children from the Niagara Region. Further investigation is recommended, examining subjects by individual location of residence, in multiple regions throughout Ontario.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The thesis presents a comparison of the national energy policies of the Federal Republic of Germany and Canada from 1973 until the late 1980s. The purpose of this paper is to analyze whether economic and/or environmental concerns were responsible for changes in the· West-German and Canadian national energy policies. Furthermore, the feasibility of implementing a soft energy path in West-Germany and Canada is examined. For better comprehension of the policy-making process and implemented changes in the national energy policies of the two states, the West-German and Canadian parliamentary systems and the political cultures were compared. For the analysis, several events with international impact were taken as guidelines. Furthermore, based on statistical data, the West-German and Canadian energy production and consumption were analyzed. With reference to these results the degree of the de facto changes in the national energy policies were analyzed. In addition, the thesis discusses the possibilities which a soft energy path offers to both national governments to renounce themselves from the dependencies on a few energy resources. The thesis reveals that changes in the West-German and Canadian national energy policies, in their energy production and consumption are correlated to various world events. In particular, governmental reponses security of energy supply by the two international oil crises of 1973 and 1979/1980 demonstrate that changes in the West-German and Canadian national energy policies were implemented in reaction to economic concerns than environmental ones. With the policies "away from oil" and "off oil", the West-German and Canadian government implemented the i i substitution of oil through various diverse energy supply resources. However, energy savings concepts and policies were initiated through the first oil crisis in 1973. The world recessions in 1975 and 1982 had no 'profound impacts on the agenda of West-German and Canadian energy policies. As a consequence of the stagnation or the negative growth of the world economic market, changes in their energy production and consumption can be perceived. However, the West-German and Canadian energy production and consumption intensified with the augmentation of the world economy. During the period of study, environmental concerns were taken into account in the energy policy agendas of the Federal Republic of Germany and Canada but they were not of primary concern. wi thin the decade of. the 1980s notably more environmental considerations were taken into account in the energy policies of the two states. The two nuclear reactor accidents in 1979 and 1986 sharpened to various degrees West-German and Canadian public discourse of present energy supply mix and attitude towards energy production and consumption. The statistical data reflects yet no changes in the energy policies in regard to the position of nuclear power. However, in the next several years possible changes can be observed through statistical data, because the planning, the construction and possible phase out of nuclear power requires several years. Finally, the thesis reveals that the implementation of a soft energy path requires profound changes in the consumer behaviour. As several studies indicate, a soft energy path is technological and economically feasible for the Federal Republic of Germany and Canada, its implementation remains to be a political decision.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Phe Ihesis examines the evolution of the -policies of the People fs Jtenublie of China towards J?hail°nd, PTal ysia, Singapore, Iidonesia pad the Philip-pines, organised in the Association of Southeast Asian Nations from 1969 to 1975• 2ze first central point of this study is an *ir sumption that the foreign relations of The People's Tepublic of Chi la Towards Southeast ^sia have been motivated by a dynamic interplay of t^o main factors: (1) Farxist-Leninist ideology and ICao J^e-tung Ph^ught, which dictate to China to behave as a revolutionary Dover vhich must assist the insurgent movements in the area in their strug fle to overthrow the local governments; (2) national interest, vhich demands of China to safeguard the southern flank of her territory bordering on Southerst 'sia through Friendly relations, trade and ot*»er conventional inrtniments of diplomacy. hile the tvo main motive factors are nuTually antagonistic and exclusivet the Chinere leaders are nevertheless at te mi ting to oring them iirco a coherent policy under Mao's theory of tve {hniity of op-nosites," vhich believes that it is -possible to reconcile these co-posing tendencies into a dynamic enuibrium through vhich both opnosites could be promoted at the same time although not to the same extent* la other words, the Chinese leaders conceive the dynamic equilibrium as a continuum between them in a mix in which one or the other orientation predominates in different •periods* Bins we might see China1 s conduct motivated in one period by mostly ideological considerations at the expense of the staire-to-state relations, then ve might see her policy in the middle of the continuum and suf ering from immo bill sine and just muddling through, or finally ?fe might see her emphasising friendly ties at the expense of support of revolutionary movements at the other extreme -point of the spectrum* !fhe mechanism vhich enables Peking to move from one pole to the other of the spectrum is activated by the following elementsJ (1) the result of an internal power struggle within the leadership in Peking between ideologically radical and moderate elements, which enables the victorious faction to initiate nev policies; (2) Peking's assessment of the changing intentions and capabilities of the major powers in the area; (3) internal changes within the countries of the area and the changing attitudes of their governments towards China; (4) changing fortunes of revolutionary movements operating in the area* 'Phe second major point of this study is an assertion that while China's conduct toward Southeast *lsia after her foundation in 1949 was primarily based upon ideological considerations, the beginning of the seventies saw the national interest reasserting itself as the leading motive factor* Thus China talks with her neighbours in Southeast asia in terms of relevance of fllong historical ties," casting herself into the role of a benevolent "older brother11 who is entitled to reopect and deference in exchange for patronage and protection* Hence the traditional echoes of the past are emerging ever stronger and influencing her postures towards the region, while the open support to revolutionary moevments is underplayed at the moment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis reports on the optical properties of the dilute magnetic semiconductors, Sb1.97 V 0.03 Te3 and Sb1.94Cr0.06Te3, along with the parent compound Sb2Te3' These materials develop a ferromagnetic state at low temperature with Curie temperatures of 22 K and 16 K respectively. All three samples were oriented such that the electric field vector of the light was perpendicular to the c-axis. The reflectance profile of these samples in the mid-infrared (500 to 3000 cm-1) shows a pronounced plasma edge which retracts with decreasing temperature. The far-infrared region of these samples exhibits a phonon at ~ 60 cm-1 which softens as temperature decreases. Kramers-Kronig analysis and a Drude-Lorentz model were employed to determine the optical constants of the bulk samples. The real part of the optical conductivity is shown to consist of intraband contributions at frequencies below the energy gap (~0.26 eV) and interband contributions at frequencies above the energy gap. The temperature dependence of the scattering rate show that a mix of phonon and impurity scattering are present, while the signature of traditional spin disorder (magnetic) scattering was difficult to confirm.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Remote sensing techniques involving hyperspectral imagery have applications in a number of sciences that study some aspects of the surface of the planet. The analysis of hyperspectral images is complex because of the large amount of information involved and the noise within that data. Investigating images with regard to identify minerals, rocks, vegetation and other materials is an application of hyperspectral remote sensing in the earth sciences. This thesis evaluates the performance of two classification and clustering techniques on hyperspectral images for mineral identification. Support Vector Machines (SVM) and Self-Organizing Maps (SOM) are applied as classification and clustering techniques, respectively. Principal Component Analysis (PCA) is used to prepare the data to be analyzed. The purpose of using PCA is to reduce the amount of data that needs to be processed by identifying the most important components within the data. A well-studied dataset from Cuprite, Nevada and a dataset of more complex data from Baffin Island were used to assess the performance of these techniques. The main goal of this research study is to evaluate the advantage of training a classifier based on a small amount of data compared to an unsupervised method. Determining the effect of feature extraction on the accuracy of the clustering and classification method is another goal of this research. This thesis concludes that using PCA increases the learning accuracy, and especially so in classification. SVM classifies Cuprite data with a high precision and the SOM challenges SVM on datasets with high level of noise (like Baffin Island).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lexical processing among bilinguals is often affected by complex patterns of individual experience. In this paper we discuss the psychocentric perspective on language representation and processing, which highlights the centrality of individual experience in psycholinguistic experimentation. We discuss applications to the investigation of lexical processing among multilinguals and explore the advantages of using high-density experiments with multilinguals. High density experiments are designed to co-index measures of lexical perception and production, as well as participant profiles. We discuss the challenges associated with the characterization of participant profiles and present a new data visualization technique, that we term Facial Profiles. This technique is based on Chernoff faces developed over 40 years ago. The Facial Profile technique seeks to overcome some of the challenges associated with the use of Chernoff faces, while maintaining the core insight that recoding multivariate data as facial features can engage the human face recognition system and thus enhance our ability to detect and interpret patterns within multivariate datasets. We demonstrate that Facial Profiles can code participant characteristics in lexical processing studies by recoding variables such as reading ability, speaking ability, and listening ability into iconically-related relative sizes of eye, mouth, and ear, respectively. The balance of ability in bilinguals can be captured by creating composite facial profiles or Janus Facial Profiles. We demonstrate the use of Facial Profiles and Janus Facial Profiles in the characterization of participant effects in the study of lexical perception and production.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and deterministic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel metaheuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS metaheuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and determinis- tic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel meta–heuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS meta–heuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Digital Terrain Models (DTMs) are important in geology and geomorphology, since elevation data contains a lot of information pertaining to geomorphological processes that influence the topography. The first derivative of topography is attitude; the second is curvature. GIS tools were developed for derivation of strike, dip, curvature and curvature orientation from Digital Elevation Models (DEMs). A method for displaying both strike and dip simultaneously as colour-coded visualization (AVA) was implemented. A plug-in for calculating strike and dip via Least Squares Regression was created first using VB.NET. Further research produced a more computationally efficient solution, convolution filtering, which was implemented as Python scripts. These scripts were also used for calculation of curvature and curvature orientation. The application of these tools was demonstrated by performing morphometric studies on datasets from Earth and Mars. The tools show promise, however more work is needed to explore their full potential and possible uses.