7 resultados para High-dimensional data

em Brock University, Canada


Relevância:

90.00% 90.00%

Publicador:

Resumo:

The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and deterministic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel metaheuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS metaheuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The curse of dimensionality is a major problem in the fields of machine learning, data mining and knowledge discovery. Exhaustive search for the most optimal subset of relevant features from a high dimensional dataset is NP hard. Sub–optimal population based stochastic algorithms such as GP and GA are good choices for searching through large search spaces, and are usually more feasible than exhaustive and determinis- tic search algorithms. On the other hand, population based stochastic algorithms often suffer from premature convergence on mediocre sub–optimal solutions. The Age Layered Population Structure (ALPS) is a novel meta–heuristic for overcoming the problem of premature convergence in evolutionary algorithms, and for improving search in the fitness landscape. The ALPS paradigm uses an age–measure to control breeding and competition between individuals in the population. This thesis uses a modification of the ALPS GP strategy called Feature Selection ALPS (FSALPS) for feature subset selection and classification of varied supervised learning tasks. FSALPS uses a novel frequency count system to rank features in the GP population based on evolved feature frequencies. The ranked features are translated into probabilities, which are used to control evolutionary processes such as terminal–symbol selection for the construction of GP trees/sub-trees. The FSALPS meta–heuristic continuously refines the feature subset selection process whiles simultaneously evolving efficient classifiers through a non–converging evolutionary process that favors selection of features with high discrimination of class labels. We investigated and compared the performance of canonical GP, ALPS and FSALPS on high–dimensional benchmark classification datasets, including a hyperspectral image. Using Tukey’s HSD ANOVA test at a 95% confidence interval, ALPS and FSALPS dominated canonical GP in evolving smaller but efficient trees with less bloat expressions. FSALPS significantly outperformed canonical GP and ALPS and some reported feature selection strategies in related literature on dimensionality reduction.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Polarized reflectance measurements of the quasi I-D charge-transfer salt (TMTSFh CI04 were carried out using a Martin-Puplett-type polarizing interferometer and a 3He refrigerator cryostat, at several temperatures between 0.45 K and 26 K, in the far infrared, in the 10 to 70 cm- 1 frequency range. Bis-tetramethyl-tetraselena-fulvalene perchlorate crystals, grown electrochemically and supplied by K. Behnia, of dimensions 2 to 4 by 0.4 by 0.2 mm, were assembled on a flat surface to form a mosaic of 1.5 by 3 mm. The needle shaped crystals were positioned parallel to each other along their long axis, which is the stacking direction of the planar TMTSF cations, exposing the ab plane face (parallel to which the sheets of CI04 anions are positioned). Reflectance measurements were performed with radiation polarized along the stacking direction in the sample. Measurements were carried out following either a fast (15-20 K per minute) or slow (0.1 K per minute) cooling of the sample. Slow cooling permits the anions to order near 24 K, and the sample is expected to be superconducting below 1.2 K, while fast cooling yields an insulating state at low temperatures. Upon the slow cooling the reflectance shows dependence with temperature and exhibits the 28 cm- 1 feature reported previously [1]. Thermoreflectance for both the 'slow' and 'fast' cooling of the sample calculated relative to the 26 K reflectance data indicates that the reflectance is temperature dependent, for the slow cooling case only. A low frequency edge in the absolute reflectance is assigned an electronic origin given its strong temperature dependence in the relaxed state. We attribute the peak in the absolute reflectance near 30 cm-1 to a phonon coupled to the electronic background. Both the low frequency edge and the 30 cm-1 feature are noted te shift towards higher frequcncy, upon cntering the superconducting state, by an amount of the order of the expected superconducting energy gap. Kramers-Kronig analysis was carried out to determine the optical conductivity for the slowly cooled sample from the measured reflectance. In order to do so the low frequency data was extrapolated to zero frequency using a Hagen-Ru bens behaviour, and the high frequency data was extended with the data of Cao et al. [2], and Kikuchi et al. [3]. The real part of the optical conductivity exhibits an asymmetric peak at 35 cm-1, and its background at lower frequencies seems to be losing spectral weight with lowering of the temperature, leading us to presume that a narrow peak is forming at even lower frequencies.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This research focuses on generating aesthetically pleasing images in virtual environments using the particle swarm optimization (PSO) algorithm. The PSO is a stochastic population based search algorithm that is inspired by the flocking behavior of birds. In this research, we implement swarms of cameras flying through a virtual world in search of an image that is aesthetically pleasing. Virtual world exploration using particle swarm optimization is considered to be a new research area and is of interest to both the scientific and artistic communities. Aesthetic rules such as rule of thirds, subject matter, colour similarity and horizon line are all analyzed together as a multi-objective problem to analyze and solve with rendered images. A new multi-objective PSO algorithm, the sum of ranks PSO, is introduced. It is empirically compared to other single-objective and multi-objective swarm algorithms. An advantage of the sum of ranks PSO is that it is useful for solving high-dimensional problems within the context of this research. Throughout many experiments, we show that our approach is capable of automatically producing images satisfying a variety of supplied aesthetic criteria.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Although the link between macroeconomic news announcements and exchange rates is well documented in recent literature, this connection may be unstable. By using a broad set of macroeconomic news announcements and high frequency forex data for the Euro/Dollar, Pound/Dollar and Yen/Dollar from Nov 1, 2004 to Mar 31, 2014, we obtain two major findings with regards to this instability. First, many macroeconomic news announcements exhibit unstable effects with certain patterns in foreign exchange rates. These news effects may change in magnitude and even in their sign over time, over business cycles and crises within distinctive contexts. This finding is robust because the results are obtained by applying a Two-Regime Smooth Transition Regression Model, a Breakpoints Regression Model, and an Efficient Test of Parameter Instability which are all consistent with each other. Second, when we explore the source of this instability, we find that global risks and the reaction by central bank monetary policy to these risks to be possible factors causing this instability.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This thesis investigates how macroeconomic news announcements affect jumps and cojumps in foreign exchange markets, especially under different business cycles. We use 5-min interval from high frequency data on Euro/Dollar, Pound/Dollar and Yen/Dollar from Nov. 1, 2004 to Feb. 28, 2015. The jump detection method was proposed by Andersen et al. (2007c), Lee & Mykland (2008) and then modified by Boudt et al. (2011a) for robustness. Then we apply the two-regime smooth transition regression model of Teräsvirta (1994) to explore news effects under different business cycles. We find that scheduled news related to employment, real activity, forward expectations, monetary policy, current account, price and consumption influences forex jumps, but only FOMC Rate Decisions has consistent effects on cojumps. Speeches given by major central bank officials near a crisis also significantly affect jumps and cojumps. However, the impacts of some macroeconomic news are not the same under different economic states.