981 resultados para Hybrid feature selections
Resumo:
Musical genre classification has been paramount in the last years, mainly in large multimedia datasets, in which new songs and genres can be added at every moment by anyone. In this context, we have seen the growing of musical recommendation systems, which can improve the benefits for several applications, such as social networks and collective musical libraries. In this work, we have introduced a recent machine learning technique named Optimum-Path Forest (OPF) for musical genre classification, which has been demonstrated to be similar to the state-of-the-art pattern recognition techniques, but much faster for some applications. Experiments in two public datasets were conducted against Support Vector Machines and a Bayesian classifier to show the validity of our work. In addition, we have executed an experiment using very recent hybrid feature selection techniques based on OPF to speed up feature extraction process. © 2011 International Society for Music Information Retrieval.
Resumo:
Computational Intelligence and Feature Selection provides a high level audience with both the background and fundamental ideas behind feature selection with an emphasis on those techniques based on rough and fuzzy sets, including their hybridizations. It introduces set theory, fuzzy set theory, rough set theory, and fuzzy-rough set theory, and illustrates the power and efficacy of the feature selections described through the use of real-world applications and worked examples. Program files implementing major algorithms covered, together with the necessary instructions and datasets, are available on the Web.
Resumo:
Feature selection is important in medical field for many reasons. However, selecting important variables is a difficult task with the presence of censoring that is a unique feature in survival data analysis. This paper proposed an approach to deal with the censoring problem in endovascular aortic repair survival data through Bayesian networks. It was merged and embedded with a hybrid feature selection process that combines cox's univariate analysis with machine learning approaches such as ensemble artificial neural networks to select the most relevant predictive variables. The proposed algorithm was compared with common survival variable selection approaches such as; least absolute shrinkage and selection operator LASSO, and Akaike information criterion AIC methods. The results showed that it was capable of dealing with high censoring in the datasets. Moreover, ensemble classifiers increased the area under the roc curves of the two datasets collected from two centers located in United Kingdom separately. Furthermore, ensembles constructed with center 1 enhanced the concordance index of center 2 prediction compared to the model built with a single network. Although the size of the final reduced model using the neural networks and its ensembles is greater than other methods, the model outperformed the others in both concordance index and sensitivity for center 2 prediction. This indicates the reduced model is more powerful for cross center prediction.
Resumo:
Feature selection and feature weighting are useful techniques for improving the classification accuracy of K-nearest-neighbor (K-NN) rule. The term feature selection refers to algorithms that select the best subset of the input feature set. In feature weighting, each feature is multiplied by a weight value proportional to the ability of the feature to distinguish pattern classes. In this paper, a novel hybrid approach is proposed for simultaneous feature selection and feature weighting of K-NN rule based on Tabu Search (TS) heuristic. The proposed TS heuristic in combination with K-NN classifier is compared with several classifiers on various available data sets. The results have indicated a significant improvement in the performance in classification accuracy. The proposed TS heuristic is also compared with various feature selection algorithms. Experiments performed revealed that the proposed hybrid TS heuristic is superior to both simple TS and sequential search algorithms. We also present results for the classification of prostate cancer using multispectral images, an important problem in biomedicine.
Resumo:
Hybrid face recognition, using image (2D) and structural (3D) information, has explored the fusion of Nearest Neighbour classifiers. This paper examines the effectiveness of feature modelling for each individual modality, 2D and 3D. Furthermore, it is demonstrated that the fusion of feature modelling techniques for the 2D and 3D modalities yields performance improvements over the individual classifiers. By fusing the feature modelling classifiers for each modality with equal weights the average Equal Error Rate improves from 12.60% for the 2D classifier and 12.10% for the 3D classifier to 7.38% for the Hybrid 2D+3D clasiffier.
Resumo:
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no space or boundary between Chinese words. This feature makes Chinese information retrieval more difficult since a retrieved document which contains the query term as a sequence of Chinese characters may not be really relevant to the query since the query term (as a sequence Chinese characters) may not be a valid Chinese word in that documents. On the other hand, a document that is actually relevant may not be retrieved because it does not contain the query sequence but contains other relevant words. In this research, we propose a hybrid Chinese information retrieval model by incorporating word-based techniques with the traditional character-based techniques. The aim of this approach is to investigate the influence of Chinese segmentation on the performance of Chinese information retrieval. Two ranking methods are proposed to rank retrieved documents based on the relevancy to the query calculated by combining character-based ranking and word-based ranking. Our experimental results show that Chinese segmentation can improve the performance of Chinese information retrieval, but the improvement is not significant if it incorporates only Chinese segmentation with the traditional character-based approach.
Resumo:
Opening up a band gap and finding a suitable substrate material are two big challenges for building graphene-based nanodevices. Using state-of-the-art hybrid density functional theory incorporating long range dispersion corrections, we investigate the interface between optically active graphitic carbon nitride (g-C3N4) and electronically active graphene. We find an inhomogeneous planar substrate (g-C3N4) promotes electronrich and hole-rich regions, i.e., forming a well-defined electron−hole puddle, on the supported graphene layer. The composite displays significant charge transfer from graphene to the g-C3N4 substrate, which alters the electronic properties of both components. In particular, the strong electronic coupling at the graphene/g-C3N4 interface opens a 70 meV gap in g-C3N4-supported graphene, a feature that can potentially allow overcoming the graphene’s band gap hurdle in constructing field effect transistors. Additionally, the 2-D planar structure of g-C3N4 is free of dangling bonds, providing an ideal substrate for graphene to sit on. Furthermore, when compared to a pure g-C3N4 monolayer, the hybrid graphene/g-C3N4 complex displays an enhanced optical absorption in the visible region, a promising feature for novel photovoltaic and photocatalytic applications.
Resumo:
This paper presents a new algorithm based on a Hybrid Particle Swarm Optimization (PSO) and Simulated Annealing (SA) called PSO-SA to estimate harmonic state variables in distribution networks. The proposed algorithm performs estimation for both amplitude and phase of each harmonic currents injection by minimizing the error between the measured values from Phasor Measurement Units (PMUs) and the values computed from the estimated parameters during the estimation process. The proposed algorithm can take into account the uncertainty of the harmonic pseudo measurement and the tolerance in the line impedances of the network as well as uncertainty of the Distributed Generators (DGs) such as Wind Turbines (WT). The main feature of proposed PSO-SA algorithm is to reach quickly around the global optimum by PSO with enabling a mutation function and then to find that optimum by SA searching algorithm. Simulation results on IEEE 34 bus radial and a realistic 70-bus radial test networks are presented to demonstrate the speed and accuracy of proposed Distribution Harmonic State Estimation (DHSE) algorithm is extremely effective and efficient in comparison with the conventional algorithms such as Weight Least Square (WLS), Genetic Algorithm (GA), original PSO and Honey Bees Mating Optimization (HBMO) algorithm.
Resumo:
Spontaneous mutation: In 1996, vegetative material (later designated ‘TL2’) taken from a disease resistant mutant plant on the fifteenth green at Novotel Palm Cove resort course near Cairns was included an on-going program of selection and testing of promising ‘Tifgreen’ mutants by Tropical Lawns Pty Ltd. Selection criteria: healthy vigorous growth during the tropical wet season, dense fine-textured appearance under close mowing, and dark green leaves. In subsequent trials, ‘TL2’ was identified as the outstanding plant among selections of mutant ‘Tifgreen’ genotypes from other north Queensland sites in terms of colour, texture and density for greens use. Propagation: vegetative. Breeder: Terry Anderlini, Gordonvale, QLD. PBR Certificate Number 2639, Application Number 2002/268, granted 24 February 2005.
Resumo:
Part classification and coding is still considered as laborious and time-consuming exercise. Keeping in view, the crucial role, which it plays, in developing automated CAPP systems, the attempts have been made in this article to automate a few elements of this exercise using a shape analysis model. In this study, a 24-vector directional template is contemplated to represent the feature elements of the parts (candidate and prototype). Various transformation processes such as deformation, straightening, bypassing, insertion and deletion are embedded in the proposed simulated annealing (SA)-like hybrid algorithm to match the candidate part with their prototype. For a candidate part, searching its matching prototype from the information data is computationally expensive and requires large search space. However, the proposed SA-like hybrid algorithm for solving the part classification problem considerably minimizes the search space and ensures early convergence of the solution. The application of the proposed approach is illustrated by an example part. The proposed approach is applied for the classification of 100 candidate parts and their prototypes to demonstrate the effectiveness of the algorithm. (C) 2003 Elsevier Science Ltd. All rights reserved.
Resumo:
Outlier detection in high dimensional categorical data has been a problem of much interest due to the extensive use of qualitative features for describing the data across various application areas. Though there exist various established methods for dealing with the dimensionality aspect through feature selection on numerical data, the categorical domain is actively being explored. As outlier detection is generally considered as an unsupervised learning problem due to lack of knowledge about the nature of various types of outliers, the related feature selection task also needs to be handled in a similar manner. This motivates the need to develop an unsupervised feature selection algorithm for efficient detection of outliers in categorical data. Addressing this aspect, we propose a novel feature selection algorithm based on the mutual information measure and the entropy computation. The redundancy among the features is characterized using the mutual information measure for identifying a suitable feature subset with less redundancy. The performance of the proposed algorithm in comparison with the information gain based feature selection shows its effectiveness for outlier detection. The efficacy of the proposed algorithm is demonstrated on various high-dimensional benchmark data sets employing two existing outlier detection methods.
Resumo:
In the 17th and 18th centuries, the European hunting horn was an instrument associated with the thrill of the chase, rustic scenes, and the pageantry of royal hunts. The sound of this horn and the music that evolved from its heritage would inspire composers writing for the modern horn. TALLY HOrn! Echoes of Hoofbeats and Haunts: Recorded Selections for Horn and Piano are two compact discs of works for horn and piano as well as solo horn reflecting this association with the hunt and include additional works of expressive contrast – lyrical pieces and nocturnes. Nine of the pieces reflect the modern horn’s beginnings and they are Abbot’s Alla Caccia, Berge’s Horn-Lokk, Boutry’s Chassacor, Bozza’s En Fôret, Büsser’s La Chasse de St. Hubert, Pessard’s In the Forest, Piantoni’s Air de Chasse, Schmid’s Im Tiefsten Walde, and Vinter’s Hunter’s Moon. By contrast, three of the selections, Gliére’s Nocturne, Strauss’ Nocturno, and Van Ecchaute’s Nachtpoëma are nocturnes that showcase the expressive quality of the horn. In addition, Bush’s Autumn Poem, Bacon’s Song After the Rain, and Webber’s Summer Pastures are included for their lyric qualities and evocations of nature. The largest work of the project is John Williams’s Concerto for Horn. Its five movements feature both hunting horn traits and expressive qualities. Alejandro Hernandez-Valdez and Grace Cho were the two pianists engaged for the recording. The recording engineer was Edward Kelly. The producer was Carlos Rodriquez. The pieces were recorded at the Spencerville Seventh-day Adventist Church sanctuary in Spencerville, Maryland between November 2010 and February 2011.