773 resultados para Traditional clustering


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Performance comparisons between File Signatures and Inverted Files for text retrieval have previously shown several significant shortcomings of file signatures relative to inverted files. The inverted file approach underpins most state-of-the-art search engine algorithms, such as Language and Probabilistic models. It has been widely accepted that traditional file signatures are inferior alternatives to inverted files. This paper describes TopSig, a new approach to the construction of file signatures. Many advances in semantic hashing and dimensionality reduction have been made in recent times, but these were not so far linked to general purpose, signature file based, search engines. This paper introduces a different signature file approach that builds upon and extends these recent advances. We are able to demonstrate significant improvements in the performance of signature file based indexing and retrieval, performance that is comparable to that of state of the art inverted file based systems, including Language models and BM25. These findings suggest that file signatures offer a viable alternative to inverted files in suitable settings and positions the file signatures model in the class of Vector Space retrieval models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the concept of innovation strategies for traditional souvenir craft industries. There are many traditional souvenir craft industries in Indonesia, and they have to compete in today‘s global markets. The craftsmanship and uniqueness of traditional crafts must be developed to attract a larger market. This competition is not easy for craftspeople, neither financially nor culturally. The authors propose some innovation strategies to facilitate craftspeople in generating ideas based on their traditional value, to ensure their sustainability in global context. However, even though there are a number of studies about the craft industry and souvenirs, there is little research focused on the souvenir product development process, especially in the traditional craft industry. Considering that souvenirs are products for pleasure which require hedonic value more than utilitarian value, the offered innovation strategy refers to the strategy applied in existing industries that produce hedonic products. Innovation strategy in the fashion industry is selected as a reference, which is discussed by considering the context of the traditional souvenir craft industry. This investigation will support further research about knowledge sharing systems to enable collaborative learning within traditional craftspeople.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Microwave heating technology is a cost-effective alternative way for heating and curing of used in polymer processing of various alternate materials. The work presented in this paper addresses the attempts made by the authors to study the glass transition temperature and curing of materials such as casting resins R2512, R2515 and laminating resin GPR 2516 in combination with two hardeners ADH 2403 and ADH 2409. The magnetron microwave generator used in this research is operating at a frequency of 2.45 GHz with a hollow rectangular waveguide. During this investigation it has been noted that microwave heated mould materials resulted with higher glass transition temperatures and better microstructure. It also noted that Microwave curing resulted in a shorter curing time to reach the maximum percentage cure. From this study it can be concluded that microwave technology can be efficiently and effectively used to cure new generation alternate polymer materials for manufacture of injection moulds in a rapid and efficient manner. Microwave curing resulted in a shorter curing time to reach the maximum percentage cure.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper proposes the use of eigenvoice modeling techniques with the Cross Likelihood Ratio (CLR) as a criterion for speaker clustering within a speaker diarization system. The CLR has previously been shown to be a robust decision criterion for speaker clustering using Gaussian Mixture Models. Recently, eigenvoice modeling techniques have become increasingly popular, due to its ability to adequately represent a speaker based on sparse training data, as well as an improved capture of differences in speaker characteristics. This paper hence proposes that it would be beneficial to capitalize on the advantages of eigenvoice modeling in a CLR framework. Results obtained on the 2002 Rich Transcription (RT-02) Evaluation dataset show an improved clustering performance, resulting in a 35.1% relative improvement in the overall Diarization Error Rate (DER) compared to the baseline system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dealing with product yield and quality in manufacturing industries is getting more difficult due to the increasing volume and complexity of data and quicker time to market expectations. Data mining offers tools for quick discovery of relationships, patterns and knowledge in large databases. Growing self-organizing map (GSOM) is established as an efficient unsupervised datamining algorithm. In this study some modifications to the original GSOM are proposed for manufacturing yield improvement by clustering. These modifications include introduction of a clustering quality measure to evaluate the performance of the programme in separating good and faulty products and a filtering index to reduce noise from the dataset. Results show that the proposed method is able to effectively differentiate good and faulty products. It will help engineers construct the knowledge base to predict product quality automatically from collected data and provide insights for yield improvement.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sutchi catfish (Pangasianodon hypophthalmus) – known more universally by the Vietnamese name ‘Tra’ is an economically important freshwater fish in the Mekong Delta in Vietnam that constitutes an important food resource. Artificial propagation technology for Tra catfish has only recently been developed along the main branches of the Mekong River where more than 60% of the local human population participate in fishing or aquaculture. Extensive support for catfish culture in general, and that of Tra (P. hypophthalmus) in particular, has been provided by the Vietnamese government to increase both the scale of production and to develop international export markets. In 2006, total Vietnamese catfish exports reached approximately 286,602 metric tons (MT) and were valued at 736.87 $M with a number of large new export destinations being developed. Total value of production from catfish culture has been predicted to increase to approximately USD 1 billion by 2020. While freshwater catfish culture in Vietnam has a promising future, concerns have been raised about long-term quality of fry and the effectiveness of current brood stock management practices, issues that have been largely neglected to date. In this study, four DNA markers (microsatellite loci: CB4, CB7, CB12 and CB13) that were developed specifically for Tra (P. hypophthalmus) in an earlier study were applied to examine the genetic quality of artificially propagated Tra fry in the Mekong Delta in Vietnam. The goals of the study were to assess: (i) how well available levels of genetic variation in Tra brood stock used for artificial propagation in the Mekong Delta of Vietnam (breeders from three private hatcheries and Research Institute of Aquaculture No2 (RIA2) founders) has been conserved; and (ii) whether or not genetic diversity had declined significantly over time in a stock improvement program for Tra catfish at RIA2. A secondary issue addressed was how genetic markers could best be used to assist industry development. DNA was extracted from fins of catfish collected from the two main branches of the Mekong River inf Vietnam, three private hatcheries and samples from the Tra improvement program at RIA2. Study outcomes: i) Genetic diversity estimates for Tra brood stock samples were similar to, and slightly higher than, wild reference samples. In addition, the relative contribution by breeders to fry in commercial private hatcheries strongly suggest that the true Ne is likely to be significantly less than the breeder numbers used; ii) in a stock improvement program for Tra catfish at RIA2, no significant differences were detected in gene frequencies among generations (FST=0.021, P=0.036>0.002 after Bonferroni correction); and only small differences were observed in alleles frequencies among sample populations. To date, genetic markers have not been applied in the Tra catfish industry, but in the current project they were used to evaluate the levels of genetic variation in the Tra catfish selective breeding program at RIA2 and to undertake genetic correlations between genetic marker and trait variation. While no associations were detected using only four loci, they analysis provided training in the practical applications of the use of molecular markers in aquaculture in general, and in Tra culture, in particular.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Road asset managers are overwhelmed with a high volume of raw data which they need to process and utilise in supporting their decision making. This paper presents a method that processes road-crash data of a whole road network and exposes hidden value inherent in the data by deploying the clustering data mining method. The goal of the method is to partition the road network into a set of groups (classes) based on common data and characterise the class crash types to produce a crash profiles for each cluster. By comparing similar road classes with differing crash types and rates, insight can be gained into these differences that are caused by the particular characteristics of their roads. These differences can be used as evidence in knowledge development and decision support.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper proposes an innovative instance similarity based evaluation metric that reduces the search map for clustering to be performed. An aggregate global score is calculated for each instance using the novel idea of Fibonacci series. The use of Fibonacci numbers is able to separate the instances effectively and, in hence, the intra-cluster similarity is increased and the inter-cluster similarity is decreased during clustering. The proposed FIBCLUS algorithm is able to handle datasets with numerical, categorical and a mix of both types of attributes. Results obtained with FIBCLUS are compared with the results of existing algorithms such as k-means, x-means expected maximization and hierarchical algorithms that are widely used to cluster numeric, categorical and mix data types. Empirical analysis shows that FIBCLUS is able to produce better clustering solutions in terms of entropy, purity and F-score in comparison to the above described existing algorithms.