948 resultados para Query-by-example
Resumo:
Biological experiments often produce enormous amount of data, which are usually analyzed by data clustering. Cluster analysis refers to statistical methods that are used to assign data with similar properties into several smaller, more meaningful groups. Two commonly used clustering techniques are introduced in the following section: principal component analysis (PCA) and hierarchical clustering. PCA calculates the variance between variables and groups them into a few uncorrelated groups or principal components (PCs) that are orthogonal to each other. Hierarchical clustering is carried out by separating data into many clusters and merging similar clusters together. Here, we use an example of human leukocyte antigen (HLA) supertype classification to demonstrate the usage of the two methods. Two programs, Generating Optimal Linear Partial Least Square Estimations (GOLPE) and Sybyl, are used for PCA and hierarchical clustering, respectively. However, the reader should bear in mind that the methods have been incorporated into other software as well, such as SIMCA, statistiXL, and R.
Resumo:
The expansion of the Internet has made the task of searching a crucial one. Internet users, however, have to make a great effort in order to formulate a search query that returns the required results. Many methods have been devised to assist in this task by helping the users modify their query to give better results. In this paper we propose an interactive method for query expansion. It is based on the observation that documents are often found to contain terms with high information content, which can summarise their subject matter. We present experimental results, which demonstrate that our approach significantly shortens the time required in order to accomplish a certain task by performing web searches.
Resumo:
Very often the experimental data are the realization of the process, fully determined by some unknown function, being distorted by hindrances. Treatment and experimental data analysis are substantially facilitated, if these data to represent as analytical expression. The experimental data processing algorithm and the example of using this algorithm for spectrographic analysis of oncologic preparations of blood is represented in this article.
Resumo:
Similar to Genetic algorithm, Evolution strategy is a process of continuous reproduction, trial and selection. Each new generation is an improvement on the one that went before. This paper presents two different proposals based on the vector space model (VSM) as a traditional model in information Retrieval (TIR). The first uses evolution strategy (ES). The second uses the document centroid (DC) in query expansion technique. Then the results are compared; it was noticed that ES technique is more efficient than the other methods.
Resumo:
∗ Supported by Research grants GAUK 190/96 and GAUK 1/1998
Resumo:
∗ The first named author’s research was partially supported by GAUK grant no. 350, partially by the Italian CNR. Both supports are gratefully acknowledged. The second author was supported by funds of Italian Ministery of University and by funds of the University of Trieste (40% and 60%).
Resumo:
Usually, generalization is considered as a function of learning from a set of examples. In present work on the basis of recent neural network assembly memory model (NNAMM), a biologically plausible 'grandmother' model for vision, where each separate memory unit itself can generalize, has been proposed. For such a generalization by computation through memory, analytical formulae and numerical procedure are found to calculate exactly the perfectly learned memory unit's generalization ability. The model's memory has complex hierarchical structure, can be learned from one example by a one-step process, and may be considered as a semi-representational one. A simple binary neural network for bell-shaped tuning is described.
Resumo:
The paper treats the task for cluster analysis of a given assembly of objects on the basis of the information contained in the description table of these objects. Various methods of cluster analysis are briefly considered. Heuristic method and rules for classification of the given assembly of objects are presented for the cases when their division into classes and the number of classes is not known. The algorithm is checked by a test example and two program products (PP) – learning systems and software for company management. Analysis of the results is presented.
Resumo:
2000 Mathematics Subject Classification: 62P10, 92C20
Resumo:
2000 Mathematics Subject Classification: 35L05, 35P25, 47A40.
Resumo:
This research focuses on automatically adapting a search engine size in response to fluctuations in query workload. Deploying a search engine in an Infrastructure as a Service (IaaS) cloud facilitates allocating or deallocating computer resources to or from the engine. Our solution is to contribute an adaptive search engine that will repeatedly re-evaluate its load and, when appropriate, switch over to a dierent number of active processors. We focus on three aspects and break them out into three sub-problems as follows: Continually determining the Number of Processors (CNP), New Grouping Problem (NGP) and Regrouping Order Problem (ROP). CNP means that (in the light of the changes in the query workload in the search engine) there is a problem of determining the ideal number of processors p active at any given time to use in the search engine and we call this problem CNP. NGP happens when changes in the number of processors are determined and it must also be determined which groups of search data will be distributed across the processors. ROP is how to redistribute this data onto processors while keeping the engine responsive and while also minimising the switchover time and the incurred network load. We propose solutions for these sub-problems. For NGP we propose an algorithm for incrementally adjusting the index to t the varying number of virtual machines. For ROP we present an ecient method for redistributing data among processors while keeping the search engine responsive. Regarding the solution for CNP, we propose an algorithm determining the new size of the search engine by re-evaluating its load. We tested the solution performance using a custom-build prototype search engine deployed in the Amazon EC2 cloud. Our experiments show that when we compare our NGP solution with computing the index from scratch, the incremental algorithm speeds up the index computation 2{10 times while maintaining a similar search performance. The chosen redistribution method is 25% to 50% faster than other methods and reduces the network load around by 30%. For CNP we present a deterministic algorithm that shows a good ability to determine a new size of search engine. When combined, these algorithms give an adapting algorithm that is able to adjust the search engine size with a variable workload.
Resumo:
A hálózatos iparágakban, ahogy a postai szolgáltatásoknál is, a forgalomban lévő készpénz nagyméretű működőtőkét jelenthet. A Magyar Posta a levél- és csomagkézbesítésen kívül jelentős készpénzforgalmat bonyolít le: nyugdíjakat, segélyeket és készpénz-átutalási megbízásokat továbbít. A forgalom napi ingadozása a vállalat likvideszköz-igényét jelentősen meghatározza. A posta esetében a postahivatalok készpénzgazdálkodása jól működő hüvelykujjszabályokon keresztül történik, ezek a szabályok döntési teret hagynak a hálózat heterogén egyedi szereplőinek. Az egyedi készletezési viselkedést a vállalati működőtőke meghatározásakor figyelembe kell venni. A tanulmány az egyedi készletezési szokások modellezésére új módszertant ajánl, majd a viselkedésmintákat csoportosítva a pénzkészletezésnek, a vállalati működőtőke szintjének és a vállalati likviditási pozíciónak a kapcsolatát elemzi. / === / The cash in circulation within network industries such as post-office services can repre-sent a sizeable quantity of operating capital. The Hungarian Post Office, besides han-dling mail, handles a significant amount of cash turnover, forwarding pensions, welfare benefits, and cash orders. Fluctuation in the daily volume of these is a strong factor in determining the company's liquidity requirements. The management of cash in post of-fices is governed by rules of thumb that operate well; the regulations leave decision-making scope for the diverse individual actors in the network. Attention has to be paid to individual cash holding when determining the corporate operating capital. The study suggests a new methodology for modelling the individual cash-holding habits, and goes on to group the behaviour patterns by analysing the connection between cash holding, level of corporate operating capital, and corporate liquidity position.
Resumo:
Ez a tanulmány a projektvezetési szakirodalomban kialakult ismeretanyagot szem előtt tartva (noha tételesen nem hivatkozva arra) tárja fel azt a sajátos és tipikusnak nevezhető kontextust, amelyben a projektalapú szervezetek projektmarketing tevékenysége megnyilvánul. A tanulmány célja tehát nem magának a projektmarketingnek a kérdéskörére irányul, hanem elsősorban annak projektspecifikus kontextusára. Jellegét illetően a tanulmány spekulatív jellegű, vagyis lényegét tekintve nem empirikus kutatási eredményekből levont következtetésekre épül. _____ The author analyses the cognitive level of individual decisions by placing the adaptive decision-maker in the centre of interest. The main question is how do adaptive processes evolve and what factors determine the adaptive mechanism. The author builds on his own qualitative study conducted with the Grounded Theory Methodology in the SME sector. The supplier selection decision is chosen from the wide range of business decisions. From the research results the two elements of the adaptive mechanism – the metastructure and the attitude set –, the process of their evolution and the factors determining this process are presented. The findings here are a middle-range theory, which can be elaborated further, but they provide some interesting insights already.
Resumo:
Az államigazgatásban – itthon és külföldön is – a projektek jelentős százaléka időben csúszik, nem azt eredményezi, amit eredetileg elvártak, a szakmai résztvevők szerint túladminisztrált, a munkatársak tevékenysége nem áttekinthető. Ezeknek a problémáknak a nagy része a projektszervezet és a hierarchikusfunkcionális- hivatali szervezet egymás mellett éléséből és a nehezen szinkronizálható együttműködésből fakad. A cikkben egy, a gyakorlatban bevált módszertant mutat be a szerző, amely adott feltételrendszer mellett nagymértékben kiküszöböli a fent említett hiányosságokat és a szervezet napi működésébe illeszkedő tevékenységek sorozatára vezeti vissza a projekttevékenységeket. A módszer egy gyakorlati problémából – a volt APEH-es és VP-s rendszerek integrálása a NAV-ba – indult ki, azonban a szerző véleménye szerint alkalmazható más, funkcionális alapokon felépülő szervezetnél is. _____ The high percentage of public sector projects slips in time, the result is not that what was expected initially, those are overadministrated by according to the professional participants’ opinion, and the activity of staff does not clear. In this article the author describes a best practice methodology, which led the project activities to series of activities which fit to the organization’s daily operations. The method started from a practical problem, but according to the author’s opinion it can be applied to other structured functional basis organizations.
Resumo:
In the last decade, non-technological and particularly organisational innovations have gained more and more importance and research focus. However, there is no consensus among the academic community either about the definition or about the broader theoretical and methodological foundations of this phenomena. In the present study the authors intend to partly improve this knowledge deficiency syndrome by analysing the most important theoretical contributions of organisational innovation and by reviewing the development in the methodological tools aimed to measure organisational innovation on a European level. By doing so, the authors will focus on the various waves of the Community Innovation Survey (CIS) as an employer-oriented and of the European Working Conditions Survey (EWCS) as an employee-oriented survey. Finally, they will formulate some remarks for further empirical research streams.