939 resultados para Information retrieval - Australia


Relevância:

100.00% 100.00%

Publicador:

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In any data mining applications, automated text and text and image retrieval of information is needed. This becomes essential with the growth of the Internet and digital libraries. Our approach is based on the latent semantic indexing (LSI) and the corresponding term-by-document matrix suggested by Berry and his co-authors. Instead of using deterministic methods to find the required number of first "k" singular triplets, we propose a stochastic approach. First, we use Monte Carlo method to sample and to build much smaller size term-by-document matrix (e.g. we build k x k matrix) from where we then find the first "k" triplets using standard deterministic methods. Second, we investigate how we can reduce the problem to finding the "k"-largest eigenvalues using parallel Monte Carlo methods. We apply these methods to the initial matrix and also to the reduced one. The algorithms are running on a cluster of workstations under MPI and results of the experiments arising in textual retrieval of Web documents as well as comparison of the stochastic methods proposed are presented. (C) 2003 IMACS. Published by Elsevier Science B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In recent years, we have witnessed many information security developmental trends. As a consequence, the dimensions of information security - once single disciplinary area - have become multifaceted and convoluted. This paper aims to (1) recapitulate these key developments: (2) argue that the emergence of many complex information security dimensions are the result of 'constant change agents' (CCAs); (3) discuss the implications on Australia's society, i. e. government, companies and individuals; and (4) propose key consideration areas and possible solutions thereof. We hope that the discussion presented here will position Australia to make better aligned information security and strategic plans, such as choosing appropriate investments and adopting effective solutions to strengthen and secure Australia's national information security posture.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study investigates information literacy and scholarly communication within the processes of doctoral research and supervision at a distance. Both doctoral candidates and supervisors acknowledge information literacy deficiencies and it is suggested that disintermediation and the proliferation of information may contribute to those deficiencies. Further to this, the influence of pedagogic continuity—particularly in relation to the information seeking behaviour of candidates—is investigated, as is the concomitant aspect of how doctoral researchers practise scholarly communication. The well-documented and enduring problem for candidates of isolation from the research cultures of their universities is also scrutinised. The contentious issue of more formally involving librarians in the doctoral process is also considered, from the perspective of candidates and supervisors. Superimposed upon these topical and timely issues is the theoretical framework of adult learning theory, in particular the tenets of andragogy. The pedagogical-andragogical orientation of candidates and supervisors is established, demonstrating both the differences and similarities between candidates and supervisors, as are a number of independent variables, including a comparison of on-campus and off-campus candidates. Other independent variables include age, gender, DETYA (Department of Education, Training & Youth Affairs) category, enrolment type, stage of candidature, employment and status, type of doctorate, and English/non-English speaking background. The research methodology uses qualitative and quantitative techniques encompassing both data and methodological triangulation. The study uses two sets of questionnaires and a series of in-depth interviews with a sample of on-campus and off-campus doctoral candidates and supervisors from four Australian universities. Major findings include NESB candidates being more pedagogical than their ESB counterparts, and candidates and supervisors from the Sciences are more pedagogical than those from Arts, Humanities and Social Sciences, or Education. Candidates make a transition from a more dependent and pedagogically oriented approach to learning towards more of an independent and andragogical orientation over the duration of their candidature. However, over tune both on-campus and off-campus candidates become more isolated from the research cultures of their universities, and less happy with support received from their supervisors in relation to their literature reviews. Ill The study found large discrepancies in perception between the support supervisors believed they gave to candidates in relation to the literature review, and the support candidates believed they received. Information seeking becomes easier over time, but candidates face a dilemma with the proliferation of information, suggesting that disintermediation has exacerbated the challenges of evaluation and organisation of information. The concept of pedagogic continuity was recognised by supervisors and especially candidates, both negative and positive influences. The findings are critically analysed and synthesised using the metaphor of a scholarly 'Club' of which obtaining a doctorate is a rite of passage. Recommendations are made for changes in professional practice, and topics that may warrant further research are suggested.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this thesis, the author designed three sets of preference based ranking algorithms for information retrieval and provided the corresponsive applications for the algorithms. The main goal is to retrieve recommended, high similar and valuable ranking results to users.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

An online transaction always retrieves a large amount of information before making decisions. Currently, the parallel methods for retrieving such information can only provide a similar performance to serial methods. In this paper we first perform an analysis to determine the factors that affect the performance of exiting methods, i.e., HQR and EHQR, and show that the several of these factors are not considered by these methods. Motivated by this, we propose a new dispatch scheme called AEHQR, which takes into account the features of parallel dispatching. In addition, we provide cost models that determine the optimal performance achievable by any parallel dispatching method. Using experimental comparison, we illustrate that the AEHQR is significantly outperforms the HQR and EHQR under all conditions.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The central problem of automatic retrieval from unformatted text is that computational devices are not adequately trained to look for associated information. However for complete understanding and information retrieval, a complete artificial intelligence would have to be built. This paper describes a method for achieving significant information retrieval by using a semantic search engine. The underlying semantic information is stored in a network of clarified words, linked by logical connections. We employ simple scoring techniques on collections of paths in this network to establish a degree of relevance between a document and a clarified search criterion. This technique has been applied with success to test examples and can be easily scaled up to search large documents.

Relevância:

100.00% 100.00%

Publicador:

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The central objective of research in Information Retrieval (IR) is to discover new techniques to retrieve relevant information in order to satisfy an Information Need. The Information Need is satisfied when relevant information can be provided to the user. In IR, relevance is a fundamental concept which has changed over time, from popular to personal, i.e., what was considered relevant before was information for the whole population, but what is considered relevant now is specific information for each user. Hence, there is a need to connect the behavior of the system to the condition of a particular person and his social context; thereby an interdisciplinary sector called Human-Centered Computing was born. For the modern search engine, the information extracted for the individual user is crucial. According to the Personalized Search (PS), two different techniques are necessary to personalize a search: contextualization (interconnected conditions that occur in an activity), and individualization (characteristics that distinguish an individual). This movement of focus to the individual's need undermines the rigid linearity of the classical model overtaken the ``berry picking'' model which explains that the terms change thanks to the informational feedback received from the search activity introducing the concept of evolution of search terms. The development of Information Foraging theory, which observed the correlations between animal foraging and human information foraging, also contributed to this transformation through attempts to optimize the cost-benefit ratio. This thesis arose from the need to satisfy human individuality when searching for information, and it develops a synergistic collaboration between the frontiers of technological innovation and the recent advances in IR. The search method developed exploits what is relevant for the user by changing radically the way in which an Information Need is expressed, because now it is expressed through the generation of the query and its own context. As a matter of fact the method was born under the pretense to improve the quality of search by rewriting the query based on the contexts automatically generated from a local knowledge base. Furthermore, the idea of optimizing each IR system has led to develop it as a middleware of interaction between the user and the IR system. Thereby the system has just two possible actions: rewriting the query, and reordering the result. Equivalent actions to the approach was described from the PS that generally exploits information derived from analysis of user behavior, while the proposed approach exploits knowledge provided by the user. The thesis went further to generate a novel method for an assessment procedure, according to the "Cranfield paradigm", in order to evaluate this type of IR systems. The results achieved are interesting considering both the effectiveness achieved and the innovative approach undertaken together with the several applications inspired using a local knowledge base.

Relevância:

100.00% 100.00%

Publicador: