924 resultados para NUDIST (Information retrieval system)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Retrieving information from Twitter is always challenging due to its large volume, inconsistent writing and noise. Most existing information retrieval (IR) and text mining methods focus on term-based approach, but suffers from the problems of terms variation such as polysemy and synonymy. This problem deteriorates when such methods are applied on Twitter due to the length limit. Over the years, people have held the hypothesis that pattern-based methods should perform better than term-based methods as it provides more context, but limited studies have been conducted to support such hypothesis especially in Twitter. This paper presents an innovative framework to address the issue of performing IR in microblog. The proposed framework discover patterns in tweets as higher level feature to assign weight for low-level features (i.e. terms) based on their distributions in higher level features. We present the experiment results based on TREC11 microblog dataset and shows that our proposed approach significantly outperforms term-based methods Okapi BM25, TF-IDF and pattern based methods, using precision, recall and F measures.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Due to the development of XML and other data models such as OWL and RDF, sharing data is an increasingly common task since these data models allow simple syntactic translation of data between applications. However, in order for data to be shared semantically, there must be a way to ensure that concepts are the same. One approach is to employ commonly usedschemas—called standard schemas —which help guarantee that syntactically identical objects have semantically similar meanings. As a result of the spread of data sharing, there has been widespread adoption of standard schemas in a broad range of disciplines and for a wide variety of applications within a very short period of time. However, standard schemas are still in their infancy and have not yet matured or been thoroughly evaluated. It is imperative that the data management research community takes a closer look at how well these standard schemas have fared in real-world applications to identify not only their advantages, but also the operational challenges that real users face. In this paper, we both examine the usability of standard schemas in a comparison that spans multiple disciplines, and describe our first step at resolving some of these issues in our Semantic Modeling System. We evaluate our Semantic Modeling System through a careful case study of the use of standard schemas in architecture, engineering, and construction, which we conducted with domain experts. We discuss how our Semantic Modeling System can help the broader problem and also discuss a number of challenges that still remain.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Vietnam is currently undergoing a metamorphosis from a relatively closed society with a centrally planned economy, to a rapidly urbanising one with a global outlook. These changes have been the catalyst for an exciting ferment of activity in popular culture. This volume contains contributions from scholars engaged in the most up-to-date social research in Vietnam, as well as some of Vietnam's most popular cultural producers who are forging new ways of imagining the present whilst at the same time engaging actively in reinterpreting the past. The diverse ways that Vietnam is culturally and socially negotiating the future are examined as the book addresses issues of indigenisation of cultural influences, ambivalence surrounding change, and the consistent blurring of boundaries between informal, non-state cultural activities and formal institutional structures in the evolution of a civil society in Vietnam.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This book develops tools and techniques that will help urban residents gain access to urban computing. Metaphorically speaking, it is taking computing to the street by giving the general public – rather than just researchers and professionals – the power to leverage available city infrastructure and create solutions tailored to their individual needs. It brings together five chapters that are based on presentations given at the Street Computing Workshop held on 24 November 2009 in Melbourne in conjunction with the Australian Computer-Human Interaction Conference (OZCHI 2009). This book focuses on applying urban informatics, urban and community sensing and open application programming interfaces (APIs) to the public space through the delivery of online services, on demand and in real time. It then offers a case study of how the city of Singapore has harnessed the potential of an online infrastructure so that residents and visitors can access services electronically. This book was published as a special issue of the Journal of Urban Technology, 19(2), 2012.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

With the explosive growth of resources available through the Internet, information mismatching and overload have become a severe concern to users. Web users are commonly overwhelmed by huge volume of information and are faced with the challenge of finding the most relevant and reliable information in a timely manner. Personalised information gathering and recommender systems represent state-of-the-art tools for efficient selection of the most relevant and reliable information resources, and the interest in such systems has increased dramatically over the last few years. However, web personalization has not yet been well-exploited; difficulties arise while selecting resources through recommender systems from a technological and social perspective. Aiming to promote high quality research in order to overcome these challenges, this paper provides a comprehensive survey on the recent work and achievements in the areas of personalised web information gathering and recommender systems. The report covers concept-based techniques exploited in personalised information gathering and recommender systems.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper addresses the problem of scheduling a cane transport system involving both rail transport and road transport, where the road transport operates from several sidings in the rail network. An iterative approach for scheduling the rail transport system has been developed using existing rail transport scheduling tools. The assumption that harvesters serviced by road transport are effectively operating from the rail siding from which their bins are supplied seems a reasonable starting point for the analysis. There is a need to manually modify the schedule to take into account the road transport schedule to ensure that full bins are not collected before the road transport system delivers them back to the rail siding.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This chapter introduces the changing role of copyright in China from a historical perspective. It begins by briefly tracing the history of copyright, from a censorship-related system associated with the emergence of the printing press in imperial China, through modernisation during the Republican period, abolition under communism and finally to the introduction of the People's Republic of China's (PRC) first copyright law in 1990 and the nation's entry into the World Trade Organisation (WTO) in 2001.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A user’s query is considered to be an imprecise description of their information need. Automatic query expansion is the process of reformulating the original query with the goal of improving retrieval effectiveness. Many successful query expansion techniques ignore information about the dependencies that exist between words in natural language. However, more recent approaches have demonstrated that by explicitly modeling associations between terms significant improvements in retrieval effectiveness can be achieved over those that ignore these dependencies. State-of-the-art dependency-based approaches have been shown to primarily model syntagmatic associations. Syntagmatic associations infer a likelihood that two terms co-occur more often than by chance. However, structural linguistics relies on both syntagmatic and paradigmatic associations to deduce the meaning of a word. Given the success of dependency-based approaches and the reliance on word meanings in the query formulation process, we argue that modeling both syntagmatic and paradigmatic information in the query expansion process will improve retrieval effectiveness. This article develops and evaluates a new query expansion technique that is based on a formal, corpus-based model of word meaning that models syntagmatic and paradigmatic associations. We demonstrate that when sufficient statistical information exists, as in the case of longer queries, including paradigmatic information alone provides significant improvements in retrieval effectiveness across a wide variety of data sets. More generally, when our new query expansion approach is applied to large-scale web retrieval it demonstrates significant improvements in retrieval effectiveness over a strong baseline system, based on a commercial search engine.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Many successful query expansion techniques ignore information about the term dependencies that exist within natural language. However, researchers have recently demonstrated that consistent and significant improvements in retrieval effectiveness can be achieved by explicitly modelling term dependencies within the query expansion process. This has created an increased interest in dependency-based models. State-of-the-art dependency-based approaches primarily model term associations known within structural linguistics as syntagmatic associations, which are formed when terms co-occur together more often than by chance. However, structural linguistics proposes that the meaning of a word is also dependent on its paradigmatic associations, which are formed between words that can substitute for each other without effecting the acceptability of a sentence. Given the reliance on word meanings when a user formulates their query, our approach takes the novel step of modelling both syntagmatic and paradigmatic associations within the query expansion process based on the (pseudo) relevant documents returned in web search. The results demonstrate that this approach can provide significant improvements in web re- trieval effectiveness when compared to a strong benchmark retrieval system.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The article focuses on how the information seeker makes decisions about relevance. It will employ a novel decision theory based on quantum probabilities. This direction derives from mounting research within the field of cognitive science showing that decision theory based on quantum probabilities is superior to modelling human judgements than standard probability models [2, 1]. By quantum probabilities, we mean decision event space is modelled as vector space rather than the usual Boolean algebra of sets. In this way,incompatible perspectives around a decision can be modelled leading to an interference term which modifies the law of total probability. The interference term is crucial in modifying the probability judgements made by current probabilistic systems so they align better with human judgement. The goal of this article is thus to model the information seeker user as a decision maker. For this purpose, signal detection models will be sketched which are in principle applicable in a wide variety of information seeking scenarios.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In response to current developments In the tertiary education sector, the Queensland University of Technology Library has mounted an Intensive course - Advanced Information Retrieval Skills - for higher degree students. In determining need for such a course, a survey of postgraduate students and their supervisors was conducted. Results of this survey are discussed and details of the four credit point subjects are outlined.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tag recommendation is a specific recommendation task for recommending metadata (tag) for a web resource (item) during user annotation process. In this context, sparsity problem refers to situation where tags need to be produced for items with few annotations or for user who tags few items. Most of the state of the art approaches in tag recommendation are rarely evaluated or perform poorly under this situation. This paper presents a combined method for mitigating sparsity problem in tag recommendation by mainly expanding and ranking candidate tags based on similar items’ tags and existing tag ontology. We evaluated the approach on two public social bookmarking datasets. The experiment results show better accuracy for recommendation in sparsity situation over several state of the art methods.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The higher education sector in Australia is under increasing pressure to prove quality and efficacy of education provision, including graduate outcomes. One of the central tasks of higher education has become to prepare nascent professionals as far as possible for initial employment and future working lives beyond this (Boden & Nedeva, 2010). Tertiary educators in the creative arts face significant and distinctive challenges in demonstrating graduate employability, and creative graduates consistently have the poorest outcomes of any subject grouping. In part, this is because the national graduate destinations survey (Graduate Careers Council of Australia, 2012) does not cater to the distinctive ‘portfolio’ nature of creative careers, or take account of the fact that creative careers can take concerted effort over several years to establish (e.g., McCowan & Wyganowska, 2010). However, it is worth asking whether we as tertiary arts educators are doing enough to prepare creative arts students for the world of work, particularly given that the majority of them will be self-employed to some degree (Bureau of Labour Statistics, 2011, Throsby & Zednik, 2010), and will be challenged to build their own careers without recourse to the support of HR departments or intra-firm promotion schemes. It has been demonstrated empirically that career management and creative enterprise skills are among the most important graduate capabilities in determining early creative career success (Bridgstock, 2011), although these skills do not appear in the Learning and Teaching Academic Standards for the Creative and Performing Arts (2010). This paper explores the nature and development of enterprise capabilities for creative arts students (as distinct from students of the business school), examines best practice in the field internationally, and proposes a theoretically-driven creative arts-specific enterprise curriculum model which commences in first year, for demonstrable impact on student enterprise behaviours (such as grant seeking, professional networking and intention to start an enterprise) and employability.