23 resultados para Alignement de phrases
Resumo:
In this paper, we discuss our participation to the INEX 2008 Link-the-Wiki track. We utilized a sliding window based algorithm to extract the frequent terms and phrases. Using the extracted phrases and term as descriptive vectors, the anchors and relevant links (both incoming and outgoing) are recognized efficiently.
Resumo:
This paper presents a symbolic navigation system that uses spatial language descriptions to inform goal-directed exploration in unfamiliar office environments. An abstract map is created from a collection of natural language phrases describing the spatial layout of the environment. The spatial representation in the abstract map is controlled by a constraint based interpretation of each natural language phrase. In goal-directed exploration of an unseen office environment, the robot links the information in the abstract map to observed symbolic information and its grounded world representation. This paper demonstrates the ability of the system, in both simulated and real-world trials, to efficiently find target rooms in environments that it has never been to previously. In three unexplored environments, it is shown that on average the system travels only 8.42% further than the optimal path when using only natural language phrases to complete navigation tasks.
Resumo:
This research investigates how the world’s leading companies, the Fortune Global 500, use sportsrelated terms and phrases on their Web site. An automated process mirrored leading transnational corporations’ Web presence and then searched their sites. Analysis of about four gigabytes of Webbased text revealed regional and industry differences in how the world’s largest corporations use sports terms on their sites.
Resumo:
In this paper, we propose an unsupervised segmentation approach, named "n-gram mutual information", or NGMI, which is used to segment Chinese documents into n-character words or phrases, using language statistics drawn from the Chinese Wikipedia corpus. The approach alleviates the tremendous effort that is required in preparing and maintaining the manually segmented Chinese text for training purposes, and manually maintaining ever expanding lexicons. Previously, mutual information was used to achieve automated segmentation into 2-character words. The NGMI approach extends the approach to handle longer n-character words. Experiments with heterogeneous documents from the Chinese Wikipedia collection show good results.
Resumo:
This paper reports research undertaken as part of a larger project in which we examined whether and how values and beliefs communicated by Australian politicians have shaped decades of health policy and influenced health outcomes for Aboriginal and Torres Strait Islander Peoples of Australia. To first characterise those values and beliefs we analysed the public statements of the politicians responsible nationally for the health of Aboriginal and Torres Strait Islander Peoples 1972–2001, using critical discourse analysis. We found that four discourses, communicated through words, phrases, sentences and grammatical structures, dominated public statements over the study period. These four discourses focused on the competence and capacity of Aboriginal and Torres Strait Islander Peoples to “manage”; matters of control of and responsibility for the health of Aboriginal and Torres Strait Islander Peoples; Aboriginal and Torres Strait Islander Peoples as “Other”; and the nature of the “problem” concerning the health of Aboriginal and Torres Strait Islander Peoples. Analysis of the discursive elements contributing to shaping these four discourses is reported in this paper.
Resumo:
This paper gives an overview of the INEX 2009 Ad Hoc Track. The main goals of the Ad Hoc Track were three-fold. The first goal was to investigate the impact of the collection scale and markup, by using a new collection that is again based on a the Wikipedia but is over 4 times larger, with longer articles and additional semantic annotations. For this reason the Ad Hoc track tasks stayed unchanged, and the Thorough Task of INEX 2002–2006 returns. The second goal was to study the impact of more verbose queries on retrieval effectiveness, by using the available markup as structural constraints—now using both the Wikipedia’s layout-based markup, as well as the enriched semantic markup—and by the use of phrases. The third goal was to compare different result granularities by allowing systems to retrieve XML elements, ranges of XML elements, or arbitrary passages of text. This investigates the value of the internal document structure (as provided by the XML mark-up) for retrieving relevant information. The INEX 2009 Ad Hoc Track featured four tasks: For the Thorough Task a ranked-list of results (elements or passages) by estimated relevance was needed. For the Focused Task a ranked-list of non-overlapping results (elements or passages) was needed. For the Relevant in Context Task non-overlapping results (elements or passages) were returned grouped by the article from which they came. For the Best in Context Task a single starting point (element start tag or passage start) for each article was needed. We discuss the setup of the track, and the results for the four tasks.
Resumo:
In this video, phrases sourced from social networking websites are reformatted into a slowly rotating mandala form. As the text changes colour, an abstract, atmospheric soundtrack develops. “In the Beyond” examines how we construct and communicate notions of identity. It mixes language gleaned from social networks, and fuses them with a mandala form. Drawing on Zygmunt Bauman’s theoretical work on “liquid modernity”, this work questions how and where we find space for contemplation and reflection in a contemporary context increasingly defined by temporary social bonds and consumer choices.
Resumo:
In this video, phrases float around the screen offering a heartfelt account of a failed romance. Accompanied by a stock music piano soundtrack, the words initially flow at a readable pace. However, as the video progresses, the screen becomes more intensely filled with text so that it cannot be read all at once. This work engages with the signifying processes of romantic clichés. It questions how we communicate grand ideas and emotions using words that seem to have all their sincerity sucked dry. By emphasizing the constructed nature of communication, and by problematizing the process of reading, this work draws attention to the fragmentary and dissociated processes that pervade signifying acts.
Resumo:
In this video, white words and phrases fade quickly in and out amongst small bursting dots of colour. Set to an energetic, synthesised soundtrack, the animated text combines, and sometimes confuses, an internal monologue with dialogue and overheard conversations. The unfolding narrative follows an unnamed narrator through a crowded social event. By visually and textually mixing self-conscious reflections with polite conversations, “Mingling” explores the social niceties and nervous behaviours that often inform our interpersonal experiences. Through its fast-paced and disjointed rendering of verbal communication, the work playfully draws out the sometimes-awkward and uncomfortable dialogues that exist between personal and social, internal and external, imagined and actual. This work was commission by the Museum of Contemporary Art, Australia (Sydney), to celebrate the organisation’s major redevelopment and acknowledge the generosity of individuals and organisations who supported the capital project.
Resumo:
“Spin” borrows idioms and metaphors from sports commentary and squeezes them into a single emotional rollercoaster. Accompanied by a driving soundtrack, text appears and disappears one word at a time. As the work progresses, multiple words fade in and out at the same time, filling the screen and testing our ability to read and assimilate these well-worn phrases. On the one hand, the work mimes some of what we enjoy about sport – its ability to take us to another place, to incite passion and emotion, and to enable us to share in common experiences, goals and desires. On the other hand, it plays up the hyperbolic language often associated with sports broadcasting. The very language that helps take us to another place, incite passion and make us feel part of something bigger than ourselves, is pushed to its extreme and starts to burst at the seams. This work was commissioned for “Kick Off: contemporary video art program” at Metricon Stadium, Gold Coast, and supported by Project Services, Department of Public Works, Queensland Government.
Resumo:
In order to promote green building practice in Australia, the Green Building Council of Australia (GBCA) launched the Green Star rating tools for various types of buildings built since 2003. Of these, the Green Star-Education rating tool addresses sustainability issues during the design and construction phrases of education facility development. It covers a number of categories, including Management, Indoor Environment Quality, Energy, Transport, Water, Materials, Land Use & Ecology, Emissions and Innovation. This paper reviews the use of the Green Star system in Australian education facilities construction and the potential challenges associated with Green Star- Education implementation. Score sheets of 34 education projects across Australia that achieved Green Star certification were collected and analysed. The percentage of green star points obtained within each category and sub-category (credits) for each project were analysed to illustrate the achievement of credits. The results show that management-related credits and ecology-related credits are the easiest and most difficult to obtain respectively. The study also indicted that 6 Green Star education projects obtained particularly high percentages in the Innovation category. The investigation of points obtained in each category provides prospective Green Star applicants with insights into credit achievement for future projects.
Resumo:
“Tranquility Falls” depicts a computer-generated waterfall set to sentimental stock music. As the water gushes, text borrowed from a popular talk show host’s self-help advice fade in and out graphically down the screen. As the animated phrases increase in tempo, the sounds of the waterfall begin to overwhelm the tender music. By creating overtly fabricated sensations of inspiration and awe, the work questions how and where we experience contemplation, wonderment and guidance in a contemporary context. “Tranquility Falls” contributes to studies in the field of contemporary art. It is particularly concerned with representations of spirituality and nature. These have been important themes in art practice for some time. For example, artists such as Olafur Eliasson and James Turrell have created artificial insertions in nature in order to question contemporary experiences of the natural environment. Other artists such as Nam Jun Paik have more directly addressed the changing relationship between spirituality and popular culture. Using a practice-led research methodology, “Tranquility Falls” extends these creative inquiries. By presenting an overtly synthetic but strangely evocative pun on a ‘fountain of knowledge’, it questions whether we are informed less by traditional engagements with organised religions and natural wonder, and instead, increasingly reliant on the mechanisms of popular culture for moments of insight and reflection. “Tranquility Falls” has been exhibited internationally at LA Louver Gallery, Venice, California in 2013 and nationally with GBK as part of Art Month Sydney, also in 2013. It has been critically reviewed in The Los Angeles Times.
Resumo:
For humans and robots to communicate using natural language it is necessary for the robots to develop concepts and associated terms that correspond to the human use of words. Time and space are foundational concepts in human language, and to develop a set of words that correspond to human notions of time and space, it is necessary to take into account the way that they are used in natural human conversations, where terms and phrases such as `soon', `in a while', or `near' are often used. We present language learning robots called Lingodroids that can learn and use simple terms for time and space. In previous work, the Lingodroids were able to learn terms for space. In this work we extend their abilities by adding temporal variables which allow them to learn terms for time. The robots build their own maps of the world and interact socially to form a shared lexicon for location and duration terms. The robots successfully use the shared lexicons to communicate places and times to meet again.
Resumo:
Guaranteeing the quality of extracted features that describe relevant knowledge to users or topics is a challenge because of the large number of extracted features. Most popular existing term-based feature selection methods suffer from noisy feature extraction, which is irrelevant to the user needs (noisy). One popular method is to extract phrases or n-grams to describe the relevant knowledge. However, extracted n-grams and phrases usually contain a lot of noise. This paper proposes a method for reducing the noise in n-grams. The method first extracts more specific features (terms) to remove noisy features. The method then uses an extended random set to accurately weight n-grams based on their distribution in the documents and their terms distribution in n-grams. The proposed approach not only reduces the number of extracted n-grams but also improves the performance. The experimental results on Reuters Corpus Volume 1 (RCV1) data collection and TREC topics show that the proposed method significantly outperforms the state-of-art methods underpinned by Okapi BM25, tf*idf and Rocchio.