891 resultados para 080704 Information Retrieval and Web Search
Resumo:
Mode of access: Internet.
Resumo:
The approaches to the analysis of various information resources pertinent to user requirements at a semantic level are determined by the thesauruses of the appropriate subject domains. The algorithms of formation and normalization of the multilinguistic thesaurus, and also methods of their comparison are given.
Resumo:
Due to the rapid growth of the number of digital media elements like image, video, audio, graphics on Internet, there is an increasing demand for effective search and retrieval techniques. Recently, many search engines have made image search as an option like Google, AlltheWeb, AltaVista, Freenet. In addition to this, Ditto, Picsearch, can search only the images on Internet. There are also other domain specific search engines available for graphics and clip art, audio, video, educational images, artwork, stock photos, science and nature [www.faganfinder.com/img]. These entire search engines are directory based. They crawls the entire Internet and index all the images in certain categories. They do not display the images in any particular order with respect to the time and context. With the availability of MPEG-7, a standard for describing multimedia content, it is now possible to store the images with its metadata in a structured format. This helps in searching and retrieving the images. The MPEG-7 standard uses XML to describe the content of multimedia information objects. These objects will have metadata information in the form of MPEG-7 or any other similar format associated with them. It can be used in different ways to search the objects. In this paper we propose a system, which can do content based image retrieval on the World Wide Web. It displays the result in user-defined order.
Resumo:
The information architecture supports information retrieval by users in Web environment. The design should be center in the information user, favoring usability. The Faculty of Industrial Engineering and Tourism of the Universidad Central "Marta Abreu" de Las Villas, lacks a site that enhances the disclosure of information to its members. Are presented as objectives of the study: 1) conduct a user survey to identify information needs of users, 2) establish guidelines for information architecture for the institution focused on users, 3) designing the information architecture for the institution and 4) designed to evaluate the proposal. Are presented as objectives of the study: 1) to realize a user study to identify the information needs of users, 2) establish guidelines for information architecture for the institution focused on users, 3) to design the information architecture for the institution and 4) to evaluate the proposal designed. To obtain results are used methods in the theoretical and empirical levels. Besides, are use techniques that favored the design and evaluation. Is designed the intranet of the Faculty of Industrial Engineering and Tourism. Is evaluated the proposed design for the validation of the results.
Resumo:
Many years have passed since Berners-Lee envi- sioned the Web as it should be (1999), but still many information professionals do not know their precise role in its development, especially con- cerning ontologies –considered one of its main elements. Why? May it still be a lack of under- standing between the different academic commu- nities involved (namely, Computer Science, Lin- guistics and Library and Information Science), as reported by Soergel (1999)? The idea behind the Semantic Web is that of several technologies working together to get optimum information re- trieval performance, which is based on proper resource description in a machine-understandable way, by means of metadata and vocabularies (Greenberg, Sutton and Campbell, 2003). This is obviously something that Library and Information Science professionals can do very well, but, are we doing enough? When computer scientists put on stage the ontology paradigm they were asking for semantically richer vocabularies that could support logical inferences in artificial intelligence as a way to improve information retrieval systems. Which direction should vocabulary development take to contribute better to that common goal? The main objective of this paper is twofold: 1) to identify main trends, issues and problems con- cerning ontology research and 2) to identify pos- sible contributions from the Library and Information Science area to the development of ontologies for the semantic web. To do so, our paper has been structured in the following manner. First, the methodology followed in the paper is reported, which is based on a thorough literature review, where main contributions are analysed. Then, the paper presents a discussion of the main trends, issues and problems concerning ontology re- search identified in the literature review. Recom- mendations of possible contributions from the Library and Information Science area to the devel- opment of ontologies for the semantic web are finally presented.
Resumo:
Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches. Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public. Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. The strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies. Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve). Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.
Resumo:
This article discusses issues related to the organization and reception of information in the context of services and public information systems driven by technology. It stems from the assumption that in a ""technologized"" society, the distance between users and information is almost always of cognitive and socio-cultural nature, a product of our effort to design communication. In this context, we favor the approach of the information sign, seeking to answer how a documentary message turns into information, i.e. a structure recognized as socially useful. Observing the structural, cognitive and communicative aspects of the documentary message, based on Documentary Linguistics, Terminology, as well as on Textual Linguistics, the policy of knowledge management and innovation of the Government of the State of Sao Paulo is analyzed, which authorizes the use of Web 2.0, also questioning to what extent this initiative represents innovation in the environment of libraries.
Impact of Commercial Search Engines and International Databases on Engineering Teaching and Research
Resumo:
For the last three decades, the engineering higher education and professional environments have been completely transformed by the "electronic/digital information revolution" that has included the introduction of personal computer, the development of email and world wide web, and broadband Internet connections at home. Herein the writer compares the performances of several digital tools with traditional library resources. While new specialised search engines and open access digital repositories may fill a gap between conventional search engines and traditional references, these should be not be confused with real libraries and international scientific databases that encompass textbooks and peer-reviewed scholarly works. An absence of listing in some Internet search listings, databases and repositories is not an indication of standing. Researchers, engineers and academics should remember these key differences in assessing the quality of bibliographic "research" based solely upon Internet searches.
Resumo:
This paper examines the effects of information request ambiguity and construct incongruence on end user's ability to develop SQL queries with an interactive relational database query language. In this experiment, ambiguity in information requests adversely affected accuracy and efficiency. Incongruities among the information request, the query syntax, and the data representation adversely affected accuracy, efficiency, and confidence. The results for ambiguity suggest that organizations might elicit better query development if end users were sensitized to the nature of ambiguities that could arise in their business contexts. End users could translate natural language queries into pseudo-SQL that could be examined for precision before the queries were developed. The results for incongruence suggest that better query development might ensue if semantic distances could be reduced by giving users data representations and database views that maximize construct congruence for the kinds of queries in typical domains. (C) 2001 Elsevier Science B.V. All rights reserved.
Resumo:
Most Internet search engines are keyword-based. They are not efficient for the queries where geographical location is important, such as finding hotels within an area or close to a place of interest. A natural interface for spatial searching is a map, which can be used not only to display locations of search results but also to assist forming search conditions. A map-based search engine requires a well-designed visual interface that is intuitive to use yet flexible and expressive enough to support various types of spatial queries as well as aspatial queries. Similar to hyperlinks for text and images in an HTML page, spatial objects in a map should support hyperlinks. Such an interface needs to be scalable with the size of the geographical regions and the number of websites it covers. In spite of handling typically a very large amount of spatial data, a map-based search interface should meet the expectation of fast response time for interactive applications. In this paper we discuss general requirements and the design for a new map-based web search interface, focusing on integration with the WWW and visual spatial query interface. A number of current and future research issues are discussed, and a prototype for the University of Queensland is presented. (C) 2001 Published by Elsevier Science Ltd.
Resumo:
With the advent of wearable sensing and mobile technologies, biosignals have seen an increasingly growing number of application areas, leading to the collection of large volumes of data. One of the difficulties in dealing with these data sets, and in the development of automated machine learning systems which use them as input, is the lack of reliable ground truth information. In this paper we present a new web-based platform for visualization, retrieval and annotation of biosignals by non-technical users, aimed at improving the process of ground truth collection for biomedical applications. Moreover, a novel extendable and scalable data representation model and persistency framework is presented. The results of the experimental evaluation with possible users has further confirmed the potential of the presented framework.
Resumo:
Constrained and unconstrained Nonlinear Optimization Problems often appear in many engineering areas. In some of these cases it is not possible to use derivative based optimization methods because the objective function is not known or it is too complex or the objective function is non-smooth. In these cases derivative based methods cannot be used and Direct Search Methods might be the most suitable optimization methods. An Application Programming Interface (API) including some of these methods was implemented using Java Technology. This API can be accessed either by applications running in the same computer where it is installed or, it can be remotely accessed through a LAN or the Internet, using webservices. From the engineering point of view, the information needed from the API is the solution for the provided problem. On the other hand, from the optimization methods researchers’ point of view, not only the solution for the problem is needed. Also additional information about the iterative process is useful, such as: the number of iterations; the value of the solution at each iteration; the stopping criteria, etc. In this paper are presented the features added to the API to allow users to access to the iterative process data.
Resumo:
ABSTRACT - The problem of how to support “intentions to make behavioural changes” (IBC) and “behaviour changes” (BC) in smoking cessation when there is a scarcity of resources is a pressing issue in public health terms. The present research focuses on the use of information and communications technologies and their role in smoking cessation. It is developed in Portugal after the ratification of WHO Framework Convention on Tobacco Control (on 8 November 2005). The prevalence of smokers over fifteen years of age within the population stood at 20.9% (30.9% for men and 11.8% for women). While the strategy of helping people to quit smoking has been emphasised at National Health Service (NHS) level, the uptake of cessation assistance has exceeded the capacity of the service. This induced the search of new theoretical and practical venues to offer alternative options to people willing to stop smoking. Among these, the National Health Plan (NHP) of Portugal (2004-2010), identifies the use of information technologies in smoking cessation. eHealth and the importance of health literacy as a means of empowering people to make behavioural changes is recurrently considered an option worth investigating. The overall objective of this research is to understand, in the Portuguese context, the use of the Internet to help people to stop smoking. Research questions consider factors that may contribute to “intentions to make behavioural changes” (IBC) and “behavioural changes” (BC) while using a Web-Assisted Tobacco Intervention Probe (WATIP). Also consideration is given to the trade-off on the use of the Web as a tool for smoking cessation: can it reach a vast number of people for a small cost (efficiency) demonstrating to work in the domain of smoking cessation (efficacy)”? In addition to the introduction, there is a second chapter in which the use of tobacco is discussed as a public health menace. The health gains achieved by stopping smoking and the means of quitting are also examined, as is the use of the Internet in smoking cessation. Then, several research issues are introduced. These include background theory and the theoretical framework for the Sense of Coherence. The research model is also discussed. A presentation of the methods, materials and of the Web-Assisted Tobacco Intervention Probe (WATIP) follows. In chapter four the results of the use of the Web-Assisted Tobacco Intervention Probe (WATIP) are presented. This study is divided into two sections. The first describes results related to quality control in relation to the Web-Assisted Tobacco Intervention Probe (WATIP) and gives an overview of its users. Of these, 3,150 answered initial eligibility questions. In the end, 1,463 met all eligibility requirements, completed intake, decided on a day to quit smoking (Dday) and declared their “intentions to make behavioural changes” (IBC) while a second targeted group of 650 did not decide on a Dday. With two quit attempts made before joining the platform, most of the participants had experienced past failures while wanting to stop. The smoking rate averaged 21 cigarettes per day. With a mean age of 35, of the participants 55% were males. Among several other considerations, gender and the Sense of Coherence (SOC) influenced the success of participants in their IBC and endeavour to set quit dates. The results of comparing males and females showed that, for current smokers, establishing a Dday was related to gender differences, not favouring males (OR=0.76, p<0.005). Belonging to higher Socio-economic strata (SES) was associated with the intention to consider IBC (when compared to lower SES condition) (OR=1.57, p<0.001) and higher number of school years (OR=0.70, p<0.005) favoured the decision to smoking cessation. Those who demonstrated higher confidence in their likelihood of success in stopping in the shortest time had a higher rate of setting a Dday (OR=0.51, p<0.001). There were differences between groups in IBC reflecting the high and low levels of the SOC score (OR=1.43, p=0.006), as those who considered setting a Dday had higher levels of SOC. After adjusting for all variables, stages of readiness to change and SOC were kept in the model. This is the first Arm of this research where the focus is a discussion of the system’s implications for the participants’ “intentions to make behavioural changes” (IBC). Moreover, a second section of this study (second Arm) offers input collected from 77 in-depth interviews with the Web-Assisted Tobacco Intervention Probe (WATIP) users. Here, “Behaviour Change” (BC) and the usability of the platform are explored a year after IBC was declared. A percentage of 32.9% of self-reported, 12-month quitters in continuous abstinence from smoking from Dday to the 12-month follow- up point of the use of the Web-Assisted Tobacco Intervention Probe (WATIP) has been assessed. Comparing the Sense of Coherence (SOC) scores of participants by their respective means, according to the two groups, there was a significant difference in these scores of non smokers (BC) (M=144,66, SD=22,52) and Sense of Coherence (SOC) of smokers (noBC) (M=131,51, SD=21,43) p=0.014. This WATIP strategy and its contents benefit from the strengthening of the smoker’s sense of coherence (SOC), so that the person’s progress towards a life without tobacco may be experienced as comprehensible, manageable and meaningful. In this sample the sense of coherence (SOC) effect is moderate although it is associated with the day to quit smoking (Dday). Some of the limitations of this research have to do with self-selection bias, sample size (power) and self-reporting (no biochemical validation). The enrolment of participants was therefore not representative of the smoking population. It is not possible to verify the Web-Assisted Tobacco Intervention Probe (WATIP) evaluation of external validity; consequently, the results obtained cannot be applied generalized. No participation bias is provided. Another limitation of this study is the associated limitations of interviews. Interviewees’ perception that fabricating answers could benefit them more than telling the simple truth in response to questions is a risk that is not evaluated (with no external validation like measuring participants’ carbon monoxide levels). What emerges in this analysis is the relevance of the process that leads to the establishment of the quit day (Dday) to stop using tobacco. In addition, technological issues, when tailoring is the focus, are key elements for scrutiny. The high number of dropouts of users of the web platform mandates future research that should concentrate on the matters of the user-centred design of portals. The focus on gains in health through patient-centred care needs more research, so that technology usability be considered within the context of best practices in smoking cessation.
Resumo:
The Smart Drug Search is publicly accessible at http://sing.ei.uvigo.es/sds/. The BIOMedical Search Engine Framework is freely available for non-commercial use at https://github.com/agjacome/biomsef
Resumo:
High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).