10 resultados para Query expansion, Text mining, Information retrieval, Chinese IR

em DigitalCommons@The Texas Medical Center


Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: To characterize PubMed usage over a typical day and compare it to previous studies of user behavior on Web search engines. DESIGN: We performed a lexical and semantic analysis of 2,689,166 queries issued on PubMed over 24 consecutive hours on a typical day. MEASUREMENTS: We measured the number of queries, number of distinct users, queries per user, terms per query, common terms, Boolean operator use, common phrases, result set size, MeSH categories, used semantic measurements to group queries into sessions, and studied the addition and removal of terms from consecutive queries to gauge search strategies. RESULTS: The size of the result sets from a sample of queries showed a bimodal distribution, with peaks at approximately 3 and 100 results, suggesting that a large group of queries was tightly focused and another was broad. Like Web search engine sessions, most PubMed sessions consisted of a single query. However, PubMed queries contained more terms. CONCLUSION: PubMed's usage profile should be considered when educating users, building user interfaces, and developing future biomedical information retrieval systems.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: To determine whether algorithms developed for the World Wide Web can be applied to the biomedical literature in order to identify articles that are important as well as relevant. DESIGN AND MEASUREMENTS A direct comparison of eight algorithms: simple PubMed queries, clinical queries (sensitive and specific versions), vector cosine comparison, citation count, journal impact factor, PageRank, and machine learning based on polynomial support vector machines. The objective was to prioritize important articles, defined as being included in a pre-existing bibliography of important literature in surgical oncology. RESULTS Citation-based algorithms were more effective than noncitation-based algorithms at identifying important articles. The most effective strategies were simple citation count and PageRank, which on average identified over six important articles in the first 100 results compared to 0.85 for the best noncitation-based algorithm (p < 0.001). The authors saw similar differences between citation-based and noncitation-based algorithms at 10, 20, 50, 200, 500, and 1,000 results (p < 0.001). Citation lag affects performance of PageRank more than simple citation count. However, in spite of citation lag, citation-based algorithms remain more effective than noncitation-based algorithms. CONCLUSION Algorithms that have proved successful on the World Wide Web can be applied to biomedical information retrieval. Citation-based algorithms can help identify important articles within large sets of relevant results. Further studies are needed to determine whether citation-based algorithms can effectively meet actual user information needs.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Early Employee Assistance Programs (EAPs) had their origin in humanitarian motives, and there was little concern for their cost/benefit ratios; however, as some programs began accumulating data and analyzing it over time, even with single variables such as absenteeism, it became apparent that the humanitarian reasons for a program could be reinforced by cost savings particularly when the existence of the program was subject to justification.^ Today there is general agreement that cost/benefit analyses of EAPs are desirable, but the specific models for such analyses, particularly those making use of sophisticated but simple computer based data management systems, are few.^ The purpose of this research and development project was to develop a method, a design, and a prototype for gathering managing and presenting information about EAPS. This scheme provides information retrieval and analyses relevant to such aspects of EAP operations as: (1) EAP personnel activities, (2) Supervisory training effectiveness, (3) Client population demographics, (4) Assessment and Referral Effectiveness, (5) Treatment network efficacy, (6) Economic worth of the EAP.^ This scheme has been implemented and made operational at The University of Texas Employee Assistance Programs for more than three years.^ Application of the scheme in the various programs has defined certain variables which remained necessary in all programs. Depending on the degree of aggressiveness for data acquisition maintained by program personnel, other program specific variables are also defined. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: During the orientation process, new students are often inundated with manuals, maps, and other materials essential to their success as students. The experience can leave students feeling overwhelmed, unable to sift through the substantial amount of information that has been given to them. Wikis, in contrast, are well-suited for facilitating userinteraction with vast amounts of diverse information. [See PDF for complete abstract]

Relevância:

40.00% 40.00%

Publicador:

Resumo:

OBJECTIVES: To determine the characteristics of popular breast cancer related websites and whether more popular sites are of higher quality. DESIGN: The search engine Google was used to generate a list of websites about breast cancer. Google ranks search results by measures of link popularity---the number of links to a site from other sites. The top 200 sites returned in response to the query "breast cancer" were divided into "more popular" and "less popular" subgroups by three different measures of link popularity: Google rank and number of links reported independently by Google and by AltaVista (another search engine). MAIN OUTCOME MEASURES: Type and quality of content. RESULTS: More popular sites according to Google rank were more likely than less popular ones to contain information on ongoing clinical trials (27% v 12%, P=0.01 ), results of trials (12% v 3%, P=0.02), and opportunities for psychosocial adjustment (48% v 23%, P<0.01). These characteristics were also associated with higher number of links as reported by Google and AltaVista. More popular sites by number of linking sites were also more likely to provide updates on other breast cancer research, information on legislation and advocacy, and a message board service. Measures of quality such as display of authorship, attribution or references, currency of information, and disclosure did not differ between groups. CONCLUSIONS: Popularity of websites is associated with type rather than quality of content. Sites that include content correlated with popularity may best meet the public's desire for information about breast cancer.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Technological and cultural factors influence access to health information on the web in multifarious ways. We evaluated structural differences and availability of communication services on the web in three diverse language and cultural groups: Chinese, English, and Spanish. A total of 382 web sites were analyzed: 144 were English language sites (38%), 129 were Chinese language sites (34%), and 108 were Spanish language sites (28%). We did not find technical differences in the number of outgoing links per domain or the total availability of communication services between the three groups. There were differences in the distribution of available services between Chinese and English sites. In the Chinese sites, there were more communication services between consumers and health experts. Our results suggest that the health-related web presence of these three cultural groups is technologically comparable, but reflects differences that may be attributable to cultural factors.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

People often use tools to search for information. In order to improve the quality of an information search, it is important to understand how internal information, which is stored in user’s mind, and external information, represented by the interface of tools interact with each other. How information is distributed between internal and external representations significantly affects information search performance. However, few studies have examined the relationship between types of interface and types of search task in the context of information search. For a distributed information search task, how data are distributed, represented, and formatted significantly affects the user search performance in terms of response time and accuracy. Guided by UFuRT (User, Function, Representation, Task), a human-centered process, I propose a search model, task taxonomy. The model defines its relationship with other existing information models. The taxonomy clarifies the legitimate operations for each type of search task of relation data. Based on the model and taxonomy, I have also developed prototypes of interface for the search tasks of relational data. These prototypes were used for experiments. The experiments described in this study are of a within-subject design with a sample of 24 participants recruited from the graduate schools located in the Texas Medical Center. Participants performed one-dimensional nominal search tasks over nominal, ordinal, and ratio displays, and searched one-dimensional nominal, ordinal, interval, and ratio tasks over table and graph displays. Participants also performed the same task and display combination for twodimensional searches. Distributed cognition theory has been adopted as a theoretical framework for analyzing and predicting the search performance of relational data. It has been shown that the representation dimensions and data scales, as well as the search task types, are main factors in determining search efficiency and effectiveness. In particular, the more external representations used, the better search task performance, and the results suggest the ideal search performance occurs when the question type and corresponding data scale representation match. The implications of the study lie in contributing to the effective design of search interface for relational data, especially laboratory results, which are often used in healthcare activities.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Resistance of tumors to pharmacologic agents poses a significant problem in the treatment of human malignancies. This study overviews the scope of clinical resistance and focuses upon current research attempts toward investigation of the phenomenon of multidrug resistance (MDR).^ The objective of this investigation was to determine whether gene amplification had a role in the development of the MDR phenotype in Chinese hamster ovary cells (CHO) primarily selected for resistance to vincristine (VCR). A DNA fragment, previously shown to be amplified in two independently derived Chinese hamster cell lines exhibiting the MDR phenotype, was also amplified in VCR hamster lines. Sequences flanking this fragment were shown to contain coding information for a 4.3 kb transcript overproduced in VCR cells. These sequences were not enriched in double minute DNA preparations isolated from VCR cells. There was an approximately forty-fold increase in both the level of gene amplification and transcript overproduction in the VCR cell lines, independent of the level of primary resistance. This DNA amplification and overproduction of the 4.3 kb transcript was also demonstrated in CHO cells independently selected for resistance to Adriamycin and vinblastine.^ All the DNA sequences of two hamster cDNA clones containing 785 and 932 base pair inserts showed direct homology to the published mouse mdr sequences (about 90%). This sequence conservation held for only portions of the gene when the human mdr1 sequences were compared with those from either the mouse or hamster.^ Somatic cell hybrids, constructed between VCR CHO cells and sensitive murine cells, were used to determine whether there was a functional relationship between the chromosome bearing the amplified sequences and the MDR phenotype. Concordant segregation between vincristine resistance, the MDR phenotype, the presence of MDR-associated amplified sequences, overexpression of the mRNA encoded by these sequences, overexpression of the mRNA encoded by these sequences, and CHO chromosome Z1 was consistent with the hypothesis that there is an amplified gene on chromosome Z1 of the VCR CHO cells which is responsible for MDR in these cells. ^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Academic and industrial research in the late 90s have brought about an exponential explosion of DNA sequence data. Automated expert systems are being created to help biologists to extract patterns, trends and links from this ever-deepening ocean of information. Two such systems aimed on retrieving and subsequently utilizing phylogenetically relevant information have been developed in this dissertation, the major objective of which was to automate the often difficult and confusing phylogenetic reconstruction process. ^ Popular phylogenetic reconstruction methods, such as distance-based methods, attempt to find an optimal tree topology (that reflects the relationships among related sequences and their evolutionary history) by searching through the topology space. Various compromises between the fast (but incomplete) and exhaustive (but computationally prohibitive) search heuristics have been suggested. An intelligent compromise algorithm that relies on a flexible “beam” search principle from the Artificial Intelligence domain and uses the pre-computed local topology reliability information to adjust the beam search space continuously is described in the second chapter of this dissertation. ^ However, sometimes even a (virtually) complete distance-based method is inferior to the significantly more elaborate (and computationally expensive) maximum likelihood (ML) method. In fact, depending on the nature of the sequence data in question either method might prove to be superior. Therefore, it is difficult (even for an expert) to tell a priori which phylogenetic reconstruction method—distance-based, ML or maybe maximum parsimony (MP)—should be chosen for any particular data set. ^ A number of factors, often hidden, influence the performance of a method. For example, it is generally understood that for a phylogenetically “difficult” data set more sophisticated methods (e.g., ML) tend to be more effective and thus should be chosen. However, it is the interplay of many factors that one needs to consider in order to avoid choosing an inferior method (potentially a costly mistake, both in terms of computational expenses and in terms of reconstruction accuracy.) ^ Chapter III of this dissertation details a phylogenetic reconstruction expert system that selects a superior proper method automatically. It uses a classifier (a Decision Tree-inducing algorithm) to map a new data set to the proper phylogenetic reconstruction method. ^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Despite of the proven efficacy of the Pap test, Asian populations still have low Pap screening compliance. The purpose of this dissertation was to investigate factors that influencing women's decision to obtain a Pap test, and to describe the development and evaluation of a cervical cancer educational program promoting the Pap screening behavior among women in Taiwan. ^ The first study examined factors associated with Pap screening compliance. Psychometric properties of measurement instruments were also assessed. The scale reliabilities were as the follows: Cronbach alpha 0.70 for knowledge scale, 0.88 for pros scale, 0.68 for cons scale, and 0.72 for perceived norms scale. Results from multiple logistic regression analysis, after adjusted for marital status, showed women who compliant to Pap screening guidelines had significantly higher knowledge, higher perceived benefits (pros), lower perceived barriers (cons), and higher perceived norms to receive a Pap test. ^ The second study described the development of a program called “Love yourself before you take care of your family”, designed to increase Pap screening behavior among women in Taiwan. The development of this program was guided by Intervention Mapping (IM), an innovative process of intervention design. The program used methods such as information transmission, modeling, persuasion, and facilitation. Strategies included direct mail campaigns, role model stories with women's testimonials, and phone intervention. ^ The third study examined the effectiveness of a randomized trial of the carefully-designed intervention (N = 424). Participants were female family members of inpatients admitted to one of the major teaching hospitals in Taiwan during August and September 1999. Women in the intervention group reported a higher rate of receiving a Pap test than women in the control group (50% versus 32%) after a three-month intervention (p = 0.002). Women in the intervention group showed increased knowledge (p = .016), perceived pros (p = 0.008), and susceptibility (p = .011) between baseline and follow-up. They also showed higher perceived pros of Pap tests than women in control group at follow-up (p = .031). This result suggested that program development based on theories and evidences could maximize the intervention impact for a specific target population. ^