168 resultados para Ranking de diversificação


Relevância:

10.00% 10.00%

Publicador:

Resumo:

The presence of spam in a document ranking is a major issue for Web search engines. Common approaches that cope with spam remove from the document rankings those pages that are likely to contain spam. These approaches are implemented as post-retrieval processes, that filter out spam pages only after documents have been retrieved with respect to a user’s query. In this paper we suggest to remove spam pages at indexing time, therefore obtaining a pruned index that is virtually “spam-free”. We investigate the benefits of this approach from three points of view: indexing time, index size, and retrieval performances. Not surprisingly, we found that the strategy decreases both the time required by the indexing process and the space required for storing the index. Surprisingly instead, we found that by considering a spam-pruned version of a collection’s index, no difference in retrieval performance is found when compared to that obtained by traditional post-retrieval spam filtering approaches.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper we define two models of users that require diversity in search results; these models are theoretically grounded in the notion of intrinsic and extrinsic diversity. We then examine Intent-Aware Expected Reciprocal Rank (ERR-IA), one of the official measures used to assess diversity in TREC 2011-12, with respect to the proposed user models. By analyzing ranking preferences as expressed by the user models and those estimated by ERR-IA, we investigate whether ERR-IA assesses document rankings according to the requirements of the diversity retrieval task expressed by the two models. Empirical results demonstrate that ERR-IA neglects query-intents coverage by attributing excessive importance to redundant relevant documents. ERR-IA behavior is contrary to the user models that require measures to first assess diversity through the coverage of intents, and then assess the redundancy of relevant intents. Furthermore, diversity should be considered separately from document relevance and the documents positions in the ranking.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Novelty-biased cumulative gain (α-NDCG) has become the de facto measure within the information retrieval (IR) community for evaluating retrieval systems in the context of sub-topic retrieval. Setting the incorrect value of parameter α in α-NDCG prevents the measure from behaving as desired in particular circumstances. In fact, when α is set according to common practice (i.e. α = 0.5), the measure favours systems that promote redundant relevant sub-topics rather than provide novel relevant ones. Recognising this characteristic of the measure is important because it affects the comparison and the ranking of retrieval systems. We propose an approach to overcome this problem by defining a safe threshold for the value of α on a query basis. Moreover, we study its impact on system rankings through a comprehensive simulation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper we describe the approaches adopted to generate the runs submitted to ImageCLEFPhoto 2009 with an aim to promote document diversity in the rankings. Four of our runs are text based approaches that employ textual statistics extracted from the captions of images, i.e. MMR [1] as a state of the art method for result diversification, two approaches that combine relevance information and clustering techniques, and an instantiation of Quantum Probability Ranking Principle. The fifth run exploits visual features of the provided images to re-rank the initial results by means of Factor Analysis. The results reveal that our methods based on only text captions consistently improve the performance of the respective baselines, while the approach that combines visual features with textual statistics shows lower levels of improvements.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

PURPOSE To examine correlates and consequences of parents' encouragement of girls' physical activity (PA) for weight loss (ENCLOSS). METHODS Data were collected for 181 girls, mothers and fathers when girls were 9, 11, and 13 years old. Mothers and fathers completed a self-report questionnaire of ENCLOSS (e.g., “I have talked to my daughter about how to exercise to lose weight”). Correlates of ENCLOSS that were assessed include girls' Body Mass Index (BMI) z-score and parents' modeling of and logistic support for PA. Dependent variables assessed at age 13 include girls' self-reported and objectively-measured PA, enjoyment of physical activity, and weight concerns. Associations between ENCLOSS, girls' BMI, and parent's support for PA were assessed using spearman rank correlations. To examine links between ENCLOSS and the outcome variables, scores for ENCLOSS were divided into tertiles at each age. Three groups were created including girls who were in the highest tertile at each age (high ENCLOSS), girls who were in the lowest tertile at each age (low ENCLOSS), and girls who varied in their tertile ranking (mid ENCLOSS). Group differences in the outcome variables were assessed using regression analysis (referent group: low ENCLOSS), controlling for girls' BMI and the outcome variable at age 9. RESULTS Girls' with higher BMI had mothers and fathers who reported higher ENCLOSS (r = .61-. 69, p<. 0001). Parents'reports of ENCLOSS were not associated with modeling of or logistic support for PA. Girls in the high ENCLOSS group reported significantly lower enjoyment of PA and higher weight concerns at age 13, independent of covariates. No differences in PA were noted. CONCLUSION Parents who encourage their daughters to be active for weight loss do not model PA or facilitate girls' PA. Persistent encouragement of PA for weight loss may lead to low enjoyment of PA and higher weight concerns among adolescent girls.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX 2008 evaluation campaign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Indians tend to have lower lean body mass than other ethnic groups which increases the risk of chronic diseases. Three complementary studies included in this thesis advanced knowledge on determinants of lean body mass in Indians and the techniques to measure it. The first study examined the determinants of lean body mass in young Indian adults and highlighted the importance of diet and physical activity for development of lean body mass. This study has important implications for policy on prevention of chronic diseases in India. The other two studies helped refinement of the techniques of lean body mass measurement and are expected to facilitate future research in this area. The thesis is presented in the form of publications in high ranking journals.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Age-related Macular Degeneration (AMD) is one of the major causes of vision loss and blindness in ageing population. Currently, there is no cure for AMD, however early detection and subsequent treatment may prevent the severe vision loss or slow the progression of the disease. AMD can be classified into two types: dry and wet AMDs. The people with macular degeneration are mostly affected by dry AMD. Early symptoms of AMD are formation of drusen and yellow pigmentation. These lesions are identified by manual inspection of fundus images by the ophthalmologists. It is a time consuming, tiresome process, and hence an automated diagnosis of AMD screening tool can aid clinicians in their diagnosis significantly. This study proposes an automated dry AMD detection system using various entropies (Shannon, Kapur, Renyi and Yager), Higher Order Spectra (HOS) bispectra features, Fractional Dimension (FD), and Gabor wavelet features extracted from greyscale fundus images. The features are ranked using t-test, Kullback–Lieber Divergence (KLD), Chernoff Bound and Bhattacharyya Distance (CBBD), Receiver Operating Characteristics (ROC) curve-based and Wilcoxon ranking methods in order to select optimum features and classified into normal and AMD classes using Naive Bayes (NB), k-Nearest Neighbour (k-NN), Probabilistic Neural Network (PNN), Decision Tree (DT) and Support Vector Machine (SVM) classifiers. The performance of the proposed system is evaluated using private (Kasturba Medical Hospital, Manipal, India), Automated Retinal Image Analysis (ARIA) and STructured Analysis of the Retina (STARE) datasets. The proposed system yielded the highest average classification accuracies of 90.19%, 95.07% and 95% with 42, 54 and 38 optimal ranked features using SVM classifier for private, ARIA and STARE datasets respectively. This automated AMD detection system can be used for mass fundus image screening and aid clinicians by making better use of their expertise on selected images that require further examination.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Age-related macular degeneration (AMD) affects the central vision and subsequently may lead to visual loss in people over 60 years of age. There is no permanent cure for AMD, but early detection and successive treatment may improve the visual acuity. AMD is mainly classified into dry and wet type; however, dry AMD is more common in aging population. AMD is characterized by drusen, yellow pigmentation, and neovascularization. These lesions are examined through visual inspection of retinal fundus images by ophthalmologists. It is laborious, time-consuming, and resource-intensive. Hence, in this study, we have proposed an automated AMD detection system using discrete wavelet transform (DWT) and feature ranking strategies. The first four-order statistical moments (mean, variance, skewness, and kurtosis), energy, entropy, and Gini index-based features are extracted from DWT coefficients. We have used five (t test, Kullback–Lieber Divergence (KLD), Chernoff Bound and Bhattacharyya Distance, receiver operating characteristics curve-based, and Wilcoxon) feature ranking strategies to identify optimal feature set. A set of supervised classifiers namely support vector machine (SVM), decision tree, k -nearest neighbor ( k -NN), Naive Bayes, and probabilistic neural network were used to evaluate the highest performance measure using minimum number of features in classifying normal and dry AMD classes. The proposed framework obtained an average accuracy of 93.70 %, sensitivity of 91.11 %, and specificity of 96.30 % using KLD ranking and SVM classifier. We have also formulated an AMD Risk Index using selected features to classify the normal and dry AMD classes using one number. The proposed system can be used to assist the clinicians and also for mass AMD screening programs.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Technical dinitrotoluene (DNT) is a mixture of 2,4- and 2,6-DNT. In humans, industrial or environmental exposure can occur orally, by inhalation, or by skin contact. The classification of DNT as an 'animal carcinogen' is based on the formation of malignant tumors in kidneys, liver, and mammary glands of rats and mice. Clear signs of toxic nephropathy were found in rats dosed with DNT, and the concept was derived of an interrelation between renal toxicity and carcinogenicity. Recent data point to the carcinogenicity of DNT on the urinary tract of exposed humans. Between 1984 and 1997, 6 cases of urothelial cancer and 14 cases of renal cell cancer were diagnosed in a group of 500 underground mining workers in the copper mining industry of the former GDR and having high exposures to explosives containing technical DNT. The incidences of both urothelial and renal cell tumors in this group were 4.5 and 14.3 times higher, respectively, than anticipated on the basis of the cancer registers of the GDR. The genotyping of all identified tumor patients for the polymorphic enzymes NAT2, GSTM1, and GSTT1 identified the urothelial tumor cases as exclusively 'slow acetylates'. A group of 161 miners highly exposed to DNT was investigated for signs of subclinical renal damage. The exposures were categorized semi-quantitatively into 'low', 'medium', 'high', and 'very high'. A straight dose-dependence of the excretion of urinary biomarker proteins with the ranking of exposure was seen. Biomarker excretion (alpha1-microglobulin, glutathione S-transferases alpha and pi) indicated that DNT-induced damage was directed toward the tubular system. New data on DNT-exposed humans appear consistent with the concept of cancer initiation by DNT isomers and the subsequent promotion of renal carcinogenesis by selective damage to the proximal tubule. The differential pathways of metabolic activation of DNT appear to apply to the proximal tubule of the kidney and to the urothelium of the renal pelvis and lower urinary tract as target tissues of carcinogenicity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A cohort of 161 underground miners who had been highly exposed to dinitrotoluene (DNT) in the copper-mining industry of the former German Democratic Republic was reinvestigated for signs of subclinical renal damage. The study included a screening of urinary proteins excreted by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), and quantitations of the specific urinary proteins α 1-microglobulin and glutathione-S-transferase α (GST α) as biomarkers for damage of the proximal tubule and glutathione-S-transferase π (GST π) for damage of the distal tubule. The exposures were categorized semiquantitatively (low, medium, high, and very high), according to the type and duration of professional contact with DNT. A straight dose-dependence of pathological protein excretion patterns with the semiquantitative ranking of DNT exposure was seen. Most of the previously reported cancer cases of the urinary tract, especially those in the higher exposed groups, were confined to pathological urinary protein excretion patterns. The damage from DNT was directed toward the tubular system. In many cases, the appearance of Tamm-Horsfall protein, a 105-kD protein marker, was noted. Data on the biomarkers α 1-microglobulin, GST α, and GST π consistently demonstrated a dose-dependent increase in tubular damage, which confirmed the results of screening by SDS-PAGE and clearly indicated a nephrotoxic effect of DNT under the given conditions of exposure. Within the cluster of cancer patients observed among the DNT-exposed workers, only in exceptional cases were normal biomarker excretions found.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The kinetics of acid-catalyzed hydrolysis of seven methylated aliphatic epoxides - R1R2C(O)CR3R4 (A: R1=R2=R3=R4=H; B: R1=R2=R3=H, R4=Me; C: R1=R2=H, R3=R4=Me; D: R1=R3=H, R2=R4=Me(trans); E: R1=R3=H, R2=R4=Me(cis); F: R1=R3=R4=Me, R2=H; G: R1=R2=R3=R4=Me) - has been studied at 36 ± 1.5°C. Compounds with two methyl groups at the same carbon atom of the oxirane ring exhibit highest rate constants (k(eff) in reciprocal molar concentration per second: 11.0 ± 1.3 for C, 10.7 ± 2.1 for F, and 8.7 ± 0.7 for G as opposed to 0.124 ± 0.003 for B, 0.305 ± 0.003 for D, and 0.635 ± 0.036 for E). Ethylene oxide (A) displays the lowest rate of hydrolysis (0.027 M-1 s-1). The results are consistent with literature data available for compounds A, B, and C. To model the reactivities we have employed quantum chemical calculations (MNDO, AM1, PM3, and MINDO/3) of the main reaction species. There is a correlation of the logarithm k(eff) with the total energy of epoxide ring opening. The best correlation coefficients (r) were obtained using the AM1 and MNDO methods (0.966 and 0.957, respectively). However, unlike MNDO, AM1 predicts approximately zero energy barriers for the oxirane ring opening of compounds B, C, E and G, which is not consistent with published kinetic data. Thus, the MNDO method provides a preferential means of modeling the acidic hydrolysis of the series of methylated oxiranes. The general ranking of mutagenicity in vitro, A > B > C, is in line with the concept that this sequence also gradually leaves the expoxide reactivity optimal for genotoxicity toward reactivities leading to higher biological detoxifications.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Competition for research funding is intense and the opinions of an expert peer reviewer can mean the difference between success and failure in securing funding. The allocation of expert peer reviewers is therefore vitally important and funding agencies strive to avoid using reviewers who have real or perceived conflicts of interest. This article examines the impact of including or excluding peer reviewers based on their conflicts of interest, and the final ranking of funding proposals. Two 7-person review panels assessed a sample of National Health and Medical Research Council (NHMRC) of Australia proposals in Basic Science or Public Health. Using a pre-post comparison, the proposals were first scored after the exclusion of reviewers with a high or medium conflict, and re-scored after the return of reviewers with medium conflicts. The main outcome measures are the agreements in ranks and funding success before and after excluding the medium conflicts. Including medium conflicts of interest had little impact on the ranks or funding success. The Bland–Altman 95% limits of agreement were ± 3.3 ranks and ± 3.4 ranks in the two panels which both assessed 36 proposals. Overall there were three proposals (4%) that had a reversed funding outcome after including medium conflicts. Relaxing the conflict of interest rules would increase the number of expert reviewers included in the panel discussions which could increase the quality of peer review and make it easier to find reviewers.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This research falls in the area of enhancing the quality of tag-based item recommendation systems. It aims to achieve this by employing a multi-dimensional user profile approach and by analyzing the semantic aspects of tags. Tag-based recommender systems have two characteristics that need to be carefully studied in order to build a reliable system. Firstly, the multi-dimensional correlation, called as tag assignment , should be appropriately modelled in order to create the user profiles [1]. Secondly, the semantics behind the tags should be considered properly as the flexibility with their design can cause semantic problems such as synonymy and polysemy [2]. This research proposes to address these two challenges for building a tag-based item recommendation system by employing tensor modeling as the multi-dimensional user profile approach, and the topic model as the semantic analysis approach. The first objective is to optimize the tensor model reconstruction and to improve the model performance in generating quality rec-ommendation. A novel Tensor-based Recommendation using Probabilistic Ranking (TRPR) method [3] has been developed. Results show this method to be scalable for large datasets and outperforming the benchmarking methods in terms of accuracy. The memory efficient loop implements the n-mode block-striped (matrix) product for tensor reconstruction as an approximation of the initial tensor. The probabilistic ranking calculates the probabil-ity of users to select candidate items using their tag preference list based on the entries generated from the reconstructed tensor. The second objective is to analyse the tag semantics and utilize the outcome in building the tensor model. This research proposes to investigate the problem using topic model approach to keep the tags nature as the “social vocabulary” [4]. For the tag assignment data, topics can be generated from the occurrences of tags given for an item. However there is only limited amount of tags availa-ble to represent items as collection of topics, since an item might have only been tagged by using several tags. Consequently, the generated topics might not able to represent the items appropriately. Furthermore, given that each tag can belong to any topics with various probability scores, the occurrence of tags cannot simply be mapped by the topics to build the tensor model. A standard weighting technique will not appropriately calculate the value of tagging activity since it will define the context of an item using a tag instead of a topic.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Conservation decision tools based on cost-effectiveness analysis are used to assess threat management strategies for improving species persistence. These approaches rank alternative strategies by their benefit to cost ratio but may fail to identify the optimal sets of strategies to implement under limited budgets because they do not account for redundancies. We devised a multi objective optimization approach in which the complementarity principle is applied to identify the sets of threat management strategies that protect the most species for any budget. We used our approach to prioritize threat management strategies for 53 species of conservation concern in the Pilbara, Australia. We followed a structured elicitation approach to collect information on the benefits and costs of implementing 17 different conservation strategies during a 3-day workshop with 49 stakeholders and experts in the biodiversity, conservation, and management of the Pilbara. We compared the performance of our complementarity priority threat management approach with a current cost-effectiveness ranking approach. A complementary set of 3 strategies: domestic herbivore management, fire management and research, and sanctuaries provided all species with >50% chance of persistence for $4.7 million/year over 20 years. Achieving the same result cost almost twice as much ($9.71 million/year) when strategies were selected by their cost-effectiveness ranks alone. Our results show that complementarity of management benefits has the potential to double the impact of priority threat management approaches.