540 resultados para Cross-lingual document retrieval
Resumo:
The use of ‘topic’ concepts has shown improved search performance, given a query, by bringing together relevant documents which use different terms to describe a higher level concept. In this paper, we propose a method for discovering and utilizing concepts in indexing and search for a domain specific document collection being utilized in industry. This approach differs from others in that we only collect focused concepts to build the concept space and that instead of turning a user’s query into a concept based query, we experiment with different techniques of combining the original query with a concept query. We apply the proposed approach to a real-world document collection and the results show that in this scenario the use of concept knowledge at index and search can improve the relevancy of results.
Resumo:
Clustering is an important technique in organising and categorising web scale documents. The main challenges faced in clustering the billions of documents available on the web are the processing power required and the sheer size of the datasets available. More importantly, it is nigh impossible to generate the labels for a general web document collection containing billions of documents and a vast taxonomy of topics. However, document clusters are most commonly evaluated by comparison to a ground truth set of labels for documents. This paper presents a clustering and labeling solution where the Wikipedia is clustered and hundreds of millions of web documents in ClueWeb12 are mapped on to those clusters. This solution is based on the assumption that the Wikipedia contains such a wide range of diverse topics that it represents a small scale web. We found that it was possible to perform the web scale document clustering and labeling process on one desktop computer under a couple of days for the Wikipedia clustering solution containing about 1000 clusters. It takes longer to execute a solution with finer granularity clusters such as 10,000 or 50,000. These results were evaluated using a set of external data.
Resumo:
BACKGROUND: Registered nurses and midwives play an essential role in detecting patients at risk of deterioration through ongoing assessment and action in response to changing health status. Yet, evidence suggests that clinical deterioration frequently goes unnoticed in hospitalised patients. While much attention has been paid to early warning and rapid response systems, little research has examined factors related to physical assessment skills. OBJECTIVES: To determine a minimum data set of core skills used during nursing assessment of hospitalised patients and identify nurse and workplace predictors of the use of physical assessment to detect patient deterioration. DESIGN: The study used a single-centre, cross-sectional survey design. SETTING and PARTICIPANTS: The study included 434 registered nurses and midwives (Grades 5-7) involved in clinical care of patients on acute care wards, including medicine, surgery, oncology, mental health and maternity service areas, at a 929-bed tertiary referral teaching hospital in Southeast Queensland, Australia. METHODS: We conducted a hospital-wide survey of registered nurses and midwives using the 133-item Physical Assessment Skills Inventory and the 58-item Barriers to Registered Nurses’ Use of Physical Assessment scale. Median frequency for each physical assessment skill was calculated to determine core skills. To explore predictors of core skill utilisation, backward stepwise general linear modelling was conducted. Means and regression coefficients are reported with 95% confidence intervals. A p value < .05 was considered significant for all analyses. RESULTS: Core skills used by most nurses every time they worked included assessment of temperature, oxygen saturation, blood pressure, breathing effort, skin, wound and mental status. Reliance on others and technology (F = 35.77, p < .001), lack of confidence (F = 5.52, p = .02), work area (F = 3.79, p = .002), and clinical role (F = 44.24, p < .001) were significant predictors of the extent of physical assessment skill use. CONCLUSIONS: The increasing acuity of the acute care patient plausibly warrants more than vital signs assessment; however, our study confirms nurses’ physical assessment core skill set is mainly comprised of vital signs. The focus on these endpoints of deterioration as dictated by early warning and rapid response systems may divert attention from and devalue comprehensive nursing assessment that could detect subtle changes in health status earlier in the patient's hospitalisation.
Resumo:
Background Currently, care providers and policy-makers internationally are working to promote normal birth. In Australia, such initiatives are being implemented without any evidence of the prevalence or determinants of normal birth as a multidimensional construct. This study aimed to better understand the determinants of normal birth (defined as without induction of labour, epidural/spinal/general anaesthesia, forceps/vacuum, caesarean birth, or episiotomy) using secondary analyses of data from a population survey of women in Queensland, Australia. Methods Women who birthed in Queensland during a two-week period in 2009 were mailed a survey approximately three months after birth. Women (n=772) provided retrospective data on their pregnancy, labour and birth preferences and experiences, socio-demographic characteristics, and reproductive history. A series of logistic regressions were conducted to determine factors associated with having labour, having a vaginal birth, and having a normal birth. Findings Overall, 81.9% of women had labour, 66.4% had a vaginal birth, and 29.6% had a normal birth. After adjusting for other significant factors, women had significantly higher odds of having labour if they birthed in a public hospital and had a pre-existing preference for a vaginal birth. Of women who had labour, 80.8% had a vaginal birth. Women who had labour had significantly higher odds of having a vaginal birth if they attended antenatal classes, did not have continuous fetal monitoring, felt able to ‘take their time’ in labour, and had a pre-existing preference for a vaginal birth. Of women who had a vaginal birth, 44.7% had a normal birth. Women who had a vaginal birth had significantly higher odds of having a normal birth if they birthed in a public hospital, birthed outside regular business hours, had mobility in labour, did not have continuous fetal monitoring, and were non-supine during birth. Conclusions These findings provide a strong foundation on which to base resources aimed at increasing informed decision-making for maternity care consumers, providers, and policy-makers alike. Research to evaluate the impact of modifying key clinical practices (e.g., supporting women׳s mobility during labour, facilitating non-supine positioning during birth) on the likelihood of a normal birth is an important next step.
Resumo:
This thesis targets on a challenging issue that is to enhance users' experience over massive and overloaded web information. The novel pattern-based topic model proposed in this thesis can generate high-quality multi-topic user interest models technically by incorporating statistical topic modelling and pattern mining. We have successfully applied the pattern-based topic model to both fields of information filtering and information retrieval. The success of the proposed model in finding the most relevant information to users mainly comes from its precisely semantic representations to represent documents and also accurate classification of the topics at both document level and collection level.
Resumo:
This cross disciplinary study was conducted as two research and development projects. The outcome is a multimodal and dynamic chronicle, which incorporates the tracking of spatial, temporal and visual elements of performative practice-led and design-led research journeys. The distilled model provides a strong new approach to demonstrate rigour in non-traditional research outputs including provenance and an 'augmented web of facticity'.
Resumo:
Previous qualitative research has highlighted that temporality plays an important role in relevance for clinical records search. In this study, an investigation is undertaken to determine the effect that the timespan of events within a patient record has on relevance in a retrieval scenario. In addition, based on the standard practise of document length normalisation, a document timespan normalisation model that specifically accounts for timespans is proposed. Initial analysis revealed that in general relevant patient records tended to cover a longer timespan of events than non-relevant patient records. However, an empirical evaluation using the TREC Medical Records track supports the opposite view that shorter documents (in terms of timespan) are better for retrieval. These findings highlight that the role of temporality in relevance is complex and how to effectively deal with temporality within a retrieval scenario remains an open question.
Resumo:
Concept mapping involves determining relevant concepts from a free-text input, where concepts are defined in an external reference ontology. This is an important process that underpins many applications for clinical information reporting, derivation of phenotypic descriptions, and a number of state-of-the-art medical information retrieval methods. Concept mapping can be cast into an information retrieval (IR) problem: free-text mentions are treated as queries and concepts from a reference ontology as the documents to be indexed and retrieved. This paper presents an empirical investigation applying general-purpose IR techniques for concept mapping in the medical domain. A dataset used for evaluating medical information extraction is adapted to measure the effectiveness of the considered IR approaches. Standard IR approaches used here are contrasted with the effectiveness of two established benchmark methods specifically developed for medical concept mapping. The empirical findings show that the IR approaches are comparable with one benchmark method but well below the best benchmark.
Resumo:
Recent advances in neural language models have contributed new methods for learning distributed vector representations of words (also called word embeddings). Two such methods are the continuous bag-of-words model and the skipgram model. These methods have been shown to produce embeddings that capture higher order relationships between words that are highly effective in natural language processing tasks involving the use of word similarity and word analogy. Despite these promising results, there has been little analysis of the use of these word embeddings for retrieval. Motivated by these observations, in this paper, we set out to determine how these word embeddings can be used within a retrieval model and what the benefit might be. To this aim, we use neural word embeddings within the well known translation language model for information retrieval. This language model captures implicit semantic relations between the words in queries and those in relevant documents, thus producing more accurate estimations of document relevance. The word embeddings used to estimate neural language models produce translations that differ from previous translation language model approaches; differences that deliver improvements in retrieval effectiveness. The models are robust to choices made in building word embeddings and, even more so, our results show that embeddings do not even need to be produced from the same corpus being used for retrieval.
Resumo:
The information on climate variations is essential for the research of many subjects, such as the performance of buildings and agricultural production. However, recorded meteorological data are often incomplete. There may be a limited number of locations recorded, while the number of recorded climatic variables and the time intervals can also be inadequate. Therefore, the hourly data of key weather parameters as required by many building simulation programmes are typically not readily available. To overcome this gap in measured information, several empirical methods and weather data generators have been developed. They generally employ statistical analysis techniques to model the variations of individual climatic variables, while the possible interactions between different weather parameters are largely ignored. Based on a statistical analysis of 10 years historical hourly climatic data over all capital cities in Australia, this paper reports on the finding of strong correlations between several specific weather variables. It is found that there are strong linear correlations between the hourly variations of global solar irradiation (GSI) and dry bulb temperature (DBT), and between the hourly variations of DBT and relative humidity (RH). With an increase in GSI, DBT would generally increase, while the RH tends to decrease. However, no such a clear correlation can be found between the DBT and atmospheric pressure (P), and between the DBT and wind speed. These findings will be useful for the research and practice in building performance simulation.
Resumo:
Purpose: In the present study, we consider mechanical properties of phosphate glasses under high temperatureinduced and under friction-induced cross-linking, which enhance the modulus of elasticity. Design/methodology/approach: Two nanomechanical properties are evaluated, the first parameter is the modulus of elasticity (E) (or Young's modulus) and the second parameter is the hardness (H). Zinc meta-, pyro - and orthophosphates were recognized as amorphous-colloidal nanoparticles were synthesized under laboratory conditions and showed antiwear properties in engine oil. Findings: Young's modulus of the phosphate glasses formed under high temperature was in the 60-89 GPa range. For phosphate tribofilm formed under friction hardness and the Young's modulus were in the range of 2-10 GPa and 40-215 GPa, respectively. The degree of cross-linking during friction is provided by internal pressure of about 600 MPa and temperature close to 1000°C enhancing mechanical properties by factor of 3 (see Fig 1). Research limitations/implications: The addition of iron or aluminum ions to phosphate glasses under high temperature - and friction-induced amorphization of zinc metaphosphate and pyrophosphate tends to provide more cross-linking and mechanically stronger structures. Iron and aluminum (FeO4 or AlO4 units), incorporated into phosphate structure as network formers, contribute to the anion network bonding by converting the P=O bonds into bridging oxygen. Future work should consider on development of new of materials prepared by solgel processes, eg., zinc (II)-silicic acid. Originality/value: This paper analyses the friction pressure-induced and temperature–induced the two factors lead phosphate tribofilm glasses to chemically advanced glass structures, which may enhance the wear inhibition. Adding the coordinating ions alters the pressure at which cross-linking occurs and increases the antiwear properties of the surface material significantly.