994 resultados para Document description


Relevância:

20.00% 20.00%

Publicador:

Resumo:

For traditional information filtering (IF) models, it is often assumed that the documents in one collection are only related to one topic. However, in reality users’ interests can be diverse and the documents in the collection often involve multiple topics. Topic modelling was proposed to generate statistical models to represent multiple topics in a collection of documents, but in a topic model, topics are represented by distributions over words which are limited to distinctively represent the semantics of topics. Patterns are always thought to be more discriminative than single terms and are able to reveal the inner relations between words. This paper proposes a novel information filtering model, Significant matched Pattern-based Topic Model (SPBTM). The SPBTM represents user information needs in terms of multiple topics and each topic is represented by patterns. More importantly, the patterns are organized into groups based on their statistical and taxonomic features, from which the more representative patterns, called Significant Matched Patterns, can be identified and used to estimate the document relevance. Experiments on benchmark data sets demonstrate that the SPBTM significantly outperforms the state-of-the-art models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Objective To synthesise recent research on the use of machine learning approaches to mining textual injury surveillance data. Design Systematic review. Data sources The electronic databases which were searched included PubMed, Cinahl, Medline, Google Scholar, and Proquest. The bibliography of all relevant articles was examined and associated articles were identified using a snowballing technique. Selection criteria For inclusion, articles were required to meet the following criteria: (a) used a health-related database, (b) focused on injury-related cases, AND used machine learning approaches to analyse textual data. Methods The papers identified through the search were screened resulting in 16 papers selected for review. Articles were reviewed to describe the databases and methodology used, the strength and limitations of different techniques, and quality assurance approaches used. Due to heterogeneity between studies meta-analysis was not performed. Results Occupational injuries were the focus of half of the machine learning studies and the most common methods described were Bayesian probability or Bayesian network based methods to either predict injury categories or extract common injury scenarios. Models were evaluated through either comparison with gold standard data or content expert evaluation or statistical measures of quality. Machine learning was found to provide high precision and accuracy when predicting a small number of categories, was valuable for visualisation of injury patterns and prediction of future outcomes. However, difficulties related to generalizability, source data quality, complexity of models and integration of content and technical knowledge were discussed. Conclusions The use of narrative text for injury surveillance has grown in popularity, complexity and quality over recent years. With advances in data mining techniques, increased capacity for analysis of large databases, and involvement of computer scientists in the injury prevention field, along with more comprehensive use and description of quality assurance methods in text mining approaches, it is likely that we will see a continued growth and advancement in knowledge of text mining in the injury field.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Non-thermal plasma (NTP) has been introduced over the past several years as a promising method for nitrogen oxide (NOx) removal. The intent, when using NTP, is to selectively transfer input electrical energy to the electrons, and to not expend this in heating the entire gas stream, which generates free radicals through collisions, and promotes the desired chemical changes in the exhaust gases. The generated active species react with the pollutant molecules and decompose them. This paper reviews and summarizes relevant literature regarding various aspects of the application of {NTP} technology on {NOx} removal from exhaust gases. A comprehensive description of available scientific literature on {NOx} removal using {NTP} technology is presented, including various types of NTP, e.g. dielectric barrier discharge, corona discharge and electron beam. Furthermore, the combination of {NTP} with catalyst and adsorbent for better {NOx} removal efficiency is presented in detail. The removal of {NOx} from both simulated gases and real diesel engines is also considered in this review paper. As {NTP} is a new technique and is not yet commercialized, there is a need for more studies to be performed in this field.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This report document the recent progress (current as of December 2014) of the research project investigating novice driver safety in Oman. Included in this report is a summary of progress with publications to date, as well as description of the preliminary results of the first phase of the quantitative survey with young drivers. With regards to the publications which have resulted from this research, two journal articles have been published in print, one is under review, and a fourth is in the late stages of development for submission...

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The rehabilitation programs of bone-anchorage prostheses relying either on the OPRA (Integrum, Sweden) or the ILP (Orthodynamics, Germany) fixation involve some forms of static load bearing exercises (LBE). So far, most of biomechanical studies of these static LBEs focused on the direct measurements of the actual forces and moments applied on the OPRA fixation of individuals with transfemoral amputation (TFA). To date, the proof-of-concept of an apparatus to conduct these kinetic measurements has been presented, along with some preliminary data. The understanding of the kinetic data is essential to improve rehabilitation programs as well as the design of upcoming loading frames. However, kinetic information alone is difficult to interpret without concomitant kinematic data. The purpose of this preliminary study was to introduce a qualitative analysis describing the different body postures during LBE for a group of TFAs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This cross disciplinary study was conducted as two research and development projects. The outcome is a multimodal and dynamic chronicle, which incorporates the tracking of spatial, temporal and visual elements of performative practice-led and design-led research journeys. The distilled model provides a strong new approach to demonstrate rigour in non-traditional research outputs including provenance and an 'augmented web of facticity'.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper uses discourse analysis techniques associated with Foucauldian archaeology to examine a teacher education accreditation document from Australia to reveal how graduating teachers are constructed through the discourses presented. The findings reveal a discursive site of contestation within the document itself and a mismatch between the identified policy discourses and those from the academic archive. The authors suggest that rather than contradictory representations of what constitutes graduating teacher quality and professionalism, what is needed is an accreditation process that agrees on constructions of graduate identity and professional practice that enact an intellectual and reflexive form of professionalism.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This research examined the function of Queensland Health's Root Cause Analysis (RCA) to improve patient safety through an investigation of patient harm events where permanent harm and preventable death, Severity Assessment Code 1, were the outcome of healthcare. Unedited and highly legislated RCAs from across Queensland Health public hospitals from 2009, 2010 and 2011 comprised the data. A document analysis revealed the RCAs opposed organisational policy and dominant theoretical directives. If we accept the prevailing assumption that patient harm is a systemic issue, then the RCA is failing to address harm events in healthcare.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Previous qualitative research has highlighted that temporality plays an important role in relevance for clinical records search. In this study, an investigation is undertaken to determine the effect that the timespan of events within a patient record has on relevance in a retrieval scenario. In addition, based on the standard practise of document length normalisation, a document timespan normalisation model that specifically accounts for timespans is proposed. Initial analysis revealed that in general relevant patient records tended to cover a longer timespan of events than non-relevant patient records. However, an empirical evaluation using the TREC Medical Records track supports the opposite view that shorter documents (in terms of timespan) are better for retrieval. These findings highlight that the role of temporality in relevance is complex and how to effectively deal with temporality within a retrieval scenario remains an open question.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Document clustering is one of the prominent methods for mining important information from the vast amount of data available on the web. However, document clustering generally suffers from the curse of dimensionality. Providentially in high dimensional space, data points tend to be more concentrated in some areas of clusters. We take advantage of this phenomenon by introducing a novel concept of dynamic cluster representation named as loci. Clusters’ loci are efficiently calculated using documents’ ranking scores generated from a search engine. We propose a fast loci-based semi-supervised document clustering algorithm that uses clusters’ loci instead of conventional centroids for assigning documents to clusters. Empirical analysis on real-world datasets shows that the proposed method produces cluster solutions with promising quality and is substantially faster than several benchmarked centroid-based semi-supervised document clustering methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a robust method for mosaicing of document images using features derived from connected components. Each connected component is described using the Angular Radial Tran. form (ART). To ensure geometric consistency during feature matching, the ART coefficients of a connected component are augmented with those of its two nearest neighbors. The proposed method addresses two critical issues often encountered in correspondence matching: (i) The stability of features and (ii) Robustness against false matches due to the multiple instances of characters in a document image. The use of connected components guarantees a stable localization across images. The augmented features ensure a successful correspondence matching even in the presence of multiple similar regions within the page. We illustrate the effectiveness of the proposed method on camera captured document images exhibiting large variations in viewpoint, illumination and scale.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis studies document signatures, which are small representations of documents and other objects that can be stored compactly and compared for similarity. This research finds that document signatures can be effectively and efficiently used to both search and understand relationships between documents in large collections, scalable enough to search a billion documents in a fraction of a second. Deliverables arising from the research include an investigation of the representational capacity of document signatures, the publication of an open-source signature search platform and an approach for scaling signature retrieval to operate efficiently on collections containing hundreds of millions of documents.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The theory for time-resolved, pump-probe, photoemission spectroscopy and other pump-probe experiments is developed. The formal development is completely general, incorporating all of the nonequilibrium effects of the pump pulse and the finite time width of the probe pulse, and including possibilities for taking into account band structure and matrix element effects, surface states, and the interaction of the photoexcited electrons with the system leading to corrections to the sudden approximation. We also illustrate the effects of windowing that arise from the finite width of the probe pulse in a simple model system by assuming the quasiequilibrium approximation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

With the extension of the work of the preceding paper, the relativistic front form for Maxwell's equations for electromagnetism is developed and shown to be particularly suited to the description of paraxial waves. The generators of the Poincaré group in a form applicable directly to the electric and magnetic field vectors are derived. It is shown that the effect of a thin lens on a paraxial electromagnetic wave is given by a six-dimensional transformation matrix, constructed out of certain special generators of the Poincaré group. The method of construction guarantees that the free propagation of such waves as well as their transmission through ideal optical systems can be described in terms of the metaplectic group, exactly as found for scalar waves by Bacry and Cadilhac. An alternative formulation in terms of a vector potential is also constructed. It is chosen in a gauge suggested by the front form and by the requirement that the lens transformation matrix act locally in space. Pencils of light with accompanying polarization are defined for statistical states in terms of the two-point correlation function of the vector potential. Their propagation and transmission through lenses are briefly considered in the paraxial limit. This paper extends Fourier optics and completes it by formulating it for the Maxwell field. We stress that the derivations depend explicitly on the "henochromatic" idealization as well as the identification of the ideal lens with a quadratic phase shift and are heuristic to this extent.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this work, we theoretically examine recent pump/probe photoemission experiments on the strongly correlated charge-density-wave insulator TaS2.We describe the general nonequilibrium many-body formulation of time-resolved photoemission in the sudden approximation, and then solve the problem using dynamical mean-field theory with the numerical renormalization group and a bare density of states calculated from density functional theory including the charge-density-wave distortion of the ion cores and spin-orbit coupling. We find a number of interesting results: (i) the bare band structure actually has more dispersion in the perpendicular direction than in the two-dimensional planes; (ii) the DMFT approach can produce upper and lower Hubbard bands that resemble those in the experiment, but the upper bands will overlap in energy with other higher energy bands; (iii) the effect of the finite width of the probe pulse is minimal on the shape of the photoemission spectra; and (iv) the quasiequilibrium approximation does not fully describe the behavior in this system.