38 resultados para Precision and recall

em QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Learning or writing regular expressions to identify instances of a specific
concept within text documents with a high precision and recall is challenging.
It is relatively easy to improve the precision of an initial regular expression
by identifying false positives covered and tweaking the expression to avoid the
false positives. However, modifying the expression to improve recall is difficult
since false negatives can only be identified by manually analyzing all documents,
in the absence of any tools to identify the missing instances. We focus on partially
automating the discovery of missing instances by soliciting minimal user
feedback. We present a technique to identify good generalizations of a regular
expression that have improved recall while retaining high precision. We empirically
demonstrate the effectiveness of the proposed technique as compared to
existing methods and show results for a variety of tasks such as identification of
dates, phone numbers, product names, and course numbers on real world datasets

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background and aims: Machine learning techniques for the text mining of cancer-related clinical documents have not been sufficiently explored. Here some techniques are presented for the pre-processing of free-text breast cancer pathology reports, with the aim of facilitating the extraction of information relevant to cancer staging.

Materials and methods: The first technique was implemented using the freely available software RapidMiner to classify the reports according to their general layout: ‘semi-structured’ and ‘unstructured’. The second technique was developed using the open source language engineering framework GATE and aimed at the prediction of chunks of the report text containing information pertaining to the cancer morphology, the tumour size, its hormone receptor status and the number of positive nodes. The classifiers were trained and tested respectively on sets of 635 and 163 manually classified or annotated reports, from the Northern Ireland Cancer Registry.

Results: The best result of 99.4% accuracy – which included only one semi-structured report predicted as unstructured – was produced by the layout classifier with the k nearest algorithm, using the binary term occurrence word vector type with stopword filter and pruning. For chunk recognition, the best results were found using the PAUM algorithm with the same parameters for all cases, except for the prediction of chunks containing cancer morphology. For semi-structured reports the performance ranged from 0.97 to 0.94 and from 0.92 to 0.83 in precision and recall, while for unstructured reports performance ranged from 0.91 to 0.64 and from 0.68 to 0.41 in precision and recall. Poor results were found when the classifier was trained on semi-structured reports but tested on unstructured.

Conclusions: These results show that it is possible and beneficial to predict the layout of reports and that the accuracy of prediction of which segments of a report may contain certain information is sensitive to the report layout and the type of information sought.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Two experiments investigated the consequences of action at encoding and recall on the ability to follow sequences of instructions. Children aged 7–9 years recalled sequences of spoken action commands under presentation and recall conditions that either did or did not involve their physical performance. In both experiments, recall was enhanced by carrying out the instructions as they were being initially presented and also by performing them at recall. In contrast, the accuracy of instruction-following did not improve above spoken presentation alone, either when the instructions were silently read or heard by the child (Experiment 1), or when the child repeated the spoken instructions as they were presented (Experiment 2). These findings suggest that the enactment advantage at presentation does not simply reflect a general benefit of a dual exposure to instructions, and that it is not a result of their self-production at presentation. The benefits of action-based recall were reduced following enactment during presentation, suggesting that the positive effects of action at encoding and recall may have a common origin. It is proposed that the benefits of physical movement arise from the existence of a short-term motor store that maintains the temporal, spatial, and motoric features of either planned or already executed actions.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The COMET (Core Outcome Measures in Effectiveness Trials) Initiative is developing a publicly accessible online resource to collate the knowledge base for core outcome set development (COS) and the applied work from different health conditions. Ensuring that the database is as comprehensive as possible and keeping it up to date are key to its value for users. This requires the development and application of an optimal, multi-faceted search strategy to identify relevant material. This paper describes the challenges of designing and implementing such a search, outlining the development of the search strategy for studies of COS development, and, in turn, the process for establishing a database of COS.

Methods: We investigated the performance characteristics of this strategy including sensitivity, precision and numbers needed to read. We compared the contribution of databases towards identifying included studies to identify the best combination of methods to retrieve all included studies.

Results: Recall of the search strategies ranged from 4% to 87%, and precision from 0.77% to 1.13%. MEDLINE performed best in terms of recall, retrieving 216 (87%) of the 250 included records, followed by Scopus (44%). The Cochrane Methodology Register found just 4% of the included records. MEDLINE was also the database with the highest precision. The number needed to read varied between 89 (MEDLINE) and 130 (SCOPUS).

Conclusions: We found that two databases and hand searching were required to locate all of the studies in this review. MEDLINE alone retrieved 87% of the included studies, but actually 97% of the included studies were indexed on MEDLINE. The Cochrane Methodology Register did not contribute any records that were not found in the other databases, and will not be included in our future searches to identify studies developing COS. SCOPUS had the lowest precision rate (0.77) and highest number needed to read (130). In future COMET searches for COS a balance needs to be struck between the work involved in screening large numbers of records, the frequency of the searching and the likelihood that eligible studies will be identified by means other than the database searches.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cancer clinical trials have been one of the key foundations for significant advances in oncology. However, there is a clear recognition within the academic, care delivery and pharmaceutical/biotech communities that our current model of clinical trial discovery and development is no longer fit for purpose. Delivering transformative cancer care should increasingly be our mantra, rather than maintaining the status quo of, at best, the often miniscule incremental benefits that are observed with many current clinical trials. As we enter the era of precision medicine for personalised cancer care (precision and personalised medicine), it is important that we capture and utilise our greater understanding of the biology of disease to drive innovative approaches in clinical trial design and implementation that can lead to a step change in cancer care delivery. A number of advances have been practice changing (e.g. imatinib mesylate in chronic myeloid leukaemia, Herceptin in erb-B2-positive breast cancer), and increasingly we are seeing the promise of a number of newer approaches, particularly in diseases like lung cancer and melanoma. Targeting immune checkpoints has recently yielded some highly promising results. New algorithms that maximise the effectiveness of clinical trials, through for example a multi-stage, multi-arm type design are increasingly gaining traction. However, our enthusiasm for the undoubted advances that have been achieved are being tempered by a realisation that these new approaches may have significant cost implications. This article will address these competing issues, mainly from a European perspective, highlight the problems and challenges to healthcare systems and suggest potential solutions that will ensure that the cost/value rubicon is addressed in a way that allows stakeholders to work together to deliver optimal cost-effective cancer care, the benefits of which can be transferred directly to our patients.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

As James Scott’s Seeing Like a State attests, forests played a central role in the rise of the modern state, specifically as test spaces for evolving methods of managing state resources at a distance, and as the location for grand state schemes. Together, such ambitions necessitated both the elimination of local understandings of forest management – to be replaced by centrally controlled scientific precisionand a narrowing of state vision. Forests thus began to be conflated with trees (and their timber) alone. All other aspects of the forest, both human and non-human, were ignored. Through the lens of the 18th and early 19th century New Forest in southern England, this paper examines the impact of government attempts to shift the focus of state forests from being remnant medieval hunting spaces to spaces of income generation through the creation of vast sylvicultural plantations. This state scheme not only reworked the relationship between the metropole and the provinces – something effected through systematic surveys and novel bureaucratic procedures – but also dramatically impacted upon the biophysical and cultural geographies of the forest. By equating forest space with trees alone, the British state failed to legislate for the actions of both local commoners and non-human others in resisting their schemes. Indeed, subsequent oppositions proved not only the tenacity of commoners in protecting their livelihoods but also the destructive power of non-human actants, specifically rabbits and mice. The paper concludes that grand state schemes necessarily fail due to their own internal illogic: the narrowing of state vision creates blind spots in which human and non-human lives assert their own visions.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

An experiment to quantify intra- and interobserver error in anatomical measurements found that interobserver measurements can vary by over 14% of mean specimen length; disparity in measurement increases logarithmically with the number of contributors; instructions did not reduce variation or measurement disparity; scale of the specimen influenced the precision of measurement (relative error increasing with specimen size); different methods of taking a measurement yielded different results, although they did not differ in terms of precision, and topographical complexity of the elements being considered may potentially influence error (error increasing with complexity). These results highlight concerns about introduction of noise and potential bias that should be taken into account when compiling composite datasets and meta-analyses.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

A gas chromatographic/mass spectrometric method is described for the detection of clenbuterol residues in liver, muscle, urine and retina. Tissue samples are first digested using protease and any clenbuterol present is extracted using a simple liquid/liquid extraction procedure. The dried extracts are then derivatized using methylboronic acid and the derivatives are subjected to gas chromatography/mass spectrometry on a magnetic sector instrument. The detection limit of the assay is 0.05 ng g-1 clenbuterol in liver, muscle or urine using a 10 g sample size, and 4 ng g-1 in retina using a 0.5 g sample size. The assay is made very specific by using selected ion monitoring of three ions at a resolution of 3500 and by ion ratio measurements. The precision and reproducibility of the assay are enhanced by the use of a deuterated internal standard, with a typical coefficient of variation of 3%.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Positive deviations from linear sea-level trends represent important climate signals if they are persistent and geographically widespread. This paper documents rapid sea-level rise reconstructed from sedimentary records obtained from salt marshes in the Southwest Pacific region (Tasmania and New Zealand). A new late Holocene relative sea-level record from eastern Tasmania was dated by AMS(14)C (conventional, high precision and bomb-spike), Cs-137, Pb-210, stable Pb isotopic ratios, trace metals, pollen and charcoal analyses. Palaeosea-level positions were determined by foraminiferal analyses. Relative sea level in Tasmania was within half a metre of present sea level for much of the last 6000 yr. Between 1900 and 1950 relative sea level rose at an average rate of 4.2 +/- 0.1 mm/yr. During the latter half of the 20th century the reconstructed rate of relative sea-level rise was 0.7 +/- 0.6 mm/yr. Our study is consistent with a similar pattern of relative sea-level change recently reconstructed for southern New Zealand. The change in the rate of sea-level rise in the SW Pacific during the early 20th century was larger than in the North Atlantic and could suggest that northern hemisphere land-based ice was the most significant melt source for global sea-level rise. (C) 2011 Elsevier B.V. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Experimental values for the carbon dioxide solubility in eight pure electrolyte solvents for lithium ion batteries – such as ethylene carbonate (EC), propylene carbonate (PC), dimethyl carbonate (DMC), ethyl methyl carbonate (EMC), diethyl carbonate (DEC), ?-butyrolactone (?BL), ethyl acetate (EA) and methyl propionate (MP) – are reported as a function of temperature from (283 to 353) K and atmospheric pressure. Based on experimental solubility data, the Henry’s law constant of the carbon dioxide in these solvents was then deduced and compared with reported values from the literature, as well as with those predicted by using COSMO-RS methodology within COSMOthermX software and those calculated by the Peng–Robinson equation of state implemented into Aspen plus. From this work, it appears that the CO2 solubility is higher in linear carbonates (such as DMC, EMC, DEC) than in cyclic ones (EC, PC, ?BL). Furthermore, the highest CO2 solubility was obtained in MP and EA solvents, which are comparable to the solubility values reported in classical ionicliquids. The precision and accuracy of the experimental values, considered as the per cent of the relative average absolute deviations of the Henry’s law constants from appropriate smoothing equations and from literature values, are close to (1% and 15%), respectively. From the variation of the Henry’s law constants with temperature, the partial molar thermodynamic functions of dissolution such as the standard Gibbs free energy, the enthalpy, and the entropy are calculated, as well as the mixing enthalpy of the solvent with CO2 in its hypothetical liquid state.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We present in this study the effect of nature and concentration of lithium salt, such as the lithium hexafluorophosphate, LiPF6; lithium tris(pentafluoroethane)-trifluorurophosphate LiFAP; lithium bis(trifluoromethylsulfonyl)imide, LiTFSI, on the CO2 solubility in four electrolytes for lithium ion batteries based on pure solvent that include ethylene carbonate (EC), dimethyl carbonate (DMC), ethyl methyl carbonate (EMC), diethyl carbonate (DEC), as well as, in the EC:DMC, EC:EMC and EC:DEC (50:50) wt.% binary mixtures as a function of temperature from (283 to 353) K and atmospheric pressure. Based on experimental solubility values, the Henry’s law constant of the carbon dioxide in these solutions with the presence or absence of lithium salt was then deduced and compared with reported values from the literature, as well as with those predicted by using COSMO-RS methodology within COSMOThermX software. From this study, it appears that the addition of 1 mol · dm-3 LiPF6 salt in alkylcarbonate solvents decreases their CO2 capture capacity. By using the same experimental conditions, an opposite CO2 solubility trend was generally observed in the case of the addition of LiFAP or LiTFSI salts in these solutions. Additionally, in all solutions investigated during this work, the CO2 solubility is greater in electrolytes containing the LiFAP salt, followed by those based on the LiTFSI case. The precision and accuracy of the experimental data reported therein, which are close to (1 and 15)%, respectively. From the variation of the Henry’s law constant with temperature, the partial molar thermodynamic functions of dissolution such as the standard Gibbs energy, the enthalpy, and the entropy, as well as the mixing enthalpy of the solvent with CO2 in its hypothetical liquid state were calculated. Finally, a quantitative analysis of the CO2 solubility evolution was carried out in the EC:DMC (50:50) wt.% binary mixture as the function of the LiPF6 or LiTFSI concentration in solution to elucidate how ionic species modify the CO2 solubility in alkylcarbonates-based Li-ion electrolytes by investigating the salting effects at T = 298.15 K and atmospheric pressure.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Situational awareness is achieved naturally by the human senses of sight and hearing in combination. Automatic scene understanding aims at replicating this human ability using microphones and cameras in cooperation. In this paper, audio and video signals are fused and integrated at different levels of semantic abstractions. We detect and track a speaker who is relatively unconstrained, i.e., free to move indoors within an area larger than the comparable reported work, which is usually limited to round table meetings. The system is relatively simple: consisting of just 4 microphone pairs and a single camera. Results show that the overall multimodal tracker is more reliable than single modality systems, tolerating large occlusions and cross-talk. System evaluation is performed on both single and multi-modality tracking. The performance improvement given by the audio–video integration and fusion is quantified in terms of tracking precision and accuracy as well as speaker diarisation error rate and precisionrecall (recognition). Improvements vs. the closest works are evaluated: 56% sound source localisation computational cost over an audio only system, 8% speaker diarisation error rate over an audio only speaker recognition unit and 36% on the precisionrecall metric over an audio–video dominant speaker recognition method.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Thirty-six 12-month-old hill hoggets were used in a 2 genotype (18 Scottish Blackface vs. 18 Swaledale×Scottish Blackface)×3 diet (fresh vs. ensiled vs. pelleted ryegrass) factorial design experiment to evaluate the effects of hogget genotype and forage type on enteric methane (CH4) emissions and nitrogen (N) utilisation. The hoggets were offered 3 diets ad libitum with no concentrate supplementation in a single period study with 6 hoggets for each of the 6 genotype×diet combinations (n=6). Fresh ryegrass was harvested daily in the morning. Pelleted ryegrass was sourced from a commercial supplier (Aylescott Driers & Feeds, Burrington, UK) and the ryegrass silage was ensiled with Ecosyl (Lactobacillus plantarum, Volac International Limited, Hertfordshire, UK) as an additive. The hoggets were housed in individual pens for at least 14 d before being transferred to individual respiration chambers for a further 4 d with feed intake, faeces and urine outputs and CH4 emissions measured. There was no significant interaction between genotype and forage type on any parameter evaluated. Sheep offered pelleted grass had greater feed intake (e.g. DM, energy and N) but less energy and nutrient apparent digestibility (e.g. DM, N and neutral detergent fibre (NDF)) than those given fresh grass or grass silage (P<0.001). Feeding pelleted grass, rather than fresh grass or grass silage, reduced enteric CH4 emissions as a proportion of DM intake and gross energy (GE) intake (P<0.01). Sheep offered fresh grass had a significantly lower acid detergent fibre (ADF) apparent digestibility, and CH4 energy output (CH4-E) as a proportion of GE intake than those offered grass silage (P<0.001). There was no significant difference, in CH4 emission rate or N utilisation efficiency when compared between Scottish Blackface and Swaledale × Scottish Blackface. Linear and multiple regression techniques were used to develop relationships between CH4 emissions or N excretion and dietary and animal variables using data from sheep offered fresh ryegrass and grass silage. The equation relating CH4-E (MJ/d) to GE intake (GEI, MJ/d), energy apparent digestibility (DE/GE) and metabolisability (ME/GE) resulted in a high r2 (CH4-E=0.074 GEI+9.2 DE/GE−10.2 ME/GE−0.37, r2=0.93). N intake (NI) was the best predictor for manure N excretion (Manure N=0.66 NI+0.96, r2=0.85). The use of these relationships can potentially improve the precision and decrease the uncertainty in predicting CH4 emissions and N excretion for sheep production systems managed under the current feeding conditions.