3 resultados para Extraction of premolars
Resumo:
Background and aims: Machine learning techniques for the text mining of cancer-related clinical documents have not been sufficiently explored. Here some techniques are presented for the pre-processing of free-text breast cancer pathology reports, with the aim of facilitating the extraction of information relevant to cancer staging.
Materials and methods: The first technique was implemented using the freely available software RapidMiner to classify the reports according to their general layout: ‘semi-structured’ and ‘unstructured’. The second technique was developed using the open source language engineering framework GATE and aimed at the prediction of chunks of the report text containing information pertaining to the cancer morphology, the tumour size, its hormone receptor status and the number of positive nodes. The classifiers were trained and tested respectively on sets of 635 and 163 manually classified or annotated reports, from the Northern Ireland Cancer Registry.
Results: The best result of 99.4% accuracy – which included only one semi-structured report predicted as unstructured – was produced by the layout classifier with the k nearest algorithm, using the binary term occurrence word vector type with stopword filter and pruning. For chunk recognition, the best results were found using the PAUM algorithm with the same parameters for all cases, except for the prediction of chunks containing cancer morphology. For semi-structured reports the performance ranged from 0.97 to 0.94 and from 0.92 to 0.83 in precision and recall, while for unstructured reports performance ranged from 0.91 to 0.64 and from 0.68 to 0.41 in precision and recall. Poor results were found when the classifier was trained on semi-structured reports but tested on unstructured.
Conclusions: These results show that it is possible and beneficial to predict the layout of reports and that the accuracy of prediction of which segments of a report may contain certain information is sensitive to the report layout and the type of information sought.
Resumo:
The cobas® (Roche) portfolio of companion diagnostics in oncology currently has three assays CE-marked for in vitro diagnostics. Two of these (EGFR and BRAF) are also US FDA-approved. These assays detect clinically relevant mutations that are correlated with response (BRAF, EGFR) or lack of response (KRAS) to targeted therapies such as selective mutant BRAF inhibitors in malignant melanoma, tyrosine kinases inhibitor in non-small cell lung cancer and anti-EGFR monoclonal antibodies in colorectal cancer, respectively. All these assays are run on a single platform using DNA extracted from a single 5 µm section of a formalin-fixed paraffin-embedded tissue block. The assays provide an ‘end-to-end’ solution from extraction of DNA to automated analysis and report on the cobas z 480. The cobas tests have shown robust and reproducible performance, with high sensitivity and specificity and low limit of detection, making them suitable as companion diagnostics for clinical use.
Resumo:
We study work extraction from the Dicke model achieved using simple unitary cyclic transformations keeping into account both a non optimal unitary protocol, and the energetic cost of creating the initial state. By analyzing the role of entanglement, we find that highly entangled states can be inefficient for energy storage when considering the energetic cost of creating the state. Such surprising result holds notwithstanding the fact that the criticality of the model at hand can sensibly improve the extraction of work. While showing the advantages of using a many-body system for work extraction, our results demonstrate that entanglement is not necessarily advantageous for energy storage purposes, when non optimal processes are considered. Our work shows the importance of better understanding the complex interconnections between non-equilibrium thermodynamics of quantum systems and correlations among their subparts.