2 resultados para Non-thresholding speech noise reduction
Resumo:
Background and aims: Machine learning techniques for the text mining of cancer-related clinical documents have not been sufficiently explored. Here some techniques are presented for the pre-processing of free-text breast cancer pathology reports, with the aim of facilitating the extraction of information relevant to cancer staging.
Materials and methods: The first technique was implemented using the freely available software RapidMiner to classify the reports according to their general layout: ‘semi-structured’ and ‘unstructured’. The second technique was developed using the open source language engineering framework GATE and aimed at the prediction of chunks of the report text containing information pertaining to the cancer morphology, the tumour size, its hormone receptor status and the number of positive nodes. The classifiers were trained and tested respectively on sets of 635 and 163 manually classified or annotated reports, from the Northern Ireland Cancer Registry.
Results: The best result of 99.4% accuracy – which included only one semi-structured report predicted as unstructured – was produced by the layout classifier with the k nearest algorithm, using the binary term occurrence word vector type with stopword filter and pruning. For chunk recognition, the best results were found using the PAUM algorithm with the same parameters for all cases, except for the prediction of chunks containing cancer morphology. For semi-structured reports the performance ranged from 0.97 to 0.94 and from 0.92 to 0.83 in precision and recall, while for unstructured reports performance ranged from 0.91 to 0.64 and from 0.68 to 0.41 in precision and recall. Poor results were found when the classifier was trained on semi-structured reports but tested on unstructured.
Conclusions: These results show that it is possible and beneficial to predict the layout of reports and that the accuracy of prediction of which segments of a report may contain certain information is sensitive to the report layout and the type of information sought.
Resumo:
Since core-collapse supernova simulations still struggle to produce robust neutrino-driven explosions in 3D, it has been proposed that asphericities caused by convection in the progenitor might facilitate shock revival by boosting the activity of non-radial hydrodynamic instabilities in the post-shock region. We investigate this scenario in depth using 42 relativistic 2D simulations with multigroup neutrino transport to examine the effects of velocity and density perturbations in the progenitor for different perturbation geometries that obey fundamental physical constraints (like the anelastic condition). As a framework for analysing our results, we introduce semi-empirical scaling laws relating neutrino heating, average turbulent velocities in the gain region, and the shock deformation in the saturation limit of non-radial instabilities. The squared turbulent Mach number, 〈Ma2〉, reflects the violence of aspherical motions in the gain layer, and explosive runaway occurs for 〈Ma2〉 ≳ 0.3, corresponding to a reduction of the critical neutrino luminosity by ∼25∼25 per cent compared to 1D. In the light of this theory, progenitor asphericities aid shock revival mainly by creating anisotropic mass flux on to the shock: differential infall efficiently converts velocity perturbations in the progenitor into density perturbations δρ/ρ at the shock of the order of the initial convective Mach number Maprog. The anisotropic mass flux and ram pressure deform the shock and thereby amplify post-shock turbulence. Large-scale (ℓ = 2, ℓ = 1) modes prove most conducive to shock revival, whereas small-scale perturbations require unrealistically high convective Mach numbers. Initial density perturbations in the progenitor are only of the order of Ma2progMaprog2 and therefore play a subdominant role.