92 resultados para Data detection
Resumo:
Data mining can be defined as the extraction of previously unknown and potentially useful information from large datasets. The main principle is to devise computer programs that run through databases and automatically seek deterministic patterns. It is applied in different fields of application, e.g., remote sensing, biometry, speech recognition, but has seldom been applied to forensic case data. The intrinsic difficulty related to the use of such data lies in its heterogeneity, which comes from the many different sources of information. The aim of this study is to highlight potential uses of pattern recognition that would provide relevant results from a criminal intelligence point of view. The role of data mining within a global crime analysis methodology is to detect all types of structures in a dataset. Once filtered and interpreted, those structures can point to previously unseen criminal activities. The interpretation of patterns for intelligence purposes is the final stage of the process. It allows the researcher to validate the whole methodology and to refine each step if necessary. An application to cutting agents found in illicit drug seizures was performed. A combinatorial approach was done, using the presence and the absence of products. Methods coming from the graph theory field were used to extract patterns in data constituted by links between products and place and date of seizure. A data mining process completed using graphing techniques is called ``graph mining''. Patterns were detected that had to be interpreted and compared with preliminary knowledge to establish their relevancy. The illicit drug profiling process is actually an intelligence process that uses preliminary illicit drug classes to classify new samples. Methods proposed in this study could be used \textit{a priori} to compare structures from preliminary and post-detection patterns. This new knowledge of a repeated structure may provide valuable complementary information to profiling and become a source of intelligence.
Resumo:
The recent approval of crizotinib for the treatment of anaplastic lymphoma kinase (ALK)-rearranged advanced non-small cell lung cancer (NSCLC) in the US and other countries has provoked intense interest in ALK rearrangements as oncogenic drivers, and promises to revolutionise the way in which NSCLC is diagnosed and treated. Here, we review clinical data to date for the use of crizotinib to treat patients with advanced, ALK-positive NSCLC and consider issues surrounding the detection of ALK-positivity including the use of fluorescence in situ hybridisation and the other potential techniques available, and their suitability for ALK screening. We also discuss the emergence of resistance to crizotinib therapy and the range of other ALK inhibitors currently in development.
Resumo:
Recently, kernel-based Machine Learning methods have gained great popularity in many data analysis and data mining fields: pattern recognition, biocomputing, speech and vision, engineering, remote sensing etc. The paper describes the use of kernel methods to approach the processing of large datasets from environmental monitoring networks. Several typical problems of the environmental sciences and their solutions provided by kernel-based methods are considered: classification of categorical data (soil type classification), mapping of environmental and pollution continuous information (pollution of soil by radionuclides), mapping with auxiliary information (climatic data from Aral Sea region). The promising developments, such as automatic emergency hot spot detection and monitoring network optimization are discussed as well.
Resumo:
Until recently, the hard X-ray, phase-sensitive imaging technique called grating interferometry was thought to provide information only in real space. However, by utilizing an alternative approach to data analysis we demonstrated that the angular resolved ultra-small angle X-ray scattering distribution can be retrieved from experimental data. Thus, reciprocal space information is accessible by grating interferometry in addition to real space. Naturally, the quality of the retrieved data strongly depends on the performance of the employed analysis procedure, which involves deconvolution of periodic and noisy data in this context. The aim of this article is to compare several deconvolution algorithms to retrieve the ultra-small angle X-ray scattering distribution in grating interferometry. We quantitatively compare the performance of three deconvolution procedures (i.e., Wiener, iterative Wiener and Lucy-Richardson) in case of realistically modeled, noisy and periodic input data. The simulations showed that the algorithm of Lucy-Richardson is the more reliable and more efficient as a function of the characteristics of the signals in the given context. The availability of a reliable data analysis procedure is essential for future developments in grating interferometry.
Resumo:
Aim The imperfect detection of species may lead to erroneous conclusions about species-environment relationships. Accuracy in species detection usually requires temporal replication at sampling sites, a time-consuming and costly monitoring scheme. Here, we applied a lower-cost alternative based on a double-sampling approach to incorporate the reliability of species detection into regression-based species distribution modelling.Location Doñana National Park (south-western Spain).Methods Using species-specific monthly detection probabilities, we estimated the detection reliability as the probability of having detected the species given the species-specific survey time. Such reliability estimates were used to account explicitly for data uncertainty by weighting each absence. We illustrated how this novel framework can be used to evaluate four competing hypotheses as to what constitutes primary environmental control of amphibian distribution: breeding habitat, aestivating habitat, spatial distribution of surrounding habitats and/or major ecosystems zonation. The study was conducted on six pond-breeding amphibian species during a 4-year period.Results Non-detections should not be considered equivalent to real absences, as their reliability varied considerably. The occurrence of Hyla meridionalis and Triturus pygmaeus was related to a particular major ecosystem of the study area, where suitable habitat for these species seemed to be widely available. Characteristics of the breeding habitat (area and hydroperiod) were of high importance for the occurrence of Pelobates cultripes and Pleurodeles waltl. Terrestrial characteristics were the most important predictors of the occurrence of Discoglossus galganoi and Lissotriton boscai, along with spatial distribution of breeding habitats for the last species.Main conclusions We did not find a single best supported hypothesis valid for all species, which stresses the importance of multiscale and multifactor approaches. More importantly, this study shows that estimating the reliability of non-detection records, an exercise that had been previously seen as a naïve goal in species distribution modelling, is feasible and could be promoted in future studies, at least in comparable systems.
Resumo:
MOTIVATION: High-throughput sequencing technologies enable the genome-wide analysis of the impact of genetic variation on molecular phenotypes at unprecedented resolution. However, although powerful, these technologies can also introduce unexpected artifacts. Results: We investigated the impact of library amplification bias on the identification of allele-specific (AS) molecular events from high-throughput sequencing data derived from chromatin immunoprecipitation assays (ChIP-seq). Putative AS DNA binding activity for RNA polymerase II was determined using ChIP-seq data derived from lymphoblastoid cell lines of two parent-daughter trios. We found that, at high-sequencing depth, many significant AS binding sites suffered from an amplification bias, as evidenced by a larger number of clonal reads representing one of the two alleles. To alleviate this bias, we devised an amplification bias detection strategy, which filters out sites with low read complexity and sites featuring a significant excess of clonal reads. This method will be useful for AS analyses involving ChIP-seq and other functional sequencing assays.
Resumo:
OBJECTIVE: To evaluate the power of various parameters of the vestibulo-ocular reflex (VOR) in detecting unilateral peripheral vestibular dysfunction and in characterizing certain inner ear pathologies. STUDY DESIGN: Prospective study of consecutive ambulatory patients presenting with acute onset of peripheral vertigo and spontaneous nystagmus. SETTING: Tertiary referral center. PATIENTS: Seventy-four patients (40 females, 34 males) and 22 normal subjects (11 females, 11 males) were included in the study. Patients were classified in three main diagnoses: vestibular neuritis: 40; viral labyrinthitis: 22; Meniere's disease: 12. METHODS: The VOR function was evaluated by standard caloric and impulse rotary tests (velocity step). A mathematical model of vestibular function was used to characterize the VOR response to rotational stimulation. The diagnostic value of the different VOR parameters was assessed by uni- and multivariable logistic regression. RESULTS: In univariable analysis, caloric asymmetry emerged as the most powerful VOR parameter in identifying unilateral vestibular deficit, with a boundary limit set at 20%. In multivariable analysis, the combination of caloric asymmetry and rotational time constant asymmetry significantly improved the discriminatory power over caloric alone (p<0.0001) and produced a detection score with a correct classification of 92.4%. In discriminating labyrinthine diseases, different combinations of the VOR parameters were obtained for each diagnosis (p<0.003) supporting that the VOR characteristics differ between the three inner ear disorders. However, the clinical usefulness of these characteristics in separating the pathologies was limited. CONCLUSION: We propose a powerful logistic model combining the indices of caloric and time constant asymmetries to detect a peripheral vestibular loss, with an accuracy of 92.4%. Based on vestibular data only, the discrimination between the different inner ear diseases is statistically possible, which supports different pathophysiologic changes in labyrinthine pathologies.
Resumo:
BACKGROUND: The purpose of the optic nerve sheath diameter (ONSD) research group project is to establish an individual patient-level database from high quality studies of ONSD ultrasonography for the detection of raised intracranial pressure (ICP), and to perform a systematic review and an individual patient data meta-analysis (IPDMA), which will provide a cutoff value to help physicians making decisions and encourage further research. Previous meta-analyses were able to assess the diagnostic accuracy of ONSD ultrasonography in detecting raised ICP but failed to determine a precise cutoff value. Thus, the ONSD research group was founded to synthesize data from several recent studies on the subject and to provide evidence on the diagnostic accuracy of ONSD ultrasonography in detecting raised ICP. METHODS: This IPDMA will be conducted in different phases. First, we will systematically search for eligible studies. To be eligible, studies must have compared ONSD ultrasonography to invasive intracranial devices, the current reference standard for diagnosing raised ICP. Subsequently, we will assess the quality of studies included based on the QUADAS-2 tool, and then collect and validate individual patient data. The objectives of the primary analyses will be to assess the diagnostic accuracy of ONSD ultrasonography and to determine a precise cutoff value for detecting raised ICP. Secondly, we will construct a logistic regression model to assess whether patient and study characteristics influence diagnostic accuracy. DISCUSSION: We believe that this IPD MA will provide the most reliable basis for the assessment of diagnostic accuracy of ONSD ultrasonography for detecting raised ICP and to provide a cutoff value. We also hope that the creation of the ONSD research group will encourage further study. TRIAL REGISTRATION: PROSPERO registration number: CRD42012003072.
Resumo:
The objective of this study was to evaluate the contribution of ultrasound scanning to the prenatal detection of trisomy 21 in a large unselected European population. Data from 19 congenital malformation registers in 11 European countries were included. The prenatal ultrasound screening programs in the countries ranged from no routine screening to three ultrasound investigations per patient. Routine serum screening was offered in four of the 11 countries and routine screening on the basis of maternal age amniocentesis in all. The results show that overall 53% of cases of trisomy 21 were detected prenatally with a range from 3% in Lithuania to 88% in Paris. Ninety-eight percent of women whose babies were diagnosed before 24 weeks gestation chose to terminate the pregnancy. Centres/countries that offer serum screening do not have a significantly higher detection rate of trisomy 21 when compared to those that offer maternal age amniocentesis and anomaly scanning only. Fifty percent of trisomy 21 cases were born to women aged 35 years or more. In conclusions, second trimester ultrasound plays an important role in the prenatal diagnosis of trisomy 21. Of those cases prenatally diagnosed, 64% of cases in women <35 years and 36% of those in women >or=35 years were detected because of an ultrasound finding. Ultrasound soft markers accounted for 84% of the scan diagnoses. There is evidence of increasing maternal age across Europe with 50% of cases of trisomy 21 born to women aged 35 years or more.
Resumo:
CONTEXT AND OBJECTIVES: A multicentric study was set up to assess the feasibility for Swiss cancer registries of actively retrieving 3 additional variables of epidemiological and a etiological relevance for melanoma, and of potential use for the evaluation of prevention campaigns. MATERIAL AND METHODS: The skin type, family history of melanoma and precise anatomical site were retrieved for melanoma cases registered in 5 Swiss cantons (Neuchâtel, St-Gall and Appenzell, Vaud and Wallis) over 3 to 6 consecutive years (1995-2002). Data were obtained via a short questionnaire administered by the physicians - mostly dermatologists - who originally excised the lesions. As the detailed body site was routinely collected in Ticino, data from this Cancer Registry were included in the body site analysis. Relative melanoma density (RMD) was computed by the ratio of observed to expected numbers of melanomas allowing for body site surface areas, and further adjusted for site-specific melanocyte density. RESULTS: Of the 1,645 questionnaires sent, 1,420 (86.3%) were returned. The detailed cutaneous site and skin type were reliably obtained for 84.7% and 78.7% of questionnaires, and family history was known in 76% of instances. Prevalence of sun-sensitive subjects and patients with melanoma affected first-degree relatives, two target groups for early detection and surveillance campaigns were 54.1% and 3.4%, respectively. After translation into the 4th digit of the International Classification of Diseases for Oncology, the anatomical site codes from printed (original information) and pictorial support (body chart from the questionnaire) concurred for 94.6% of lesions. Discrepancies occurred mostly for lesions on the upper, outer part of the shoulder for which the clinician's textual description was "shoulder blade". This differential misclassification suggests under-estimation by about 10% of melanomas of the upper limbs and an over-estimation of 5% for truncal melanomas. Sites of highest melanoma risk were the face, the shoulder and the upper arm for sexes, the back for men and the leg for women. Three major features of this series were: (1) an unexpectedly high RMD for the face in women (6.2 vs 4.2 in men), (2) the absence of a male predominance for melanomas on the ears, and (3) for the upper limbs, a steady gradient of increasing melanoma density with increasing proximity to the trunk, regardless of sex. DISCUSSION AND CONCLUSION: The feasibility of retrieving the skin type, the precise anatomical location and family history of melanoma in a reliable manner was demonstrated thanks to the collaboration of Swiss dermatologists. Use of a schematic body drawing improves the quality of the anatomical site data and facilitate the reporting task of doctors. Age and sex patterns of RMD paralleled general indicators of sun exposure and behaviour, except for the hand (RMD=0.2). These Swiss results support some site or sun exposure specificity in the aetiology of melanoma.
Resumo:
Paul Ehrlich's inspired concept of 'magic bullets' for the cure of diseases has been revitalized by recent advances in immunology1. In particular, the development of cell fusion technology allowing the production of monoclonal antibodies (Mabs) with exquisite specificities2 triggered new hopes that we may now have the perfect carrier molecules with which to deliver cytotoxic drugs3 or toxins4 to the hidden cancer cells. This article reviews data on one aspect of the magic bullet concept, the use of radiolabelled antibodies as tracers for tumour localization. It will also discuss the very recent clinical use of 131I-labelled Mabs against carcinoembryonic antigen (CEA)5 to detect carcinoma either by conventional external photoscanning or by single photon emission computerized tomography (SPELT). This alliance of the most modern tools from immunology (Mabs) and nuclear medicine (SPELT) appears promising as a way to improve the sensitivity of 'immunoscintigraphy'. However, this approach is not yet ready, for widespread clinical use.
Resumo:
The research considers the problem of spatial data classification using machine learning algorithms: probabilistic neural networks (PNN) and support vector machines (SVM). As a benchmark model simple k-nearest neighbor algorithm is considered. PNN is a neural network reformulation of well known nonparametric principles of probability density modeling using kernel density estimator and Bayesian optimal or maximum a posteriori decision rules. PNN is well suited to problems where not only predictions but also quantification of accuracy and integration of prior information are necessary. An important property of PNN is that they can be easily used in decision support systems dealing with problems of automatic classification. Support vector machine is an implementation of the principles of statistical learning theory for the classification tasks. Recently they were successfully applied for different environmental topics: classification of soil types and hydro-geological units, optimization of monitoring networks, susceptibility mapping of natural hazards. In the present paper both simulated and real data case studies (low and high dimensional) are considered. The main attention is paid to the detection and learning of spatial patterns by the algorithms applied.
Resumo:
Aims Perfusion-cardiac magnetic resonance (CMR) has emerged as a potential alternative to single-photon emission computed tomography (SPECT) to assess myocardial ischaemia non-invasively. The goal was to compare the diagnostic performance of perfusion-CMR and SPECT for the detection of coronary artery disease (CAD) using conventional X-ray coronary angiography (CXA) as the reference standard. Methods and results In this multivendor trial, 533 patients, eligible for CXA or SPECT, were enrolled in 33 centres (USA and Europe) with 515 patients receiving MR contrast medium. Single-photon emission computed tomography and CXA were performed within 4 weeks before or after CMR in all patients. The prevalence of CAD in the sample was 49%. Drop-out rates for CMR and SPECT were 5.6 and 3.7%, respectively (P = 0.21). The primary endpoint was non-inferiority of CMR vs. SPECT for both sensitivity and specificity for the detection of CAD. Readers were blinded vs. clinical data, CXA, and imaging results. As a secondary endpoint, the safety profile of the CMR examination was evaluated. For CMR and SPECT, the sensitivity scores were 0.67 and 0.59, respectively, with the lower confidence level for the difference of +0.02, indicating superiority of CMR over SPECT. The specificity scores for CMR and SPECT were 0.61 and 0.72, respectively (lower confidence level for the difference: -0.17), indicating inferiority of CMR vs. SPECT. No severe adverse events occurred in the 515 patients. Conclusion In this large multicentre, multivendor study, the sensitivity of perfusion-CMR to detect CAD was superior to SPECT, while its specificity was inferior to SPECT. Cardiac magnetic resonance is a safe alternative to SPECT to detect perfusion deficits in CAD.
Resumo:
With the trend in molecular epidemiology towards both genome-wide association studies and complex modelling, the need for large sample sizes to detect small effects and to allow for the estimation of many parameters within a model continues to increase. Unfortunately, most methods of association analysis have been restricted to either a family-based or a case-control design, resulting in the lack of synthesis of data from multiple studies. Transmission disequilibrium-type methods for detecting linkage disequilibrium from family data were developed as an effective way of preventing the detection of association due to population stratification. Because these methods condition on parental genotype, however, they have precluded the joint analysis of family and case-control data, although methods for case-control data may not protect against population stratification and do not allow for familial correlations. We present here an extension of a family-based association analysis method for continuous traits that will simultaneously test for, and if necessary control for, population stratification. We further extend this method to analyse binary traits (and therefore family and case-control data together) and accurately to estimate genetic effects in the population, even when using an ascertained family sample. Finally, we present the power of this binary extension for both family-only and joint family and case-control data, and demonstrate the accuracy of the association parameter and variance components in an ascertained family sample.
Resumo:
Whether for investigative or intelligence aims, crime analysts often face up the necessity to analyse the spatiotemporal distribution of crimes or traces left by suspects. This article presents a visualisation methodology supporting recurrent practical analytical tasks such as the detection of crime series or the analysis of traces left by digital devices like mobile phone or GPS devices. The proposed approach has led to the development of a dedicated tool that has proven its effectiveness in real inquiries and intelligence practices. It supports a more fluent visual analysis of the collected data and may provide critical clues to support police operations as exemplified by the presented case studies.