956 resultados para False positives
Resumo:
The Bloom filter is a space efficient randomized data structure for representing a set and supporting membership queries. Bloom filters intrinsically allow false positives. However, the space savings they offer outweigh the disadvantage if the false positive rates are kept sufficiently low. Inspired by the recent application of the Bloom filter in a novel multicast forwarding fabric, this paper proposes a variant of the Bloom filter, the optihash. The optihash introduces an optimization for the false positive rate at the stage of Bloom filter formation using the same amount of space at the cost of slightly more processing than the classic Bloom filter. Often Bloom filters are used in situations where a fixed amount of space is a primary constraint. We present the optihash as a good alternative to Bloom filters since the amount of space is the same and the improvements in false positives can justify the additional processing. Specifically, we show via simulations and numerical analysis that using the optihash the false positives occurrences can be reduced and controlled at a cost of small additional processing. The simulations are carried out for in-packet forwarding. In this framework, the Bloom filter is used as a compact link/route identifier and it is placed in the packet header to encode the route. At each node, the Bloom filter is queried for membership in order to make forwarding decisions. A false positive in the forwarding decision is translated into packets forwarded along an unintended outgoing link. By using the optihash, false positives can be reduced. The optimization processing is carried out in an entity termed the Topology Manger which is part of the control plane of the multicast forwarding fabric. This processing is only carried out on a per-session basis, not for every packet. The aim of this paper is to present the optihash and evaluate its false positive performances via simulations in order to measure the influence of different parameters on the false positive rate. The false positive rate for the optihash is then compared with the false positive probability of the classic Bloom filter.
Resumo:
Some unexpected promiscuous inhibitors were observed in a virtual screening protocol applied to select cruzain inhibitors from the ZINC database. Physical-chemical and pharmacophore model filters were used to reduce the database size. The selected compounds were docked into the cruzain active site. Six hit compounds were tested as inhibitors. Although the compounds were designed to be nucleophilically attacked by the catalytic cysteine of cruzain, three of them showed typical promiscuous behavior, revealing that false positives are a prevalent concern in VS programs. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Abstract Background A large number of probabilistic models used in sequence analysis assign non-zero probability values to most input sequences. To decide when a given probability is sufficient the most common way is bayesian binary classification, where the probability of the model characterizing the sequence family of interest is compared to that of an alternative probability model. We can use as alternative model a null model. This is the scoring technique used by sequence analysis tools such as HMMER, SAM and INFERNAL. The most prevalent null models are position-independent residue distributions that include: the uniform distribution, genomic distribution, family-specific distribution and the target sequence distribution. This paper presents a study to evaluate the impact of the choice of a null model in the final result of classifications. In particular, we are interested in minimizing the number of false predictions in a classification. This is a crucial issue to reduce costs of biological validation. Results For all the tests, the target null model presented the lowest number of false positives, when using random sequences as a test. The study was performed in DNA sequences using GC content as the measure of content bias, but the results should be valid also for protein sequences. To broaden the application of the results, the study was performed using randomly generated sequences. Previous studies were performed on aminoacid sequences, using only one probabilistic model (HMM) and on a specific benchmark, and lack more general conclusions about the performance of null models. Finally, a benchmark test with P. falciparum confirmed these results. Conclusions Of the evaluated models the best suited for classification are the uniform model and the target model. However, the use of the uniform model presents a GC bias that can cause more false positives for candidate sequences with extreme compositional bias, a characteristic not described in previous studies. In these cases the target model is more dependable for biological validation due to its higher specificity.
Resumo:
False-positive and false-negative values were calculated for five different designs of the trend test and it was demonstrated that a design suggested by Portier and Hoel in 1984 for a different problem produced the lowest false-positive and false-negative rates when applied to historical spontaneous tumor rate data for Fischer Rats. ^
Resumo:
This study investigated the role of contextual factors in personnel selection. Specifically, I explored if specific job factors such as the wage, training, available applicant pool and security concerns around a job, influenced personnel decisions. Additionally, I explored if the individual differences of decision makers played a role in how the previously mentioned job factors affected their decisions. A policy-capturing methodology was employed to determine the weight participants place on the job factors when selecting candidates for different jobs. Regression and correlational analyses were computed with the beta weights obtained from individual regression analyses. The results obtained from the two samples (student and general population) revealed that specific job characteristics did indeed influence personnel decisions. Participants were more concerned with making mistakes and thus less likely to accept candidates when selecting candidates for jobs having high salary and/or high training requirements.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
The unresolved issue of false-positive D-dimer results in the diagnostic workup of pulmonary embolism Pulmonary embolism (PE) remains a difficult diagnosis as it lacks specific symptoms and clinical signs. After the determination of the pretest PE probability by a validated clinical score, D-dimers (DD) is the initial blood test in the majority of patients whose probability is low or intermediate. The low specificity of DD results in a high number of false-positives that then require thoracic angio-CT. A new clinical decision rule, called the Pulmonary Embolism Rule-out criteria (PERC), identifies patients at such low risk that PE can be safely ruled-out without a DD test. Its safety has been confirmed in US emergency departments, but retrospective European studies showed that it would lead to 5-7% of undiagnosed PE. Alternative strategies are needed to reduce the proportion of false-positive DD results.
Resumo:
Recent studies have indicated that research practices in psychology may be susceptible to factors that increase false-positive rates, raising concerns about the possible prevalence of false-positive findings. The present article discusses several practices that may run counter to the inflation of false-positive rates. Taking these practices into account would lead to a more balanced view on the false-positive issue. Specifically, we argue that an inflation of false-positive rates would diminish, sometimes to a substantial degree, when researchers (a) have explicit a priori theoretical hypotheses, (b) include multiple replication studies in a single paper, and (c) collect additional data based on observed results. We report findings from simulation studies and statistical evidence that support these arguments. Being aware of these preventive factors allows researchers not to overestimate the pervasiveness of false-positives in psychology and to gauge the susceptibility of a paper to possible false-positives in practical and fair ways.
Resumo:
Chest radiography (CXR) is inferior to Thin-section computed tomography in the detection of asbestos related interstitial and pleural abnormalities. It remains unclear, however, whether these limitations are large enough to impair CXR´s ability in detecting the expected reduction in the frequency of these asbestos-related abnormalities (ARA) as exposure decreases. Clinical evaluation, CXR, Thin-section CT and spirometry were obtained in 1418 miners and millers who were exposed to progressively lower airborne concentrations of asbestos. They were separated into four groups according to the type, period and measurements of exposure and/or procedures for controlling exposure: Group I (1940-1966/tremolite and chrysotile, without measurements of exposure and procedures for controlling exposure); Group II (1967-1976/chrysotile only, without measurements of exposure and procedures for controlling exposure); Group III (1977-1980/chrysotile only, initiated measurements of exposure and procedures for controlling exposure) and Group IV (after 1981/chrysotile only, implemented measurements of exposure and a comprehensive procedures for controlling exposure). In all groups, CXR suggested more frequently interstitial abnormalities and less frequently pleural plaques than observed on Thin-section CT (p<0.050). The odds for asbestosis in groups of decreasing exposure diminished to greater extent at Thin-section CT than on CXR. Lung function was reduced in subjects who had pleural plaques evident only on Thin-section CT (p<0.050). In a longitudinal evaluation of 301 subjects without interstitial and pleural abnormalities on CXR and Thin-section CT in a previous evaluation, only Thin-section CT indicated that these ARA reduced as exposure decreased. CXR compared to Thin-section CT was associated with false-positives for interstitial abnormalities and false-negatives for pleural plaques, regardless of the intensity of asbestos exposure. Also, CXR led to a substantial misinformation of the effects of the progressively lower asbestos concentrations in the occurrence of asbestos-related diseases in miners and millers.
Resumo:
Introdução: Revisar os casos de doenças febris exantemáticas com IgM reagente contra o sarampo, no estado de São Paulo, Brasil, durante os cinco anos seguidos a interrupção da transmissão do vírus do sarampo. Métodos: Nós revisamos 463 casos de doenças febris exantemáticas com IgM reagente contra o sarampo, no estado de São Paulo, Brasil, de 2000 a 2004. Indivíduos vacinados contra o sarampo 56 dias antes da coleta de amostra foram considerados expostos à vacina. Soros da fase aguda e de convalescença foram testados para a evidência de infecção de sarampo, rubéola, parvovírus B19 e herpes vírus 6. Na ausência de soroconversão para imunoglobulina G contra o sarampo, casos com IgM reagente contra o sarampo foram considerados falsos positivos em pessoas com evidência de outras infecções virais. Resultados: Entre as 463 pessoas com doenças febris exantemáticas que testaram positivo para anticorpos IgM contra o sarampo durante o período, 297 (64 por cento) pessoas foram classificadas como expostas à vacina. Entre os 166 casos não expostos à vacina, 109 (66 por cento) foram considerados falsos positivos baseado na ausência de soroconversão, dos quais 21 (13 por cento) tiveram evidência de infecção por vírus da rubéola, 49 (30 por cento) parvovírus B19 e 28 (17 por cento) infecção por herpes vírus humano 6. Conclusões: Após a interrupção da transmissão do vírus do sarampo é necessária exaustiva investigação dos casos com IgM reagente contra o sarampo, especialmente dos casos não expostos à vacina. Testes laboratoriais para etiologias das doenças febris exantemáticas ajudam na interpretação destes casos
Resumo:
Aims. We report the discovery of very shallow (Delta F/F approximate to 3.4 x 10(-4)), periodic dips in the light curve of an active V = 11.7 G9V star observed by the CoRoT satellite, which we interpret as caused by a transiting companion. We describe the 3-colour CoRoT data and complementary ground-based observations that support the planetary nature of the companion. Methods. We used CoRoT colours information, good angular resolution ground-based photometric observations in- and out- of transit, adaptive optics imaging, near-infrared spectroscopy, and preliminary results from radial velocity measurements, to test the diluted eclipsing binary scenarios. The parameters of the host star were derived from optical spectra, which were then combined with the CoRoT light curve to derive parameters of the companion. Results. We examined all conceivable cases of false positives carefully, and all the tests support the planetary hypothesis. Blends with separation >0.40 '' or triple systems are almost excluded with a 8 x 10(-4) risk left. We conclude that, inasmuch we have been exhaustive, we have discovered a planetary companion, named CoRoT-7b, for which we derive a period of 0.853 59 +/- 3 x 10(-5) day and a radius of R(p) = 1.68 +/- 0.09 R(Earth). Analysis of preliminary radial velocity data yields an upper limit of 21 M(Earth) for the companion mass, supporting the finding. Conclusions. CoRoT-7b is very likely the first Super-Earth with a measured radius. This object illustrates what will probably become a common situation with missions such as Kepler, namely the need to establish the planetary origin of transits in the absence of a firm radial velocity detection and mass measurement. The composition of CoRoT-7b remains loosely constrained without a precise mass. A very high surface temperature on its irradiated face, approximate to 1800-2600 K at the substellar point, and a very low one, approximate to 50 K, on its dark face assuming no atmosphere, have been derived.
Resumo:
Background: The ideal malaria parasite populations for initial mapping of genomic regions contributing to phenotypes such as drug resistance and virulence, through genome-wide association studies, are those with high genetic diversity, allowing for numerous informative markers, and rare meiotic recombination, allowing for strong linkage disequilibrium (LD) between markers and phenotype-determining loci. However, levels of genetic diversity and LD in field populations of the major human malaria parasite P. vivax remain little characterized. Results: We examined single-nucleotide polymorphisms (SNPs) and LD patterns across a 100-kb chromosome segment of P. vivax in 238 field isolates from areas of low to moderate malaria endemicity in South America and Asia, where LD tends to be more extensive than in holoendemic populations, and in two monkey-adapted strains (Salvador-I, from El Salvador, and Belem, from Brazil). We found varying levels of SNP diversity and LD across populations, with the highest diversity and strongest LD in the area of lowest malaria transmission. We found several clusters of contiguous markers with rare meiotic recombination and characterized a relatively conserved haplotype structure among populations, suggesting the existence of recombination hotspots in the genome region analyzed. Both silent and nonsynonymous SNPs revealed substantial between-population differentiation, which accounted for similar to 40% of the overall genetic diversity observed. Although parasites clustered according to their continental origin, we found evidence for substructure within the Brazilian population of P. vivax. We also explored between-population differentiation patterns revealed by loci putatively affected by natural selection and found marked geographic variation in frequencies of nucleotide substitutions at the pvmdr-1 locus, putatively associated with drug resistance. Conclusion: These findings support the feasibility of genome-wide association studies in carefully selected populations of P. vivax, using relatively low densities of markers, but underscore the risk of false positives caused by population structure at both local and regional levels.
Resumo:
Background: There are several studies in the literature depicting measurement error in gene expression data and also, several others about regulatory network models. However, only a little fraction describes a combination of measurement error in mathematical regulatory networks and shows how to identify these networks under different rates of noise. Results: This article investigates the effects of measurement error on the estimation of the parameters in regulatory networks. Simulation studies indicate that, in both time series (dependent) and non-time series (independent) data, the measurement error strongly affects the estimated parameters of the regulatory network models, biasing them as predicted by the theory. Moreover, when testing the parameters of the regulatory network models, p-values computed by ignoring the measurement error are not reliable, since the rate of false positives are not controlled under the null hypothesis. In order to overcome these problems, we present an improved version of the Ordinary Least Square estimator in independent (regression models) and dependent (autoregressive models) data when the variables are subject to noises. Moreover, measurement error estimation procedures for microarrays are also described. Simulation results also show that both corrected methods perform better than the standard ones (i.e., ignoring measurement error). The proposed methodologies are illustrated using microarray data from lung cancer patients and mouse liver time series data. Conclusions: Measurement error dangerously affects the identification of regulatory network models, thus, they must be reduced or taken into account in order to avoid erroneous conclusions. This could be one of the reasons for high biological false positive rates identified in actual regulatory network models.
Resumo:
The mRNA differential display technique was used to compare mRNAs between normal mammary gland and turner-derived epithelial cells from female Sprague-Dawley rat mammary gland tumors induced by the heterocyclic amine 2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine (PhIP) and promoted by a high-fat diet (23.5% corn oil). Two genes, beta-casein and transferrin, were identified as differentially expressed. The expression of these genes was examined across a bank of rat mammary gland tumors derived from animals on a low-fat diet (5% corn oil) or the high-fat diet. Carcinomas had over a 10- and 50-fold lower expression of beta-casein and transferrin, respectively than normal mammary gland. In addition, carcinomas from animals on the high-fat diet showed on average a 5-fold higher expression of beta-casein, and transferrin than carcinomas from animals on the low-fat diet. The results indicate the process of mammary gland tumorigenesis alters the expression of certain genes in the mammary gland, and that the level of dietary fat further modulates the expression of these genes.
Resumo:
Although the utility of the acetylcholinesterase (AChE) histochemistry on rectal suction biopsy in diagnosing Hirschsprung`s disease (HD) has been documented, few reports address a great number of biopsies and patients. Our aim is to present a 17-year experience on the method of rectal suction biopsy and AChE histochemical staining for diagnosis of intestinal dysganglionoses. Between August 1989 and July 2006, 297 children suspected of having HD were submitted to rectal suction biopsies that were evaluated by the same two surgeons. There were 18 complications (6.0%), namely one self-limited rectal bleeding and 17 (5.7%) inadequate procedures that were repeated. A total of 157 patients (52.8%) showed no increased AChE activity and the remaining patients (140-47.2.0%) presented patterns of increased AChE activity confirming the diagnosis of HD or neuronal intestinal dysplasia. Among the 140 cases suspected as having HD, in 131 children the diagnosis of HD was confirmed and they were operated on. The histological studies showed that 111 children presented the classic form of HD or a long spastic segment. Sixteen children presented total colonic aganglionosis and four children proved to have intestinal neuronal dysplasia, according to histological and radiological criteria. Nine (6.6%) newborns were identified as false-positives and no false-negative results were verified. The rectal suction biopsy combined with AChE staining is advantageous for the differentiation between normal bowel and intestinal dysganglionoses. The rectal suction method is simple and can easily be performed by experienced surgeons. The histological evaluation is very objective and can be performed by a non-pathologist.