53 resultados para Fault prediction


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The completion of the sequencing of the mouse genome promises to help predict human genes with greater accuracy. While current ab initio gene prediction programs are remarkably sensitive (i.e., they predict at least a fragment of most genes), their specificity is often low, predicting a large number of false-positive genes in the human genome. Sequence conservation at the protein level with the mouse genome can help eliminate some of those false positives. Here we describe SGP2, a gene prediction program that combines ab initio gene prediction with TBLASTX searches between two genome sequences to provide both sensitive and specific gene predictions. The accuracy of SGP2 when used to predict genes by comparing the human and mouse genomes is assessed on a number of data sets, including single-gene data sets, the highly curated human chromosome 22 predictions, and entire genome predictions from ENSEMBL. Results indicate that SGP2 outperforms purely ab initio gene prediction methods. Results also indicate that SGP2 works about as well with 3x shotgun data as it does with fully assembled genomes. SGP2 provides a high enough specificity that its predictions can be experimentally verified at a reasonable cost. SGP2 was used to generate a complete set of gene predictions on both the human and mouse by comparing the genomes of these two species. Our results suggest that another few thousand human and mouse genes currently not in ENSEMBL are worth verifying experimentally.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Recent advances on high-throughput technologies have produced a vast amount of protein sequences, while the number of high-resolution structures has seen a limited increase. This has impelled the production of many strategies to built protein structures from its sequence, generating a considerable amount of alternative models. The selection of the closest model to the native conformation has thus become crucial for structure prediction. Several methods have been developed to score protein models by energies, knowledge-based potentials and combination of both.Results: Here, we present and demonstrate a theory to split the knowledge-based potentials in scoring terms biologically meaningful and to combine them in new scores to predict near-native structures. Our strategy allows circumventing the problem of defining the reference state. In this approach we give the proof for a simple and linear application that can be further improved by optimizing the combination of Zscores. Using the simplest composite score () we obtained predictions similar to state-of-the-art methods. Besides, our approach has the advantage of identifying the most relevant terms involved in the stability of the protein structure. Finally, we also use the composite Zscores to assess the conformation of models and to detect local errors.Conclusion: We have introduced a method to split knowledge-based potentials and to solve the problem of defining a reference state. The new scores have detected near-native structures as accurately as state-of-art methods and have been successful to identify wrongly modeled regions of many near-native conformations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: A number of studies have used protein interaction data alone for protein function prediction. Here, we introduce a computational approach for annotation of enzymes, based on the observation that similar protein sequences are more likely to perform the same function if they share similar interacting partners. Results: The method has been tested against the PSI-BLAST program using a set of 3,890 protein sequences from which interaction data was available. For protein sequences that align with at least 40% sequence identity to a known enzyme, the specificity of our method in predicting the first three EC digits increased from 80% to 90% at 80% coverage when compared to PSI-BLAST. Conclusion: Our method can also be used in proteins for which homologous sequences with known interacting partners can be detected. Thus, our method could increase 10% the specificity of genome-wide enzyme predictions based on sequence matching by PSI-BLAST alone.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present simple procedures for the prediction of a real valued sequence. The algorithms are based on a combinationof several simple predictors. We show that if the sequence is a realization of a bounded stationary and ergodic random process then the average of squared errors converges, almost surely, to that of the optimum, given by the Bayes predictor. We offer an analog result for the prediction of stationary gaussian processes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sequential randomized prediction of an arbitrary binary sequence isinvestigated. No assumption is made on the mechanism of generating the bit sequence. The goal of the predictor is to minimize its relative loss, i.e., to make (almost) as few mistakes as the best ``expert'' in a fixed, possibly infinite, set of experts. We point out a surprising connection between this prediction problem and empirical process theory. First, in the special case of static (memoryless) experts, we completely characterize the minimax relative loss in terms of the maximum of an associated Rademacher process. Then we show general upper and lower bounds on the minimaxrelative loss in terms of the geometry of the class of experts. As main examples, we determine the exact order of magnitude of the minimax relative loss for the class of autoregressive linear predictors and for the class of Markov experts.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Until recently farm management made little use of accounting and agriculture has been largely excluded from the scope of accounting standards. This article examines the current use of accounting in agriculture and points theneed to establish accounting standards for agriculture. Empirical evidence shows that accounting can make a significant contribution to agricultural management and farm viability and could also be important for other agents involved in agricultural decision making. Existing literature on failureprediction models and farm viability prediction studies provide the starting point for our research, in which two dichotomous logit models were applied to subsamples of viable and unviable farms in Catalonia, Spain. The firstmodel considered only non-financial variables, while the other also considered financial ones. When accounting variables were added to the model, a significant reduction in deviance was observed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a simple randomized procedure for the prediction of a binary sequence. The algorithm uses ideas from recent developments of the theory of the prediction of individual sequences. We show that if thesequence is a realization of a stationary and ergodic random process then the average number of mistakes converges, almost surely, to that of the optimum, given by the Bayes predictor.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this work was the use of NIR technology by direct application of a fiber optic probe on back fat to analyze the fatty acid composition of CLA fed boars and gilts. 265 animals were fed 3 different diets and the fatty acid profile of back fat from Gluteus medius was analyzed using gas chromatography and FT-NIR. Spectra were acquired using a Bruker Optics Matrix-F duplex spectrometer equipped with a fiber optic probe (IN-268-2). Oleic and stearic fatty acids were predicted accurately; myristic, vaccenic and linoleic fatty acids were predicted with lower accuracy, while palmitic and α-linolenic fatty acids were poorly predicted. The relative percentage of fatty acids and NIR spectra showed differences in fatty acid composition of back fat from pigs fed CLA which increased the relative percentage of SFA and PUFA while MUFA decreased. Results suggest that a NIR fiber optic probe can be used to predict total saturated and unsaturated fatty acid composition, as well as the percentage of stearic and oleic. NIR showed potential as a rapid and easily implemented method to discriminate carcasses from animals fed different diets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The control and prediction of wastewater treatment plants poses an important goal: to avoid breaking the environmental balance by always keeping the system in stable operating conditions. It is known that qualitative information — coming from microscopic examinations and subjective remarks — has a deep influence on the activated sludge process. In particular, on the total amount of effluent suspended solids, one of the measures of overall plant performance. The search for an input–output model of this variable and the prediction of sudden increases (bulking episodes) is thus a central concern to ensure the fulfillment of current discharge limitations. Unfortunately, the strong interrelationbetween variables, their heterogeneity and the very high amount of missing information makes the use of traditional techniques difficult, or even impossible. Through the combined use of several methods — rough set theory and artificial neural networks, mainly — reasonable prediction models are found, which also serve to show the different importance of variables and provide insight into the process dynamics

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[spa] La mayoría de siniestros con daños corporales se liquidan mediante negociación, llegando a juicio menos del 5% de los casos. Una estrategia de negociación bien definida es, por tanto, fundamental para las compañías aseguradoras. En este artículo asumimos que la compensación monetaria concedida en juicio es la máxima cuantía que debería ser ofrecida por el asegurador en el proceso de negociación. Usando una base de datos real, implementamos un modelo log-lineal para estimar la máxima oferta de negociación. Perturbaciones no-esféricas son detectadas. Correlación ocurre cuando más de una siniestro se liquida en la misma sentencia judicial. Heterocedasticidad por grupos se debe a la influencia de la valoración del forense en la indemnización final.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[spa] La mayoría de siniestros con daños corporales se liquidan mediante negociación, llegando a juicio menos del 5% de los casos. Una estrategia de negociación bien definida es, por tanto, fundamental para las compañías aseguradoras. En este artículo asumimos que la compensación monetaria concedida en juicio es la máxima cuantía que debería ser ofrecida por el asegurador en el proceso de negociación. Usando una base de datos real, implementamos un modelo log-lineal para estimar la máxima oferta de negociación. Perturbaciones no-esféricas son detectadas. Correlación ocurre cuando más de una siniestro se liquida en la misma sentencia judicial. Heterocedasticidad por grupos se debe a la influencia de la valoración del forense en la indemnización final.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Substantial collective flow is observed in collisions between lead nuclei at Large Hadron Collider (LHC) as evidenced by the azimuthal correlations in the transverse momentum distributions of the produced particles. Our calculations indicate that the global v1-flow, which at RHIC peaked at negative rapidities (named third flow component or antiflow), now at LHC is going to turn toward forward rapidities (to the same side and direction as the projectile residue). Potentially this can provide a sensitive barometer to estimate the pressure and transport properties of the quark-gluon plasma. Our calculations also take into account the initial state center-of-mass rapidity fluctuations, and demonstrate that these are crucial for v1 simulations. In order to better study the transverse momentum flow dependence we suggest a new "symmetrized" v1S(pt) function, and we also propose a new method to disentangle global v1 flow from the contribution generated by the random fluctuations in the initial state. This will enhance the possibilities of studying the collective Global v1 flow both at the STAR Beam Energy Scan program and at LHC.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Evaluating other individuals with respect to personality characteristics plays a crucial role in human relations and it is the focus of attention for research in diverse fields such as psychology and interactive computer systems. In psychology, face perception has been recognized as a key component of this evaluation system. Multiple studies suggest that observers use face information to infer personality characteristics. Interactive computer systems are trying to take advantage of these findings and apply them to increase the natural aspect of interaction and to improve the performance of interactive computer systems. Here, we experimentally test whether the automatic prediction of facial trait judgments (e.g. dominance) can be made by using the full appearance information of the face and whether a reduced representation of its structure is sufficient. We evaluate two separate approaches: a holistic representation model using the facial appearance information and a structural model constructed from the relations among facial salient points. State of the art machine learning methods are applied to a) derive a facial trait judgment model from training data and b) predict a facial trait value for any face. Furthermore, we address the issue of whether there are specific structural relations among facial points that predict perception of facial traits. Experimental results over a set of labeled data (9 different trait evaluations) and classification rules (4 rules) suggest that a) prediction of perception of facial traits is learnable by both holistic and structural approaches; b) the most reliable prediction of facial trait judgments is obtained by certain type of holistic descriptions of the face appearance; and c) for some traits such as attractiveness and extroversion, there are relationships between specific structural features and social perceptions.