975 resultados para Selection Algorithms
Resumo:
In the context of cancer diagnosis and treatment, we consider the problem of constructing an accurate prediction rule on the basis of a relatively small number of tumor tissue samples of known type containing the expression data on very many (possibly thousands) genes. Recently, results have been presented in the literature suggesting that it is possible to construct a prediction rule from only a few genes such that it has a negligible prediction error rate. However, in these results the test error or the leave-one-out cross-validated error is calculated without allowance for the selection bias. There is no allowance because the rule is either tested on tissue samples that were used in the first instance to select the genes being used in the rule or because the cross-validation of the rule is not external to the selection process; that is, gene selection is not performed in training the rule at each stage of the cross-validation process. We describe how in practice the selection bias can be assessed and corrected for by either performing a cross-validation or applying the bootstrap external to the selection process. We recommend using 10-fold rather than leave-one-out cross-validation, and concerning the bootstrap, we suggest using the so-called. 632+ bootstrap error estimate designed to handle overfitted prediction rules. Using two published data sets, we demonstrate that when correction is made for the selection bias, the cross-validated error is no longer zero for a subset of only a few genes.
Resumo:
The reconstruction of a complex scene from multiple images is a fundamental problem in the field of computer vision. Volumetric methods have proven to be a strong alternative to traditional correspondence-based methods due to their flexible visibility models. In this paper we analyse existing methods for volumetric reconstruction and identify three key properties of voxel colouring algorithms: a water-tight surface model, a monotonic carving order, and causality. We present a new Voxel Colouring algorithm which embeds all reconstructions of a scene into a single output. While modelling exact visibility for arbitrary camera locations, Embedded Voxel Colouring removes the need for a priori threshold selection present in previous work. An efficient implementation is given along with results demonstrating the advantages of posteriori threshold selection.
Resumo:
There are many techniques for electricity market price forecasting. However, most of them are designed for expected price analysis rather than price spike forecasting. An effective method of predicting the occurrence of spikes has not yet been observed in the literature so far. In this paper, a data mining based approach is presented to give a reliable forecast of the occurrence of price spikes. Combined with the spike value prediction techniques developed by the same authors, the proposed approach aims at providing a comprehensive tool for price spike forecasting. In this paper, feature selection techniques are firstly described to identify the attributes relevant to the occurrence of spikes. A simple introduction to the classification techniques is given for completeness. Two algorithms: support vector machine and probability classifier are chosen to be the spike occurrence predictors and are discussed in details. Realistic market data are used to test the proposed model with promising results.
Resumo:
The BR algorithm is a novel and efficient method to find all eigenvalues of upper Hessenberg matrices and has never been applied to eigenanalysis for power system small signal stability. This paper analyzes differences between the BR and the QR algorithms with performance comparison in terms of CPU time based on stopping criteria and storage requirement. The BR algorithm utilizes accelerating strategies to improve its performance when computing eigenvalues of narrowly banded, nearly tridiagonal upper Hessenberg matrices. These strategies significantly reduce the computation time at a reasonable level of precision. Compared with the QR algorithm, the BR algorithm requires fewer iteration steps and less storage space without depriving of appropriate precision in solving eigenvalue problems of large-scale power systems. Numerical examples demonstrate the efficiency of the BR algorithm in pursuing eigenanalysis tasks of 39-, 68-, 115-, 300-, and 600-bus systems. Experiment results suggest that the BR algorithm is a more efficient algorithm for large-scale power system small signal stability eigenanalysis.
Resumo:
Algorithms for explicit integration of structural dynamics problems with multiple time steps (subcycling) are investigated. Only one such algorithm, due to Smolinski and Sleith has proved to be stable in a classical sense. A simplified version of this algorithm that retains its stability is presented. However, as with the original version, it can be shown to sacrifice accuracy to achieve stability. Another algorithm in use is shown to be only statistically stable, in that a probability of stability can be assigned if appropriate time step limits are observed. This probability improves rapidly with the number of degrees of freedom in a finite element model. The stability problems are shown to be a property of the central difference method itself, which is modified to give the subcycling algorithm. A related problem is shown to arise when a constraint equation in time is introduced into a time-continuous space-time finite element model. (C) 1998 Elsevier Science S.A.
Resumo:
A novel screening strategy has been developed for the identification of alpha-chymotrypsin inhibitors from a phage peptide library. In this strategy, the standard affinity selection protocol was modified by adding a proteolytic cleavage period to avoid recovery of alpha-chymotrypsin substrates. After four cycles of selection and further activity assay, a group of related peptides were identified by DNA sequencing. These peptides share a consensus sequence motif as (S/T)RVPR(R/H). Then, a corresponding short peptide (Ac-ASRVPRRG-NH2) was synthesized chemically and proved to be an inhibitor of alpha-chymotrypsin. The present work provides a useful way for searching proteinase inhibitors without detailed knowledge of the molecular structure.
Resumo:
Aspergillus foetidus ACR I 3996 (=FRR 3558) and three strains of Aspergillus niger ACM 4992 (=ATCC 9142), ACM 4993 (=ATCC 10577), ACM 4994 (=ATCC 12846) were compared for the production of citric acid from pineapple peel in solid-state fermentation. A. niger ACM 4992 produced the highest amount of citric acid, with a yield of 19.4 g of citric acid per 100 g of dry fermented pineapple waste under optimum conditions, representing a yield of 0.74 g citric acid/g sugar consumed. Optimal conditions were 65% (w/w) initial moisture content, 3% (v/w) methanol, 30 degrees C, an unadjusted initial pH of 3.4, a particle size of 2 mm and 5 ppm Fe2+. Citric acid production was best in flasks, with lower yields being obtained in tray and rotating drum bioreactors.
Resumo:
A 12 week kayak training programme was evaluated in children who either had or did not have the anthropometric characteristics identified as being unique to senior elite sprint kayakers. Altogether, 234 male and female school children were screened to select 10 children with and 10 children without the identified key anthropometric characteristics. Before and after training, the children completed an all-out 2 min kayak ergometer simulation test; measures of oxygen consumption, plasma lactate and total work accomplished were recorded. In addition, a 500 m time trial was performed at weeks 3 and 12. The coaches were unaware which 20 children possessed those anthropometric characteristics deemed to favour development of kayak ability. All children improved in both the 2 min ergometer simulation test and 500 m time trial. However, boys who were selected according to favourable anthropometric characteristics showed greater improvement than those without such characteristics in the 2 min ergometer test only. In summary, in a small group of children selected according to anthropometric data unique to elite adult kayakers, 12 weeks of intensive kayak training did not influence the rate of improvement of on-water sprint kayak performance.
Resumo:
Extended gcd calculation has a long history and plays an important role in computational number theory and linear algebra. Recent results have shown that finding optimal multipliers in extended gcd calculations is difficult. We present an algorithm which uses lattice basis reduction to produce small integer multipliers x(1), ..., x(m) for the equation s = gcd (s(1), ..., s(m)) = x(1)s(1) + ... + x(m)s(m), where s1, ... , s(m) are given integers. The method generalises to produce small unimodular transformation matrices for computing the Hermite normal form of an integer matrix.
Resumo:
We examined the effect of age-specific fecundity, mated status, and egg load on host-plant selection, by Helicoverpa armigera under laboratory conditions. The physiological state of a female moth (number of mature eggs produced) greatly influences her host-plant specificity and propensity to oviposit (oviposition motivation). Female moths were less discriminating against cowpea (a low-ranked host) relative to maize (a high-ranked host) as egg load increased. Similarly, increased egg load led to a greater propensity to oviposit on both cowpea and maize. Distribution of oviposition with age of mated females peaked shortly after mating and declined steadily thereafter until death. Most mated females (88%) carried only a single spermatophore, a few females (12%) contained two. The significance of these findings in relation to host-plant selection by H. armigera, and its management, are discussed.
Resumo:
We describe a strategy for the selection and amplification of foreign gene expression in Chinese hamster ovary (CHO) cells employing a metallothionein gene-containing expression vector. This report describes an amplification procedure that results in an enrichment of clones exhibiting high levels of recombinant protein production and reduces the labour required for screening recombinant cell lines.
Resumo:
Life histories are generally assumed to evolve via antagonistic pleiotropy (negative genetic correlations) among traits, and trade-offs between life-history traits are typically studied using either phenotypic manipulations or selection experiments. We investigated the trade-off between egg size and fecundity in Drosophila melanogaster by examining both the phenotypic and genetic relationships between these traits after artificial selection for large and small eggs, relative to female body size. Egg size responded strongly to selection in both directions, increasing in the large-egg selected lines and decreasing in the small-egg selected lines. Phenotypic correlations between egg size and fecundity in the large-egg selected lines were negative, but no relationship between these traits occurred in either the control or small-egg selected lines. There was no negative genetic correlation between egg size and fecundity. Total reproductive allocation decreased in the small-egg selected lines but did not increase in the large-egg lines. Our results have three implications. First, our selection procedure may have forced females selected for large eggs into a physiological trade-off not reflected in a negative genetic correlation between these traits. Second, the lack of a negative genetic correlation between egg size and number suggests that the phenotypic trade-off frequently observed between egg size and number in other organisms may not evolve over the short term via a direct genetic trade-off whereby increases in egg size are automatically accompanied by decreased fecundity. Finally, total reproductive allocation may not evolve independently of egg size as commonly assumed.