946 resultados para universal prediction


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Sequential randomized prediction of an arbitrary binary sequence isinvestigated. No assumption is made on the mechanism of generating the bit sequence. The goal of the predictor is to minimize its relative loss, i.e., to make (almost) as few mistakes as the best ``expert'' in a fixed, possibly infinite, set of experts. We point out a surprising connection between this prediction problem and empirical process theory. First, in the special case of static (memoryless) experts, we completely characterize the minimax relative loss in terms of the maximum of an associated Rademacher process. Then we show general upper and lower bounds on the minimaxrelative loss in terms of the geometry of the class of experts. As main examples, we determine the exact order of magnitude of the minimax relative loss for the class of autoregressive linear predictors and for the class of Markov experts.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We investigate on-line prediction of individual sequences. Given a class of predictors, the goal is to predict as well as the best predictor in the class, where the loss is measured by the self information (logarithmic) loss function. The excess loss (regret) is closely related to the redundancy of the associated lossless universal code. Using Shtarkov's theorem and tools from empirical process theory, we prove a general upper bound on the best possible (minimax) regret. The bound depends on certain metric properties of the class of predictors. We apply the bound to both parametric and nonparametric classes ofpredictors. Finally, we point out a suboptimal behavior of the popular Bayesian weighted average algorithm.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In Natural Language Processing (NLP) symbolic systems, several linguistic phenomena, for instance, the thematic role relationships between sentence constituents, such as AGENT, PATIENT, and LOCATION, can be accounted for by the employment of a rule-based grammar. Another approach to NLP concerns the use of the connectionist model, which has the benefits of learning, generalization and fault tolerance, among others. A third option merges the two previous approaches into a hybrid one: a symbolic thematic theory is used to supply the connectionist network with initial knowledge. Inspired on neuroscience, it is proposed a symbolic-connectionist hybrid system called BIO theta PRED (BIOlogically plausible thematic (theta) symbolic-connectionist PREDictor), designed to reveal the thematic grid assigned to a sentence. Its connectionist architecture comprises, as input, a featural representation of the words (based on the verb/noun WordNet classification and on the classical semantic microfeature representation), and, as output, the thematic grid assigned to the sentence. BIO theta PRED is designed to ""predict"" thematic (semantic) roles assigned to words in a sentence context, employing biologically inspired training algorithm and architecture, and adopting a psycholinguistic view of thematic theory.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Geostatistics has been successfully used to analyze and characterize the spatial variability of environmental properties. Besides giving estimated values at unsampled locations, it provides a measure of the accuracy of the estimate, which is a significant advantage over traditional methods used to assess pollution. In this work universal block kriging is novelty used to model and map the spatial distribution of salinity measurements gathered by an Autonomous Underwater Vehicle in a sea outfall monitoring campaign, with the aim of distinguishing the effluent plume from the receiving waters, characterizing its spatial variability in the vicinity of the discharge and estimating dilution. The results demonstrate that geostatistical methodology can provide good estimates of the dispersion of effluents that are very valuable in assessing the environmental impact and managing sea outfalls. Moreover, since accurate measurements of the plume’s dilution are rare, these studies might be very helpful in the future to validate dispersion models.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pensions together with savings and investments during active life are key elements of retirement planning. Motivation for personal choices about the standard of living, bequest and the replacement ratio of pension with respect to last salary income must be considered. This research contributes to the financial planning by helping to quantify long-term care economic needs. We estimate life expectancy from retirement age onwards. The economic cost of care per unit of service is linked to the expected time of needed care and the intensity of required services. The expected individual cost of long-term care from an onset of dependence is estimated separately for men and women. Assumptions on the mortality of the dependent people compared to the general population are introduced. Parameters defining eligibility for various forms of coverage by the universal public social care of the welfare system are addressed. The impact of the intensity of social services on individual predictions is assessed, and a partial coverage by standard private insurance products is also explored. Data were collected by the Spanish Institute of Statistics in two surveys conducted on the general Spanish population in 1999 and in 2008. Official mortality records and life table trends were used to create realistic scenarios for longevity. We find empirical evidence that the public long-term care system in Spain effectively mitigates the risk of incurring huge lifetime costs. We also find that the most vulnerable categories are citizens with moderate disabilities that do not qualify to obtain public social care support. In the Spanish case, the trends between 1999 and 2008 need to be further explored.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Erosion is deleterious because it reduces the soil's productivity capacity for growing crops and causes sedimentation and water pollution problems. Surface and buried crop residue, as well as live and dead plant roots, play an important role in erosion control. An efficient way to assess the effectiveness of such materials in erosion reduction is by means of decomposition constants as used within the Revised Universal Soil Loss Equation - RUSLE's prior-land-use subfactor - PLU. This was investigated using simulated rainfall on a 0.12 m m-1 slope, sandy loam Paleudult soil, at the Agriculture Experimental Station of the Federal University of Rio Grande do Sul, in Eldorado do Sul, State of Rio Grande do Sul, Brazil. The study area had been covered by native grass pasture for about fifteen years. By the middle of March 1996, the sod was mechanically mowed and the crop residue removed from the field. Late in April 1996, the sod was chemically desiccated with herbicide and, about one month later, the following treatments were established and evaluated for sod biomass decomposition and soil erosion, from June 1996 to May 1998, on duplicated 3.5 x 11.0 m erosion plots: (a) and (b) soil without tillage, with surface residue and dead roots; (c) soil without tillage, with dead roots only; (d) soil tilled conventionally every two-and-half months, with dead roots plus incorporated residue; and (e) soil tilled conventionally every six months, with dead roots plus incorporated residue. Simulated rainfall was applied with a rotating-boom rainfall simulator, at an intensity of 63.5 mm h-1 for 90 min, eight to nine times during the experimental period (about every two-and-half months). Surface and subsurface sod biomass amounts were measured before each rainfall test along with the erosion measurements of runoff rate, sediment concentration in runoff, soil loss rate, and total soil loss. Non-linear regression analysis was performed using an exponential and a power model. Surface sod biomass decomposition was better depicted by the exponential model, while subsurface sod biomass was by the power model. Subsurface sod biomass decomposed faster and more than surface sod biomass, with dead roots in untilled soil without residue on the surface decomposing more than dead roots in untilled soil with surface residue. Tillage type and frequency did not appreciably influence subsurface sod biomass decomposition. Soil loss rates increased greatly with both surface sod biomass decomposition and decomposition of subsurface sod biomass in the conventionally tilled soil, but they were minimally affected by subsurface sod biomass decomposition in the untilled soil. Runoff rates were little affected by the studied treatments. Dead roots plus incorporated residues were effective in reducing erosion in the conventionally tilled soil, while consolidation of the soil surface was important in no-till. The residual effect of the turned soil on erosion diminished gradually with time and ceased after two years.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pensions together with savings and investments during active life are key elements of retirement planning. Motivation for personal choices about the standard of living, bequest and the replacement ratio of pension with respect to last salary income must be considered. This research contributes to the financial planning by helping to quantify long-term care economic needs. We estimate life expectancy from retirement age onwards. The economic cost of care per unit of service is linked to the expected time of needed care and the intensity of required services. The expected individual cost of long-term care from an onset of dependence is estimated separately for men and women. Assumptions on the mortality of the dependent people compared to the general population are introduced. Parameters defining eligibility for various forms of coverage by the universal public social care of the welfare system are addressed. The impact of the intensity of social services on individual predictions is assessed, and a partial coverage by standard private insurance products is also explored. Data were collected by the Spanish Institute of Statistics in two surveys conducted on the general Spanish population in 1999 and in 2008. Official mortality records and life table trends were used to create realistic scenarios for longevity. We find empirical evidence that the public long-term care system in Spain effectively mitigates the risk of incurring huge lifetime costs. We also find that the most vulnerable categories are citizens with moderate disabilities that do not qualify to obtain public social care support. In the Spanish case, the trends between 1999 and 2008 need to be further explored.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The draft forces of soil engaging tines and theoretical analysis compared to existing mathematical models, have yet not been studied in Rio Grande do Sul soils. From the existing models, those which can get the closest fitting draft forces to real measure on field have been established for two of Rio Grande do Sul soils. An Albaqualf and a Paleudult were evaluated. From the studied models, those suggested by Reece, so called "Universal Earthmoving Equation", Hettiaratchi and Reece, and Godwin and Spoor were the best fitting ones, comparing the calculated results with those measured "in situ". Allowing for the less complexity of Reece's model, it is suggested that this model should be used for modeling draft forces prediction for narrow tines in Albaqualf and Paleudut.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The three-body recombination coefficient of an ultracold atomic system, together with the corresponding two-body scattering length a, allow us to predict the energy E 3 of the shallow trimer bound state, using a universal scaling function. The production of dimers in trapped Bose-Einstein condensates, from three-body recombination processes, in the regime of short magnetic pulses near a Feshbach resonance, is also studied in line with the experimental observation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Protein-protein interactions (PPIs) are essential for understanding the function of biological systems and have been characterized using a vast array of experimental techniques. These techniques detect only a small proportion of all PPIs and are labor intensive and time consuming. Therefore, the development of computational methods capable of predicting PPIs accelerates the pace of discovery of new interactions. This paper reports a machine learning-based prediction model, the Universal In Silico Predictor of Protein-Protein Interactions (UNISPPI), which is a decision tree model that can reliably predict PPIs for all species (including proteins from parasite-host associations) using only 20 combinations of amino acids frequencies from interacting and non-interacting proteins as learning features. UNISPPI was able to correctly classify 79.4% and 72.6% of experimentally supported interactions and non-interacting protein pairs, respectively, from an independent test set. Moreover, UNISPPI suggests that the frequencies of the amino acids asparagine, cysteine and isoleucine are important features for distinguishing between interacting and non-interacting protein pairs. We envisage that UNISPPI can be a useful tool for prioritizing interactions for experimental validation. © 2013 Valente et al.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Genome predictions based on selected genes would be a very welcome approach for taxonomic studies, including DNA-DNA similarity, G+C content and representative phylogeny of bacteria. At present, DNA-DNA hybridizations are still considered the gold standard in species descriptions. However, this method is time-consuming and troublesome, and datasets can vary significantly between experiments as well as between laboratories. For the same reasons, full matrix hybridizations are rarely performed, weakening the significance of the results obtained. The authors established a universal sequencing approach for the three genes recN, rpoA and thdF for the Pasteurellaceae, and determined if the sequences could be used for predicting DNA-DNA relatedness within the family. The sequence-based similarity values calculated using a previously published formula proved most useful for species and genus separation, indicating that this method provides better resolution and no experimental variation compared to hybridization. By this method, cross-comparisons within the family over species and genus borders easily become possible. The three genes also serve as an indicator of the genome G+C content of a species. A mean divergence of around 1 % was observed from the classical method, which in itself has poor reproducibility. Finally, the three genes can be used alone or in combination with already-established 16S rRNA, rpoB and infB gene-sequencing strategies in a multisequence-based phylogeny for the family Pasteurellaceae. It is proposed to use the three sequences as a taxonomic tool, replacing DNA-DNA hybridization.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this study, we estimate the statistical significance of structure prediction by threading. We introduce a single parameter ɛ that serves as a universal measure determining the probability that the best alignment is indeed a native-like analog. Parameter ɛ takes into account both length and composition of the query sequence and the number of decoys in threading simulation. It can be computed directly from the query sequence and potential of interactions, eliminating the need for sequence reshuffling and realignment. Although our theoretical analysis is general, here we compare its predictions with the results of gapless threading. Finally we estimate the number of decoys from which the native structure can be found by existing potentials of interactions. We discuss how this analysis can be extended to determine the optimal gap penalties for any sequence-structure alignment (threading) method, thus optimizing it to maximum possible performance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A cultura da cana-de-açúcar é a segunda maior movimentação econômica na cadeia do agronegócio no Brasil. Gera riquezas através da fabricação de açúcar, etanol e cogeração de energia elétrica, além de outros subprodutos. Considerada fonte de energia renovável, a cana-de- açúcar a princípio tinha sua imagem associada a impactos negativos principalmente devido as queimadas realizadas nas lavouras para colheita manual. Nos últimos anos, baseado em decretos e no Protocolo Agro-ambiental, essa prática vem sendo abolida. Para se manter e até mesmo aumentar o rendimento das colhedoras nos canaviais, os gestores têm adotado práticas para reduzir os terraços agrícolas, com impacto nos sistemas de conservação de solos. Assim, este trabalho teve como objetivo identificar os impactos ambientais provocados pela mecanização agrícola decorrente do mau manejo e dimensionamento dos mecanismos de conservação do solo. Neste estudo também se realizou uma análise à Equação Universal de Perdas de Solo (EUPS), como ferramenta para o dimensionamento de terraços agrícolas. O estudo foi realizado em uma microbacia hidrográfica, denominada Ribeirão da Bocaina, localizada na UGRHI-13 (Tietê - Jacaré). Foi possível identificar a variabilidade amostral do solo para o dimensionamento conservacionista, gerando curvas de nível com Desníveis Verticais (D.V) desuniformes, contrariando a sistemática atual de terraços que respeita cotas múltiplas ou mesmo dimensionamentos empiristas, segundo o conhecimento local e o histórico recente da área. Algumas sugestões também foram feitas afim de torná-la uma ferramenta ainda mais eficiente, considerando condições particulares à cultura da cana-de-açúcar, tais como a influência da palhada, sulcos de plantio e diversos tipos de terraços como meios de controle à erosão. A metodologia foi satisfatória, no que tange a compreensão pelos meios de correlação entre as práticas conservacionistas e modelos de predição de perda de solo, trazendo luz à ciência na interpretação das ferramentas existentes e as lacunas a serem preenchidas.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

36