988 resultados para least absolute deviation (LAD) fitting


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The generalized Bonferroni mean is able to capture some interaction effects between variables and model mandatory requirements. We present a number of weights identification algorithms we have developed in the R programming language in order to model data using the generalized Bonferroni mean subject to various preferences. We then compare its accuracy when fitting to the journal ranks dataset.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We discuss the problem of learning fuzzy measures from empirical data. Values of the discrete Choquet integral are fitted to the data in the least absolute deviation sense. This problem is solved by linear programming techniques. We consider the cases when the data are given on the numerical and interval scales. An open source programming library which facilitates calculations involving fuzzy measures and their learning from data is presented.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study focuses on multiple linear regression models relating six climate indices (temperature humidity THI, environmental stress ESI, equivalent temperature index ETI, heat load HLI, modified HLI (HLI new), and respiratory rate predictor RRP) with three main components of cow’s milk (yield, fat, and protein) for cows in Iran. The least absolute shrinkage selection operator (LASSO) and the Akaike information criterion (AIC) techniques are applied to select the best model for milk predictands with the smallest number of climate predictors. Uncertainty estimation is employed by applying bootstrapping through resampling. Cross validation is used to avoid over-fitting. Climatic parameters are calculated from the NASA-MERRA global atmospheric reanalysis. Milk data for the months from April to September, 2002 to 2010 are used. The best linear regression models are found in spring between milk yield as the predictand and THI, ESI, ETI, HLI, and RRP as predictors with p-value < 0.001 and R2 (0.50, 0.49) respectively. In summer, milk yield with independent variables of THI, ETI, and ESI show the highest relation (p-value < 0.001) with R2 (0.69). For fat and protein the results are only marginal. This method is suggested for the impact studies of climate variability/change on agriculture and food science fields when short-time series or data with large uncertainty are available.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This article examines the construction of aggregation functions from data by minimizing the least absolute deviation criterion. We formulate various instances of such problems as linear programming problems. We consider the cases in which the data are provided as intervals, and the outputs ordering needs to be preserved, and show that linear programming formulation is valid for such cases. This feature is very valuable in practice, since the standard simplex method can be used.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We consider the use of Ordered Weighted Averaging (OWA) in linear regression. Our goal is to replace the traditional least squares, least absolute deviation, and maximum likelihood criteria with an OWA function of the residuals. We obtain several high breakdown robust regression methods as special cases (least median, least trimmed squares, trimmed likelihood methods). We also present new formulations of regression problem. OWA-based regression is particularly useful in the presence of outliers.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We consider an application of fuzzy logic connectives to statistical regression. We replace the standard least squares, least absolute deviation, and maximum likelihood criteria with an ordered weighted averaging (OWA) function of the residuals. Depending on the choice of the weights, we obtain the standard regression problems, high-breakdown robust methods (least median, least trimmed squares, and trimmed likelihood methods), as well as new formulations. We present various approaches to numerical solution of such regression problems. OWA-based regression is particularly useful in the presence of outliers, and we illustrate the performance of the new methods on several instances of linear regression problems with multiple outliers.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

ACM Computing Classification System (1998): I.4.9, I.4.10.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

2010 Mathematics Subject Classification: 62F12, 62M05, 62M09, 62M10, 60G42.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

O fator de compressibilidade (Z) de gás natural é utilizado em vários cálculos na engenharia de petróleo (avaliação de formações, perda de carga em tubulações, gradiente de pressão em poços de gás, cálculos de balanço de massa, medição de gás, compressão e processamento de gás). As fontes mais comuns de valores de Z são medições experimentais, caras e demoradas. Essa propriedade também é estimada por correlações empíricas, modelos baseados no princípio dos estados correspondentes ou equações de estado (EOS). Foram avaliadas as capacidades das EOS de Soave-Redlich-Kwong (SRK), Peng-Robinson (PR), Patel-Teja (PT), Patel-Teja-Valderrama (PTV), Schmidt-Wenzel (SW), Lawal-Lake-Silberberg (LLS) e AGA-8 para previsão desta propriedade em aproximadamente 2200 pontos de dados experimentais. Estes pontos foram divididos em quatro grupos: Grupo 1 (Presença de frações C7+, Grupo 2 (temperaturas inferiores a 258,15 K), Grupo 3 (pressões superiores a 10000 kPa) e Grupo 4 (pressões inferiores a 10000 kPa). Os cálculos utilizando as equações de estado sob diferentes esquemas de previsão de coeficientes binários de interação foram cuidadosamente investigados. Os resultados sugerem que a EOS AGA-8 apresenta os menores erros para pressões de até 70000 kPa. Entretanto, observou-se uma tendência de aumento nos desvios médios absolutos em função das concentrações de CO2 e H2S. As EOS PTV e a EOS SW são capazes de predizer o fator de compressibilidade (Z) com desvios médios absolutos entre os valores calculados e experimentais com precisão satisfatória para a maioria das aplicações, para uma variada faixa de temperatura e pressão. Este estudo também apresenta uma avaliação de 224 métodos de cálculo de Z onde foram utilizadas 8 correlações combinadas com 4 regras de mistura para estimativa de temperaturas e pressões pseudorreduzidas das amostras, junto com 7 métodos de caracterização das propriedades críticas da fração C7+, quando presente na composição do gás. Em função dos resultados são sugeridas, para diferentes tipos de sistemas, as melhores combinações de correlações com regras de mistura capazes de predizer fatores de compressibilidade (Z) com os menores erros absolutos médios relativos

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Experimental values for the solubility of carbon dioxide, ethane, methane, oxygen, nitrogen, hydrogen, argon and carbon monoxide in 1-butyl-3-methylimidazolium hexafluorophosphate, [bmim][PF6] - a room temperature ionic liquid - are reported as a function of temperature between 283 and 343 K and at pressures close to atmospheric. Carbon dioxide is the most soluble and hydrogen is the least soluble of the gases studied with mole fraction solubilities of the order of 10-2 and 10-4, respectively. All the mole fraction solubilities decrease with temperature except for hydrogen for which a maximum is observed at temperatures close to 310 K. From the variation of solubility, expressed as Henry's law constants, with temperature, the partial molar thermodynamic functions of solvation such as the standard Gibbs energy, the enthalpy, and the entropy are calculated. The precision of the experimental data, considered as the average absolute deviation of the Henry's law constants from appropriate smoothing equations, is better than ±1%. © 2005 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genomewide marker information can improve the reliability of breeding value predictions for young selection candidates in genomic selection. However, the cost of genotyping limits its use to elite animals, and how such selective genotyping affects predictive ability of genomic selection models is an open question. We performed a simulation study to evaluate the quality of breeding value predictions for selection candidates based on different selective genotyping strategies in a population undergoing selection. The genome consisted of 10 chromosomes of 100 cM each. After 5,000 generations of random mating with a population size of 100 (50 males and 50 females), generation G(0) (reference population) was produced via a full factorial mating between the 50 males and 50 females from generation 5,000. Different levels of selection intensities (animals with the largest yield deviation value) in G(0) or random sampling (no selection) were used to produce offspring of G(0) generation (G(1)). Five genotyping strategies were used to choose 500 animals in G(0) to be genotyped: 1) Random: randomly selected animals, 2) Top: animals with largest yield deviation values, 3) Bottom: animals with lowest yield deviations values, 4) Extreme: animals with the 250 largest and the 250 lowest yield deviations values, and 5) Less Related: less genetically related animals. The number of individuals in G(0) and G(1) was fixed at 2,500 each, and different levels of heritability were considered (0.10, 0.25, and 0.50). Additionally, all 5 selective genotyping strategies (Random, Top, Bottom, Extreme, and Less Related) were applied to an indicator trait in generation G(0), and the results were evaluated for the target trait in generation G(1), with the genetic correlation between the 2 traits set to 0.50. The 5 genotyping strategies applied to individuals in G(0) (reference population) were compared in terms of their ability to predict the genetic values of the animals in G(1) (selection candidates). Lower correlations between genomic-based estimates of breeding values (GEBV) and true breeding values (TBV) were obtained when using the Bottom strategy. For Random, Extreme, and Less Related strategies, the correlation between GEBV and TBV became slightly larger as selection intensity decreased and was largest when no selection occurred. These 3 strategies were better than the Top approach. In addition, the Extreme, Random, and Less Related strategies had smaller predictive mean squared errors (PMSE) followed by the Top and Bottom methods. Overall, the Extreme genotyping strategy led to the best predictive ability of breeding values, indicating that animals with extreme yield deviations values in a reference population are the most informative when training genomic selection models.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The main purpose of this study is to assess the relationship between six bioclimatic indices for cattle (temperature humidity (THI), environmental stress (ESI), equivalent temperature (ESI), heat load (HLI), modified heat load (HLInew) and respiratory rate predictor(RRP)) and fundamental milk components (fat, protein, and milk yield) considering uncertainty. The climate parameters used to calculate the climate indices were taken from the NASA-Modern Era Retrospective-Analysis for Research and Applications (NASA-MERRA) reanalysis from 2002 to 2010. Cow milk data were considered for the same period from April to September when cows use natural pasture, with possibility for cows to choose to stay in the barn or to graze on the pasture in the pasturing system. The study is based on a linear regression analysis using correlations as a summarizing diagnostic. Bootstrapping is used to represent uncertainty estimation through resampling in the confidence intervals. To find the relationships between climate indices (THI, ETI, HLI, HLInew, ESI and RRP) and main components of cow milk (fat, protein and yield), multiple liner regression is applied. The least absolute shrinkage selection operator (LASSO) and the Akaike information criterion (AIC) techniques are applied to select the best model for milk predictands with the smallest number of climate predictors. Cross validation is used to avoid over-fitting. Based on results of investigation the effect of heat stress indices on milk compounds separately, we suggest the use of ESI and RRP in the summer and ESI in the spring. THI and HLInew are suggested for fat content and HLInew also is suggested for protein content in the spring season. The best linear models are found in spring between milk yield as predictands and THI, ESI,HLI, ETI and RRP as predictors with p-value < 0.001 and R2 0.50, 0.49. In summer, milk yield with independent variables of THI, ETI and ESI show the highest relation (p-value < 0.001) with R2 (0.69). For fat and protein the results are only marginal. It is strongly suggested that new and significant indices are needed to control critical heat stress conditions that consider more predictors of the effect of climate variability on animal products, such as sunshine duration, quality of pasture, the number of days of stress (NDS), the color of skin with attention to large black spots, and categorical predictors such as breed, welfare facility, and management system. This methodology is suggested for studies investigating the impacts of climate variability/change on food quality/security, animal science and agriculture using short term data considering uncertainty or data collection is expensive, difficult, or data with gaps.