950 resultados para CROSS-VALIDATION


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Twenty-four hepatitis C virus patients coinfected with human T-lymphotropic virus type 1 were compared with six coinfected with HTLV-2 and 55 with HCV alone, regarding clinical, epidemiological, laboratory and histopathological data. Fischer's discriminant analysis was applied to define functions capable of differentiating between the study groups (HCV, HCV/HTLV-1 and HCV/HTLV-2). The discriminant accuracy was evaluated by cross-validation. Alcohol consumption, use of intravenous drugs or inhaled cocaine and sexual partnership with intravenous drug users were more frequent in the HCV/HTLV-2 group, whereas patients in the HCV group more often reported abdominal pain or a sexual partner with hepatitis. Coinfected patients presented higher platelet counts, but aminotransferase and gamma-glutamyl transpeptidase levels were higher among HCV-infected subjects. No significant difference between the groups was seen regarding liver histopathological findings. Through discriminant analysis, classification functions were defined, including sex, age group, intravenous drug use and sexual partner with hepatitis. Cross-validation revealed high discriminant accuracy for the HCV group.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

As florestas são uma fonte importante de recursos naturais, desempenhando um papel fulcral na sustentabilidade ambiental. A sua gestão quer territorial quer económica, conduz a uma maximização da produção, sem alteração da qualidade da matéria-prima. Portugal apresenta mais de um terço do seu território coberto por floresta, apresentando uma possibilidade de aplicação de sistemas de gestão, territorial e económica que maximizem a sua produção. Os Sistemas de Informação Geográfica (SIG) são modelos da realidade em que é possível integrar toda a informação disponível sobre um assunto tendo por base um campo comum a todos as variáveis, a localização geográfica. Os SIG podem contribuir de diversas formas para um maior desenvolvimento das rotinas e ferramentas de planeamento e gestão florestal. A sua integração com modelos quantitativos para planeamento e gestão de florestas é uma mais-valia nesta área. Nesta dissertação apresentam-se modelos geoestatísticos, com recurso a Sistemas de Informação Geográfica, de apoio e suporte à produção de pinha em Pinheiro-manso (Pinus pinea L.). Procurando estimar as áreas com melhor propensão à produção, a partir de dados amostrais. Estes foram previamente estudados tendo sido selecionadas quatro variáveis: largura da copa, área basal, altura da árvore e produção de pinha. A geoestatística aplicada, inclui modelos de correlação espacial: kriging, onde são atribuídos pesos às amostras a partir de uma análise espacial baseada no variograma experimental. Foi utilizada a extensão Geostatistical Analyst do ArcGis da ESRI, para realizar 96 krigings para as quatro variáveis em estudo, com diferentes parametrizações, destes foram selecionados 8 krigings. Com base nos critérios de adequação dos modelos e da análise de resultados da predição dos erros - cross validation. O resultado deste estudo é apresentado através de mapas de previsão para a produção de pinha em Pinheiro manso, em que foram analisadas áreas com maior e menor probabilidade de produção tendo-se realizado análises de comparação de variáveis. Através da interseção de todas as variáveis com a produção, podemos concluir que os concelhos com maiores áreas de probabilidade de produção de pinha em Pinheiro manso, da área de estudo, são Alcácer do Sal, Montemor-o-Novo, Vendas Novas, Coruche e Chamusca. Com a realização de um cruzamento de dados entre os resultados obtidos dos krigings, e a Carta de Uso e Ocupação do Solo de Portugal Continental para 2007 (COS2007), realizaram-se mapas de previsão para a expansão do Pinheiro manso. Nas áreas de expansão conseguimos atingir aumentos mínimos na ordem dos 11% e máximo na ordem dos 61%. No total consegue-se atingir aproximadamente 128 mil ha para área de expansão do Pinheiro manso. Superando, os valores esperados pelos Planos Regionais de Ordenamento Florestal, abrangidos pela área da amostra em estudo, em que é esperado um incremento de cerca de 130 mil hectares de área de Pinheiro manso para 2030.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper aims at developing a collision prediction model for three-leg junctions located in national roads (NR) in Northern Portugal. The focus is to identify factors that contribute for collision type crashes in those locations, mainly factors related to road geometric consistency, since literature is scarce on those, and to research the impact of three modeling methods: generalized estimating equations, random-effects negative binomial models and random-parameters negative binomial models, on the factors of those models. The database used included data published between 2008 and 2010 of 177 three-leg junctions. It was split in three groups of contributing factors which were tested sequentially for each of the adopted models: at first only traffic, then, traffic and the geometric characteristics of the junctions within their area of influence; and, lastly, factors which show the difference between the geometric characteristics of the segments boarding the junctionsâ area of influence and the segment included in that area were added. The choice of the best modeling technique was supported by the result of a cross validation made to ascertain the best model for the three sets of researched contributing factors. The models fitted with random-parameters negative binomial models had the best performance in the process. In the best models obtained for every modeling technique, the characteristics of the road environment, including proxy measures for the geometric consistency, along with traffic volume, contribute significantly to the number of collisions. Both the variables concerning junctions and the various national highway segments in their area of influence, as well as variations from those characteristics concerning roadway segments which border the already mentioned area of influence have proven their relevance and, therefore, there is a rightful need to incorporate the effect of geometric consistency in the three-leg junctions safety studies.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Nowadays the main honey producing countries require accurate labeling of honey before commercialization, including floral classification. Traditionally, this classification is made by melissopalynology analysis, an accurate but time-consuming task requiring laborious sample pre-treatment and high-skilled technicians. In this work the potential use of a potentiometric electronic tongue for pollinic assessment is evaluated, using monofloral and polyfloral honeys. The results showed that after splitting honeys according to color (white, amber and dark), the novel methodology enabled quantifying the relative percentage of the main pollens (Castanea sp., Echium sp., Erica sp., Eucaliptus sp., Lavandula sp., Prunus sp., Rubus sp. and Trifolium sp.). Multiple linear regression models were established for each type of pollen, based on the best sensors sub-sets selected using the simulated annealing algorithm. To minimize the overfitting risk, a repeated K-fold cross-validation procedure was implemented, ensuring that at least 10-20% of the honeys were used for internal validation. With this approach, a minimum average determination coefficient of 0.91 ± 0.15 was obtained. Also, the proposed technique enabled the correct classification of 92% and 100% of monofloral and polyfloral honeys, respectively. The quite satisfactory performance of the novel procedure for quantifying the relative pollen frequency may envisage its applicability for honey labeling and geographical origin identification. Nevertheless, this approach is not a full alternative to the traditional melissopalynologic analysis; it may be seen as a practical complementary tool for preliminary honey floral classification, leaving only problematic cases for pollinic evaluation.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Olive oils may be commercialized as intense, medium or light, according to the intensity perception of fruitiness, bitterness and pungency attributes, assessed by a sensory panel. In this work, the capability of an electronic tongue to correctly classify olive oils according to the sensory intensity perception levels was evaluated. Cross-sensitivity and non-specific lipid polymeric membranes were used as sensors. The sensor device was firstly tested using quinine monohydrochloride standard solutions. Mean sensitivities of 14±2 to 25±6 mV/decade, depending on the type of plasticizer used in the lipid membranes, were obtained showing the device capability for evaluating bitterness. Then, linear discriminant models based on sub-sets of sensors, selected by a meta-heuristic simulated annealing algorithm, were established enabling to correctly classify 91% of olive oils according to their intensity sensory grade (leave-one-out cross-validation procedure). This capability was further evaluated using a repeated K-fold cross-validation procedure, showing that the electronic tongue allowed an average correct classification of 80% of the olive oils used for internal-validation. So, the electronic tongue can be seen as a taste sensor, allowing differentiating olive oils with different sensory intensities, and could be used as a preliminary, complementary and practical tool for panelists during olive oil sensory analysis.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Olive oil quality grading is traditionally assessed by human sensory evaluation of positive and negative attributes (olfactory, gustatory, and final olfactorygustatory sensations). However, it is not guaranteed that trained panelist can correctly classify monovarietal extra-virgin olive oils according to olive cultivar. In this work, the potential application of human (sensory panelists) and artificial (electronic tongue) sensory evaluation of olive oils was studied aiming to discriminate eight single-cultivar extra-virgin olive oils. Linear discriminant, partial least square discriminant, and sparse partial least square discriminant analyses were evaluated. The best predictive classification was obtained using linear discriminant analysis with simulated annealing selection algorithm. A low-level data fusion approach (18 electronic tongue signals and nine sensory attributes) enabled 100 % leave-one-out cross-validation correct classification, improving the discrimination capability of the individual use of sensor profiles or sensory attributes (70 and 57 % leave-one-out correct classifications, respectively). So, human sensory evaluation and electronic tongue analysis may be used as complementary tools allowing successful monovarietal olive oil discrimination.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Tese de Doutoramento em Engenharia Civil.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Natural mineral waters (still), effervescent natural mineral waters (sparkling) and aromatized waters with fruit-flavors (still or sparkling) are an emerging market. In this work, the capability of a potentiometric electronic tongue, comprised with lipid polymeric membranes, to quantitatively estimate routinely quality physicochemical parameters (pH and conductivity) as well as to qualitatively classify water samples according to the type of water was evaluated. The study showed that a linear discriminant model, based on 21 sensors selected by the simulated annealing algorithm, could correctly classify 100 % of the water samples (leave-one out cross-validation). This potential was further demonstrated by applying a repeated K-fold cross-validation (guaranteeing that at least 15 % of independent samples were only used for internal-validation) for which 96 % of correct classifications were attained. The satisfactory recognition performance of the E-tongue could be attributed to the pH, conductivity, sugars and organic acids contents of the studied waters, which turned out in significant differences of sweetness perception indexes and total acid flavor. Moreover, the E-tongue combined with multivariate linear regression models, based on sub-sets of sensors selected by the simulated annealing algorithm, could accurately estimate waters pH (25 sensors: R 2 equal to 0.99 and 0.97 for leave-one-out or repeated K-folds cross-validation) and conductivity (23 sensors: R 2 equal to 0.997 and 0.99 for leave-one-out or repeated K-folds cross-validation). So, the overall satisfactory results achieved, allow envisaging a potential future application of electronic tongue devices for bottled water analysis and classification.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Recently there has been a renewed research interest in the properties of non survey updates of input-output tables and social accounting matrices (SAM). Along with the venerable and well known scaling RAS method, several alternative new procedures related to entropy minimization and other metrics have been suggested, tested and used in the literature. Whether these procedures will eventually substitute or merely complement the RAS approach is still an open question without a definite answer. The performance of many of the updating procedures has been tested using some kind of proximity or closeness measure to a reference input-output table or SAM. The first goal of this paper, in contrast, is the proposal of checking the operational performance of updating mechanisms by way of comparing the simulation results that ensue from adopting alternative databases for calibration of a reference applied general equilibrium model. The second goal is to introduce a new updatin! g procedure based on information retrieval principles. This new procedure is then compared as far as performance is concerned to two well-known updating approaches: RAS and cross-entropy. The rationale for the suggested cross validation is that the driving force for having more up to date databases is to be able to conduct more current, and hopefully more credible, policy analyses.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Introduction: As part of the MicroArray Quality Control (MAQC)-II project, this analysis examines how the choice of univariate feature-selection methods and classification algorithms may influence the performance of genomic predictors under varying degrees of prediction difficulty represented by three clinically relevant endpoints. Methods: We used gene-expression data from 230 breast cancers (grouped into training and independent validation sets), and we examined 40 predictors (five univariate feature-selection methods combined with eight different classifiers) for each of the three endpoints. Their classification performance was estimated on the training set by using two different resampling methods and compared with the accuracy observed in the independent validation set. Results: A ranking of the three classification problems was obtained, and the performance of 120 models was estimated and assessed on an independent validation set. The bootstrapping estimates were closer to the validation performance than were the cross-validation estimates. The required sample size for each endpoint was estimated, and both gene-level and pathway-level analyses were performed on the obtained models. Conclusions: We showed that genomic predictor accuracy is determined largely by an interplay between sample size and classification difficulty. Variations on univariate feature-selection methods and choice of classification algorithm have only a modest impact on predictor performance, and several statistically equally good predictors can be developed for any given classification problem.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Lean meat percentage (LMP) is an important carcass quality parameter. The aim of this work is to obtain a calibration equation for the Computed Tomography (CT) scans with the Partial Least Square Regression (PLS) technique in order to predict the LMP of the carcass and the different cuts and to study and compare two different methodologies of the selection of the variables (Variable Importance for Projection — VIP- and Stepwise) to be included in the prediction equation. The error of prediction with cross-validation (RMSEPCV) of the LMP obtained with PLS and selection based on VIP value was 0.82% and for stepwise selection it was 0.83%. The prediction of the LMP scanning only the ham had a RMSEPCV of 0.97% and if the ham and the loin were scanned the RMSEPCV was 0.90%. Results indicate that for CT data both VIP and stepwise selection are good methods. Moreover the scanning of only the ham allowed us to obtain a good prediction of the LMP of the whole carcass.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND: Little information is available on the validity of simple and indirect body-composition methods in non-Western populations. Equations for predicting body composition are population-specific, and body composition differs between blacks and whites. OBJECTIVE: We tested the hypothesis that the validity of equations for predicting total body water (TBW) from bioelectrical impedance analysis measurements is likely to depend on the racial background of the group from which the equations were derived. DESIGN: The hypothesis was tested by comparing, in 36 African women, TBW values measured by deuterium dilution with those predicted by 23 equations developed in white, African American, or African subjects. These cross-validations in our African sample were also compared, whenever possible, with results from other studies in black subjects. RESULTS: Errors in predicting TBW showed acceptable values (1.3-1.9 kg) in all cases, whereas a large range of bias (0.2-6.1 kg) was observed independently of the ethnic origin of the sample from which the equations were derived. Three equations (2 from whites and 1 from blacks) showed nonsignificant bias and could be used in Africans. In all other cases, we observed either an overestimation or underestimation of TBW with variable bias values, regardless of racial background, yielding no clear trend for validity as a function of ethnic origin. CONCLUSIONS: The findings of this cross-validation study emphasize the need for further fundamental research to explore the causes of the poor validity of TBW prediction equations across populations rather than the need to develop new prediction equations for use in Africa.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Lean meat percentage (LMP) is the criterion for carcass classification and it must be measured on line objectively. The aim of this work was to compare the error of the prediction (RMSEP) of the LMP measured with the following different devices: Fat-O-Meat’er (FOM), UltraFOM (UFOM), AUTOFOM and -VCS2000. For this reason the same 99 carcasses were measured using all 4 apparatus and dissected according to the European Reference Method. Moreover a subsample of the carcasses (n=77) were fully scanned with a X-ray Computed Tomography equipment (CT). The RMSEP calculated with cross validation leave-one-out was lower for FOM and AUTOFOM (1.8% and 1.9%, respectively) and higher for UFOM and VCS2000 (2.3% for both devices). The error obtained with CT was the lowest (0.96%) in accordance with previous results, but CT cannot be used on line. It can be concluded that FOM and AUTOFOM presented better accuracy than UFOM and VCS2000.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

La predicció del grau d’adaptació de les persones que ingressen als centres penitenciaris és un element clau per poder minimitzar problemes regimentals i, alhora, facilitar el procés de rehabilitació. Els elements predictors es poden fonamentar en variables psicomètriques (mesurades mitjançant qüestionaris psicològics) i també en judicis tècnics professionals, basats en l’acumulació d’experiència obtinguda pel treball. En aquesta investigació, es validen dos qüestionaris psicomètrics que poden ser útils per predir el grau d’adaptació regimental: un de personalitat, el CPS (Cuestionario de Personalidad Situacional) i l’altre d’impulsivitat, el BARRAT (BIS-11). Els resultats demostren que les variables del CPS prediuen millor el comportament conflictiu, mentre que la impulsivitat mesurada pel BARRAT prediu millor el comportament adaptat. També es validen els criteris tècnics psicològic (judici tècnic emès per part del psicòleg) i criminològic (judici tècnic emès per part del jurista criminòleg). El pas següent ha estat fer una validació creuada que consisteix en la comparació mútua entre el criteris psicomètrics i els tècnics, per veure quines variables són finalment seleccionades de cara a la predicció de la variable més rellevant de l’adaptació regimental: haver tingut o no una regressió de grau de tractament penitenciari. Els resultats demostren que no hi ha una superioritat dels criteris psicomètrics sobre els criteris tècnics i que no estan oposats, sinó que es reforcen mútuament per millorar la predicció del grau d’adaptació regimental. El poder predictiu de les variables seleccionades és encara més alt quan es tracta de predir l’adaptació al règim penitenciari dels interns primaris (els que ingressen per primera vegada a la presó) que quan es refereix als reincidents. D’altra banda, s’ha baremat el qüestionari d’impulsivitat de BARRAT (BIS-11) en població penitenciària per convertir-lo en una eina de pronòstic, tant de comportament regimental com de reincidència penal.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Models predicting species spatial distribution are increasingly applied to wildlife management issues, emphasising the need for reliable methods to evaluate the accuracy of their predictions. As many available datasets (e.g. museums, herbariums, atlas) do not provide reliable information about species absences, several presence-only based analyses have been developed. However, methods to evaluate the accuracy of their predictions are few and have never been validated. The aim of this paper is to compare existing and new presenceonly evaluators to usual presence/absence measures. We use a reliable, diverse, presence/absence dataset of 114 plant species to test how common presence/absence indices (Kappa, MaxKappa, AUC, adjusted D-2) compare to presenceonly measures (AVI, CVI, Boyce index) for evaluating generalised linear models (GLM). Moreover we propose a new, threshold-independent evaluator, which we call "continuous Boyce index". All indices were implemented in the B10MAPPER software. We show that the presence-only evaluators are fairly correlated (p > 0.7) to the presence/absence ones. The Boyce indices are closer to AUC than to MaxKappa and are fairly insensitive to species prevalence. In addition, the Boyce indices provide predicted-toexpected ratio curves that offer further insights into the model quality: robustness, habitat suitability resolution and deviation from randomness. This information helps reclassifying predicted maps into meaningful habitat suitability classes. The continuous Boyce index is thus both a complement to usual evaluation of presence/absence models and a reliable measure of presence-only based predictions.