Biblioteca Digital

970 resultados para Multivariate data

Evaluation of FAO Penman-Monteith and alternative methods for estimating reference evapotranspiration with missing data in Southern Ontario, Canada

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Grass reference evapotranspiration (ETo) is an important agrometeorological parameter for climatological and hydrological studies, as well as for irrigation planning and management. There are several methods to estimate ETo, but their performance in different environments is diverse, since all of them have some empirical background. The FAO Penman-Monteith (FAD PM) method has been considered as a universal standard to estimate ETo for more than a decade. This method considers many parameters related to the evapotranspiration process: net radiation (Rn), air temperature (7), vapor pressure deficit (Delta e), and wind speed (U); and has presented very good results when compared to data from lysimeters Populated with short grass or alfalfa. In some conditions, the use of the FAO PM method is restricted by the lack of input variables. In these cases, when data are missing, the option is to calculate ETo by the FAD PM method using estimated input variables, as recommended by FAD Irrigation and Drainage Paper 56. Based on that, the objective of this study was to evaluate the performance of the FAO PM method to estimate ETo when Rn, Delta e, and U data are missing, in Southern Ontario, Canada. Other alternative methods were also tested for the region: Priestley-Taylor, Hargreaves, and Thornthwaite. Data from 12 locations across Southern Ontario, Canada, were used to compare ETo estimated by the FAD PM method with a complete data set and with missing data. The alternative ETo equations were also tested and calibrated for each location. When relative humidity (RH) and U data were missing, the FAD PM method was still a very good option for estimating ETo for Southern Ontario, with RMSE smaller than 0.53 mm day(-1). For these cases, U data were replaced by the normal values for the region and Delta e was estimated from temperature data. The Priestley-Taylor method was also a good option for estimating ETo when U and Delta e data were missing, mainly when calibrated locally (RMSE = 0.40 mm day(-1)). When Rn was missing, the FAD PM method was not good enough for estimating ETo, with RMSE increasing to 0.79 mm day(-1). When only T data were available, adjusted Hargreaves and modified Thornthwaite methods were better options to estimate ETo than the FAO) PM method, since RMSEs from these methods, respectively 0.79 and 0.83 mm day(-1), were significantly smaller than that obtained by FAO PM (RMSE = 1.12 mm day(-1). (C) 2009 Elsevier B.V. All rights reserved.

Spatio-temporal modeling of agricultural yield data with an application to pricing crop insurance contracts

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article presents a statistical model of agricultural yield data based on a set of hierarchical Bayesian models that allows joint modeling of temporal and spatial autocorrelation. This method captures a comprehensive range of the various uncertainties involved in predicting crop insurance premium rates as opposed to the more traditional ad hoc, two-stage methods that are typically based on independent estimation and prediction. A panel data set of county-average yield data was analyzed for 290 counties in the State of Parana (Brazil) for the period of 1990 through 2002. Posterior predictive criteria are used to evaluate different model specifications. This article provides substantial improvements in the statistical and actuarial methods often applied to the calculation of insurance premium rates. These improvements are especially relevant to situations where data are limited.

Establishing a soybean germplasm core collection

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Core collections are of strategic importance as they allow the use of a small part of a germplasm collection that is representative of the total collection. The objective of this study was to develop a soybean core collection of the USDA Soybean Germplasm Collection by comparing the results of random, proportional, logarithmic, multivariate proportional and multivariate logarithmic sampling strategies. All but the random sampling strategy used stratification of the entire collection based on passport data and maturity group classification. The multivariate proportional and multivariate logarithmic strategies made further use of qualitative and quantitative trait data to select diverse accessions within each stratum. The 18 quantitative trait data distribution parameters were calculated for each core and for the entire collection for pairwise comparison to validate the sampling strategies. All strategies were adequate for assembling a core collection. The random core collection best represented the entire collection in statistical terms. Proportional and logarithmic strategies did not maximize statistical representation but were better in selecting maximum variability. Multivariate proportional and multivariate logarithmic strategies produced the best core collections as measured by maximum variability conservation. The soybean core collection was established using the multivariate proportional selection strategy. (C) 2010 Elsevier B.V. All rights reserved.

Canonical discriminant analysis applied to broiler chicken performance

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The mechanisms involved in the control of growth in chickens are too complex to be explained only under univariate analysis because all related traits are biologically correlated. Therefore, we evaluated broiler chicken performance under a multivariate approach, using the canonical discriminant analysis. A total of 1920 chicks from eight treatments, defined as the combination of four broiler chicken strains (Arbor Acres, AgRoss 308, Cobb 500 and RX) from both sexes, were housed in 48 pens. Average feed intake, average live weight, feed conversion and carcass, breast and leg weights were obtained for days 1 to 42. Canonical discriminant analysis was implemented by SAS((R)) CANDISC procedure and differences between treatments were obtained by the F-test (P < 0.05) over the squared Mahalanobis` distances. Multivariate performance from all treatments could be easily visualised because one graph was obtained from two first canonical variables, which explained 96.49% of total variation, using a SAS((R)) CONELIP macro. A clear distinction between sexes was found, where males were better than females. Also between strains, Arbor Acres, AgRoss 308 and Cobb 500 (commercial) were better than RX (experimental), Evaluation of broiler chicken performance was facilitated by the fact that the six original traits were reduced to only two canonical variables. Average live weight and carcass weight (first canonical variable) were the most important traits to discriminate treatments. The contrast between average feed intake and average live weight plus feed conversion (second canonical variable) were used to classify them. We suggest analysing performance data sets using canonical discriminant analysis.

Endophytic Colonization of Potato (Solanum tuberosum L.) by a Novel Competent Bacterial Endophyte, Pseudomonas putida Strain P9, and Its Effect on Associated Bacterial Communities

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Pseudomonas putida strain P9 is a novel competent endophyte from potato. P9 causes cultivar-dependent suppression of Phytophthora infestans. Colonization of the rhizoplane and endosphere of potato plants by P9 and its rifampin-resistant derivative P9R was studied. The purposes of this work were to follow the fate of P9 inside growing potato plants and to establish its effect on associated microbial communities. The effects of P9 and P9R inoculation were studied in two separate experiments. The roots of transplants of three different cultivars of potato were dipped in suspensions of P9 or P9R cells, and the plants were planted in soil. The fate of both strains was followed by examining colony growth and by performing PCR-denaturing gradient gel electrophoresis (PCR-DGGE). Colonies of both strains were recovered from rhizoplane and endosphere samples of all three cultivars at two growth stages. A conspicuous band, representing P9 and P9R, was found in all Pseudomonas PCR-DGGE fingerprints for treated plants. The numbers of P9R CFU and the P9R-specific band intensities for the different replicate samples were positively correlated, as determined by linear regression analysis. The effects of plant growth stage, genotype, and the presence of P9R on associated microbial communities were examined by multivariate and unweighted-pair group method with arithmetic mean cluster analyses of PCR-DGGE fingerprints. The presence of strain P9R had an effect on bacterial groups identified as Pseudomonas azotoformans, Pseudomonas veronii, and Pseudomonas syringae. In conclusion, strain P9 is an avid colonizer of potato plants, competing with microbial populations indigenous to the potato phytosphere. Bacterization with a biocontrol agent has an important and previously unexplored effect on plant-associated communities.

Genetic population data of 12 STR loci of the PowerPlex (R) Y system in the state of Sao Paulo population (Southeast of Brazil)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Allele frequency distributions and population data for 12 Y-chromosomal short tandem repeats (STRs) included in the PowerPlex (R) Y Systems (Promega) were obtained for a sample of 200 healthy unrelated males living in S (a) over tildeo Paulo State (Southeast of Brazil). A total of 192 haplotypes were identified, of which 184 were unique and 8 were found in 2 individuals. The average gene diversity of the 12 Y-STR was 0.6746 and the haplotype diversity was 0.9996. Pairwise analysis confirmed that our population is more similar with the Italy, North Portugal and Spain, being more distant of the Japan. (c) 2007 Elsevier Ireland Ltd. All rights reserved.

Evaluation of natural and synthetic compounds according to their antioxidant activity using a multivariate approach

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The antioxidant activity of natural and synthetic compounds was evaluated using five in vitro methods: ferric reducing/antioxidant power (FRAP), 2,2-diphenyl-1-picrylhydradzyl (DPPH), oxygen radical absorption capacity (ORAL), oxidation of an aqueous dispersion of linoleic acid accelerated by azo-initiators (LAOX), and oxidation of a meat homogenate submitted to a thermal treatment (TBARS). All results were expressed as Trolox equivalents. The application of multivariate statistical techniques suggested that the phenolic compounds (caffeic acid, carnosic acid, genistein and resveratrol), beyond their high antioxidant activity measured by the DPPH, FRAP and TBARS methods, showed the highest ability to react with the radicals in the ORAC methodology, compared to the other compounds evaluated in this study (ascorbic acid, erythorbate, tocopherol, BHT, Trolox, tryptophan, citric acid, EDTA, glutathione, lecithin, methionine and tyrosine). This property was significantly correlated with the number of phenolic rings and catecholic structure present in the molecule. Based on a multivariate analysis, it is possible to select compounds from different clusters and explore their antioxidant activity interactions in food products.

Brazilian Network of Food Data Systems and LATINFOODS Regional Technical Compilation Committee: Food composition activities (2006-2009)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Brazilian Network of Food Data Systems (BRASILFOODS) has been keeping the Brazilian Food Composition Database-USP (TBCA-USP) (http://www.fcf.usp.br/tabela) since 1998. Besides the constant compilation, analysis and update work in the database, the network tries to innovate through the introduction of food information that may contribute to decrease the risk for non-transmissible chronic diseases, such as the profile of carbohydrates and flavonoids in foods. In 2008, data on carbohydrates, individually analyzed, of 112 foods, and 41 data related to the glycemic response produced by foods widely consumed in the country were included in the TBCA-USP. Data (773) about the different flavonoid subclasses of 197 Brazilian foods were compiled and the quality of each data was evaluated according to the USDAs data quality evaluation system. In 2007, BRASILFOODS/USP and INFOODS/FAO organized the 7th International Food Data Conference ""Food Composition and Biodiversity"". This conference was a unique opportunity for interaction between renowned researchers and participants from several countries and it allowed the discussion of aspects that may improve the food composition area. During the period, the LATINFOODS Regional Technical Compilation Committee and BRASILFOODS disseminated to Latin America the Form and Manual for Data Compilation, version 2009, ministered a Food Composition Data Compilation course and developed many activities related to data production and compilation. (C) 2010 Elsevier Inc. All rights reserved.

Migrating eprints.org data to a Fez repository

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This document records the process of migrating eprints.org data to a Fez repository. Fez is a Web-based digital repository and workflow management system based on Fedora (http://www.fedora.info/). At the time of migration, the University of Queensland Library was using EPrints 2.2.1 [pepper] for its ePrintsUQ repository. Once we began to develop Fez, we did not upgrade to later versions of eprints.org software since we knew we would be migrating data from ePrintsUQ to the Fez-based UQ eSpace. Since this document records our experiences of migration from an earlier version of eprints.org, anyone seeking to migrate eprints.org data into a Fez repository might encounter some small differences. Moving UQ publication data from an eprints.org repository into a Fez repository (hereafter called UQ eSpace (http://espace.uq.edu.au/) was part of a plan to integrate metadata (and, in some cases, full texts) about all UQ research outputs, including theses, images, multimedia and datasets, in a single repository. This tied in with the plan to identify and capture the research output of a single institution, the main task of the eScholarshipUQ testbed for the Australian Partnership for Sustainable Repositories project (http://www.apsr.edu.au/). The migration could not occur at UQ until the functionality in Fez was at least equal to that of the existing ePrintsUQ repository. Accordingly, as Fez development occurred throughout 2006, a list of eprints.org functionality not currently supported in Fez was created so that programming of such development could be planned for and implemented.

A simple data logger for student-designed rocket experiments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The final-year project for Mechanical & Space Engineering students at UQ often involves the design and flight testing of an experiment. This report describes the design and use of a simple data logger that should be suitable for collecting data from the students' flight experiments. The exercise here was taken as far as the construction of a prototype device that is suitable for ground-based testing, say, the static firing of a hybrid rocket motor.

Correlation of habitual physical activity levels with flow mediated dilation of the brachial artery in 5-10 year old children

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Endothelial dysfunction is an early key event of atherogenesis. Both fitness level and exercise intervention have been shown to positively influence endothelial function. In a cross-sectional study of 47 children, the relationship between habitual physical activity and flow-mediated dilation (FMD) of the brachial artery was explored. Habitual physical activity levels (PALs) were assessed using a validated stable isotope technique, and FMD of the brachial artery was measured via high-resolution ultrasound. The results showed that habitual physical activity significantly correlated with FMD (r=0.39, P=0.007), and remained the most influential variable on dilation in multivariate analysis. Although both fitness level and exercise intervention have previously been shown to positively influence FMD, this is the first time that a relationship with normal PALs has been investigated, especially, at such a young age. These data support the concept that physical activity exerts its protective effect on cardiovascular health via the endothelium and add further emphasis to the importance of physical activity in childhood.

Equational Reasoning as a Tool for Data Analysis

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A combination of deductive reasoning, clustering, and inductive learning is given as an example of a hybrid system for exploratory data analysis. Visualization is replaced by a dialogue with the data.

Leadership Attributes and Cultural Values in Australia and New Zealand Compared: An Initial Report Based on GLOBE Data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper reports a comparative study of Australian and New Zealand leadership attributes, based on the GLOBE (Global Leadership and Organizational Behavior Effectiveness) program. Responses from 344 Australian managers and 184 New Zealand managers in three industries were analyzed using exploratory and confirmatory factor analysis. Results supported some of the etic leadership dimensions identified in the GLOBE study, but also found some emic dimensions of leadership for each country. An interesting finding of the study was that the New Zealand data fitted the Australian model, but not vice versa, suggesting asymmetric perceptions of leadership in the two countries.

Selection bias in gene extraction on the basis of microarray gene-expression data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the context of cancer diagnosis and treatment, we consider the problem of constructing an accurate prediction rule on the basis of a relatively small number of tumor tissue samples of known type containing the expression data on very many (possibly thousands) genes. Recently, results have been presented in the literature suggesting that it is possible to construct a prediction rule from only a few genes such that it has a negligible prediction error rate. However, in these results the test error or the leave-one-out cross-validated error is calculated without allowance for the selection bias. There is no allowance because the rule is either tested on tissue samples that were used in the first instance to select the genes being used in the rule or because the cross-validation of the rule is not external to the selection process; that is, gene selection is not performed in training the rule at each stage of the cross-validation process. We describe how in practice the selection bias can be assessed and corrected for by either performing a cross-validation or applying the bootstrap external to the selection process. We recommend using 10-fold rather than leave-one-out cross-validation, and concerning the bootstrap, we suggest using the so-called. 632+ bootstrap error estimate designed to handle overfitted prediction rules. Using two published data sets, we demonstrate that when correction is made for the selection bias, the cross-validated error is no longer zero for a subset of only a few genes.

Special issue on advances in data mining and its applications

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Data mining is the process to identify valid, implicit, previously unknown, potentially useful and understandable information from large databases. It is an important step in the process of knowledge discovery in databases, (Olaru & Wehenkel, 1999). In a data mining process, input data can be structured, seme-structured, or unstructured. Data can be in text, categorical or numerical values. One of the important characteristics of data mining is its ability to deal data with large volume, distributed, time variant, noisy, and high dimensionality. A large number of data mining algorithms have been developed for different applications. For example, association rules mining can be useful for market basket problems, clustering algorithms can be used to discover trends in unsupervised learning problems, classification algorithms can be applied in decision-making problems, and sequential and time series mining algorithms can be used in predicting events, fault detection, and other supervised learning problems (Vapnik, 1999). Classification is among the most important tasks in the data mining, particularly for data mining applications into engineering fields. Together with regression, classification is mainly for predictive modelling. So far, there have been a number of classification algorithms in practice. According to (Sebastiani, 2002), the main classification algorithms can be categorized as: decision tree and rule based approach such as C4.5 (Quinlan, 1996); probability methods such as Bayesian classifier (Lewis, 1998); on-line methods such as Winnow (Littlestone, 1988) and CVFDT (Hulten 2001), neural networks methods (Rumelhart, Hinton & Wiliams, 1986); example-based methods such as k-nearest neighbors (Duda & Hart, 1973), and SVM (Cortes & Vapnik, 1995). Other important techniques for classification tasks include Associative Classification (Liu et al, 1998) and Ensemble Classification (Tumer, 1996).

«
1
2
...
29
30
31
32
33
34
35
...
64
65
»