972 resultados para data transformation


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Intensification processes in homegardens of the Nuba Mountains, Sudan, raise concerns about strongly positive carbon (C) and nutrient balances which are expected to lead to substantial element losses from these agroecosystems, in particular via soil gaseous emissions. Therefore, this thesis aimed at the quantification of C, nitrogen (N), phosphorus (P) and potassium (K) input and output fluxes with a special focus on soil gaseous losses, and the calculation of respective element balances. A further focus in this thesis was rainfall, a valuable resource for rain-fed agriculture in the Nuba Mountains. To minimize negative consequences of the high variability of rainfall, risk reducing mechanisms were developed by rain-fed farmers that may lose their efficacy in the course of climate change effects predicted for East Africa. Therefore, the second objective of this study was to examine possible changes in rainfall amounts during the last 60 years and to provide reliable risk and probability statements of rainfall-induced events of agricultural importance to rain-fed farmers in the Nuba Mountains. Soil gaseous emissions of C (in form of CO2) and N (in form of NH3 and N2O) of two traditional and two intensified homegardens were determined with a portable dynamic closed chamber system. For C gaseous emission rates reached their peak at the onset of the rainy season (2,325 g CO2-C ha-1 h-1 in an intensified garden type) and for N during the rainy season (16 g NH3-N ha-1 h-1 and 11.3 g N2O-N ha-1 h-1, in a traditional garden type). Data indicated cumulative annual emissions of 5,893 kg CO2-C ha-1, 37 kg NH3-N ha-1, and 16 kg N2O-N ha-1. For the assessment of the long-term productivity of the two types of homegardens and the identification of pathways of substantial element losses, a C and nutrient budget approach was used. In three traditional and three intensified homegardens observation plots were selected. The following variables were quantified on each plot between June and December in 2010: soil amendments, irrigation, biomass removal, symbiotic N2 fixation, C fixation by photosynthesis, atmospheric wet and dry deposition, leaching and soil gaseous emissions. Annual balances for C and nutrients amounted to -21 kg C ha-1, -70 kg N ha-1, 9 kg P ha-1 and -117 kg K ha-1 in intensified homegardens and to -1,722 kg C ha-1, -167 kg N ha-1, -9 kg P ha-1 and -74 kg K ha-1 in traditional homegardens. For the analysis of rainfall data, the INSTAT+ software allowed to aggregate long-term daily rainfall records from the Kadugli and Rashad weather stations into daily, monthly and annual intervals and to calculate rainfall-induced events of agricultural importance. Subsequently, these calculated values and events were checked for possible monotonic trends by Mann-Kendall tests. Over the period from 1970 to 2009, annual rainfall did not change significantly for either station. However, during this period an increase of low rainfall events coinciding with a decline in the number of medium daily rainfall events was observed in Rashad. Furthermore, the availability of daily rainfall data enabled frequency and conditional probability calculations that showed either no statistically significant changes or trends resulting only in minor changes of probabilities.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

It is well known that regression analyses involving compositional data need special attention because the data are not of full rank. For a regression analysis where both the dependent and independent variable are components we propose a transformation of the components emphasizing their role as dependent and independent variables. A simple linear regression can be performed on the transformed components. The regression line can be depicted in a ternary diagram facilitating the interpretation of the analysis in terms of components. An exemple with time-budgets illustrates the method and the graphical features

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In a seminal paper, Aitchison and Lauder (1985) introduced classical kernel density estimation techniques in the context of compositional data analysis. Indeed, they gave two options for the choice of the kernel to be used in the kernel estimator. One of these kernels is based on the use the alr transformation on the simplex SD jointly with the normal distribution on RD-1. However, these authors themselves recognized that this method has some deficiencies. A method for overcoming these dificulties based on recent developments for compositional data analysis and multivariate kernel estimation theory, combining the ilr transformation with the use of the normal density with a full bandwidth matrix, was recently proposed in Martín-Fernández, Chacón and Mateu- Figueras (2006). Here we present an extensive simulation study that compares both methods in practice, thus exploring the finite-sample behaviour of both estimators

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In an earlier investigation (Burger et al., 2000) five sediment cores near the Rodrigues Triple Junction in the Indian Ocean were studied applying classical statistical methods (fuzzy c-means clustering, linear mixing model, principal component analysis) for the extraction of endmembers and evaluating the spatial and temporal variation of geochemical signals. Three main factors of sedimentation were expected by the marine geologists: a volcano-genetic, a hydro-hydrothermal and an ultra-basic factor. The display of fuzzy membership values and/or factor scores versus depth provided consistent results for two factors only; the ultra-basic component could not be identified. The reason for this may be that only traditional statistical methods were applied, i.e. the untransformed components were used and the cosine-theta coefficient as similarity measure. During the last decade considerable progress in compositional data analysis was made and many case studies were published using new tools for exploratory analysis of these data. Therefore it makes sense to check if the application of suitable data transformations, reduction of the D-part simplex to two or three factors and visual interpretation of the factor scores would lead to a revision of earlier results and to answers to open questions . In this paper we follow the lines of a paper of R. Tolosana- Delgado et al. (2005) starting with a problem-oriented interpretation of the biplot scattergram, extracting compositional factors, ilr-transformation of the components and visualization of the factor scores in a spatial context: The compositional factors will be plotted versus depth (time) of the core samples in order to facilitate the identification of the expected sources of the sedimentary process. Kew words: compositional data analysis, biplot, deep sea sediments

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many multivariate methods that are apparently distinct can be linked by introducing one or more parameters in their definition. Methods that can be linked in this way are correspondence analysis, unweighted or weighted logratio analysis (the latter also known as "spectral mapping"), nonsymmetric correspondence analysis, principal component analysis (with and without logarithmic transformation of the data) and multidimensional scaling. In this presentation I will show how several of these methods, which are frequently used in compositional data analysis, may be linked through parametrizations such as power transformations, linear transformations and convex linear combinations. Since the methods of interest here all lead to visual maps of data, a "movie" can be made where where the linking parameter is allowed to vary in small steps: the results are recalculated "frame by frame" and one can see the smooth change from one method to another. Several of these "movies" will be shown, giving a deeper insight into the similarities and differences between these methods

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Factor analysis as frequent technique for multivariate data inspection is widely used also for compositional data analysis. The usual way is to use a centered logratio (clr) transformation to obtain the random vector y of dimension D. The factor model is then y = Λf + e (1) with the factors f of dimension k < D, the error term e, and the loadings matrix Λ. Using the usual model assumptions (see, e.g., Basilevsky, 1994), the factor analysis model (1) can be written as Cov(y) = ΛΛT + ψ (2) where ψ = Cov(e) has a diagonal form. The diagonal elements of ψ as well as the loadings matrix Λ are estimated from an estimation of Cov(y). Given observed clr transformed data Y as realizations of the random vector y. Outliers or deviations from the idealized model assumptions of factor analysis can severely effect the parameter estimation. As a way out, robust estimation of the covariance matrix of Y will lead to robust estimates of Λ and ψ in (2), see Pison et al. (2003). Well known robust covariance estimators with good statistical properties, like the MCD or the S-estimators (see, e.g. Maronna et al., 2006), rely on a full-rank data matrix Y which is not the case for clr transformed data (see, e.g., Aitchison, 1986). The isometric logratio (ilr) transformation (Egozcue et al., 2003) solves this singularity problem. The data matrix Y is transformed to a matrix Z by using an orthonormal basis of lower dimension. Using the ilr transformed data, a robust covariance matrix C(Z) can be estimated. The result can be back-transformed to the clr space by C(Y ) = V C(Z)V T where the matrix V with orthonormal columns comes from the relation between the clr and the ilr transformation. Now the parameters in the model (2) can be estimated (Basilevsky, 1994) and the results have a direct interpretation since the links to the original variables are still preserved. The above procedure will be applied to data from geochemistry. Our special interest is on comparing the results with those of Reimann et al. (2002) for the Kola project data

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An emerging consensus in cognitive science views the biological brain as a hierarchically-organized predictive processing system. This is a system in which higher-order regions are continuously attempting to predict the activity of lower-order regions at a variety of (increasingly abstract) spatial and temporal scales. The brain is thus revealed as a hierarchical prediction machine that is constantly engaged in the effort to predict the flow of information originating from the sensory surfaces. Such a view seems to afford a great deal of explanatory leverage when it comes to a broad swathe of seemingly disparate psychological phenomena (e.g., learning, memory, perception, action, emotion, planning, reason, imagination, and conscious experience). In the most positive case, the predictive processing story seems to provide our first glimpse at what a unified (computationally-tractable and neurobiological plausible) account of human psychology might look like. This obviously marks out one reason why such models should be the focus of current empirical and theoretical attention. Another reason, however, is rooted in the potential of such models to advance the current state-of-the-art in machine intelligence and machine learning. Interestingly, the vision of the brain as a hierarchical prediction machine is one that establishes contact with work that goes under the heading of 'deep learning'. Deep learning systems thus often attempt to make use of predictive processing schemes and (increasingly abstract) generative models as a means of supporting the analysis of large data sets. But are such computational systems sufficient (by themselves) to provide a route to general human-level analytic capabilities? I will argue that they are not and that closer attention to a broader range of forces and factors (many of which are not confined to the neural realm) may be required to understand what it is that gives human cognition its distinctive (and largely unique) flavour. The vision that emerges is one of 'homomimetic deep learning systems', systems that situate a hierarchically-organized predictive processing core within a larger nexus of developmental, behavioural, symbolic, technological and social influences. Relative to that vision, I suggest that we should see the Web as a form of 'cognitive ecology', one that is as much involved with the transformation of machine intelligence as it is with the progressive reshaping of our own cognitive capabilities.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The main premise of Vygotsky’s cultural-historical theory is that to promote learning, and thus development, educators must intervene in, and change, the students’ socio-cultural context. Vygotsky’s theory, however, has been misinterpreted and the opposite approach has been accepted: the teaching is adapted, according to the context. The result is widespread failure in schools. This article reclaims the true transformative meaning of Vygotskian theory and shows how successful schools in several countries implement various actions to transform their social and cultural environment. Data is presented from six case studies of successful schools conducted in five European countries. The analysis shows that these actions improve instrumental learning and, consequently, cognitive development. All these efforts focus on teaching methods that aim to increase the amount that students learn

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Asymmetry in a distribution can arise from a long tail of values in the underlying process or from outliers that belong to another population that contaminate the primary process. The first paper of this series examined the effects of the former on the variogram and this paper examines the effects of asymmetry arising from outliers. Simulated annealing was used to create normally distributed random fields of different size that are realizations of known processes described by variograms with different nugget:sill ratios. These primary data sets were then contaminated with randomly located and spatially aggregated outliers from a secondary process to produce different degrees of asymmetry. Experimental variograms were computed from these data by Matheron's estimator and by three robust estimators. The effects of standard data transformations on the coefficient of skewness and on the variogram were also investigated. Cross-validation was used to assess the performance of models fitted to experimental variograms computed from a range of data contaminated by outliers for kriging. The results showed that where skewness was caused by outliers the variograms retained their general shape, but showed an increase in the nugget and sill variances and nugget:sill ratios. This effect was only slightly more for the smallest data set than for the two larger data sets and there was little difference between the results for the latter. Overall, the effect of size of data set was small for all analyses. The nugget:sill ratio showed a consistent decrease after transformation to both square roots and logarithms; the decrease was generally larger for the latter, however. Aggregated outliers had different effects on the variogram shape from those that were randomly located, and this also depended on whether they were aggregated near to the edge or the centre of the field. The results of cross-validation showed that the robust estimators and the removal of outliers were the most effective ways of dealing with outliers for variogram estimation and kriging. (C) 2007 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sequential crystallization of poly(L-lactide) (PLLA) followed by poly(epsilon-caprolactone) (PCL) in double crystalline PLLA-b-PCL diblock copolymers is studied by differential scanning calorimetry (DSC), polarized optical microscopy (POM), wide-angle X-ray scattering (WAXS) and small-angle X-ray scattering (SAXS). Three samples with different compositions are studied. The sample with the shortest PLLA block (32 wt.-% PLLA) crystallizes from a homogeneous melt, the other two (with 44 and 60% PLLA) from microphase separated structures. The microphase structure of the melt is changed as PLLA crystallizes at 122 degrees C (a temperature at which the PCL block is molten) forming spherulites regardless of composition, even with 32% PLLA. SAXS indicates that a lamellar structure with a different periodicity than that obtained in the melt forms (for melt segregated samples). Where PCL is the majority block, PCL crystallization at 42 degrees C following PLLA crystallization leads to rearrangement of the lamellar structure, as observed by SAXS, possibly due to local melting at the interphases between domains. POM results showed that PCL crystallizes within previously formed PLLA spherulites. WAXS data indicate that the PLLA unit cell is modified by crystallization of PCL, at least for the two majority PCL samples. The PCL minority sample did not crystallize at 42 degrees C (well below the PCL homopolymer crystallization temperature), pointing to the influence of pre-crystallization of PLLA on PCL crystallization, although it did crystallize at lower temperature. Crystallization kinetics were examined by DSC and WAXS, with good agreement in general. The crystallization rate of PLLA decreased with increase in PCL content in the copolymers. The crystallization rate of PCL decreased with increasing PLLA content. The Avrami exponents were in general depressed for both components in the block copolymers compared to the parent homopolymers. Polarized optical micrographs during isothermal crystalli zation of (a) homo-PLLA, (b) homo-PCL, (c) and (d) block copolymer after 30 min at 122 degrees C and after 15 min at 42 degrees C.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A catemeric crystal structure of cyheptamide undergoes a transformation in the solid-state upon heating to produce a dimer-based form whose structure has been determined from laboratory X-ray powder diffraction ( XRPD) data, thereby providing the first conclusive evidence of a carbamazepine analogue crystallising in both hydrogen bonded motifs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A modified radial basis function (RBF) neural network and its identification algorithm based on observational data with heterogeneous noise are introduced. The transformed system output of Box-Cox is represented by the RBF neural network. To identify the model from observational data, the singular value decomposition of the full regression matrix consisting of basis functions formed by system input data is initially carried out and a new fast identification method is then developed using Gauss-Newton algorithm to derive the required Box-Cox transformation, based on a maximum likelihood estimator (MLE) for a model base spanned by the largest eigenvectors. Finally, the Box-Cox transformation-based RBF neural network, with good generalisation and sparsity, is identified based on the derived optimal Box-Cox transformation and an orthogonal forward regression algorithm using a pseudo-PRESS statistic to select a sparse RBF model with good generalisation. The proposed algorithm and its efficacy are demonstrated with numerical examples.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Group on Earth Observations System of Systems, GEOSS, is a co-ordinated initiative by many nations to address the needs for earth-system information expressed by the 2002 World Summit on Sustainable Development. We discuss the role of earth-system modelling and data assimilation in transforming earth-system observations into the predictive and status-assessment products required by GEOSS, across many areas of socio-economic interest. First we review recent gains in the predictive skill of operational global earth-system models, on time-scales of days to several seasons. We then discuss recent work to develop from the global predictions a diverse set of end-user applications which can meet GEOSS requirements for information of socio-economic benefit; examples include forecasts of coastal storm surges, floods in large river basins, seasonal crop yield forecasts and seasonal lead-time alerts for malaria epidemics. We note ongoing efforts to extend operational earth-system modelling and assimilation capabilities to atmospheric composition, in support of improved services for air-quality forecasts and for treaty assessment. We next sketch likely GEOSS observational requirements in the coming decades. In concluding, we reflect on the cost of earth observations relative to the modest cost of transforming the observations into information of socio-economic value.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Variational data assimilation systems for numerical weather prediction rely on a transformation of model variables to a set of control variables that are assumed to be uncorrelated. Most implementations of this transformation are based on the assumption that the balanced part of the flow can be represented by the vorticity. However, this assumption is likely to break down in dynamical regimes characterized by low Burger number. It has recently been proposed that a variable transformation based on potential vorticity should lead to control variables that are uncorrelated over a wider range of regimes. In this paper we test the assumption that a transform based on vorticity and one based on potential vorticity produce an uncorrelated set of control variables. Using a shallow-water model we calculate the correlations between the transformed variables in the different methods. We show that the control variables resulting from a vorticity-based transformation may retain large correlations in some dynamical regimes, whereas a potential vorticity based transformation successfully produces a set of uncorrelated control variables. Calculations of spatial correlations show that the benefit of the potential vorticity transformation is linked to its ability to capture more accurately the balanced component of the flow.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Normal Quantile Transform (NQT) has been used in many hydrological and meteorological applications in order to make the Cumulated Distribution Function (CDF) of the observed, simulated and forecast river discharge, water level or precipitation data Gaussian. It is also the heart of the meta-Gaussian model for assessing the total predictive uncertainty of the Hydrological Uncertainty Processor (HUP) developed by Krzysztofowicz. In the field of geo-statistics this transformation is better known as the Normal-Score Transform. In this paper some possible problems caused by small sample sizes when applying the NQT in flood forecasting systems will be discussed and a novel way to solve the problem will be outlined by combining extreme value analysis and non-parametric regression methods. The method will be illustrated by examples of hydrological stream-flow forecasts.