905 resultados para Web Mining, Data Mining, User Topic Model, Web User Profiles
Resumo:
Background: Expressed Sequence Tags (ESTs) are in general used to gain a first insight into gene activities from a species of interest. Subsequently, and typically based on a combination of EST and genome sequences, microarray-based expression analyses are performed for a variety of conditions. In some cases, a multitude of EST and microarray experiments are conducted for one species, covering different tissues, cell states, and cell types. Under these circumstances, the challenge arises to combine results derived from the different expression profiling strategies, with the goal to uncover novel information on the basis of the integrated datasets. Findings: Using our new analysis tool, MediPlEx (MEDIcago truncatula multiPLe EXpression analysis), expression data from EST experiments, oligonucleotide microarrays and Affymetrix GeneChips® can be combined and analyzed, leading to a novel approach to integrated transcriptome analysis. We have validated our tool via the identification of a set of well-characterized AM-specific and AM-induced marker genes, identified by MediPlEx on the basis of in silico and experimental gene expression profiles from roots colonized with AM fungi. Conclusions: MediPlEx offers an integrated analysis pipeline for different sets of expression data generated for the model legume Medicago truncatula. As expected, in silico and experimental gene expression data that cover the same biological condition correlate well. The collection of differentially expressed genes identified via MediPlEx provides a starting point for functional studies in plant mutants.
Resumo:
Observing system experiments (OSEs) are carried out over a 1-year period to quantify the impact of Argo observations on the Mercator Ocean 0.25° global ocean analysis and forecasting system. The reference simulation assimilates sea surface temperature (SST), SSALTO/DUACS (Segment Sol multi-missions dALTimetrie, d'orbitographie et de localisation précise/Data unification and Altimeter combination system) altimeter data and Argo and other in situ observations from the Coriolis data center. Two other simulations are carried out where all Argo and half of the Argo data are withheld. Assimilating Argo observations has a significant impact on analyzed and forecast temperature and salinity fields at different depths. Without Argo data assimilation, large errors occur in analyzed fields as estimated from the differences when compared with in situ observations. For example, in the 0–300 m layer RMS (root mean square) differences between analyzed fields and observations reach 0.25 psu and 1.25 °C in the western boundary currents and 0.1 psu and 0.75 °C in the open ocean. The impact of the Argo data in reducing observation–model forecast differences is also significant from the surface down to a depth of 2000 m. Differences between in situ observations and forecast fields are thus reduced by 20 % in the upper layers and by up to 40 % at a depth of 2000 m when Argo data are assimilated. At depth, the most impacted regions in the global ocean are the Mediterranean outflow, the Gulf Stream region and the Labrador Sea. A significant degradation can be observed when only half of the data are assimilated. Therefore, Argo observations matter to constrain the model solution, even for an eddy-permitting model configuration. The impact of the Argo floats' data assimilation on other model variables is briefly assessed: the improvement of the fit to Argo profiles do not lead globally to unphysical corrections on the sea surface temperature and sea surface height. The main conclusion is that the performance of the Mercator Ocean 0.25° global data assimilation system is heavily dependent on the availability of Argo data.
Resumo:
To compare the accuracy of different forecasting approaches an error measure is required. Many error measures have been proposed in the literature, however in practice there are some situations where different measures yield different decisions on forecasting approach selection and there is no agreement on which approach should be used. Generally forecasting measures represent ratios or percentages providing an overall image of how well fitted the forecasting technique is to the observations. This paper proposes a multiplicative Data Envelopment Analysis (DEA) model in order to rank several forecasting techniques. We demonstrate the proposed model by applying it to the set of yearly time series of the M3 competition. The usefulness of the proposed approach has been tested using the M3-competition where five error measures have been applied in and aggregated to a single DEA score.
Resumo:
This paper proposes a process for the classifi cation of new residential electricity customers. The current state of the art is extended by using a combination of smart metering and survey data and by using model-based feature selection for the classifi cation task. Firstly, the normalized representative consumption profi les of the population are derived through the clustering of data from households. Secondly, new customers are classifi ed using survey data and a limited amount of smart metering data. Thirdly, regression analysis and model-based feature selection results explain the importance of the variables and which are the drivers of diff erent consumption profi les, enabling the extraction of appropriate models. The results of a case study show that the use of survey data signi ficantly increases accuracy of the classifi cation task (up to 20%). Considering four consumption groups, more than half of the customers are correctly classifi ed with only one week of metering data, with more weeks the accuracy is signifi cantly improved. The use of model-based feature selection resulted in the use of a signifi cantly lower number of features allowing an easy interpretation of the derived models.
Resumo:
The scope of the thesis is to broaden the knowledge about axially loaded pipe piles, that can play as foundations for offshore wind turbines based on jacket structures. The goal of the work was pursued by interpreting experimental data on large-scale model piles and by developing numerical tools for the prediction of their monotonic response to tensile and compressive loads to failure. The availability of experimental results on large scale model piles produced in two different campaigns at Fraunhofer IWES (Hannover, Germany) represented the reference for the whole work. Data from CPTs, blow counts during installation and load-displacement curves allowed to develop considerations on the experimental results and comparison with empirical methods from literature, such as CPT-based methods and Load Transfer methods. The understanding of soil-structure interaction mechanisms has been involved in the study in order to better assess the mechanical response of the sand with the scope to help in developing predictive tools of the experiments. A lack of information on the response of Rohsand 3152 when in contact with steel was highlighted, so the necessity of better assessing its response was fulfilled with a comprehensive campaign of interface shear test. It was found how the response of the sand to ultimate conditions evolve with the roughness of the steel, which is a precious information to take account of when attempting the prediction of a pile capacity. Parallel to this topic, the work has developed a numerical modelling procedure that was validated on the available large-scale model piles at IWES. The modelling strategy is intended to build a FE model whose mechanical properties of the sand come from an interpretation of commonly available geotechnical tests. The results of the FE model were compared with other predictive tools currently used in the engineering practice.
Resumo:
The aim of this paper is to present an economical design of an X chart for a short-run production. The process mean starts equal to mu(0) (in-control, State I) and in a random time it shifts to mu(1) > mu(0) (out-of-control, State II). The monitoring procedure consists of inspecting a single item at every m produced ones. If the measurement of the quality characteristic does not meet the control limits, the process is stopped, adjusted, and additional (r - 1) items are inspected retrospectively. The probabilistic model was developed considering only shifts in the process mean. A direct search technique is applied to find the optimum parameters which minimizes the expected cost function. Numerical examples illustrate the proposed procedure. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
The assessment of groundwater conditions within an unconfined aquifer with a periodic boundary condition is of interest in many hydrological and environmental problems. A two-dimensional numerical model for density dependent variably saturated groundwater flow, SUTRA (Voss, C.I., 1984. SUTRA: a finite element simulation model for saturated-unsaturated, fluid-density dependent ground-water flow with energy transport or chemically reactive single species solute transport. US Geological Survey, National Center, Reston, VA) is modified in order to be able to simulate the groundwater flow in unconfined aquifers affected by a periodic boundary condition. The basic flow equation is changed from pressure-form to mixed-form. The model is also adjusted to handle a seepage-face boundary condition. Experiments are conducted to provide data for the groundwater response to the periodic boundary condition for aquifers with both vertical and sloping faces. The performance of the numerical model is assessed using those data. The results of pressure- and mixed-form approximations are compared and the improvement achieved through the mixed-form of the equation is demonstrated. The ability of the numerical model to simulate the water table and seepage-face is tested by modelling some published experimental data. Finally the numerical model is successfully verified against present experimental results to confirm its ability to simulate complex boundary conditions like the periodic head and the seepage-face boundary condition on the sloping face. (C) 1999 Elsevier Science B.V. All rights reserved.
Resumo:
The object of this article is to estimate demand elasticities for a basket of staple food important for providing the caloric needs of Brazilian households. These elasticities are useful in the measurement of the impact of structural reforms on poverty. A two-stage demand system was constructed, based on data from Household Expenditure Surveys (POF) produced by IBGE (The Brazilian Bureau of Statistics) in 1987/88 and 1995/96. We have used panel data to estimate the model, and have calculated income, own-price, and cross-price elasticities for eight groups of goods and services and, in the second stage, for 11 sub groups of staple food products. We estimated those elasticities for the whole sample of consumers and for two income groups.
Resumo:
This paper addresses the investment decisions considering the presence of financial constraints of 373 large Brazilian firms from 1997 to 2004, using panel data. A Bayesian econometric model was used considering ridge regression for multicollinearity problems among the variables in the model. Prior distributions are assumed for the parameters, classifying the model into random or fixed effects. We used a Bayesian approach to estimate the parameters, considering normal and Student t distributions for the error and assumed that the initial values for the lagged dependent variable are not fixed, but generated by a random process. The recursive predictive density criterion was used for model comparisons. Twenty models were tested and the results indicated that multicollinearity does influence the value of the estimated parameters. Controlling for capital intensity, financial constraints are found to be more important for capital-intensive firms, probably due to their lower profitability indexes, higher fixed costs and higher degree of property diversification.
Resumo:
Variation in larval size has been shown to be an important factor for the post-metamorphic performance of marine invertebrates but, despite its importance, few sources of this variation have been identified. For a range of taxa, offspring size is positively correlated with maternal size but the reasons for this correlation remain unclear. We halved the size of colonies in the bryozoan Bugula neritina 1 wk prior to reproduction (but during embryogenesis) to determine if larval size is a fixed or plastic trait. We manipulated colonies in such a way that the ratio of feeding zooids to reproductive zooids was constant between treatment and control colonies. We found that manipulating colony size strongly affects larval size; halved colonies produced larvae that were similar to13% smaller than those produced by intact colonies. We entered these data into a simple model based on previous work to estimate the likely post-metamorphic consequences of this reduction in larval size. The model predicted that larvae that came from manipulated colonies would suffer similar to300% higher post-metamorphic mortality and similar to50% lower fecundity as adults. Colonies that are faced with a stress appear to be trading off current offspring fitness to maximize their own long-term fitness and this may explain previous observations of compensatory growth in colonial organisms. This study demonstrates that larval size is a surprisingly dynamic trait and strong links exist between the maternal phenotype and the fitness of the offspring. The performance of settling larvae may be determined not only by their larval experience but also by the experience of their mothers.
The states, diffusion, and concentration distribution of water in radiation-formed PVA/PVP hydrogels
Resumo:
Hydrogels with various compositions of polyvinyl alcohol (PVA) and poly(1-vinyl-2-pyrrolidinone) (PVP) were prepared by irradiating mixtures of PVA and PVP in aqueous solutions with gamma-rays from Co-60 sources at room temperature. The states of water in the hydrogels were characterized using DSC and NMR T-2 relaxation measurements and the kinetics of water diffusion in the hydrogels were studied by sorption experiments and NMR imaging. The DSC endothermic peaks in the temperature range -10 to +10 degrees C implied that there are at least two kinds of freezable water present in the matrix. The difference between the total water content and the freezable water content was refer-red to as bound water, which is not freezable. The weight fraction of water at which only nonfreezable water is present in a hydrogel with F-VP = 0.19 has been estimated to be g(H2O)/g(Polymer) = 0.375. From water sorption experiments, it was demonstrated that the early stage of the diffusion of water into the hydrogels was Fickian. A curve-fit of the early-stage experimental data to the Fickian model allowed determination of the water diffusion coefficient, which was found to lie between 1.5 x 10(-11) m(2) s(-1) and 4.5 x 10(-11) m(2) s(-1), depending on the polymer composition, the cross-link density, and the temperature. It was also found that the energy barrier for diffusion of water molecules into PVA/PVP hydrogels was approximate to 24 kJ mol(-1). Additionally, the diffusion coefficients determined from NMR imaging of the volumetric swelling of the gels agreed well with the results obtained by the mass sorption method.
Resumo:
Methods Stepwise regression of annual data was applied to model incidence, calculated based on 91 cases, from lagged variables: antecedent precipitation, air temperature, soil water storage, absolute and relative air humidity, and Southern Oscillation Index (SOI). Results Multiple regression analyses resulted in a model, which explains 49% of the incidence variance, taking into account the absolute air humidity in the year of exposure, soil water storage and SOI of the previous 2 years. Conclusions The correlations may reflect enhanced fungal growth after increase in soil water storage in the longer term and greater spore release with increase in absolute air humidity in the short term.
Resumo:
Pollution by polycyclic aromatic hydrocarbons(PAHs) is widespread due to unsuitable disposal of industrial waste. They are mostly defined as priority pollutants by environmental protection authorities worldwide. Phenanthrene, a typical PAH, was selected as the target in this paper. The PAH-degrading mixed culture, named ZM, was collected from a petroleum contaminated river bed. This culture was injected into phenanthrene solutions at different concentrations to quantify the biodegradation process. Results show near-complete removal of phenanthrene in three days of biodegradation if the initial phenanthrene concentration is low. When the initial concentration is high, the removal rate is increased but 20%-40% of the phenanthrene remains at the end of the experiment. The biomass shows a peak on the third day due to the combined effects of microbial growth and decay. Another peak is evident for cases with a high initial concentration, possibly due to production of an intermediate metabolite. The pH generally decreased during biodegradation because of the production of organic acid. Two phenomenological models were designed to simulate the phenanthrene biodegradation and biomass growth. A relatively simple model that does not consider the intermediate metabolite and its inhibition of phenanthrene biodegradation cannot fit the observed data. A modified Monod model that considered an intermediate metabolite (organic acid) and its inhibiting reversal effect reasonably depicts the experimental results.
Resumo:
An important feature of some conceptual modelling grammars is the features they provide to allow database designers to show real-world things may or may not possess a particular attribute or relationship. In the entity-relationship model, for example, the fact that a thing may not possess an attribute can be represented by using a special symbol to indicate that the attribute is optional. Similarly, the fact that a thing may or may not be involved in a relationship can be represented by showing the minimum cardinality of the relationship as zero. Whether these practices should be followed, however, is a contentious issue. An alternative approach is to eliminate optional attributes and relationships from conceptual schema diagrams by using subtypes that have only mandatory attributes and relationships. In this paper, we first present a theory that led us to predict that optional attributes and relationships should be used in conceptual schema diagrams only when users of the diagrams require a surface-level understanding of the domain being represented by the diagrams. When users require a deep-level understanding, however, optional attributes and relationships should not be used because they undermine users' abilities to grasp important domain semantics. We describe three experiments which we then undertook to test our predictions. The results of the experiments support our predictions.
Resumo:
Recent research support sLocke's (1976) model of facet satisfaction in which the range of affect of objectively defined facet descriptions is moderated by subjective evaluations of facet importance (McFarlin & Rice, 1992). This study examined the utility of Locke's moderated model of face t satisfaction for the prediction of organizationally important global measures of job satisfaction. A large dataset of two groups of workers allowed testing over different time periods and across a broad range of satisfaction measures. The hypothesis derived from Locke's model, that global satisfaction would represent a linear function of facet satisfaction (i.e., facet description x facet importance), was not supported. Instead, a simple (have-want) discrepancy model (operationalized as facet description) provided the most consistent set of predictors. The results suggests that workers, when providing global measures of job satisfaction, may use cognitive heuristics to reduce the complexity of facet description x importance calculations. The implications of these data for Locke's model and directions for future research are outlined.