43 resultados para separation of variables
Resumo:
Descriptors based on Molecular Interaction Fields (MIF) are highly suitable for drug discovery, but their size (thousands of variables) often limits their application in practice. Here we describe a simple and fast computational method that extracts from a MIF a handful of highly informative points (hot spots) which summarize the most relevant information. The method was specifically developed for drug discovery, is fast, and does not require human supervision, being suitable for its application on very large series of compounds. The quality of the results has been tested by running the method on the ligand structure of a large number of ligand-receptor complexes and then comparing the position of the selected hot spots with actual atoms of the receptor. As an additional test, the hot spots obtained with the novel method were used to obtain GRIND-like molecular descriptors which were compared with the original GRIND. In both cases the results show that the novel method is highly suitable for describing ligand-receptor interactions and compares favorably with other state-of-the-art methods.
Resumo:
We offer a formulation that locates hubs on a network in a competitiveenvironment; that is, customer capture is sought, which happenswhenever the location of a new hub results in a reduction of thecurrent cost (time, distance) needed by the traffic that goes from thespecified origin to the specified destination.The formulation presented here reduces the number of variables andconstraints as compared to existing covering models. This model issuited for both air passenger and cargo transportation.In this model, each origin-destination flow can go through either oneor two hubs, and each demand point can be assigned to more than a hub,depending on the different destinations of its traffic. Links(``spokes'' have no capacity limit. Computational experience is provided.
Resumo:
I study the optimal project choice when the principal relies on the agent in charge of production for project evaluation. The principal has to choose between a safe project generating a fixed revenue and a risky project generating an uncertain revenue. The agent has private information about the production cost under each project but also about the signal regarding the profitability of the risky project. If the signal favoring the adoption of the risky project is goods news to the agent, integrating production and project evaluation tasks does not generate any loss compared to the benchmark in which the principal herself receives the signal. By contrast, if it is bad news, task integration creates an endogenous reservation utility which is type-dependent and thereby generates countervailing incentives, which can make a bias toward either project optimal. Our results can offer an explanation for why good firms can go bad and a rationale for the separation of day-to-day operating decisions from long-term strategic decisions stressed by Williamson.
Resumo:
The generalization of simple correspondence analysis, for two categorical variables, to multiple correspondence analysis where they may be three or more variables, is not straighforward, both from a mathematical and computational point of view. In this paper we detail the exact computational steps involved in performing a multiple correspondence analysis, including the special aspects of adjusting the principal inertias to correct the percentages of inertia, supplementary points and subset analysis. Furthermore, we give the algorithm for joint correspondence analysis where the cross-tabulations of all unique pairs of variables are analysed jointly. The code in the R language for every step of the computations is given, as well as the results of each computation.
Resumo:
This paper analyzes the political sustainability of the welfare state in a model where immigration policy is also endogenous. In the model, the skills of the native population are affected by immigration and skill accumulation. Moreover, immigrants affect future policies, once they gain the right to vote. The main finding is that the long-run survival of redistributive policies is linked to an immigration policy specifying both skill and quantity restrictions. In particular, in steady state the unskilled majority admits a limited inflow of unskilled immigrants in order to offset growth in the fraction of skilled voters and maintain a high degree of income redistribution.Interestingly, equilibrium immigration policy shifts from unrestricted skilled immigration,when the country is skill-scarce, to restricted unskilled immigration, as the fraction of native skilled workers increases. The analysis also suggests a new set of variables that may help explain international differences in immigration restrictions.
Resumo:
Structural equation models (SEM) are commonly used to analyze the relationship between variables some of which may be latent, such as individual ``attitude'' to and ``behavior'' concerning specific issues. A number of difficulties arise when we want to compare a large number of groups, each with large sample size, and the manifest variables are distinctly non-normally distributed. Using an specific data set, we evaluate the appropriateness of the following alternative SEM approaches: multiple group versus MIMIC models, continuous versus ordinal variables estimation methods, and normal theory versus non-normal estimation methods. The approaches are applied to the ISSP-1993 Environmental data set, with the purpose of exploring variation in the mean level of variables of ``attitude'' to and ``behavior''concerning environmental issues and their mutual relationship across countries. Issues of both theoretical and practical relevance arise in the course of this application.
Resumo:
The control and prediction of wastewater treatment plants poses an important goal: to avoid breaking the environmental balance by always keeping the system in stable operating conditions. It is known that qualitative information — coming from microscopic examinations and subjective remarks — has a deep influence on the activated sludge process. In particular, on the total amount of effluent suspended solids, one of the measures of overall plant performance. The search for an input–output model of this variable and the prediction of sudden increases (bulking episodes) is thus a central concern to ensure the fulfillment of current discharge limitations. Unfortunately, the strong interrelationbetween variables, their heterogeneity and the very high amount of missing information makes the use of traditional techniques difficult, or even impossible. Through the combined use of several methods — rough set theory and artificial neural networks, mainly — reasonable prediction models are found, which also serve to show the different importance of variables and provide insight into the process dynamics
Resumo:
Monitoring thunderstorms activity is an essential part of operational weather surveillance given their potential hazards, including lightning, hail, heavy rainfall, strong winds or even tornadoes. This study has two main objectives: firstly, the description of a methodology, based on radar and total lightning data to characterise thunderstorms in real-time; secondly, the application of this methodology to 66 thunderstorms that affected Catalonia (NE Spain) in the summer of 2006. An object-oriented tracking procedure is employed, where different observation data types generate four different types of objects (radar 1-km CAPPI reflectivity composites, radar reflectivity volumetric data, cloud-to-ground lightning data and intra-cloud lightning data). In the framework proposed, these objects are the building blocks of a higher level object, the thunderstorm. The methodology is demonstrated with a dataset of thunderstorms whose main characteristics, along the complete life cycle of the convective structures (development, maturity and dissipation), are described statistically. The development and dissipation stages present similar durations in most cases examined. On the contrary, the duration of the maturity phase is much more variable and related to the thunderstorm intensity, defined here in terms of lightning flash rate. Most of the activity of IC and CG flashes is registered in the maturity stage. In the development stage little CG flashes are observed (2% to 5%), while for the dissipation phase is possible to observe a few more CG flashes (10% to 15%). Additionally, a selection of thunderstorms is used to examine general life cycle patterns, obtained from the analysis of normalized (with respect to thunderstorm total duration and maximum value of variables considered) thunderstorm parameters. Among other findings, the study indicates that the normalized duration of the three stages of thunderstorm life cycle is similar in most thunderstorms, with the longest duration corresponding to the maturity stage (approximately 80% of the total time).
Resumo:
Using the experimental data of Paret and Tabeling [Phys. Rev. Lett. 79, 4162 (1997)] we consider in detail the dispersion of particle pairs by a two-dimensional turbulent flow and its relation to the kinematic properties of the velocity field. We show that the mean square separation of a pair of particles is governed by rather rare, extreme events and that the majority of initially close pairs are not dispersed by the flow. Another manifestation of the same effect is the fact that the dispersion of an initially dense cluster is not the result of homogeneously spreading the particles within the whole system. Instead it proceeds through a splitting into smaller but also dense clusters. The statistical nature of this effect is discussed.
Resumo:
We present a nonequlibrium approach for the study of a flexible bilayer whose two components induce distinct curvatures. In turn, the two components are interconverted by an externally promoted reaction. Phase separation of the two species in the surface results in the growth of domains characterized by different local composition and curvature modulations. This domain growth is limited by the effective mixing due to the interconversion reaction, leading to a finite characteristic domain size. In addition to these effects, first introduced in our earlier work [ Phys. Rev. E 71 051906 (2005)], the important new feature is the assumption that the reactive process actively affects the local curvature of the bilayer. Specifically, we suggest that a force energetically activated by external sources causes a modification of the shape of the membrane at the reaction site. Our results show the appearance of a rich and robust dynamical phenomenology that includes the generation of traveling and/or oscillatory patterns. Linear stability analysis, amplitude equations, and numerical simulations of the model kinetic equations confirm the occurrence of these spatiotemporal behaviors in nonequilibrium reactive bilayers.
Resumo:
Capsula and seed morphology of W. European species of Euphorbia aggr. flavicoma has been studied. A total of 1500 seeds coming from 13 taxa have been investigated under light microscope, scanning electron microscope and binocular stereoscope. Data were processed by multivariate analysis and the corresponding dendrogram is presented. At de end of the paper, a key is presented allowing to the separation of taxa down to the species level.
Resumo:
The present study discusses retention criteria for principal components analysis (PCA) applied to Likert scale items typical in psychological questionnaires. The main aim is to recommend applied researchers to restrain from relying only on the eigenvalue-than-one criterion; alternative procedures are suggested for adjusting for sampling error. An additional objective is to add evidence on the consequences of applying this rule when PCA is used with discrete variables. The experimental conditions were studied by means of Monte Carlo sampling including several sample sizes, different number of variables and answer alternatives, and four non-normal distributions. The results suggest that even when all the items and thus the underlying dimensions are independent, eigenvalues greater than one are frequent and they can explain up to 80% of the variance in data, meeting the empirical criterion. The consequences of using Kaiser"s rule are illustrated with a clinical psychology example. The size of the eigenvalues resulted to be a function of the sample size and the number of variables, which is also the case for parallel analysis as previous research shows. To enhance the application of alternative criteria, an R package was developed for deciding the number of principal components to retain by means of confidence intervals constructed about the eigenvalues corresponding to lack of relationship between discrete variables.
Resumo:
Aquest Treball Final de Grau aporta els resultats d’un estudi sobre els efectes i condicionats que suposa pel rendiment acadèmic dels alumnes de cicle superior de primària el divorci o la separació dels seus progenitors. Ens trobem davant l’augment del nombre de trencaments familiars, que ha esdevingut un fenomen clarament observable en la societat, es tracta d’un fenomen complex, en el qual entren en joc nombroses variables. I el trencament també suposa conseqüències socials, en primer lloc, pels fills/es. En una primera part ens endinsem en les aportacions i teories que han defensat diversos experts al llarg del temps sobre aquest fet. En una segona part es presenten les conclusions i els acords extrets de diverses entrevistes amb mestres d’una escola local en relació a la possible vinculació entre trencament familiar i rendiment acadèmic. I, per últim, s’acaben comparat les visions dels autors teòrics amb les aportacions i visions dels educadors professionals; per arribar a les principals conclusions que, no es pot generalitzar els efectes negatius de la ruptura, cal veure també les possibilitats positives del moment i, per últim, destacar que la manera amb la qual la família s'afronta a la ruptura té una importància crucial a l'hora de determinar l'impacte d'aquesta la ruptura en els fills.
Resumo:
In soccer, dead-ball moves are those in which the ball is returned to play from a stationary position following an interruption of play. The aim of this study was to analyse the effectiveness of one such dead-ball move, namely corner kicks, and to identify the key variables that determine the success of a shot or header following a corner, thereby enabling a model of successful corner kicks to be proposed. We recorded 554 corner kicks performed during the 2010 World Cup in South Africa and carried out a univariate, bivariate and multivariate analysis of the data. The results indicated that corners were of limited effectiveness in terms of the success of subsequent shots or headers. The analysis also revealed a series of variables that were significantly related to one another, and this enabled us to propose an explanatory model. Although this model had limited explanatory power, it nonetheless helps to understand the execution of corner kicks in practical terms.
Resumo:
Broadcast transmission mode in ad hoc networks is critical to manage multihop routing or providing medium accesscontrol (MAC)-layer fairness. In this paper, it is shown that ahigher capacity to exchange information among neighbors may beobtained through a physical-MAC cross-layer design of the broadcastprotocol exploiting signal separation principles. Coherentdetection and separation of contending nodes is possible throughtraining sequences which are selected at random from a reducedset. Guidelines for the design of this set are derived for a lowimpact on the network performance and the receiver complexity.