53 resultados para text vector space model
Resumo:
Factor analysis as frequent technique for multivariate data inspection is widely used also for compositional data analysis. The usual way is to use a centered logratio (clr)transformation to obtain the random vector y of dimension D. The factor model istheny = Λf + e (1)with the factors f of dimension k & D, the error term e, and the loadings matrix Λ.Using the usual model assumptions (see, e.g., Basilevsky, 1994), the factor analysismodel (1) can be written asCov(y) = ΛΛT + ψ (2)where ψ = Cov(e) has a diagonal form. The diagonal elements of ψ as well as theloadings matrix Λ are estimated from an estimation of Cov(y).Given observed clr transformed data Y as realizations of the random vectory. Outliers or deviations from the idealized model assumptions of factor analysiscan severely effect the parameter estimation. As a way out, robust estimation ofthe covariance matrix of Y will lead to robust estimates of Λ and ψ in (2), seePison et al. (2003). Well known robust covariance estimators with good statisticalproperties, like the MCD or the S-estimators (see, e.g. Maronna et al., 2006), relyon a full-rank data matrix Y which is not the case for clr transformed data (see,e.g., Aitchison, 1986).The isometric logratio (ilr) transformation (Egozcue et al., 2003) solves thissingularity problem. The data matrix Y is transformed to a matrix Z by usingan orthonormal basis of lower dimension. Using the ilr transformed data, a robustcovariance matrix C(Z) can be estimated. The result can be back-transformed tothe clr space byC(Y ) = V C(Z)V Twhere the matrix V with orthonormal columns comes from the relation betweenthe clr and the ilr transformation. Now the parameters in the model (2) can beestimated (Basilevsky, 1994) and the results have a direct interpretation since thelinks to the original variables are still preserved.The above procedure will be applied to data from geochemistry. Our specialinterest is on comparing the results with those of Reimann et al. (2002) for the Kolaproject data
Resumo:
We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail, we are given a set of labeled images of scenes (for example, coast, forest, city, river, etc.), and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent ";topics"; using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bag of visual words representation for each image, and subsequently, training a multiway classifier on the topic distribution vector for each image. We compare this approach to that of representing each image by a bag of visual words vector directly and training a multiway classifier on these vectors. To this end, we introduce a novel vocabulary using dense color SIFT descriptors and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learned, and the type of discriminative classifier used (k-nearest neighbor or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases, using the authors' own data sets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos
Resumo:
The front speed of the Neolithic (farmer) spread in Europe decreased as it reached Northern latitudes, where the Mesolithic (huntergatherer) population density was higher. Here, we describe a reaction diffusion model with (i) an anisotropic dispersion kernel depending on the Mesolithicpopulation density gradient and (ii) a modified population growth equation. Both effects are related to the space available for the Neolithic population. The model is able to explain the slowdown of the Neolithic front as observed from archaeological data
Resumo:
Most integrodifference models of biological invasions are based on the nonoverlapping-generations approximation. However, the effect of multiple reproduction events overlapping generations on the front speed can be very important especially for species with a long life spam . Only in one-dimensional space has this approximation been relaxed previously, although almost all biological invasions take place in two dimensions. Here we present a model that takes into account the overlapping generations effect or, more generally, the stage structure of the population , and we analyze the main differences with the corresponding nonoverlappinggenerations results
Resumo:
This paper deals with fault detection and isolation problems for nonlinear dynamic systems. Both problems are stated as constraint satisfaction problems (CSP) and solved using consistency techniques. The main contribution is the isolation method based on consistency techniques and uncertainty space refining of interval parameters. The major advantage of this method is that the isolation speed is fast even taking into account uncertainty in parameters, measurements, and model errors. Interval calculations bring independence from the assumption of monotony considered by several approaches for fault isolation which are based on observers. An application to a well known alcoholic fermentation process model is presented
Resumo:
An implicitly parallel method for integral-block driven restricted active space self-consistent field (RASSCF) algorithms is presented. The approach is based on a model space representation of the RAS active orbitals with an efficient expansion of the model subspaces. The applicability of the method is demonstrated with a RASSCF investigation of the first two excited states of indole
Resumo:
Estudi realitzat a partir d’una estada al Institut de Génétique Moléculaire de Montpellier, França, entre 2010 i 2012. En aquest projecte s’ha avaluat les avantatges dels vectors adenovirals canins tipus 2 (CAV2) com a vectors de transferència gènica al sistema nerviós central (SNC) en un model primat no-humà i en un model caní del síndrome de Sly (mucopolisacaridosis tipus 7, MPS VII), malaltia monogènica que cursa amb neurodegeneració. En una primera part del projecte s’ha avaluat la biodistribució, l’eficàcia i la durada de l’expressió del transgen en un model primat no humà, (Microcebus murinus). Com ha vector s’ha utilitzat un CAV2 de primera generació que expressa la proteïna verda fluorescent (CAVGFP). Els resultats aportats en aquesta memòria demostren que en primats no humans, com en d’altres espècies testades anteriorment per l’equip de l’EJ Kremer, la injecció intracerebral de CAV2 resulta en una extensa transducció del SNC, siguent les neurones i els precursors neuronals les cèl•lules preferencialment transduïdes. Els vectors canins, servint-se de vesícules intracel•lulars són transportats, majoritàriament, des de les sinapsis cap al soma neuronal, aquest transport intracel•lular permet una extensa transducció del SNC a partir d’una única injecció intracerebral dels vectors virals. En una segona part d’aquest projecte s’ha avaluat l’ús terapèutic dels CAV2. S’ha injectat un vector helper-dependent que expressa el gen la b-glucuronidasa i el gen de la proteïna verda fluorescent (HD-RIGIE), en el SNC del model caní del síndrome de Sly (MPS VII). La biodistribució i la eficàcia terapèutica han estat avaluades. Els nivells d’activitat enzimàtica en animals malalts injectats amb el vector terapèutic va arribar a valors similars als dels animals no afectes. A més a més s’ha observat una reducció en la quantitat dels GAGs acumulats en les cèl•lules dels animals malalts tractats amb el vector terapèutic, demostrant la potencialitat terapèutica dels CAV2 per a malalties que afecten al SNC. Els resultats aportats en aquest treball ens permeten dir que els CAV2 són unes bones eines terapèutiques per al tractament de malalties que afecten al SNC.
Resumo:
In this article we analyze the reasons, within the context of Spanish industrial relations, for trade union members’ active participation in their regional union. The case of Spain is particularly interesting as the unions’ main activity, collective bargaining, is a public good. The text, based on research involving a representative survey of members of a regional branch of the “Workers” Commissions” (Comisiones Obreras) trade union, provides empirical evidence that the union presence in the workplace has a significant influence on members’ propensity for activism. By contrast, the alternative hypothesis based on instrumental reasons appears of little relevance in the Spanish industrial relations context.
Resumo:
We develop a setting with weak intellectual property rights, where firms' boundaries, location and knowledge spillovers are endogenous. We have two main results. The first one is that, if communication costs increase with distance, entrepreneurs concerned about information leakage have a benefit from locating away from the industry center: distance is an obstacle to collusive trades between members andnon-members. The second result is that we identify a trade-off for the entrepreneur between owning a facility (controlling all its characteristics) and sharing a facility with a {\it non-member} (an agent not involved in production), therefore losing control over some of its characteristics. We focus on ``location" as the relevant characteristic of the facility, but location can be used as a spatial metaphor for other relevant characteristics of the facility. For theentrepreneur, sharing the facility with non-members implies that the latter, as co-owners, know the location (even if they do not have access to it). Knowledge of the location for the co-owners facilitates collusion with employees, what increases leakage. The model yields a benefit for new plants from spatial dispersion (locating at the periphery of the industry), particularly so for new plants of new firms.We relate this result with recent empirical findings on the dynamics of industry location.
Resumo:
The paper proposes a numerical solution method for general equilibrium models with a continuum of heterogeneous agents, which combines elements of projection and of perturbation methods. The basic idea is to solve first for the stationary solutionof the model, without aggregate shocks but with fully specified idiosyncratic shocks. Afterwards one computes a first-order perturbation of the solution in the aggregate shocks. This approach allows to include a high-dimensional representation of the cross-sectional distribution in the state vector. The method is applied to a model of household saving with uninsurable income risk and liquidity constraints. The model includes not only productivity shocks, but also shocks to redistributive taxation, which cause substantial short-run variation in the cross-sectional distribution of wealth. If those shocks are operative, it is shown that a solution method based on very few statistics of the distribution is not suitable, while the proposed method can solve the model with high accuracy, at least for the case of small aggregate shocks. Techniques are discussed to reduce the dimension of the state space such that higher order perturbations are feasible.Matlab programs to solve the model can be downloaded.
Resumo:
This paper presents a general equilibrium model of money demand wherethe velocity of money changes in response to endogenous fluctuations in the interest rate. The parameter space can be divided into two subsets: one where velocity is constant and equal to one as in cash-in-advance models, and another one where velocity fluctuates as in Baumol (1952). Despite its simplicity, in terms of paramaters to calibrate, the model performs surprisingly well. In particular, it approximates the variability of money velocity observed in the U.S. for the post-war period. The model is then used to analyze the welfare costs of inflation under uncertainty. This application calculates the errors derived from computing the costs of inflation with deterministic models. It turns out that the size of this difference is small, at least for the levels of uncertainty estimated for the U.S. economy.
Resumo:
Els canvis que s'estan produint a les universitats provocats per l'adaptació dels estudis a l'anomenat Espai Europeu d'Educació Superior (EEES), que ha de fer-se realitat l'any 2010, representen també un gran repte per a les biblioteques universitàries, que estan treballant per adaptar els seus recursos i serveis a les noves exigències de l'educació superior. Les biblioteques han establert models organitzatius i de col·laboració que, en un entorn marcat per l'ús intensiu de les tecnologies de la informació i pel fenomen de l'èxit de cercadors com Google, han de permetre superar amb èxit reptes com ara el suport al desenvolupament dels nous plans d'estudi dissenyats per competències tot potenciant i introduint la formació dels usuaris en l'adquisició d'habilitats informacionals; el disseny de sistemes d'informació robustos que donin suport a la producció científica i acadèmica dels investigadors i dels professors i li aportin valor, mitjançant dipòsits oberts d'informació i de documentació; la personalització dels serveis o l'adaptació dels espais a un model educatiu centrat en l'aprenentatge actiu de l'estudiant. Aquest article resumeix les principals actuacions i reptes de futur que recull amb detall l'informe encarregat per l'Associació Catalana d'Universitats Públiques (ACUP) als directors de les biblioteques, en el marc de l'elaboració del futur llibre blanc de les universitats.
Resumo:
A numerical study is presented of the third-dimensional Gaussian random-field Ising model at T=0 driven by an external field. Standard synchronous relaxation dynamics is employed to obtain the magnetization versus field hysteresis loops. The focus is on the analysis of the number and size distribution of the magnetization avalanches. They are classified as being nonspanning, one-dimensional-spanning, two-dimensional-spanning, or three-dimensional-spanning depending on whether or not they span the whole lattice in different space directions. Moreover, finite-size scaling analysis enables identification of two different types of nonspanning avalanches (critical and noncritical) and two different types of three-dimensional-spanning avalanches (critical and subcritical), whose numbers increase with L as a power law with different exponents. We conclude by giving a scenario for avalanche behavior in the thermodynamic limit.
Resumo:
A new arena for the dynamics of spacetime is proposed, in which the basic quantum variable is the two-point distance on a metric space. The scaling dimension (that is, the Kolmogorov capacity) in the neighborhood of each point then defines in a natural way a local concept of dimension. We study our model in the region of parameter space in which the resulting spacetime is not too different from a smooth manifold.
Resumo:
Atès que el referent clàssic "Èdip en cerca de la seva identitat" ha estat sempre reconegut per a Suddenly Last Summer, l'autor d'aquest article, mitjançant una anàlisi acurada del text del dramaturg americà, proposa de llegir en aquest cas Can on a Hot tin Roof des del model Èdip Rei de Sòfocles i descobrir-hi igualment la tradicional ironia clàssica tant des del punt de vista de l'espectador com dels mateixos personatges principals, Brick i el seu pare, ambdós en cerca de la seva veritat, una veritat, és clar, contrària a la que esperaven.