Biblioteca Digital

794 resultados para Non Parametric Methodology

Hybrid moving block bootstrap for stochastic simulation of multi-site multi-season streamflows

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Hybrid approach introduced by the authors for at-site modeling of annual and periodic streamflows in earlier works is extended to simulate multi-site multi-season streamflows. It bears significance in integrated river basin planning studies. This hybrid model involves: (i) partial pre-whitening of standardized multi-season streamflows at each site using a parsimonious linear periodic model; (ii) contemporaneous resampling of the resulting residuals with an appropriate block size, using moving block bootstrap (non-parametric, NP) technique; and (iii) post-blackening the bootstrapped innovation series at each site, by adding the corresponding parametric model component for the site, to obtain generated streamflows at each of the sites. It gains significantly by effectively utilizing the merits of both parametric and NP models. It is able to reproduce various statistics, including the dependence relationships at both spatial and temporal levels without using any normalizing transformations and/or adjustment procedures. The potential of the hybrid model in reproducing a wide variety of statistics including the run characteristics, is demonstrated through an application for multi-site streamflow generation in the Upper Cauvery river basin, Southern India. (C) 2004 Elsevier B.V. All rights reserved.

First simultaneous measurement of the top quark mass in the lepton+jets and dilepton channels at CDF

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present a measurement of the mass of the top quark using data corresponding to an integrated luminosity of 1.9fb^-1 of ppbar collisions collected at sqrt{s}=1.96 TeV with the CDF II detector at Fermilab's Tevatron. This is the first measurement of the top quark mass using top-antitop pair candidate events in the lepton + jets and dilepton decay channels simultaneously. We reconstruct two observables in each channel and use a non-parametric kernel density estimation technique to derive two-dimensional probability density functions from simulated signal and background samples. The observables are the top quark mass and the invariant mass of two jets from the W decay in the lepton + jets channel, and the top quark mass and the scalar sum of transverse energy of the event in the dilepton channel. We perform a simultaneous fit for the top quark mass and the jet energy scale, which is constrained in situ by the hadronic W boson mass. Using 332 lepton + jets candidate events and 144 dilepton candidate events, we measure the top quark mass to be mtop=171.9 +/- 1.7 (stat. + JES) +/- 1.1 (syst.) GeV/c^2 = 171.9 +/- 2.0 GeV/c^2.

Methods in general model localization

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The aim of this study was to evaluate and test methods which could improve local estimates of a general model fitted to a large area. In the first three studies, the intention was to divide the study area into sub-areas that were as homogeneous as possible according to the residuals of the general model, and in the fourth study, the localization was based on the local neighbourhood. According to spatial autocorrelation (SA), points closer together in space are more likely to be similar than those that are farther apart. Local indicators of SA (LISAs) test the similarity of data clusters. A LISA was calculated for every observation in the dataset, and together with the spatial position and residual of the global model, the data were segmented using two different methods: classification and regression trees (CART) and the multiresolution segmentation algorithm (MS) of the eCognition software. The general model was then re-fitted (localized) to the formed sub-areas. In kriging, the SA is modelled with a variogram, and the spatial correlation is a function of the distance (and direction) between the observation and the point of calculation. A general trend is corrected with the residual information of the neighbourhood, whose size is controlled by the number of the nearest neighbours. Nearness is measured as Euclidian distance. With all methods, the root mean square errors (RMSEs) were lower, but with the methods that segmented the study area, the deviance in single localized RMSEs was wide. Therefore, an element capable of controlling the division or localization should be included in the segmentation-localization process. Kriging, on the other hand, provided stable estimates when the number of neighbours was sufficient (over 30), thus offering the best potential for further studies. Even CART could be combined with kriging or non-parametric methods, such as most similar neighbours (MSN).

Lahopuumäärän ennustaminen ja kartoitus lentokonelaserkeilauksen avulla

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Lahopuun määrästä ja sijoittumisesta ollaan kiinnostuneita paitsi elinympäristöjen monimuotoisuuden, myös ilmakehän hiilen varastoinnin kannalta. Tutkimuksen tavoitteena oli kehittää aluepohjainen laserkeilausdataa hyödyntävä malli lahopuukohteiden paikantamiseksi ja lahopuun määrän estimoimiseksi. Samalla tutkittiin mallin selityskyvyn muuttumista mallinnettavan ruudun kokoa suurennettaessa. Tutkimusalue sijaitsi Itä-Suomessa Sonkajärvellä ja koostui pääasiassa nuorista hoidetuista talousmetsistä. Tutkimuksessa käytettiin harvapulssista laserkeilausdataa sekä kaistoittain mitattua maastodataa kuolleesta puuaineksesta. Aineisto jaettiin siten, että neljäsosa datasta oli käytössä mallinnusta varten ja loput varattiin valmiiden mallien testaamiseen. Lahopuun mallintamisessa käytettiin sekä parametrista että ei-parametrista mallinnusmenetelmää. Logistisen regression avulla erikokoisille (0,04, 0,20, 0,32, 0,52 ja 1,00 ha) ruuduille ennustettiin todennäköisyys lahopuun esiintymiselle. Muodostettujen mallien selittävät muuttujat valittiin 80 laserpiirteen ja näiden muunnoksien joukosta. Mallien selittävät muuttujat valittiin kolmessa vaiheessa. Aluksi muuttujia tarkasteltiin visuaalisesti kuvaamalla ne lahopuumäärän suhteen. Ensimmäisessä vaiheessa sopivimmiksi arvioitujen muuttujien selityskykyä testattiin mallinnuksen toisessa vaiheessa yhden muuttujan mallien avulla. Lopullisessa usean muuttujan mallissa selittävien muuttujien kriteerinä oli tilastollinen merkitsevyys 5 % riskitasolla. 0,20 hehtaarin ruutukoolle luotu malli parametrisoitiin muun kokoisille ruuduille. Logistisella regressiolla toteutetun parametrisen mallintamisen lisäksi, 0,04 ja 1,0 hehtaarin ruutukokojen aineistot luokiteltiin ei-parametrisen CART-mallinnuksen (Classification and Regression Trees) avulla. CARTmenetelmällä etsittiin aineistosta vaikeasti havaittavia epälineaarisia riippuvuuksia laserpiirteiden ja lahopuumäärän välillä. CART-luokittelu tehtiin sekä lahopuustoisuuden että lahopuutilavuuden suhteen. CART-luokituksella päästiin logistista regressiota parempiin tuloksiin ruutujen luokituksessa lahopuustoisuuden suhteen. Logistisella mallilla tehty luokitus parani ruutukoon suurentuessa 0,04 ha:sta(kappa 0,19) 0,32 ha:iin asti (kappa 0,38). 0,52 ha:n ruutukoolla luokituksen kappa-arvo kääntyi laskuun (kappa 0,32) ja laski edelleen hehtaarin ruutukokoon saakka (kappa 0,26). CART-luokitus parani ruutukoon kasvaessa. Luokitustulokset olivat logistista mallinnusta parempia sekä 0,04 ha:n (kappa 0,24) että 1,0 ha:n (kappa 0,52) ruutukoolla. CART-malleilla määritettyjen ruutukohtaisten lahopuutilavuuksien suhteellinen RMSE pieneni ruutukoon kasvaessa. 0,04 hehtaarin ruutukoolla koko aineiston lahopuumäärän suhteellinen RMSE oli 197,1 %, kun hehtaarin ruutukoolla vastaava luku oli 120,3 %. Tämän tutkimuksen tulosten perusteella voidaan todeta, että maastossa mitatun lahopuumäärän ja tutkimuksessa käytettyjen laserpiirteiden yhteys on pienellä ruutukoolla hyvin heikko, mutta vahvistuu hieman ruutukoon kasvaessa. Kun mallinnuksessa käytetty ruutukoko kasvaa, pienialaisten lahopuukeskittymien havaitseminen kuitenkin vaikeutuu. Tutkimuksessa kohteen lahopuustoisuus pystyttiin kartoittamaan kohtuullisesti suurella ruutukoolla, mutta pienialaisten kohteiden kartoittaminen ei onnistunut käytetyillä menetelmillä. Pienialaisten kohteiden paikantaminen laserkeilauksen avulla edellyttää jatkotutkimusta erityisesti tiheäpulssisen laserdatan käytöstä lahopuuinventoinneissa.

Ratkaiseeko raha? : Tulospalkkiojärjestelmään ja työympäristöön liittyvien kokemusten yhteys esimiesten työtyytyväisyyteen ja työmotivaatioon

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The aim of this study was to investigate the relationship between merit pay system and work environment and foremen´s work satisfaction and work motivation. There has been a lot of investigation on rewarding. Less research has been done on previous surveys among the merit pay systems and motivation investigations. According to former surveys, rewarding systems cannot be released from its context. Therefore this survey expanded to deal with work environment. It was also essential to investigate different dimensions of extrinsic and intrinsic motivation and equity of rewarding. Investigation or work motivation and work satisfaction was challenging because both of these concepts have been investigated under quite traditional frame of reference of work motivation theories. In some surveys, the concepts have not been even separated or they have been used even as synonyms. The data were collected with the 193 foremen working in the profit centers of the different chains of the company in the field of retail trade. The questions were: Are the experiences of merit pay system and work environment related to foremen´s work satisfaction and work motivation? Are the backround variables related to foremen´s work satisfaction and work motivation? The data collection was carried out by an electronic inquiry during May 2010. 137 replied from foremen working under merit pay system. The research material was analyzed with PASW-software. Various analyzing methods were used: factor analyses, regression analyses and group of different parametric and non-parametric analyses. In contrast to theoretical framework in the factor analyses work satisfaction and work motivation clustered into the same dimension. As a main result the atmosphere, possibilities to influence and the atmosphere of leading were strongly positively related to foremen´s work satisfaction and work motivation. According to regression analyses these factors were able to explain 55 % of the foremen´s work satisfaction and work motivation. The best explanatory variable was atmosphere. Instead, the backround variables (age, sex, working years, group of profession, education) were not associated with work satisfaction and work motivation.

Microfinance, Efficiency and Agricultural Production in Bangladesh

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The objectives of this study were to make a detailed and systematic empirical analysis of microfinance borrowers and non-borrowers in Bangladesh and also examine how efficiency measures are influenced by the access to agricultural microfinance. In the empirical analysis, this study used both parametric and non-parametric frontier approaches to investigate differences in efficiency estimates between microfinance borrowers and non-borrowers. This thesis, based on five articles, applied data obtained from a survey of 360 farm households from north-central and north-western regions in Bangladesh. The methods used in this investigation involve stochastic frontier (SFA) and data envelopment analysis (DEA) in addition to sample selectivity and limited dependent variable models. In article I, technical efficiency (TE) estimation and identification of its determinants were performed by applying an extended Cobb-Douglas stochastic frontier production function. The results show that farm households had a mean TE of 83% with lower TE scores for the non-borrowers of agricultural microfinance. Addressing institutional policies regarding the consolidation of individual plots into farm units, ensuring access to microfinance, extension education for the farmers with longer farming experience are suggested to improve the TE of the farmers. In article II, the objective was to assess the effects of access to microfinance on household production and cost efficiency (CE) and to determine the efficiency differences between the microfinance participating and non-participating farms. In addition, a non-discretionary DEA model was applied to capture directly the influence of microfinance on farm households production and CE. The results suggested that under both pooled DEA models and non-discretionary DEA models, farmers with access to microfinance were significantly more efficient than their non-borrowing counterparts. Results also revealed that land fragmentation, family size, household wealth, on farm-training and off farm income share are the main determinants of inefficiency after effectively correcting for sample selection bias. In article III, the TE of traditional variety (TV) and high-yielding-variety (HYV) rice producers were estimated in addition to investigating the determinants of adoption rate of HYV rice. Furthermore, the role of TE as a potential determinant to explain the differences of adoption rate of HYV rice among the farmers was assessed. The results indicated that in spite of its much higher yield potential, HYV rice production was associated with lower TE and had a greater variability in yield. It was also found that TE had a significant positive influence on the adoption rates of HYV rice. In article IV, we estimated profit efficiency (PE) and profit-loss between microfinance borrowers and non-borrowers by a sample selection framework, which provided a general framework for testing and taking into account the sample selection in the stochastic (profit) frontier function analysis. After effectively correcting for selectivity bias, the mean PE of the microfinance borrowers and non-borrowers were estimated at 68% and 52% respectively. This suggested that a considerable share of profits were lost due to profit inefficiencies in rice production. The results also demonstrated that access to microfinance contributes significantly to increasing PE and reducing profit-loss per hectare land. In article V, the effects of credit constraints on TE, allocative efficiency (AE) and CE were assessed while adequately controlling for sample selection bias. The confidence intervals were determined by the bootstrap method for both samples. The results indicated that differences in average efficiency scores of credit constrained and unconstrained farms were not statistically significant although the average efficiencies tended to be higher in the group of unconstrained farms. After effectively correcting for selectivity bias, household experience, number of dependents, off-farm income, farm size, access to on farm training and yearly savings were found to be the main determinants of inefficiencies. In general, the results of the study revealed the existence substantial technical, allocative, economic inefficiencies and also considerable profit inefficiencies. The results of the study suggested the need to streamline agricultural microfinance by the microfinance institutions (MFIs), donor agencies and government at all tiers. Moreover, formulating policies that ensure greater access to agricultural microfinance to the smallholder farmers on a sustainable basis in the study areas to enhance productivity and efficiency has been recommended. Key Words: Technical, allocative, economic efficiency, DEA, Non-discretionary DEA, selection bias, bootstrapping, microfinance, Bangladesh.

The Effect of Food-Related Lifestyle on the Choices of Consumers of Five Food Products

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The purpose of this study was to find out whether food-related lifestyle guides and explains product evaluations, specifically, consumer perceptions and choice evaluations of five different food product categories: lettuce, mincemeat, savoury sauce, goat cheese, and pudding. The opinions of consumers who shop in neighbourhood stores were considered most valuable. This study applies means-end chain (MEC) theory, according to which products are seen as means by which consumers attain meaningful goals. The food-related lifestyle (FRL) instrument was created to study lifestyles that reflect these goals. Further, this research has adopted the view that the FRL functions as a script which guides consumer behaviour. Two research methods were used in this study. The first was the laddering interview, the primary aim of which was to gather information for formulating the questionnaire of the main study. The survey consisted of two separate questionnaires. The first was the FRL questionnaire modified for this study. The aim of the other questionnaire was to determine the choice criteria for buying five different categories of food products. Before these analyses could be made, several data modifications were made following MEC analysis procedures. Beside forming FRL dimensions by counting sum-scores from the FRL statements, factor analysis was run in order to elicit latent factors underlying the dimensions. The lifestyle factors found were adventurous, conscientious, enthusiastic, snacking, moderate, and uninvolved lifestyles. The association analyses were done separately for each choice of product as well as for each attribute-consequence linkage with a non-parametric Mann-Whitney U test. The testing variables were FRL dimensions and the FRL lifestyle factors. In addition, the relation between the attribute-consequence linkages and the demographic variables were analysed. Results from this study showed that the choice of product is sequential, so that consumers first categorize products into groups based on specific criteria like health or convenience. It was attested that the food-related lifestyles function as a script in food choice and that the FRL instrument can be used to predict consumer buying behaviour. Certain lifestyles were associated with the choice of each product category. The actual product choice within a product category then appeared to be a different matter. In addition, this study proposes a modification to the FRL instrument. The positive towards advertising FRL dimension was modified to examine many kinds of information search including the internet, TV, magazines, and other people. This new dimension, which was designated as being open to additional information, proved to be very robust and reliable in finding differences in consumer choice behaviour. Active additional information search was linked to adventurous and snacking food-related lifestyles. The results of this study support the previous knowledge that consumers expect to get many benefits simultaneously when they buy food products. This study brought detailed information about the benefits sought, the combination of benefits differing between products and between respondents. Household economy, pleasure and quality were emphasized with the choice of lettuce. Quality was the most significant benefit in choosing mincemeat, but health related benefits were often evaluated as well. The dominant benefits linked to savoury sauce were household economic benefits, expected pleasurable experiences, and a lift in self-respect. The choice of goat cheese appeared not to be an economic decision, self-respect, pleasure, and quality being included in the choice criteria. In choosing pudding, the respondents considered the well-being of family members, and indulged their family members or themselves.

Comparison of AM-FM Based Features For Robust Speech Recognition

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Effective feature extraction for robust speech recognition is a widely addressed topic and currently there is much effort to invoke non-stationary signal models instead of quasi-stationary signal models leading to standard features such as LPC or MFCC. Joint amplitude modulation and frequency modulation (AM-FM) is a classical non-parametric approach to non-stationary signal modeling and recently new feature sets for automatic speech recognition (ASR) have been derived based on a multi-band AM-FM representation of the signal. We consider several of these representations and compare their performances for robust speech recognition in noise, using the AURORA-2 database. We show that FEPSTRUM representation proposed is more effective than others. We also propose an improvement to FEPSTRUM based on the Teager energy operator (TEO) and show that it can selectively outperform even FEPSTRUM

A Fast Linear Separability Test by Projection of Positive Points on Subspaces

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A geometric and non parametric procedure for testing if two finite set of points are linearly separable is proposed. The Linear Separability Test is equivalent to a test that determines if a strictly positive point h > 0 exists in the range of a matrix A (related to the points in the two finite sets). The algorithm proposed in the paper iteratively checks if a strictly positive point exists in a subspace by projecting a strictly positive vector with equal co-ordinates (p), on the subspace. At the end of each iteration, the subspace is reduced to a lower dimensional subspace. The test is completed within r ≤ min(n, d + 1) steps, for both linearly separable and non separable problems (r is the rank of A, n is the number of points and d is the dimension of the space containing the points). The worst case time complexity of the algorithm is O(nr3) and space complexity of the algorithm is O(nd). A small review of some of the prominent algorithms and their time complexities is included. The worst case computational complexity of our algorithm is lower than the worst case computational complexity of Simplex, Perceptron, Support Vector Machine and Convex Hull Algorithms, if d

Vegetative phenology of tropical montane forests in the Nilgiris, South India

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Spatial and temporal variation in foliar phenology plays a significant role in growth and reproduction of a plant species. Foliar phenology is strongly influenced by environmental factors such as rainfall. A study on phenology of tropical montane forests was undertaken in three different forest patches of the Nilgiri Mountains in peninsular India above 2000 meters ASL. Since August 2000, 500 trees belonging to 70 species of angiosperms were monitored for both vegetative and reproductive phenologies on a monthly basis. Climate data were collected from nearby weather stations. This paper reports results of the study from August 2000 - August 2003 on foliar phenology. Non-parametric correlations and multiple regressions were performed to analyse the influence of environmental factors such as rainfall, temperature and sunshine on foliar phenology. It was found that moisture related factors had a negative influence on the leaf initiation. Circular statistical analyses were performed to understand the seasonality in different phenophases of foliar phenology. Different phenophases of leafing were not significantly seasonal. Results are discussed and compared among three different forest patches on the Nilgiri plateau and also with other montane forest patches across the globe.

Comparative statistical aging studies on an oil-pressboard insulation model

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Accelerated aging experiments have been conducted on a representative oil-pressboard insulation model to investigate the effect of constant and sequential stresses on the PD behavior using a built-in phase resolved partial discharge analyzer. A cycle of the applied voltage starting from the zero of the positive half cycle was divided into 16 equal phase windows (Φ1 to Φ16) and partial discharge (PD) magnitude distribution in each phase was determined. Based on the experimental results, three stages of aging mechanism were identified. Gumbel's extreme value distribution of the largest element was used to model the first stage of aging process. Second and subsequent stages were modeled using two-parameter Weibull distribution. Spearman's non-parametric rank correlation test statistic and Kolmogrov-Smirnov two sample test were used to relate the aging process of each phase with the corresponding process of the full cycle. To bring out clearly the effect of stress level, its duration and test procedure on the distribution parameters and hence of the aging process, non-parametric ANOVA techniques like Kruskal-Wallis and Fisher's LSD multiple comparison tests were used. Results of the analysis show that two phases (Φ13 and Φ14) near the vicinity of the negative voltage peak were found to contribute significantly to the aging process and their aging mechanism also correlated well with that of the corresponding full cycle mechanism. Attempts have been made to relate these results with the published work of other workers

MPI-based parallel synchronous vector evaluated particle swarm optimization for multi-objective design optimization of composite structures

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents a decentralized/peer-to-peer architecture-based parallel version of the vector evaluated particle swarm optimization (VEPSO) algorithm for multi-objective design optimization of laminated composite plates using message passing interface (MPI). The design optimization of laminated composite plates being a combinatorially explosive constrained non-linear optimization problem (CNOP), with many design variables and a vast solution space, warrants the use of non-parametric and heuristic optimization algorithms like PSO. Optimization requires minimizing both the weight and cost of these composite plates, simultaneously, which renders the problem multi-objective. Hence VEPSO, a multi-objective variant of the PSO algorithm, is used. Despite the use of such a heuristic, the application problem, being computationally intensive, suffers from long execution times due to sequential computation. Hence, a parallel version of the PSO algorithm for the problem has been developed to run on several nodes of an IBM P720 cluster. The proposed parallel algorithm, using MPI's collective communication directives, establishes a peer-to-peer relationship between the constituent parallel processes, deviating from the more common master-slave approach, in achieving reduction of computation time by factor of up to 10. Finally we show the effectiveness of the proposed parallel algorithm by comparing it with a serial implementation of VEPSO and a parallel implementation of the vector evaluated genetic algorithm (VEGA) for the same design problem. (c) 2012 Elsevier Ltd. All rights reserved.

Review of trend detection methods and their application to detect temperature changes in India

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Present study performs the spatial and temporal trend analysis of annual, monthly and seasonal maximum and minimum temperatures (t(max), t(min)) in India. Recent trends in annual, monthly, winter, pre-monsoon, monsoon and post-monsoon extreme temperatures (t(max), t(min)) have been analyzed for three time slots viz. 1901-2003,1948-2003 and 1970-2003. For this purpose, time series of extreme temperatures of India as a whole and seven homogeneous regions, viz. Western Himalaya (WH), Northwest (NW), Northeast (NE), North Central (NC), East coast (EC), West coast (WC) and Interior Peninsula (IP) are considered. Rigorous trend detection analysis has been exercised using variety of non-parametric methods which consider the effect of serial correlation during analysis. During the last three decades minimum temperature trend is present in All India as well as in all temperature homogeneous regions of India either at annual or at any seasonal level (winter, pre-monsoon, monsoon, post-monsoon). Results agree with the earlier observation that the trend in minimum temperature is significant in the last three decades over India (Kothawale et al., 2010). Sequential MK test reveals that most of the trend both in maximum and minimum temperature began after 1970 either in annual or seasonal levels. (C) 2012 Elsevier B.V. All rights reserved.

Narrowband signal detection techniques in shallow ocean by acoustic vector sensor array

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents the formulation and performance analysis of four techniques for detection of a narrowband acoustic source in a shallow range-independent ocean using an acoustic vector sensor (AVS) array. The array signal vector is not known due to the unknown location of the source. Hence all detectors are based on a generalized likelihood ratio test (GLRT) which involves estimation of the array signal vector. One non-parametric and three parametric (model-based) signal estimators are presented. It is shown that there is a strong correlation between the detector performance and the mean-square signal estimation error. Theoretical expressions for probability of false alarm and probability of detection are derived for all the detectors, and the theoretical predictions are compared with simulation results. It is shown that the detection performance of an AVS array with a certain number of sensors is equal to or slightly better than that of a conventional acoustic pressure sensor array with thrice as many sensors.

Dynamic multi-relational Chinese restaurant process for analyzing influences on users in social media

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We study the problem of analyzing influence of various factors affecting individual messages posted in social media. The problem is challenging because of various types of influences propagating through the social media network that act simultaneously on any user. Additionally, the topic composition of the influencing factors and the susceptibility of users to these influences evolve over time. This problem has not been studied before, and off-the-shelf models are unsuitable for this purpose. To capture the complex interplay of these various factors, we propose a new non-parametric model called the Dynamic Multi-Relational Chinese Restaurant Process. This accounts for the user network for data generation and also allows the parameters to evolve over time. Designing inference algorithms for this model suited for large scale social-media data is another challenge. To this end, we propose a scalable and multi-threaded inference algorithm based on online Gibbs Sampling. Extensive evaluations on large-scale Twitter and Face book data show that the extracted topics when applied to authorship and commenting prediction outperform state-of-the-art baselines. More importantly, our model produces valuable insights on topic trends and user personality trends beyond the capability of existing approaches.

«
1
2
...
10
11
12
13
14
15
16
...
52
53
»