923 resultados para Generalised Linear Models
Resumo:
La efectividad en el deporte hace referencia al impacto alcanzado por una acción llevada a cabo en condiciones habituales, estando presente en la ejecución de cualquier actividad física, referida a la capacidad para producir el efecto deseado, y está relacionada con la e$cacia, entendida como el efecto de una acción llevada a cabo en las mejores condiciones posibles, y que tiene como objetivo, lograr la meta, o conseguir el triunfo. El objetivo de este trabajo consistió en identificar la relación entre la zona y el tipo de golpe, desde la cual el tenista presenta mayor y menor efectividad en el juego. Para ello se observó a un tenista durante 12 entrenamientos con un rival de nivel equivalente, según la ATP, durante la temporada 2012-2013, registrando su situación en la cancha y el tipo de golpe de todas las devoluciones con éxito, entendido como obtención del punto o recuperación del saque. Se crearon tres criterios categóricos que constituyen un instrumento de observación para registrar el juego del tenista en la zona horizontal, y la zona vertical de la pista, además del tipo de golpe que realiza en términos de drive, revés, smash y dejada. Utilizando la técnica de regresión log-lineal, se obtuvieron resultados que indican que el jugador presenta una menor efectividad en los golpes realizados desde el lado izquierdo, y muestra una mayor efectividad en el drive y revés ejecutados desde media pista o fondo del lado derecho. La interpretación de los resultados aporta información sobre las localizaciones en la pista y los golpes, relacionados con su mayor y menor efectividad.
Resumo:
The dewatering of iron ore concentrates requires large capacity in addition to producing a cake with low moisture content. Such large processes are commonly energy intensive and means to lower the specific energy consumption are needed. Ceramic capillary action disc filters incorporate a novel filter medium enabling the harnessing of capillary action, which results in decreased energy consumption in comparison to traditional filtration technologies. As another benefit, the filter medium is mechanically and chemically more durable than, for example, filter cloths and can, thus, withstand harsh operating conditions and possible regeneration better than other types of filter media. In iron ore dewatering, the regeneration of the filter medium is done through a combination of several techniques: (1) backwashing, (2) ultrasonic cleaning, and (3) acid regeneration. Although it is commonly acknowledged that the filter medium is affected by slurry particles and extraneous compounds, published research, especially in the field of dewatering of mineral concentrates, is scarce. Whereas the regenerative effect of backwashing and ultrasound are more or less mechanical, regeneration with acids is based on chemistry. The chemistry behind the acid regeneration is, naturally, dissolution. The dissolution of iron oxide particles has been extensively studied over several decades but those studies may not necessarily be directly applicable in the regeneration of the filter medium which has undergone interactions with the slurry components. The aim of this thesis was to investigate if free particle dissolution indeed correlates with the regeneration of the filter medium. For this purpose, both free particle dissolution and dissolution of surface adhered particles were studied. The focus was on acidic dissolution of iron oxide particles and on the study of the ceramic filter medium used in the dewatering of iron ore concentrates. The free particle dissolution experiments show that the solubility of synthetic fine grained iron oxide particles in oxalic acid could be explained through linear models accounting for the effects of temperature and acid concentration, whereas the dissolution of a natural magnetite is not so easily explained by such models. In addition, the kinetic experiments performed both support and contradict the work of previous authors: the suitable kinetic model here supports previous research suggesting solid state reduction to be the reaction mechanism of hematite dissolution but the formation of a stable iron oxalate is not supported by the results of this research. Several other dissolution mechanisms have also been suggested for iron oxide dissolution in oxalic acid, indicating that the details of oxalate promoted reductive dissolution are not yet agreed and, in this respect, this research offers added value to the community. The results of the regeneration experiments with the ceramic filter media show that oxalic acid is highly effective in removing iron oxide particles from the surface of the filter medium. The dissolution of those particles did not, however, exhibit the expected behaviour, i.e. complete dissolution. The results of this thesis show that although the regeneration of the ceramic filter medium with acids incorporates the dissolution of slurry particles from the surface of the filter medium, the regeneration cannot be assessed purely based upon free particle dissolution. A steady state, dependent on temperature and on the acid concentration, was observed in the dissolution of particles from the surface even though the limit of solubility of free iron oxide particles had not been reached. Both the regeneration capacity and efficiency, with regards to the removal of iron oxide particles, was found to be temperature dependent, but was not affected by the acid concentration. This observation further suggests that the removal of the surface adhered particles does not follow the dissolution of free particles, which do exhibit a dependency on the acid concentration. In addition, changes in the permeability and in the pore structure of the filter medium were still observed after the bulk concentration of dissolved iron had reached a steady state. Consequently, the regeneration of the filter medium continued after the dissolution of particles from the surface had ceased. This observation suggests that internal changes take place at the final stages of regeneration. The regeneration process could, in theory, be divided into two, possibly overlapping, stages: (1) dissolution of surface-adhered particles, and (2) dissolution of extraneous compounds from within the pore structure. In addition to the fundamental knowledge generated during this thesis, tools to assess the effects of parameters on the regeneration of the ceramic filter medium are needed. It has become clear that the same tools used to estimate the dissolution of free particles cannot be used to estimate the regeneration of a filter medium unless only a robust characterisation of the order of regeneration efficiency is needed.
Resumo:
Tutkimuksen aiheena on yleistynyt luottamus. Väitöskirjassa tutkitaan mistä tuntemattomien kansalaisten toisiinsa kohdistama luottamus kumpuaa ja haetaan vastauksia tähän kysymykseen sekä maakohtaisen että vertailevan tutkimuksen avulla. Tutkimus koostuu yhteenvedon lisäksi viidestä tutkimusartikkelista, joissa luottamuksen syntyä tarkastellaan sekä yksilöiden mikrotason vuorovaikutuksen että maiden välisten eroavaisuuksien näkökulmasta. Yleistyneen luottamuksen synnystä on esitetty useita eri teorioita. Tässä tutkimuksessa tarkastellaan näistä kahta keskeisintä. Osa tutkijoista korostaa kansalaisyhteiskunnan ja ruohonjuuritason verkostojen roolia yleistyneen luottamuksen synnyn taustalla. Tämän hypoteesin mukaan kansalaiset, jotka viettävät aikaansa yhdistyksissä tai muissa sosiaalisissa verkostoissa, oppivat muita helpommin luottamaan paitsi täysin tuntemattomiin ihmisiin myös yhteiskunnallisiin instituutioihin (kansalaisyhteiskuntakeskeinen hypoteesi). Toiset taas painottavat yhteiskunnan julkisten instituutioiden merkitystä. Tämä hypoteesi korostaa instituutioiden reiluutta ja oikeudenmukaisuutta (instituutiokeskeinen hypoteesi). Ihmiset pystyvät luottamaan toisiinsa ja ratkaisemaan kollektiivisia ongelmiaan yhdessä silloin kun esimerkiksi poliittiset ja lainsäädännölliset instituutiot pystyvät luomaan tähän tarvittavan toimintaympäristön. Aineistoina käytetään kansallisia (Hyvinvointi- ja palvelut) sekä kansainvälisiä vertailevia kyselytutkimuksia (European Social Survey ja ISSP). Yksilö- ja makrotason analyyseja yhdistämällä selvitetään yleistynyttä luottamusta selittäviä tekijöitä sekä mekanismeja joiden kautta yleistynyt luottamus muodostuu. Väitöskirjan tulokset tukevat suurimmaksi osaksi instituutiokeskeiseen suuntaukseen sisältyviä hypoteeseja yleistyneen luottamuksen kasautumisesta. Kuitenkin myös esimerkiksi yhdistystoiminnalla havaittiin olevan joitakin yhdistysjäsenien ulkopuolelle ulottuvia myönteisiä vaikutuksia kansalaisten luottamukseen, mikä taas tukee kansalaisyhteiskuntakeskeistä hypoteesia. Tutkimuksen keskeinen tulos on, että kaiken kaikkiaan luottamus näyttäisi kukoistavan maissa, joissa kansalaiset kokevat julkiset instituutiot oikeudenmukaisina sekä reiluina, kansalaisyhteiskunnan roolin luottamuksen synnyttämisessä ollessa tälle alisteinen. Syyksi tähän on oletettu, että näissä maissa (erityisesti pohjoismaiset hyvinvointivaltiot) harjoitettu universaali hyvinvointipolitiikka ja palvelut ovat keskeisiä korkeaa yleistynyttä luottamusta selittäviä tekijöitä. Toisaalta maavertailuissa tätä yhteyttä on selitetty myös sillä, että näissä yhteiskunnassa ei ole paikannettavissa selkeää kulttuurisesti erottuvaa alaluokkaa. Tämän tutkimuksen tulokset tukevat enemmän universaalin hyvinvointivaltion oikeudenmukaisuuteen liittyviä ominaisuuksia alaluokkaistumishypoteesin sijaan. Toisaalta mikrotasolla tarkasteltuna yleistyneen luottamuksen ja hyvinvointipalvelujen välinen yhteys liittyy enemmän palveluiden riittävyyteen kuin niiden universaalisuuden asteeseen. Niin ikään maavertailuissa esimerkiksi verotuksen oikeudenmukaisena kokeminen näyttäisi olevan palvelujen saatavuutta tai niihin liittyviä oikeudenmukaisuuden kokemuksia tärkeämpi seikka yleistyneen luottamuksen kannalta.
Resumo:
Response Surface Methodology (RSM) was applied to evaluate the chromatic features and sensory acceptance of emulsions that combine Soy Protein (SP) and red Guava Juice (GJ). The parameters analyzed were: instrumental color based on the coordinates a* (redness), b* (yellowness), L* (lightness), C* (chromaticity), h* (hue angle), visual color, acceptance, and appearance. The analyses of the results showed that GJ was responsible for the high measured values of red color, hue angle, chromaticity, acceptance, and visual color, whereas SP was the variable that increased the yellowness intensity of the assays. The redness (R²adj = 74.86%, p < 0.01) and hue angle (R²adj = 80.96%, p < 0.01) were related to the independent variables by linear models, while the sensory data (color and acceptance) could not be modeled due to a high variability. The models of yellowness, lightness, and chromaticity did not present lack of fit but presented adjusted determination coefficients bellow 70%. Notwithstanding, the linear correlations between sensory and instrumental data were not significant (p > 0.05) and low Pearson coefficients were obtained. The results showed that RSM is a useful tool to develop soy-based emulsions and model some chromatic features of guava-based emulsions through RSM.
Resumo:
The industrialization of passion fruit in the form of juice produces considerable amounts of residue that could be used as food. The objective of the present study was to determine the effects of the volume of passion fruit juice added to the syrup and the cooking time on the color and texture of passion fruit albedo preserved in syrup. Multi-linear models were well fit to describe the value for a* (for the albedo) the values for b* (for the albedo and syrup), which exhibited high correlation coefficients of 98%, 84%, and 88%, respectively. The volume of passion fruit juice added and the cooking time of the albedos in the syrup, involved in the processing of passion fruit albedo preserves in syrup, significantly affected color analyses. The texture was not affected by the parameters studied. Therefore, the use of larger volumes of passion fruit juice and longer cooking time is recommended for the production of passion fruit albedo preserves in syrup to achieve the characteristic yellow color of the fruit.
Resumo:
Financial time series have a tendency of abruptly changing their behavior and maintain this behavior for several consecutive periods, and commodity futures returns are not an exception. This quality proposes that nonlinear models, as opposed to linear models, can more accurately describe returns and volatility. Markov regime switching models are able to match this behavior and have become a popular way to model financial time series. This study uses Markov regime switching model to describe the behavior of energy futures returns on a commodity level, because studies show that commodity futures are a heterogeneous asset class. The purpose of this thesis is twofold. First, determine how many regimes characterize individual energy commodities’ returns in different return frequencies. Second, study the characteristics of these regimes. We extent the previous studies on the subject in two ways: We allow for the possibility that the number of regimes may exceed two, as well as conduct the research on individual commodities rather than on commodity indices or subgroups of these indices. We use daily, weekly and monthly time series of Brent crude oil, WTI crude oil, natural gas, heating oil and gasoil futures returns over 1994–2014, where available, to carry out the study. We apply the likelihood ratio test to determine the sufficient number of regimes for each commodity and data frequency. Then the time series are modeled with Markov regime switching model to obtain the return distribution characteristics of each regime, as well as the transition probabilities of moving between regimes. The results for the number of regimes suggest that daily energy futures return series consist of three to six regimes, whereas weekly and monthly returns for all energy commodities display only two regimes. When the number of regimes exceeds two, there is a tendency for the time series of energy commodities to form groups of regimes. These groups are usually quite persistent as a whole because probability of a regime switch inside the group is high. However, individual regimes in these groups are not persistent and the process oscillates between these regimes frequently. Regimes that are not part of any group are generally persistent, but show low ergodic probability, i.e. rarely prevail in the market. This study also suggests that energy futures return series characterized with two regimes do not necessarily display persistent bull and bear regimes. In fact, for the majority of time series, bearish regime is considerably less persistent. Rahoituksen aikasarjoilla on taipumus arvaamattomasti muuttaa käyttäytymistään ja jatkaa tätä uutta käyttäytymistä useiden periodien ajan, eivätkä hyödykefutuurien tuotot tee tähän poikkeusta. Tämän ominaisuuden johdosta lineaaristen mallien sijasta epälineaariset mallit pystyvät tarkemmin kuvailemaan esimerkiksi tuottojen jakauman parametreja. Markov regiiminvaihtomallit pystyvät vangitsemaan tämän ominaisuuden ja siksi niistä on tullut suosittuja rahoituksen aikasarjojen mallintamisessa. Tämä tutkimus käyttää Markov regiiminvaihtomallia kuvaamaan yksittäisten energiafutuurien tuottojen käyttäytymistä, sillä tutkimukset osoittavat hyödykefutuurien olevan hyvin heterogeeninen omaisuusluokka. Tutkimuksen tarkoitus on selvittää, kuinka monta regiimiä tarvitaan kuvaamaan energiafutuurien tuottoja eri tuottofrekvensseillä ja mitkä ovat näiden regiimien ominaisuudet. Aiempaa tutkimusta aiheesta laajennetaan määrittämällä regiimien lukumäärä tilastotieteellisen testauksen menetelmin sekä tutkimalla energiafutuureja yksittäin; ei indeksi- tai alaindeksitasolla. Tutkimuksessa käytetään päivä-, viikko- ja kuukausiaikasarjoja Brent-raakaöljyn, WTI-raakaöljyn, maakaasun, lämmitysöljyn ja polttoöljyn tuotoista aikaväliltä 1994–2014, siltä osin kuin aineistoa on saatavilla. Likelihood ratio -testin avulla estimoidaan kaikille aikasarjoille regiimien määrä,jonka jälkeen Markov regiiminvaihtomallia hyödyntäen määritetään yksittäisten regiimientuottojakaumien ominaisuudet sekä regiimien välinen transitiomatriisi. Tulokset regiimien lukumäärän osalta osoittavat, että energiafutuurien päiväkohtaisten tuottojen aikasarjoissa regiimien lukumäärä vaihtelee kolmen ja kuuden välillä. Viikko- ja kuukausituottojen kohdalla kaikkien energiafutuurien prosesseissa regiimien lukumäärä on kaksi. Kun regiimejä on enemmän kuin kaksi, on prosessilla taipumus muodostaa regiimeistä koostuvia ryhmiä. Prosessi pysyy ryhmän sisällä yleensä pitkään, koska todennäköisyys siirtyä ryhmään kuuluvien regiimien välillä on suuri. Yksittäiset regiimit ryhmän sisällä eivät kuitenkaan ole kovin pysyviä. Näin ollen prosessi vaihtelee ryhmän sisäisten regiimien välillä tiuhaan. Regiimit, jotka eivät kuulu ryhmään, ovat yleensä pysyviä, mutta prosessi ajautuu niihin vain harvoin, sillä todennäköisyys siirtyä muista regiimeistä niihin on pieni. Tutkimuksen tulokset osoittavat myös, että prosesseissa, joita ohjaa kaksi regiimiä, nämä regiimit eivät välttämättä ole pysyvät bull- ja bear-markkinatilanteet. Tulokset osoittavat sen sijaan, että bear-markkinatilanne on energiafutuureissa selvästi vähemmän pysyvä.
Resumo:
This paper studies seemingly unrelated linear models with integrated regressors and stationary errors. By adding leads and lags of the first differences of the regressors and estimating this augmented dynamic regression model by feasible generalized least squares using the long-run covariance matrix, we obtain an efficient estimator of the cointegrating vector that has a limiting mixed normal distribution. Simulation results suggest that this new estimator compares favorably with others already proposed in the literature. We apply these new estimators to the testing of purchasing power parity (PPP) among the G-7 countries. The test based on the efficient estimates rejects the PPP hypothesis for most countries.
Resumo:
Recent work shows that a low correlation between the instruments and the included variables leads to serious inference problems. We extend the local-to-zero analysis of models with weak instruments to models with estimated instruments and regressors and with higher-order dependence between instruments and disturbances. This makes this framework applicable to linear models with expectation variables that are estimated non-parametrically. Two examples of such models are the risk-return trade-off in finance and the impact of inflation uncertainty on real economic activity. Results show that inference based on Lagrange Multiplier (LM) tests is more robust to weak instruments than Wald-based inference. Using LM confidence intervals leads us to conclude that no statistically significant risk premium is present in returns on the S&P 500 index, excess holding yields between 6-month and 3-month Treasury bills, or in yen-dollar spot returns.
Resumo:
L’Organisation mondiale de la Santé recommande aux individus de limiter leur consommation d’aliments sucrés dans le but de prévenir le développement des maladies chroniques. En santé publique, peu de recherches ont tenté d’identifier les facteurs individuels et contextuels qui peuvent influencer conjointement la consommation de ces aliments. Or, de telles connaissances seraient utiles pour guider les interventions nutritionnelles visant à en réduire la consommation. L’objectif de cette thèse est d'étudier les facteurs reliés au comportement et les contextes associés à la consommation quotidienne d’aliments sucrés chez des adultes vivant dans un milieu urbain occidental. Cette étude a été menée auprès d'une communauté moyen-orientale établie dans la Ville de Montréal. Les aliments sucrés ont été définis comme étant les glucides raffinés dont la teneur en sucres totaux dépasse 20 % de l’énergie totale. Lors de l’étape exploratoire (N = 42), un rappel de 24 heures a permis d’identifier les sources d’aliments sucrés et de déterminer l’apport quotidien en sucres totaux de cette communauté. Une étude qualitative descriptive a été privilégiée et un cadre écologique a guidé la réalisation d’entrevues semi-dirigées sur les contextes de consommation (N = 42). Une analyse de contenu employant des procédures de codage initial et focus a mené à l’élaboration d’un instrument de mesure quantitatif sur les contextes de consommation. Cet instrument a été soumis à un pré-test (N = 20), puis administré à l’échantillon principal (N = 192). Une analyse factorielle exploratoire a permis de préciser les contextes de consommation. Les facteurs individuels mesurés incluent les données sociodémographiques, les symptômes dépressifs, la maîtrise de soi, l’assoupissement de jour, les perceptions ainsi que l’hémoglobine glycosylée. La consommation quotidienne de sucres totaux a été mesurée par un questionnaire de fréquence alimentaire (N = 192). Une analyse de régression multivariée employant le modèle linéaire généralisé (distribution de type gamma et lien logarithmique) a été effectuée pour mesurer les relations entre les contextes de consommation, les facteurs individuels et la consommation de sucres totaux, en contrôlant l’âge et le sexe. L’apport quotidien en sucres totaux de l'échantillon est de 20,3 %, ce qui s’apparente aux apports des Canadiens et des Québécois. La consommation quotidienne moyenne est de 76 g/j. Les analyses qualitative et factorielle ont permis d’identifier un ensemble de 42 contextes de consommation regroupés en sept domaines (Actes et situations de grignotage, Stimuli visuels, Besoins énergétiques, Besoins émotionnels, Indulgence, Contraintes, Socialisation). La consommation quotidienne de sucres totaux est supérieure chez les hommes (B = 0,204, ES = 0,094, p = 0,03). Les facteurs positivement associés à la consommation sont le grignotage (B = 0,225, ES = 0,091, p = 0,01), la prise de dessert (B = 0,105, ES = 0,036, p = 0,001) ainsi que les symptômes dépressifs (B = 0,017, ES = 0,094, p = 0,03). L’âge (B = -0,01, ES = 0,004, p = 0,02), l’indulgence (B = -0,103, ES = 0,052, p = 0,05) et l’auto-modération (B = -0,121, ES = 0,042, p = 0,001) montrent, pour leur part, une association négative. Cette étude a privilégié une méthodologie mixte et a permis de développer une mesure innovatrice pour étudier les facteurs contextuels associés à la consommation d’aliments sucrés. Ceux-ci ont été analysés conjointement avec les facteurs individuels. Afin d'encourager les individus à réduire leur consommation de sucres totaux lorsque nécessaire, les initiatives en santé publique devraient en effet cibler les contextes de consommation de même que les facteurs individuels.
Resumo:
En écologie, dans le cadre par exemple d’études des services fournis par les écosystèmes, les modélisations descriptive, explicative et prédictive ont toutes trois leur place distincte. Certaines situations bien précises requièrent soit l’un soit l’autre de ces types de modélisation ; le bon choix s’impose afin de pouvoir faire du modèle un usage conforme aux objectifs de l’étude. Dans le cadre de ce travail, nous explorons dans un premier temps le pouvoir explicatif de l’arbre de régression multivariable (ARM). Cette méthode de modélisation est basée sur un algorithme récursif de bipartition et une méthode de rééchantillonage permettant l’élagage du modèle final, qui est un arbre, afin d’obtenir le modèle produisant les meilleures prédictions. Cette analyse asymétrique à deux tableaux permet l’obtention de groupes homogènes d’objets du tableau réponse, les divisions entre les groupes correspondant à des points de coupure des variables du tableau explicatif marquant les changements les plus abrupts de la réponse. Nous démontrons qu’afin de calculer le pouvoir explicatif de l’ARM, on doit définir un coefficient de détermination ajusté dans lequel les degrés de liberté du modèle sont estimés à l’aide d’un algorithme. Cette estimation du coefficient de détermination de la population est pratiquement non biaisée. Puisque l’ARM sous-tend des prémisses de discontinuité alors que l’analyse canonique de redondance (ACR) modélise des gradients linéaires continus, la comparaison de leur pouvoir explicatif respectif permet entre autres de distinguer quel type de patron la réponse suit en fonction des variables explicatives. La comparaison du pouvoir explicatif entre l’ACR et l’ARM a été motivée par l’utilisation extensive de l’ACR afin d’étudier la diversité bêta. Toujours dans une optique explicative, nous définissons une nouvelle procédure appelée l’arbre de régression multivariable en cascade (ARMC) qui permet de construire un modèle tout en imposant un ordre hiérarchique aux hypothèses à l’étude. Cette nouvelle procédure permet d’entreprendre l’étude de l’effet hiérarchisé de deux jeux de variables explicatives, principal et subordonné, puis de calculer leur pouvoir explicatif. L’interprétation du modèle final se fait comme dans une MANOVA hiérarchique. On peut trouver dans les résultats de cette analyse des informations supplémentaires quant aux liens qui existent entre la réponse et les variables explicatives, par exemple des interactions entres les deux jeux explicatifs qui n’étaient pas mises en évidence par l’analyse ARM usuelle. D’autre part, on étudie le pouvoir prédictif des modèles linéaires généralisés en modélisant la biomasse de différentes espèces d’arbre tropicaux en fonction de certaines de leurs mesures allométriques. Plus particulièrement, nous examinons la capacité des structures d’erreur gaussienne et gamma à fournir les prédictions les plus précises. Nous montrons que pour une espèce en particulier, le pouvoir prédictif d’un modèle faisant usage de la structure d’erreur gamma est supérieur. Cette étude s’insère dans un cadre pratique et se veut un exemple pour les gestionnaires voulant estimer précisément la capture du carbone par des plantations d’arbres tropicaux. Nos conclusions pourraient faire partie intégrante d’un programme de réduction des émissions de carbone par les changements d’utilisation des terres.
Resumo:
Les simulations et figures ont été réalisées avec le logiciel R.
Resumo:
This study is concerned with Autoregressive Moving Average (ARMA) models of time series. ARMA models form a subclass of the class of general linear models which represents stationary time series, a phenomenon encountered most often in practice by engineers, scientists and economists. It is always desirable to employ models which use parameters parsimoniously. Parsimony will be achieved by ARMA models because it has only finite number of parameters. Even though the discussion is primarily concerned with stationary time series, later we will take up the case of homogeneous non stationary time series which can be transformed to stationary time series. Time series models, obtained with the help of the present and past data is used for forecasting future values. Physical science as well as social science take benefits of forecasting models. The role of forecasting cuts across all fields of management-—finance, marketing, production, business economics, as also in signal process, communication engineering, chemical processes, electronics etc. This high applicability of time series is the motivation to this study.
Resumo:
The method of Least Squares is due to Carl Friedrich Gauss. The Gram-Schmidt orthogonalization method is of much younger date. A method for solving Least Squares Problems is developed which automatically results in the appearance of the Gram-Schmidt orthogonalizers. Given these orthogonalizers an induction-proof is available for solving Least Squares Problems.
Resumo:
We present a tree-structured architecture for supervised learning. The statistical model underlying the architecture is a hierarchical mixture model in which both the mixture coefficients and the mixture components are generalized linear models (GLIM's). Learning is treated as a maximum likelihood problem; in particular, we present an Expectation-Maximization (EM) algorithm for adjusting the parameters of the architecture. We also develop an on-line learning algorithm in which the parameters are updated incrementally. Comparative simulation results are presented in the robot dynamics domain.
Resumo:
The statistical analysis of literary style is the part of stylometry that compares measurable characteristics in a text that are rarely controlled by the author, with those in other texts. When the goal is to settle authorship questions, these characteristics should relate to the author’s style and not to the genre, epoch or editor, and they should be such that their variation between authors is larger than the variation within comparable texts from the same author. For an overview of the literature on stylometry and some of the techniques involved, see for example Mosteller and Wallace (1964, 82), Herdan (1964), Morton (1978), Holmes (1985), Oakes (1998) or Lebart, Salem and Berry (1998). Tirant lo Blanc, a chivalry book, is the main work in catalan literature and it was hailed to be “the best book of its kind in the world” by Cervantes in Don Quixote. Considered by writters like Vargas Llosa or Damaso Alonso to be the first modern novel in Europe, it has been translated several times into Spanish, Italian and French, with modern English translations by Rosenthal (1996) and La Fontaine (1993). The main body of this book was written between 1460 and 1465, but it was not printed until 1490. There is an intense and long lasting debate around its authorship sprouting from its first edition, where its introduction states that the whole book is the work of Martorell (1413?-1468), while at the end it is stated that the last one fourth of the book is by Galba (?-1490), after the death of Martorell. Some of the authors that support the theory of single authorship are Riquer (1990), Chiner (1993) and Badia (1993), while some of those supporting the double authorship are Riquer (1947), Coromines (1956) and Ferrando (1995). For an overview of this debate, see Riquer (1990). Neither of the two candidate authors left any text comparable to the one under study, and therefore discriminant analysis can not be used to help classify chapters by author. By using sample texts encompassing about ten percent of the book, and looking at word length and at the use of 44 conjunctions, prepositions and articles, Ginebra and Cabos (1998) detect heterogeneities that might indicate the existence of two authors. By analyzing the diversity of the vocabulary, Riba and Ginebra (2000) estimates that stylistic boundary to be near chapter 383. Following the lead of the extensive literature, this paper looks into word length, the use of the most frequent words and into the use of vowels in each chapter of the book. Given that the features selected are categorical, that leads to three contingency tables of ordered rows and therefore to three sequences of multinomial observations. Section 2 explores these sequences graphically, observing a clear shift in their distribution. Section 3 describes the problem of the estimation of a suden change-point in those sequences, in the following sections we propose various ways to estimate change-points in multinomial sequences; the method in section 4 involves fitting models for polytomous data, the one in Section 5 fits gamma models onto the sequence of Chi-square distances between each row profiles and the average profile, the one in Section 6 fits models onto the sequence of values taken by the first component of the correspondence analysis as well as onto sequences of other summary measures like the average word length. In Section 7 we fit models onto the marginal binomial sequences to identify the features that distinguish the chapters before and after that boundary. Most methods rely heavily on the use of generalized linear models