972 resultados para Models, statistical
Resumo:
Time series regression models are especially suitable in epidemiology for evaluating short-term effects of time-varying exposures on health. The problem is that potential for confounding in time series regression is very high. Thus, it is important that trend and seasonality are properly accounted for. Our paper reviews the statistical models commonly used in time-series regression methods, specially allowing for serial correlation, make them potentially useful for selected epidemiological purposes. In particular, we discuss the use of time-series regression for counts using a wide range Generalised Linear Models as well as Generalised Additive Models. In addition, recently critical points in using statistical software for GAM were stressed, and reanalyses of time series data on air pollution and health were performed in order to update already published. Applications are offered through an example on the relationship between asthma emergency admissions and photochemical air pollutants
Resumo:
Observations in daily practice are sometimes registered as positive values larger then a given threshold α. The sample space is in this case the interval (α,+∞), α > 0, which can be structured as a real Euclidean space in different ways. This fact opens the door to alternative statistical models depending not only on the assumed distribution function, but also on the metric which is considered as appropriate, i.e. the way differences are measured, and thus variability
Resumo:
This paper is a first draft of the principle of statistical modelling on coordinates. Several causes —which would be long to detail—have led to this situation close to the deadline for submitting papers to CODAWORK’03. The main of them is the fast development of the approach along the last months, which let appear previous drafts as obsolete. The present paper contains the essential parts of the state of the art of this approach from my point of view. I would like to acknowledge many clarifying discussions with the group of people working in this field in Girona, Barcelona, Carrick Castle, Firenze, Berlin, G¨ottingen, and Freiberg. They have given a lot of suggestions and ideas. Nevertheless, there might be still errors or unclear aspects which are exclusively my fault. I hope this contribution serves as a basis for further discussions and new developments
Resumo:
The paper discusses maintenance challenges of organisations with a huge number of devices and proposes the use of probabilistic models to assist monitoring and maintenance planning. The proposal assumes connectivity of instruments to report relevant features for monitoring. Also, the existence of enough historical registers with diagnosed breakdowns is required to make probabilistic models reliable and useful for predictive maintenance strategies based on them. Regular Markov models based on estimated failure and repair rates are proposed to calculate the availability of the instruments and Dynamic Bayesian Networks are proposed to model cause-effect relationships to trigger predictive maintenance services based on the influence between observed features and previously documented diagnostics
Resumo:
Esta tesis está dividida en dos partes: en la primera parte se presentan y estudian los procesos telegráficos, los procesos de Poisson con compensador telegráfico y los procesos telegráficos con saltos. El estudio presentado en esta primera parte incluye el cálculo de las distribuciones de cada proceso, las medias y varianzas, así como las funciones generadoras de momentos entre otras propiedades. Utilizando estas propiedades en la segunda parte se estudian los modelos de valoración de opciones basados en procesos telegráficos con saltos. En esta parte se da una descripción de cómo calcular las medidas neutrales al riesgo, se encuentra la condición de no arbitraje en este tipo de modelos y por último se calcula el precio de las opciones Europeas de compra y venta.
Resumo:
The performance of a model-based diagnosis system could be affected by several uncertainty sources, such as,model errors,uncertainty in measurements, and disturbances. This uncertainty can be handled by mean of interval models.The aim of this thesis is to propose a methodology for fault detection, isolation and identification based on interval models. The methodology includes some algorithms to obtain in an automatic way the symbolic expression of the residual generators enhancing the structural isolability of the faults, in order to design the fault detection tests. These algorithms are based on the structural model of the system. The stages of fault detection, isolation, and identification are stated as constraint satisfaction problems in continuous domains and solved by means of interval based consistency techniques. The qualitative fault isolation is enhanced by a reasoning in which the signs of the symptoms are derived from analytical redundancy relations or bond graph models of the system. An initial and empirical analysis regarding the differences between interval-based and statistical-based techniques is presented in this thesis. The performance and efficiency of the contributions are illustrated through several application examples, covering different levels of complexity.
Resumo:
En aquest estudi, la toxicitat de diversos metalls pesants i l'arsènic va ser analitzada utilitzant diferents models biològics. En la primera part d'aquest treball, el bioassaig de toxicitat Microtox, el qual està basat en la variació de l'emissió lumínica del bacteri luminiscent Vibrio fischeri, va ser utilitzat per establir les corbes dosi-resposta de diferents elements tòxics com el Zn(II), Pb(II), Cu(II), Hg(II), Ag(I), Co(II), Cd(II), Cr(VI), As(V) i As(III) en solucions aquoses. Els experiments es varen portar a terme a pH 6.0 i 7.0 per tal de mostrar que el pH pot influir en la toxicitat final mesurada d'alguns metalls degut als canvis relacionats amb la seva especiació química. Es varen trobar diferents tipus de corbes dosi-resposta depenent del metall analitzat i el pH del medi. En el cas de l'arsènic, l'efecte del pH en la toxicitat de l'arsenat i l'arsenit es va investigar utilitzant l'assaig Microtox en un rang de pHs comprès entre pH 5.0 i 9.0. Els valors d'EC50 determinats per l'As(V) disminueixen, reflectint un augment de la toxicitat, a mesura que el pH de la solució augmenta mentre que, en el cas de l'As(III), els valors d'EC50 quasi bé no varien entre pH 6.0 i 8.0 i només disminueixen a pH 9.0. HAsO42- i H2AsO3- es varen definir com les espècies més tòxiques. Així mateix, una anàlisi estadística va revelar un efecte antagònic entre les espècies químiques d'arsenat que es troben conjuntament a pH 6.0 i 7.0. D'altra banda, els resultats de dos mètodes estadístics per predir la toxicitat i les possibles interaccions entre el Co(II), Cd(II), Cu(II), Zn(II) i Pb(II) en mescles binàries equitòxiques es varen comparar amb la toxicitat observada sobre el bacteri Vibrio fischeri. L'efecte combinat d'aquests metalls va resultar ser antagònic per les mescles de Co(II)-Cd(II), Cd(II)-Zn(II), Cd(II)-Pb(II) i Cu(II)-Pb(II), sinèrgic per Co(II)-Cu(II) i Zn(II)-Pb(II) i additiu en els altres casos, revelant un patró complex de possibles interaccions. L'efecte sinèrgic de la combinació Co(II)-Cu(II) i la forta disminució de la toxicitat del Pb(II) quan es troba en presència de Cd(II) hauria de merèixer més atenció quan s'estableixen les normatives de seguretat ambiental. La sensibilitat de l'assaig Microtox també va ser determinada. Els valors d'EC20, els quals representen la toxicitat llindar mesurable, varen ser determinats per cada element individualment i es va veure que augmenten de la següent manera: Pb(II) < Ag(I) < Hg(II) Cu(II) < Zn(II) < As(V) < Cd(II) Co(II) < As(III) < Cr(VI). Aquests valors es varen comparar amb les concentracions permeses en aigues residuals industrials establertes per la normativa oficial de Catalunya (Espanya). L'assaig Microtox va resultar ser suficientment sensible per detectar els elements assajats respecte a les normes oficials referents al control de la contaminació, excepte en el cas del cadmi, mercuri, arsenat, arsenit i cromat. En la segona part d'aquest treball, com a resultats complementaris dels resultats previs obtinguts utilitzant l'assaig de toxicitat aguda Microtox, els efectes crònics del Cd(II), Cr(VI) i As(V) es varen analitzar sobre la taxa de creixement i la viabilitat en el mateix model biològic. Sorprenentment, aquests productes químics nocius varen resultar ser poc tòxics per aquest bacteri quan es mesura el seu efecte després de temps d'exposició llargs. Tot i això, en el cas del Cr(VI), l'assaig d'inhibició de la viabilitat va resultar ser més sensible que l'assaig de toxicitat aguda Microtox. Així mateix, també va ser possible observar un clar fenomen d'hormesis, especialment en el cas del Cd(II), quan s'utilitza l'assaig d'inhibició de la viabilitat. A més a més, diversos experiments es varen portar a terme per intentar explicar la manca de toxicitat de Cr(VI) mostrada pel bacteri Vibrio fischeri. La resistència mostrada per aquest bacteri podria ser atribuïda a la capacitat d'aquest bacteri de convertir el Cr(VI) a la forma menys tòxica de Cr(III). Es va trobar que aquesta capacitat de reducció depèn de la composició del medi de cultiu, de la concentració inicial de Cr(VI), del temps d'incubació i de la presència d'una font de carboni. En la tercera part d'aquest treball, la línia cel·lular humana HT29 i cultius primaris de cèl·lules sanguínies de Sparus sarba es varen utilitzar in vitro per detectar la toxicitat llindar de metalls mesurant la sobreexpressió de proteines d'estrès. Extractes de fangs precedents de diverses plantes de tractament d'aigues residuals i diferents metalls, individualment o en combinació, es varen analitzar sobre cultius cel·lulars humans per avaluar el seu efecte sobre la taxa de creixement i la capacitat d'induir la síntesi de les proteïnes Hsp72 relacionades amb l'estrès cel·lular. No es varen trobar efectes adversos significatius quan els components s'analitzen individualment. Nogensmenys, quan es troben conjuntament, es produeix un afecte advers sobre tan la taxa de creixement com en l'expressió de proteins d'estrès. D'altra banda, cèl·lules sanguínies procedents de Sparus sarba es varen exposar in vitro a diferents concentracions de cadmi, plom i crom. La proteïna d'estrès HSP70 es va sobreexpressar significativament després de l'exposició a concentracions tan febles com 0.1 M. Sota les nostres condicions de treball, no es va evidenciar una sobreexpressió de metal·lotioneïnes. Nogensmenys, les cèl·lules sanguínies de peix varen resultar ser un model biològic interessant per a ser utilitzat en anàlisis de toxicitat. Ambdós models biològics varen resultar ser molt adequats per a detectar acuradament la toxicitat produïda per metalls. En general, l'avaluació de la toxicitat basada en l'anàlisi de la sobreexpressió de proteïnes d'estrès és més sensible que l'avaluació de la toxicitat realitzada a nivell d'organisme. A partir dels resultats obtinguts, podem concloure que una bateria de bioassaigs és realment necessària per avaluar acuradament la toxicitat de metalls ja que existeixen grans variacions entre els valors de toxicitat obtinguts emprant diferents organismes i molts factors ambientals poden influir i modificar els resultats obtinguts.
Resumo:
[EU]Lan honetan semantika distribuzionalaren eta ikasketa automatikoaren erabilera aztertzen dugu itzulpen automatiko estatistikoa hobetzeko. Bide horretan, erregresio logistikoan oinarritutako ikasketa automatikoko eredu bat proposatzen dugu hitz-segiden itzulpen- probabilitatea modu dinamikoan modelatzeko. Proposatutako eredua itzulpen automatiko estatistikoko ohiko itzulpen-probabilitateen orokortze bat dela frogatzen dugu, eta testuinguruko nahiz semantika distribuzionaleko informazioa barneratzeko baliatu ezaugarri lexiko, hitz-cluster eta hitzen errepresentazio bektorialen bidez. Horretaz gain, semantika distribuzionaleko ezagutza itzulpen automatiko estatistikoan txertatzeko beste hurbilpen bat lantzen dugu: hitzen errepresentazio bektorial elebidunak erabiltzea hitz-segiden itzulpenen antzekotasuna modelatzeko. Gure esperimentuek proposatutako ereduen baliagarritasuna erakusten dute, emaitza itxaropentsuak eskuratuz oinarrizko sistema sendo baten gainean. Era berean, gure lanak ekarpen garrantzitsuak egiten ditu errepresentazio bektorialen mapaketa elebidunei eta hitzen errepresentazio bektorialetan oinarritutako hitz-segiden antzekotasun neurriei dagokienean, itzulpen automatikoaz haratago balio propio bat dutenak semantika distribuzionalaren arloan.
Resumo:
Land use and land cover changes in the Brazilian Amazon have major implications for regional and global carbon (C) cycling. Cattle pasture represents the largest single use (about 70%) of this once-forested land in most of the region. The main objective of this study was to evaluate the accuracy of the RothC and Century models at estimating soil organic C (SOC) changes under forest-to-pasture conditions in the Brazilian Amazon. We used data from 11 site-specific 'forest to pasture' chronosequences with the Century Ecosystem Model (Century 4.0) and the Rothamsted C Model (RothC 26.3). The models predicted that forest clearance and conversion to well managed pasture would cause an initial decline in soil C stocks (0-20 cm depth), followed in the majority of cases by a slow rise to levels exceeding those under native forest. One exception to this pattern was a chronosequence in Suia-Missu, which is under degraded pasture. In three other chronosequences the recovery of soil C under pasture appeared to be only to about the same level as under the previous forest. Statistical tests were applied to determine levels of agreement between simulated SOC stocks and observed stocks for all the sites within the 11 chronosequences. The models also provided reasonable estimates (coefficient of correlation = 0.8) of the microbial biomass C in the 0-10 cm soil layer for three chronosequences, when compared with available measured data. The Century model adequately predicted the magnitude and the overall trend in delta C-13 for the six chronosequences where measured 813 C data were available. This study gave independent tests of model performance, as no adjustments were made to the models to generate outputs. Our results suggest that modelling techniques can be successfully used for monitoring soil C stocks and changes, allowing both the identification of current patterns in the soil and the projection of future conditions. Results were used and discussed not only to evaluate soil C dynamics but also to indicate soil C sequestration opportunities for the Brazilian Amazon region. Moreover, modelling studies in these 'forest to pasture' systems have important applications, for example, the calculation of CO, emissions from land use change in national greenhouse gas inventories. (0 2007 Elsevier B.V. All rights reserved.
Resumo:
A physically motivated statistical model is used to diagnose variability and trends in wintertime ( October - March) Global Precipitation Climatology Project (GPCP) pentad (5-day mean) precipitation. Quasi-geostrophic theory suggests that extratropical precipitation amounts should depend multiplicatively on the pressure gradient, saturation specific humidity, and the meridional temperature gradient. This physical insight has been used to guide the development of a suitable statistical model for precipitation using a mixture of generalized linear models: a logistic model for the binary occurrence of precipitation and a Gamma distribution model for the wet day precipitation amount. The statistical model allows for the investigation of the role of each factor in determining variations and long-term trends. Saturation specific humidity q(s) has a generally negative effect on global precipitation occurrence and with the tropical wet pentad precipitation amount, but has a positive relationship with the pentad precipitation amount at mid- and high latitudes. The North Atlantic Oscillation, a proxy for the meridional temperature gradient, is also found to have a statistically significant positive effect on precipitation over much of the Atlantic region. Residual time trends in wet pentad precipitation are extremely sensitive to the choice of the wet pentad threshold because of increasing trends in low-amplitude precipitation pentads; too low a choice of threshold can lead to a spurious decreasing trend in wet pentad precipitation amounts. However, for not too small thresholds, it is found that the meridional temperature gradient is an important factor for explaining part of the long-term trend in Atlantic precipitation.
Resumo:
An extensive statistical ‘downscaling’ study is done to relate large-scale climate information from a general circulation model (GCM) to local-scale river flows in SW France for 51 gauging stations ranging from nival (snow-dominated) to pluvial (rainfall-dominated) river-systems. This study helps to select the appropriate statistical method at a given spatial and temporal scale to downscale hydrology for future climate change impact assessment of hydrological resources. The four proposed statistical downscaling models use large-scale predictors (derived from climate model outputs or reanalysis data) that characterize precipitation and evaporation processes in the hydrological cycle to estimate summary flow statistics. The four statistical models used are generalized linear (GLM) and additive (GAM) models, aggregated boosted trees (ABT) and multi-layer perceptron neural networks (ANN). These four models were each applied at two different spatial scales, namely at that of a single flow-gauging station (local downscaling) and that of a group of flow-gauging stations having the same hydrological behaviour (regional downscaling). For each statistical model and each spatial resolution, three temporal resolutions were considered, namely the daily mean flows, the summary statistics of fortnightly flows and a daily ‘integrated approach’. The results show that flow sensitivity to atmospheric factors is significantly different between nival and pluvial hydrological systems which are mainly influenced, respectively, by shortwave solar radiations and atmospheric temperature. The non-linear models (i.e. GAM, ABT and ANN) performed better than the linear GLM when simulating fortnightly flow percentiles. The aggregated boosted trees method showed higher and less variable R2 values to downscale the hydrological variability in both nival and pluvial regimes. Based on GCM cnrm-cm3 and scenarios A2 and A1B, future relative changes of fortnightly median flows were projected based on the regional downscaling approach. The results suggest a global decrease of flow in both pluvial and nival regimes, especially in spring, summer and autumn, whatever the considered scenario. The discussion considers the performance of each statistical method for downscaling flow at different spatial and temporal scales as well as the relationship between atmospheric processes and flow variability.
Resumo:
If the fundamental precepts of Farming Systems Research were to be taken literally then it would imply that for each farm 'unique' solutions should be sought. This is an unrealistic expectation, but it has led to the idea of a recommendation domain, implying creating a taxonomy of farms, in order to increase the general applicability of recommendations. Mathematical programming models are an established means of generating recommended solutions, but for such models to be effective they have to be constructed for 'truly' typical or representative situations. The multi-variate statistical techniques provide a means of creating the required typologies, particularly when an exhaustive database is available. This paper illustrates the application of this methodology in two different studies that shared the common purpose of identifying types of farming systems in their respective study areas. The issues related with the use of factor and cluster analyses for farm typification prior to building representative mathematical programming models for Chile and Pakistan are highlighted. (C) 2003 Elsevier Science Ltd. All rights reserved.
Resumo:
The proportional odds model provides a powerful tool for analysing ordered categorical data and setting sample size, although for many clinical trials its validity is questionable. The purpose of this paper is to present a new class of constrained odds models which includes the proportional odds model. The efficient score and Fisher's information are derived from the profile likelihood for the constrained odds model. These results are new even for the special case of proportional odds where the resulting statistics define the Mann-Whitney test. A strategy is described involving selecting one of these models in advance, requiring assumptions as strong as those underlying proportional odds, but allowing a choice of such models. The accuracy of the new procedure and its power are evaluated.
Resumo:
The contribution investigates the problem of estimating the size of a population, also known as the missing cases problem. Suppose a registration system is targeting to identify all cases having a certain characteristic such as a specific disease (cancer, heart disease, ...), disease related condition (HIV, heroin use, ...) or a specific behavior (driving a car without license). Every case in such a registration system has a certain notification history in that it might have been identified several times (at least once) which can be understood as a particular capture-recapture situation. Typically, cases are left out which have never been listed at any occasion, and it is this frequency one wants to estimate. In this paper modelling is concentrating on the counting distribution, e.g. the distribution of the variable that counts how often a given case has been identified by the registration system. Besides very simple models like the binomial or Poisson distribution, finite (nonparametric) mixtures of these are considered providing rather flexible modelling tools. Estimation is done using maximum likelihood by means of the EM algorithm. A case study on heroin users in Bangkok in the year 2001 is completing the contribution.