957 resultados para Multivariate statistical methods
Resumo:
International audience
Resumo:
Using water quality management programs is a necessary and inevitable way for preservation and sustainable use of water resources. One of the important issues in determining the quality of water in rivers is designing effective quality control networks, so that the measured quality variables in these stations are, as far as possible, indicative of overall changes in water quality. One of the methods to achieve this goal is increasing the number of quality monitoring stations and sampling instances. Since this will dramatically increase the annual cost of monitoring, deciding on which stations and parameters are the most important ones, along with increasing the instances of sampling, in a way that shows maximum change in the system under study can affect the future decision-making processes for optimizing the efficacy of extant monitoring network, removing or adding new stations or parameters and decreasing or increasing sampling instances. This end, the efficiency of multivariate statistical procedures was studied in this thesis. Multivariate statistical procedure, with regard to its features, can be used as a practical and useful method in recognizing and analyzing rivers’ pollution and consequently in understanding, reasoning, controlling, and correct decision-making in water quality management. This research was carried out using multivariate statistical techniques for analyzing the quality of water and monitoring the variables affecting its quality in Gharasou river, in Ardabil province in northwest of Iran. During a year, 28 physical and chemical parameters were sampled in 11 stations. The results of these measurements were analyzed by multivariate procedures such as: Cluster Analysis (CA), Principal Component Analysis (PCA), Factor Analysis (FA), and Discriminant Analysis (DA). Based on the findings from cluster analysis, principal component analysis, and factor analysis the stations were divided into three groups of highly polluted (HP), moderately polluted (MP), and less polluted (LP) stations Thus, this study illustrates the usefulness of multivariate statistical techniques for analysis and interpretation of complex data sets, and in water quality assessment, identification of pollution sources/factors and understanding spatial variations in water quality for effective river water quality management. This study also shows the effectiveness of these techniques for getting better information about the water quality and design of monitoring network for effective management of water resources. Therefore, based on the results, Gharasou river water quality monitoring program was developed and presented.
Resumo:
Abstract: Quantitative Methods (QM) is a compulsory course in the Social Science program in CEGEP. Many QM instructors assign a number of homework exercises to give students the opportunity to practice the statistical methods, which enhances their learning. However, traditional written exercises have two significant disadvantages. The first is that the feedback process is often very slow. The second disadvantage is that written exercises can generate a large amount of correcting for the instructor. WeBWorK is an open-source system that allows instructors to write exercises which students answer online. Although originally designed to write exercises for math and science students, WeBWorK programming allows for the creation of a variety of questions which can be used in the Quantitative Methods course. Because many statistical exercises generate objective and quantitative answers, the system is able to instantly assess students’ responses and tell them whether they are right or wrong. This immediate feedback has been shown to be theoretically conducive to positive learning outcomes. In addition, the system can be set up to allow students to re-try the problem if they got it wrong. This has benefits both in terms of student motivation and reinforcing learning. Through the use of a quasi-experiment, this research project measured and analysed the effects of using WeBWorK exercises in the Quantitative Methods course at Vanier College. Three specific research questions were addressed. First, we looked at whether students who did the WeBWorK exercises got better grades than students who did written exercises. Second, we looked at whether students who completed more of the WeBWorK exercises got better grades than students who completed fewer of the WeBWorK exercises. Finally, we used a self-report survey to find out what students’ perceptions and opinions were of the WeBWorK and the written exercises. For the first research question, a crossover design was used in order to compare whether the group that did WeBWorK problems during one unit would score significantly higher on that unit test than the other group that did the written problems. We found no significant difference in grades between students who did the WeBWorK exercises and students who did the written exercises. The second research question looked at whether students who completed more of the WeBWorK exercises would get significantly higher grades than students who completed fewer of the WeBWorK exercises. The straight-line relationship between number of WeBWorK exercises completed and grades was positive in both groups. However, the correlation coefficients for these two variables showed no real pattern. Our third research question was investigated by using a survey to elicit students’ perceptions and opinions regarding the WeBWorK and written exercises. Students reported no difference in the amount of effort put into completing each type of exercise. Students were also asked to rate each type of exercise along six dimensions and a composite score was calculated. Overall, students gave a significantly higher score to the written exercises, and reported that they found the written exercises were better for understanding the basic statistical concepts and for learning the basic statistical methods. However, when presented with the choice of having only written or only WeBWorK exercises, slightly more students preferred or strongly preferred having only WeBWorK exercises. The results of this research suggest that the advantages of using WeBWorK to teach Quantitative Methods are variable. The WeBWorK system offers immediate feedback, which often seems to motivate students to try again if they do not have the correct answer. However, this does not necessarily translate into better performance on the written tests and on the final exam. What has been learned is that the WeBWorK system can be used by interested instructors to enhance student learning in the Quantitative Methods course. Further research may examine more specifically how this system can be used more effectively.
Resumo:
A ocratoxina A (OTA), micotoxina encontrada em diferentes níveis e em diversas matrizes, apresenta efeitos carcinogênicos, nefrotóxicos e teratogênicos. O desenvolvimento de métodos capazes de diminuir esta contaminação a níveis permitidos pela legislação é incentivado e os processos biológicos utilizados envolvem o uso de enzimas e/ou microrganismos para degradação da OTA e são preferenciais pela especificidade, bem como pelas condições brandas para a detoxificação. O objetivo do trabalho foi estudar a ação de carboxipeptidase A nos níveis e na toxicidade de OTA, visando aplicar a técnica para detoxificar farinhas de trigo. Primeiramente foi estimado o risco de exposição à ocratoxina A pelo consumo de farinhas de trigo. Para isso foram estabelecidas condições de determinação de OTA em farinhas de trigo, empregando técnicas de estatística multivariada para definir os principais interferentes na extração de OTA pelo método de QuEChERS e detecção em CLAE-FL. O método validado permitiu a avaliação da ocorrência natural em 20 amostras de farinha de trigo, estando estas contaminadas na faixa de 0,22 a 0,85 µg.kg-1 , apresentando um valor de ingestão diária de 0,08 ngOTA.dia-1 .kgmassacorpórea -1 e uma disponibilidade de 94,4%. Em seguida foi realizada a padronização da extração de carboxipeptidase A em biomassa de Rhizopus oryzae que consistiu em agitação ultrassônica durante 30 minutos numa potencia fixa de 150 W e 40 kHz e a triagem de agentes biológicos para degradação de OTA. Para o estudo da degradação in vitro de OTA, método de extração e detecção de OTA e OTα em CLAEFL foi validado e o processo de degradação foi realizado com Rhizopus oryzae e Trichoderma reesei, obtendo-se uma redução máxima de 63,5% e 57,7%, respectivamente. A degradação apresentou uma correlação alta (R>0,9) e significativa (p<0,05) com a produção de Otα, indicando que ocorreu a produção de enzimas capazes de hidrolisar a micotoxina, por exemplo, a carboxipeptidase A. O estudo da toxicidade de OTA e seu metabólito OTα foi realizado em neutrófilos humanos, onde foi observado a ausência de efeito tóxico de OTα. Também foi determinado o mecanismo de toxicidade de OTA pelo aumento de Ca2+ intracelular pela liberação a partir das reservas internas. Esta liberação, subsequentemente, provoca uma cascata de eventos, nomeadamente: a produção de espécies reativas, depleção de ATP, perda de ΔΨm, levando à morte por necrose. Para reduzir o risco de exposição à micotoxina pela ingestão de matéria prima contaminada, carboxipeptidase A extraída de diferentes fontes foi aplicada na hidrólise de OTA em farinha de trigo para posterior determinação do conteúdo residual de OTA e OTα, empregando método validado. O estudo mostrou uma redução de OTA entre 16,8 e 78,5% e produção de OTα entre 2 a 8,2 ng.g-1 . As carboxipeptidases mais promissoras para degradação foram as provenientes de Rhizopus e Trichoderma e a carboxipeptidase comercial. Ficou demonstrado que se pode recomendar a aplicação de enzimas proteolíticas, tipo carboxipeptidase, para reduzir o risco de exposição à micotoxina quando utilizada matéria prima contaminada, por exemplo, farinha de trigo para diferentes processos. A transformação de OTA para OTα e seus efeitos na redução da toxicidade da micotoxina corroboram com esta afirmação.
Resumo:
O prognóstico da perda dentária é um dos principais problemas na prática clínica de medicina dentária. Um dos principais fatores prognósticos é a quantidade de suporte ósseo do dente, definido pela área da superfície radicular dentária intraóssea. A estimação desta grandeza tem sido realizada por diferentes metodologias de investigação com resultados heterogéneos. Neste trabalho utilizamos o método da planimetria com microtomografia para calcular a área da superfície radicular (ASR) de uma amostra de cinco dentes segundos pré-molares inferiores obtida da população portuguesa, com o objetivo final de criar um modelo estatístico para estimar a área de superfície radicular intraóssea a partir de indicadores clínicos da perda óssea. Por fim propomos um método para aplicar os resultados na prática. Os dados referentes à área da superfície radicular, comprimento total do dente (CT) e dimensão mésio-distal máxima da coroa (MDeq) serviram para estabelecer as relações estatísticas entre variáveis e definir uma distribuição normal multivariada. Por fim foi criada uma amostra de 37 observações simuladas a partir da distribuição normal multivariada definida e estatisticamente idênticas aos dados da amostra de cinco dentes. Foram ajustados cinco modelos lineares generalizados aos dados simulados. O modelo estatístico foi selecionado segundo os critérios de ajustamento, preditibilidade, potência estatística, acurácia dos parâmetros e da perda de informação, e validado pela análise gráfica de resíduos. Apoiados nos resultados propomos um método em três fases para estimação área de superfície radicular perdida/remanescente. Na primeira fase usamos o modelo estatístico para estimar a área de superfície radicular, na segunda estimamos a proporção (decis) de raiz intraóssea usando uma régua de Schei adaptada e na terceira multiplicamos o valor obtido na primeira fase por um coeficiente que representa a proporção de raiz perdida (ASRp) ou da raiz remanescente (ASRr) para o decil estimado na segunda fase. O ponto forte deste estudo foi a aplicação de metodologia estatística validada para operacionalizar dados clínicos na estimação de suporte ósseo perdido. Como pontos fracos consideramos a aplicação destes resultados apenas aos segundos pré-molares mandibulares e a falta de validação clínica.
Resumo:
This dissertation applies statistical methods to the evaluation of automatic summarization using data from the Text Analysis Conferences in 2008-2011. Several aspects of the evaluation framework itself are studied, including the statistical testing used to determine significant differences, the assessors, and the design of the experiment. In addition, a family of evaluation metrics is developed to predict the score an automatically generated summary would receive from a human judge and its results are demonstrated at the Text Analysis Conference. Finally, variations on the evaluation framework are studied and their relative merits considered. An over-arching theme of this dissertation is the application of standard statistical methods to data that does not conform to the usual testing assumptions.
Resumo:
Neste trabalho apresentamos a teoria da análise de correlação canónica, uma técnica de análise estatística multivariada para o estudo da relação, simultânea, entre dois, três ou mais grupos de variáveis. Descrevemos a natureza da correlação canónica com três ou mais variáveis, com modelos matemáticos, fazendo uma síntese dos métodos de generalização de correlação canónica nomeadamente o método Ssqcor, método Sumcor, método Ecart, método Maxvar, método Minvar, e o método de Carroll. Apresentamos uma aplicação utilizando dados provenientes do cálculo do Índice de Preços no Consumidor IPC, produzido pelo INE - STP (Instituto Nacional de Estatística de São Tomé e Príncipe), referente ao período 2010 a 2014. Estamos interessados em conhecer as correlações canónicas entre grupos de variáveis relacionadas com o cabaz de produtos pré-estabelecido para o cálculo do índice de preços no consumidor, concretamente os produtos alimentares (PA), produtos para bebidas (PB) e produtos não alimentares (PNA), constituindo assim os três grandes grupos de variáveis da nossa pesquisa.
Resumo:
Dissertação de Mestrado, Gestão Empresarial, Faculdade de Economia, Universidade do Algarve, 2015
Resumo:
Ecomorphology is a science based on the idea that morphological differences among species could be associated with distinct biological and environmental pressures suffered by them. These differences can be studied employing morphological and biometric indexes denominated Ecomorphological attributes , representing standards that express characteristics of the individual in relation to its environment, and can be interpreted as indicators of life habits or adaptations suffered due its occupation of different habitats. This work aims to contribute for the knowledge of the ecomorphology of the Brazilian marine ichthyofauna, specifically from Galinhos, located at Rio Grande do Norte state. 10 different species of fish were studied, belonging the families Gerreidae (Eucinostomus argenteus), Haemulidae (Orthopristis ruber,Pomadasyscorvinaeformis,Haemulonaurolineatum,Haemulonplumieri,Haemulonsteindachneri), Lutjanidae (Lutjanus synagris), Paralichthyidae (Syaciummicrurum), Bothidae (Bothus ocellatus) and Tetraodontidae (Sphoeroidestestudineus), which were obtained during five collections, in the period time of September/2004 to April/2005, utilizing three special nets. The ecomorphological study was performed at the laboratory. Eight to ten samples of each fish specie were measured. Fifteen morphological aspects were considered to calculate twelve ecomorphological attributes. Multivariate statistical analysis methods such as Principal Component Analysis (PCA) and Cluster Analysis were done to identify ecmorphological patterns to describe the data set obtained. As results, H.aurolineatumwas the most abundant specie found (23,03%) and S.testudineusthe less one with 0,23%. The 1st Principal component showed variation of 60,03% with influence of the ecomorphological attribute related to body morphology, while the 2nd PC with 23,25% variation had influence of the ecomorphological attribute related to oral morphology. The Cluster Analiysis promoted the identification of three distinct groups Perciformes, Pleuronectiformes and Tetraodontiformes. Based on the obtained data, considering morphological characters differences among the species studied, we suggest that all of them live at the medium (E.argenteus,O.rubber, P.corvinaeformis,H.aurolineatum,H.plumieri,H.steindachneri,L.synagris) and bottom (S.micrurum,B.ocellatus,S.testudineus) region of column water.
Resumo:
The study of random probability measures is a lively research topic that has attracted interest from different fields in recent years. In this thesis, we consider random probability measures in the context of Bayesian nonparametrics, where the law of a random probability measure is used as prior distribution, and in the context of distributional data analysis, where the goal is to perform inference given avsample from the law of a random probability measure. The contributions contained in this thesis can be subdivided according to three different topics: (i) the use of almost surely discrete repulsive random measures (i.e., whose support points are well separated) for Bayesian model-based clustering, (ii) the proposal of new laws for collections of random probability measures for Bayesian density estimation of partially exchangeable data subdivided into different groups, and (iii) the study of principal component analysis and regression models for probability distributions seen as elements of the 2-Wasserstein space. Specifically, for point (i) above we propose an efficient Markov chain Monte Carlo algorithm for posterior inference, which sidesteps the need of split-merge reversible jump moves typically associated with poor performance, we propose a model for clustering high-dimensional data by introducing a novel class of anisotropic determinantal point processes, and study the distributional properties of the repulsive measures, shedding light on important theoretical results which enable more principled prior elicitation and more efficient posterior simulation algorithms. For point (ii) above, we consider several models suitable for clustering homogeneous populations, inducing spatial dependence across groups of data, extracting the characteristic traits common to all the data-groups, and propose a novel vector autoregressive model to study of growth curves of Singaporean kids. Finally, for point (iii), we propose a novel class of projected statistical methods for distributional data analysis for measures on the real line and on the unit-circle.
Resumo:
Often in biomedical research, we deal with continuous (clustered) proportion responses ranging between zero and one quantifying the disease status of the cluster units. Interestingly, the study population might also consist of relatively disease-free as well as highly diseased subjects, contributing to proportion values in the interval [0, 1]. Regression on a variety of parametric densities with support lying in (0, 1), such as beta regression, can assess important covariate effects. However, they are deemed inappropriate due to the presence of zeros and/or ones. To evade this, we introduce a class of general proportion density, and further augment the probabilities of zero and one to this general proportion density, controlling for the clustering. Our approach is Bayesian and presents a computationally convenient framework amenable to available freeware. Bayesian case-deletion influence diagnostics based on q-divergence measures are automatic from the Markov chain Monte Carlo output. The methodology is illustrated using both simulation studies and application to a real dataset from a clinical periodontology study.
Resumo:
Conventional radiography has shown limitation in acquiring image of the ATM region, thus, computed tomography (CT) scanning has been the best option to the present date for diagnosis, surgical planning and treatment of bone lesions, owing to its specific properties. OBJECTIVE: The aim of the study was to evaluate images of simulated bone lesions at the head of the mandible by multislice CT. MATERIAL AND METHODS: Spherical lesions were made with dental spherical drills (sizes 1, 3, and 6) and were evaluated by using multislice CT (64 rows), by two observers in two different occasions, deploying two protocols: axial, coronal, and sagittal images, and parasagittal images for pole visualization (anterior, lateral, posterior, medial and superior). Acquired images were then compared with those lesions in the dry mandible (gold standard) to evaluate the specificity and sensibility of both protocols. Statistical methods included: Kappa statistics, validity test and chi-square test. Results demonstrated the advantage of associating axial, coronal, and sagittal slices with parasagittal slices for lesion detection at the head of the mandible. RESULTS: There was no statistically significant difference between the types of protocols regarding a particular localization of lesions at the poles. CONCLUSIONS: Protocols for the assessment of the head of the mandible were established to improve the visualization of alterations of each of the poles of the mandible's head. The anterior and posterior poles were better visualized in lateral-medial planes while lateral, medial and superior poles were better visualized in the anterior-posterior plane.
Resumo:
OBJECTIVE: To estimate the spatial intensity of urban violence events using wavelet-based methods and emergency room data. METHODS: Information on victims attended at the emergency room of a public hospital in the city of São Paulo, Southeastern Brazil, from January 1, 2002 to January 11, 2003 were obtained from hospital records. The spatial distribution of 3,540 events was recorded and a uniform random procedure was used to allocate records with incomplete addresses. Point processes and wavelet analysis technique were used to estimate the spatial intensity, defined as the expected number of events by unit area. RESULTS: Of all georeferenced points, 59% were accidents and 40% were assaults. There is a non-homogeneous spatial distribution of the events with high concentration in two districts and three large avenues in the southern area of the city of São Paulo. CONCLUSIONS: Hospital records combined with methodological tools to estimate intensity of events are useful to study urban violence. The wavelet analysis is useful in the computation of the expected number of events and their respective confidence bands for any sub-region and, consequently, in the specification of risk estimates that could be used in decision-making processes for public policies.
Resumo:
The spatial and temporal retention of metals has been studied in water and sediments of the Gavião River, Anagé and Tremedal Reservoirs, located in the semi-arid region, Bahia - Brazil, in order to identify trends in the fluxes of metals from the sediments to the water column. The determination of metals was made by ICP OES and ET AAS. The application of statistical methods showed that this aquatic system presents suitable conditions to move Cd2+ and Pb2+ from the water column to the sediment.
Resumo:
Structure of intertidal and subtidal benthic macrofauna in the northeastern region of Todos os Santos Bay (TSB), northeast Brazil, was investigated during a period of two years. Relationships with environmental parameters were studied through uni-and multivariate statistical analyses, and the main distributional patterns shown to be especially related to sediment type and content of organic fractions (Carbon, Nitrogen, Phosphorus), on both temporal and spatial scales. Polychaete annelids accounted for more than 70% of the total fauna and showed low densities, species richness and diversity, except for the area situated on the reef banks. These banks constitute a peculiar environment in relation to the rest of the region by having coarse sediments poor in organic matter and rich in biodetritic carbonates besides an abundant and diverse fauna. The intertidal region and the shallower area nearer to the oil refinery RLAM, with sediments composed mainly of fine sand, seem to constitute an unstable system with few highly dominant species, such as Armandia polyophthalma and Laeonereis acuta. In the other regions of TSB, where muddy bottoms predominated, densities and diversity were low, especially in the stations near the refinery. Here the lowest values of the biological indicators occurred together with the highest organic compound content. In addition, the nearest sites (stations 4 and 7) were sometimes azoic. The adjacent Caboto, considered as a control area at first, presented low density but intermediate values of species diversity, which indicates a less disturbed environment in relation to the pelitic infralittoral in front of the refinery. The results of the ordination analyses evidenced five homogeneous groups of stations (intertidal; reef banks; pelitic infralittoral; mixed sediments; Caboto) with different specific patterns, a fact which seems to be mainly related to granulometry and chemical sediment characteristics.