1000 resultados para Componentes principais, Análise de
Resumo:
Estado e sociedade brasileiros conviveram em descompasso, nos anos 80. A conseqüência imediata desse fenômeno foi o atendimento insuficiente de necessidades básicas da sociedade, nesse período, com aumento da entropia em vários subsistemas sociais brasileiros, dentre os quais o subsistema de saúde. Nesta tese, trabalhando com dados econômicos, sociais e de saúde, e construindo algumas variáveis-indicadores, confrontou-se, naquele período, necessidades da sociedade com ações do Estado, na área da saúde. Utilizando técnicas estatísticas - análise gráfica, associação estatística dos indicadores selecionados (matriz de correlação de PEARSON), análise em componentes principais, análise de agrupamento e análise de regressão linear múltipla com variáveis logaritímizadas - foi possível visualizar causas e conseqüências dessa alta entropia, caracterizada por desperdício de recursos e várias situações propensas à geração de crises nas organizações, setores e instituições do subsistema de saúde brasileiro. Propõe-se um método de alocação de recursos federais, objetivando minimizar desigualdades entre as Unidades da Federação, a partir de seus desempenhos na área de saúde.
Resumo:
Com o objetivo de mostrar uma aplicação dos modelos da família GARCH a taxas de câmbio, foram utilizadas técnicas estatísticas englobando análise multivariada de componentes principais e análise de séries temporais com modelagem de média e variância (volatilidade), primeiro e segundo momentos respectivamente. A utilização de análise de componentes principais auxilia na redução da dimensão dos dados levando a estimação de um menor número de modelos, sem contudo perder informação do conjunto original desses dados. Já o uso dos modelos GARCH justifica-se pela presença de heterocedasticidade na variância dos retornos das séries de taxas de câmbio. Com base nos modelos estimados foram simuladas novas séries diárias, via método de Monte Carlo (MC), as quais serviram de base para a estimativa de intervalos de confiança para cenários futuros de taxas de câmbio. Para a aplicação proposta foram selecionadas taxas de câmbio com maior market share de acordo com estudo do BIS, divulgado a cada três anos.
Resumo:
Produced water is characterized as one of the most common wastes generated during exploration and production of oil. This work aims to develop methodologies based on comparative statistical processes of hydrogeochemical analysis of production zones in order to minimize types of high-cost interventions to perform identification test fluids - TIF. For the study, 27 samples were collected from five different production zones were measured a total of 50 chemical species. After the chemical analysis was applied the statistical data, using the R Statistical Software, version 2.11.1. Statistical analysis was performed in three steps. In the first stage, the objective was to investigate the behavior of chemical species under study in each area of production through the descriptive graphical analysis. The second step was to identify a function that classify production zones from each sample, using discriminant analysis. In the training stage, the rate of correct classification function of discriminant analysis was 85.19%. The next stage of processing of the data used for Principal Component Analysis, by reducing the number of variables obtained from the linear combination of chemical species, try to improve the discriminant function obtained in the second stage and increase the discrimination power of the data, but the result was not satisfactory. In Profile Analysis curves were obtained for each production area, based on the characteristics of the chemical species present in each zone. With this study it was possible to develop a method using hydrochemistry and statistical analysis that can be used to distinguish the water produced in mature fields of oil, so that it is possible to identify the zone of production that is contributing to the excessive elevation of the water volume.
Resumo:
The problems of combinatory optimization have involved a large number of researchers in search of approximative solutions for them, since it is generally accepted that they are unsolvable in polynomial time. Initially, these solutions were focused on heuristics. Currently, metaheuristics are used more for this task, especially those based on evolutionary algorithms. The two main contributions of this work are: the creation of what is called an -Operon- heuristic, for the construction of the information chains necessary for the implementation of transgenetic (evolutionary) algorithms, mainly using statistical methodology - the Cluster Analysis and the Principal Component Analysis; and the utilization of statistical analyses that are adequate for the evaluation of the performance of the algorithms that are developed to solve these problems. The aim of the Operon is to construct good quality dynamic information chains to promote an -intelligent- search in the space of solutions. The Traveling Salesman Problem (TSP) is intended for applications based on a transgenetic algorithmic known as ProtoG. A strategy is also proposed for the renovation of part of the chromosome population indicated by adopting a minimum limit in the coefficient of variation of the adequation function of the individuals, with calculations based on the population. Statistical methodology is used for the evaluation of the performance of four algorithms, as follows: the proposed ProtoG, two memetic algorithms and a Simulated Annealing algorithm. Three performance analyses of these algorithms are proposed. The first is accomplished through the Logistic Regression, based on the probability of finding an optimal solution for a TSP instance by the algorithm being tested. The second is accomplished through Survival Analysis, based on a probability of the time observed for its execution until an optimal solution is achieved. The third is accomplished by means of a non-parametric Analysis of Variance, considering the Percent Error of the Solution (PES) obtained by the percentage in which the solution found exceeds the best solution available in the literature. Six experiments have been conducted applied to sixty-one instances of Euclidean TSP with sizes of up to 1,655 cities. The first two experiments deal with the adjustments of four parameters used in the ProtoG algorithm in an attempt to improve its performance. The last four have been undertaken to evaluate the performance of the ProtoG in comparison to the three algorithms adopted. For these sixty-one instances, it has been concluded on the grounds of statistical tests that there is evidence that the ProtoG performs better than these three algorithms in fifty instances. In addition, for the thirty-six instances considered in the last three trials in which the performance of the algorithms was evaluated through PES, it was observed that the PES average obtained with the ProtoG was less than 1% in almost half of these instances, having reached the greatest average for one instance of 1,173 cities, with an PES average equal to 3.52%. Therefore, the ProtoG can be considered a competitive algorithm for solving the TSP, since it is not rare in the literature find PESs averages greater than 10% to be reported for instances of this size.
Resumo:
O presente trabalho objetivou verificar a possibilidade da utilização de métodos estatísticos multivariados na caracterização das fases do desenvolvimento do mosaico sucessional de um trecho de floresta estacional semidecidual, através de variáveis estruturais. Foram alocadas parcelas de 10 m x 10 m, em que se procedeu à análise estrutural, ou seja, levantamento fitossociológico acrescido das variáveis Porcentagem de Cobertura (PC), Altura do Dossel (AD) e Cobertura por Lianas (CL). Os métodos estatísticos empregados foram Análise de Componentes Principais e Análise de Agrupamento, mais especificamente Classificação Hierárquica Ascendente. O primeiro componente principal explicou 43,96% da variância total, enquanto o segundo, 25,66%. As variáveis Área Basal (AB), Diâmetro Médio (DM) e Dominância Média (DOM) apresentaram correlações positivas entre si superiores a 0,75, podendo ser DM e DOM consideradas como um grupo de variáveis. As variáveis Número de Indivíduos (NI) e Número de Espécies (NE) apresentaram correlação 0,60, enquanto AD, CL e PC baixas correlações com as demais, indicando a importância da inclusão destas na análise. A classificação hierárquica e a partição dos grupos em quatro foram feitas considerando os dois primeiros eixos fatoriais. Os resultados indicaram dois comportamentos diferenciados: 1) valores baixos para AD e AB: Grupo 1, com valores baixos também para NI, NE e PC (fase de clareira); e Grupo 2, com valores elevados para NI e CL e baixos para DOM e DM (fase de construção); e 2) valores altos para AD e AB: Grupo 3, com valores altos também para NI, NE e PC e valor baixo para CL (fase madura); e Grupo 4, com valores elevados para DOM e DM e mais baixos para CL (fase de degradação). Os métodos estatísticos multivariados permitiram caracterizar as fases do desenvolvimento do mosaico sucessional, através das variáveis estruturais. A forma como foram estimadas as variáveis AD, CL e PC, porém, deve ser aprimorada, assim como é preciso incluir variáveis que discriminem melhor cada fase.
Resumo:
Heavy metals can cause problems of human poisoning by ingestion of contaminated food, and the environment, a negative impact on the aquatic fauna and flora. And for the presence of these metals have been used for aquatic animals biomonitoramento environment. This research was done in order to assess the environmental impact of industrial and domestic sewage dumped in estuaries potiguares, from measures of heavy metals in mullet. The methods used for these determinations are those in the literature for analysis of food and water. Collections were 20 samples of mullet in several municipality of the state of Rio Grande do Norte, from the estuaries potiguares. Were analyzed the content of humidity, ash and heavy metals. The data were subjected to two methods of exploratory analysis: analysis of the main components (PCA), which provided a multivariate interpretation, showing that the samples are grouped according to similarities in the levels of metals and analysis of hierarchical groupings (HCA), producing similar results. These tests have proved useful for the treatment of the data producing information that would hardly viewed directly in the matrix of data. The analysis of the results shows the high levels of metallic species in samples Mugil brasiliensis collected in Estuaries /Potengi, Piranhas/Açu, Guaraíra / Papeba / Arês and Curimataú
Resumo:
The objective of the present work is the construction of percentual indexes of sustainable development "Status" - PIDSD (%) aiming to order and classify the rural settlements, considering agricultural, socioeconomic and environmental aspects, in order to diagnose their sustainable reality. This way, we considered multivariate statistical procedure to establish analytical descriptors - indexes - like the principal components technique (CP). The CP technique was used in a matrix formed by 47 variables observed in 50 rural settlements, distributed in seven different regions of the state of Mato Grosso, obtained from diagnostics, provided by "Mato-Grossense" Enterprise of Research, Assistance and Rural Extension S/A - EMPAER - MT, in order to obtain the indexes used in the construction of PIDSD (%). The settlements with higher PIDSD (%) were considered "higher potential" or "higher sustainable" in relation to the analyzed variables, making the establishment of assistance strategies and cooperation possible, allowing the government and civil society in general, to improve those with worse results ("lower potential" or "lower sustainable"), and search for ways to strengthen and multiply the results of the "higher potential" settlements. Vale do Seringal settlement had the best conditions in relation to the variables, mainly those of higher weigh and was considered the one with "higher potential". São Sebastião had the worst conditions and was considered "lower potential".
Resumo:
Skeletal muscle consists of muscle fiber types that have different physiological and biochemical characteristics. Basically, the muscle fiber can be classified into type I and type II, presenting, among other features, contraction speed and sensitivity to fatigue different for each type of muscle fiber. These fibers coexist in the skeletal muscles and their relative proportions are modulated according to the muscle functionality and the stimulus that is submitted. To identify the different proportions of fiber types in the muscle composition, many studies use biopsy as standard procedure. As the surface electromyography (EMGs) allows to extract information about the recruitment of different motor units, this study is based on the assumption that it is possible to use the EMG to identify different proportions of fiber types in a muscle. The goal of this study was to identify the characteristics of the EMG signals which are able to distinguish, more precisely, different proportions of fiber types. Also was investigated the combination of characteristics using appropriate mathematical models. To achieve the proposed objective, simulated signals were developed with different proportions of motor units recruited and with different signal-to-noise ratios. Thirteen characteristics in function of time and the frequency were extracted from emulated signals. The results for each extracted feature of the signals were submitted to the clustering algorithm k-means to separate the different proportions of motor units recruited on the emulated signals. Mathematical techniques (confusion matrix and analysis of capability) were implemented to select the characteristics able to identify different proportions of muscle fiber types. As a result, the average frequency and median frequency were selected as able to distinguish, with more precision, the proportions of different muscle fiber types. Posteriorly, the features considered most able were analyzed in an associated way through principal component analysis. Were found two principal components of the signals emulated without noise (CP1 and CP2) and two principal components of the noisy signals (CP1 and CP2 ). The first principal components (CP1 and CP1 ) were identified as being able to distinguish different proportions of muscle fiber types. The selected characteristics (median frequency, mean frequency, CP1 and CP1 ) were used to analyze real EMGs signals, comparing sedentary people with physically active people who practice strength training (weight training). The results obtained with the different groups of volunteers show that the physically active people obtained higher values of mean frequency, median frequency and principal components compared with the sedentary people. Moreover, these values decreased with increasing power level for both groups, however, the decline was more accented for the group of physically active people. Based on these results, it is assumed that the volunteers of the physically active group have higher proportions of type II fibers than sedentary people. Finally, based on these results, we can conclude that the selected characteristics were able to distinguish different proportions of muscle fiber types, both for the emulated signals as to the real signals. These characteristics can be used in several studies, for example, to evaluate the progress of people with myopathy and neuromyopathy due to the physiotherapy, and also to analyze the development of athletes to improve their muscle capacity according to their sport. In both cases, the extraction of these characteristics from the surface electromyography signals provides a feedback to the physiotherapist and the coach physical, who can analyze the increase in the proportion of a given type of fiber, as desired in each case.
Resumo:
Doutoramento em Matemática e Estatística - Instituto Superior de Agronomia - UL
Resumo:
Produced water is characterized as one of the most common wastes generated during exploration and production of oil. This work aims to develop methodologies based on comparative statistical processes of hydrogeochemical analysis of production zones in order to minimize types of high-cost interventions to perform identification test fluids - TIF. For the study, 27 samples were collected from five different production zones were measured a total of 50 chemical species. After the chemical analysis was applied the statistical data, using the R Statistical Software, version 2.11.1. Statistical analysis was performed in three steps. In the first stage, the objective was to investigate the behavior of chemical species under study in each area of production through the descriptive graphical analysis. The second step was to identify a function that classify production zones from each sample, using discriminant analysis. In the training stage, the rate of correct classification function of discriminant analysis was 85.19%. The next stage of processing of the data used for Principal Component Analysis, by reducing the number of variables obtained from the linear combination of chemical species, try to improve the discriminant function obtained in the second stage and increase the discrimination power of the data, but the result was not satisfactory. In Profile Analysis curves were obtained for each production area, based on the characteristics of the chemical species present in each zone. With this study it was possible to develop a method using hydrochemistry and statistical analysis that can be used to distinguish the water produced in mature fields of oil, so that it is possible to identify the zone of production that is contributing to the excessive elevation of the water volume.
Resumo:
OBJETIVO: Existem vários critérios para a escolha do número de componentes a serem mantidos na análise de componentes principais. Esta escolha pode dar-se por critérios arbitrários (critério de Kaiser p.ex.) ou subjetivos (fatores interpretáveis). Apresenta-se o critério de simulação de Lébart e Dreyfus. MÉTODO: É gerada uma matriz de números aleatórios, sendo realizada a análise de componentes principais a partir dessa matriz. Os componentes extraídos de um conjunto de dados como este representam um limite inferior que deve ser ultrapassado para que um componente possa ser selecionado. Utiliza-se como exemplo a análise de componentes principais da escala de Hamilton para a depressão (17 itens) numa amostra de 130 pacientes. RESULTADOS E CONCLUSÕES: O método de simulação é comparado com o método de Kaiser. É mostrado que o método de simulação mantém apenas os componentes clinicamente significativos ao contrário do método de Kaiser.
Resumo:
Currently, owing to the occurrence of environmental problems, along with the need of environmental preservation, both the territory management of Hydrographic Basin and the conservation of natural resources have proven to have remarkable importance. Thus, the mean goal of the research is to raise and scrutinize social-economic and technologic data from the Mogi Guaçu River Hydrographic Basin (São Paulo, Brazil). The aim is to group municipalities with similar characteristics regarding the collected data, which may direct joint actions in the Hydrographic Basin Management. There were used both the methods of factorial analysis and automatic hierarchical classifications. Additionally, there is going to be applied a Geographical Information System to represent the outcomes of the methods aforementioned, through the evolvement of a geo-referenced database, which will allow the obtainment of information categorically distributed including theme maps of interest. The main characteristics adopted to group the municipalities were: agricultural area, sugar cane production, small farms, animal production, number of agriculture machinery and equipments and agricultural income. The methodology adopted in the Mogi Guaçu River Hydrographic Basin will be analyzed vis-à-vis its appropriateness on basin management, as well as the possibility of assisting the studies on behalf of the São Paulo Hydrographic Basin groups, to regional development.
Resumo:
Descriptive terminology and sensory profile of three varieties of brazilian varietal white wines (cultivars Riesling, Gewürztraminer and Chardonnay) were developed by a methodology based on the Quantitative Descriptive Analysis (QDA). The sensory panel consensually defined the sensory descriptors, their respective reference materials and the descriptive evaluation ballot. Ten individuals were selected as judges based on their discrimination, reproducibility and individual consensus with the sensory panel. Twelve descriptors were generated showing similarities and differences among the wine samples. Each descriptor was evaluated using a nine-centimeters non-structured scale with the intensity terms anchored at its ends. The collected data were analysed by ANOVA, Tukey test and Principal Component Analysis (PCA). The results showed a great difference within the sensory profile of Riesling and Gewürztraminer wines, whereas Chardonnay wines showed a lesser variation. PCA separated samples into two groups: a first group formed by wines higher in sweetness and fruitty flavor and aroma; and a second group of wines higher in sourness, adstringency, bitterness, alcoholic and fermented flavors.
Resumo:
Com o fenômeno da globalização, a contabilidade tem passado por modificações que se refletiram, inicialmente, no âmbito empresarial e só alguns anos mais tarde na administração pública. Há uma tendência cada vez maior e crescente de pressionar os governantes para ampliarem o campo de suas responsabilidades financeiras a partir da contabilidade voltada para a gestão das contas públicas. A responsabilidade fiscal dos governos é fundamental para fomentar o crescimento econômico e o desenvolvimento dos municípios. O presente estudo visa demonstrar em que medida os índices de análise econômico-financeira podem ser empregados na administração pública a fim de se estabelecer um ranking na gestão dos municípios catarinenses. Para atingir tal finalidade, utilizou-se de conceitos e técnicas de análise de balanço, os quais foram tratados com a aplicação da análise de componentes principais. Os resultados indicam a importância do emprego da análise de balanços na área pública como ferramenta a ser utilizada no cotidiano da administração governamental.
Resumo:
Apesar da existência de contatos entre planalto e litoral brasileiros ser atualmente quase um consenso entre os arqueólogos nacionais, não há muita certeza de como teria se dado tal contato e qual seria o fluxo entre interior e costa. O vale do Ribeira de Iguape (SP) é uma das raras regiões do Sul-Sudeste do país onde tal comunicação seria bastante facilitada devido a peculiaridades de sua geomorfologia. Neste trabalho, apresentamos os resultados de uma análise craniométrica comparativa entre 12 esqueletos provenientes de sambaquis fluviais do vale do Ribeira datados entre 6.000 e 1.200 anos AP e 225 esqueletos oriundos de diversas séries pré-históricas brasileiras do interior e do litoral. Ao contrário do que se observa no início do Holoceno nesse vale, não há qualquer afinidade biológica entre os ribeirinhos mais tardios e os paleoíndios de Lagoa Santa ou qualquer outra série interiorana. Os grupos fluviais (ambos os sexos) associam-se aos sambaquis da costa de São Paulo e do Paraná, mostrando que houve realmente um contato considerável entre a planície costeira e o planalto, ao menos no estado de São Paulo a partir da segunda metade do Holoceno.