993 resultados para multivariate methods
Resumo:
Accurate monitoring of degradation levels in soils is essential in order to understand and achieve complete degradation of petroleum hydrocarbons in contaminated soils. We aimed to develop the use of multivariate methods for the monitoring of biodegradation of diesel in soils and to determine if diesel contaminated soils could be remediated to a chemical composition similar to that of an uncontaminated soil. An incubation experiment was set up with three contrasting soil types. Each soil was exposed to diesel at varying stages of degradation and then analysed for key hydrocarbons throughout 161 days of incubation. Hydrocarbon distributions were analysed by Principal Coordinate Analysis and similar samples grouped by cluster analysis. Variation and differences between samples were determined using permutational multivariate analysis of variance. It was found that all soils followed trajectories approaching the chemical composition of the unpolluted soil. Some contaminated soils were no longer significantly different to that of uncontaminated soil after 161 days of incubation. The use of cluster analysis allows the assignment of a percentage chemical similarity of a diesel contaminated soil to an uncontaminated soil sample. This will aid in the monitoring of hydrocarbon contaminated sites and the establishment of potential endpoints for successful remediation.
Resumo:
Several Brazilian commercial gasoline physicochemical parameters, such as relative density, distillation curve (temperatures related to 10%, 50% and 90% of distilled volume, final boiling point and residue), octane numbers (motor and research octane number and anti-knock index), hydrocarbon compositions (olefins, aromatics and saturates) and anhydrous ethanol and benzene content was predicted from chromatographic profiles obtained by flame ionization detection (GC-FID) and using partial least square regression (PLS). GC-FID is a technique intensively used for fuel quality control due to its convenience, speed, accuracy and simplicity and its profiles are much easier to interpret and understand than results produced by other techniques. Another advantage is that it permits association with multivariate methods of analysis, such as PLS. The chromatogram profiles were recorded and used to deploy PLS models for each property. The standard error of prediction (SEP) has been the main parameter considered to select the "best model". Most of GC-FID-PLS results, when compared to those obtained by the Brazilian Government Petroleum, Natural Gas and Biofuels Agency - ANP Regulation 309 specification methods, were very good. In general, all PLS models developed in these work provide unbiased predictions with lows standard error of prediction and percentage average relative error (below 11.5 and 5.0, respectively). (C) 2007 Elsevier B.V. All rights reserved.
Resumo:
This work outlines the theoretical advantages of multivariate methods in biomechanical data, validates the proposed methods and outlines new clinical findings relating to knee osteoarthritis that were made possible by this approach. New techniques were based on existing multivariate approaches, Partial Least Squares (PLS) and Non-negative Matrix Factorization (NMF) and validated using existing data sets. The new techniques developed, PCA-PLS-LDA (Principal Component Analysis – Partial Least Squares – Linear Discriminant Analysis), PCA-PLS-MLR (Principal Component Analysis – Partial Least Squares –Multiple Linear Regression) and Waveform Similarity (based on NMF) were developed to address the challenging characteristics of biomechanical data, variability and correlation. As a result, these new structure-seeking technique revealed new clinical findings. The first new clinical finding relates to the relationship between pain, radiographic severity and mechanics. Simultaneous analysis of pain and radiographic severity outcomes, a first in biomechanics, revealed that the knee adduction moment’s relationship to radiographic features is mediated by pain in subjects with moderate osteoarthritis. The second clinical finding was quantifying the importance of neuromuscular patterns in brace effectiveness for patients with knee osteoarthritis. I found that brace effectiveness was more related to the patient’s unbraced neuromuscular patterns than it was to mechanics, and that these neuromuscular patterns were more complicated than simply increased overall muscle activity, as previously thought.
Resumo:
Sum: Plant biologists in fields of ecology, evolution, genetics and breeding frequently use multivariate methods. This paper illustrates Principal Component Analysis (PCA) and Gabriel's biplot as applied to microarray expression data from plant pathology experiments. Availability: An example program in the publicly distributed statistical language R is available from the web site (www.tpp.uq.edu.au) and by e-mail from the contact. Contact: scott.chapman@csiro.au.
Resumo:
The GxE interaction only became widely discussed from evolutionary studies and evaluations of the causes of behavioral changes of species cultivated in environments. In the last 60 years, several methodologies for the study of adaptability and stability of genotypes in multiple environments trials were developed in order to assist the breeder's choice regarding which genotypes are more stable and which are the most suitable for the crops in the most diverse environments. The methods that use linear regression analysis were the first to be used in a general way by breeders, followed by multivariate analysis methods and mixed models. The need to identify the genetic and environmental causes that are behind the GxE interaction led to the development of new models that include the use of covariates and which can also include both multivariate methods and mixed modeling. However, further studies are needed to identify the causes of GxE interaction as well as for the more accurate measurement of its effects on phenotypic expression of varieties in competition trials carried out in genetic breeding programs.
Resumo:
Olive oil quality grading is traditionally assessed by human sensory evaluation of positive and negative attributes (olfactory, gustatory, and final olfactorygustatory sensations). However, it is not guaranteed that trained panelist can correctly classify monovarietal extra-virgin olive oils according to olive cultivar. In this work, the potential application of human (sensory panelists) and artificial (electronic tongue) sensory evaluation of olive oils was studied aiming to discriminate eight single-cultivar extra-virgin olive oils. Linear discriminant, partial least square discriminant, and sparse partial least square discriminant analyses were evaluated. The best predictive classification was obtained using linear discriminant analysis with simulated annealing selection algorithm. A low-level data fusion approach (18 electronic tongue signals and nine sensory attributes) enabled 100 % leave-one-out cross-validation correct classification, improving the discrimination capability of the individual use of sensor profiles or sensory attributes (70 and 57 % leave-one-out correct classifications, respectively). So, human sensory evaluation and electronic tongue analysis may be used as complementary tools allowing successful monovarietal olive oil discrimination.
Resumo:
A partir de las últimas décadas se ha impulsado el desarrollo y la utilización de los Sistemas de Información Geográficos (SIG) y los Sistemas de Posicionamiento Satelital (GPS) orientados a mejorar la eficiencia productiva de distintos sistemas de cultivos extensivos en términos agronómicos, económicos y ambientales. Estas nuevas tecnologías permiten medir variabilidad espacial de propiedades del sitio como conductividad eléctrica aparente y otros atributos del terreno así como el efecto de las mismas sobre la distribución espacial de los rendimientos. Luego, es posible aplicar el manejo sitio-específico en los lotes para mejorar la eficiencia en el uso de los insumos agroquímicos, la protección del medio ambiente y la sustentabilidad de la vida rural. En la actualidad, existe una oferta amplia de recursos tecnológicos propios de la agricultura de precisión para capturar variación espacial a través de los sitios dentro del terreno. El óptimo uso del gran volumen de datos derivado de maquinarias de agricultura de precisión depende fuertemente de las capacidades para explorar la información relativa a las complejas interacciones que subyacen los resultados productivos. La covariación espacial de las propiedades del sitio y el rendimiento de los cultivos ha sido estudiada a través de modelos geoestadísticos clásicos que se basan en la teoría de variables regionalizadas. Nuevos desarrollos de modelos estadísticos contemporáneos, entre los que se destacan los modelos lineales mixtos, constituyen herramientas prometedoras para el tratamiento de datos correlacionados espacialmente. Más aún, debido a la naturaleza multivariada de las múltiples variables registradas en cada sitio, las técnicas de análisis multivariado podrían aportar valiosa información para la visualización y explotación de datos georreferenciados. La comprensión de las bases agronómicas de las complejas interacciones que se producen a la escala de lotes en producción, es hoy posible con el uso de éstas nuevas tecnologías. Los objetivos del presente proyecto son: (l) desarrollar estrategias metodológicas basadas en la complementación de técnicas de análisis multivariados y geoestadísticas, para la clasificación de sitios intralotes y el estudio de interdependencias entre variables de sitio y rendimiento; (ll) proponer modelos mixtos alternativos, basados en funciones de correlación espacial de los términos de error que permitan explorar patrones de correlación espacial de los rendimientos intralotes y las propiedades del suelo en los sitios delimitados. From the last decades the use and development of Geographical Information Systems (GIS) and Satellite Positioning Systems (GPS) is highly promoted in cropping systems. Such technologies allow measuring spatial variability of site properties including electrical conductivity and others soil features as well as their impact on the spatial variability of yields. Therefore, site-specific management could be applied to improve the efficiency in the use of agrochemicals, the environmental protection, and the sustainability of the rural life. Currently, there is a wide offer of technological resources to capture spatial variation across sites within field. However, the optimum use of data coming from the precision agriculture machineries strongly depends on the capabilities to explore the information about the complex interactions underlying the productive outputs. The covariation between spatial soil properties and yields from georeferenced data has been treated in a graphical manner or with standard geostatistical approaches. New statistical modeling capabilities from the Mixed Linear Model framework are promising to deal with correlated data such those produced by the precision agriculture. Moreover, rescuing the multivariate nature of the multiple data collected at each site, several multivariate statistical approaches could be crucial tools for data analysis with georeferenced data. Understanding the basis of complex interactions at the scale of production field is now within reach the use of these new techniques. Our main objectives are: (1) to develop new statistical strategies, based on the complementarities of geostatistics and multivariate methods, useful to classify sites within field grown with grain crops and analyze the interrelationships of several soil and yield variables, (2) to propose mixed linear models to predict yield according spatial soil variability and to build contour maps to promote a more sustainable agriculture.
Resumo:
Entre el 29 de setiembre y 3 de octubre 2011, se determinó la estructura de las comunidades, en términos de abundancia relativa, riqueza y diversidad y relación con el medio ambiente de la isla Lobos de Tierra (noreste de isla Rata, El Ñopo y La Grama). En los cálculos de diversidad específi ca se usó métodos uni y multivariados para hacer comparaciones en los lugares estudiados. En la zona mesolitoral se registró alta riqueza específi ca asociada a comunidades de fondos duros destacando el noreste de isla Rata con 58 especies. El grupo dominante fue moluscos en todas las zonas de estudio, sobresaliendo Tegula corvus y Acanthopleura echinata con mayores niveles de abundancia (296 a 412 ind.m-2). El índice de diversidad (H’) promedio por estación mostró valores >1,5 bits/ind en todas las zonas, con valores de dominancia y equitatividad <1,0. En el submareal, la riqueza fue de 124 especies. Los crustáceos y poliquetos tuvieron la mayor riqueza de especies y densidad. Los principales representantes fueron Gammarus sp. (26.607 ind.m-2), Spionidae (2.227 ind.m-2) y Diopatra rhizoicola (2.073 ind.m-2). El índice de diversidad promedio fue 2,2 bits, valor considerado de alta diversidad. La fauna íctica submareal estuvo conformada por 16 especies, destacando los géneros Auchenionchus y Labrisomus y en el intermareal se registraron 7 especies destacando Tomicodon chilensis. Se detectaron 24 especies de macroalgas: Rhodophyta (16 especies), Chlorophyta (5 especies) y Phaeophyta (3 especies), predominó Caulerpa fi liformis (submareal) y Gymnogongrus furcellatus (intermareal).
Resumo:
The correlation between the species composition of pasture communities and soil properties in Plana de Vic has been studied using two multivariate methods, Correspondence Analysis (CA) for the vegetation data and Principal Component Analysis (PCA) for the soil data. To analyse the pastures, we took 144 vegetation relevés (comprising 201 species) that have been classified into 10 phytocoenological communities elsewhere. Most of these communities are almost entirely built up by perennials, ranging from xerophilous, clearly Mediterranean, to mesophilous, related to medium-European pastures, but a few occurring in shallow soils are dominated by therophytes. As for the soil properties, we analysed texture, pH, depth, bulk density, organic matter, C/N ratio and the carbonates content of 25 samples, correspondingto representative relevés of the communities studied.
Resumo:
Colorectal cancer (CRC) is the second leading cause of cancer-related death in developed countries. Early detection of CRC leads to decreased CRC mortality. A blood-based CRC screening test is highly desirable due to limited invasiveness and high acceptance rate among patients compared to currently used fecal occult blood testing and colonoscopy. Here we describe the discovery and validation of a 29-gene panel in peripheral blood mononuclear cells (PBMC) for the detection of CRC and adenomatous polyps (AP). Blood samples were prospectively collected from a multicenter, case-control clinical study. First, we profiled 93 samples with 667 candidate and 3 reference genes by high throughput real-time PCR (OpenArray system). After analysis, 160 genes were retained and tested again on 51 additional samples. Low expressed and unstable genes were discarded resulting in a final dataset of 144 samples profiled with 140 genes. To define which genes, alone or in combinations had the highest potential to discriminate AP and/or CRC from controls, data were analyzed by a combination of univariate and multivariate methods. A list of 29 potentially discriminant genes was compiled and evaluated for its predictive accuracy by penalized logistic regression and bootstrap. This method discriminated AP >1cm and CRC from controls with a sensitivity of 59% and 75%, respectively, with 91% specificity. The behavior of the 29-gene panel was validated with a LightCycler 480 real-time PCR platform, commonly adopted by clinical laboratories. In this work we identified a 29-gene panel expressed in PBMC that can be used for developing a novel minimally-invasive test for accurate detection of AP and CRC using a standard real-time PCR platform.
Resumo:
Tässä työssä on tutkittu sellun kuivauskoneella koivusellua kuivattaessa esiintyvää ajettavuusongelmaa. Tähän ongelmaan on etsitty ratkaisua monimuuttujamenetelmien avulla. Työn kirjallisuusosassa on lyhyesti käyty läpi sellun kuivauskoneiden historia lieriöviirakoneista nykyaikaisiin kaksoisviirasovelluksiin. Lisäksi kirjallisuusosassa on käsitelty tässä työssä käytettyjen monimuuttujamenetelmien perusteet ja käyty esimerkin omaisesti läpi joitakin kemometrian sovelluksia puunjalostusteollisuudessa. Työn kokeellisessa osassa on haettu tiedonkeruujärjestelmästä massan ominaisuuksista ja kuivauskoneen ajoparametreistä koostuvaa dataa vuoden 1999 ajalta. Tästä datasta on tehty PCA-mallit vuoden 1999 jokaisen kuukauden datasta ja diskriminoiva PLS-malli ajettavuuden kannalta hyvästä ja huonosta jaksosta koostuvasta datasta. Kuivauskoneelta on koottu dataa myös ottamalla kiertovesi- ja selluarkkinäytteitä. Kiertovesinäytteiden analyysituloksista on tehty PCA-mallit ja lisäksi analyysitulokset on tarkasteltu yksimuuttujaisesti. Selluarkkinäytteistä on määritetty UV-spektrit ja niistä on tehty PCA-mallit, jotta spektreistä saataisiin mahdollisimman paljon informaatiota. Työssä on havaittu yhdeksi selitykseksi ajettavuusongelmalle kuivauskoneen kiertovesien likaantuminen. Kiertovesiä ja massarainan pintakemiaa tulisi kuitenkin tutkia laajemmassa mittakaavassa kuin tämän työn puitteissa on ollut mahdollista tutkia.
Resumo:
The uncertainty of any analytical determination depends on analysis and sampling. Uncertainty arising from sampling is usually not controlled and methods for its evaluation are still little known. Pierre Gy’s sampling theory is currently the most complete theory about samplingwhich also takes the design of the sampling equipment into account. Guides dealing with the practical issues of sampling also exist, published by international organizations such as EURACHEM, IUPAC (International Union of Pure and Applied Chemistry) and ISO (International Organization for Standardization). In this work Gy’s sampling theory was applied to several cases, including the analysis of chromite concentration estimated on SEM (Scanning Electron Microscope) images and estimation of the total uncertainty of a drug dissolution procedure. The results clearly show that Gy’s sampling theory can be utilized in both of the above-mentioned cases and that the uncertainties achieved are reliable. Variographic experiments introduced in Gy’s sampling theory are beneficially applied in analyzing the uncertainty of auto-correlated data sets such as industrial process data and environmental discharges. The periodic behaviour of these kinds of processes can be observed by variographic analysis as well as with fast Fourier transformation and auto-correlation functions. With variographic analysis, the uncertainties are estimated as a function of the sampling interval. This is advantageous when environmental data or process data are analyzed as it can be easily estimated how the sampling interval is affecting the overall uncertainty. If the sampling frequency is too high, unnecessary resources will be used. On the other hand, if a frequency is too low, the uncertainty of the determination may be unacceptably high. Variographic methods can also be utilized to estimate the uncertainty of spectral data produced by modern instruments. Since spectral data are multivariate, methods such as Principal Component Analysis (PCA) are needed when the data are analyzed. Optimization of a sampling plan increases the reliability of the analytical process which might at the end have beneficial effects on the economics of chemical analysis,
Resumo:
This study developed and validated a method for moisture determination in artisanal Minas cheese, using near-infrared spectroscopy and partial-least-squares. The model robustness was assured by broad sample diversity, real conditions of routine analysis, variable selection, outlier detection and analytical validation. The model was built from 28.5-55.5% w/w, with a root-mean-square-error-of-prediction of 1.6%. After its adoption, the method stability was confirmed over a period of two years through the development of a control chart. Besides this specific method, the present study sought to provide an example multivariate metrological methodology with potential for application in several areas, including new aspects, such as more stringent evaluation of the linearity of multivariate methods.
Resumo:
The resources of the step family and the children’s well-being The present study investigates children's well-being in stepfamilies and fac¬tors, both external and internal, that are related to the children's well-being. Of the external factors, the study focuses on factors related to the structure of the stepfamily, parents' education, socio-economic status and factors related to work, livelihood and living conditions. The internal resources include the general functioning of the family, parenthood and parenting, family support networks and issues that the stepfamilies themselves consider important. Another important resource in a stepfamily is a functioning network of human relationships, which in the present study is approached from the maternal viewpoint. Changing family relations are considered a potential threat to the children's well-being. Therefore, in addition to looking into the stepfamily's resources, the other important goal of the study is to explore other factors potentially related to the well-being of children living in stepfamilies. In view of the stepfamily's resources, it is important to explore how the functioning of the relationships network is linked with the child's well-being. The study employs survey and interview data. The survey data (n=2236) are part of national survey data on the well-being of families and children and factors impacting them which were gathered as part of ”Origins of Exclusion in Early Childhood”, a research project carried out in 2002. The data consists of 667 stepfamilies. The interview data consists of interviews with 24 parents in stepfamilies. In the study, the analyses of survey and interview data are combined. Both descriptive statistical analyses and multivariate methods are employed. Content analysis is employed in the analysis of the interview data. The results indicate that the stepfamilies’ resources in general but their external resources in particular differed from those of the nuclear and single-parent families. The level of education and the socio-economic status of the stepfamily parents were somewhat lower than those of the nuclear family parents. The differences in relation to single-parent families were primarily related to the better economic status of the stepfamilies. The analysis of internal resources showed relatively minor differences: the stepfamilies assessed themselves a somewhat better general functioning of the family than did the nuclear families. Parenting issues caused more disagreement in stepfamilies than in nuclear families. The analysis of the functioning of the human relations in stepfamilies showed that the stepfamily mothers experienced the external relationships of the family (e.g., between the child and the absent father) as significantly more problematic than the relationships within the stepfamily. Living in a stepfamily thus challenges the functioning of the relationship between the child and the absent father. As a result of the analysis of the relationships networks in the stepfamilies, three groups were formed. One group had the nuclear family as an ideal goal, another valued an extended family composed of a variety of relationships, and the third one appreciated a strong intimate relationship between the parents. In the present study, the most common group was the multi-relationship, extended type of stepfamily. In conclusion, living in a stepfamily does not seem to pose a risk to the child’s well-being, but it may influence the family’s resources and thus have an indirect effect on the child’s well-being. In view of the resources of the stepfamily, the child’s well-being was best supported by a functioning network of human relationships in the stepfamily: there was a distinct connection with the children’s problems and the non-functioning of the relationships network. According to the mothers, the internal relationships in the stepfamily seemed to be more important than the external relationships of the family. A child’s functioning relationship with the absent father can be viewed as a positive resource, supporting the child’s well-being in the stepfamily.
Resumo:
The environmental impacts of a single mine often remain local, but acidic and metal-rich acid mine drainage (AMD) from the waste materials may pose a serious threat to adjacent surface waters and their ecosystems. Testate amoebae (thecamoebian) analysis was used together with lake sediment geochemistry to study and evaluate the ecological effects of sulphidic metal mines on aquatic environments. Three different mines were included in the study: Luikonlahti Cu-mine in Kaavi, eastern Finland, Haveri Cu-Au mine in Ylöjärvi, southern Finland and Pyhäsalmi Zn-Cu-S mine in Pyhäjärvi, central Finland. Luikonlahti and Haveri are closed mines, but Pyhäsalmi is still operating. The sampling strategy was case specific, and planned to provide a representative sediment sample series to define natural background conditions, to detect spatial and temporal variations in mine impacts, to evaluate the possible recovery after the peak contamination, and to distinguish the effects of other environmental factors from the mining impacts. In the Haveri case, diatom analyses were performed alongside thecamoebian analysis to evaluate the similarities and differences between the two proxies. The results of the analyses were investigated with multivariate methods (direct and indirect ordinations, diversity and distance measure indices). Finally, the results of each case study were harmonized, pooled, and jointly analyzed to summarize the results for this dissertation. Geochemical results showed broadly similar temporal patterns in each case. Concentrations of ions in the pre-disturbance samples defined the natural baseline against which other results were compared. The beginning of the mining activities had only minor impacts on sediment geochemistry, mainly appearing as an increased clastic input into the lakes at Haveri and Pyhäsalmi. The active mining phase was followed by the metallic contamination and, subsequently, by the most recent change towards decreased but still elevated metal concentrations in the sediments. Because of the delay in the oxidation of waste material and formation of AMD, the most intense, but transient metal contamination phase occurred in the post-mining period at Luikonlahti and Haveri. At Pyhäsalmi, the highest metal contamination preceded effluent mitigation actions. Spatial gradients were observed besides the temporal evolution in both the pre-disturbance and mine-impacted samples from Luikonlahti and Pyhäsalmi. The geochemical gradients varied with distance from the main source of contaminants (dispersion and dilution) and with water depth (redox and pH). The spatial extent of the highest metal contamination associated with these mines remained rather limited. At Haveri, the metallic impact was widespread, with the upstream site in another lake basin found to be contaminated. Changes in thecamoebian assemblages corresponded well with the geochemical results. Despite some differences, the general features and ecological responses of the faunal assemblages were rather similar in each lake. Constantly abundant strains of Difflugia oblonga, Difflugia protaeiformis and centropyxids formed the core of these assemblages. Increasing proportions of Cucurbitella tricuspis towards the surface samples were found in all of the cases. The results affirmed the indicator value of some already known indicator forms, but such as C. tricuspis and higher nutrient levels, but also elicited possible new ones such as D. oblonga ‘spinosa’ and clayey substrate, high conductivity and/or alkalinity, D. protaeiformis ‘multicornis’ and pH, water hardness and the amount of clastic material and Centropyxis constricta ‘aerophila’ and high metal and S concentrations. In each case, eutrophication appeared to be the most important environmental factor, masking the effects of other variables. Faunal responses to high metal inputs in sediments remained minor, but were nevertheless detectable. Besides the trophic state of the lake, numerical methods suggested overall geochemical conditions (pH, redox) to be the most important factor at Luikonlahti, whereas the Haveri results showed the clearest connection between metals and amoebae. At Pyhäsalmi, the strongest relationships were found between Ca- and S-rich present loading, redox conditions and substrate composition. Sediment geochemistry and testate amoeba analysis proved to be a suitable combination of methods to detect and describe the aquatic mine impacts in each specific case, to evaluate recovery and to differentiate between the effects of different anthropogenic and natural environmental factors. It was also suggested that aquatic mine impacts can be significantly mitigated by careful design and after-care of the waste facilities, especially by reducing and preventing AMD. The case-specific approach is nevertheless necessary because of the unique characteristics of each mine and variations in the environmental background conditions.