852 resultados para improved principal components analysis (IPCA) algorithm


Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis entitled “Studies on Nitrifying Microorganisms in Cochin Estuary and Adjacent Coastal Waters” reports for the first time the spatial andtemporal variations in the abundance and activity of nitrifiers (Ammonia oxidizingbacteria-AOB; Nitrite oxidizing bacteria- NOB and Ammonia oxidizing archaea-AOA) from the Cochin Estuary (CE), a monsoon driven, nutrient rich tropicalestuary along the southwest coast of India. To fulfil the above objectives, field observations were carried out for aperiod of one year (2011) in the CE. Surface (1 m below surface) and near-bottomwater samples were collected from four locations (stations 1 to 3 in estuary and 4 in coastal region), covering pre-monsoon, monsoon and post-monsoon seasons. Station 1 is a low saline station (salinity range 0-10) with high freshwater influx While stations 2 and 3 are intermediately saline stations (salinity ranges 10-25). Station 4 is located ~20 km away from station 3 with least influence of fresh water and is considered as high saline (salinity range 25- 35) station. Ambient physicochemical parameters like temperature, pH, salinity, dissolved oxygen (DO), Ammonium, nitrite, nitrate, phosphate and silicate of surface and bottom waters were measured using standard techniques. Abundance of Eubacteria, total Archaea and ammonia and nitrite oxidizing bacteria (AOB and NOB) were quantified using Fluorescent in situ Hybridization (FISH) with oligonucleotide probes labeled withCy3. Community structure of AOB and AOA was studied using PCR Denaturing Gradient Gel Electrophoresis (DGGE) technique. PCR products were cloned and sequenced to determine approximate phylogenetic affiliations. Nitrification rate in the water samples were analyzed using chemical NaClO3 (inhibitor of nitrite oxidation), and ATU (inhibitor of ammonium oxidation). Contribution of AOA and AOB in ammonia oxidation process was measured based on the recovered ammonia oxidation rate. The contribution of AOB and AOA were analyzed after inhibiting the activities of AOB and AOA separately using specific protein inhibitors. To understand the factors influencing or controlling nitrification, various statistical tools were used viz. Karl Pearson’s correlation (to find out the relationship between environmental parameters, bacterial abundance and activity), three-way ANOVA (to find out the significant variation between observations), Canonical Discriminant Analysis (CDA) (for the discrimination of stations based on observations), Multivariate statistics, Principal components analysis (PCA) and Step up multiple regression model (SMRM) (First order interaction effects were applied to determine the significantly contributing biological and environmental parameters to the numerical abundance of nitrifiers). In the CE, nitrification is modulated by the complex interplay between different nitrifiers and environmental variables which in turn is dictated by various hydrodynamic characteristics like fresh water discharge and seawater influx brought in by river water discharge and flushing. AOB in the CE are more adapted to varying environmental conditions compared to AOA though the diversity of AOA is higher than AOB. The abundance and seasonality of AOB and NOB is influenced by the concentration of ammonia in the water column. AOB are the major players in modulating ammonia oxidation process in the water column of CE. The distribution pattern and seasonality of AOB and NOB in the CE suggest that these organisms coexist, and are responsible for modulating the entire nitrification process in the estuary. This process is fuelled by the cross feeding among different nitrifiers, which in turn is dictated by nutrient levels especially ammonia. Though nitrification modulates the increasing anthropogenic ammonia concentration the anthropogenic inputs have to be controlled to prevent eutrophication and associated environmental changes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The increased use of cereal/legume crop rotation has been advocated as a strategy to increase cereal yields of subsistence farmers in West Africa, and is believed to promote changes in the rhizosphere that enhance early plant growth. In this study we investigated the microbial diversity of the rhizoplane from seedlings grown in two soils previously planted to cereal or legume from experimental plots in Gaya, Niger, and Kaboli, Togo. Soils from these legume rotation and continuous cereal plots were placed into containers and sown in a growth chamber with maize (Zea mays L.), millet (Pennisetum glaucum L.), sorghum (Sorghum bicolor L. Moench.), cowpea (Vigna unguiculata L.) or groundnut (Arachis hypogaea L.). At 7 and 14 days after sowing, 16S rDNA profiles of the eubacterial and ammoniaoxidizing communities from the rhizoplane and bulk soil were generated using denaturing gradient gel electrophoresis (DGGE). Community profiles were subjected to peak fitting analyses to quantify the DNA band position and intensities, after which these data were compared using correspondence and principal components analysis. The data showed that cropping system had a highly significant effect on community structure (p <0.005), irrespective of plant species or sampling time. Continuous cereal-soil grown plants had highly similar rhizoplane communities across crop species and sites, whereas communities from the rotation soil showed greater variability and clustered with respect to plant species. Analyses of the ammonia-oxidizing communities provided no evidence of any effects of plant species or management history on ammonia oxidizers in soil from Kaboli, but there were large shifts with respect to this group of bacteria in soils from Gaya. The results of these analyses show that crop rotation can cause significant shifts in rhizosphere bacterial communities.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study uses data from a sample survey of 200 households drawn from a mountainous commune in Vietnam’s North Central Coast region to measure and explain relative poverty. Principal components analysis is used to construct a multidimensional index of poverty outcomes from variables measuring household income and the value of domestic assets. This index of poverty is then regressed on likely causes of poverty including different forms of resource endowment and social exclusion defined by gender and ethnicity. The ordinary least squares estimates indicate that poverty is indeed influenced by ethnicity, partly through its interaction with social capital. However, poverty is most strongly affected by differences in human and social capital. Differences in the amount of livestock and high quality farmland owned also matter. Thai households are poorer than their Kinh counterparts even when endowed with the same levels of human, social, physical and natural capital considered in the study. This empirical result provides a rationale for further research on the causal relationship between ethnicity and poverty outcomes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present a statistical image-based shape + structure model for Bayesian visual hull reconstruction and 3D structure inference. The 3D shape of a class of objects is represented by sets of contours from silhouette views simultaneously observed from multiple calibrated cameras. Bayesian reconstructions of new shapes are then estimated using a prior density constructed with a mixture model and probabilistic principal components analysis. We show how the use of a class-specific prior in a visual hull reconstruction can reduce the effect of segmentation errors from the silhouette extraction process. The proposed method is applied to a data set of pedestrian images, and improvements in the approximate 3D models under various noise conditions are shown. We further augment the shape model to incorporate structural features of interest; unknown structural parameters for a novel set of contours are then inferred via the Bayesian reconstruction process. Model matching and parameter inference are done entirely in the image domain and require no explicit 3D construction. Our shape model enables accurate estimation of structure despite segmentation errors or missing views in the input silhouettes, and works even with only a single input view. Using a data set of thousands of pedestrian images generated from a synthetic model, we can accurately infer the 3D locations of 19 joints on the body based on observed silhouette contours from real images.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Functional Data Analysis (FDA) deals with samples where a whole function is observed for each individual. A particular case of FDA is when the observed functions are density functions, that are also an example of infinite dimensional compositional data. In this work we compare several methods for dimensionality reduction for this particular type of data: functional principal components analysis (PCA) with or without a previous data transformation and multidimensional scaling (MDS) for diferent inter-densities distances, one of them taking into account the compositional nature of density functions. The difeerent methods are applied to both artificial and real data (households income distributions)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we analyze the spread of shocks across assets markets in eight Latin American countries. First, we measure the extent of markets reactions with the Principal Components Analysis. And second, we investigate the volatility of assets markets based in ARCH-GARCH models in function of the principal components retained in the first stage. Our results do not support the existence of financial contagion, but of interdependence in most of the cases and a slight increase in the sensibility of markets to recent shocks.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Una de las actuaciones posibles para la gestión de los residuos sólidos urbanos es la valorización energética, es decir la incineración con recuperación de energía. Sin embargo es muy importante controlar adecuadamente el proceso de incineración para evitar en lo posible la liberación de sustancias contaminantes a la atmósfera que puedan ocasionar problemas de contaminación industrial.Conseguir que tanto el proceso de incineración como el tratamiento de los gases se realice en condiciones óptimas presupone tener un buen conocimiento de las dependencias entre las variables de proceso. Se precisan métodos adecuados de medida de las variables más importantes y tratar los valores medidos con modelos adecuados para transformarlos en magnitudes de mando. Un modelo clásico para el control parece poco prometedor en este caso debido a la complejidad de los procesos, la falta de descripción cuantitativa y la necesidad de hacer los cálculos en tiempo real. Esto sólo se puede conseguir con la ayuda de las modernas técnicas de proceso de datos y métodos informáticos, tales como el empleo de técnicas de simulación, modelos matemáticos, sistemas basados en el conocimiento e interfases inteligentes. En [Ono, 1989] se describe un sistema de control basado en la lógica difusa aplicado al campo de la incineración de residuos urbanos. En el centro de investigación FZK de Karslruhe se están desarrollando aplicaciones que combinan la lógica difusa con las redes neuronales [Jaeschke, Keller, 1994] para el control de la planta piloto de incineración de residuos TAMARA. En esta tesis se plantea la aplicación de un método de adquisición de conocimiento para el control de sistemas complejos inspirado en el comportamiento humano. Cuando nos encontramos ante una situación desconocida al principio no sabemos como actuar, salvo por la extrapolación de experiencias anteriores que puedan ser útiles. Aplicando procedimientos de prueba y error, refuerzo de hipótesis, etc., vamos adquiriendo y refinando el conocimiento, y elaborando un modelo mental. Podemos diseñar un método análogo, que pueda ser implementado en un sistema informático, mediante el empleo de técnicas de Inteligencia Artificial.Así, en un proceso complejo muchas veces disponemos de un conjunto de datos del proceso que a priori no nos dan información suficientemente estructurada para que nos sea útil. Para la adquisición de conocimiento pasamos por una serie de etapas: - Hacemos una primera selección de cuales son las variables que nos interesa conocer. - Estado del sistema. En primer lugar podemos empezar por aplicar técnicas de clasificación (aprendizaje no supervisado) para agrupar los datos y obtener una representación del estado de la planta. Es posible establecer una clasificación, pero normalmente casi todos los datos están en una sola clase, que corresponde a la operación normal. Hecho esto y para refinar el conocimiento utilizamos métodos estadísticos clásicos para buscar correlaciones entre variables (análisis de componentes principales) y así poder simplificar y reducir la lista de variables. - Análisis de las señales. Para analizar y clasificar las señales (por ejemplo la temperatura del horno) es posible utilizar métodos capaces de describir mejor el comportamiento no lineal del sistema, como las redes neuronales. Otro paso más consiste en establecer relaciones causales entre las variables. Para ello nos sirven de ayuda los modelos analíticos - Como resultado final del proceso se pasa al diseño del sistema basado en el conocimiento. El objetivo principal es aplicar el método al caso concreto del control de una planta de tratamiento de residuos sólidos urbanos por valorización energética. En primer lugar, en el capítulo 2 Los residuos sólidos urbanos, se trata el problema global de la gestión de los residuos, dando una visión general de las diferentes alternativas existentes, y de la situación nacional e internacional en la actualidad. Se analiza con mayor detalle la problemática de la incineración de los residuos, poniendo especial interés en aquellas características de los residuos que tienen mayor importancia de cara al proceso de combustión.En el capítulo 3, Descripción del proceso, se hace una descripción general del proceso de incineración y de los distintos elementos de una planta incineradora: desde la recepción y almacenamiento de los residuos, pasando por los distintos tipos de hornos y las exigencias de los códigos de buena práctica de combustión, el sistema de aire de combustión y el sistema de humos. Se presentan también los distintos sistemas de depuración de los gases de combustión, y finalmente el sistema de evacuación de cenizas y escorias.El capítulo 4, La planta de tratamiento de residuos sólidos urbanos de Girona, describe los principales sistemas de la planta incineradora de Girona: la alimentación de residuos, el tipo de horno, el sistema de recuperación de energía, y el sistema de depuración de los gases de combustión Se describe también el sistema de control, la operación, los datos de funcionamiento de la planta, la instrumentación y las variables que son de interés para el control del proceso de combustión.En el capítulo 5, Técnicas utilizadas, se proporciona una visión global de los sistemas basados en el conocimiento y de los sistemas expertos. Se explican las diferentes técnicas utilizadas: redes neuronales, sistemas de clasificación, modelos cualitativos, y sistemas expertos, ilustradas con algunos ejemplos de aplicación.Con respecto a los sistemas basados en el conocimiento se analizan en primer lugar las condiciones para su aplicabilidad, y las formas de representación del conocimiento. A continuación se describen las distintas formas de razonamiento: redes neuronales, sistemas expertos y lógica difusa, y se realiza una comparación entre ellas. Se presenta una aplicación de las redes neuronales al análisis de series temporales de temperatura.Se trata también la problemática del análisis de los datos de operación mediante técnicas estadísticas y el empleo de técnicas de clasificación. Otro apartado está dedicado a los distintos tipos de modelos, incluyendo una discusión de los modelos cualitativos.Se describe el sistema de diseño asistido por ordenador para el diseño de sistemas de supervisión CASSD que se utiliza en esta tesis, y las herramientas de análisis para obtener información cualitativa del comportamiento del proceso: Abstractores y ALCMEN. Se incluye un ejemplo de aplicación de estas técnicas para hallar las relaciones entre la temperatura y las acciones del operador. Finalmente se analizan las principales características de los sistemas expertos en general, y del sistema experto CEES 2.0 que también forma parte del sistema CASSD que se ha utilizado.El capítulo 6, Resultados, muestra los resultados obtenidos mediante la aplicación de las diferentes técnicas, redes neuronales, clasificación, el desarrollo de la modelización del proceso de combustión, y la generación de reglas. Dentro del apartado de análisis de datos se emplea una red neuronal para la clasificación de una señal de temperatura. También se describe la utilización del método LINNEO+ para la clasificación de los estados de operación de la planta.En el apartado dedicado a la modelización se desarrolla un modelo de combustión que sirve de base para analizar el comportamiento del horno en régimen estacionario y dinámico. Se define un parámetro, la superficie de llama, relacionado con la extensión del fuego en la parrilla. Mediante un modelo linealizado se analiza la respuesta dinámica del proceso de incineración. Luego se pasa a la definición de relaciones cualitativas entre las variables que se utilizan en la elaboración de un modelo cualitativo. A continuación se desarrolla un nuevo modelo cualitativo, tomando como base el modelo dinámico analítico.Finalmente se aborda el desarrollo de la base de conocimiento del sistema experto, mediante la generación de reglas En el capítulo 7, Sistema de control de una planta incineradora, se analizan los objetivos de un sistema de control de una planta incineradora, su diseño e implementación. Se describen los objetivos básicos del sistema de control de la combustión, su configuración y la implementación en Matlab/Simulink utilizando las distintas herramientas que se han desarrollado en el capítulo anterior.Por último para mostrar como pueden aplicarse los distintos métodos desarrollados en esta tesis se construye un sistema experto para mantener constante la temperatura del horno actuando sobre la alimentación de residuos.Finalmente en el capítulo Conclusiones, se presentan las conclusiones y resultados de esta tesis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Diffuse reflectance spectroscopy (DRS) is increasingly being used to predict numerous soil physical, chemical and biochemical properties. However, soil properties and processes vary at different scales and, as a result, relationships between soil properties often depend on scale. In this paper we report on how the relationship between one such property, cation exchange capacity (CEC), and the DRS of the soil depends on spatial scale. We show this by means of a nested analysis of covariance of soils sampled on a balanced nested design in a 16 km × 16 km area in eastern England. We used principal components analysis on the DRS to obtain a reduced number of variables while retaining key variation. The first principal component accounted for 99.8% of the total variance, the second for 0.14%. Nested analysis of the variation in the CEC and the two principal components showed that the substantial variance components are at the > 2000-m scale. This is probably the result of differences in soil composition due to parent material. We then developed a model to predict CEC from the DRS and used partial least squares (PLS) regression do to so. Leave-one-out cross-validation results suggested a reasonable predictive capability (R2 = 0.71 and RMSE = 0.048 molc kg− 1). However, the results from the independent validation were not as good, with R2 = 0.27, RMSE = 0.056 molc kg− 1 and an overall correlation of 0.52. This would indicate that DRS may not be useful for predictions of CEC. When we applied the analysis of covariance between predicted and observed we found significant scale-dependent correlations at scales of 50 and 500 m (0.82 and 0.73 respectively). DRS measurements can therefore be useful to predict CEC if predictions are required, for example, at the field scale (50 m). This study illustrates that the relationship between DRS and soil properties is scale-dependent and that this scale dependency has important consequences for prediction of soil properties from DRS data

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pressing global environmental problems highlight the need to develop tools to measure progress towards "sustainability." However, some argue that any such attempt inevitably reflects the views of those creating such tools and only produce highly contested notions of "reality." To explore this tension, we critically assesses the Environmental Sustainability Index (ESI), a well-publicized product of the World Economic Forum that is designed to measure 'sustainability' by ranking nations on league tables based on extensive databases of environmental indicators. By recreating this index, and then using statistical tools (principal components analysis) to test relations between various components of the index, we challenge ways in which countries are ranked in the ESI. Based on this analysis, we suggest (1) that the approach taken to aggregate, interpret and present the ESI creates a misleading impression that Western countries are more sustainable than the developing world; (2) that unaccounted methodological biases allowed the authors of the ESI to over-generalize the relative 'sustainability' of different countries; and, (3) that this has resulted in simplistic conclusions on the relation between economic growth and environmental sustainability. This criticism should not be interpreted as a call for the abandonment of efforts to create standardized comparable data. Instead, this paper proposes that indicator selection and data collection should draw on a range of voices, including local stakeholders as well as international experts. We also propose that aggregating data into final league ranking tables is too prone to error and creates the illusion of absolute and categorical interpretations. (c) 2004 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The effects of irrigation and nitrogen (N) fertilizer on Hagberg falling number (HFN), specific weight (SW) and blackpoint (BP) of winter wheat (Triticum aestivum L) were investigated. Mains water (+50 and +100 mm month(-1), containing 44 mg NO3- litre(-1) and 28 mg SO42- litre(-1)) was applied with trickle irrigation during winter (17 January-17 March), spring (21 March-20 May) or summer (24 May-23 July). In 1999/2000 these treatments were factorially combined with three N levels (0, 200, 400 kg N ha(-1)), applied to cv Hereward. In 2000/01 the 400 kg N ha(-1) treatment was replaced with cv Malacca given 200 kg N ha(-1). Irrigation increased grain yield, mostly by increasing grain numbers when applied in winter and spring, and by increasing mean grain weight when applied in summer. Nitrogen increased grain numbers and SW, and reduced BP in both years. Nitrogen increased HFN in 1999/2000 and reduced HFN in 2000/01. Effects of irrigation on HFN, SW and BP were smaller and inconsistent over year and nitrogen level. Irrigation interacted with N on mean grain weight: negatively for winter and spring irrigation, and positively for summer irrigation. Ten variables derived from digital image analysis of harvested grain were included with mean grain weight in a principal components analysis. The first principal component ('size') was negatively related to HFN (in two years) and BP (one year), and positively related to SW (two years). Treatment effects on dimensions of harvested grain could not explain all of the effects on HFN, BP and SW but the results were consistent with the hypothesis that water and nutrient availability, even when they were affected early in the season, could influence final grain quality if they influenced grain numbers and size. (C) 2004 Society of Chemical Industry

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Concentrations of large numbers of endemic species have been singled out in prioritization exercises as significant areas for global biodiversity conservation. This paper describes bird and mammal endemicity in Indo-Pacific ecoregions. An ecoregion is a relatively large unit of land or water that contains a distinct assemblage of natural communities. We prioritize 133 ecoregions according to their levels of endemicity, and explain how variables such as biome type, whether the ecoregion is on an island or continental mass, montane or non-montane, correlate with the proportion of the total species assemblage that are endemic. Following an exploratory principal components analysis we classify all ecoregions according to the relationship between numbers of endemics and overall species richness. Endemicity is negatively correlated with species richness. We show that plotting the logit transformation of the endemicity of birds and mammals against log of species richness is a more effective and useful way of identifying important ecoregions than simply ordering ecoregions by the proportion of endemic species, or any other single measure. The plot, divided into 16 regions corresponding to the quartiles of the two variables, was used to identify ecoregions of high conservation value. These are the ecoregions with the highest endemicity and lowest species richness. Further analysis shows that island and montane ecoregions, regardless of their biome type, are by far the most important for endemic species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background and Aims: The aims of this investigation were to highlight the qualitative and quantitative diversity apparent between nine diploid Fragaria species and produce interspecific populations segregating for a large number of morphological characters suitable for quantitative trait loci analysis. Methods: A qualitative comparison of eight described diploid Fragaria species was performed and measurements were taken of 23 morphological traits from 19 accessions including eight described species and one previously undescribed species. A principal components analysis was performed on 14 mathematically unrelated traits from these accessions, which partitioned the species accessions into distinct morphological groups. Interspecific crosses were performed with accessions of species that displayed significant quantitative divergence and, from these, populations that should segregate for a range of quantitative traits were raised. Key Results: Significant differences between species were observed for all 23 morphological traits quantified and three distinct groups of species accessions were observed after the principal components analysis. Interspecific crosses were performed between these groups, and F2 and backcross populations were raised that should segregate for a range of morphological characters. In addition, the study highlighted a number of distinctive morphological characters in many of the species studied. Conclusions: Diploid Fragaria species are morphologically diverse, yet remain highly interfertile, making the group an ideal model for the study of the genetic basis of phenotypic differences between species through map-based investigation using quantitative trait loci. The segregating interspecific populations raised will be ideal for such investigations and could also provide insights into the nature and extent of genome evolution within this group.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The pig is a single-stomached omnivorous mammal and is an important model of human disease and nutrition. As such, it is necessary to establish a metabolic framework from which pathology-based variation can be compared. Here, a combination of one and two-dimensional (1)H and (13)C nuclear magnetic resonance spectroscopy (NMR) and high-resolution magic angle spinning (HR-MAS) NMR was used to provide a systems overview of porcine metabolism via characterisation of the urine, serum, liver and kidney metabolomes. The metabolites observed in each of these biological compartments were found to be qualitatively comparable to the metabolic signature of the same biological matrices in humans and rodents. The data were modelled using a combination of principal components analysis and Venn diagram mapping. Urine represented the most metabolically distinct biological compartment studied, with a relatively greater number of NMR detectable metabolites present, many of which are implicated in gut-microbial co-metabolic processes. The major inter-species differences observed were in the phase II conjugation of extra-genomic metabolites; the pig was observed to conjugate p-cresol, a gut microbial metabolite of tyrosine, with glucuronide rather than sulfate as seen in man. These observations are important to note when considering the translatability of experimental data derived from porcine models.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

With no universal approach for measuring brand performance, we show how a consumer-based brand measure was developed for corporate financial services brands. Churchill's paradigm was adopted. A literature review and 20 depth interviews with experts suggested that brand loyalty, consumer satisfaction and reputation constitute the brand performance measure. Ten financial services organisations provided access to their consumers. Following a postal survey, 600 questionnaires were analysed through principal components analysis to identify the consumer-based measure. Further testing revealed this to be a valid and reliable brand performance measure.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Polycyclic aromatic hydrocarbons (PAHs) are ubiquitous environmental pollutants that frequently accumulate in soils. There is therefore a requirement to determine their levels in contaminated environments for the purposes of determining impacts on human health. PAHs are a suite of individual chemicals, and there is an ongoing debate as to the most appropriate method for assessing the risk to humans from them. Two methods predominate: the surrogate marker approach and the toxic equivalency factor. The former assumes that all chemicals in a mixture have an equivalent toxicity. The toxic equivalency approach estimates the potency of individual chemicals relative to the usually most toxic Benzo(a)pyrene. The surrogate marker approach is believed to overestimate risk and the toxic equivalency factor to underestimate risk. When analysing the risks from soils, the surrogate marker approach is preferred due to its simplicity, but there are concerns because of the potential diversity of the PAH profile across the range of impacted soils. Using two independent data sets containing soils from 274 sites across a diverse range of locations, statistical analysis was undertaken to determine the differences in the composition of carcinogenic PAH between site locations, for example, rural versus industrial. Following principal components analysis, distinct population differences were not seen between site locations in spite of large differences in the total PAH burden between individual sites. Using all data, highly significant correlations were seen between BaP and other carcinogenic PAH with the majority of r2 values > 0.8. Correlations with the European Food Standards Agency (EFSA) summed groups, that is, EFSA2, EFSA4 and EFSA8 had even higher correlations (r2 > 0.95). We therefore conclude that BaP is a suitable surrogate marker to represent mixtures of PAH in soil during risk assessments.