33 resultados para Application of Data-driven Modelling in Water Sciences
Resumo:
Knowledge of the spatial distribution of hydraulic conductivity (K) within an aquifer is critical for reliable predictions of solute transport and the development of effective groundwater management and/or remediation strategies. While core analyses and hydraulic logging can provide highly detailed information, such information is inherently localized around boreholes that tend to be sparsely distributed throughout the aquifer volume. Conversely, larger-scale hydraulic experiments like pumping and tracer tests provide relatively low-resolution estimates of K in the investigated subsurface region. As a result, traditional hydrogeological measurement techniques contain a gap in terms of spatial resolution and coverage, and they are often alone inadequate for characterizing heterogeneous aquifers. Geophysical methods have the potential to bridge this gap. The recent increased interest in the application of geophysical methods to hydrogeological problems is clearly evidenced by the formation and rapid growth of the domain of hydrogeophysics over the past decade (e.g., Rubin and Hubbard, 2005).
Resumo:
Failure to detect a species in an area where it is present is a major source of error in biological surveys. We assessed whether it is possible to optimize single-visit biological monitoring surveys of highly dynamic freshwater ecosystems by framing them a priori within a particular period of time. Alternatively, we also searched for the optimal number of visits and when they should be conducted. We developed single-species occupancy models to estimate the monthly probability of detection of pond-breeding amphibians during a four-year monitoring program. Our results revealed that detection probability was species-specific and changed among sampling visits within a breeding season and also among breeding seasons. Thereby, the optimization of biological surveys with minimal survey effort (a single visit) is not feasible as it proves impossible to select a priori an adequate sampling period that remains robust across years. Alternatively, a two-survey combination at the beginning of the sampling season yielded optimal results and constituted an acceptable compromise between sampling efficacy and survey effort. Our study provides evidence of the variability and uncertainty that likely affects the efficacy of monitoring surveys, highlighting the need of repeated sampling in both ecological studies and conservation management.
Resumo:
Understanding the drivers of population divergence, speciation and species persistence is of great interest to molecular ecology, especially for species-rich radiations inhabiting the world's biodiversity hotspots. The toolbox of population genomics holds great promise for addressing these key issues, especially if genomic data are analysed within a spatially and ecologically explicit context. We have studied the earliest stages of the divergence continuum in the Restionaceae, a species-rich and ecologically important plant family of the Cape Floristic Region (CFR) of South Africa, using the widespread CFR endemic Restio capensis (L.) H.P. Linder & C.R. Hardy as an example. We studied diverging populations of this morphotaxon for plastid DNA sequences and >14 400 nuclear DNA polymorphisms from Restriction site Associated DNA (RAD) sequencing and analysed the results jointly with spatial, climatic and phytogeographic data, using a Bayesian generalized linear mixed modelling (GLMM) approach. The results indicate that population divergence across the extreme environmental mosaic of the CFR is mostly driven by isolation by environment (IBE) rather than isolation by distance (IBD) for both neutral and non-neutral markers, consistent with genome hitchhiking or coupling effects during early stages of divergence. Mixed modelling of plastid DNA and single divergent outlier loci from a Bayesian genome scan confirmed the predominant role of climate and pointed to additional drivers of divergence, such as drift and ecological agents of selection captured by phytogeographic zones. Our study demonstrates the usefulness of population genomics for disentangling the effects of IBD and IBE along the divergence continuum often found in species radiations across heterogeneous ecological landscapes.
Resumo:
OBJECTIVE: To compare the pharmacokinetic and pharmacodynamic characteristics of angiotensin II receptor antagonists as a therapeutic class. DESIGN: Population pharmacokinetic-pharmacodynamic modelling study. METHODS: The data of 14 phase I studies with 10 different drugs were analysed. A common population pharmacokinetic model (two compartments, mixed zero- and first-order absorption, two metabolite compartments) was applied to the 2685 drug and 900 metabolite concentration measurements. A standard nonlinear mixed effect modelling approach was used to estimate the drug-specific parameters and their variabilities. Similarly, a pharmacodynamic model was applied to the 7360 effect measurements, i.e. the decrease of peak blood pressure response to intravenous angiotensin challenge recorded by finger photoplethysmography. The concentration of drug and metabolite in an effect compartment was assumed to translate into receptor blockade [maximum effect (Emax) model with first-order link]. RESULTS: A general pharmacokinetic-pharmacodynamic (PK-PD) model for angiotensin antagonism in healthy individuals was successfully built up for the 10 drugs studied. Representatives of this class share different pharmacokinetic and pharmacodynamic profiles. Their effects on blood pressure are dose-dependent, but the time course of the effect varies between the drugs. CONCLUSIONS: The characterisation of PK-PD relationships for these drugs gives the opportunity to optimise therapeutic regimens and to suggest dosage adjustments in specific conditions. Such a model can be used to further refine the use of this class of drugs.
Resumo:
Self-potentials (SP) are sensitive to water fluxes and concentration gradients in both saturated and unsaturated geological media, but quantitative interpretations of SP field data may often be hindered by the superposition of different source contributions and time-varying electrode potentials. Self-potential mapping and close to two months of SP monitoring on a gravel bar were performed to investigate the origins of SP signals at a restored river section of the Thur River in northeastern Switzerland. The SP mapping and subsequent inversion of the data indicate that the SP sources are mainly located in the upper few meters in regions of soil cover rather than bare gravel. Wavelet analyses of the time-series indicate a strong, but non-linear influence of water table and water content variations, as well as rainfall intensity on the recorded SP signals. Modeling of the SP response with respect to an increase in the water table elevation and precipitation indicate that the distribution of soil properties in the vadose zone has a very strong influence. We conclude that the observed SP responses on the gravel bar are more complicated than previously proposed semi-empiric relationships between SP signals and hydraulic head or the thickness of the vadose zone. We suggest that future SP monitoring in restored river corridors should either focus on quantifying vadose zone processes by installing vertical profiles of closely spaced SP electrodes or by installing the electrodes within the river to avoid signals arising from vadose zone processes and time-varying electrochemical conditions in the vicinity of the electrodes.
Resumo:
An equation is applied for calculating the expected persistence time of an unstructured population of the white-toothed shrew Crocidura russula from Preverenges, a suburban area in western Switzerland. Population abundance data from March and November between 1977 and 1988 were fit to the logistic density dependence model to estimate mean population growth rate as a function of population density. The variance in mean growth rate was approximated with two different models. The largest estimated persistence time was less than a few decades, the smallest less than 10 years. The results are sensitive to the magnitude of variance in population growth rate. Deviations from the logistic density dependence model in November are quite well explained by weather variables but those in March are uncorrelated with weather variables. Variability in population growth rates measured in winter months may be better explained by behavioural mechanisms. Environmental variability, dispersal of juveniles and refugia within the range of the population may contribute to its long-term survival.
Resumo:
Mitochondria have a fundamental role in the transduction of energy from food into ATP. The coupling between food oxidation and ATP production is never perfect, but may nevertheless be of evolutionary significance. The 'uncoupling to survive' hypothesis suggests that 'mild' mitochondrial uncoupling evolved as a protective mechanism against the excessive production of damaging reactive oxygen species (ROS). Because resource allocation and ROS production are thought to shape animal life histories, alternative life-history trajectories might be driven by individual variation in the degree of mitochondrial uncoupling. We tested this hypothesis in a small bird species, the zebra finch (Taeniopygia guttata), by treating adults with the artificial mitochondrial uncoupler 2,4-dinitrophenol (DNP) over a 32-month period. In agreement with our expectations, the uncoupling treatment increased metabolic rate. However, we found no evidence that treated birds enjoyed lower oxidative stress levels or greater survival rates, in contrast to previous results in other taxa. In vitro experiments revealed lower sensitivity of ROS production to DNP in mitochondria isolated from skeletal muscles of zebra finch than mouse. In addition, we found significant reductions in the number of eggs laid and in the inflammatory immune response in treated birds. Altogether, our data suggest that the 'uncoupling to survive' hypothesis may not be applicable for zebra finches, presumably because of lower effects of mitochondrial uncoupling on mitochondrial ROS production in birds than in mammals. Nevertheless, mitochondrial uncoupling appeared to be a potential life-history regulator of traits such as fecundity and immunity at adulthood, even with food supplied ad libitum.
Resumo:
The eclogite facies assemblage K-feldspar-jadeite-quartz in metagranites and metapelites from the Sesia-Lanzo Zone (Western Alps, Italy) records the equilibration pressure by dilution of the reaction jadeite + quartz = albite. The metapelites show partial transformation from a pre-Alpine assemblage of garnet (Alm(63)Prp(26)Grs(10))-K-feldspar-plagioclase-biotite +/- sillimanite to the Eo-Alpine high-pressure assemblage garnet (Alm(50)Prp(14)Grs(35))-jadeite (Jd(80-97)Di(0-4)Hd(0-8)Acm(0-7))=zoisite-phengite. Plagioclase is replaced by jadeite-zoisite-kyanite-K-feldspar-quartz and biotite is replaced by garnet-phengite or omphacite-kyanite-phengite. Equilibrium was attained only in local domains in the metapelites and therefore the K-feldspar-jadeite-quartz (KJQ) barometer was applied only to the plagioclase pseudomorphs and K-feldspar domains. The albite content of K-feldspar ranges from 4 to 11 mol% in less equilibrated assemblages from Val Savenca and from 4 to 7 mol% in the partially equilibrated samples from Monte Mucrone and the equilibrated samples from Montestrutto and Tavagnasco. Thermodynamic calculations on the stability of the assemblage K-feldspar-jadeite-quartz using available mixing data for K-feldspar and pyroxene indicate pressures of 15-21 kbar (+/- 1.6-1.9 kbar) at 550 +/- 50 degrees C. This barometer yields direct pressure estimates in high-pressure rocks where pressures are seldom otherwise fixed, although it is sensitive to analytical precision and the choice of thermodynamic mixing model for K-feldspar. Moreover, the KJQ barometer is independent of the ratio P-H2O/P-T. The inferred limiting a(H2O) for the assemblage jadeite-kyanite in the metapelites from Val Savenca is low and varies from 0.2 to 0.6.
Resumo:
To study different temporal components on cancer mortality (age, period and cohort) methods of graphic representation were applied to Swiss mortality data from 1950 to 1984. Maps using continuous slopes ("contour maps") and based on eight tones of grey according to the absolute distribution of rates were used to represent the surfaces defined by the matrix of various age-specific rates. Further, progressively more complex regression surface equations were defined, on the basis of two independent variables (age/cohort) and a dependent one (each age-specific mortality rate). General patterns of trends in cancer mortality were thus identified, permitting definition of important cohort (e.g., upwards for lung and other tobacco-related neoplasms, or downwards for stomach) or period (e.g., downwards for intestines or thyroid cancers) effects, besides the major underlying age component. For most cancer sites, even the lower order (1st to 3rd) models utilised provided excellent fitting, allowing immediate identification of the residuals (e.g., high or low mortality points) as well as estimates of first-order interactions between the three factors, although the parameters of the main effects remained still undetermined. Thus, the method should be essentially used as summary guide to illustrate and understand the general patterns of age, period and cohort effects in (cancer) mortality, although they cannot conceptually solve the inherent problem of identifiability of the three components.
Resumo:
Abstract : This work is concerned with the development and application of novel unsupervised learning methods, having in mind two target applications: the analysis of forensic case data and the classification of remote sensing images. First, a method based on a symbolic optimization of the inter-sample distance measure is proposed to improve the flexibility of spectral clustering algorithms, and applied to the problem of forensic case data. This distance is optimized using a loss function related to the preservation of neighborhood structure between the input space and the space of principal components, and solutions are found using genetic programming. Results are compared to a variety of state-of--the-art clustering algorithms. Subsequently, a new large-scale clustering method based on a joint optimization of feature extraction and classification is proposed and applied to various databases, including two hyperspectral remote sensing images. The algorithm makes uses of a functional model (e.g., a neural network) for clustering which is trained by stochastic gradient descent. Results indicate that such a technique can easily scale to huge databases, can avoid the so-called out-of-sample problem, and can compete with or even outperform existing clustering algorithms on both artificial data and real remote sensing images. This is verified on small databases as well as very large problems. Résumé : Ce travail de recherche porte sur le développement et l'application de méthodes d'apprentissage dites non supervisées. Les applications visées par ces méthodes sont l'analyse de données forensiques et la classification d'images hyperspectrales en télédétection. Dans un premier temps, une méthodologie de classification non supervisée fondée sur l'optimisation symbolique d'une mesure de distance inter-échantillons est proposée. Cette mesure est obtenue en optimisant une fonction de coût reliée à la préservation de la structure de voisinage d'un point entre l'espace des variables initiales et l'espace des composantes principales. Cette méthode est appliquée à l'analyse de données forensiques et comparée à un éventail de méthodes déjà existantes. En second lieu, une méthode fondée sur une optimisation conjointe des tâches de sélection de variables et de classification est implémentée dans un réseau de neurones et appliquée à diverses bases de données, dont deux images hyperspectrales. Le réseau de neurones est entraîné à l'aide d'un algorithme de gradient stochastique, ce qui rend cette technique applicable à des images de très haute résolution. Les résultats de l'application de cette dernière montrent que l'utilisation d'une telle technique permet de classifier de très grandes bases de données sans difficulté et donne des résultats avantageusement comparables aux méthodes existantes.
Resumo:
The paper presents the Multiple Kernel Learning (MKL) approach as a modelling and data exploratory tool and applies it to the problem of wind speed mapping. Support Vector Regression (SVR) is used to predict spatial variations of the mean wind speed from terrain features (slopes, terrain curvature, directional derivatives) generated at different spatial scales. Multiple Kernel Learning is applied to learn kernels for individual features and thematic feature subsets, both in the context of feature selection and optimal parameters determination. An empirical study on real-life data confirms the usefulness of MKL as a tool that enhances the interpretability of data-driven models.
Resumo:
BACKGROUND: The annotation of protein post-translational modifications (PTMs) is an important task of UniProtKB curators and, with continuing improvements in experimental methodology, an ever greater number of articles are being published on this topic. To help curators cope with this growing body of information we have developed a system which extracts information from the scientific literature for the most frequently annotated PTMs in UniProtKB. RESULTS: The procedure uses a pattern-matching and rule-based approach to extract sentences with information on the type and site of modification. A ranked list of protein candidates for the modification is also provided. For PTM extraction, precision varies from 57% to 94%, and recall from 75% to 95%, according to the type of modification. The procedure was used to track new publications on PTMs and to recover potential supporting evidence for phosphorylation sites annotated based on the results of large scale proteomics experiments. CONCLUSIONS: The information retrieval and extraction method we have developed in this study forms the basis of a simple tool for the manual curation of protein post-translational modifications in UniProtKB/Swiss-Prot. Our work demonstrates that even simple text-mining tools can be effectively adapted for database curation tasks, providing that a thorough understanding of the working process and requirements are first obtained. This system can be accessed at http://eagl.unige.ch/PTM/.
Resumo:
Several definitions of paediatric abdominal obesity have been proposed but it is unclear whether they lead to similar results. We assessed the prevalence of abdominal obesity using five different waist circumference-based definitions and their agreement with total body fat (TBF) and abdominal fat (AF). Data from 190 girls and 162 boys (Ballabeina), and from 134 girls and 113 boys (Kinder-Sportstudie, KISS) aged 5-11 years were used. TBF was assessed by bioimpedance (Ballabeina) or dual energy X-ray absorption (KISS). On the basis of the definition used, the prevalence of abdominal obesity varied between 3.1 and 49.4% in boys, and 4.7 and 55.5% in girls (Ballabeina), and between 1.8 and 36.3% in boys and 4.5 and 37.3% in girls (KISS). Among children considered as abdominally obese by at least one definition, 32.0 (Ballabeina) and 44.7% (KISS) were considered as such by at least two (out of five possible) definitions. Using excess TBF or AF as reference, the areas under the receiver operating curve varied between 0.577 and 0.762 (Ballabeina), and 0.583 and 0.818 (KISS). We conclude that current definitions of abdominal obesity in children lead to wide prevalence estimates and should not be used until a standard definition can be proposed.
Resumo:
Automatic environmental monitoring networks enforced by wireless communication technologies provide large and ever increasing volumes of data nowadays. The use of this information in natural hazard research is an important issue. Particularly useful for risk assessment and decision making are the spatial maps of hazard-related parameters produced from point observations and available auxiliary information. The purpose of this article is to present and explore the appropriate tools to process large amounts of available data and produce predictions at fine spatial scales. These are the algorithms of machine learning, which are aimed at non-parametric robust modelling of non-linear dependencies from empirical data. The computational efficiency of the data-driven methods allows producing the prediction maps in real time which makes them superior to physical models for the operational use in risk assessment and mitigation. Particularly, this situation encounters in spatial prediction of climatic variables (topo-climatic mapping). In complex topographies of the mountainous regions, the meteorological processes are highly influenced by the relief. The article shows how these relations, possibly regionalized and non-linear, can be modelled from data using the information from digital elevation models. The particular illustration of the developed methodology concerns the mapping of temperatures (including the situations of Föhn and temperature inversion) given the measurements taken from the Swiss meteorological monitoring network. The range of the methods used in the study includes data-driven feature selection, support vector algorithms and artificial neural networks.
Resumo:
PURPOSE: The aim of this study was to develop models based on kernel regression and probability estimation in order to predict and map IRC in Switzerland by taking into account all of the following: architectural factors, spatial relationships between the measurements, as well as geological information. METHODS: We looked at about 240,000 IRC measurements carried out in about 150,000 houses. As predictor variables we included: building type, foundation type, year of construction, detector type, geographical coordinates, altitude, temperature and lithology into the kernel estimation models. We developed predictive maps as well as a map of the local probability to exceed 300 Bq/m(3). Additionally, we developed a map of a confidence index in order to estimate the reliability of the probability map. RESULTS: Our models were able to explain 28% of the variations of IRC data. All variables added information to the model. The model estimation revealed a bandwidth for each variable, making it possible to characterize the influence of each variable on the IRC estimation. Furthermore, we assessed the mapping characteristics of kernel estimation overall as well as by municipality. Overall, our model reproduces spatial IRC patterns which were already obtained earlier. On the municipal level, we could show that our model accounts well for IRC trends within municipal boundaries. Finally, we found that different building characteristics result in different IRC maps. Maps corresponding to detached houses with concrete foundations indicate systematically smaller IRC than maps corresponding to farms with earth foundation. CONCLUSIONS: IRC mapping based on kernel estimation is a powerful tool to predict and analyze IRC on a large-scale as well as on a local level. This approach enables to develop tailor-made maps for different architectural elements and measurement conditions and to account at the same time for geological information and spatial relations between IRC measurements.