964 resultados para Predictive modelling
Resumo:
In a seminal data mining article, Leo Breiman [1] argued that to develop effective predictive classification and regression models, we need to move away from the sole dependency on statistical algorithms and embrace a wider toolkit of modeling algorithms that include data mining procedures. Nevertheless, many researchers still rely solely on statistical procedures when undertaking data modeling tasks; the sole reliance on these procedures has lead to the development of irrelevant theory and questionable research conclusions ([1], p.199). We will outline initiatives that the HPC & Research Support group is undertaking to engage researchers with data mining tools and techniques; including a new range of seminars, workshops, and one-on-one consultations covering data mining algorithms, the relationship between data mining and the research cycle, and limitations and problems with these new algorithms. Organisational limitations and restrictions to these initiatives are also discussed.
Resumo:
Abstract Background The quantum increases in home Internet access and available online health information with limited control over information quality highlight the necessity of exploring decision making processes in accessing and using online information, specifically in relation to children who do not make their health decisions. Objectives To understand the processes explaining parents’ decisions to use online health information for child health care. Methods Parents (N = 391) completed an initial questionnaire assessing the theory of planned behaviour constructs of attitude, subjective norm, and perceived behavioural control, as well as perceived risk, group norm, and additional demographic factors. Two months later, 187 parents completed a follow-up questionnaire assessing their decisions to use online information for their child’s health care, specifically to 1) diagnose and/or treat their child’s suspected medical condition/illness and 2) increase understanding about a diagnosis or treatment recommended by a health professional. Results Hierarchical multiple regression showed that, for both behaviours, attitude, subjective norm, perceived behavioural control, (less) perceived risk, group norm, and (non) medical background were the significant predictors of intention. For parents’ use of online child health information, for both behaviours, intention was the sole significant predictor of behaviour. The findings explain 77% of the variance in parents’ intention to treat/diagnose a child health problem and 74% of the variance in their intentions to increase their understanding about child health concerns. Conclusions Understanding parents’ socio-cognitive processes that guide their use of online information for child health care is important given the increase in Internet usage and the sometimes-questionable quality of health information provided online. Findings highlight parents’ thirst for information; there is an urgent need for health professionals to provide parents with evidence-based child health websites in addition to general population education on how to evaluate the quality of online health information.
Resumo:
Extensive resources are allocated to managing vertebrate pests, yet spatial understanding of pest threats, and how they respond to management, is limited at the regional scale where much decision-making is undertaken. We provide regional-scale spatial models and management guidance for European rabbits (Oryctolagus cuniculus) in a 260,791 km(2) region in Australia by determining habitat suitability, habitat susceptibility and the effects of the primary rabbit management options (barrier fence, shooting and baiting and warren ripping) or changing predation or disease control levels. A participatory modelling approach was used to develop a Bayesian network which captured the main drivers of suitability and spread, which in turn was linked spatially to develop high resolution risk maps. Policy-makers, rabbit managers and technical experts were responsible for defining the questions the model needed to address, and for subsequently developing and parameterising the model. Habitat suitability was determined by conditions required for warren-building and by above-ground requirements, such as food and harbour, and habitat susceptibility by the distance from current distributions, habitat suitability, and the costs of traversing habitats of different quality. At least one-third of the region had a high probability of being highly suitable (support high rabbit densities), with the model supported by validation. Habitat susceptibility was largely restricted by the current known rabbit distribution. Warren ripping was the most effective control option as warrens were considered essential for rabbit persistence. The anticipated increase in disease resistance was predicted to increase the probability of moderately suitable habitat becoming highly suitable, but not increase the at-risk area. We demonstrate that it is possible to build spatial models to guide regional-level management of vertebrate pests which use the best available knowledge and capture fine spatial-scale processes.
Resumo:
This paper reports an approach by which laboratory based testing and numerical modelling can be combined to predict the long term performance of a range of concretes exposed to marine environments. Firstly, a critical review of the test methods for assessing the chloride penetration resistance of concrete is given. The repeatability of the different test results is also included. In addition to the test methods, a numerical simulation model is used to explore the test data further to obtain long-term chloride ingress trends. The combined use of testing and modelling is validated with the help of long-term chloride ingress data from a North Sea exposure site. In summary, the paper outlines a methodology for determining the long term performance of concrete in marine environments.
Resumo:
Dissertação de Mestrado, Estudos Integrados dos Oceanos, 25 de Março de 2013, Universidade dos Açores.
Resumo:
Experience is lacking with mineral scaling and corrosion in enhanced geothermal systems (EGS) in which surface water is circulated through hydraulically stimulated crystalline rocks. As an aid in designing EGS projects we have conducted multicomponent reactive-transport simulations to predict the likely characteristics of scales and corrosion that may form when exploiting heat from granitoid reservoir rocks at ∼200 °C and 5 km depth. The specifications of an EGS project at Basel, Switzerland, are used to constrain the model. The main water–rock reactions in the reservoir during hydraulic stimulation and the subsequent doublet operation were identified in a separate paper (Alt-Epping et al., 2013b). Here we use the computed composition of the reservoir fluid to (1) predict mineral scaling in the injection and production wells, (2) evaluate methods of chemical geothermometry and (3) identify geochemical indicators of incipient corrosion. The envisaged heat extraction scheme ensures that even if the reservoir fluid is in equilibrium with quartz, cooling of the fluid will not induce saturation with respect to amorphous silica, thus eliminating the risk of silica scaling. However, the ascending fluid attains saturation with respect to crystalline aluminosilicates such as albite, microcline and chlorite, and possibly with respect to amorphous aluminosilicates. If no silica-bearing minerals precipitate upon ascent, reservoir temperatures can be predicted by classical formulations of silica geothermometry. In contrast, Na/K concentration ratios in the production fluid reflect steady-state conditions in the reservoir rather than albite–microcline equilibrium. Thus, even though igneous orthoclase is abundant in the reservoir and albite precipitates as a secondary phase, Na/K geothermometers fail to yield accurate temperatures. Anhydrite, which is present in fractures in the Basel reservoir, is predicted to dissolve during operation. This may lead to precipitation of pyrite and, at high exposure of anhydrite to the circulating fluid, of hematite scaling in the geothermal installation. In general, incipient corrosion of the casing can be detected at the production wellhead through an increase in H2(aq) and the enhanced precipitation of Fe-bearing aluminosilicates. The appearance of magnetite in scales indicates high corrosion rates.
Resumo:
In the last two decades there have been substantial developments in the mathematical theory of inverse optimization problems, and their applications have expanded greatly. In parallel, time series analysis and forecasting have become increasingly important in various fields of research such as data mining, economics, business, engineering, medicine, politics, and many others. Despite the large uses of linear programming in forecasting models there is no a single application of inverse optimization reported in the forecasting literature when the time series data is available. Thus the goal of this paper is to introduce inverse optimization into forecasting field, and to provide a streamlined approach to time series analysis and forecasting using inverse linear programming. An application has been used to demonstrate the use of inverse forecasting developed in this study. © 2007 Elsevier Ltd. All rights reserved.
Resumo:
This thesis presents novel modelling applications for environmental geospatial data using remote sensing, GIS and statistical modelling techniques. The studied themes can be classified into four main themes: (i) to develop advanced geospatial databases. Paper (I) demonstrates the creation of a geospatial database for the Glanville fritillary butterfly (Melitaea cinxia) in the Åland Islands, south-western Finland; (ii) to analyse species diversity and distribution using GIS techniques. Paper (II) presents a diversity and geographical distribution analysis for Scopulini moths at a world-wide scale; (iii) to study spatiotemporal forest cover change. Paper (III) presents a study of exotic and indigenous tree cover change detection in Taita Hills Kenya using airborne imagery and GIS analysis techniques; (iv) to explore predictive modelling techniques using geospatial data. In Paper (IV) human population occurrence and abundance in the Taita Hills highlands was predicted using the generalized additive modelling (GAM) technique. Paper (V) presents techniques to enhance fire prediction and burned area estimation at a regional scale in East Caprivi Namibia. Paper (VI) compares eight state-of-the-art predictive modelling methods to improve fire prediction, burned area estimation and fire risk mapping in East Caprivi Namibia. The results in Paper (I) showed that geospatial data can be managed effectively using advanced relational database management systems. Metapopulation data for Melitaea cinxia butterfly was successfully combined with GPS-delimited habitat patch information and climatic data. Using the geospatial database, spatial analyses were successfully conducted at habitat patch level or at more coarse analysis scales. Moreover, this study showed it appears evident that at a large-scale spatially correlated weather conditions are one of the primary causes of spatially correlated changes in Melitaea cinxia population sizes. In Paper (II) spatiotemporal characteristics of Socupulini moths description, diversity and distribution were analysed at a world-wide scale and for the first time GIS techniques were used for Scopulini moth geographical distribution analysis. This study revealed that Scopulini moths have a cosmopolitan distribution. The majority of the species have been described from the low latitudes, sub-Saharan Africa being the hot spot of species diversity. However, the taxonomical effort has been uneven among biogeographical regions. Paper III showed that forest cover change can be analysed in great detail using modern airborne imagery techniques and historical aerial photographs. However, when spatiotemporal forest cover change is studied care has to be taken in co-registration and image interpretation when historical black and white aerial photography is used. In Paper (IV) human population distribution and abundance could be modelled with fairly good results using geospatial predictors and non-Gaussian predictive modelling techniques. Moreover, land cover layer is not necessary needed as a predictor because first and second-order image texture measurements derived from satellite imagery had more power to explain the variation in dwelling unit occurrence and abundance. Paper V showed that generalized linear model (GLM) is a suitable technique for fire occurrence prediction and for burned area estimation. GLM based burned area estimations were found to be more superior than the existing MODIS burned area product (MCD45A1). However, spatial autocorrelation of fires has to be taken into account when using the GLM technique for fire occurrence prediction. Paper VI showed that novel statistical predictive modelling techniques can be used to improve fire prediction, burned area estimation and fire risk mapping at a regional scale. However, some noticeable variation between different predictive modelling techniques for fire occurrence prediction and burned area estimation existed.
Resumo:
The Taita Hills in southeastern Kenya form the northernmost part of Africa’s Eastern Arc Mountains, which have been identified by Conservation International as one of the top ten biodiversity hotspots on Earth. As with many areas of the developing world, over recent decades the Taita Hills have experienced significant population growth leading to associated major changes in land use and land cover (LULC), as well as escalating land degradation, particularly soil erosion. Multi-temporal medium resolution multispectral optical satellite data, such as imagery from the SPOT HRV, HRVIR, and HRG sensors, provides a valuable source of information for environmental monitoring and modelling at a landscape level at local and regional scales. However, utilization of multi-temporal SPOT data in quantitative remote sensing studies requires the removal of atmospheric effects and the derivation of surface reflectance factor. Furthermore, for areas of rugged terrain, such as the Taita Hills, topographic correction is necessary to derive comparable reflectance throughout a SPOT scene. Reliable monitoring of LULC change over time and modelling of land degradation and human population distribution and abundance are of crucial importance to sustainable development, natural resource management, biodiversity conservation, and understanding and mitigating climate change and its impacts. The main purpose of this thesis was to develop and validate enhanced processing of SPOT satellite imagery for use in environmental monitoring and modelling at a landscape level, in regions of the developing world with limited ancillary data availability. The Taita Hills formed the application study site, whilst the Helsinki metropolitan region was used as a control site for validation and assessment of the applied atmospheric correction techniques, where multiangular reflectance field measurements were taken and where horizontal visibility meteorological data concurrent with image acquisition were available. The proposed historical empirical line method (HELM) for absolute atmospheric correction was found to be the only applied technique that could derive surface reflectance factor within an RMSE of < 0.02 ps in the SPOT visible and near-infrared bands; an accuracy level identified as a benchmark for successful atmospheric correction. A multi-scale segmentation/object relationship modelling (MSS/ORM) approach was applied to map LULC in the Taita Hills from the multi-temporal SPOT imagery. This object-based procedure was shown to derive significant improvements over a uni-scale maximum-likelihood technique. The derived LULC data was used in combination with low cost GIS geospatial layers describing elevation, rainfall and soil type, to model degradation in the Taita Hills in the form of potential soil loss, utilizing the simple universal soil loss equation (USLE). Furthermore, human population distribution and abundance were modelled with satisfactory results using only SPOT and GIS derived data and non-Gaussian predictive modelling techniques. The SPOT derived LULC data was found to be unnecessary as a predictor because the first and second order image texture measurements had greater power to explain variation in dwelling unit occurrence and abundance. The ability of the procedures to be implemented locally in the developing world using low-cost or freely available data and software was considered. The techniques discussed in this thesis are considered equally applicable to other medium- and high-resolution optical satellite imagery, as well the utilized SPOT data.