89 resultados para Data Streams Distribution
Resumo:
1. The population density and age structure of two species of heather psyllid Strophingia ericae and Strophingia cinereae, feeding on Calluna vulgaris and Erica cinerea, respectively, were sampled using standardized methods at locations throughout Britain. Locations were chosen to represent the full latitudinal and altitudinal range of the host plants.
2. The paper explains how spatial variation in thermal environment, insect life-history characteristics and physiology, and plant distribution, interact to provide the mechanisms that determine the range and abundance of Strophingia spp.
3. Strophingia ericae and S. cinereae, despite the similarity in the spatial distribution patterns of their host plants within Britain, display strongly contrasting geographical ranges and corresponding life-history strategies. Strophingia ericae is found on its host plant throughout Britain but S. cinereae is restricted to low elevation sites south of the Mersey-Humber line and occupies only part of the latitudinal and altitudinal range of its host plant. There is no evidence to suggest that S. ericae has reached its potential altitudinal or latitudinal limit in the UK, even though its host plant appears to reach its altitudinal limit.
4. There was little difference in the ability of the two Strophingia spp. to survive shortterm exposure to temperatures as low as - 15 degrees C and low winter temperatures probably do not limit distribution in S. cinereae.
5. Population density of S. ericae was not related to altitude but showed a weak correlation with latitude. The spread of larval instars present at a site, measured as an index of instar homogeneity, was significantly correlated with a range of temperature related variables, of which May mean temperature and length of growing season above 3 degrees C (calculated using the Lennon and Turner climatic model) were the most significant. Factor analysis did not improve the level of correlation significantly above those obtained for single climatic variables. The data confirmed that S. ericae has a I year life cycle at the lowest elevations and a 2 year life cycle at the higher elevations. However, there was no evidence, as previously suggested, for an abrupt change from a one to a 2 year life cycle in S. ericae with increasing altitudes or latitudes.
6. By contrast with S. ericae, S. cinereae had an obligatory 1 year life cycle, its population decreased with altitude and the index of instar homogeneity showed little correlation with single temperature variables. Moreover, it occupied only part of the range of its host plant and its spatial distribution in the UK could be predicted with 96% accuracy using selected variables in discriminant analysis.
7. The life histories of the congeneric heather psyllids reflect adaptations that allow them to exploit host plants with different distributions in climatic and thereby geographical space. Strophingia ericae has the flexible life history that enables it to exploit C. vulgaris throughout its European boreal temperate range. Strophingia cinereae has a less flexible life history and is adapted for living on an oceanic temperate host. While the geographic ranges of the two Strophingia spp. overlap within the UK, the psyllids appear to respond differently to variation in their thermal environment.
Resumo:
1. The prediction and mapping of climate in areas between climate stations is of increasing importance in ecology.
2. Four categories of model, simple interpolation, thin plate splines, multiple linear regression and mixed spline-regression, were tested for their ability to predict the spatial distribution of temperature on the British mainland. The models were tested by external cross-verification.
3. The British distribution of mean daily temperature was predicted with the greatest accuracy by using a mixed model: a thin plate spline fitted to the surface of the country, after correction of the data by a selection from 16 independent topographical variables (such as altitude, distance from the sea, slope and topographic roughness), chosen by multiple regression from a digital terrain model (DTM) of the country.
4. The next most accurate method was a pure multiple regression model using the DTM. Both regression and thin plate spline models based on a few variables (latitude, longitude and altitude) only were comparatively unsatisfactory, but some rather simple methods of surface interpolation (such as bilinear interpolation after correction to sea level) gave moderately satisfactory results. Differences between the methods seemed to be dependent largely on their ability to model the effect of the sea on land temperatures.
5. Prediction of temperature by the best methods was greater than 95% accurate in all months of the year, as shown by the correlation between the predicted and actual values. The predicted temperatures were calculated at real altitudes, not subject to sea-level correction.
6. A minimum of just over 30 temperature recording stations would generate a satisfactory surface, provided the stations were well spaced.
7. Maps of mean daily temperature, using the best overall methods are provided; further important variables, such as continentality and length of growing season, were also mapped. Many of these are believed to be the first detailed representations at real altitude.
8. The interpolated monthly temperature surfaces are available on disk.
Resumo:
In this paper we propose a graph stream clustering algorithm with a unied similarity measure on both structural and attribute properties of vertices, with each attribute being treated as a vertex. Unlike others, our approach does not require an input parameter for the number of clusters, instead, it dynamically creates new sketch-based clusters and periodically merges existing similar clusters. Experiments on two publicly available datasets reveal the advantages of our approach in detecting vertex clusters in the graph stream. We provide a detailed investigation into how parameters affect the algorithm performance. We also provide a quantitative evaluation and comparison with a well-known offline community detection algorithm which shows that our streaming algorithm can achieve comparable or better average cluster purity.
Resumo:
Two common scenarios in Geoforensics (definition in text) are considered: the provenance, or localization of unknown samples and the question of sample variability at scenes of crime/alibi locations. Both have been discussed in forensic and soil science publications, but mostly within a theoretical or non-forensic context. These previous publications provide context for the two case study scenarios (one actual, one based on a range of criminal casework) that consider provenance and variability. A challenging scientific question in geoforensics is the provenance question: ‘where may this sample have come from?’ A question the Tellus data can assist in answering. The question of variation between samples maybe less of a challenge, yet variation between a suspect sample within a scene of crime requires detailed sampling. Variation on a larger (tens to hundreds of kilometres) scale may provide useful intelligence on where a sample came from. To summarise, databases such as Tellus and TellusBorder may be used as effective tools to assist in the search for the origin of displaced soil and sediment
Resumo:
This study examines the firm size distribution of US banks and credit unions. A truncated lognormal distribution describes the size distribution, measured using assets data, of a large population of small, community-based commercial banks. The size distribution of a smaller but increasingly dominant cohort of large banks, which operate a high-volume low-cost retail banking model, exhibits power-law behaviour. There is a progressive increase in skewness over time, and Zipf’s Law is rejected as a descriptor of the size distribution in the upper tail. By contrast, the asset size distribution of the population of credit unions conforms closely to the lognormal distribution.
Resumo:
Context. Thanks to the advent of Herschel and ALMA, new high-quality observations of molecules present in the circumstellar envelopes of asymptotic giant branch (AGB) stars are being reported that reveal large differences from the existing chemical models. New molecular data and more comprehensive models of the chemistry in circumstellar envelopes are now available.
Aims: The aims are to determine and study the important formation and destruction pathways in the envelopes of O-rich AGB stars and to provide more reliable predictions of abundances, column densities, and radial distributions for potentially detectable species with physical conditions applicable to the envelope surrounding IK Tau.
Methods: We use a large gas-phase chemical model of an AGB envelope including the effects of CO and N2 self-shielding in a spherical geometry and a newly compiled list of inner-circumstellar envelope parent species derived from detailed modeling and observations. We trace the dominant chemistry in the expanding envelope and investigate the chemistry as a probe for the physics of the AGB phase by studying variations of abundances with mass-loss rates and expansion velocities.
Results: We find a pattern of daughter molecules forming from the photodissociation products of parent species with contributions from ion-neutral abstraction and dissociative recombination. The chemistry in the outer zones differs from that in traditional PDRs in that photoionization of daughter species plays a significant role. With the proper treatment of self-shielding, the N → N2 and C+→ CO transitions are shifted outward by factors of 7 and 2, respectively, compared with earlier models. An upper limit on the abundance of CH4 as a parent species of (≲2.5 × 10-6 with respect to H2) is found for IK Tau, and several potentially observable molecules with relatively simple chemical links to other parent species are determined. The assumed stellar mass-loss rate, in particular, has an impact on the calculated abundances of cations and the peak-abundance radius of both cations and neutrals: as the mass-loss rate increases, the peak abundance of cations generally decreases and the peak-abundance radius of all species moves outwards. The effects of varying the envelope expansion velocity and cosmic-ray ionization rate are not as significant.
Resumo:
Using new biomarker data from the 2010 pilot round of the Longitudinal Aging Study in India (LASI), we investigate education, gender, and state-level disparities in health. We find that hemoglobin level, a marker for anemia, is lower for respondents with no schooling (0.7 g/dL less in the adjusted model) compared to those with some formal education and is also lower for females than for males (2.0 g/dL less in the adjusted model). In addition, we find that about one third of respondents in our sample aged 45 or older have high C-reaction protein (CRP) levels (>3 mg/L), an indicator of inflammation and a risk factor for cardiovascular disease. We find no evidence of educational or gender differences in CRP, but there are significant state-level disparities, with Kerala residents exhibiting the lowest CRP levels (a mean of 1.96 mg/L compared to 3.28 mg/L in Rajasthan, the state with the highest CRP). We use the Blinder–Oaxaca decomposition approach to explain group-level differences, and find that state-level disparities in CRP are mainly due to heterogeneity in the association of the observed characteristics of respondents with CRP, rather than differences in the distribution of endowments across the sampled state populations.
Resumo:
BACKGROUND: Worldwide data for cancer survival are scarce. We aimed to initiate worldwide surveillance of cancer survival by central analysis of population-based registry data, as a metric of the effectiveness of health systems, and to inform global policy on cancer control.
METHODS: Individual tumour records were submitted by 279 population-based cancer registries in 67 countries for 25·7 million adults (age 15-99 years) and 75,000 children (age 0-14 years) diagnosed with cancer during 1995-2009 and followed up to Dec 31, 2009, or later. We looked at cancers of the stomach, colon, rectum, liver, lung, breast (women), cervix, ovary, and prostate in adults, and adult and childhood leukaemia. Standardised quality control procedures were applied; errors were corrected by the registry concerned. We estimated 5-year net survival, adjusted for background mortality in every country or region by age (single year), sex, and calendar year, and by race or ethnic origin in some countries. Estimates were age-standardised with the International Cancer Survival Standard weights.
FINDINGS: 5-year survival from colon, rectal, and breast cancers has increased steadily in most developed countries. For patients diagnosed during 2005-09, survival for colon and rectal cancer reached 60% or more in 22 countries around the world; for breast cancer, 5-year survival rose to 85% or higher in 17 countries worldwide. Liver and lung cancer remain lethal in all nations: for both cancers, 5-year survival is below 20% everywhere in Europe, in the range 15-19% in North America, and as low as 7-9% in Mongolia and Thailand. Striking rises in 5-year survival from prostate cancer have occurred in many countries: survival rose by 10-20% between 1995-99 and 2005-09 in 22 countries in South America, Asia, and Europe, but survival still varies widely around the world, from less than 60% in Bulgaria and Thailand to 95% or more in Brazil, Puerto Rico, and the USA. For cervical cancer, national estimates of 5-year survival range from less than 50% to more than 70%; regional variations are much wider, and improvements between 1995-99 and 2005-09 have generally been slight. For women diagnosed with ovarian cancer in 2005-09, 5-year survival was 40% or higher only in Ecuador, the USA, and 17 countries in Asia and Europe. 5-year survival for stomach cancer in 2005-09 was high (54-58%) in Japan and South Korea, compared with less than 40% in other countries. By contrast, 5-year survival from adult leukaemia in Japan and South Korea (18-23%) is lower than in most other countries. 5-year survival from childhood acute lymphoblastic leukaemia is less than 60% in several countries, but as high as 90% in Canada and four European countries, which suggests major deficiencies in the management of a largely curable disease.
INTERPRETATION: International comparison of survival trends reveals very wide differences that are likely to be attributable to differences in access to early diagnosis and optimum treatment. Continuous worldwide surveillance of cancer survival should become an indispensable source of information for cancer patients and researchers and a stimulus for politicians to improve health policy and health-care systems.
Resumo:
This paper presents data from the English Channel area of Britain and Northern France on the spatial distribution of Lower to early Middle Palaeolithic pre-MIS5 interglacial sites which are used to test the contention that the pattern of the richest sites is a real archaeological distribution and not of taphonomic origin. These sites show a marked concentration in the middle-lower reaches of river valleys with most being upstream of, but close to, estimated interglacial tidal limits. A plant and animal database derived from Middle-Late Pleistocene sites in the region is used to estimate the potentially edible foods and their distribution in the typically undulating landscape of the region. This is then converted into the potential availability of macronutrients (proteins, carbohydrates, fats) and selected micronutrients. The floodplain is shown to be the optimum location in the nutritional landscape (nutriscape). In addition to both absolute and seasonal macronutrient advantages the floodplains could have provided foods rich in key micronutrients, which are linked to better health, the maintenance of fertility and minimization of infant mortality. Such places may have been seen as ‘good (or healthy) places’ explaining the high number of artefacts accumulated by repeated visitation over long periods of time and possible occupation. The distribution of these sites reflects the richest aquatic and wetland successional habitats along valley floors. Such locations would have provided foods rich in a wide range of nutrients, importantly including those in short supply at these latitudes. When combined with other benefits, the high nutrient diversity made these locations the optimal niche in northwest European mixed temperate woodland environments. It is argued here that the use of these nutritionally advantageous locations as nodal or central points facilitated a healthy variant of the Palaeolithic diet which permitted habitation at the edge of these hominins’ range.
Resumo:
Current variation aware design methodologies, tuned for worst-case scenarios, are becoming increasingly pessimistic from the perspective of power and performance. A good example of such pessimism is setting the refresh rate of DRAMs according to the worst-case access statistics, thereby resulting in very frequent refresh cycles, which are responsible for the majority of the standby power consumption of these memories. However, such a high refresh rate may not be required, either due to extremely low probability of the actual occurrence of such a worst-case, or due to the inherent error resilient nature of many applications that can tolerate a certain number of potential failures. In this paper, we exploit and quantify the possibilities that exist in dynamic memory design by shifting to the so-called approximate computing paradigm in order to save power and enhance yield at no cost. The statistical characteristics of the retention time in dynamic memories were revealed by studying a fabricated 2kb CMOS compatible embedded DRAM (eDRAM) memory array based on gain-cells. Measurements show that up to 73% of the retention power can be saved by altering the refresh time and setting it such that a small number of failures is allowed. We show that these savings can be further increased by utilizing known circuit techniques, such as body biasing, which can help, not only in extending, but also in preferably shaping the retention time distribution. Our approach is one of the first attempts to access the data integrity and energy tradeoffs achieved in eDRAMs for utilizing them in error resilient applications and can prove helpful in the anticipated shift to approximate computing.
Resumo:
Conventional practice in Regional Geochemistry includes as a final step of any geochemical campaign the generation of a series of maps, to show the spatial distribution of each of the components considered. Such maps, though necessary, do not comply with the compositional, relative nature of the data, which unfortunately make any conclusion based on them sensitive
to spurious correlation problems. This is one of the reasons why these maps are never interpreted isolated. This contribution aims at gathering a series of statistical methods to produce individual maps of multiplicative combinations of components (logcontrasts), much in the flavor of equilibrium constants, which are designed on purpose to capture certain aspects of the data.
We distinguish between supervised and unsupervised methods, where the first require an external, non-compositional variable (besides the compositional geochemical information) available in an analogous training set. This external variable can be a quantity (soil density, collocated magnetics, collocated ratio of Th/U spectral gamma counts, proportion of clay particle fraction, etc) or a category (rock type, land use type, etc). In the supervised methods, a regression-like model between the external variable and the geochemical composition is derived in the training set, and then this model is mapped on the whole region. This case is illustrated with the Tellus dataset, covering Northern Ireland at a density of 1 soil sample per 2 square km, where we map the presence of blanket peat and the underlying geology. The unsupervised methods considered include principal components and principal balances
(Pawlowsky-Glahn et al., CoDaWork2013), i.e. logcontrasts of the data that are devised to capture very large variability or else be quasi-constant. Using the Tellus dataset again, it is found that geological features are highlighted by the quasi-constant ratios Hf/Nb and their ratio against SiO2; Rb/K2O and Zr/Na2O and the balance between these two groups of two variables; the balance of Al2O3 and TiO2 vs. MgO; or the balance of Cr, Ni and Co vs. V and Fe2O3. The largest variability appears to be related to the presence/absence of peat.
Resumo:
Assessment of Human papillomavirus (HPV) prevalence and genotype distribution is important for monitoring the impact of prophylactic HPV vaccination. This study aimed to demonstrate the HPV genotypes predominating in pre-malignant and cervical cancers in Northern Ireland (NI) before the vaccination campaign has effect. Formalin fixed paraffin embedded tissue blocks from 2,303 women aged 16-93 years throughout NI were collated between April 2011 and February 2013. HPV DNA was amplified by PCR and HPV genotyping undertaken using the Roche® linear array detection kit. In total, 1,241 out of 1,830 eligible samples (68.0%) tested positive for HPV, with the majority of these [1,181/1,830 (64.5%)] having high-risk (HR) HPV infection; 37.4% were positive for HPV-16 (n=684) and 5.1% for HPV-18 (n=93). HPV type-specific prevalence was 48.1%, 65.9%, 81.3%, 92.2%, and 64.3% among cervical intraepithelial neoplasias (CIN) Grades I-III, squamous cell carcinomas (SCC) and adenocarcinoma (AC) cases, respectively. Most SCC cases (81.3%) had only one HPV genotype detected and almost a third (32.0%) of all cervical pathologies were HPV negative including 51.9% of CIN I (n=283), 34.1% CIN II (n=145), 18.7% of CIN III (n=146), 7.8% of SCC (n=5), and 35.7% of AC (n=5) cases. This study provides important baseline data for monitoring the effect of HPV vaccination in NI and for comparison with other UK regions. The coverage of other HR-HPV genotypes apart from 16 and 18, including HPV-45, 31, 39, and 52, and the potential for cross protection, should be considered when considering future polyvalent vaccines.
Resumo:
The Virtual Atomic and Molecular Data Centre (VAMDC) Consortium is a worldwide consortium which federates atomic and molecular databases through an e-science infrastructure and an organisation to support this activity. About 90% of the inter-connected databases handle data that are used for the interpretation of astronomical spectra and for modelling in many fields of astrophysics. Recently the VAMDC Consortium has connected databases from the radiation damage and the plasma communities, as well as promoting the publication of data from Indian institutes. This paper describes how the VAMDC Consortium is organised for the optimal distribution of atomic and molecular data for scientific research. It is noted that the VAMDC Consortium strongly advocates that authors of research papers using data cite the original experimental and theoretical papers as well as the relevant databases.
Resumo:
Biogas from anaerobic digestion of sewage sludge is a renewable resource with high energy content, which is formed mainly of CH4 (40-75 vol.%) and CO2 (15-60 vol.%) Other components such as water (H2O, 5-10 vol.%) and trace amounts of hydrogen sulfide and siloxanes can also be present. A CH4-rich stream can be produced by removing the CO2 and other impurities so that the upgraded bio-methane can be injected into the natural gas grid or used as a vehicle fuel. The main objective of this paper is to develop a new modeling methodology to assess the technical and economic performance of biogas upgrading processes using ionic liquids which physically absorb CO2. Three different ionic liquids, namely the 1-ethyl-3-methylimidazolium bis[(trifluoromethyl)sulfonyl]imide, 1-hexyl-3-methylimidazoliumbis[(trifluoromethyl)sulfonyl]imide and trihexyl(tetradecyl)phosphonium bis[(trifluoromethyl)sulfonyl]imide, are considered for CO2 capture in a pressure-swing regenerative absorption process. The simulation software Aspen Plus and Aspen Process Economic Analyzer is used to account for mass and energy balances as well as equipment cost. In all cases, the biogas upgrading plant consists of a multistage compressor for biogas compression, a packed absorption column for CO2 absorption, a flash evaporator for solvent regeneration, a centrifugal pump for solvent recirculation, a pre-absorber solvent cooler and a gas turbine for electricity recovery. The evaluated processes are compared in terms of energy efficiency, capital investment and bio-methane production costs. The overall plant efficiency ranges from 71-86 % whereas the bio-methane production cost ranges from £6.26-7.76 per GJ (LHV). A sensitivity analysis is also performed to determine how several technical and economic parameters affect the bio-methane production costs. The results of this study show that the simulation methodology developed can predict plant efficiencies and production costs of large scale CO2 capture processes using ionic liquids without having to rely on gas solubility experimental data.