828 resultados para Linear regression analysis
Resumo:
Protocols for bioassessment often relate changes in summary metrics that describe aspects of biotic assemblage structure and function to environmental stress. Biotic assessment using multimetric indices now forms the basis for setting regulatory standards for stream quality and a range of other goals related to water resource management in the USA and elsewhere. Biotic metrics are typically interpreted with reference to the expected natural state to evaluate whether a site is degraded. It is critical that natural variation in biotic metrics along environmental gradients is adequately accounted for, in order to quantify human disturbance-induced change. A common approach used in the IBI is to examine scatter plots of variation in a given metric along a single stream size surrogate and a fit a line (drawn by eye) to form the upper bound, and hence define the maximum likely value of a given metric in a site of a given environmental characteristic (termed the 'maximum species richness line' - MSRL). In this paper we examine whether the use of a single environmental descriptor and the MSRL is appropriate for defining the reference condition for a biotic metric (fish species richness) and for detecting human disturbance gradients in rivers of south-eastern Queensland, Australia. We compare the accuracy and precision of the MSRL approach based on single environmental predictors, with three regression-based prediction methods (Simple Linear Regression, Generalised Linear Modelling and Regression Tree modelling) that use (either singly or in combination) a set of landscape and local scale environmental variables as predictors of species richness. We compared the frequency of classification errors from each method against set biocriteria and contrast the ability of each method to accurately reflect human disturbance gradients at a large set of test sites. The results of this study suggest that the MSRL based upon variation in a single environmental descriptor could not accurately predict species richness at minimally disturbed sites when compared with SLR's based on equivalent environmental variables. Regression-based modelling incorporating multiple environmental variables as predictors more accurately explained natural variation in species richness than did simple models using single environmental predictors. Prediction error arising from the MSRL was substantially higher than for the regression methods and led to an increased frequency of Type I errors (incorrectly classing a site as disturbed). We suggest that problems with the MSRL arise from the inherent scoring procedure used and that it is limited to predicting variation in the dependent variable along a single environmental gradient.
Resumo:
Existing crowd counting algorithms rely on holistic, local or histogram based features to capture crowd properties. Regression is then employed to estimate the crowd size. Insufficient testing across multiple datasets has made it difficult to compare and contrast different methodologies. This paper presents an evaluation across multiple datasets to compare holistic, local and histogram based methods, and to compare various image features and regression models. A K-fold cross validation protocol is followed to evaluate the performance across five public datasets: UCSD, PETS 2009, Fudan, Mall and Grand Central datasets. Image features are categorised into five types: size, shape, edges, keypoints and textures. The regression models evaluated are: Gaussian process regression (GPR), linear regression, K nearest neighbours (KNN) and neural networks (NN). The results demonstrate that local features outperform equivalent holistic and histogram based features; optimal performance is observed using all image features except for textures; and that GPR outperforms linear, KNN and NN regression
Resumo:
After attending this presentation, attendees will gain awareness of the ontogeny of cranial maturation, specifically: (1) the fusion timings of primary ossification centers in the basicranium; and (2) the temporal pattern of closure of the anterior fontanelle, to develop new population-specific age standards for medicolegal death investigation of Australian subadults. This presentation will impact the forensic science community by demonstrating the potential of a contemporary forensic subadult Computed Tomography (CT) database of cranial scans and population data, to recalibrate existing standards for age estimation and quantify growth and development of Australian children. This research welcomes a study design applicable to all countries faced with paucity in skeletal repositories. Accurate assessment of age-at-death of skeletal remains represents a key element in forensic anthropology methodology. In Australian casework, age standards derived from American reference samples are applied in light of scarcity in documented Australian skeletal collections. Currently practitioners rely on antiquated standards, such as the Scheuer and Black1 compilation for age estimation, despite implications of secular trends and population variation. Skeletal maturation standards are population specific and should not be extrapolated from one population to another, while secular changes in skeletal dimensions and accelerated maturation underscore the importance of establishing modern standards to estimate age in modern subadults. Despite CT imaging becoming the gold standard for skeletal analysis in Australia, practitioners caution the application of forensic age standards derived from macroscopic inspection to a CT medium, suggesting a need for revised methodologies. Multi-slice CT scans of subadult crania and cervical vertebrae 1 and 2 were acquired from 350 Australian individuals (males: n=193, females: n=157) aged birth to 12 years. The CT database, projected at 920 individuals upon completion (January 2014), comprises thin-slice DICOM data (resolution: 0.5/0.3mm) of patients scanned since 2010 at major Brisbane Childrens Hospitals. DICOM datasets were subject to manual segmentation, followed by the construction of multi-planar and volume rendering cranial models, for subsequent scoring. The union of primary ossification centers of the occipital bone were scored as open, partially closed or completely closed; while the fontanelles, and vertebrae were scored in accordance with two stages. Transition analysis was applied to elucidate age at transition between union states for each center, and robust age parameters established using Bayesian statistics. In comparison to reported literature, closure of the fontanelles and contiguous sutures in Australian infants occur earlier than reported, with the anterior fontanelle transitioning from open to closed at 16.7±1.1 months. The metopic suture is closed prior to 10 weeks post-partum and completely obliterated by 6 months of age, independent of sex. Utilizing reverse engineering capabilities, an alternate method for infant age estimation based on quantification of fontanelle area and non-linear regression with variance component modeling will be presented. Closure models indicate that the greatest rate of change in anterior fontanelle area occurs prior to 5 months of age. This study complements the work of Scheuer and Black1, providing more specific age intervals for union and temporal maturity of each primary ossification center of the occipital bone. For example, dominant fusion of the sutura intra-occipitalis posterior occurs before 9 months of age, followed by persistence of a hyaline cartilage tongue posterior to the foramen magnum until 2.5 years; with obliteration at 2.9±0.1 years. Recalibrated age parameters for the atlas and axis are presented, with the anterior arch of the atlas appearing at 2.9 months in females and 6.3 months in males; while dentoneural, dentocentral and neurocentral junctions of the axis transitioned from non-union to union at 2.1±0.1 years in females and 3.7±0.1 years in males. These results are an exemplar of significant sexual dimorphism in maturation (p<0.05), with girls exhibiting union earlier than boys, justifying the need for segregated sex standards for age estimation. Studies such as this are imperative for providing updated standards for Australian forensic and pediatric practice and provide an insight into skeletal development of this population. During this presentation, the utility of novel regression models for age estimation of infants will be discussed, with emphasis on three-dimensional modeling capabilities of complex structures such as fontanelles, for the development of new age estimation methods.
Resumo:
Urban areas are growing unsustainably around the world; however, the growth patterns and their associated drivers vary between contexts. As a result, research has highlighted the need to adopt case study based approaches to stimulate the development of new theoretic understandings. Using land-cover data sets derived from Landsat images (30 m × 30 m), this research identifies both patterns and drivers of urban growth in a period (1991-2001) when a number of policy acts were enacted aimed at fostering smart growth in Brisbane, Australia. A linear multiple regression model was estimated using the proportion of lands that were converted from non-built-up (1991) to built-up usage (2001) within a suburb as a dependent variable to identify significant drivers of land-cover changes. In addition, the hot spot analysis was conducted to identify spatial biases of land-cover changes, if any. Results show that the built-up areas increased by 1.34% every year. About 19.56% of the non-built-up lands in 1991 were converted into built-up lands in 2001. This conversion pattern was significantly biased in the northernmost and southernmost suburbs in the city. This is due to the fact that, as evident from the regression analysis, these suburbs experienced a higher rate of population growth, and had the availability of habitable green field sites in relatively flat lands. The above findings suggest that the policy interventions undertaken between the periods were not as effective in promoting sustainable changes in the environment as they were aimed for.
Resumo:
In the aftermath of the global financial crisis, effective risk management (RM) and its communication to stakeholders are now considered essential components in corporate governance. However, despite the importance of RM communication, it is still unclear how and to what extent disclosures in financial reports can achieve effective communication of RM activities. The situation is hampered by the paucity of international RM Research that captures institution differences in corporate governance standards. The Australian setting provides an ideal environment in which to examine RM communication because the Australian Securities Exchange (ASX) has since 2007 recommended RM disclosures under its principle-based governance rules. The recommendations are contained in Principle 7 of the Corporate Governance Principles and recommendations (ASX CGPR). Accordingly, to assess the effectiveness of the AXS's RM governance principle, this study examines the nature and extent of RM disclosures reported by major ASX-listed firms. Using a mixed method approach (thematic content analysis and a series of regression analysis) we find widespread divergence in disclosure practices and low conformance with the Principle 7 recommendations. Certain corporate governance mechanisms appear to influence some categories of RM dislcosure but equity risk has surprisingly little explanatory power. These results suggest that the RM disclosures practices observed in the Australian setting may not be meeting the objectives of regulators and the needs of stakeholders.
Resumo:
Protein adsorption at solid-liquid interfaces is critical to many applications, including biomaterials, protein microarrays and lab-on-a-chip devices. Despite this general interest, and a large amount of research in the last half a century, protein adsorption cannot be predicted with an engineering level, design-orientated accuracy. Here we describe a Biomolecular Adsorption Database (BAD), freely available online, which archives the published protein adsorption data. Piecewise linear regression with breakpoint applied to the data in the BAD suggests that the input variables to protein adsorption, i.e., protein concentration in solution; protein descriptors derived from primary structure (number of residues, global protein hydrophobicity and range of amino acid hydrophobicity, isoelectric point); surface descriptors (contact angle); and fluid environment descriptors (pH, ionic strength), correlate well with the output variable-the protein concentration on the surface. Furthermore, neural network analysis revealed that the size of the BAD makes it sufficiently representative, with a neural network-based predictive error of 5% or less. Interestingly, a consistently better fit is obtained if the BAD is divided in two separate sub-sets representing protein adsorption on hydrophilic and hydrophobic surfaces, respectively. Based on these findings, selected entries from the BAD have been used to construct neural network-based estimation routines, which predict the amount of adsorbed protein, the thickness of the adsorbed layer and the surface tension of the protein-covered surface. While the BAD is of general interest, the prediction of the thickness and the surface tension of the protein-covered layers are of particular relevance to the design of microfluidics devices.
Resumo:
Long-term systematic population monitoring data sets are rare but are essential in identifying changes in species abundance. In contrast, community groups and natural history organizations have collected many species lists. These represent a large, untapped source of information on changes in abundance but are generally considered of little value. The major problem with using species lists to detect population changes is that the amount of effort used to obtain the list is often uncontrolled and usually unknown. It has been suggested that using the number of species on the list, the "list length," can be a measure of effort. This paper significantly extends the utility of Franklin's approach using Bayesian logistic regression. We demonstrate the value of List Length Analysis to model changes in species prevalence (i.e., the proportion of lists on which the species occurs) using bird lists collected by a local bird club over 40 years around Brisbane, southeast Queensland, Australia. We estimate the magnitude and certainty of change for 269 bird species and calculate the probabilities that there have been declines and increases of given magnitudes. List Length Analysis confirmed suspected species declines and increases. This method is an important complement to systematically designed intensive monitoring schemes and provides a means of utilizing data that may otherwise be deemed useless. The results of List Length Analysis can be used for targeting species of conservation concern for listing purposes or for more intensive monitoring. While Bayesian methods are not essential for List Length Analysis, they can offer more flexibility in interrogating the data and are able to provide a range of parameters that are easy to interpret and can facilitate conservation listing and prioritization. © 2010 by the Ecological Society of America.
Resumo:
Fluctuations in transit ridership pattern over the year have always concerned transport planners, operators and researchers. Predominantly, metrological elements have been specified to explain variability in ridership volume. However, the outcome of this research points to new direction to explain ridership fluctuation in Brisbane. It explored the relationship between daily bus ridership, seasonality and weather variables for a one-year period, 2012. Rather than segregating the entire year’s ridership into the four calendar seasons (summer, autumn, spring, and winter), this analysis distributed the yearly ridership into nine complex seasonality blocks. These represent calendar season, school/university (academic) period and their corresponding holidays, as well as other observant holidays such as Christmas. The dominance of complex seasonality over typical calendar season was established through analysis and using Multiple Linear Regression (MLR). This research identified a very strong association between complex seasonality and bus ridership. Furthermore, an expectation that Brisbane’s subtropical summer is unfavourable to transit usage was not supported by the findings of this study. A nil association of precipitation and temperature was observed in this region. Finally, this research developed a ridership estimation model, capable of predicting daily ridership within very limited error range. Following the application of this developed model, the estimated annual time series data of each suburb was analysed using Fourier Transformation to appreciate whether any cyclical effects remained, compared with the original data.
Resumo:
This paper proposes the addition of a weighted median Fisher discriminator (WMFD) projection prior to length-normalised Gaussian probabilistic linear discriminant analysis (GPLDA) modelling in order to compensate the additional session variation. In limited microphone data conditions, a linear-weighted approach is introduced to increase the influence of microphone speech dataset. The linear-weighted WMFD-projected GPLDA system shows improvements in EER and DCF values over the pooled LDA- and WMFD-projected GPLDA systems in inter-view-interview condition as WMFD projection extracts more speaker discriminant information with limited number of sessions/ speaker data, and linear-weighted GPLDA approach estimates reliable model parameters with limited microphone data.
Resumo:
This study analysed whether a significant relationship exists between the torque and muscle thickness and pennation angle of the erector spinae muscle during a maximal isometric lumbar extension with the lumbar spine in neutral position. This was a cross-sectional study in which 46 healthy adults performed three repetitions for 5 s of maximal isometric lumbar extension with rests of 90 s. During the lumbar extensions, bilateral ultrasound images of the erector spinae muscle (to measure pennation angle and muscle thickness) and torque were acquired. Reliability test analysis calculating the internal consistency (Cronbach's alpha) of the measure, correlation between pennation angle, muscle thickness and torque extensions were examined. Through a linear regression the contribution of each independent variable (muscle thickness and pennation angle) to the variation of the dependent variable (torque) was calculated. The results of the reliability test were: 0.976–0.979 (pennation angle), 0.980–0.980 (muscle thickness) and 0.994 (torque). The results show that pennation angle and muscle thickness were significantly related to each other with a range between 0.295 and 0.762. In addition, multiple regression analysis showed that the two variables considered in this study explained 68% of the variance in the torque. Pennation angle and muscle thickness have a moderate impact on the variance exerted on the torque during a maximal isometric lumbar extension with the lumbar spine in neutral position.
Resumo:
Objective: Association between ankylosing spondylitis (AS) and two genes, ERAP1 and IL23R, has recently been reported in North American and British populations. The population attributable risk fraction for ERAP1 in this study was 25%, and for IL23R, 9%. Confirmation of these findings to ERAP1 in other ethnic groups has not yet been demonstrated. We sought to test the association between single nucleotide polymorphisms (SNPs) in these genes and susceptibility to AS among a Portuguese population. We also investigated the role of these genes in clinical manifestations of AS, including age of symptom onset, the Bath Ankylosing Spondylitis Disease Activity, Metrology and Functional Indices, and the modified Stoke Ankylosing Spondylitis Spinal Score. Methods: The study was conducted on 358 AS cases and 285 ethnically matched Portuguese healthy controls. AS was defined according to the modified New York Criteria. Genotyping of IL23R and ERAP1 allelic variants was carried out with TaqMan allelic discrimination assays. Association analysis was performed using the Cochrane-Armitage and linear regression tests of genotypes as implemented in PLINK for dichotomous and quantitative variables respectively. A meta-analysis for Portuguese and previously published Spanish IL23R data was performed using the StatsDirect® Statistical tools, by fixed and random effects models. Results: A total of 14 nsSNPs markers (8 for IL23R, 5 for ERAPl, 1 for LN-PEP) were analysed. Three markers (2 for IL23R and 1 for ERAP1) showed significant single-locus disease associations, confirming that the association of these genes with AS in the Portuguese population. The strongest associated SNP in IL23R was rs1004819 (OR=1.4, p=0.0049), and in ERAP1 was rs30187 (OR=1.26, p=0.035). The population attributable risk fractions in the Portuguese population for these SNPs are 11% and 9.7% respectively. No association was seen with any SNP in LN-PEP, which flanks ERAP1 and was associated with AS in the British population. No association was seen with clinical manifestations of AS. Conclusions: These results show that IL23R and ERAP1 genes are also associated with susceptibility to AS in the Portuguese population, and that they contribute a significant proportion of the population risk for this disease.
Resumo:
A novel combined near- and mid-infrared (NIR and MIR) spectroscopic method has been researched and developed for the analysis of complex substances such as the Traditional Chinese Medicine (TCM), Illicium verum Hook. F. (IVHF), and its noxious adulterant, Iuicium lanceolatum A.C. Smith (ILACS). Three types of spectral matrix were submitted for classification with the use of the linear discriminant analysis (LDA) method. The data were pretreated with either the successive projections algorithm (SPA) or the discrete wavelet transform (DWT) method. The SPA method performed somewhat better, principally because it required less spectral features for its pretreatment model. Thus, NIR or MIR matrix as well as the combined NIR/MIR one, were pretreated by the SPA method, and then analysed by LDA. This approach enabled the prediction and classification of the IVHF, ILACS and mixed samples. The MIR spectral data produced somewhat better classification rates than the NIR data. However, the best results were obtained from the combined NIR/MIR data matrix with 95–100% correct classifications for calibration, validation and prediction. Principal component analysis (PCA) of the three types of spectral data supported the results obtained with the LDA classification method.
Resumo:
A single-generation dataset consisting of 1,730 records from a selection program for high growth rate in giant freshwater prawn (GFP, Macrobrachium rosenbergii) was used to derive prediction equations for meat weight and meat yield. Models were based on body traits [body weight, total length and abdominal width (AW)] and carcass measurements (tail weight and exoskeleton-off weight). Lengths and width were adjusted for the systematic effects of selection line, male morphotypes and female reproductive status, and for the covariables of age at slaughter within sex and body weight. Body and meat weights adjusted for the same effects (except body weight) were used to calculate meat yield (expressed as percentage of tail weight/body weight and exoskeleton-off weight/body weight). The edible meat weight and yield in this GFP population ranged from 12 to 15 g and 37 to 45 %, respectively. The simple (Pearson) correlation coefficients between body traits (body weight, total length and AW) and meat weight were moderate to very high and positive (0.75–0.94), but the correlations between body traits and meat yield were negative (−0.47 to −0.74). There were strong linear positive relationships between measurements of body traits and meat weight, whereas relationships of body traits with meat yield were moderate and negative. Step-wise multiple regression analysis showed that the best model to predict meat weight included all body traits, with a coefficient of determination (R 2) of 0.99 and a correlation between observed and predicted values of meat weight of 0.99. The corresponding figures for meat yield were 0.91 and 0.95, respectively. Body weight or length was the best predictor of meat weight, explaining 91–94 % of observed variance when it was fitted alone in the model. By contrast, tail width explained a lower proportion (69–82 %) of total variance in the single trait models. It is concluded that in practical breeding programs, improvement of meat weight can be easily made through indirect selection for body trait combinations. The improvement of meat yield, albeit being more difficult, is possible by genetic means, with 91 % of the variation in the trait explained by the body and carcass traits examined in this study.
Resumo:
A novel near-infrared spectroscopy (NIRS) method has been researched and developed for the simultaneous analyses of the chemical components and associated properties of mint (Mentha haplocalyx Briq.) tea samples. The common analytes were: total polysaccharide content, total flavonoid content, total phenolic content, and total antioxidant activity. To resolve the NIRS data matrix for such analyses, least squares support vector machines was found to be the best chemometrics method for prediction, although it was closely followed by the radial basis function/partial least squares model. Interestingly, the commonly used partial least squares was unsatisfactory in this case. Additionally, principal component analysis and hierarchical cluster analysis were able to distinguish the mint samples according to their four geographical provinces of origin, and this was further facilitated with the use of the chemometrics classification methods-K-nearest neighbors, linear discriminant analysis, and partial least squares discriminant analysis. In general, given the potential savings with sampling and analysis time as well as with the costs of special analytical reagents required for the standard individual methods, NIRS offered a very attractive alternative for the simultaneous analysis of mint samples.
Resumo:
Background This paper examines changing patterns in the utilisation and geographic access to health services in Great Britain using National Travel Survey data (1985-2012). The National Travel Survey (NTS) is a series of household surveys designed to provide data on personal travel and monitor changes in travel behaviour over time. The utilisation rate was derived using the proportion of journeys made to access health services. Geographic access was analysed by separating the concept into its accessibility and mobility dimensions. Methods Variables from the PSU, households, and individuals datasets were used as explanatory variables. Whereas, variables extracted from the journeys dataset were used as dependent variables to identify patterns of utilisation i.e. the proportion of journeys made by different groups to access health facilities in a particular journey distance or time band or by mode of transport; and geographic access to health services. A binary logistic regression analysis was conducted to identify the utilisation rate over the different time periods between different groups. This analysis shows the Odds Ratios (ORs) for different groups making a trip to utilise health services compared to their respective counterparts. Linear multiple regression analyses were conducted to then identify patterns of change in the accessibility and mobility level. Results Analysis of the data has shown that that journey distances to health facilities were signi fi cantly shorter and also gradually reduced over the period in question for Londoners, females, those without a car or on low incomes, and older people. Although rates of utilisation of health services we re Oral Abstracts / Journal of Transport & Health 2 (2015) S5 – S63 S43 signi fi cantly lower because of longer journey times. These fi ndings indicate that the rate of utilisation of health services largely depends on mobility level although previous research studies have traditionally overlooked the mobility dimension. Conclusions This fi nding, therefore, suggests the need to improve geographic access to services together with an enhanced mobility option for disadvantaged groups in order for them to have improved levels of access to health facilities. This research has also found that the volume of car trips to health services also increased steadily over the period 1985-2012 while all other modes accounted for a smaller number of trips. However, it is dif fi cult to conclude from this research whether this increase in the volume of car trips was due to a lack of alternative transport or due to an increase in the level of car-ownership.