249 resultados para multivariate regression tree
Resumo:
"Two more bodies, including a that of child discovered in a tree, were retrieved in the Lockyer Valley at the weekend, reinforcing the grisly complexity of the search for the missing."
Resumo:
To enhance the efficiency of regression parameter estimation by modeling the correlation structure of correlated binary error terms in quantile regression with repeated measurements, we propose a Gaussian pseudolikelihood approach for estimating correlation parameters and selecting the most appropriate working correlation matrix simultaneously. The induced smoothing method is applied to estimate the covariance of the regression parameter estimates, which can bypass density estimation of the errors. Extensive numerical studies indicate that the proposed method performs well in selecting an accurate correlation structure and improving regression parameter estimation efficiency. The proposed method is further illustrated by analyzing a dental dataset.
Resumo:
In the Bayesian framework a standard approach to model criticism is to compare some function of the observed data to a reference predictive distribution. The result of the comparison can be summarized in the form of a p-value, and it's well known that computation of some kinds of Bayesian predictive p-values can be challenging. The use of regression adjustment approximate Bayesian computation (ABC) methods is explored for this task. Two problems are considered. The first is the calibration of posterior predictive p-values so that they are uniformly distributed under some reference distribution for the data. Computation is difficult because the calibration process requires repeated approximation of the posterior for different data sets under the reference distribution. The second problem considered is approximation of distributions of prior predictive p-values for the purpose of choosing weakly informative priors in the case where the model checking statistic is expensive to compute. Here the computation is difficult because of the need to repeatedly sample from a prior predictive distribution for different values of a prior hyperparameter. In both these problems we argue that high accuracy in the computations is not required, which makes fast approximations such as regression adjustment ABC very useful. We illustrate our methods with several samples.
Resumo:
Objectives Directly measuring disease incidence in a population is difficult and not feasible to do routinely. We describe the development and application of a new method of estimating at a population level the number of incident genital chlamydia infections, and the corresponding incidence rates, by age and sex using routine surveillance data. Methods A Bayesian statistical approach was developed to calibrate the parameters of a decision-pathway tree against national data on numbers of notifications and tests conducted (2001-2013). Independent beta probability density functions were adopted for priors on the time-independent parameters; the shape parameters of these beta distributions were chosen to match prior estimates sourced from peer-reviewed literature or expert opinion. To best facilitate the calibration, multivariate Gaussian priors on (the logistic transforms of) the time-dependent parameters were adopted, using the Matérn covariance function to favour changes over consecutive years and across adjacent age cohorts. The model outcomes were validated by comparing them with other independent empirical epidemiological measures i.e. prevalence and incidence as reported by other studies. Results Model-based estimates suggest that the total number of people acquiring chlamydia per year in Australia has increased by ~120% over 12 years. Nationally, an estimated 356,000 people acquired chlamydia in 2013, which is 4.3 times the number of reported diagnoses. This corresponded to a chlamydia annual incidence estimate of 1.54% in 2013, increased from 0.81% in 2001 (~90% increase). Conclusions We developed a statistical method which uses routine surveillance (notifications and testing) data to produce estimates of the extent and trends in chlamydia incidence.
Resumo:
The Galilee and Eromanga basins are sub-basins of the Great Artesian Basin (GAB). In this study, a multivariate statistical approach (hierarchical cluster analysis, principal component analysis and factor analysis) is carried out to identify hydrochemical patterns and assess the processes that control hydrochemical evolution within key aquifers of the GAB in these basins. The results of the hydrochemical assessment are integrated into a 3D geological model (previously developed) to support the analysis of spatial patterns of hydrochemistry, and to identify the hydrochemical and hydrological processes that control hydrochemical variability. In this area of the GAB, the hydrochemical evolution of groundwater is dominated by evapotranspiration near the recharge area resulting in a dominance of the Na–Cl water types. This is shown conceptually using two selected cross-sections which represent discrete groundwater flow paths from the recharge areas to the deeper parts of the basins. With increasing distance from the recharge area, a shift towards a dominance of carbonate (e.g. Na–HCO3 water type) has been observed. The assessment of hydrochemical changes along groundwater flow paths highlights how aquifers are separated in some areas, and how mixing between groundwater from different aquifers occurs elsewhere controlled by geological structures, including between GAB aquifers and coal bearing strata of the Galilee Basin. The results of this study suggest that distinct hydrochemical differences can be observed within the previously defined Early Cretaceous–Jurassic aquifer sequence of the GAB. A revision of the two previously recognised hydrochemical sequences is being proposed, resulting in three hydrochemical sequences based on systematic differences in hydrochemistry, salinity and dominant hydrochemical processes. The integrated approach presented in this study which combines different complementary multivariate statistical techniques with a detailed assessment of the geological framework of these sedimentary basins, can be adopted in other complex multi-aquifer systems to assess hydrochemical evolution and its geological controls.
Resumo:
Background Helicobacter pylori (HP) is associated with chronic gastritis and gastric cancer, and more than half of the world’s population is chronically infected. The aim of this retrospective study was to investigate whether an irregular meal pattern is associated with increased risk of gastritis and HP infection. Methods The study involved 323 subjects, divided into three groups: subjects with HP infection and gastritis, with gastritis, and a control group. Subjects were interviewed on eating habits and meal timing. Multivariate logistic regression was used to compare groups. Adjusted odds ratios (OR) were derived controlling for gender, age, stress and probiotic consumption. Results Subjects who deviated from their regular meals by 2 hours or more had a significantly higher incidence of HP infection with gastritis (adjusted OR= 13.3, 95% CI 5.3–33.3, p<0.001) and gastritis (adjusted OR=6.1, 95% CI 2.5–15.0, p<0.001). Subjects who deviated their meals by 2 hours or more, twice or more per week, had an adjusted OR of 6.3 and 3.5 of acquiring HP infection with gastritis (95% CI 2.6–15.2, p<0.001) and gastritis (95% CI 1.5–8.5, p<0.001) respectively. Conclusion Frequent deviation in meal timing over a prolonged period appears associated with increased risk of developing HP infection and gastritis.
Resumo:
The effects of reductions in cell wall lignin content, manifested by RNA interference suppression of coumaroyl 3'-hydroxylase, on plant growth, water transport, gas exchange, and photosynthesis were evaluated in hybrid poplar trees (Populus alba 3 grandidentata). The growth characteristics of the reduced lignin trees were significantly impaired, resulting in smaller stems and reduced root biomass when compared to wild-type trees, as well as altered leaf morphology and architecture. The severe inhibition of cell wall lignification produced trees with a collapsed xylem phenotype, resulting in compromised vascular integrity, and displayed reduced hydraulic conductivity and a greater susceptibility to wall failure and cavitation. In the reduced lignin trees, photosynthetic carbon assimilation and stomatal conductance were also greatly reduced, however, shoot xylem pressure potential and carbon isotope discrimination were higher and water-use efficiency was lower, inconsistent with water stress. Reductions in assimilation rate could not be ascribed to increased stomatal limitation. Starch and soluble sugars analysis of leaves revealed that photosynthate was accumulating to high levels, suggesting that the trees with substantially reduced cell wall lignin were not carbon limited and that reductions in sink strength were, instead, limiting photosynthesis.
Resumo:
There is a concern that high densities of elephants in southern Africa could lead to the overall reduction of other forms of biodiversity. We present a grid-based model of elephant-savanna dynamics, which differs from previous elephant-vegetation models by accounting for woody plant demographics, tree-grass interactions, stochastic environmental variables (fire and rainfall), and spatial contagion of fire and tree recruitment. The model projects changes in height structure and spatial pattern of trees over periods of centuries. The vegetation component of the model produces long-term tree-grass coexistence, and the emergent fire frequencies match those reported for southern African savannas. Including elephants in the savanna model had the expected effect of reducing woody plant cover, mainly via increased adult tree mortality, although at an elephant density of 1.0 elephant/km2, woody plants still persisted for over a century. We tested three different scenarios in addition to our default assumptions. (1) Reducing mortality of adult trees after elephant use, mimicking a more browsing-tolerant tree species, mitigated the detrimental effect of elephants on the woody population. (2) Coupling germination success (increased seedling recruitment) to elephant browsing further increased tree persistence, and (3) a faster growing woody component allowed some woody plant persistence for at least a century at a density of 3 elephants/km2. Quantitative models of the kind presented here provide a valuable tool for exploring the consequences of management decisions involving the manipulation of elephant population densities. © 2005 by the Ecological Society of America.
Resumo:
Background Some patients visit a hospital’s emergency department (ED) for reasons other than an urgent medical condition. There is evidence that this practice may differ among patients from different backgrounds. The objective of this study was to examine the reasons why patients from a non-English speaking background (NESB) and patients with an English speaking background but not born in Australia (ESB-NBA) visit the ED, as compared to patients from English-speaking backgrounds but born in Australia (ESB-BA). Methods A cross-sectional survey was conducted at the ED of a tertiary hospital in metropolitan Brisbane, Queensland, Australia. Over a four-month period patients who were assigned an Australasian Triage Scale score of 3, 4 or 5 were surveyed. Pearson chi-square test and multivariate logistic regression analyses were performed to examine the differences between the ESB and NESB patients’ reported reasons for attending the ED. Results A total of 828 patients participated in this study. Compared to ESB-BA patients NESB patients were less likely to consider contacting a general practitioner (GP) before attending the ED (Odds Ratios (OR) 0.6 (95% Confidence Interval (CI) 0.4–0.8, p < .05) While ESB-NBA were more likely to consider contacting a GP 1.7 (1.1–2.5, p < .05). Both the NESB patients and the ESB-NBA patients were far more likely than ESB-BA patients to report that they had visited the ED either because they do not have a GP (OR 7.9, 95% CI 4.7–13.4, p < .001) and 2.2 (95% CI 1.1–4.4, p < .05) respectively and less likely to think that the ED could deal with their problem better than a GP(OR 0.5 (95% CI 0.3–0.8, p < .05) and 0.7 (0.3–0.9, p < .05) respectively. The NESB patients also thought it would take too long to make an appointment to consult a GP (OR 6.2, 95% CI 3.7–10.4, p < 0.001). Conclusions NESB patients were the least likely to consider contacting a GP before attending hospital EDs. Educational interventions may help direct NESB people to the appropriate health services and therefore reduce the burden on tertiary hospitals ED.
Resumo:
This paper presents an efficient noniterative method for distribution state estimation using conditional multivariate complex Gaussian distribution (CMCGD). In the proposed method, the mean and standard deviation (SD) of the state variables is obtained in one step considering load uncertainties, measurement errors, and load correlations. In this method, first the bus voltages, branch currents, and injection currents are represented by MCGD using direct load flow and a linear transformation. Then, the mean and SD of bus voltages, or other states, are calculated using CMCGD and estimation of variance method. The mean and SD of pseudo measurements, as well as spatial correlations between pseudo measurements, are modeled based on the historical data for different levels of load duration curve. The proposed method can handle load uncertainties without using time-consuming approaches such as Monte Carlo. Simulation results of two case studies, six-bus, and a realistic 747-bus distribution network show the effectiveness of the proposed method in terms of speed, accuracy, and quality against the conventional approach.
Resumo:
The proliferation of the web presents an unsolved problem of automatically analyzing billions of pages of natural language. We introduce a scalable algorithm that clusters hundreds of millions of web pages into hundreds of thousands of clusters. It does this on a single mid-range machine using efficient algorithms and compressed document representations. It is applied to two web-scale crawls covering tens of terabytes. ClueWeb09 and ClueWeb12 contain 500 and 733 million web pages and were clustered into 500,000 to 700,000 clusters. To the best of our knowledge, such fine grained clustering has not been previously demonstrated. Previous approaches clustered a sample that limits the maximum number of discoverable clusters. The proposed EM-tree algorithm uses the entire collection in clustering and produces several orders of magnitude more clusters than the existing algorithms. Fine grained clustering is necessary for meaningful clustering in massive collections where the number of distinct topics grows linearly with collection size. These fine-grained clusters show an improved cluster quality when assessed with two novel evaluations using ad hoc search relevance judgments and spam classifications for external validation. These evaluations solve the problem of assessing the quality of clusters where categorical labeling is unavailable and unfeasible.
Resumo:
Objective To identify the prevalence of and risk factors for inadvertent hypothermia after procedures performed with procedural sedation and analgesia in a cardiac catheterisation laboratory. Design Single-centre, prospective observational study. Setting Tertiary care private hospital in Australia. Participants A convenience sample of 399 patients undergoing elective procedures with procedural sedation and analgesia were included. Propofol infusions were used when an anaesthetist was present. Otherwise, bolus doses of either midazolam or fentanyl or a combination of these medications was used. Interventions None Measurements and main results Hypothermia was defined as a temperature <36.0° Celsius. Multivariate logistic regression was used to identify risk factors. Hypothermia was present after 23.3% (n=93; 95% confidence interval [CI] 19.2%-27.4%) of 399 procedures. Sedative regimens with the highest prevalence of hypothermia were any regimen that included propofol (n=35; 40.2%; 95% CI 29.9%-50.5%) and the use of fentanyl combined with midazolam (n=23; 20.3%; 95% CI 12.9%-27.7%). Difference in mean temperature from pre to post-procedure was -0.27°C (Standard deviation [SD] 0.45). Receiving propofol (odds ratio [OR] OR 4.6 95% CI 2.5-8.6), percutaneous coronary intervention (OR 3.2 95% CI 1.7-5.9), body mass index <25 (OR 2.5 95% CI 1.4-4.4) and being hypothermic prior to the procedure (OR 4.9; 95% CI 2.3-10.8) were independent predictors of post-procedural hypothermia. Conclusions A moderate prevalence of hypothermia was observed. The small absolute change in temperature observed may not be a clinically important amount. More research is needed to increase confidence in our estimates of hypothermia in sedated patients and its impact on clinical outcomes.
Resumo:
Monte-Carlo Tree Search (MCTS) is a heuristic to search in large trees. We apply it to argumentative puzzles where MCTS pursues the best argumentation with respect to a set of arguments to be argued. To make our ideas as widely applicable as possible, we integrate MCTS to an abstract setting for argumentation where the content of arguments is left unspecified. Experimental results show the pertinence of this integration for learning argumentations by comparing it with a basic reinforcement learning.