217 resultados para data structures
Resumo:
A combination of deductive reasoning, clustering, and inductive learning is given as an example of a hybrid system for exploratory data analysis. Visualization is replaced by a dialogue with the data.
Resumo:
This paper reports a comparative study of Australian and New Zealand leadership attributes, based on the GLOBE (Global Leadership and Organizational Behavior Effectiveness) program. Responses from 344 Australian managers and 184 New Zealand managers in three industries were analyzed using exploratory and confirmatory factor analysis. Results supported some of the etic leadership dimensions identified in the GLOBE study, but also found some emic dimensions of leadership for each country. An interesting finding of the study was that the New Zealand data fitted the Australian model, but not vice versa, suggesting asymmetric perceptions of leadership in the two countries.
Resumo:
In the context of cancer diagnosis and treatment, we consider the problem of constructing an accurate prediction rule on the basis of a relatively small number of tumor tissue samples of known type containing the expression data on very many (possibly thousands) genes. Recently, results have been presented in the literature suggesting that it is possible to construct a prediction rule from only a few genes such that it has a negligible prediction error rate. However, in these results the test error or the leave-one-out cross-validated error is calculated without allowance for the selection bias. There is no allowance because the rule is either tested on tissue samples that were used in the first instance to select the genes being used in the rule or because the cross-validation of the rule is not external to the selection process; that is, gene selection is not performed in training the rule at each stage of the cross-validation process. We describe how in practice the selection bias can be assessed and corrected for by either performing a cross-validation or applying the bootstrap external to the selection process. We recommend using 10-fold rather than leave-one-out cross-validation, and concerning the bootstrap, we suggest using the so-called. 632+ bootstrap error estimate designed to handle overfitted prediction rules. Using two published data sets, we demonstrate that when correction is made for the selection bias, the cross-validated error is no longer zero for a subset of only a few genes.
Resumo:
A hydraulic jump is the transition from a supercritical open channel flow to a subcritical regime. It is characterised by a highly turbulent flow with macro-scale vortices, some kinetic energy dissipation and a bubbly two-phase flow structure. New air-water flow measurements were performed in hydraulic jump flows for a range of inflow Froude numbers. The experiments were conducted in a large-size facility using two types of phase-detection intrusive probes: i.e., single-tip and double-tip conductivity probes. These were complemented by some measurements of free-surface fluctuations using ultrasonic displacement meters. The present study was focused on the turbulence characteristics of hydraulic jumps with partially-developed inflow conditions. The void fraction measurements showed the presence of an advective diffusion shear layer in which the void fractions profiles matched closely an analytical solution of the advective diffusion equation for air bubbles. The present results highlighted some influence of the inflow Froude number onto the air bubble entrainment process. At the largest Froude numbers, the advected air bubbles were more thoroughly dispersed vertically, and larger amount of air bubbles were detected in the turbulent shear layer. In the air-water mixing layer, the maximum void fraction and bubble count rate data showed some longitudinal decay function in the flow direction. Such trends were previously reported in the literature. The measurements of interfacial velocity and turbulence level distributions provided new information on the turbulent velocity field in the highly-aerated shear region. The present data suggested some longitudinal decay of the turbulence intensity. The velocity profiles tended to follow a wall jet flow pattern. The air–water turbulent time and length scales were deduced from some auto- and cross-correlation analyses based upon the method of CHANSON (2006,2007). The results provided the integral turbulent time and length scales of the eddy structures advecting the air bubbles in the developing shear layer. The experimental data showed that the auto-correlation time scale Txx was larger than the transverse cross-correlation time scale Txz. The integral turbulence length scale Lxz was a function of the inflow conditions, of the streamwise position (x-x1)/d1 and vertical elevation y/d1. Herein the dimensionless integral turbulent length scale Lxz/d1 was closely related to the inflow depth: i.e., Lxz/d1 = 0.2 to 0.8, with Lxz increasing towards the free-surface. The free-surface fluctuations measurements showed large turbulent fluctuations that reflected the dynamic, unsteady structure of the hydraulic jumps. A linear relationship was found between the normalized maximum free-surface fluctuation and the inflow Froude number.
Resumo:
Data mining is the process to identify valid, implicit, previously unknown, potentially useful and understandable information from large databases. It is an important step in the process of knowledge discovery in databases, (Olaru & Wehenkel, 1999). In a data mining process, input data can be structured, seme-structured, or unstructured. Data can be in text, categorical or numerical values. One of the important characteristics of data mining is its ability to deal data with large volume, distributed, time variant, noisy, and high dimensionality. A large number of data mining algorithms have been developed for different applications. For example, association rules mining can be useful for market basket problems, clustering algorithms can be used to discover trends in unsupervised learning problems, classification algorithms can be applied in decision-making problems, and sequential and time series mining algorithms can be used in predicting events, fault detection, and other supervised learning problems (Vapnik, 1999). Classification is among the most important tasks in the data mining, particularly for data mining applications into engineering fields. Together with regression, classification is mainly for predictive modelling. So far, there have been a number of classification algorithms in practice. According to (Sebastiani, 2002), the main classification algorithms can be categorized as: decision tree and rule based approach such as C4.5 (Quinlan, 1996); probability methods such as Bayesian classifier (Lewis, 1998); on-line methods such as Winnow (Littlestone, 1988) and CVFDT (Hulten 2001), neural networks methods (Rumelhart, Hinton & Wiliams, 1986); example-based methods such as k-nearest neighbors (Duda & Hart, 1973), and SVM (Cortes & Vapnik, 1995). Other important techniques for classification tasks include Associative Classification (Liu et al, 1998) and Ensemble Classification (Tumer, 1996).
Resumo:
There are many techniques for electricity market price forecasting. However, most of them are designed for expected price analysis rather than price spike forecasting. An effective method of predicting the occurrence of spikes has not yet been observed in the literature so far. In this paper, a data mining based approach is presented to give a reliable forecast of the occurrence of price spikes. Combined with the spike value prediction techniques developed by the same authors, the proposed approach aims at providing a comprehensive tool for price spike forecasting. In this paper, feature selection techniques are firstly described to identify the attributes relevant to the occurrence of spikes. A simple introduction to the classification techniques is given for completeness. Two algorithms: support vector machine and probability classifier are chosen to be the spike occurrence predictors and are discussed in details. Realistic market data are used to test the proposed model with promising results.
Resumo:
Background: This study used household survey data on the prevalence of child, parent and family variables to establish potential targets for a population-level intervention to strengthen parenting skills in the community. The goals of the intervention include decreasing child conduct problems, increasing parental self-efficacy, use of positive parenting strategies, decreasing coercive parenting and increasing help-seeking, social support and participation in positive parenting programmes. Methods: A total of 4010 parents with a child under the age of 12 years completed a statewide telephone survey on parenting. Results: One in three parents reported that their child had a behavioural or emotional problem in the previous 6 months. Furthermore, 9% of children aged 2–12 years meet criteria for oppositional defiant disorder. Parents who reported their child's behaviour to be difficult were more likely to perceive parenting as a negative experience (i.e. demanding, stressful and depressing). Parents with greatest difficulties were mothers without partners and who had low levels of confidence in their parenting roles. About 20% of parents reported being stressed and 5% reported being depressed in the 2 weeks prior to the survey. Parents with personal adjustment problems had lower levels of parenting confidence and their child was more difficult to manage. Only one in four parents had participated in a parent education programme. Conclusions: Implications for the setting of population-level goals and targets for strengthening parenting skills are discussed.
Resumo:
The solution structure of robustoxin, the lethal neurotoxin from the Sydney funnel-web spider Atrax robustus, has been determined from 2D H-1 NMR data, Robustoxin is a polypeptide of 42 residues cross-linked by four disulphide bonds, the connectivities of which were determined from NMR data and trial structure calculations to be 1-15, 8-20, 14-31 and 16-42 (a 1-4/2-6/3-7/5-8 pattern), The structure consists of a small three-stranded, anti-parallel beta-sheet and a series of interlocking gamma-turns at the C-terminus. It also contains a cystine knot, thus placing it in the inhibitor cystine knot motif family of structures, which includes the omega-conotoxins and a number of plant and animal toxins and protease inhibitors. Robustoxin contains three distinct charged patches on its surface, and an extended loop that includes several aromatic and non-polar residues, Both of these structural features may play a role in its binding to the voltage-gated sodium channel. (C) 1997 Federation of European Biochemical Societies.
Resumo:
This paper discusses a multi-layer feedforward (MLF) neural network incident detection model that was developed and evaluated using field data. In contrast to published neural network incident detection models which relied on simulated or limited field data for model development and testing, the model described in this paper was trained and tested on a real-world data set of 100 incidents. The model uses speed, flow and occupancy data measured at dual stations, averaged across all lanes and only from time interval t. The off-line performance of the model is reported under both incident and non-incident conditions. The incident detection performance of the model is reported based on a validation-test data set of 40 incidents that were independent of the 60 incidents used for training. The false alarm rates of the model are evaluated based on non-incident data that were collected from a freeway section which was video-taped for a period of 33 days. A comparative evaluation between the neural network model and the incident detection model in operation on Melbourne's freeways is also presented. The results of the comparative performance evaluation clearly demonstrate the substantial improvement in incident detection performance obtained by the neural network model. The paper also presents additional results that demonstrate how improvements in model performance can be achieved using variable decision thresholds. Finally, the model's fault-tolerance under conditions of corrupt or missing data is investigated and the impact of loop detector failure/malfunction on the performance of the trained model is evaluated and discussed. The results presented in this paper provide a comprehensive evaluation of the developed model and confirm that neural network models can provide fast and reliable incident detection on freeways. (C) 1997 Elsevier Science Ltd. All rights reserved.
Resumo:
Background: kappa-PVIIA is a 27-residue polypeptide isolated from the venom of Conus purpurascens and is the first member of a new class of conotoxins that block potassium channels. By comparison to other ion channels of eukaryotic cell membranes, voltage-sensitive potassium channels are relatively simple and methodology has been developed for mapping their interactions with small-peptide toxins, PVIIA, therefore, is a valuable new probe of potassium channel structure. This study of the solution structure and mode of channel binding of PVIIA forms the basis for mapping the interacting residues at the conotoxin-ion channel interface. Results: The three-dimensional structure of PVIIA resembles the triple-stranded beta sheet/cystine-knot motif formed by a number of toxic and inhibitory peptides. Subtle structural differences, predominantly in loops 2 and 4, are observed between PVIIA and other conotoxins with similar structural frameworks, however. Electrophysiological binding data suggest that PVIIA blocks channel currents by binding in a voltage-sensitive manner to the external vestibule and occluding the pore, Comparison of the electrostatic surface of PVIIA with that of the well-characterised potassium channel blocker charybdotoxin suggests a likely binding orientation for PVIIA, Conclusions: Although the structure of PVIIA is considerably different to that of the alpha K scorpion toxins, it has a similar mechanism of channel blockade. On the basis of a comparison of the structures of PVIIA and charybdotoxin, we suggest that Lys19 of PVIIA is the residue which is responsible for physically occluding the pore of the potassium channel.
Resumo:
Multifrequency bioimpedance analysis has the potential to provide a non-invasive technique for determining body composition in live cattle. A bioimpedance meter developed for use in clinical medicine was adapted and evaluated in 2 experiments using a total of 31 cattle. Prediction equations were obtained for total body water, extracellular body water, intracellular body water, carcass water and carcass protein. There were strong correlations between the results obtained through chemical markers and bioimpedance analysis when determined in cattle that had a wide range of liveweights and conditions. The r(2) values obtained were 0.87 and 0.91 for total body water and extracellular body water respectively. Bioimpedance also correlated with carcass water, measured by chemical analysis (r(2) = 0.72), but less well with carcass protein (r(2) = 0.46). These correlations were improved by inclusion of liveweight and sex as variables in multiple regression analysis. However, the resultant equations were poor predictors of protein and water content in the carcasses of a group of small underfed beef cattle, that had a narrow range of liveweights. In this case, although there was no statistical difference between the predicted and measured values overall, bioimpedance analysis did not detect the differences in carcass protein between the 2 groups that were apparent following chemical analysis. Further work is required to determine the sensitivity of the technique in small underfed cattle, and its potential use in heavier well fed cattle close to slaughter weight.
Resumo:
A gas product analysis has been conducted on gamma-irradiated samples of poly(lactic acid) (PLA) and poly(glycolic acid) (PGA) by means of gas chromatography. The major volatile products have been identified to be CO, CO2, CH4 and C2H6 for PLA, and CO and CO2 for PGA. In addition, the yield of evolved gases for PLA has been found to be 1.81 for CO2, 0.98 for CO, 0.026 for CH4 and 0.012 for C2H6; and that for PGA to be 1.70 for CO2 and 0.42 for CO. The new chain ends formed due to gamma-induced bond cleavage in PLA have been assigned to CH3-CH2-CO-O- and CH3-CH2-O-CO-, and the G values for formation of these chain ends were found to be 1.9 and 0.6, respectively. The G value for chain scission reported previously of 2.3 is comparable with that for the formation of the propanoic acid end group. (C) 1997 Elsevier Science Limited.
Resumo:
The otoliths and lenses of the temperate damselfish Parma microlepis (Gunther) (Pomacentridae) showed similar differences in trace-metal profile for selected locations along the coast of New South Wales, Australia. Otoliths and lenses displayed a differential ability to accumulate metals. Metal concentrations were ranked differently in the two structures (e.g. Sr > Ba > Pb > Rb > Hg in otoliths, and Hg > Sr similar or equal to Rb > Pb > Ba in lenses), and where similar metals were accumulated, they were accumulated at vastly different concentrations (e.g. Ba concentrations in otoliths are a thousand-fold greater than in lenses). Analyses of the otoliths and lenses of P. microlepis from locations close to Sydney and up to 100 kill from the city were able to distinguish amongst these locations with respect to a number of metals, namely Ba, Mn and Hg. Multivariate analyses of otolith and lens data gave similar results among locations (agreement was obtained for Ii out of 15 pair-wise comparisons), and differences were attributable to the differential ability of the two structures to accumulate metals such as Mn and Hg. Trace-metal differences between locations were found to coincide with the proximity of sewage (including industrial waste) and petroleum storage facilities to the different locations.