922 resultados para Data Streams Distribution
Resumo:
Water quality data are often collected at different sites over time to improve water quality management. Water quality data usually exhibit the following characteristics: non-normal distribution, presence of outliers, missing values, values below detection limits (censored), and serial dependence. It is essential to apply appropriate statistical methodology when analyzing water quality data to draw valid conclusions and hence provide useful advice in water management. In this chapter, we will provide and demonstrate various statistical tools for analyzing such water quality data, and will also introduce how to use a statistical software R to analyze water quality data by various statistical methods. A dataset collected from the Susquehanna River Basin will be used to demonstrate various statistical methods provided in this chapter. The dataset can be downloaded from website http://www.srbc.net/programs/CBP/nutrientprogram.htm.
Resumo:
Coptotermes Wasmann (Isoptera: Rhinotermitidae) is one of the most economically important subterranean termite genera and some species are successful invaders. However, despite its important pest status, the taxonomic validity of many named Coptotermes species remains unclear. In this study, we reviewed all named species within the genus and investigated evidence supporting the validity of each named species. Species were systematically scrutinized according to the region of their original description: Southeast Asia, India, China, Africa, the Neotropics, and Australia. We estimate that of the currently 69 named species described by accepted nomenclatural rules, only 21 taxa have solid evidence for validity, 44 names have uncertain status, and the remaining species names should be synonymized or were made unavailable. Species with high degrees of invasiveness may be known under additional junior synonyms due to independent parochial descriptions. Molecular data for a vast majority of species are scarce and significant effort is needed to complete the taxonomic and phylogenetic revision of the genus. Because of the wide distribution of Coptotermes, we advocate for an integrative taxonomic effort to establish the distribution of each putative species, provide specimens and corresponding molecular data, check original descriptions and type specimens (if available), and provide evidence for a more robust phylogenetic position of each species. This study embodies both consensus and contention of those studying Coptotermes and thus pinpoints the current uncertainty of many species. This project is intended to be a roadmap for identifying those Coptotermes species names that need to be more thoroughly investigated, as an incentive to complete a necessary revision process.
Resumo:
The problem of scheduling divisible loads in distributed computing systems, in presence of processor release time is considered. The objective is to find the optimal sequence of load distribution and the optimal load fractions assigned to each processor in the system such that the processing time of the entire processing load is a minimum. This is a difficult combinatorial optimization problem and hence genetic algorithms approach is presented for its solution.
Resumo:
Automatic identification of software faults has enormous practical significance. This requires characterizing program execution behavior and the use of appropriate data mining techniques on the chosen representation. In this paper, we use the sequence of system calls to characterize program execution. The data mining tasks addressed are learning to map system call streams to fault labels and automatic identification of fault causes. Spectrum kernels and SVM are used for the former while latent semantic analysis is used for the latter The techniques are demonstrated for the intrusion dataset containing system call traces. The results show that kernel techniques are as accurate as the best available results but are faster by orders of magnitude. We also show that latent semantic indexing is capable of revealing fault-specific features.
Resumo:
We present a measurement of the transverse momentum with respect to the jet axis (kt) of particles in jets produced in pp̅ collisions at √s=1.96 TeV. Results are obtained for charged particles in a cone of 0.5 radians around the jet axis in events with dijet invariant masses between 66 and 737 GeV/c2. The experimental data are compared to theoretical predictions obtained for fragmentation partons within the framework of resummed perturbative QCD using the modified leading log and next-to-modified leading log approximations. The comparison shows that trends in data are successfully described by the theoretical predictions, indicating that the perturbative QCD stage of jet fragmentation is dominant in shaping basic jet characteristics.
Resumo:
We present a measurement of the transverse momentum with respect to the jet axis ($k_{T}$) of particles in jets produced in $p\bar p$ collisions at $\sqrt{s}=1.96$ TeV. Results are obtained for charged particles within a cone of opening angle 0.5 radians around the jet axis in events with dijet invariant masses between 66 and 737 GeV/c$^{2}$. The experimental data are compared to theoretical predictions obtained for fragmentation partons within the framework of resummed perturbative QCD using the modified leading log and next-to-modified leading log approximations. The comparison shows that trends in data are successfully described by the theoretical predictions, indicating that the perturbative QCD stage of jet fragmentation is dominant in shaping basic jet characteristics.
Resumo:
The aim of the current study is to examine the influence of the channel external environment on power, and the effect of power on the distribution network structure within the People’s Republic of China. Throughout the study a dual research process was applied. The theory was constructed by elaborating the main theoretical premises of the study, the channel power theories, the political economy framework and the distribution network structure, but these marketing channel concepts were expanded with other perspectives from other disciplines. The main method applied was a survey conducted among 164 Chinese retailers, complemented by interviews, photographs, observations and census data from the field. This multi-method approach enabled not only to validate and triangulate the quantitative results, but to uncover serendipitous findings as well. The theoretical contribution of the current study to the theory of marketing channels power is the different view it takes on power. First, earlier power studies have taken the producer perspective, whereas the current study also includes a distributor perspective to the discussion. Second, many power studies have dealt with strongly dependent relationships, whereas the current study examines loosely dependent relationships. Power is dependent on unequal distribution of resources rather than based on high dependency. The benefit of this view is in realising that power resources and power strategies are separate concepts. The empirical material of the current study confirmed that at least some resources were significantly related to power strategies. The study showed that the dimension resources composed of technology, know-how and knowledge, managerial freedom and reputation was significantly related to non-coercive power. Third, the notion of different outcomes of power is a contribution of this study to the channels power theory even though not confirmed by the empirical results. Fourth, it was proposed that channel external environment other than the resources would also contribute to the channel power. These propositions were partially supported thus providing only partial contribution to the channel power theory. Finally, power was equally distributed among the different types of actors. The findings from the qualitative data suggest that different types of retailers can be classified according to the meaning the actors put into their business. Some are more business oriented, for others retailing is the only way to earn a living. The findings also suggest that in some actors both retailing and wholesaling functions emerge, and this has implications for the marketing channels structure.
Resumo:
The problem of detecting an unknown transient signal in noise is considered. The SNR of the observed data is first enhanced using wavelet domain filter The output of the wavelet domain filter is then transformed using a Wigner-Ville transform,which separates the spectrum of the observed signal into narrow frequency bands. Each subband signal at the output of the Wigner-ville block is subjected kto wavelet based level dependent denoising (WBLDD)to supress colored noise A weighted sum of the absolute value of outputs of WBLDD is passed through an energy detector, whose output is used as test statistic to take the final decision. By assigning weights proportional to the energy of the corresponding subband signals, the proposed detector approximates a frequency domain matched filter Simulation results are presented to show that the performance of the proposed detector is better than that of the wavelet packet transform based detector.
Resumo:
We propose a novel second order cone programming formulation for designing robust classifiers which can handle uncertainty in observations. Similar formulations are also derived for designing regression functions which are robust to uncertainties in the regression setting. The proposed formulations are independent of the underlying distribution, requiring only the existence of second order moments. These formulations are then specialized to the case of missing values in observations for both classification and regression problems. Experiments show that the proposed formulations outperform imputation.
Resumo:
Deterministic models have been widely used to predict water quality in distribution systems, but their calibration requires extensive and accurate data sets for numerous parameters. In this study, alternative data-driven modeling approaches based on artificial neural networks (ANNs) were used to predict temporal variations of two important characteristics of water quality chlorine residual and biomass concentrations. The authors considered three types of ANN algorithms. Of these, the Levenberg-Marquardt algorithm provided the best results in predicting residual chlorine and biomass with error-free and ``noisy'' data. The ANN models developed here can generate water quality scenarios of piped systems in real time to help utilities determine weak points of low chlorine residual and high biomass concentration and select optimum remedial strategies.
Resumo:
The torsional potential functions Vt(phi) and Vt(psi) around single bonds N--C alpha and C alpha--C, which can be used in conformational studies of oligopeptides, polypeptides and proteins, have been derived, using crystal structure data of 22 globular proteins, fitting the observed distribution in the (phi, psi)-plane with the value of Vtot(phi, psi), using the Boltzmann distribution. The averaged torsional potential functions, obtained from various amino acid residues in L-configuration, are Vt(phi) = 1.0 cos (phi + 60 degrees); Vt(psi) = 0.5 cos (psi + 60 degrees) - 1.0 cos (2 psi + 30 degrees) - 0.5 cos (3 psi + 30 degrees). The dipeptide energy maps Vtot(phi, psi) obtained using these functions, instead of the normally accepted torsional functions, were found to explain various observations, such as the absence of the left-handed alpha helix and the C7 conformation, and the relatively high density of points near the line psi = 0 degrees. These functions derived from observational data on protein structures, will, it is hoped, explain various previously unexplained facts in polypeptide conformation.
Resumo:
The torsional potential functions Vt(φ) and Vt(ψ) around single bonds N–Cα and Cα-C, which can be used in conformational studies of oligopeptides, polypeptides and proteins, have been derived, using crystal structure data of 22 globular proteins, fitting the observed distribution in the (φ, ψ)-plane with the value of Vtot(φ, ψ), using the Boltzmann distribution. The averaged torsional potential functions, obtained from various amino acid residues in l-configuration, are Vt(φ) = – 1.0 cos (φ + 60°); Vt(ψ) = – 0.5 cos (ψ + 60°) – 1.0 cos (2ψ + 30°) – 0.5 cos (3ψ + 30°). The dipeptide energy maps Vtot(φ, ψ) obtained using these functions, instead of the normally accepted torsional functions, were found to explain various observations, such as the absence of the left-handed alpha helix and the C7 conformation, and the relatively high density of points near the line ψ = 0°. These functions, derived from observational data on protein structures, will, it is hoped, explain various previously unexplained facts in polypeptide conformation.
Resumo:
The blood-brain barrier (BBB) is a unique barrier that strictly regulates the entry of endogenous substrates and xenobiotics into the brain. This is due to its tight junctions and the array of transporters and metabolic enzymes that are expressed. The determination of brain concentrations in vivo is difficult, laborious and expensive which means that there is interest in developing predictive tools of brain distribution. Predicting brain concentrations is important even in early drug development to ensure efficacy of central nervous system (CNS) targeted drugs and safety of non-CNS drugs. The literature review covers the most common current in vitro, in vivo and in silico methods of studying transport into the brain, concentrating on transporter effects. The consequences of efflux mediated by p-glycoprotein, the most widely characterized transporter expressed at the BBB, is also discussed. The aim of the experimental study was to build a pharmacokinetic (PK) model to describe p-glycoprotein substrate drug concentrations in the brain using commonly measured in vivo parameters of brain distribution. The possibility of replacing in vivo parameter values with their in vitro counterparts was also studied. All data for the study was taken from the literature. A simple 2-compartment PK model was built using the Stella™ software. Brain concentrations of morphine, loperamide and quinidine were simulated and compared with published studies. Correlation of in vitro measured efflux ratio (ER) from different studies was evaluated in addition to studying correlation between in vitro and in vivo measured ER. A Stella™ model was also constructed to simulate an in vitro transcellular monolayer experiment, to study the sensitivity of measured ER to changes in passive permeability and Michaelis-Menten kinetic parameter values. Interspecies differences in rats and mice were investigated with regards to brain permeability and drug binding in brain tissue. Although the PK brain model was able to capture the concentration-time profiles for all 3 compounds in both brain and plasma and performed fairly well for morphine, for quinidine it underestimated and for loperamide it overestimated brain concentrations. Because the ratio of concentrations in brain and blood is dependent on the ER, it is suggested that the variable values cited for this parameter and its inaccuracy could be one explanation for the failure of predictions. Validation of the model with more compounds is needed to draw further conclusions. In vitro ER showed variable correlation between studies, indicating variability due to experimental factors such as test concentration, but overall differences were small. Good correlation between in vitro and in vivo ER at low concentrations supports the possibility of using of in vitro ER in the PK model. The in vitro simulation illustrated that in the simulation setting, efflux is significant only with low passive permeability, which highlights the fact that the cell model used to measure ER must have low enough paracellular permeability to correctly mimic the in vivo situation.
Resumo:
The structures of Ca0.5Ti2P3O12 and Sr0.5Ti2P3O12, low-thermal-expansion materials, have been refined by the Rietveld method using high-resolution powder X-ray diffraction (XRD) data. The assignment of space group R[3 with combining macron] to NASICON-type compounds containing divalent cations is confirmed. 31P magic-angle spinning nuclear magnetic resonance (MASNMR) data are presented as supporting data. A comparison of changes in the polyhedral network resulting from the cation distribution, is made with NaTi2P3O12 and Nb2P3O12. Factors that may govern thermal expansion in this family of compounds are discussed.