932 resultados para HMM, Nosocomial Pathogens, Genotyping, Statistical Modelling, VRE
Resumo:
A formalism for modelling the dynamics of Genetic Algorithms (GAs) using methods from statistical mechanics, originally due to Prugel-Bennett and Shapiro, is reviewed, generalized and improved upon. This formalism can be used to predict the averaged trajectory of macroscopic statistics describing the GA's population. These macroscopics are chosen to average well between runs, so that fluctuations from mean behaviour can often be neglected. Where necessary, non-trivial terms are determined by assuming maximum entropy with constraints on known macroscopics. Problems of realistic size are described in compact form and finite population effects are included, often proving to be of fundamental importance. The macroscopics used here are cumulants of an appropriate quantity within the population and the mean correlation (Hamming distance) within the population. Including the correlation as an explicit macroscopic provides a significant improvement over the original formulation. The formalism is applied to a number of simple optimization problems in order to determine its predictive power and to gain insight into GA dynamics. Problems which are most amenable to analysis come from the class where alleles within the genotype contribute additively to the phenotype. This class can be treated with some generality, including problems with inhomogeneous contributions from each site, non-linear or noisy fitness measures, simple diploid representations and temporally varying fitness. The results can also be applied to a simple learning problem, generalization in a binary perceptron, and a limit is identified for which the optimal training batch size can be determined for this problem. The theory is compared to averaged results from a real GA in each case, showing excellent agreement if the maximum entropy principle holds. Some situations where this approximation brakes down are identified. In order to fully test the formalism, an attempt is made on the strong sc np-hard problem of storing random patterns in a binary perceptron. Here, the relationship between the genotype and phenotype (training error) is strongly non-linear. Mutation is modelled under the assumption that perceptron configurations are typical of perceptrons with a given training error. Unfortunately, this assumption does not provide a good approximation in general. It is conjectured that perceptron configurations would have to be constrained by other statistics in order to accurately model mutation for this problem. Issues arising from this study are discussed in conclusion and some possible areas of further research are outlined.
Resumo:
The topic of this thesis is the development of knowledge based statistical software. The shortcomings of conventional statistical packages are discussed to illustrate the need to develop software which is able to exhibit a greater degree of statistical expertise, thereby reducing the misuse of statistical methods by those not well versed in the art of statistical analysis. Some of the issues involved in the development of knowledge based software are presented and a review is given of some of the systems that have been developed so far. The majority of these have moved away from conventional architectures by adopting what can be termed an expert systems approach. The thesis then proposes an approach which is based upon the concept of semantic modelling. By representing some of the semantic meaning of data, it is conceived that a system could examine a request to apply a statistical technique and check if the use of the chosen technique was semantically sound, i.e. will the results obtained be meaningful. Current systems, in contrast, can only perform what can be considered as syntactic checks. The prototype system that has been implemented to explore the feasibility of such an approach is presented, the system has been designed as an enhanced variant of a conventional style statistical package. This involved developing a semantic data model to represent some of the statistically relevant knowledge about data and identifying sets of requirements that should be met for the application of the statistical techniques to be valid. Those areas of statistics covered in the prototype are measures of association and tests of location.
Resumo:
Efficient numerical modelling of the power, spectral and statistical properties of partially coherent quasi-CW Raman fiber laser radiation is presented. XPM between pump wave and generated Stokes wave is not important in the generation spectrum broadening and XPM term can be omitted in propagation equation what sufficiently speeds-up simulations. The time dynamics of Raman fiber laser (RFL) is stochastic exhibiting events several times more intense that the mean value on the ps timescale. However, the RFL has different statistical properties on different time scales. The probability density function of spectral power density is exponential for the generation modes located either in the spectrum centre or spectral wings while the phases are distributed uniformly. The pump wave preserves the initial Gaussian statistics during propagation in the laser cavity. Intense pulses in the pump wave are evolved under the SPM influence and are not disturbed by the dispersion. Contrarily, in the generated wave the dispersion plays a significant role that results in stochastic behavior. © 2012 Elsevier B.V. All rights reserved.
Resumo:
We would like to thank the study participants and the clinical and research staff at the Queen Elizabeth National Spinal Injury Unit, as without them this study would not have been possible. We are grateful for the funding received from Glasgow Research Partnership in Engineering for the employment of SC during data collection for this study. We would like to thank the Royal Society of Edinburgh's Scottish Crucible scheme for providing the opportunity for this collaboration to occur. We are also indebted to Maria Dumitrascuta for her time and effort in producing inter-repeatability results for the shape models.
Resumo:
We would like to thank the study participants and the clinical and research staff at the Queen Elizabeth National Spinal Injury Unit, as without them this study would not have been possible. We are grateful for the funding received from Glasgow Research Partnership in Engineering for the employment of SC during data collection for this study. We would like to thank the Royal Society of Edinburgh's Scottish Crucible scheme for providing the opportunity for this collaboration to occur. We are also indebted to Maria Dumitrascuta for her time and effort in producing inter-repeatability results for the shape models.
Resumo:
The prevalence and concentrations of Campylobacter jejuni, Salmonella spp. and enterohaemorrhagic E. coli (EHEC) were investigated in surface waters in Brisbane, Australia using quantitative PCR (qPCR) based methodologies. Water samples were collected from Brisbane City Botanic Gardens (CBG) Pond, and two urban tidal creeks (i.e., Oxley Creek and Blunder Creek). Of the 32 water samples collected, 8 (25%), 1 (3%), 9 (28%), 14 (44%), and 15 (47%) were positive for C. jejuni mapA, Salmonella invA, EHEC O157 LPS, EHEC VT1, and EHEC VT2 genes, respectively. The presence/absence of the potential pathogens did not correlate with either E. coli or enterococci concentrations as determined by binary logistic regression. In conclusion, the high prevalence, and concentrations of potential zoonotic pathogens along with the concentrations of one or more fecal indicators in surface water samples indicate a poor level of microbial quality of surface water, and could represent a significant health risk to users. The results from the current study would provide valuable information to the water quality managers in terms of minimizing the risk from pathogens in surface waters.
Resumo:
Multicarrier code division multiple access (MC-CDMA) is a very promising candidate for the multiple access scheme in fourth generation wireless communi- cation systems. During asynchronous transmission, multiple access interference (MAI) is a major challenge for MC-CDMA systems and significantly affects their performance. The main objectives of this thesis are to analyze the MAI in asyn- chronous MC-CDMA, and to develop robust techniques to reduce the MAI effect. Focus is first on the statistical analysis of MAI in asynchronous MC-CDMA. A new statistical model of MAI is developed. In the new model, the derivation of MAI can be applied to different distributions of timing offset, and the MAI power is modelled as a Gamma distributed random variable. By applying the new statistical model of MAI, a new computer simulation model is proposed. This model is based on the modelling of a multiuser system as a single user system followed by an additive noise component representing the MAI, which enables the new simulation model to significantly reduce the computation load during computer simulations. MAI reduction using slow frequency hopping (SFH) technique is the topic of the second part of the thesis. Two subsystems are considered. The first sub- system involves subcarrier frequency hopping as a group, which is referred to as GSFH/MC-CDMA. In the second subsystem, the condition of group hopping is dropped, resulting in a more general system, namely individual subcarrier frequency hopping MC-CDMA (ISFH/MC-CDMA). This research found that with the introduction of SFH, both of GSFH/MC-CDMA and ISFH/MC-CDMA sys- tems generate less MAI power than the basic MC-CDMA system during asyn- chronous transmission. Because of this, both SFH systems are shown to outper- form MC-CDMA in terms of BER. This improvement, however, is at the expense of spectral widening. In the third part of this thesis, base station polarization diversity, as another MAI reduction technique, is introduced to asynchronous MC-CDMA. The com- bined system is referred to as Pol/MC-CDMA. In this part a new optimum com- bining technique namely maximal signal-to-MAI ratio combining (MSMAIRC) is proposed to combine the signals in two base station antennas. With the applica- tion of MSMAIRC and in the absents of additive white Gaussian noise (AWGN), the resulting signal-to-MAI ratio (SMAIR) is not only maximized but also in- dependent of cross polarization discrimination (XPD) and antenna angle. In the case when AWGN is present, the performance of MSMAIRC is still affected by the XPD and antenna angle, but to a much lesser degree than the traditional maximal ratio combining (MRC). Furthermore, this research found that the BER performance for Pol/MC-CDMA can be further improved by changing the angle between the two receiving antennas. Hence the optimum antenna angles for both MSMAIRC and MRC are derived and their effects on the BER performance are compared. With the derived optimum antenna angle, the Pol/MC-CDMA system is able to obtain the lowest BER for a given XPD.
Resumo:
Modelling of interferometric signals related to tear film surface quality is considered. In the context of tear film surface quality estimation in normal healthy eyes, two clinical parameters are of interest: the build-up time, and the average interblink surface quality. The former is closely related to the signal derivative while the latter to the signal itself. Polynomial signal models, chosen for a particular set of noisy interferometric measurements, can be optimally selected, in some sense, with a range of information criteria such as AIC, MDL, Cp, and CME. Those criteria, however, do not always guarantee that the true derivative of the signal is accurately represented and they often overestimate it. Here, a practical method for judicious selection of model order in a polynomial fitting to a signal is proposed so that the derivative of the signal is adequately represented. The paper highlights the importance of context-based signal modelling in model order selection.