97 resultados para computational statistics
Resumo:
Maximum entropy approach to classification is very well studied in applied statistics and machine learning and almost all the methods that exists in literature are discriminative in nature. In this paper, we introduce a maximum entropy classification method with feature selection for large dimensional data such as text datasets that is generative in nature. To tackle the curse of dimensionality of large data sets, we employ conditional independence assumption (Naive Bayes) and we perform feature selection simultaneously, by enforcing a `maximum discrimination' between estimated class conditional densities. For two class problems, in the proposed method, we use Jeffreys (J) divergence to discriminate the class conditional densities. To extend our method to the multi-class case, we propose a completely new approach by considering a multi-distribution divergence: we replace Jeffreys divergence by Jensen-Shannon (JS) divergence to discriminate conditional densities of multiple classes. In order to reduce computational complexity, we employ a modified Jensen-Shannon divergence (JS(GM)), based on AM-GM inequality. We show that the resulting divergence is a natural generalization of Jeffreys divergence to a multiple distributions case. As far as the theoretical justifications are concerned we show that when one intends to select the best features in a generative maximum entropy approach, maximum discrimination using J-divergence emerges naturally in binary classification. Performance and comparative study of the proposed algorithms have been demonstrated on large dimensional text and gene expression datasets that show our methods scale up very well with large dimensional datasets.
Resumo:
This paper presents a simple technique for reducing the computational effort while solving any geotechnical stability problem by using the upper bound finite element limit analysis and linear optimization. In the proposed method, the problem domain is discretized into a number of different regions in which a particular order (number of sides) of the polygon is chosen to linearize the Mohr-Coulomb yield criterion. A greater order of the polygon needs to be selected only in that region wherein the rate of the plastic strains becomes higher. The computational effort required to solve the problem with this implementation reduces considerably. By using the proposed method, the bearing capacity has been computed for smooth and rough strip footings and the results are found to be quite satisfactory.
Resumo:
In this paper we investigate the local flame surface statistics of constant-pressure turbulent expanding flames. First the statistics of local length ratio is experimentally determined from high-speed planar Mie scattering images of spherically expanding flames, with the length ratio on the measurement plane, at predefined equiangular sectors, defined as the ratio of the actual flame length to the length of a circular-arc of radius equal to the average radius of the flame. Assuming isotropic distribution of such flame segments we then convolute suitable forms of the length-ratio probability distribution functions (pdfs) to arrive at the corresponding area-ratio pdfs. It is found that both the length ratio and area ratio pdfs are near log-normally distributed and shows self-similar behavior with increasing radius. Near log-normality and rather intermittent behavior of the flame-length ratio suggests similarity with dissipation rate quantities which stimulates multifractal analysis. (C) 2014 AIP Publishing LLC.
Resumo:
We present a detailed direct numerical simulation of statistically steady, homogeneous, isotropic, two-dimensional magnetohydrodynamic turbulence. Our study concentrates on the inverse cascade of the magnetic vector potential. We examine the dependence of the statistical properties of such turbulence on dissipation and friction coefficients. We extend earlier work significantly by calculating fluid and magnetic spectra, probability distribution functions (PDFs) of the velocity, magnetic, vorticity, current, stream-function, and magnetic-vector-potential fields, and their increments. We quantify the deviations of these PDFs from Gaussian ones by computing their flatnesses and hyperflatnesses. We also present PDFs of the Okubo-Weiss parameter, which distinguishes between vortical and extensional flow regions, and its magnetic analog. We show that the hyperflatnesses of PDFs of the increments of the stream function and the magnetic vector potential exhibit significant scale dependence and we examine the implication of this for the multiscaling of structure functions. We compare our results with those of earlier studies.
Resumo:
Human Leukocyte Antigen (HLA) plays an important role, in presenting foreign pathogens to our immune system, there by eliciting early immune responses. HLA genes are highly polymorphic, giving rise to diverse antigen presentation capability. An important factor contributing to enormous variations in individual responses to diseases is differences in their HLA profiles. The heterogeneity in allele specific disease responses decides the overall disease epidemiological outcome. Here we propose an agent based computational framework, capable of incorporating allele specific information, to analyze disease epidemiology. This framework assumes a SIR model to estimate average disease transmission and recovery rate. Using epitope prediction tool, it performs sequence based epitope detection for a given the pathogenic genome and derives an allele specific disease susceptibility index depending on the epitope detection efficiency. The allele specific disease transmission rate, that follows, is then fed to the agent based epidemiology model, to analyze the disease outcome. The methodology presented here has a potential use in understanding how a disease spreads and effective measures to control the disease.
Resumo:
We apply the objective method of Aldous to the problem of finding the minimum-cost edge cover of the complete graph with random independent and identically distributed edge costs. The limit, as the number of vertices goes to infinity, of the expected minimum cost for this problem is known via a combinatorial approach of Hessler and Wastlund. We provide a proof of this result using the machinery of the objective method and local weak convergence, which was used to prove the (2) limit of the random assignment problem. A proof via the objective method is useful because it provides us with more information on the nature of the edge's incident on a typical root in the minimum-cost edge cover. We further show that a belief propagation algorithm converges asymptotically to the optimal solution. This can be applied in a computational linguistics problem of semantic projection. The belief propagation algorithm yields a near optimal solution with lesser complexity than the known best algorithms designed for optimality in worst-case settings.
Resumo:
The complex perovskite oxide SrRuO3 shows intriguing transport properties at low temperatures due to the interplay of spin, charge, and orbital degrees of freedom. One of the open questions in this system is regarding the origin and nature of the low-temperature glassy state. In this paper we report on measurements of higher-order statistics of resistance fluctuations performed in epitaxial thin films of SrRuO3 to probe this issue. We observe large low-frequency non-Gaussian resistance fluctuations over a certain temperature range. Our observations are compatible with that of a spin-glass system with properties described by hierarchical dynamics rather than with that of a simple ferromagnet with a large coercivity.
Resumo:
A comprehensive analysis of the crystal packing and the energetic features of a series of four biologically active molecules belonging to the family of substituted 4-(benzylideneamino)-3-(4-fluoro-3-phenoxyphenyl)-1H-1,2,4-triazole-5-(4 H)-thione derivatives have been performed based on the molecular conformation and the supramolecular packing. This involves the formation of a short centrosymmetric R-2(2)(8) NH...S supramolecular synthon in the solid state, including the presence of CH...S, CH...O, CH...N, CH...F, CH...Cl, CF...FC, CCl...ClC, and CH...pi intermolecular interactions along with pp stacking to evaluate the role of noncovalent interactions in the crystal. The presence of such synthons has a substantial contribution toward the interaction energy (-18 to -20 kcal/mol) as obtained from the PIXEL calculation, wherein the Coulombic and polarization contribution are more significant than the dispersion contribution. The geometrical characteristics of such synthons favor short distance, and the population of related molecules having these geometries is rare as has been obtained from the Cambridge Structural Database (CSD). Furthermore, their interaction energies have been compared with those present in our molecules in the solid state. The topological characteristics of the NH...S supramolecular synthon, in addition to related weak interactions, CH...N, CH...Cl, CF...FC, and CCl...ClC, have been estimated using the quantum theory of atoms in molecules (QTAIM). In addition, an analysis of the Hirshfeld surface and associated fingerprint plots of these four molecules also have provided a platform for the evaluation of the contribution of different atom...atom contacts, which contribute toward the packing of the molecules in solids.
Resumo:
Random changes in the alkyl substitution patterns of fluorescent dyes, e.g. BODIPYs, are often accompanied by significant changes in their photophysical properties. To understand such alterations in properties in closely related molecular systems, a comparative DFT (density functional theory) computational investigation was performed in order to comprehend the effects of alkyl substitution in controlling the structural and electronic nature of BODIPY dyes. In this context, a systematic strategy was utilized, considering all possible outcomes of constitutionally-isomeric molecules to understand the alkyl groups' effects on the BODIPY molecules. Four different computational methods {i.e. B3LYP/631G(d); B3LYP/6-311++ G(d,p); wb97xd/6-311++ G(d,p) and mpw1pw91/6-311++ G(d,p)} were employed to rationalize the agreement of the trends associated with the molecular properties. In line with experimental observations, it was found that alkyl substituents in BODIPY dyes situated at 3/5-positions effectively participate in stabilization as well as planarization of such molecules. Screening of all the possible isomeric molecular systems was used to understand the individual properties and overall effects of the typical alkyl substituents in controlling several basic properties of such BODIPY molecules.
Resumo:
We derive analytical expressions for probability distribution function (PDF) for electron transport in a simple model of quantum junction in presence of thermal fluctuations. Our approach is based on the large deviation theory combined with the generating function method. For large number of electrons transferred, the PDF is found to decay exponentially in the tails with different rates due to applied bias. This asymmetry in the PDF is related to the fluctuation theorem. Statistics of fluctuations are analyzed in terms of the Fano factor. Thermal fluctuations play a quantitative role in determining the statistics of electron transfer; they tend to suppress the average current while enhancing the fluctuations in particle transfer. This gives rise to both bunching and antibunching phenomena as determined by the Fano factor. The thermal fluctuations and shot noise compete with each other and determine the net (effective) statistics of particle transfer. Exact analytical expression is obtained for delay time distribution. The optimal values of the delay time between successive electron transfers can be lowered below the corresponding shot noise values by tuning the thermal effects. (C) 2015 AIP Publishing LLC.
Resumo:
It is well established that Re and Ru additions to Ni-base superalloys result in improved creep performance and phase stability. However, the role of Re and Ru and their synergetic effects are not well understood, and the first step in understanding these effects is to design alloys with controlled microstructural parameters. A computational approach was undertaken in the present work for designing model alloys with varying levels of Re and Ru. Thermodynamic and first principles calculations were employed complimentarily to design a set of alloys with varying Re and Ru levels, but which were constrained by constant microstructural parameters, i.e., phase fractions and lattice misfit across the alloys. Three ternary/quaternary alloys of type Ni-Al-xRe-yRu were thus designed. These compositions were subsequently cast, homogenized and aged. Experimental results suggest that while the measured volume fraction matches the predicted value in the Ru containing alloy, volume fraction is significantly higher than the designed value in the Re containing alloys. This is possibly due to errors in the thermodynamic database used to predict phase fraction and composition. These errors are also reflected in the mismatch between predicted and measured values of misfit.
Resumo:
Local heterogeneity is ubiquitous in natural aqueous systems. It can be caused locally by external biomolecular subsystems like proteins, DNA, micelles and reverse micelles, nanoscopic materials etc., but can also be intrinsic to the thermodynamic nature of the aqueous solution itself (like binary mixtures or at the gas-liquid interface). The altered dynamics of water in the presence of such diverse surfaces has attracted considerable attention in recent years. As these interfaces are quite narrow, only a few molecular layers thick, they are hard to study by conventional methods. The recent development of two dimensional infra-red (2D-IR) spectroscopy allows us to estimate length and time scales of such dynamics fairly accurately. In this work, we present a series of interesting studies employing two dimensional infra-red spectroscopy (2D-IR) to investigate (i) the heterogeneous dynamics of water inside reverse micelles of varying sizes, (ii) supercritical water near the Widom line that is known to exhibit pronounced density fluctuations and also study (iii) the collective and local polarization fluctuation of water molecules in the presence of several different proteins. The spatio-temporal correlation of confined water molecules inside reverse micelles of varying sizes is well captured through the spectral diffusion of corresponding 2D-IR spectra. In the case of supercritical water also, we observe a strong signature of dynamic heterogeneity from the elongated nature of the 2D-IR spectra. In this case the relaxation is ultrafast. We find remarkable agreement between the different tools employed to study the relaxation of density heterogeneity. For aqueous protein solutions, we find that the calculated dielectric constant of the respective systems unanimously shows a noticeable increment compared to that of neat water. However, the `effective' dielectric constant for successive layers shows significant variation, with the layer adjacent to the protein having a much lower value. Relaxation is also slowest at the surface. We find that the dielectric constant achieves the bulk value at distances more than 3 nm from the surface of the protein.
Resumo:
The Variational Asymptotic Method (VAM) is used for modeling a coupled non-linear electromechanical problem finding applications in aircrafts and Micro Aerial Vehicle (MAV) development. VAM coupled with geometrically exact kinematics forms a powerful tool for analyzing a complex nonlinear phenomena as shown previously by many in the literature 3 - 7] for various challenging problems like modeling of an initially twisted helicopter rotor blades, matrix crack propagation in a composite, modeling of hyper elastic plates and various multi-physics problems. The problem consists of design and analysis of a piezocomposite laminate applied with electrical voltage(s) which can induce direct and planar distributed shear stresses and strains in the structure. The deformations are large and conventional beam theories are inappropriate for the analysis. The behavior of an elastic body is completely understood by its energy. This energy must be integrated over the cross-sectional area to obtain the 1-D behavior as is typical in a beam analysis. VAM can be used efficiently to approximate 3-D strain energy as closely as possible. To perform this simplification, VAM makes use of thickness to width, width to length, width multiplied by initial twist and strain as small parameters embedded in the problem definition and provides a way to approach the exact solution asymptotically. In this work, above mentioned electromechanical problem is modeled using VAM which breaks down the 3-D elasticity problem into two parts, namely a 2-D non-linear cross-sectional analysis and a 1-D non-linear analysis, along the reference curve. The recovery relations obtained as a by-product in the cross-sectional analysis earlier are used to obtain 3-D stresses, displacements and velocity contours. The piezo-composite laminate which is chosen for an initial phase of computational modeling is made up of commercially available Macro Fiber Composites (MFCs) stacked together in an arbitrary lay-up and applied with electrical voltages for actuation. The expressions of sectional forces and moments as obtained from cross-sectional analysis in closed-form show the electro-mechanical coupling and relative contribution of electric field in individual layers of the piezo-composite laminate. The spatial and temporal constitutive law as obtained from the cross-sectional analysis are substituted into 1-D fully intrinsic, geometrically exact equilibrium equations of motion and 1-D intrinsic kinematical equations to solve for all 1-D generalized variables as function of time and an along the reference curve co-ordinate, x(1).
Resumo:
This paper lists some references that could in some way be relevant in the context of the real-time computational simulation of biological organs, the research area being defined in a very broad sense. This paper contains 198 references.
Resumo:
The problem of determination of system reliability of randomly vibrating structures arises in many application areas of engineering. We discuss in this paper approaches based on Monte Carlo simulations and laboratory testing to tackle problems of time variant system reliability estimation. The strategy we adopt is based on the application of Girsanov's transformation to the governing stochastic differential equations which enables estimation of probability of failure with significantly reduced number of samples than what is needed in a direct simulation study. Notably, we show that the ideas from Girsanov's transformation based Monte Carlo simulations can be extended to conduct laboratory testing to assess system reliability of engineering structures with reduced number of samples and hence with reduced testing times. Illustrative examples include computational studies on a 10 degree of freedom nonlinear system model and laboratory/computational investigations on road load response of an automotive system tested on a four post Lest rig. (C) 2015 Elsevier Ltd. All rights reserved.