884 resultados para data structures
Resumo:
A certain type of bacterial inclusion, known as a bacterial microcompartment, was recently identified and imaged through cryo-electron tomography. A reconstructed 3D object from single-axis limited angle tilt-series cryo-electron tomography contains missing regions and this problem is known as the missing wedge problem. Due to missing regions on the reconstructed images, analyzing their 3D structures is a challenging problem. The existing methods overcome this problem by aligning and averaging several similar shaped objects. These schemes work well if the objects are symmetric and several objects with almost similar shapes and sizes are available. Since the bacterial inclusions studied here are not symmetric, are deformed, and show a wide range of shapes and sizes, the existing approaches are not appropriate. This research develops new statistical methods for analyzing geometric properties, such as volume, symmetry, aspect ratio, polyhedral structures etc., of these bacterial inclusions in presence of missing data. These methods work with deformed and non-symmetric varied shaped objects and do not necessitate multiple objects for handling the missing wedge problem. The developed methods and contributions include: (a) an improved method for manual image segmentation, (b) a new approach to 'complete' the segmented and reconstructed incomplete 3D images, (c) a polyhedral structural distance model to predict the polyhedral shapes of these microstructures, (d) a new shape descriptor for polyhedral shapes, named as polyhedron profile statistic, and (e) the Bayes classifier, linear discriminant analysis and support vector machine based classifiers for supervised incomplete polyhedral shape classification. Finally, the predicted 3D shapes for these bacterial microstructures belong to the Johnson solids family, and these shapes along with their other geometric properties are important for better understanding of their chemical and biological characteristics.
Resumo:
Bayesian methods offer a flexible and convenient probabilistic learning framework to extract interpretable knowledge from complex and structured data. Such methods can characterize dependencies among multiple levels of hidden variables and share statistical strength across heterogeneous sources. In the first part of this dissertation, we develop two dependent variational inference methods for full posterior approximation in non-conjugate Bayesian models through hierarchical mixture- and copula-based variational proposals, respectively. The proposed methods move beyond the widely used factorized approximation to the posterior and provide generic applicability to a broad class of probabilistic models with minimal model-specific derivations. In the second part of this dissertation, we design probabilistic graphical models to accommodate multimodal data, describe dynamical behaviors and account for task heterogeneity. In particular, the sparse latent factor model is able to reveal common low-dimensional structures from high-dimensional data. We demonstrate the effectiveness of the proposed statistical learning methods on both synthetic and real-world data.
Resumo:
Mémoire numérisé par la Direction des bibliothèques de l'Université de Montréal.
Resumo:
Methane (CH4) concentrations and CH4 stable carbon isotopic composition (d13CCH4) were investigated in the water column within Jaco Scar. It is one of several scars formed by massive slides resulting from the subduction of seamounts offshore Costa Rica, a process that can open up structural and stratigraphical pathways for migrating CH4. The release of large amounts of CH4 into the adjacent water column was discovered at the outcropping lowermost sedimentary sequence of the hanging wall in the northwest corner of Jaco Scar, where concentrations reached up to 1,500 nmol L-1. There CH4-rich fluids seeping from the sedimentary sequence stimulate both growth and activity of a dense chemosynthetic community. Additional point sources supplying CH4 at lower concentrations were identified in density layers above and below the main plume from light carbon isotope ratios. The injected CH4 is most likely a mixture of microbial and thermogenic CH4 as suggested by d13CCH4 values between -50 and -62 per mil Vienna Pee Dee Belemnite. This CH4 spreads along isopycnal surfaces throughout the whole area of the scar, and the concentrations decrease due to mixing with ocean water and microbial oxidation. The supply of CH4 appears to be persistent as repeatedly high CH4 concentrations were found within the scar over 6 years. The maximum CH4 concentration and average excess CH4 concentration at Jaco Scar indicate that CH4 seepage from scars might be as significant as seepage from other tectonic structures in the marine realm. Hence, taking into account the global abundance of scars, such structures might constitute a substantial, hitherto unconsidered contribution to natural CH4 sources at the seafloor.
Resumo:
The first objective of this research was to develop closed-form and numerical probabilistic methods of analysis that can be applied to otherwise conventional methods of unreinforced and geosynthetic reinforced slopes and walls. These probabilistic methods explicitly include random variability of soil and reinforcement, spatial variability of the soil, and cross-correlation between soil input parameters on probability of failure. The quantitative impact of simultaneously considering the influence of random and/or spatial variability in soil properties in combination with cross-correlation in soil properties is investigated for the first time in the research literature. Depending on the magnitude of these statistical descriptors, margins of safety based on conventional notions of safety may be very different from margins of safety expressed in terms of probability of failure (or reliability index). The thesis work also shows that intuitive notions of margin of safety using conventional factor of safety and probability of failure can be brought into alignment when cross-correlation between soil properties is considered in a rigorous manner. The second objective of this thesis work was to develop a general closed-form solution to compute the true probability of failure (or reliability index) of a simple linear limit state function with one load term and one resistance term expressed first in general probabilistic terms and then migrated to a LRFD format for the purpose of LRFD calibration. The formulation considers contributions to probability of failure due to model type, uncertainty in bias values, bias dependencies, uncertainty in estimates of nominal values for correlated and uncorrelated load and resistance terms, and average margin of safety expressed as the operational factor of safety (OFS). Bias is defined as the ratio of measured to predicted value. Parametric analyses were carried out to show that ignoring possible correlations between random variables can lead to conservative (safe) values of resistance factor in some cases and in other cases to non-conservative (unsafe) values. Example LRFD calibrations were carried out using different load and resistance models for the pullout internal stability limit state of steel strip and geosynthetic reinforced soil walls together with matching bias data reported in the literature.
Resumo:
The aim of this study is to explore the suitability of chromospheric images for magnetic modeling of active regions. We use high-resolutionimages (≈0.2"-0.3"), from the Interferometric Bidimensional Spectrometer in the Ca II 8542 Å line, the Rapid Oscillations in the Solar Atmosphere instrument in the Hα 6563Å line, the Interface Region Imaging Spectrograph in the 2796Å line, and compare non-potential magnetic field models obtainedfrom those chromospheric images with those obtained from images of the Atmospheric Imaging Assembly in coronal (171 Å, etc.) and inchromospheric (304 Å) wavelengths. Curvi-linear structures are automatically traced in those images with the OCCULT-2 code, to which we forward-fitted magnetic field lines computed with the Vertical-current Approximation Nonlinear Force Free Field code. We find that the chromospheric images: (1) reveal crisp curvi-linear structures (fibrils, loop segments, spicules) that are extremely well-suited for constraining magnetic modeling; (2) that these curvi-linear structures arefield-aligned with the best-fit solution by a median misalignment angle of μ2 ≈ 4°–7° (3) the free energy computed from coronal data may underestimate that obtained from chromospheric data by a factor of ≈2–4, (4) the height range of chromospheric features is confined to h≲4000 km, while coronal features are detected up to h = 35,000 km; and (5) the plasma-β parameter is β ≈ 10^-5 - 10^-1 for all traced features. We conclude that chromospheric images reveal important magnetic structures that are complementary to coronal images and need to be included in comprehensive magnetic field models, something that is currently not accomodated in standard NLFFF codes.
Resumo:
A major weakness among loading models for pedestrians walking on flexible structures proposed in recent years is the various uncorroborated assumptions made in their development. This applies to spatio-temporal characteristics of pedestrian loading and the nature of multi-object interactions. To alleviate this problem, a framework for the determination of localised pedestrian forces on full-scale structures is presented using a wireless attitude and heading reference systems (AHRS). An AHRS comprises a triad of tri-axial accelerometers, gyroscopes and magnetometers managed by a dedicated data processing unit, allowing motion in three-dimensional space to be reconstructed. A pedestrian loading model based on a single point inertial measurement from an AHRS is derived and shown to perform well against benchmark data collected on an instrumented treadmill. Unlike other models, the current model does not take any predefined form nor does it require any extrapolations as to the timing and amplitude of pedestrian loading. In order to assess correctly the influence of the moving pedestrian on behaviour of a structure, an algorithm for tracking the point of application of pedestrian force is developed based on data from a single AHRS attached to a foot. A set of controlled walking tests with a single pedestrian is conducted on a real footbridge for validation purposes. A remarkably good match between the measured and simulated bridge response is found, indeed confirming applicability of the proposed framework.
Resumo:
Five G protein-coupled receptors (GPCRs) have been identified to be activated by free fatty acids (FFA). Among them, FFA1 (GPR40) and FFA4 (GPR120) bind long-chain fatty acids, FFA2 (GPR43) and FFA3 (GPR41) bind short-chain fatty acids and GPR84 binds medium-chain fatty acids. Free fatty acid receptors have now emerged as potential targets for the treatment of diabetes, obesity and immune diseases. The recent progress in crystallography of GPCRs has now enabled the elucidation of the structure of FFA1 and provided reliable templates for homology modelling of other FFA receptors. Analysis of the crystal structure and improved homology models, along with mutagenesis data and structure activity, highlighted an unusual arginine charge pairing interaction in FFA1-3 for receptor modulation, distinct structural features for ligand binding to FFA1 and FFA4 and an arginine of the second extracellular loop as a possible anchoring point for FFA at GPR84. Structural data will be helpful for searching novel small molecule modulators at the FFA receptors.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
In this thesis, the magnetic properties of four transition-metal oxides are presented. Their multiferroic and magnetoelectric phases have been investigated by means of different neutron scattering techniques. The materials TbMnO3 and MnWO4 belong to the group of spin-induced multiferroics. Their ferroelectric polarization can be explained by the inverse DzyaloshinskiiMoriya interaction. Another common feature of both materials is the presence of subsequent magnetic transitions from a spin-density wave to a spin spiral. The features of the phase transitions have been studied in both materials and it could be shown that diffuse magnetic scattering from the spin spiral is present even in the ordered spin-density wave phase. The excitation spectrum in the multiferroic phase of TbMnO3 was investigated in detail and a comprehensive dataset was obtained using time-of-flight spectroscopy. A spin-wave model could be obtained which can quantitatively describe the full dispersion. Furthermore, the polarization of the zone-center excitations could be derived which fit well to data from inelastic neutron spectroscopy and infrared spectroscopy. With the combination of spherical neutron polarimetry and a poling of the sample by an electric field, it was possible to observe the chiral magnetic component of the magnetic excitations in TbMnO3 and MnWO4. The spin-wave model for TbMnO3 obtained in this thesis is able to correctly describe the dispersion of this component. The double tungstate NaFe(WO4)2 is isostructural to the multiferroic MnWO4 and develops a complex magnetic phase diagram. By the use of neutron diffraction techniques, the zero-field structure and high-field structures in magnetic field applied along the b-axis could be determined. The data reveal a direct transition into an incommensurate spin-spiral structure. The value of the incommensurability is driven by anharmonic modulations and shows strong hysteresis effects. The static and dynamic properties in the magnetoelectric spin-glass phase of Ni0.42Mn0.58TiO3 were studied in detail. The spin-glass phase is composed of short-ranged MnTiO3 and NiTiO3-type order. The antiferromagnetic domains could be controlled by crossed magnetic and electric fields, which was visualized using spherical neutron polarimetry. A comprehensive dataset of the magnetic excitations in the spin-glass phase was collected. The dataset revealed correlations in the hexagonal plane which are only weakly coupled along the c-axis. The excitation spectra could be simulated by taking into account the MnTiO3-type order.
Resumo:
New morpho-bathymetric and tectono-stratigraphic data on Naples and Salerno Gulfs, derived from bathymetric and seismic data analysis and integrated geologic interpretation are here presented. The CUBE(Combined Uncertainty Bathymetric Estimator) method has been applied to complex morphologies, such as the Capri continental slope and the related geological structures occurring in the Salerno Gulf.The bathymetric data analysis has been carried out for marine geological maps of the whole Campania continental margin at scales ranging from 1:25.000 to 1:10.000, including focused examples in Naples and Salerno Gulfs, Naples harbour, Capri and Ischia Islands and Salerno Valley. Seismic data analysis has allowed for the correlation of main morpho-structural lineaments recognized at a regional scale through multichannel profiles with morphological features cropping out at the sea bottom, evident from bathymetry.Main fault systems in the area have been represented on a tectonic sketch map, including the master fault located northwards to the Salerno Valley half graben. Some normal faults parallel to the master fault have been interpreted from the slope map derived from bathymetric data. A complex system of antithetic faults bound two morpho-structural highs located 20km to the south of the Capri Island. Some hints of compressional reactivation of normal faults in an extensional setting involving the whole Campania continental margin have been shown from seismic interpretation.
Resumo:
The protein lysate array is an emerging technology for quantifying the protein concentration ratios in multiple biological samples. It is gaining popularity, and has the potential to answer questions about post-translational modifications and protein pathway relationships. Statistical inference for a parametric quantification procedure has been inadequately addressed in the literature, mainly due to two challenges: the increasing dimension of the parameter space and the need to account for dependence in the data. Each chapter of this thesis addresses one of these issues. In Chapter 1, an introduction to the protein lysate array quantification is presented, followed by the motivations and goals for this thesis work. In Chapter 2, we develop a multi-step procedure for the Sigmoidal models, ensuring consistent estimation of the concentration level with full asymptotic efficiency. The results obtained in this chapter justify inferential procedures based on large-sample approximations. Simulation studies and real data analysis are used to illustrate the performance of the proposed method in finite-samples. The multi-step procedure is simpler in both theory and computation than the single-step least squares method that has been used in current practice. In Chapter 3, we introduce a new model to account for the dependence structure of the errors by a nonlinear mixed effects model. We consider a method to approximate the maximum likelihood estimator of all the parameters. Using the simulation studies on various error structures, we show that for data with non-i.i.d. errors the proposed method leads to more accurate estimates and better confidence intervals than the existing single-step least squares method.
Resumo:
The recent advent of new technologies has led to huge amounts of genomic data. With these data come new opportunities to understand biological cellular processes underlying hidden regulation mechanisms and to identify disease related biomarkers for informative diagnostics. However, extracting biological insights from the immense amounts of genomic data is a challenging task. Therefore, effective and efficient computational techniques are needed to analyze and interpret genomic data. In this thesis, novel computational methods are proposed to address such challenges: a Bayesian mixture model, an extended Bayesian mixture model, and an Eigen-brain approach. The Bayesian mixture framework involves integration of the Bayesian network and the Gaussian mixture model. Based on the proposed framework and its conjunction with K-means clustering and principal component analysis (PCA), biological insights are derived such as context specific/dependent relationships and nested structures within microarray where biological replicates are encapsulated. The Bayesian mixture framework is then extended to explore posterior distributions of network space by incorporating a Markov chain Monte Carlo (MCMC) model. The extended Bayesian mixture model summarizes the sampled network structures by extracting biologically meaningful features. Finally, an Eigen-brain approach is proposed to analyze in situ hybridization data for the identification of the cell-type specific genes, which can be useful for informative blood diagnostics. Computational results with region-based clustering reveals the critical evidence for the consistency with brain anatomical structure.
Resumo:
The speed with which data has moved from being scarce, expensive and valuable, thus justifying detailed and careful verification and analysis to a situation where the streams of detailed data are almost too large to handle has caused a series of shifts to occur. Legal systems already have severe problems keeping up with, or even in touch with, the rate at which unexpected outcomes flow from information technology. The capacity to harness massive quantities of existing data has driven Big Data applications until recently. Now the data flows in real time are rising swiftly, become more invasive and offer monitoring potential that is eagerly sought by commerce and government alike. The ambiguities as to who own this often quite remarkably intrusive personal data need to be resolved – and rapidly - but are likely to encounter rising resistance from industrial and commercial bodies who see this data flow as ‘theirs’. There have been many changes in ICT that has led to stresses in the resolution of the conflicts between IP exploiters and their customers, but this one is of a different scale due to the wide potential for individual customisation of pricing, identification and the rising commercial value of integrated streams of diverse personal data. A new reconciliation between the parties involved is needed. New business models, and a shift in the current confusions over who owns what data into alignments that are in better accord with the community expectations. After all they are the customers, and the emergence of information monopolies needs to be balanced by appropriate consumer/subject rights. This will be a difficult discussion, but one that is needed to realise the great benefits to all that are clearly available if these issues can be positively resolved. The customers need to make these data flow contestable in some form. These Big data flows are only going to grow and become ever more instructive. A better balance is necessary, For the first time these changes are directly affecting governance of democracies, as the very effective micro targeting tools deployed in recent elections have shown. Yet the data gathered is not available to the subjects. This is not a survivable social model. The Private Data Commons needs our help. Businesses and governments exploit big data without regard for issues of legality, data quality, disparate data meanings, and process quality. This often results in poor decisions, with individuals bearing the greatest risk. The threats harbored by big data extend far beyond the individual, however, and call for new legal structures, business processes, and concepts such as a Private Data Commons. This Web extra is the audio part of a video in which author Marcus Wigan expands on his article "Big Data's Big Unintended Consequences" and discusses how businesses and governments exploit big data without regard for issues of legality, data quality, disparate data meanings, and process quality. This often results in poor decisions, with individuals bearing the greatest risk. The threats harbored by big data extend far beyond the individual, however, and call for new legal structures, business processes, and concepts such as a Private Data Commons.
Resumo:
International audience