416 resultados para STRUCTURAL INFORMATION
Resumo:
Models of word meaning, built from a corpus of text, have demonstrated success in emulating human performance on a number of cognitive tasks. Many of these models use geometric representations of words to store semantic associations between words. Often word order information is not captured in these models. The lack of structural information used by these models has been raised as a weakness when performing cognitive tasks. This paper presents an efficient tensor based approach to modelling word meaning that builds on recent attempts to encode word order information, while providing flexible methods for extracting task specific semantic information.
Resumo:
As business process management technology matures, organisations acquire more and more business process models. The resulting collections can consist of hundreds, even thousands of models and their management poses real challenges. One of these challenges concerns model retrieval where support should be provided for the formulation and efficient execution of business process model queries. As queries based on only structural information cannot deal with all querying requirements in practice, there should be support for queries that require knowledge of process model semantics. In this paper we formally define a process model query language that is based on semantic relationships between tasks. This query language is independent of the particular process modelling notation used, but we will demonstrate how it can be used in the context of Petri nets by showing how the semantic relationships can be determined for these nets in such a way that state space explosion is avoided as much as possible. An experiment with three large process model repositories shows that queries expressed in our language can be evaluated efficiently.
Resumo:
This paper develops and evaluates an enhanced corpus based approach for semantic processing. Corpus based models that build representations of words directly from text do not require pre-existing linguistic knowledge, and have demonstrated psychologically relevant performance on a number of cognitive tasks. However, they have been criticised in the past for not incorporating sufficient structural information. Using ideas underpinning recent attempts to overcome this weakness, we develop an enhanced tensor encoding model to build representations of word meaning for semantic processing. Our enhanced model demonstrates superior performance when compared to a robust baseline model on a number of semantic processing tasks.
Resumo:
As business process management technology matures, organisations acquire more and more business process models. The management of the resulting collections of process models poses real challenges. One of these challenges concerns model retrieval where support should be provided for the formulation and efficient execution of business process model queries. As queries based on only structural information cannot deal with all querying requirements in practice, there should be support for queries that require knowledge of process model semantics. In this paper we formally define a process model query language that is based on semantic relationships between tasks in process models and is independent of any particular process modelling notation.
Resumo:
The assembly of retroviruses such as HIV-1 is driven by oligomerization of their major structural protein, Gag. Gag is a multidomain polyprotein including three conserved folded domains: MA (matrix), CA (capsid) and NC (nucleocapsid)(1). Assembly of an infectious virion proceeds in two stages(2). In the first stage, Gag oligomerization into a hexameric protein lattice leads to the formation of an incomplete, roughly spherical protein shell that buds through the plasma membrane of the infected cell to release an enveloped immature virus particle. In the second stage, cleavage of Gag by the viral protease leads to rearrangement of the particle interior, converting the non-infectious immature virus particle into a mature infectious virion. The immature Gag shell acts as the pivotal intermediate in assembly and is a potential target for anti-retroviral drugs both in inhibiting virus assembly and in disrupting virus maturation(3). However, detailed structural information on the immature Gag shell has not previously been available. For this reason it is unclear what protein conformations and interfaces mediate the interactions between domains and therefore the assembly of retrovirus particles, and what structural transitions are associated with retrovirus maturation. Here we solve the structure of the immature retroviral Gag shell from Mason-Pfizer monkey virus by combining cryo-electron microscopy and tomography. The 8-angstrom resolution structure permits the derivation of a pseudo-atomic model of CA in the immature retrovirus, which defines the protein interfaces mediating retrovirus assembly. We show that transition of an immature retrovirus into its mature infectious form involves marked rotations and translations of CA domains, that the roles of the amino-terminal and carboxy-terminal domains of CA in assembling the immature and mature hexameric lattices are exchanged, and that the CA interactions that stabilize the immature and mature viruses are almost completely distinct.
Resumo:
Several major human pathogens, including the filoviruses, paramyxoviruses, and rhabdoviruses, package their single-stranded RNA genomes within helical nucleocapsids, which bud through the plasma membrane of the infected cell to release enveloped virions. The virions are often heterogeneous in shape, which makes it difficult to study their structure and assembly mechanisms. We have applied cryo-electron tomography and sub-tomogram averaging methods to derive structures of Marburg virus, a highly pathogenic filovirus, both after release and during assembly within infected cells. The data demonstrate the potential of cryo-electron tomography methods to derive detailed structural information for intermediate steps in biological pathways within intact cells. We describe the location and arrangement of the viral proteins within the virion. We show that the N-terminal domain of the nucleoprotein contains the minimal assembly determinants for a helical nucleocapsid with variable number of proteins per turn. Lobes protruding from alternate interfaces between each nucleoprotein are formed by the C-terminal domain of the nucleoprotein, together with viral proteins VP24 and VP35. Each nucleoprotein packages six RNA bases. The nucleocapsid interacts in an unusual, flexible "Velcro-like" manner with the viral matrix protein VP40. Determination of the structures of assembly intermediates showed that the nucleocapsid has a defined orientation during transport and budding. Together the data show striking architectural homology between the nucleocapsid helix of rhabdoviruses and filoviruses, but unexpected, fundamental differences in the mechanisms by which the nucleocapsids are then assembled together with matrix proteins and initiate membrane envelopment to release infectious virions, suggesting that the viruses have evolved different solutions to these conserved assembly steps.
Resumo:
The detailed characterization of protein N-glycosylation is very demanding given the many different glycoforms and structural isomers that can exist on glycoproteins. Here we report a fast and sensitive method for the extensive structure elucidation of reducing-end labeled N-glycan mixtures using a combination of capillary normal-phase HPLC coupled off-line to matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) and TOF/TOF-MS/MS. Using this method, isobaric N-glycans released from honey bee phospholipase A2 and Arabidopsis thaliana glycoproteins were separated by normal-phase chromatography and subsequently identified by key fragment ions in the MALDI-TOF/TOF tandem mass spectra. In addition, linkage and branching information were provided by abundant cross-ring and "elimination" fragment ions in the MALDI-CID spectra that gave extensive structural information. Furthermore, the fragmentation characteristics of N-glycans reductively aminated with 2-aminobenzoic acid and 2-aminobenzamide were compared. The identification of N-glycans containing 3-linked core fucose was facilitated by distinctive ions present only in the MALDI-CID spectra of 2-aminobenzoic acid-labeled oligosaccharides. To our knowledge, this is the first MS/MS-based technique that allows confident identification of N-glycans containing 3-linked core fucose, which is a major allergenic determinant on insect and plant glycoproteins.
Resumo:
The use of Wireless Sensor Networks (WSNs) for Structural Health Monitoring (SHM) has become a promising approach due to many advantages such as low cost, fast and flexible deployment. However, inherent technical issues such as data synchronization error and data loss have prevented these distinct systems from being extensively used. Recently, several SHM-oriented WSNs have been proposed and believed to be able to overcome a large number of technical uncertainties. Nevertheless, there is limited research examining effects of uncertainties of generic WSN platform and verifying the capability of SHM-oriented WSNs, particularly on demanding SHM applications like modal analysis and damage identification of real civil structures. This article first reviews the major technical uncertainties of both generic and SHM-oriented WSN platforms and efforts of SHM research community to cope with them. Then, effects of the most inherent WSN uncertainty on the first level of a common Output-only Modal-based Damage Identification (OMDI) approach are intensively investigated. Experimental accelerations collected by a wired sensory system on a benchmark civil structure are initially used as clean data before being contaminated with different levels of data pollutants to simulate practical uncertainties in both WSN platforms. Statistical analyses are comprehensively employed in order to uncover the distribution pattern of the uncertainty influence on the OMDI approach. The result of this research shows that uncertainties of generic WSNs can cause serious impact for level 1 OMDI methods utilizing mode shapes. It also proves that SHM-WSN can substantially lessen the impact and obtain truly structural information without having used costly computation solutions.
Resumo:
The μO-conotoxins are an intriguing class of conotoxins targeting various voltage-dependent sodium channels and molluscan calcium channels. In the current study, we have shown MrVIA and MrVIB to be the first known peptidic inhibitors of the transient tetrodotoxin-resistant (TTX-R) Na+ current in rat dorsal root ganglion neurons, in addition to inhibiting tetrodotoxin-sensitive Na+ currents. Human TTX-R sodium channels are a therapeutic target for indications such as pain, highlighting the importance of the μO-conotoxins as potential leads for drug development. Furthermore, we have used NMR spectroscopy to provide the first structural information on this class of conotoxins. MrVIA and MrVIB are hydrophobic peptides that aggregate in aqueous solution but were solubilized in 50% acetonitrile/water. The three-dimensional structure of MrVIB consists of a small β-sheet and a cystine knot arrangement of the three-disulfide bonds. It contains four backbone “loops” between successive cysteine residues that are exposed to the solvent to varying degrees. The largest of these, loop 2, is the most disordered part of the molecule, most likely due to flexibility in solution. This disorder is the most striking difference between the structures of MrVIB and the known δ- and ω-conotoxins, which along with the μO-conotoxins are members of the O superfamily. Loop 2 of ω-conotoxins has previously been shown to contain residues critical for binding to voltage-gated calcium channels, and it is interesting to speculate that the flexibility observed in MrVIB may accommodate binding to both sodium and molluscan calcium channels.
Resumo:
Modern lipidomics relies heavily on mass spectrometry for the structural characterization and quantification of lipids of biological origins. Structural information is gained by tandem mass spectrometry (MS/MS) whereby lipid ions are fragmented to elucidate lipid class, fatty acid chain length, and degree of unsaturation. Unfortunately, however, in most cases double bond position cannot be assigned based on MS/MS data alone and thus significant structural diversity is hidden from such analyses. For this reason, we have developed two online methods for determining double bond position within unsaturated lipids; ozone electrospray ionization mass spectrometry (OzESI-MS) and ozone-induced dissociation (OzID). Both techniques utilize ozone to cleave C-C double bonds that result in chemically induced fragment ions that locate the position(s) of unsaturation
Resumo:
RATIONALE Both traditional electron ionization and electrospray ionization tandem mass spectrometry have demonstrated limitations in the unambiguous identification of fatty acids. In the former case, high electron energies lead to extensive dissociation of the radical cations from which little specific structural information can be obtained. In the latter, conventional collision-induced dissociation (CID) of even-electron ions provides little intra-chain fragmentation and thus few structural diagnostics. New approaches that harness the desirable features of both methods, namely radical-driven dissociation with discrete energy deposition, are thus required. METHODS Herein we describe the derivatization of a structurally diverse suite of fatty acids as 4-iodobenzyl esters (FAIBE). Electrospray ionization of these derivatives in the presence of sodium acetate yields abundant [M+Na]+ ions that can be mass-selected and subjected to laser irradiation (=266nm) on a modified linear ion-trap mass spectrometer. RESULTS Photodissociation (PD) of the FAIBE derivatives yields abundant radical cations by loss of atomic iodine and in several cases selective dissociation of activated carboncarbon bonds (e.g., at allylic positions) are also observed. Subsequent CID of the [M+NaI]center dot+ radical cations yields radical-directed dissociation (RDD) mass spectra that reveal extensive carboncarbon bond dissociation without scrambling of molecular information. CONCLUSIONS Both PD and RDD spectra obtained from derivatized fatty acids provide a wealth of structural information including the position(s) of unsaturation, chain-branching and hydroxylation. The structural information obtained by this approach, in particular the ability to rapidly differentiate isomeric lipids, represents a useful addition to the lipidomics tool box. Copyright (c) 2013 John Wiley & Sons, Ltd.
Resumo:
Phospholipids are the key structural component of cell membranes, and recent advances in electrospray ionization mass spectrometry provide for the fast and efficient analysis of these compounds in biological extracts.1-3 The application of electrospray ionization tandem mass spectrometry (ESI-MS/MS) to phospholipid analysis has demonstrated several key advantages over the more traditional chromatographic methods, including speed and greater structural information.4 For example, the ESI-MS/MS spectrum of a typical phospholipidsparticularly in negative ion modesreadily identifies the carbon chain length and the degree of unsaturation of each of the fatty acids esterified to the parent molecule.5 A critical limitation of conventional ESI-MS/MS analysis, however, is the inability to uniquely identify the position of double bonds within the fatty acid chains. This is especially problematic given the importance of double bond position in determining the biological function of lipid classes.6 Previous attempts to identify double bond position in intact phospholipids using mass spectrometry employ either MS3 or offline chemical derivatization.7-11 The former method requires specialized instrumentation and is rarely applied, while the latter methods suffer from complications inherent in sample handling prior to analysis. In this communication we outline a novel on-line approach for the identification of double bond position in intact phospholipids. In our method, the double bond(s) present in unsaturated phospholipids are cleaved by ozonolysis within the ion source of a conventional ESI mass spectrometer to give two chemically induced fragment ions that may be used to unambiguously assign the position of the double bond. This is achieved by using oxygen as the electrospray nebulizing gas in combination with high electrospray voltages to initiate the formation of an ozoneproducing.
Resumo:
Computational models in physiology often integrate functional and structural information from a large range of spatio-temporal scales from the ionic to the whole organ level. Their sophistication raises both expectations and scepticism concerning how computational methods can improve our understanding of living organisms and also how they can reduce, replace and refine animal experiments. A fundamental requirement to fulfil these expectations and achieve the full potential of computational physiology is a clear understanding of what models represent and how they can be validated. The present study aims at informing strategies for validation by elucidating the complex interrelations between experiments, models and simulations in cardiac electrophysiology. We describe the processes, data and knowledge involved in the construction of whole ventricular multiscale models of cardiac electrophysiology. Our analysis reveals that models, simulations, and experiments are intertwined, in an assemblage that is a system itself, namely the model-simulation-experiment (MSE) system. Validation must therefore take into account the complex interplay between models, simulations and experiments. Key points for developing strategies for validation are: 1) understanding sources of bio-variability is crucial to the comparison between simulation and experimental results; 2) robustness of techniques and tools is a pre-requisite to conducting physiological investigations using the MSE system; 3) definition and adoption of standards facilitates interoperability of experiments, models and simulations; 4) physiological validation must be understood as an iterative process that defines the specific aspects of electrophysiology the MSE system targets, and is driven by advancements in experimental and computational methods and the combination of both.
Resumo:
Identifying unusual or anomalous patterns in an underlying dataset is an important but challenging task in many applications. The focus of the unsupervised anomaly detection literature has mostly been on vectorised data. However, many applications are more naturally described using higher-order tensor representations. Approaches that vectorise tensorial data can destroy the structural information encoded in the high-dimensional space, and lead to the problem of the curse of dimensionality. In this paper we present the first unsupervised tensorial anomaly detection method, along with a randomised version of our method. Our anomaly detection method, the One-class Support Tensor Machine (1STM), is a generalisation of conventional one-class Support Vector Machines to higher-order spaces. 1STM preserves the multiway structure of tensor data, while achieving significant improvement in accuracy and efficiency over conventional vectorised methods. We then leverage the theory of nonlinear random projections to propose the Randomised 1STM (R1STM). Our empirical analysis on several real and synthetic datasets shows that our R1STM algorithm delivers comparable or better accuracy to a state-of-the-art deep learning method and traditional kernelised approaches for anomaly detection, while being approximately 100 times faster in training and testing.
Resumo:
The insulin receptor (IR), the insulin-like growth factor 1 receptor (IGF1R) and the insulin receptor-related receptor (IRR) are covalently-linked homodimers made up of several structural domains. The molecular mechanism of ligand binding to the ectodomain of these receptors and the resulting activation of their tyrosine kinase domain is still not well understood. We have carried out an amino acid residue conservation analysis in order to reconstruct the phylogeny of the IR Family. We have confirmed the location of ligand binding site 1 of the IGF1R and IR. Importantly, we have also predicted the likely location of the insulin binding site 2 on the surface of the fibronectin type III domains of the IR. An evolutionary conserved surface on the second leucine-rich domain that may interact with the ligand could not be detected. We suggest a possible mechanical trigger of the activation of the IR that involves a slight ‘twist’ rotation of the last two fibronectin type III domains in order to face the likely location of insulin. Finally, a strong selective pressure was found amongst the IRR orthologous sequences, suggesting that this orphan receptor has a yet unknown physiological role which may be conserved from amphibians to mammals.