887 resultados para Ontologies Representing the same Conceptualisation
Resumo:
OntoTag - A Linguistic and Ontological Annotation Model Suitable for the Semantic Web
1. INTRODUCTION. LINGUISTIC TOOLS AND ANNOTATIONS: THEIR LIGHTS AND SHADOWS
Computational Linguistics is already a consolidated research area. It builds upon the results of other two major ones, namely Linguistics and Computer Science and Engineering, and it aims at developing computational models of human language (or natural language, as it is termed in this area). Possibly, its most well-known applications are the different tools developed so far for processing human language, such as machine translation systems and speech recognizers or dictation programs.
These tools for processing human language are commonly referred to as linguistic tools. Apart from the examples mentioned above, there are also other types of linguistic tools that perhaps are not so well-known, but on which most of the other applications of Computational Linguistics are built. These other types of linguistic tools comprise POS taggers, natural language parsers and semantic taggers, amongst others. All of them can be termed linguistic annotation tools.
Linguistic annotation tools are important assets. In fact, POS and semantic taggers (and, to a lesser extent, also natural language parsers) have become critical resources for the computer applications that process natural language. Hence, any computer application that has to analyse a text automatically and ‘intelligently’ will include at least a module for POS tagging. The more an application needs to ‘understand’ the meaning of the text it processes, the more linguistic tools and/or modules it will incorporate and integrate.
However, linguistic annotation tools have still some limitations, which can be summarised as follows:
1. Normally, they perform annotations only at a certain linguistic level (that is, Morphology, Syntax, Semantics, etc.).
2. They usually introduce a certain rate of errors and ambiguities when tagging. This error rate ranges from 10 percent up to 50 percent of the units annotated for unrestricted, general texts.
3. Their annotations are most frequently formulated in terms of an annotation schema designed and implemented ad hoc.
A priori, it seems that the interoperation and the integration of several linguistic tools into an appropriate software architecture could most likely solve the limitations stated in (1). Besides, integrating several linguistic annotation tools and making them interoperate could also minimise the limitation stated in (2). Nevertheless, in the latter case, all these tools should produce annotations for a common level, which would have to be combined in order to correct their corresponding errors and inaccuracies. Yet, the limitation stated in (3) prevents both types of integration and interoperation from being easily achieved.
In addition, most high-level annotation tools rely on other lower-level annotation tools and their outputs to generate their own ones. For example, sense-tagging tools (operating at the semantic level) often use POS taggers (operating at a lower level, i.e., the morphosyntactic) to identify the grammatical category of the word or lexical unit they are annotating. Accordingly, if a faulty or inaccurate low-level annotation tool is to be used by other higher-level one in its process, the errors and inaccuracies of the former should be minimised in advance. Otherwise, these errors and inaccuracies would be transferred to (and even magnified in) the annotations of the high-level annotation tool.
Therefore, it would be quite useful to find a way to
(i) correct or, at least, reduce the errors and the inaccuracies of lower-level linguistic tools;
(ii) unify the annotation schemas of different linguistic annotation tools or, more generally speaking, make these tools (as well as their annotations) interoperate.
Clearly, solving (i) and (ii) should ease the automatic annotation of web pages by means of linguistic tools, and their transformation into Semantic Web pages (Berners-Lee, Hendler and Lassila, 2001). Yet, as stated above, (ii) is a type of interoperability problem. There again, ontologies (Gruber, 1993; Borst, 1997) have been successfully applied thus far to solve several interoperability problems. Hence, ontologies should help solve also the problems and limitations of linguistic annotation tools aforementioned.
Thus, to summarise, the main aim of the present work was to combine somehow these separated approaches, mechanisms and tools for annotation from Linguistics and Ontological Engineering (and the Semantic Web) in a sort of hybrid (linguistic and ontological) annotation model, suitable for both areas. This hybrid (semantic) annotation model should (a) benefit from the advances, models, techniques, mechanisms and tools of these two areas; (b) minimise (and even solve, when possible) some of the problems found in each of them; and (c) be suitable for the Semantic Web. The concrete goals that helped attain this aim are presented in the following section.
2. GOALS OF THE PRESENT WORK
As mentioned above, the main goal of this work was to specify a hybrid (that is, linguistically-motivated and ontology-based) model of annotation suitable for the Semantic Web (i.e. it had to produce a semantic annotation of web page contents). This entailed that the tags included in the annotations of the model had to (1) represent linguistic concepts (or linguistic categories, as they are termed in ISO/DCR (2008)), in order for this model to be linguistically-motivated; (2) be ontological terms (i.e., use an ontological vocabulary), in order for the model to be ontology-based; and (3) be structured (linked) as a collection of ontology-based
Resumo:
The tsunami deposits of the valley of Agaete (Pérez-Torrado et al., 2006), north-western Gran Canaria, attributed to the Guimar flank collapse in Tenerife, have been revisited and new data are presented here. Besides the occurrences reported by Pérez-Torrado et al. (2006) a new outcrop was found and named “La Ruina” (at 28º 05’ 47,41” N; 15º 41’ 52,04” W; 71 m asl). The above-mentioned authors suggested the possibility that more than one marine conglomerate deposit could be present in the outcrops of “Llanos de Turmán” and “Berrazales”. At “La Gasolinera” and “La Aldea 1” the conglomerates are formed by a single layer representing one depositional event; at “La Aldea 2”, the conglomerates are composed of two layers directly contacting with each other, but evidence of a time hiatus between them was not found. Although the hypothesis of stacking of two depositional units within the same episode versus deposition of two distinct layers in different time-moments is debatable at the present state of knowledge, the first possibility is favoured. The field evidence at “Llanos de Turman” and “Berrazales” unquestionably shows that terrestrial sediments (colluvia; paleosols) are present and separate two marine conglomerate deposits, indicating that at least two distinct tsunami inundations are needed to explain the stratigraphy. However, at the new “La Ruina” outcrop, besides the two deposits mentioned above, a third and older marine conglomerate was found, clearly separated in time from the ones cited above. The existence of marine conglomerates emplaced in different moments is evidenced by the occurrence of intercalated paleosols, colluvia and other subaerial materials, implying significant time intervals between the emplacement of marine conglomeratic layers. A number of gastropod operculae from the tsunamiites were sent for U-Th dating to try to further constrain the age span of these deposits. The field evidence presented above shows that the emplacement of the deposits is related to, at least, three tsunami events. The lateral correlation between different outcrops is difficult due to variable number of deposits in each outcrop, lateral discontinuity and variability, and to compositional and textural similarity between distinct tsunami sediments. The occurrence of three Pleistocene tsunami deposits in the same area points to a relatively high frequency of tsunamis (generated by landslides, surface rupturing earthquakes, fast entry of voluminous volcanic deposits into the sea or large submarine eruptions). It is possible that this recurrence of tsunami inundations may reflect multiple-phased landslides responsible for the mega-landslide scars prominent in the geomorphology of the neighbouring island of Tenerife. This is a contribution from project “Estabilidad de los edificios volcánicos en Canarias: análisis de los factores geológicos, geomecánicos y paleoclimáticos. Aplicación a los flancos N y S de la isla de Tenerife” financed by MCT, Spain.
Resumo:
The use of semantic and Linked Data technologies for Enterprise Application Integration (EAI) is increasing in recent years. Linked Data and Semantic Web technologies such as the Resource Description Framework (RDF) data model provide several key advantages over the current de-facto Web Service and XML based integration approaches. The flexibility provided by representing the data in a more versatile RDF model using ontologies enables avoiding complex schema transformations and makes data more accessible using Web standards, preventing the formation of data silos. These three benefits represent an edge for Linked Data-based EAI. However, work still has to be performed so that these technologies can cope with the particularities of the EAI scenarios in different terms, such as data control, ownership, consistency, or accuracy. The first part of the paper provides an introduction to Enterprise Application Integration using Linked Data and the requirements imposed by EAI to Linked Data technologies focusing on one of the problems that arise in this scenario, the coreference problem, and presents a coreference service that supports the use of Linked Data in EAI systems. The proposed solution introduces the use of a context that aggregates a set of related identities and mappings from the identities to different resources that reside in distinct applications and provide different views or aspects of the same entity. A detailed architecture of the Coreference Service is presented explaining how it can be used to manage the contexts, identities, resources, and applications which they relate to. The paper shows how the proposed service can be utilized in an EAI scenario using an example involving a dashboard that integrates data from different systems and the proposed workflow for registering and resolving identities. As most enterprise applications are driven by business processes and involve legacy data, the proposed approach can be easily incorporated into enterprise applications.
Resumo:
UML is widely accepted as the standard for representing the various software artifacts generated by a development process. For this reason, there have been attempts to use this language to represent the software architecture of systems as well. Unfortunately, these attempts have ended in the same representations (boxes and lines) already criticized by the software architecture community.In this work we propose an extension to the UML metamodel that is able to represent the syntactics and semantics of the C3 architectural style. This style is derived from C2. The modifications to define C3 are described in section 4. This proposal is innovative regarding UML extensions for software architectures, since previous proposals where based on light extensions to the UML meta-model, while we propose a heavyweight extension of the metamodel. On the other hand, this proposal is less ambitious than previous proposals, since we do not want to represent in UML any architectural style, but only one: C3.
Resumo:
What are the limits and modulators of neural precision? We address this question in the most regular biological oscillator known, the electric organ command nucleus in the brainstem of wave-type gymnotiform fish. These fish produce an oscillating electric field, the electric organ discharge (EOD), used in electrolocation and communication. We show here that the EOD precision, measured by the coefficient of variation (CV = SD/mean period) is as low as 2 × 10−4 in five species representing three families that range widely in species and individual mean EOD frequencies (70–1,250 Hz). Intracellular recording in the pacemaker nucleus (Pn), which commands the EOD cycle by cycle, revealed that individual Pn neurons of the same species also display an extremely low CV (CV = 6 × 10−4, 0.8 μs SD). Although the EOD CV can remain at its minimum for hours, it varies with novel environmental conditions, during communication, and spontaneously. Spontaneous changes occur as abrupt steps (250 ms), oscillations (3–5 Hz), or slow ramps (10–30 s). Several findings suggest that these changes are under active control and depend on behavioral state: mean EOD frequency and CV can change independently; CV often decreases in response to behavioral stimuli; and lesions of one of the two inputs to the Pn had more influence on CV than lesions of the other input.
Resumo:
Major histocompatibility complex (MHC) class II molecules displayed clustered patterns at the surfaces of T (HUT-102B2) and B (JY) lymphoma cells characterized by interreceptor distances in the micrometer range as detected by scanning force microscopy of immunogold-labeled antigens. Electron microscopy revealed that a fraction of the MHC class II molecules was also heteroclustered with MHC class I antigens at the same hierarchical level as described by the scanning force microscopy data, after specifically and sequentially labeling the antigens with 30- and 15-nm immunogold beads. On JY cells the estimated fraction of co-clustered HLA II was 0.61, whereas that of the HLA I was 0.24. Clusterization of the antigens was detected by the deviation of their spatial distribution from the Poissonian distribution representing the random case. Fluorescence resonance energy transfer measurements also confirmed partial co-clustering of the HLA class I and II molecules at another hierarchical level characterized by the 2- to 10-nm Förster distance range and providing fine details of the molecular organization of receptors. The larger-scale topological organization of the MHC class I and II antigens may reflect underlying membrane lipid domains and may fulfill significant functions in cell-to-cell contacts and signal transduction.
Resumo:
Motifs of neural circuitry seem surprisingly conserved over different areas of neocortex or of paleocortex, while performing quite different sensory processing tasks. This apparent paradox may be resolved by the fact that seemingly different problems in sensory information processing are related by transformations (changes of variables) that convert one problem into another. The same basic algorithm that is appropriate to the recognition of a known odor quality, independent of the strength of the odor, can be used to recognize a vocalization (e.g., a spoken syllable), independent of whether it is spoken quickly or slowly. To convert one problem into the other, a new representation of time sequences is needed. The time that has elapsed since a recent event must be represented in neural activity. The electrophysiological hallmarks of cells that are involved in generating such a representation of time are discussed. The anatomical relationships between olfactory and auditory pathways suggest relevant experiments. The neurophysiological mechanism for the psychophysical logarithmic encoding of time duration would be of direct use for interconverting olfactory and auditory processing problems. Such reuse of old algorithms in new settings and representations is related to the way that evolution develops new biochemistry.
Resumo:
A single gene (mas) encodes the multifunctional enzyme that catalyzes the synthesis of very long chain multiple methyl branched fatty acids called mycocerosic acids that are present only in slow-growing pathogenic mycobacteria and are thought to be important for pathogenesis. To achieve a targeted disruption of mas, an internal 2-kb segment of this gene was replaced with approximately the same size hygromycin-resistance gene (hyg), such that hyg was flanked by 4.7- and 1.4-kb segments of mas. Transformation of Mycobacterium bovis BCG with this construct in a plasmid that cannot replicate in mycobacteria yielded hygromycin-resistant transformants. Screening of 38 such transformants by PCR revealed several transformants representing homologous recombination with single crossover and one with double crossover. With primers representing the hyg termini and those representing the mycobacterial genome segments outside that used to make the transformation construct, the double-crossover mutant yielded PCR products expected from either side of hyg. Gene replacement was further confirmed by the absence of the vector and the 2-kb segment of mas replaced by hyg from the genome of the mutant. Thin-layer and radio-gas chromatographic analyses of the lipids derived from [1-14C]propionate showed that the mutant was incapable of synthesizing mycocerosic acids and mycosides. Thus, homologous recombination with double crossover was achieved in a slow-growing mycobacterium with an intron-containing RecA. The resulting mas-disrupted mutant should allow testing of the postulated roles of mycosides in pathogenesis.
Resumo:
Chloroplast DNA restriction-site variation was surveyed among 40 accessions representing all 11 species of giant senecios (Dendrosenecio, Asteraceae) at all but one known location, plus three outgroup species. Remarkably little variation (only 9 variable sites out of roughly 1000 sites examined) was found among the 40 giant senecio accessions, yet as a group they differ significantly (at 18 sites) from Cineraria deltoidea, the closest known relative. This pattern indicates that the giant senecios underwent a recent dramatic radiation in eastern Africa and evolved from a relatively isolated lineage within the Senecioneae. Biogeographic interpretation of the molecular phylogeny suggests that the giant senecios originated high on Mt. Kilimanjaro, with subsequent dispersion to the Aberdares, Mt. Kenya, and the Cherangani Hills, followed by dispersion westward to the Ruwenzori Mountains, and then south to the Virunga Mountains, Mt. Kahuzi, and Mt. Muhi, but with dispersion back to Mt. Elgon. Geographic radiation was an important antecedent to the diversification in eastern Africa, which primarily involved repeated altitudinal radiation, both up and down the mountains, leading to morphological parallelism in both directions. In general, the plants on a given mountain are more closely related to each other than they are to plants on other mountains, and plants on nearby mountains are more closely related to each other than they are to plants on more distant mountains. The individual steps of the geographic radiation have occurred at various altitudes, some clearly the result of intermountain dispersal. The molecular evidence suggests that two species are extant ancestors to other species on the same or nearby mountains.
Resumo:
The regions surrounding the catalytic amino acids previously identified in a few "retaining" O-glycosyl hydrolases (EC 3.2.1) have been analyzed by hydrophobic cluster analysis and have been used to define sequence motifs. These motifs have been found in more than 150 glycosyl hydrolase sequences representing at least eight established protein families that act on a large variety of substrates. This allows the localization and the precise role of the catalytic residues (nucleophile and acid catalyst) to be predicted for each of these enzymes, including several lysosomal glycosidases. An identical arrangement of the catalytic nucleophile was also found for S-glycosyl hydrolases (myrosinases; EC 3.2.3.1) for which the acid catalyst is lacking. A (beta/alpha)8 barrel structure has been reported for two of the eight families of proteins that have been grouped. It is suggested that the six other families also share this fold at their catalytic domain. These enzymes illustrate how evolutionary events led to a wide diversification of substrate specificity with a similar disposition of identical catalytic residues onto the same ancestral (beta/alpha)8 barrel structure.
Resumo:
This paper introduces the Sm4RIA Extension for OIDE, which implements the Sm4RIA approach in OIDE (OOH4RIA Integrated Development Environment). The application, based on the Eclipse framework, supports the design of the Sm4RIA models as well as the model-to-model and model-to-text transformation processes that facilitate the generation of Semantic Rich Internet Applications, i.e., RIA applications capable of sharing data as Linked data and consuming external data from other sources in the same manner. Moreover, the application implements mechanisms for the creation of RIA interfaces from ontologies and the automatic generation of administration interfaces for a previously design application.
Resumo:
This layer is a georeferenced raster image of the historic paper map entitled: Kaarte van alle de dykpligtige en eenige waalpligtige landen behorende onder het Hoogreemraadschap van den Zeeburg en Diemerdyk, J. Wandelaar, delin. et sculpsit. It was published in 1749. Scale [ca. 1:6,000]. This layer is image 1 of 3 total images of the three sheet source map, representing the northern portion of the map. Covers the region east of Amsterdam, the Netherlands including portions of Gemeente Amsterdam, Gemeente Diemen, Gemeente Muiden, and Gemeente Weesp. Map in Dutch.The image inside the map neatline is georeferenced to the surface of the earth and fit to the RD_New (Rijksdriehoekstelsel), GCS Amersfoort coordinate system. All map collar and inset information is also available as part of the raster image, including any inset maps, profiles, statistical tables, directories, text, illustrations, index maps, legends, or other information associated with the principal map. This map shows features such as drainage, canals, cities and other human settlements, administrative boundaries, roads, propery boundaries with names of landowners, selected buildings and built-up areas, fortification, dikes, dams, windmills, shoreline features, and more. Relief shown by hachures. Depths shown by soundings.This layer is part of a selection of digitally scanned and georeferenced historic maps from the Harvard Map Collection. These maps typically portray both natural and manmade features. The selection represents a range of originators, ground condition dates, scales, and map purposes.
Resumo:
This layer is a georeferenced raster image of the historic paper map entitled: Kaarte van alle de dykpligtige en eenige waalpligtige landen behorende onder het Hoogreemraadschap van den Zeeburg en Diemerdyk, J. Wandelaar, delin. et sculpsit. It was published in 1749. Scale [ca. 1:6,000]. This layer is image 2 of 3 total images of the three sheet source map, representing the central portion of the map. Covers the region east of Amsterdam, the Netherlands including portions of Gemeente Amsterdam, Gemeente Diemen, Gemeente Muiden, and Gemeente Weesp. Map in Dutch.The image inside the map neatline is georeferenced to the surface of the earth and fit to the RD_New (Rijksdriehoekstelsel), GCS Amersfoort coordinate system. All map collar and inset information is also available as part of the raster image, including any inset maps, profiles, statistical tables, directories, text, illustrations, index maps, legends, or other information associated with the principal map. This map shows features such as drainage, canals, cities and other human settlements, administrative boundaries, roads, propery boundaries with names of landowners, selected buildings and built-up areas, fortification, dikes, dams, windmills, shoreline features, and more. Relief shown by hachures. Depths shown by soundings.This layer is part of a selection of digitally scanned and georeferenced historic maps from the Harvard Map Collection. These maps typically portray both natural and manmade features. The selection represents a range of originators, ground condition dates, scales, and map purposes.
Resumo:
This layer is a georeferenced raster image of the historic paper map entitled: Kaarte van alle de dykpligtige en eenige waalpligtige landen behorende onder het Hoogreemraadschap van den Zeeburg en Diemerdyk, J. Wandelaar, delin. et sculpsit. It was published in 1749. Scale [ca. 1:6,000]. This layer is image 3 of 3 total images of the three sheet source map, representing the southern portion of the map. Covers the region east of Amsterdam, the Netherlands including portions of Gemeente Amsterdam, Gemeente Diemen, Gemeente Muiden, and Gemeente Weesp. Map in Dutch.The image inside the map neatline is georeferenced to the surface of the earth and fit to the RD_New (Rijksdriehoekstelsel), GCS Amersfoort coordinate system. All map collar and inset information is also available as part of the raster image, including any inset maps, profiles, statistical tables, directories, text, illustrations, index maps, legends, or other information associated with the principal map. This map shows features such as drainage, canals, cities and other human settlements, administrative boundaries, roads, propery boundaries with names of landowners, selected buildings and built-up areas, fortification, dikes, dams, windmills, shoreline features, and more. Relief shown by hachures. Depths shown by soundings.This layer is part of a selection of digitally scanned and georeferenced historic maps from the Harvard Map Collection. These maps typically portray both natural and manmade features. The selection represents a range of originators, ground condition dates, scales, and map purposes.
Resumo:
Mutual relations in the area of sports, which in contemporary international contacts often not only reflect the true nature of political relations but sometimes even affect them, can be a valuable contribution to the analysis of this conflict’s nature. Why did the Transnistrian government, despite the use of anti-Moldovan rhetoric, agree to Transnistrian athletes representing Moldova during the Olympics and in other international competitions? Why does it accept the presence of sports teams from both banks of the Dniester playing in the same leagues? Why does Transnistria, despite being much smaller, predominate in many sports? How is it that Sheriff Tiraspol, the flagship football club of the business and political circles controlling Transnistria, managed to win the Moldovan championship ten times in a row and is the main source of players for Moldova’s national team? Does sport really ‘know no borders’ or perhaps the border on the Dniester is different than seems at first sight?