985 resultados para Semantic features matrix
Resumo:
An implementation of Sem-ODB—a database management system based on the Semantic Binary Model is presented. A metaschema of Sem-ODB database as well as the top-level architecture of the database engine is defined. A new benchmarking technique is proposed which allows databases built on different database models to compete fairly. This technique is applied to show that Sem-ODB has excellent efficiency comparing to a relational database on a certain class of database applications. A new semantic benchmark is designed which allows evaluation of the performance of the features characteristic of semantic database applications. An application used in the benchmark represents a class of problems requiring databases with sparse data, complex inheritances and many-to-many relations. Such databases can be naturally accommodated by semantic model. A fixed predefined implementation is not enforced allowing the database designer to choose the most efficient structures available in the DBMS tested. The results of the benchmark are analyzed. ^ A new high-level querying model for semantic databases is defined. It is proven adequate to serve as an efficient native semantic database interface, and has several advantages over the existing interfaces. It is optimizable and parallelizable, supports the definition of semantic userviews and the interoperability of semantic databases with other data sources such as World Wide Web, relational, and object-oriented databases. The query is structured as a semantic database schema graph with interlinking conditionals. The query result is a mini-database, accessible in the same way as the original database. The paradigm supports and utilizes the rich semantics and inherent ergonomics of semantic databases. ^ The analysis and high-level design of a system that exploits the superiority of the Semantic Database Model to other data models in expressive power and ease of use to allow uniform access to heterogeneous data sources such as semantic databases, relational databases, web sites, ASCII files, and others via a common query interface is presented. The Sem-ODB engine is used to control all the data sources combined under a unified semantic schema. A particular application of the system to provide an ODBC interface to the WWW as a data source is discussed. ^
Resumo:
The research presented in this dissertation is comprised of several parts which jointly attain the goal of Semantic Distributed Database Management with Applications to Internet Dissemination of Environmental Data. ^ Part of the research into more effective and efficient data management has been pursued through enhancements to the Semantic Binary Object-Oriented database (Sem-ODB) such as more effective load balancing techniques for the database engine, and the use of Sem-ODB as a tool for integrating structured and unstructured heterogeneous data sources. Another part of the research in data management has pursued methods for optimizing queries in distributed databases through the intelligent use of network bandwidth; this has applications in networks that provide varying levels of Quality of Service or throughput. ^ The application of the Semantic Binary database model as a tool for relational database modeling has also been pursued. This has resulted in database applications that are used by researchers at the Everglades National Park to store environmental data and to remotely-sensed imagery. ^ The areas of research described above have contributed to the creation TerraFly, which provides for the dissemination of geospatial data via the Internet. TerraFly research presented herein ranges from the development of TerraFly's back-end database and interfaces, through the features that are presented to the public (such as the ability to provide autopilot scripts and on-demand data about a point), to applications of TerraFly in the areas of hazard mitigation, recreation, and aviation. ^
Resumo:
Over the past five years, XML has been embraced by both the research and industrial community due to its promising prospects as a new data representation and exchange format on the Internet. The widespread popularity of XML creates an increasing need to store XML data in persistent storage systems and to enable sophisticated XML queries over the data. The currently available approaches to addressing the XML storage and retrieval issue have the limitations of either being not mature enough (e.g. native approaches) or causing inflexibility, a lot of fragmentation and excessive join operations (e.g. non-native approaches such as the relational database approach). ^ In this dissertation, I studied the issue of storing and retrieving XML data using the Semantic Binary Object-Oriented Database System (Sem-ODB) to leverage the advanced Sem-ODB technology with the emerging XML data model. First, a meta-schema based approach was implemented to address the data model mismatch issue that is inherent in the non-native approaches. The meta-schema based approach captures the meta-data of both Document Type Definitions (DTDs) and Sem-ODB Semantic Schemas, thus enables a dynamic and flexible mapping scheme. Second, a formal framework was presented to ensure precise and concise mappings. In this framework, both schemas and the conversions between them are formally defined and described. Third, after major features of an XML query language, XQuery, were analyzed, a high-level XQuery to Semantic SQL (Sem-SQL) query translation scheme was described. This translation scheme takes advantage of the navigation-oriented query paradigm of the Sem-SQL, thus avoids the excessive join problem of relational approaches. Finally, the modeling capability of the Semantic Binary Object-Oriented Data Model (Sem-ODM) was explored from the perspective of conceptually modeling an XML Schema using a Semantic Schema. ^ It was revealed that the advanced features of the Sem-ODB, such as multi-valued attributes, surrogates, the navigation-oriented query paradigm, among others, are indeed beneficial in coping with the XML storage and retrieval issue using a non-XML approach. Furthermore, extensions to the Sem-ODB to make it work more effectively with XML data were also proposed. ^
Resumo:
Sociolinguists have documented the substrate influence of various languages on the formation of dialects in numerous ethnic-regional setting throughout the United States. This literature shows that while phonological and grammatical influences from other languages may be instantiated as durable dialect features, lexical phenomena often fade over time as ethnolinguistic communities assimilate with contiguous dialect groups. In preliminary investigations of emerging Miami Latino English, we have observed that lexical forms based on Spanish lexical forms are not only ubiquitous among the speech of the first generation Cuban Americans but also of the second. Examples, observed in field work, casual observation, and studied formally in an experimental context include the following: “get down from the car,” which derives from the Spanish equivalent, bajar del carro instead of “get out of the car”. The translation task administered to thirty-one participants showed a variety lexical phenomena are still maintained at equal or higher frequencies.
Resumo:
Spectral unmixing (SU) is a technique to characterize mixed pixels of the hyperspectral images measured by remote sensors. Most of the existing spectral unmixing algorithms are developed using the linear mixing models. Since the number of endmembers/materials present at each mixed pixel is normally scanty compared with the number of total endmembers (the dimension of spectral library), the problem becomes sparse. This thesis introduces sparse hyperspectral unmixing methods for the linear mixing model through two different scenarios. In the first scenario, the library of spectral signatures is assumed to be known and the main problem is to find the minimum number of endmembers under a reasonable small approximation error. Mathematically, the corresponding problem is called the $\ell_0$-norm problem which is NP-hard problem. Our main study for the first part of thesis is to find more accurate and reliable approximations of $\ell_0$-norm term and propose sparse unmixing methods via such approximations. The resulting methods are shown considerable improvements to reconstruct the fractional abundances of endmembers in comparison with state-of-the-art methods such as having lower reconstruction errors. In the second part of the thesis, the first scenario (i.e., dictionary-aided semiblind unmixing scheme) will be generalized as the blind unmixing scenario that the library of spectral signatures is also estimated. We apply the nonnegative matrix factorization (NMF) method for proposing new unmixing methods due to its noticeable supports such as considering the nonnegativity constraints of two decomposed matrices. Furthermore, we introduce new cost functions through some statistical and physical features of spectral signatures of materials (SSoM) and hyperspectral pixels such as the collaborative property of hyperspectral pixels and the mathematical representation of the concentrated energy of SSoM for the first few subbands. Finally, we introduce sparse unmixing methods for the blind scenario and evaluate the efficiency of the proposed methods via simulations over synthetic and real hyperspectral data sets. The results illustrate considerable enhancements to estimate the spectral library of materials and their fractional abundances such as smaller values of spectral angle distance (SAD) and abundance angle distance (AAD) as well.
Resumo:
Humans have a high ability to extract visual data information acquired by sight. Trought a learning process, which starts at birth and continues throughout life, image interpretation becomes almost instinctively. At a glance, one can easily describe a scene with reasonable precision, naming its main components. Usually, this is done by extracting low-level features such as edges, shapes and textures, and associanting them to high level meanings. In this way, a semantic description of the scene is done. An example of this, is the human capacity to recognize and describe other people physical and behavioral characteristics, or biometrics. Soft-biometrics also represents inherent characteristics of human body and behaviour, but do not allow unique person identification. Computer vision area aims to develop methods capable of performing visual interpretation with performance similar to humans. This thesis aims to propose computer vison methods which allows high level information extraction from images in the form of soft biometrics. This problem is approached in two ways, unsupervised and supervised learning methods. The first seeks to group images via an automatic feature extraction learning , using both convolution techniques, evolutionary computing and clustering. In this approach employed images contains faces and people. Second approach employs convolutional neural networks, which have the ability to operate on raw images, learning both feature extraction and classification processes. Here, images are classified according to gender and clothes, divided into upper and lower parts of human body. First approach, when tested with different image datasets obtained an accuracy of approximately 80% for faces and non-faces and 70% for people and non-person. The second tested using images and videos, obtained an accuracy of about 70% for gender, 80% to the upper clothes and 90% to lower clothes. The results of these case studies, show that proposed methods are promising, allowing the realization of automatic high level information image annotation. This opens possibilities for development of applications in diverse areas such as content-based image and video search and automatica video survaillance, reducing human effort in the task of manual annotation and monitoring.
Resumo:
The trees, hedgerows and woods are current configuration of the tree network in several ecological regions of the world. In Trás–os–Montes region, Northeast of Portugal, they are a traditional component of Terra fria landscape and they could be seen in several forms: scatter trees, fencerows, small woodlots, riparian buffer strips, among others. The extensive livestock systems in this region are based on a set of circuits across the landscape. In this practice, flocks interacts with these structures using them for different functions inducing an influence on the itineraries. Our purpose will be focused on the woody features of landscape regarding their configurations, abundance and spacial distribution; in order to examine how the grazing systems depends on the currency of these formations; particularly how species flocks behaviors are related on. Depending on spatial data, The investigation attain to compare the tree network within the agriculture matrix, to the grazed territory crossed by flocks. From the other side, the importance of spatial data on interpreting the issue by suggesting different parameter that may influence the circuits. The recognition of the pressure exerciced by the occurence of the woody structures on the grazed circuits is possible. We believe that the role of these woody structures features in supporting the tradicional silvopastoral systems has been sufficiently strong for change their distribution pattern.
Resumo:
Global land cover maps play an important role in the understanding of the Earth's ecosystem dynamic. Several global land cover maps have been produced recently namely, Global Land Cover Share (GLC-Share) and GlobeLand30. These datasets are very useful sources of land cover information and potential users and producers are many times interested in comparing these datasets. However these global land cover maps are produced based on different techniques and using different classification schemes making their interoperability in a standardized way a challenge. The Environmental Information and Observation Network (EIONET) Action Group on Land Monitoring in Europe (EAGLE) concept was developed in order to translate the differences in the classification schemes into a standardized format which allows a comparison between class definitions. This is done by elaborating an EAGLE matrix for each classification scheme, where a bar code is assigned to each class definition that compose a certain land cover class. Ahlqvist (2005) developed an overlap metric to cope with semantic uncertainty of geographical concepts, providing this way a measure of how geographical concepts are more related to each other. In this paper, the comparison of global land cover datasets is done by translating each land cover legend into the EAGLE bar coding for the Land Cover Components of the EAGLE matrix. The bar coding values assigned to each class definition are transformed in a fuzzy function that is used to compute the overlap metric proposed by Ahlqvist (2005) and overlap matrices between land cover legends are elaborated. The overlap matrices allow the semantic comparison between the classification schemes of each global land cover map. The proposed methodology is tested on a case study where the overlap metric proposed by Ahlqvist (2005) is computed in the comparison of two global land cover maps for Continental Portugal. The study resulted with the overlap spatial distribution among the two global land cover maps, Globeland30 and GLC-Share. These results shows that Globeland30 product overlap with a degree of 77% with GLC-Share product in Continental Portugal.
Resumo:
In this paper, the problem of semantic place categorization in mobile robotics is addressed by considering a time-based probabilistic approach called dynamic Bayesian mixture model (DBMM), which is an improved variation of the dynamic Bayesian network. More specifically, multi-class semantic classification is performed by a DBMM composed of a mixture of heterogeneous base classifiers, using geometrical features computed from 2D laserscanner data, where the sensor is mounted on-board a moving robot operating indoors. Besides its capability to combine different probabilistic classifiers, the DBMM approach also incorporates time-based (dynamic) inferences in the form of previous class-conditional probabilities and priors. Extensive experiments were carried out on publicly available benchmark datasets, highlighting the influence of the number of time-slices and the effect of additive smoothing on the classification performance of the proposed approach. Reported results, under different scenarios and conditions, show the effectiveness and competitive performance of the DBMM.
Resumo:
Colorectal cancer (CRC) is the third most common cancer in the UK with 41,000 new cases diagnosed in 2011. Despite undergoing potentially curative resection, a significant amount of patients develop recurrence. Biomarkers that aid prognostication or identify patients who are suitable for adjuvant treatments are needed. The TNM staging system does a reasonably good job at offering prognostic information to the treating clinician, but it could be better and identifying methods of improving its accuracy are needed. Tumour progression is based on a complex relationship between tumour behaviour and the hosts’ inflammatory responses. Sustained tumour cell proliferation, evading growth suppressors, resisting apoptosis, replicative immortality, sustained angiogenesis, invasion & metastasis, avoiding immune destruction, deregulated cellular energetics, tumour promoting inflammation and genomic instability & mutation have been identified as hallmarks. These hallmarks are malignant behaviors are what makes the cell cancerous and the more extreme the behaviour the more aggressive the cancer the more likely the risk of a poor outcome. There are two primary genomic instability pathways: Microsatellite Instability (MSI) and Chromosomal Instability (CI) also referred to as Microsatellite Stability (MSS). Tumours arising by these pathways have a predilection for specific anatomical, histological and molecular biological features. It is possible that aberrant molecular expression of genes/proteins that promote malignant behaviors may also act as prognostic and predictive biomarkers, which may offer superior prognostic information to classical prognostic features. Cancer related inflammation has been described as a 7th hallmark of cancer. Despite the systemic inflammatory response (SIR) being associated with more aggressive malignant disease, infiltration by immune cells, particularly CD8+ lymphocytes, at the advancing edge of the tumour have been associated with improved outcome and tumour MSI. It remains unknown if the SIR is associated with tumour MSI and this requires further study. The mechanisms by which colorectal cancer cells locally invade through the bowel remain uncertain, but connective tissue degradation by matrix metalloproteinases (MMPs) such as MMP-9 have been implicated. MMP-9 has been found in the cancer cells, stromal cells and patient circulation. Although tumoural MMP-9 has been associated with poor survival, reports are conflicting and contain relatively small sample sizes. Furthermore, the influence of high serum MMP-9 on survival remains unknown. Src family kinases (SFKs) have been implicated in many adverse cancer cell behaviors. SFKs comprise 9 family members BLK, C-SRC, FGR, FYN, HCK, LCK, LYN, YES, YRK. C-SRC has been the most investigated of all SFKs, but the role of other SFKs in cellular behaviors and their prognostic value remains largely unknown. The development of Src inhibitors, such as Dasatinib, has identified SFKs as a potential therapeutic target for patients at higher risk of poor survival. Unfortunately, clinical trials so far have not been promising but this may reflect inadequate patient selection and SFKs may act as useful prognostic and predictive biomarkers. In chapter 3, the association between cancer related inflammation, tumour MSI, clinicopathological factors and survival was tested in two independent cohorts. A training cohort consisting of n=182 patients and a validation cohort of n=677 patients. MSI tumours were associated with a raised CRP (p=0.003). Hypoalbuminaemia was independently associated with poor overall survival in TNM stage II cancer (HR 3.04 (95% CI 1.44 – 6.43);p=0.004), poor recurrence free survival in TNM stage III cancer (HR 1.86 (95% 1.03 – 3.36);p=0.040) and poor overall survival in CI colorectal cancer (HR 1.49 (95% CI 1.06 – 2.10);p=0.022). Interestingly, MSI tumours were associated with poor overall survival in TNM stage III cancer (HR 2.20 (95% CI 1.10 – 4.37);p=0.025). In chapter 4, the role of MMP-9 in colorectal cancer progression and survival was examined. MMP-9 in the tissue was assessed using IHC and serum expression quantified using ELISA. Serum MMP-9 was associated with cancer cell expression (Spearman’s Correlation Coefficient (SCC) 0.393, p<0.001)) and stromal expression (SCC 0.319, p=0.002). Serum MMP-9 was associated with poor recurrence-free (HR 3.37 (95% CI 1.20 – 9.48);p=0.021) and overall survival (HR 3.16 (95% CI 1.22 – 8.15);p=0.018), but tumour MMP-9 was not survival or MSI status. In chapter 5, the role of SFK expression and activation in colorectal cancer progression and survival was studied. On PCR analysis, although LYN, C-SRC and YES were the most highly expressed, FGR and HCK had higher expression profiles as tumours progressed. Using IHC, raised cytoplasmic FAK (tyr 861) was independently associated with poor recurrence free survival in all cancers (HR 1.48 (95% CI 1.02 – 2.16);p=0.040) and CI cancers (HR 1.50 (95% CI 1.02 – 2.21);p=0.040). However, raised cytoplasmic HCK (HR 2.04 (95% CI 1.11 – 3.76);p=0.022) was independently associated with poor recurrence-free survival in TNM stage II cancers. T84 and HT29 cell lines were used to examine the cellular effects of Dasatinib. Cell viability was assessed using WST-1 assay and apoptosis assessed using an ELISA cell death detection assay. Dasatinib increased T84 tumour cell apoptosis in a dose dependent manner and resulted in reduced expression of nuclear (p=0.008) and cytoplasmic (p=0.016) FAK (tyr 861) expression and increased nuclear FGR expression (p=0.004). The results of this thesis confirm that colorectal cancer is a complex disease that represents several subtypes of cancer based on molecular biological behaviors. This thesis concentrated on features of the disease related to inflammation in terms of genetic and molecular characterisation. MSI cancers are closely associated with systemic inflammation but despite this observation, they retain their relatively improved survival. MMP-9 is a feature of tissue remodeling during inflammation and is also associated with degradation of connective tissue, advanced T-stage and poor outcome when measured in the serum. The lack of stromal quantification due to TMA use rather than full sections makes the value of tumoural MMP-9 immunoreactivity in the prognostication and its association with MSI unknown and requires further study. Finally, SFK activation was also associated with SIR, however, only cytoplasmic HCK was independently associated with poor survival in patients with TNM stage II disease, the group of patients where identifying a novel biomarker is most needed. There is still some way to go before these biomarkers are translated into clinical practice and future work needs to focus on obtaining a reliable and robust scientific technique with validation in an adequately powered independent cohort.
Resumo:
Building Information Modelling is changing the design and construction field ever since it entered the market. It took just some time to show its capabilities, it takes some time to be mastered before it could be used expressing all its best features. Since it was conceived to be adopted from the earliest stage of design to get the maximum from the decisional project, it still struggles to adapt to existing buildings. In fact, there is a branch of this methodology that is dedicated to what has been already made that is called Historic BIM or HBIM. This study aims to make clear what are BIM and HBIM, both from a theoretical point of view and in practice, applying from scratch the state of the art to a case study. It had been chosen the fortress of San Felice sul Panaro, a marvellous building with a thousand years of history in its bricks, that suffered violent earthquakes, but it is still standing. By means of this example, it will be shown which are the limits that could be encountered when applying BIM methodology to existing heritage, moreover will be pointed out all the new features that a simple 2D design could not achieve.
Resumo:
Hypertensive patients exhibit higher cardiovascular risk and reduced lung function compared with the general population. Whether this association stems from the coexistence of two highly prevalent diseases or from direct or indirect links of pathophysiological mechanisms is presently unclear. This study investigated the association between lung function and carotid features in non-smoking hypertensive subjects with supposed normal lung function. Hypertensive patients (n = 67) were cross-sectionally evaluated by clinical, hemodynamic, laboratory, and carotid ultrasound analysis. Forced vital capacity, forced expired volume in 1 second and in 6 seconds, and lung age were estimated by spirometry. Subjects with ventilatory abnormalities according to current guidelines were excluded. Regression analysis adjusted for age and prior smoking history showed that lung age and the percentage of predicted spirometric parameters associated with common carotid intima-media thickness, diameter, and stiffness. Further analyses, adjusted for additional potential confounders, revealed that lung age was the spirometric parameter exhibiting the most significant regression coefficients with carotid features. Conversely, plasma C-reactive protein and matrix-metalloproteinases-2/9 levels did not influence this relationship. The present findings point toward lung age as a potential marker of vascular remodeling and indicate that lung and vascular remodeling might share common pathophysiological mechanisms in hypertensive subjects.
Resumo:
Congenital muscular dystrophy with laminin α2 chain deficiency (MDC1A) is one of the most severe forms of muscular disease and is characterized by severe muscle weakness and delayed motor milestones. The genetic basis of MDC1A is well known, yet the secondary mechanisms ultimately leading to muscle degeneration and subsequent connective tissue infiltration are not fully understood. In order to obtain new insights into the molecular mechanisms underlying MDC1A, we performed a comparative proteomic analysis of affected muscles (diaphragm and gastrocnemius) from laminin α2 chain-deficient dy(3K)/dy(3K) mice, using multidimensional protein identification technology combined with tandem mass tags. Out of the approximately 700 identified proteins, 113 and 101 proteins, respectively, were differentially expressed in the diseased gastrocnemius and diaphragm muscles compared with normal muscles. A large portion of these proteins are involved in different metabolic processes, bind calcium, or are expressed in the extracellular matrix. Our findings suggest that metabolic alterations and calcium dysregulation could be novel mechanisms that underlie MDC1A and might be targets that should be explored for therapy. Also, detailed knowledge of the composition of fibrotic tissue, rich in extracellular matrix proteins, in laminin α2 chain-deficient muscle might help in the design of future anti-fibrotic treatments. All MS data have been deposited in the ProteomeXchange with identifier PXD000978 (http://proteomecentral.proteomexchange.org/dataset/PXD000978).
Resumo:
Subjects with spinal cord injury (SCI) exhibit impaired left ventricular (LV) diastolic function, which has been reported to be attenuated by regular physical activity. This study investigated the relationship between circulating matrix metalloproteinases (MMPs) and tissue inhibitors of MMPs (TIMPs) and echocardiographic parameters in SCI subjects and the role of physical activity in this regard. Forty-two men with SCI [19 sedentary (S-SCI) and 23 physically-active (PA-SCI)] were evaluated by clinical, anthropometric, laboratory, and echocardiographic analysis. Plasmatic pro-MMP-2, MMP-2, MMP-8, pro-MMP-9, MMP-9, TIMP-1 and TIMP-2 levels were determined by enzyme-linked immunosorbent assay and zymography. PA-SCI subjects presented lower pro-MMP-2 and pro-MMP-2/TIMP-2 levels and improved markers of LV diastolic function (lower E/Em and higher Em and E/A values) than S-SCI ones. Bivariate analysis showed that pro-MMP-2 correlated inversely with Em and directly with E/Em, while MMP-9 correlated directly with LV mass index and LV end-diastolic diameter in the whole sample. Following multiple regression analysis, pro-MMP-2, but not physical activity, remained associated with Em, while MMP-9 was associated with LV mass index in the whole sample. These findings suggest differing roles for MMPs in LV structure and function regulation and an interaction among pro-MMP-2, diastolic function and physical activity in SCI subjects.
Resumo:
Corynebacterium species (spp.) are among the most frequently isolated pathogens associated with subclinical mastitis in dairy cows. However, simple, fast, and reliable methods for the identification of species of the genus Corynebacterium are not currently available. This study aimed to evaluate the usefulness of matrix-assisted laser desorption ionization/mass spectrometry (MALDI-TOF MS) for identifying Corynebacterium spp. isolated from the mammary glands of dairy cows. Corynebacterium spp. were isolated from milk samples via microbiological culture (n=180) and were analyzed by MALDI-TOF MS and 16S rRNA gene sequencing. Using MALDI-TOF MS methodology, 161 Corynebacterium spp. isolates (89.4%) were correctly identified at the species level, whereas 12 isolates (6.7%) were identified at the genus level. Most isolates that were identified at the species level with 16 S rRNA gene sequencing were identified as Corynebacterium bovis (n=156; 86.7%) were also identified as C. bovis with MALDI-TOF MS. Five Corynebacterium spp. isolates (2.8%) were not correctly identified at the species level with MALDI-TOF MS and 2 isolates (1.1%) were considered unidentified because despite having MALDI-TOF MS scores >2, only the genus level was correctly identified. Therefore, MALDI-TOF MS could serve as an alternative method for species-level diagnoses of bovine intramammary infections caused by Corynebacterium spp.