315 resultados para Structural similarity
em Queensland University of Technology - ePrints Archive
Resumo:
This paper proposes a novel Hybrid Clustering approach for XML documents (HCX) that first determines the structural similarity in the form of frequent subtrees and then uses these frequent subtrees to represent the constrained content of the XML documents in order to determine the content similarity. The empirical analysis reveals that the proposed method is scalable and accurate.
Resumo:
XML document clustering is essential for many document handling applications such as information storage, retrieval, integration and transformation. An XML clustering algorithm should process both the structural and the content information of XML documents in order to improve the accuracy and meaning of the clustering solution. However, the inclusion of both kinds of information in the clustering process results in a huge overhead for the underlying clustering algorithm because of the high dimensionality of the data. This paper introduces a novel approach that first determines the structural similarity in the form of frequent subtrees and then uses these frequent subtrees to represent the constrained content of the XML documents in order to determine the content similarity. The proposed method reduces the high dimensionality of input data by using only the structure-constrained content. The empirical analysis reveals that the proposed method can effectively cluster even very large XML datasets and outperform other existing methods.
Resumo:
The detection and potential treatment of oxidative stress in biological systems has been explored using isoindoline-based nitroxide radicals. A novel tetraethyl-fluorescein nitroxide was synthesised for its use as a profluorescent probe for redox processes in biological systems. This tetraethyl system, as well as a tetramethyl-fluorescein nitroxide, were shown to be sensitive and selective probes for superoxide in vitro. The redox environment of cellular systems was also explored using the tetramethylfluorescein species based on its reduction to the hydroxylamine. Flow cytometry was employed to assess the extent of nitroxide reduction, reflecting the overall cellular redox environment. Treatment of normal fibroblasts with rotenone and 2-deoxyglucose resulted in an oxidising cellular environment as shown by the lack of reduction of the fluorescein-nitroxide system. Assessment of the tetraethyl-fluorescein nitroxide system in the same way demonstrated its enhanced resistance to reduction and offers the potential to detect and image biologically relevant reactive oxygen species directly. Importantly, these profluorescent nitroxide compounds were shown to be more effective than the more widely used and commercially available probes for reactive oxygen species such as 2’,7’-dichlorodihydrofluorescein diacetate. Fluorescence imaging of the tetramethyl-fluorescein nitroxide and a number of other rhodamine-nitroxide derivatives was undertaken, revealing the differential cellular localisation of these systems and thus their potential for the detection of redox changes in specific cellular compartments. As well as developing novel methods for the detection of oxidative stress, a number of novel isoindoline nitroxides were synthesised for their potential application as small-molecule antioxidants. These compounds incorporated known pharmacophores into the isoindoline-nitroxide structure in an attempt to increase their efficacy in biological systems. A primary and a secondary amine nitroxide were synthesised which incorporated the phenethylamine backbone of the sympathomimetic amine class of drugs. Initial assessment of the novel primary amine derivative indicated a protective effect comparable to that of 5-carboxy-1,1,3,3- tetramethylisoindolin-2-yloxyl. Methoxy-substituted nitroxides were also synthesised as potential antioxidants for their structural similarity to some amphetamine type stimulants. A copper-catalysed methodology provided access to both the mono- and di-substituted methoxy-nitroxides. Deprotection of the ethers in these compounds using boron tribromide successfully produced a phenolnitroxide, however the catechol moiety in the disubstituted derivative appeared to undergo reaction with the nitroxide to produce quinone-like degradation products. A novel fluoran-nitroxide was also synthesised from the methoxy-substituted nitroxide, providing a pH-sensitive spin probe. An amino-acid precursor containing a nitroxide moiety was also synthesised for its application as a dual-action antioxidant. N-Acetyl protection of the nitroxide radical was necessary prior to the Erlenmeyer reaction with N-acetyl glycine. Hydrolysis and reduction of the azlactone intermediate produced a novel amino acid precursor with significant potential as an effective antioxidant.
Resumo:
With the growing number of XML documents on theWeb it becomes essential to effectively organise these XML documents in order to retrieve useful information from them. A possible solution is to apply clustering on the XML documents to discover knowledge that promotes effective data management, information retrieval and query processing. However, many issues arise in discovering knowledge from these types of semi-structured documents due to their heterogeneity and structural irregularity. Most of the existing research on clustering techniques focuses only on one feature of the XML documents, this being either their structure or their content due to scalability and complexity problems. The knowledge gained in the form of clusters based on the structure or the content is not suitable for reallife datasets. It therefore becomes essential to include both the structure and content of XML documents in order to improve the accuracy and meaning of the clustering solution. However, the inclusion of both these kinds of information in the clustering process results in a huge overhead for the underlying clustering algorithm because of the high dimensionality of the data. The overall objective of this thesis is to address these issues by: (1) proposing methods to utilise frequent pattern mining techniques to reduce the dimension; (2) developing models to effectively combine the structure and content of XML documents; and (3) utilising the proposed models in clustering. This research first determines the structural similarity in the form of frequent subtrees and then uses these frequent subtrees to represent the constrained content of the XML documents in order to determine the content similarity. A clustering framework with two types of models, implicit and explicit, is developed. The implicit model uses a Vector Space Model (VSM) to combine the structure and the content information. The explicit model uses a higher order model, namely a 3- order Tensor Space Model (TSM), to explicitly combine the structure and the content information. This thesis also proposes a novel incremental technique to decompose largesized tensor models to utilise the decomposed solution for clustering the XML documents. The proposed framework and its components were extensively evaluated on several real-life datasets exhibiting extreme characteristics to understand the usefulness of the proposed framework in real-life situations. Additionally, this research evaluates the outcome of the clustering process on the collection selection problem in the information retrieval on the Wikipedia dataset. The experimental results demonstrate that the proposed frequent pattern mining and clustering methods outperform the related state-of-the-art approaches. In particular, the proposed framework of utilising frequent structures for constraining the content shows an improvement in accuracy over content-only and structure-only clustering results. The scalability evaluation experiments conducted on large scaled datasets clearly show the strengths of the proposed methods over state-of-the-art methods. In particular, this thesis work contributes to effectively combining the structure and the content of XML documents for clustering, in order to improve the accuracy of the clustering solution. In addition, it also contributes by addressing the research gaps in frequent pattern mining to generate efficient and concise frequent subtrees with various node relationships that could be used in clustering.
Resumo:
Twin studies offer the opportunity to determine the relative contribution of genes versus environment in traits of interest. Here, we investigate the extent to which variance in brain structure is reduced in monozygous twins with identical genetic make-up. We investigate whether using twins as compared to a control population reduces variability in a number of common magnetic resonance (MR) structural measures, and we investigate the location of areas under major genetic influences. This is fundamental to understanding the benefit of using twins in studies where structure is the phenotype of interest. Twenty-three pairs of healthy MZ twins were compared to matched control pairs. Volume, T2 and diffusion MR imaging were performed as well as spectroscopy (MRS). Images were compared using (i) global measures of standard deviation and effect size, (ii) voxel-based analysis of similarity and (iii) intra-pair correlation. Global measures indicated a consistent increase in structural similarity in twins. The voxel-based and correlation analyses indicated a widespread pattern of increased similarity in twin pairs, particularly in frontal and temporal regions. The areas of increased similarity were most widespread for the diffusion trace and least widespread for T2. MRS showed consistent reduction in metabolite variation that was significant in the temporal lobe N-acetylaspartate (NAA). This study has shown the distribution and magnitude of reduced variability in brain volume, diffusion, T2 and metabolites in twins. The data suggest that evaluation of twins discordant for disease is indeed a valid way to attribute genetic or environmental influences to observed abnormalities in patients since evidence is provided for the underlying assumption of decreased variability in twins.
Resumo:
Process-aware information systems (PAISs) can be configured using a reference process model, which is typically obtained via expert interviews. Over time, however, contextual factors and system requirements may cause the operational process to start deviating from this reference model. While a reference model should ideally be updated to remain aligned with such changes, this is a costly and often neglected activity. We present a new process mining technique that automatically improves the reference model on the basis of the observed behavior as recorded in the event logs of a PAIS. We discuss how to balance the four basic quality dimensions for process mining (fitness, precision, simplicity and generalization) and a new dimension, namely the structural similarity between the reference model and the discovered model. We demonstrate the applicability of this technique using a real-life scenario from a Dutch municipality.
Resumo:
Background As the increasing adoption of information technology continues to offer better distant medical services, the distribution of, and remote access to digital medical images over public networks continues to grow significantly. Such use of medical images raises serious concerns for their continuous security protection, which digital watermarking has shown great potential to address. Methods We present a content-independent embedding scheme for medical image watermarking. We observe that the perceptual content of medical images varies widely with their modalities. Recent medical image watermarking schemes are image-content dependent and thus they may suffer from inconsistent embedding capacity and visual artefacts. To attain the image content-independent embedding property, we generalise RONI (region of non-interest, to the medical professionals) selection process and use it for embedding by utilising RONI’s least significant bit-planes. The proposed scheme thus avoids the need for RONI segmentation that incurs capacity and computational overheads. Results Our experimental results demonstrate that the proposed embedding scheme performs consistently over a dataset of 370 medical images including their 7 different modalities. Experimental results also verify how the state-of-the-art reversible schemes can have an inconsistent performance for different modalities of medical images. Our scheme has MSSIM (Mean Structural SIMilarity) larger than 0.999 with a deterministically adaptable embedding capacity. Conclusions Our proposed image-content independent embedding scheme is modality-wise consistent, and maintains a good image quality of RONI while keeping all other pixels in the image untouched. Thus, with an appropriate watermarking framework (i.e., with the considerations of watermark generation, embedding and detection functions), our proposed scheme can be viable for the multi-modality medical image applications and distant medical services such as teleradiology and eHealth.
Resumo:
Ignoring an object slows subsequent naming responses to it, a phenomenon known as negative priming (NP). A central issue in NP research concerns the level of representation at which the effect occurs. As object naming is typically considered to involve access to abstract semantic representations, Tipper 1985 proposed that the NP effect occurred at this level of processing, and other researchers supported this proposal by demonstrating a similar result with categorically related objects (e.g., Allport et al., 1985; Murray, 1995), an effect referred to as semantic NP. However, objects within categories share more physical or structural features than objects from different categories. Consequently, the NP effect observed with categorically related objects might occur at a structural rather than semantic level of representation. We used event related fMRI interleaving overt object naming and image acquisition to demonstrate for the first time that the semantic NP effect activates the left posterior-mid fusiform and insular-opercular cortices. Moreover, both naming latencies and left posterior-mid fusiform cortex responses were influenced by the structural similarity of prime-probe object pairings in the categorically related condition, increasing with the number of shared features. None of the cerebral regions activated in a previous fMRI study of the identity NP effect (de Zubicaray et al., 2006) showed similar activation during semantic NP, including the left anterolateral temporal cortex, a region considered critical for semantic processing. The results suggest that the identity and semantic NP effects differ with respect to their neural mechanisms, and the label "semantic NP" might be a misnomer. We conclude that the effect is most likely the result of competition between structurally similar category exemplars that determines the efficiency of object name retrieval.
Resumo:
Two monoclonal antibodies (mAb) CB268 and CII-C1 to type II collagen (CII) react with precisely the same conformational epitope constituted by the residues ARGLT on the three chains of the CII triple helix. The antibodies share structural similarity, with most differences in the complementarity determining region 3 of the heavy chain (HCDR3). The fine reactivity of these mAbs was investigated by screening two nonameric phage-displayed random peptide libraries. For each mAb, there were phage clones (phagotopes) that reacted strongly by ELISA only with the selecting mAb, and inhibited binding to CII only for that mAb, not the alternate mAb. Nonetheless, a synthetic peptide RRLPFGSQM corresponding to an insert from a highly reactive CII-C1-selected phagotope, which was unreactive (and non-inhibitory) with CB268, inhibited the reactivity of CB268 with CII. Most phage-displayed peptides contained a motif in the first part of the molecule that consisted of two basic residues adjacent to at least one hydrophobic residue (e.g. RRL or LRR), but the second portion of the peptides differed for the two mAbs. We predict that conserved CDR sequences interact with the basic-basic-hydrophobic motif, whereas non-conserved amino acids in the binding sites (especially HCDR3) interact with unique peptide sequences and limit cross-reactivity. The observation that two mAbs can react identically with a single epitope on one antigen (CII), but show no cross-reactivity when tested against a second (phagotope) indicates that microorganisms could exhibit mimics capable of initiating autoimmunity without this being evident from conventional assays.
Resumo:
Identifying the design features that impact construction is essential to developing cost effective and constructible designs. The similarity of building components is a critical design feature that affects method selection, productivity, and ultimately construction cost and schedule performance. However, there is limited understanding of what constitutes similarity in the design of building components and limited computer-based support to identify this feature in a building product model. This paper contributes a feature-based framework for representing and reasoning about component similarity that builds on ontological modelling, model-based reasoning and cluster analysis techniques. It describes the ontology we developed to characterize component similarity in terms of the component attributes, the direction, and the degree of variation. It also describes the generic reasoning process we formalized to identify component similarity in a standard product model based on practitioners' varied preferences. The generic reasoning process evaluates the geometric, topological, and symbolic similarities between components, creates groupings of similar components, and quantifies the degree of similarity. We implemented this reasoning process in a prototype cost estimating application, which creates and maintains cost estimates based on a building product model. Validation studies of the prototype system provide evidence that the framework is general and enables a more accurate and efficient cost estimating process.
Resumo:
The purpose of this research was to develop and test a multicausal model of the individual characteristics associated with academic success in first-year Australian university students. This model comprised the constructs of: previous academic performance, achievement motivation, self-regulatory learning strategies, and personality traits, with end-of-semester grades the dependent variable of interest. The study involved the distribution of a questionnaire, which assessed motivation, self-regulatory learning strategies and personality traits, to 1193 students at the start of their first year at university. Students' academic records were accessed at the end of their first year of study to ascertain their first and second semester grades. This study established that previous high academic performance, use of self-regulatory learning strategies, and being introverted and agreeable, were indicators of academic success in the first semester of university study. Achievement motivation and the personality trait of conscientiousness were indirectly related to first semester grades, through the influence they had on the students' use of self-regulatory learning strategies. First semester grades were predictive of second semester grades. This research provides valuable information for both educators and students about the factors intrinsic to the individual that are associated with successful performance in the first year at university.