906 resultados para Graphic heterogeneity
Resumo:
This exhibition catalogue essay provides an introduction to psychedelic culture during the postwar period. It describes the early use of LSD in psychiatric circles and its conception as a psychotomimetic substance. It then considers its use by literary figures such as Aldous Huxley and followers of the Beat Generation. Timothy Leary's role as an LSD philosopher is also explained as is the rise of the Hippies and the ensuing counterculture. This culture produced a range of cultural forms such as music, fashion, graphic design and other visual arts that were informed by hallucinations experienced under the influence of LSD. It concludes with a description of the end of the Hippie movement in the 1970s.
Resumo:
This article presents a two-stage analytical framework that integrates ecological crop (animal) growth and economic frontier production models to analyse the productive efficiency of crop (animal) production systems. The ecological crop (animal) growth model estimates "potential" output levels given the genetic characteristics of crops (animals) and the physical conditions of locations where the crops (animals) are grown (reared). The economic frontier production model estimates "best practice" production levels, taking into account economic, institutional and social factors that cause farm and spatial heterogeneity. In the first stage, both ecological crop growth and economic frontier production models are estimated to calculate three measures of productive efficiency: (1) technical efficiency, as the ratio of actual to "best practice" output levels; (2) agronomic efficiency, as the ratio of actual to "potential" output levels; and (3) agro-economic efficiency, as the ratio of "best practice" to "potential" output levels. Also in the first stage, the economic frontier production model identifies factors that determine technical efficiency. In the second stage, agro-economic efficiency is analysed econometrically in relation to economic, institutional and social factors that cause farm and spatial heterogeneity. The proposed framework has several important advantages in comparison with existing proposals. Firstly, it allows the systematic incorporation of all physical, economic, institutional and social factors that cause farm and spatial heterogeneity in analysing the productive performance of crop and animal production systems. Secondly, the location-specific physical factors are not modelled symmetrically as other economic inputs of production. Thirdly, climate change and technological advancements in crop and animal sciences can be modelled in a "forward-looking" manner. Fourthly, knowledge in agronomy and data from experimental studies can be utilised for socio-economic policy analysis. The proposed framework can be easily applied in empirical studies due to the current availability of ecological crop (animal) growth models, farm or secondary data, and econometric software packages. The article highlights several directions of empirical studies that researchers may pursue in the future.
Resumo:
This study explores the relationship between new venture team composition and new venture persistence and performance over time. We examine the team characteristics of a 5-year panel study of 202 new venture teams and new venture performance. Our study makes two contributions. First, we extend earlier research concerning homophily theories of the prevalence of homogeneous teams. Using structural event analysis we demonstrate that team members’ start-up experience is important in this context. Second, we attempt to reconcile conflicting evidence concerning the influence of team homogeneity on performance by considering the element of time. We hypothesize that higher team homogeneity is positively related to short term outcomes, but is less effective in the longer term. Our results confirm a difference over time. We find that more homogeneous teams are less likely to be higher performing in the long term. However, we find no relationship between team homogeneity and short-term performance outcomes.
Resumo:
There is continuing debate regarding the psychometric properties of self-report measures of behaviour, particularly in road safety research. Practical considerations often preclude the use of objective assessments, leading to reliance on self-report measures. Acknowledging that such measures are likely to remain commonly used, this pilot project sought not to argue whether self-report measures should continue to be used, but to explore factors associated with how individuals respond to self-reported speeding measures. This paper reports preliminary findings from a qualitative study (focus groups and in-depth interviews) conducted with licensed drivers to explore the operational utility of self-reported speeding behaviour measures. Drawing upon concepts from the Theory of Planned Behaviour (TPB; Ajzen, 1991) and Agency Theory (Bandura, 2001), we identified four dimensions of self-reported speeding: including timeframe, speed zone, degree over the speed limit and, overall frequency of the behaviour, and examined participants’ perceptions of the operational utility of these factors. Issues related to comprehensibility, perceived accuracy, response format and layout were also explored. Results indicated that: heterogeneity in the timeframe of behavioural reflections suggests a need to provide a set timeframe for participants to consider when thinking about their previous speeding behaviour; response categories and formats should be carefully considered to ensure the most accurate representations of the frequency and degree of speeding are captured; the need to clearly articulate “low-level” speeding on self-report measures; and, that self-reports of speeding behaviour are typically context-irrelevant unless stipulated in the question. Limitations and directions for further research are discussed.
Resumo:
Partition of heavy metals between particulate and dissolve fraction of stormwater primarily depends on the adsorption characteristics of solids particles. Moreover, the bioavailability of heavy metals is also influenced by the adsorption behaviour of solids. However, due to the lack of fundamental knowledge in relation to the heavy metals adsorption processes of road deposited solids, the effectiveness of stormwater management strategies can be limited. The research study focused on the investigation of the physical and chemical parameters of solids on urban road surfaces and, more specifically, on heavy metal adsorption to solids. Due to the complex nature of heavy metal interaction with solids, a substantial database was generated through a series of field investigations and laboratory experiments. The study sites for the build-up pollutant sample collection were selected from four urbanised suburbs located in a major river catchment. Sixteen road sites were selected from these suburbs and represented typical industrial, commercial and residential land uses. Build-up pollutants were collected using a wet and dry vacuum collection technique which was specially designed to improve fine particle collection. Roadside soil samples were also collected from each suburb for comparison with the road surface solids. The collected build-up solids samples were separated into four particle size ranges and tested for a range of physical and chemical parameters. The solids build-up on road surfaces contained a high fraction (70%) of particles smaller than 150ìm, which are favourable for heavy metal adsorption. These solids particles predominantly consist of soil derived minerals which included quartz, albite, microcline, muscovite and chlorite. Additionally, a high percentage of amorphous content was also identified in road deposited solids. In comparing the mineralogical data of surrounding soil and road deposited solids, it was found that about 30% of the solids consisted of particles generated from traffic related activities on road surfaces. Significant difference in mineralogical composition was noted in different particle sizes of build-up solids. Fine solids particles (<150ìm) consisted of a clayey matrix and high amorphous content (in the region of 40%) while coarse particles (>150ìm) consisted of a sandy matrix at all study sites, with about 60% quartz content. Due to these differences in mineralogical components, particles larger than and smaller than 150ìm had significant differences in their specific surface area (SSA) and effective cation exchange capacity (ECEC). These parameters, in turn, exert a significant influence on heavy metal adsorption. Consequently, heavy metal content in >150ìm particles was lower than in the case of fine particles. The particle size range <75ìm had the highest heavy metal content, corresponding with its high clay forming minerals, high organic matter and low quartz content which increased the SSA, ECEC and the presence of Fe, Al and Mn oxides. The clay forming minerals, high organic matter and Fe, Al and Mn oxides create distinct groups of charge sites on solids surfaces and exhibit different adsorption mechanisms and bond strength, between heavy metal elements and charge sites. Therefore, the predominance of these factors in different particle sizes leads to different heavy metal adsorption characteristics. Heavy metals show preference for association with clay forming minerals in fine solids particles, whilst in coarse particles heavy metals preferentially associate with organic matter. Although heavy metal adsorption to amorphous material is very low, the heavy metals embedded in traffic related materials have a potential impact on stormwater quality.Adsorption of heavy metals is not confined to an individual type of charge site in solids, whereas specific heavy metal elements show preference for adsorption to several different types of charge sites in solids. This is attributed to the dearth of preferred binding sites and the inability to reach the preferred binding sites due to competition between different heavy metal species. This confirms that heavy metal adsorption is significantly influenced by the physical and chemical parameters of solids that lead to a heterogeneity of surface charge sites. The research study highlighted the importance of removal of solids particles from stormwater runoff before they enter into receiving waters to reduce the potential risk posed by the bioavailability of heavy metals. The bioavailability of heavy metals not only results from the easily mobile fraction bound to the solids particles, but can also occur as a result of the dissolution of other forms of bonds by chemical changes in stormwater or microbial activity. Due to the diversity in the composition of the different particle sizes of solids and the characteristics and amount of charge sites on the particle surfaces, investigations using bulk solids are not adequate to gain an understanding of the heavy metal adsorption processes of solids particles. Therefore, the investigation of different particle size ranges is recommended for enhancing stormwater quality management practices.
Resumo:
Background In contrast to pluripotent embryonic stem cells, adult stem cells have been considered to be multipotent, being somewhat more restricted in their differentiation capacity and only giving rise to cell types related to their tissue of origin. Several studies, however, have reported that bone marrow-derived mesenchymal stromal cells (MSCs) are capable of transdifferentiating to neural cell types, effectively crossing normal lineage restriction boundaries. Such reports have been based on the detection of neural-related proteins by the differentiated MSCs. In order to assess the potential of human adult MSCs to undergo true differentiation to a neural lineage and to determine the degree of homogeneity between donor samples, we have used RT-PCR and immunocytochemistry to investigate the basal expression of a range of neural related mRNAs and proteins in populations of non-differentiated MSCs obtained from 4 donors. Results The expression analysis revealed that several of the commonly used marker genes from other studies like nestin, Enolase2 and microtubule associated protein 1b (MAP1b) are already expressed by undifferentiated human MSCs. Furthermore, mRNA for some of the neural-related transcription factors, e.g. Engrailed-1 and Nurr1 were also strongly expressed. However, several other neural-related mRNAs (e.g. DRD2, enolase2, NFL and MBP) could be identified, but not in all donor samples. Similarly, synaptic vesicle-related mRNA, STX1A could only be detected in 2 of the 4 undifferentiated donor hMSC samples. More significantly, each donor sample revealed a unique expression pattern, demonstrating a significant variation of marker expression. Conclusion The present study highlights the existence of an inter-donor variability of expression of neural-related markers in human MSC samples that has not previously been described. This donor-related heterogeneity might influence the reproducibility of transdifferentiation protocols as well as contributing to the ongoing controversy about differentiation capacities of MSCs. Therefore, further studies need to consider the differences between donor samples prior to any treatment as well as the possibility of harvesting donor cells that may be inappropriate for transplantation strategies.
Resumo:
The use of adherent monolayer cultures have produced many insights into melanoma cell growth and differentiation, but often novel therapeutics demonstrated to act on these cells are not active in vivo. It is imperative that new methods of growing melanoma cells that reflect growth in vivo are investigated. To this end, a range of human melanoma cell lines passaged as adherent cultures or induced to form melanoma spheres (melanospheres) in stem cell media have been studied to compare cellular characteristics and protein expression. Melanoma spheres and tumours grown from cell lines as mouse xenografts had increased heterogeneity when compared with adherent cells and 3D-spheroids in agar (aggregates). Furthermore, cells within the melanoma spheres and mouse xenografts each displayed a high level of reciprocal BRN2 or MITF expression, which matched more closely the pattern seen in human melanoma tumours in situ, rather than the propensity for co-expression of these important melanocytic transcription factors seen in adherent cells and 3D-spheroids. Notably, when the levels of the BRN2 and MITF proteins were each independently repressed using siRNA treatment of adherent melanoma cells, members of the NOTCH pathway responded by decreasing or increasing expression, respectively. This links BRN2 as an activator, and conversely, MITF as a repressor of the NOTCH pathway in melanoma cells. Loss of the BRN2-MITF axis in antisense-ablated cell lines decreased the melanoma sphere-forming capability, cell adhesion during 3D-spheroid formation and invasion through a collagen matrix. Combined, this evidence suggests that the melanoma sphere-culture system induces subpopulations of cells that may more accurately portray the in vivo disease, than the growth as adherent melanoma cells.
Resumo:
Exhibited at The Fashioning the Future Awards Showcase exhibition Fashioning the Future Awards is the leading international cross-disciplinary platform for celebrating innovative initiatives towards fashion design for sustainability, its development and communication. The 2011 awards are a showcase for exceptional work that celebrates ‘Unique’ ways to create our futures. Fashioning the Future is designed and coordinated by the Centre for Sustainable Fashion at London College of Fashion. Unique Enterprise Award The Unique Enterprise Award was offered for the consideration of the opportunities that arise from the necessity to solve the issues around water, waste, wellbeing, energy, equality and biodiversity. Winner Alice Payne According to Alice Payne there is no one-size-fits-all approach to creating a sustainable fashion system. Existing companies will need to evolve, change the way they design and produce garments, offer services rather than products, and engage with the end user to consider the end of life and future lives of their garments. The ThinkLifecycle content management system (CMS) acts as a bridge between existing industry practices and new, redirected practice in which sustainability is at the forefront of commercial thinking. Its chief aim is to embed lifecycle thinking within a company at a daily, operational level.
Resumo:
The Learning by Design Workshop Program 2010, a part of the Queensland Government Unlimited: Designing for the Asia Pacific Event Program, was a one-day professional development design thinking workshop run on October 9, 2011 at The Edge, State Library of Queensland for self-selected public and private secondary school teachers from the subject areas of Visual Art, Graphics and Industrial Technology and Design. Participants were drawn from a database of Brisbane and regional Queensland schools from the goDesign and Living City Workshop Programs. It aimed to generate leadership within schools for design-led education and creative thinking and give teachers a rare opportunity to work with professional designers to generate future strategies for design-based learning. Teachers were introduced to the concept of design thinking in education by international keynote speakers CJ Lim (Studio 8 Architects) and Jeb Brugmann (The Next Practice), national speaker Oliver Freeman (NevilleFreeman Agency) and three Queensland speakers, Alexander Loterztain, David Williams and Keith Holledge. Inspired by the Unlimited showcase exhibition Make Change: Design Thinking in Action and ‘Idea Starters’/teaching resources provided, teachers worked with a professional designer (from a discipline of architecture, interior design, industrial design, urban design, graphic design or landscape architecture) in ten random teams, to generate optimistic ideas for the Ideal City of tomorrow, each considering a theme – Food, Water, Transport, Ageing, Growth, Employment, Shelter, Health, Education and Energy. They then discussed how this process could be best activated and expanded on to build interest and knowledge in design thinking in the classroom. Assisted by illustrators, the teams prepared a visual presentation of their ideas and process from art materials provided. The workshop culminated in a video-taped interactive design charette to the larger group, which is intended to be utilised as a toolkit and praxis for teachers as part of the State Library of Queensland Design Minds Website Project.
Resumo:
The Generation Workshop Program 2010, a part of the Queensland Government Unlimited: Designing for the Asia Pacific Event Program, consisted of two one-day intensive design thinking workshops run on October 7-8, 2011 at The Edge, State Library of Queensland, for 100 senior secondary students and 20 secondary teachers self-selected from the subject areas of Visual Art, Graphics and Industrial Technology and Design. Participants were drawn from a database of Brisbane and regional Queensland private and public schools from the goDesign and Living City Workshop Programs. The workshop aimed to facilitate awareness in young people of the role of design in society and the value of design thinking skills in solving complex problems facing the Asia Pacific Region, and to inspire the generation of strategies for our future cities. It also aimed to encourage the collaboration of professional designers with secondary schools to inspire post-secondary pathways and idea generation for education. Inspired by international and national speakers Bunker Roy (Barefoot College) and Hael Kobayashi (Associate Producer on "Happy Feet" film for Australia's Animal Logic), the Unlimited showcase exhibition Make Change: Design Thinking in Action and ‘Idea Starters’/teaching resources provided, students worked with a teacher in ten random teams, to generate optimistic strategies for the Ideal City of tomorrow, each considering a theme – Food, Water, Transport, Ageing, Growth, Employment, Shelter, Health, Education and Energy. Each team of 6 was led by a professional designer (from the discipline of architecture, interior design, industrial design, urban design, graphic design or landscape architecture) who was a catalyst for driving the student creative thinking process. Assisted by illustrators, the teams prepared a visual presentation of their idea from art materials provided. The workshop culminated in a video-taped interactive design chatter to the larger group, which will be utilised as a toolkit and praxis for teachers as part of the State Library of Queensland Design Minds Project. Photos of student design work were published on the Unlimited website.
Resumo:
The annual YODEX (Young Designers Exhibition) in Taipei as the largest student design show in Asia presents a substantial opportunity as a profiling event for QUT. In 2011 an interactive and highly engaging QUT exhibition ensured direct communication with participants and first hand exposure to innovative design approaches.
Resumo:
An interactive installation with full body interface, digital projection, multi-touch sensitive screen surfaces, interactive 3D gaming software, motorised dioramas, 4.1 spatial sound & new furniture forms - investigating the cultural dimensions of sustainability through the lens of 'time'. “Time is change, time is finitude. Humans are a finite species. Every decision we make today brings that end closer, or alternatively pushes it further away. Nothing can be neutral”. Tony Fry DETAILS: Finitude (Mallee:Time) is a major new media/sculptural hybrid work premiered in 2011 in version 1 at the Ka-rama Motel for the Mildura Palimpsest #8 ('Collaborators and Saboteurs'). Each participant/viewer lies comfortably on their back on the double bed of Room 22. Directly above them, supported by a wooden structure, not unlike a house frame, is a semi-transparent Perspex screen that displays projected 3D imagery and is simultaneously sensitive to the lightest of finger touches. Depending upon the ever changing qualities of the projected image on this screen the participant can see through its surface to a series of physical dioramas suspended above, lit by subtle LED spotlighting. This diorama consists of a slowly rotating series of physical environments, which also include several animatronic components, allowing the realtime composition of whimsical ‘landscapes’ of both 'real' and 'virtual' media. Through subtle, non-didactic touch-sensitive interactivity the participant then has influence over both the 3D graphic imagery, the physical movements of the diorama and the 4.1 immersive soundscape, creating an uncanny blend of physical and virtual media. Five speakers positioned around the room deliver a rich interactive soundscape that responds both audibly and physically to interactions. VERSION 1, CONTEXT/THEORY: Finitude (Mallee: Time) is Version 1 of a series of presentations during 2012-14. This version has been inspired through a series of recent visits and residencies in the SW Victoria Mallee country. Further drawing on recent writings by post colonial author Paul Carter, the work is envisaged as an evolving ‘personal topography’ of place-discovery. By contrasting and melding readily available generalisations of the Mallee regions’ rational surfaces, climatic maps and ecological systems with what Carter calls “a fine capillary system of interconnected words, places, memories and sensations” generated through my own idiosyncratic research processes, Finitude (Mallee Time) invokes a “dark writing” of place through outside eyes - an approach that avoids concentration upon what 'everyone else knows', to instead imagine and develop a sense how things might be. This basis in re-imagining and re-invention becomes the vehicle for the work’s more fundamental intention - as a meditative re-imagination of 'time' (and region) as finite resources: Towards this end, every object, process and idea in the work is re-thought as having its own ‘time component’ or ‘residue’ that becomes deposited into our 'collective future'. Thought this way Finitude (Mallee Time) suggests the poverty of predominant images of time as ‘mechanism’ to instead envisage time as a plastic cyclical medium that we can each choose to ‘give to’ or ‘take away from’ our future. Put another way - time has become finitude.
Resumo:
Complex networks have been studied extensively due to their relevance to many real-world systems such as the world-wide web, the internet, biological and social systems. During the past two decades, studies of such networks in different fields have produced many significant results concerning their structures, topological properties, and dynamics. Three well-known properties of complex networks are scale-free degree distribution, small-world effect and self-similarity. The search for additional meaningful properties and the relationships among these properties is an active area of current research. This thesis investigates a newer aspect of complex networks, namely their multifractality, which is an extension of the concept of selfsimilarity. The first part of the thesis aims to confirm that the study of properties of complex networks can be expanded to a wider field including more complex weighted networks. Those real networks that have been shown to possess the self-similarity property in the existing literature are all unweighted networks. We use the proteinprotein interaction (PPI) networks as a key example to show that their weighted networks inherit the self-similarity from the original unweighted networks. Firstly, we confirm that the random sequential box-covering algorithm is an effective tool to compute the fractal dimension of complex networks. This is demonstrated on the Homo sapiens and E. coli PPI networks as well as their skeletons. Our results verify that the fractal dimension of the skeleton is smaller than that of the original network due to the shortest distance between nodes is larger in the skeleton, hence for a fixed box-size more boxes will be needed to cover the skeleton. Then we adopt the iterative scoring method to generate weighted PPI networks of five species, namely Homo sapiens, E. coli, yeast, C. elegans and Arabidopsis Thaliana. By using the random sequential box-covering algorithm, we calculate the fractal dimensions for both the original unweighted PPI networks and the generated weighted networks. The results show that self-similarity is still present in generated weighted PPI networks. This implication will be useful for our treatment of the networks in the third part of the thesis. The second part of the thesis aims to explore the multifractal behavior of different complex networks. Fractals such as the Cantor set, the Koch curve and the Sierspinski gasket are homogeneous since these fractals consist of a geometrical figure which repeats on an ever-reduced scale. Fractal analysis is a useful method for their study. However, real-world fractals are not homogeneous; there is rarely an identical motif repeated on all scales. Their singularity may vary on different subsets; implying that these objects are multifractal. Multifractal analysis is a useful way to systematically characterize the spatial heterogeneity of both theoretical and experimental fractal patterns. However, the tools for multifractal analysis of objects in Euclidean space are not suitable for complex networks. In this thesis, we propose a new box covering algorithm for multifractal analysis of complex networks. This algorithm is demonstrated in the computation of the generalized fractal dimensions of some theoretical networks, namely scale-free networks, small-world networks, random networks, and a kind of real networks, namely PPI networks of different species. Our main finding is the existence of multifractality in scale-free networks and PPI networks, while the multifractal behaviour is not confirmed for small-world networks and random networks. As another application, we generate gene interactions networks for patients and healthy people using the correlation coefficients between microarrays of different genes. Our results confirm the existence of multifractality in gene interactions networks. This multifractal analysis then provides a potentially useful tool for gene clustering and identification. The third part of the thesis aims to investigate the topological properties of networks constructed from time series. Characterizing complicated dynamics from time series is a fundamental problem of continuing interest in a wide variety of fields. Recent works indicate that complex network theory can be a powerful tool to analyse time series. Many existing methods for transforming time series into complex networks share a common feature: they define the connectivity of a complex network by the mutual proximity of different parts (e.g., individual states, state vectors, or cycles) of a single trajectory. In this thesis, we propose a new method to construct networks of time series: we define nodes by vectors of a certain length in the time series, and weight of edges between any two nodes by the Euclidean distance between the corresponding two vectors. We apply this method to build networks for fractional Brownian motions, whose long-range dependence is characterised by their Hurst exponent. We verify the validity of this method by showing that time series with stronger correlation, hence larger Hurst exponent, tend to have smaller fractal dimension, hence smoother sample paths. We then construct networks via the technique of horizontal visibility graph (HVG), which has been widely used recently. We confirm a known linear relationship between the Hurst exponent of fractional Brownian motion and the fractal dimension of the corresponding HVG network. In the first application, we apply our newly developed box-covering algorithm to calculate the generalized fractal dimensions of the HVG networks of fractional Brownian motions as well as those for binomial cascades and five bacterial genomes. The results confirm the monoscaling of fractional Brownian motion and the multifractality of the rest. As an additional application, we discuss the resilience of networks constructed from time series via two different approaches: visibility graph and horizontal visibility graph. Our finding is that the degree distribution of VG networks of fractional Brownian motions is scale-free (i.e., having a power law) meaning that one needs to destroy a large percentage of nodes before the network collapses into isolated parts; while for HVG networks of fractional Brownian motions, the degree distribution has exponential tails, implying that HVG networks would not survive the same kind of attack.
Resumo:
With the growing number of XML documents on theWeb it becomes essential to effectively organise these XML documents in order to retrieve useful information from them. A possible solution is to apply clustering on the XML documents to discover knowledge that promotes effective data management, information retrieval and query processing. However, many issues arise in discovering knowledge from these types of semi-structured documents due to their heterogeneity and structural irregularity. Most of the existing research on clustering techniques focuses only on one feature of the XML documents, this being either their structure or their content due to scalability and complexity problems. The knowledge gained in the form of clusters based on the structure or the content is not suitable for reallife datasets. It therefore becomes essential to include both the structure and content of XML documents in order to improve the accuracy and meaning of the clustering solution. However, the inclusion of both these kinds of information in the clustering process results in a huge overhead for the underlying clustering algorithm because of the high dimensionality of the data. The overall objective of this thesis is to address these issues by: (1) proposing methods to utilise frequent pattern mining techniques to reduce the dimension; (2) developing models to effectively combine the structure and content of XML documents; and (3) utilising the proposed models in clustering. This research first determines the structural similarity in the form of frequent subtrees and then uses these frequent subtrees to represent the constrained content of the XML documents in order to determine the content similarity. A clustering framework with two types of models, implicit and explicit, is developed. The implicit model uses a Vector Space Model (VSM) to combine the structure and the content information. The explicit model uses a higher order model, namely a 3- order Tensor Space Model (TSM), to explicitly combine the structure and the content information. This thesis also proposes a novel incremental technique to decompose largesized tensor models to utilise the decomposed solution for clustering the XML documents. The proposed framework and its components were extensively evaluated on several real-life datasets exhibiting extreme characteristics to understand the usefulness of the proposed framework in real-life situations. Additionally, this research evaluates the outcome of the clustering process on the collection selection problem in the information retrieval on the Wikipedia dataset. The experimental results demonstrate that the proposed frequent pattern mining and clustering methods outperform the related state-of-the-art approaches. In particular, the proposed framework of utilising frequent structures for constraining the content shows an improvement in accuracy over content-only and structure-only clustering results. The scalability evaluation experiments conducted on large scaled datasets clearly show the strengths of the proposed methods over state-of-the-art methods. In particular, this thesis work contributes to effectively combining the structure and the content of XML documents for clustering, in order to improve the accuracy of the clustering solution. In addition, it also contributes by addressing the research gaps in frequent pattern mining to generate efficient and concise frequent subtrees with various node relationships that could be used in clustering.