110 resultados para Museum conservation methods.
Resumo:
In this Thesis, we develop theory and methods for computational data analysis. The problems in data analysis are approached from three perspectives: statistical learning theory, the Bayesian framework, and the information-theoretic minimum description length (MDL) principle. Contributions in statistical learning theory address the possibility of generalization to unseen cases, and regression analysis with partially observed data with an application to mobile device positioning. In the second part of the Thesis, we discuss so called Bayesian network classifiers, and show that they are closely related to logistic regression models. In the final part, we apply the MDL principle to tracing the history of old manuscripts, and to noise reduction in digital signals.
Resumo:
In this thesis we present and evaluate two pattern matching based methods for answer extraction in textual question answering systems. A textual question answering system is a system that seeks answers to natural language questions from unstructured text. Textual question answering systems are an important research problem because as the amount of natural language text in digital format grows all the time, the need for novel methods for pinpointing important knowledge from the vast textual databases becomes more and more urgent. We concentrate on developing methods for the automatic creation of answer extraction patterns. A new type of extraction pattern is developed also. The pattern matching based approach chosen is interesting because of its language and application independence. The answer extraction methods are developed in the framework of our own question answering system. Publicly available datasets in English are used as training and evaluation data for the methods. The techniques developed are based on the well known methods of sequence alignment and hierarchical clustering. The similarity metric used is based on edit distance. The main conclusions of the research are that answer extraction patterns consisting of the most important words of the question and of the following information extracted from the answer context: plain words, part-of-speech tags, punctuation marks and capitalization patterns, can be used in the answer extraction module of a question answering system. This type of patterns and the two new methods for generating answer extraction patterns provide average results when compared to those produced by other systems using the same dataset. However, most answer extraction methods in the question answering systems tested with the same dataset are both hand crafted and based on a system-specific and fine-grained question classification. The the new methods developed in this thesis require no manual creation of answer extraction patterns. As a source of knowledge, they require a dataset of sample questions and answers, as well as a set of text documents that contain answers to most of the questions. The question classification used in the training data is a standard one and provided already in the publicly available data.
Resumo:
The Minimum Description Length (MDL) principle is a general, well-founded theoretical formalization of statistical modeling. The most important notion of MDL is the stochastic complexity, which can be interpreted as the shortest description length of a given sample of data relative to a model class. The exact definition of the stochastic complexity has gone through several evolutionary steps. The latest instantation is based on the so-called Normalized Maximum Likelihood (NML) distribution which has been shown to possess several important theoretical properties. However, the applications of this modern version of the MDL have been quite rare because of computational complexity problems, i.e., for discrete data, the definition of NML involves an exponential sum, and in the case of continuous data, a multi-dimensional integral usually infeasible to evaluate or even approximate accurately. In this doctoral dissertation, we present mathematical techniques for computing NML efficiently for some model families involving discrete data. We also show how these techniques can be used to apply MDL in two practical applications: histogram density estimation and clustering of multi-dimensional data.
Resumo:
Matrix decompositions, where a given matrix is represented as a product of two other matrices, are regularly used in data mining. Most matrix decompositions have their roots in linear algebra, but the needs of data mining are not always those of linear algebra. In data mining one needs to have results that are interpretable -- and what is considered interpretable in data mining can be very different to what is considered interpretable in linear algebra. --- The purpose of this thesis is to study matrix decompositions that directly address the issue of interpretability. An example is a decomposition of binary matrices where the factor matrices are assumed to be binary and the matrix multiplication is Boolean. The restriction to binary factor matrices increases interpretability -- factor matrices are of the same type as the original matrix -- and allows the use of Boolean matrix multiplication, which is often more intuitive than normal matrix multiplication with binary matrices. Also several other decomposition methods are described, and the computational complexity of computing them is studied together with the hardness of approximating the related optimization problems. Based on these studies, algorithms for constructing the decompositions are proposed. Constructing the decompositions turns out to be computationally hard, and the proposed algorithms are mostly based on various heuristics. Nevertheless, the algorithms are shown to be capable of finding good results in empirical experiments conducted with both synthetic and real-world data.
Resumo:
Ubiquitous computing is about making computers and computerized artefacts a pervasive part of our everyday lifes, bringing more and more activities into the realm of information. The computationalization, informationalization of everyday activities increases not only our reach, efficiency and capabilities but also the amount and kinds of data gathered about us and our activities. In this thesis, I explore how information systems can be constructed so that they handle this personal data in a reasonable manner. The thesis provides two kinds of results: on one hand, tools and methods for both the construction as well as the evaluation of ubiquitous and mobile systems---on the other hand an evaluation of the privacy aspects of a ubiquitous social awareness system. The work emphasises real-world experiments as the most important way to study privacy. Additionally, the state of current information systems as regards data protection is studied. The tools and methods in this thesis consist of three distinct contributions. An algorithm for locationing in cellular networks is proposed that does not require the location information to be revealed beyond the user's terminal. A prototyping platform for the creation of context-aware ubiquitous applications called ContextPhone is described and released as open source. Finally, a set of methodological findings for the use of smartphones in social scientific field research is reported. A central contribution of this thesis are the pragmatic tools that allow other researchers to carry out experiments. The evaluation of the ubiquitous social awareness application ContextContacts covers both the usage of the system in general as well as an analysis of privacy implications. The usage of the system is analyzed in the light of how users make inferences of others based on real-time contextual cues mediated by the system, based on several long-term field studies. The analysis of privacy implications draws together the social psychological theory of self-presentation and research in privacy for ubiquitous computing, deriving a set of design guidelines for such systems. The main findings from these studies can be summarized as follows: The fact that ubiquitous computing systems gather more data about users can be used to not only study the use of such systems in an effort to create better systems but in general to study phenomena previously unstudied, such as the dynamic change of social networks. Systems that let people create new ways of presenting themselves to others can be fun for the users---but the self-presentation requires several thoughtful design decisions that allow the manipulation of the image mediated by the system. Finally, the growing amount of computational resources available to the users can be used to allow them to use the data themselves, rather than just being passive subjects of data gathering.
Resumo:
Free and open source software development is an alternative to traditional software engineering as an approach to the development of complex software systems. It is a way of developing software based on geographically distributed teams of volunteers without apparent central plan or traditional mechanisms of coordination. The purpose of this thesis is to summarize the current knowledge about free and open source software development and explore the ways on which further understanding on it could be gained. The results of research on the field as well as the research methods are introduced and discussed. Also adapting software process metrics to the context of free and open source software development is illustrated and the possibilities to utilize them as tools to validate other research are discussed.
Resumo:
This Ph.D. thesis Participation or Further Exclusion? Contestations over Forest Conservation and Control in the East Usambara Mountains, Tanzania describes and analyses the shift in the prevailing discourse of forest and biodiversity conservation policies and strategies towards more participatory approaches in Tanzania, and the changes in the practises of resource control. I explore the scope for and limits to the different actors and groups who are considered to form the community, to participate in resource control, in a specific historical and socio-economic context. I analyse whether, how and to which extent the targets of such participatory conservation interventions have been able to affect the formal rules and practices of resource control, and explore their different responses and discursive and other strategies in relation to conservation efforts. I approach the problematic through exploring certain participatory conservation interventions and related negotiations between the local farmers, government officials and the external actors in the case of two protected forest reserves in the southern part of the East Usambaras, Tanzania. The study area belongs to the Eastern Arc Mountains that are valued globally and nationally for their high level of biodiversity and number of endemic and near endemic species. The theoretical approach draws from theorising on power, participation and conservation in anthropology of development and post-structuralist political ecology. The material was collected in three stages between 2003 and 2008 by using an ethnographic approach. I interviewed and observed the actors and their resource use and control practices at the local level, including the representatives of the villagers living close to the protected forests and the conservation agency, but also followed the selected processes and engaged with the non-local agencies involved in the conservation efforts in the East Usambaras. In addition, the more recent processes of change and the actors strategies in resource control were contextualised against the social and environmental history of the study area and the evolvement of institutions of natural resource control. My findings indicate that the discourse of participation that has emerged in global conservation policy debate within the past three decades, and is being institutionalised in the national policies in many countries, including Tanzania, has shaped the practices of forest conservation in the East Usambaras, although in a fragmented and uneven way. Instrumental interpretation of participation, in which it is to serve the goals of improving the control of the forest and making it more acceptable and efficient, has prevailed among the governmental actors and conservation organisations. Yet, there is variation between the different projects and actors promoting participatory conservation regarding the goals and means of participation, e.g. to which extent the local people are to be involved in decision-making. The actors representing communities also have their diverse agendas, understandings and experiences regarding the rationality, outcomes and benefits of being involved in forest control, making the practices of control fluid. The elements of the exclusive conservation thinking and practices co-exist with the more recent participatory processes, and continue to shape the understandings and strategies of the actors involved in resource control. The ideas and narratives of the different discourses are reproduced and selectively used by the parties involved. The idea of forest conservation is not resisted as such by most of the actors at local level, quite the opposite. However, the strict regulations and rules governing access to resources, such as valuable timber species, continue to be disputed by many. Furthermore, the history of control, such as past injustices related to conservation and unfulfilled promises, undermines the participation of certain social groups in resource control and benefit sharing. This also creates controversies in the practices of conservation, and fuels conflicts regarding the establishment of new protected areas. In spite of this, the fact that the representatives of the communities have been invited to the arenas where information is shared, and principles and conditions of forest control and benefit sharing are discussed and partly decided upon, has created expectations among the participants, and opened up opportunities for some of the local actors to enhance their own, and sometimes wider interests in relation to resource control and the related benefits. The local actors experiences of the previous government and other interventions strongly affect how they position themselves in relation to conservation interventions, and their responses and strategies. However, my findings also suggest, in a similar way to research conducted in some other protected areas, that the benefits of participation in conservation and resource control tend to accrue unevenly between different groups of local people, e.g. due to unequal access to information and differences in their initial resources and social position.
Resumo:
Forestry has influenced forest dwelling organisms for centuries in Fennoscandia. For example, in Finland ca. 30% of the threatened species are threatened because of forestry. Nowadays forest management recommendations include practices aimed at maintaining biodiversity in harvesting, such as green-tree retention. However, the effects of these practices have been little studied. In variable retention, different numbers of trees are retained, varying from green-tree retention (at least a few live standing trees in clear-cuts) to thinning (only individual trees removed). I examined the responses of ground-dwelling spiders and carabid beetles to green-tree retention (with small and large tree groups), gap felling and thinning aimed at an uneven age structure of trees. The impacts of these harvesting methods were compared to those of clear-cutting and uncut controls. I aimed to test the hypothesis that retaining more trees positively affects populations of those species of spiders and carabids that were present before harvesting. The data come from two studies. First, spiders were collected with pitfall traps in south-central Finland in 1995 (pre-treatment) and 1998 (after-treatment) in order to examine the effects of clear-cutting, green-tree retention (with 0.01-0.02-ha sized tree groups), gap felling (with three 0.16-ha sized openings in a 1-ha stand), thinning aiming at an uneven age structure of trees and uncut control. Second, spiders and carabids were caught with pitfall traps in eastern Finland in 1998-2001 (pre-treatment and three post-treatment years) in eleven 0.09-0.55-ha sized retention-tree groups and clear-cuts adjacent to them. Original spider and carabid assemblages were better maintained after harvests that retained more trees. Thinning maintained forest spiders well. However, gap felling and large retention-tree groups maintained some forest spider and carabid species in the short-term, but negatively affected some species over time. However, use of small retention-tree groups was associated with negative effects on forest spider populations. Studies are needed on the long-term effects of variable retention on terrestrial invertebrates; especially those directed at defining appropriate retention patch size and on the importance of structural diversity provided by variable retention for invertebrate populations. However, the aims of variable retention should be specified first. For example, are retention-tree groups planned to constitute life-boats , stepping-stones or to create structural diversity? Does it suffice that some species are maintained, or do we want to preserve the most sensitive ones, and how are these best defined? Moreover, the ecological benefits and economic costs of modified logging methods should be compared to other approaches aimed at maintaining biodiversity.
Resumo:
During the last 10-15 years interest in mouse behavioural analysis has evolved considerably. The driving force is development in molecular biological techniques that allow manipulation of the mouse genome by changing the expression of genes. Therefore, with some limitations it is possible to study how genes participate in regulation of physiological functions and to create models explaining genetic contribution to various pathological conditions. The first aim of our study was to establish a framework for behavioural phenotyping of genetically modified mice. We established comprehensive battery of tests for the initial screening of mutant mice. These included tests for exploratory and locomotor activity, emotional behaviour, sensory functions, and cognitive performance. Our interest was in the behavioural patterns of common background strains used for genetic manipulations in mice. Additionally we studied the behavioural effect of sex differences, test history, and individual housing. Our findings highlight the importance of careful consideration of genetic background for analysis of mutant mice. It was evident that some backgrounds may mask or modify the behavioural phenotype of mutants and thereby lead to false positive or negative findings. Moreover, there is no universal strain that is equally suitable for all tests, and using different backgrounds allows one to address possible phenotype modifying factors. We discovered that previous experience affected performance in several tasks. The most sensitive traits were the exploratory and emotional behaviour, as well as motor and nociceptive functions. Therefore, it may be essential to repeat some of the tests in naïve animals for assuring the phenotype. Social isolation for a long time period had strong effects on exploratory behaviour, but also on learning and memory. All experiments revealed significant interactions between strain and environmental factors (test history or housing condition) indicating genotype-dependent effects of environmental manipulations. Several mutant line analyses utilize this information. For example, we studied mice overexpressing as well as those lacking extracellular matrix protein heparin-binding growth-associated molecule (HB-GAM), and mice lacking N-syndecan (a receptor for HB-GAM). All mutant mice appeared to be fertile and healthy, without any apparent neurological or sensory defects. The lack of HB-GAM and N-syndecan, however, significantly reduced the learning capacity of the mice. On the other hand, overexpression of HB-GAM resulted in facilitated learning. Moreover, HB-GAM knockout mice displayed higher anxiety-like behaviour, whereas anxiety was reduced in HB-GAM overexpressing mice. Changes in hippocampal plasticity accompanied the behavioural phenotypes. We conclude that HB-GAM and N-syndecan are involved in the modulation of synaptic plasticity in hippocampus and play a role in regulation of anxiety- and learning-related behaviour.
Resumo:
During the past decades agricultural intensification has caused dramatic population declines in a wide range of taxa related to farmland habitats, including farmland birds. In this thesis, I studied how boreal farmland landscape characteristics and agricultural land use affect the abundance and diversity of farmland birds using extensive field data collected by territory mapping of breeding farmland birds in various parts of Finland. My results show that the area and openness of agricultural areas are key determinants of farmland bird abundance and distribution. A landscape composition with enough open farmland combined with key habitats such as farmyards and wetland is likely to provide essential prerequisites for the occurrence of a rich farmland avifauna. In Finland, the majority of large areas suitable for open habitat specialists are located in southern and western parts of the country. However, the diversity of the species with an unfavourable conservation status in Europe (SPECs) had notable hotspot areas in northern and north-western agricultural areas. I found that in boreal agroecosystems farmland birds favour fields with springtime vegetative cover, especially agricultural grasslands and set-asides. Hence, in the spring cereal dominated Finnish agroecosystems it is the absence of field vegetation that may limit populations of many farmland bird species. It is likely that the decrease of crops providing vegetative cover in the spring, such as permanent grasslands, cultivated grass, and autumn-sown cereals, has greatly contributed to the declines of Finnish farmland birds. Grass crops have persistently declined in Finland as a consequence of specialization in crop production and the large-scale decline in livestock husbandry. Small-scale non-crop habitats, especially ditches and ditch margins, are also important for many bird species in the Finnish agroecosystems, but have dramatically declined during the last decades. A major problem for farmland bird conservation in Finland is the conflict between landscape structure and agricultural management. Areas with mixed and cattle farming are virtually absent from the large agricultural plains of southern and south-western Finland, where the landscape structure is more likely to be favourable for rich farmland bird assemblages. On the other hand, mixed and cattle farming is still rather frequent in northern and central parts of the country, where the landscape structure is not suitable for many farmland specialist birds requiring open landscapes. My results provide useful guidelines for farmland bird conservation, and imply that considerable attention needs to be paid to landscape factors when selecting areas for various conservational management actions, such as agri-environment schemes. Actions promoting the abundance of set-asides, grass crops, and ditches would markedly benefit Finnish farmland bird populations. Organic farming may benefit farmland birds, but it is not clear how general its beneficial effect is in boreal agroecosystems. The most urgent action aiming to preserve farmland biodiversity would be to support re-introducing and sustaining cattle farming by environmental subsidies. This would be especially beneficial in the southern parts of Finland, where the landscape characteristics and abundance of agricultural areas are most suitable for farmland birds and where cattle farming is currently rare.
Resumo:
Ongoing habitat loss and fragmentation threaten much of the biodiversity that we know today. As such, conservation efforts are required if we want to protect biodiversity. Conservation budgets are typically tight, making the cost-effective selection of protected areas difficult. Therefore, reserve design methods have been developed to identify sets of sites, that together represent the species of conservation interest in a cost-effective manner. To be able to select reserve networks, data on species distributions is needed. Such data is often incomplete, but species habitat distribution models (SHDMs) can be used to link the occurrence of the species at the surveyed sites to the environmental conditions at these locations (e.g. climatic, vegetation and soil conditions). The probability of the species occurring at unvisited location is next predicted by the model, based on the environmental conditions of those sites. The spatial configuration of reserve networks is important, because habitat loss around reserves can influence the persistence of species inside the network. Since species differ in their requirements for network configuration, the spatial cohesion of networks needs to be species-specific. A way to account for species-specific requirements is to use spatial variables in SHDMs. Spatial SHDMs allow the evaluation of the effect of reserve network configuration on the probability of occurrence of the species inside the network. Even though reserves are important for conservation, they are not the only option available to conservation planners. To enhance or maintain habitat quality, restoration or maintenance measures are sometimes required. As a result, the number of conservation options per site increases. Currently available reserve selection tools do however not offer the ability to handle multiple, alternative options per site. This thesis extends the existing methodology for reserve design, by offering methods to identify cost-effective conservation planning solutions when multiple, alternative conservation options are available per site. Although restoration and maintenance measures are beneficial to certain species, they can be harmful to other species with different requirements. This introduces trade-offs between species when identifying which conservation action is best applied to which site. The thesis describes how the strength of such trade-offs can be identified, which is useful for assessing consequences of conservation decisions regarding species priorities and budget. Furthermore, the results of the thesis indicate that spatial SHDMs can be successfully used to account for species-specific requirements for spatial cohesion - in the reserve selection (single-option) context as well as in the multi-option context. Accounting for the spatial requirements of multiple species and allowing for several conservation options is however complicated, due to trade-offs in species requirements. It is also shown that spatial SHDMs can be successfully used for gaining information on factors that drive a species spatial distribution. Such information is valuable to conservation planning, as better knowledge on species requirements facilitates the design of networks for species persistence. This methods and results described in this thesis aim to improve species probabilities of persistence, by taking better account of species habitat and spatial requirements. Many real-world conservation planning problems are characterised by a variety of conservation options related to protection, restoration and maintenance of habitat. Planning tools therefore need to be able to incorporate multiple conservation options per site, in order to continue the search for cost-effective conservation planning solutions. Simultaneously, the spatial requirements of species need to be considered. The methods described in this thesis offer a starting point for combining these two relevant aspects of conservation planning.