9 resultados para text analytic approaches
em Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland
Resumo:
Artikkeli pohjautuu kirjoittajan väitöstutkimukseen The problem of equivalence in translating texts in international reading literacy studies : a text analytic study of three English and Finnish texts used in the PISA 2000 reading texts (Jyväskylän yliopisto 2007).
Resumo:
Biomedical research is currently facing a new type of challenge: an excess of information, both in terms of raw data from experiments and in the number of scientific publications describing their results. Mirroring the focus on data mining techniques to address the issues of structured data, there has recently been great interest in the development and application of text mining techniques to make more effective use of the knowledge contained in biomedical scientific publications, accessible only in the form of natural human language. This thesis describes research done in the broader scope of projects aiming to develop methods, tools and techniques for text mining tasks in general and for the biomedical domain in particular. The work described here involves more specifically the goal of extracting information from statements concerning relations of biomedical entities, such as protein-protein interactions. The approach taken is one using full parsing—syntactic analysis of the entire structure of sentences—and machine learning, aiming to develop reliable methods that can further be generalized to apply also to other domains. The five papers at the core of this thesis describe research on a number of distinct but related topics in text mining. In the first of these studies, we assessed the applicability of two popular general English parsers to biomedical text mining and, finding their performance limited, identified several specific challenges to accurate parsing of domain text. In a follow-up study focusing on parsing issues related to specialized domain terminology, we evaluated three lexical adaptation methods. We found that the accurate resolution of unknown words can considerably improve parsing performance and introduced a domain-adapted parser that reduced the error rate of theoriginal by 10% while also roughly halving parsing time. To establish the relative merits of parsers that differ in the applied formalisms and the representation given to their syntactic analyses, we have also developed evaluation methodology, considering different approaches to establishing comparable dependency-based evaluation results. We introduced a methodology for creating highly accurate conversions between different parse representations, demonstrating the feasibility of unification of idiverse syntactic schemes under a shared, application-oriented representation. In addition to allowing formalism-neutral evaluation, we argue that such unification can also increase the value of parsers for domain text mining. As a further step in this direction, we analysed the characteristics of publicly available biomedical corpora annotated for protein-protein interactions and created tools for converting them into a shared form, thus contributing also to the unification of text mining resources. The introduced unified corpora allowed us to perform a task-oriented comparative evaluation of biomedical text mining corpora. This evaluation established clear limits on the comparability of results for text mining methods evaluated on different resources, prompting further efforts toward standardization. To support this and other research, we have also designed and annotated BioInfer, the first domain corpus of its size combining annotation of syntax and biomedical entities with a detailed annotation of their relationships. The corpus represents a major design and development effort of the research group, with manual annotation that identifies over 6000 entities, 2500 relationships and 28,000 syntactic dependencies in 1100 sentences. In addition to combining these key annotations for a single set of sentences, BioInfer was also the first domain resource to introduce a representation of entity relations that is supported by ontologies and able to capture complex, structured relationships. Part I of this thesis presents a summary of this research in the broader context of a text mining system, and Part II contains reprints of the five included publications.
Resumo:
My presupposition, that learning at some level deals with life praxis, is expressed in four metaphors: space, time, fable and figure. Relations between learning,knowledge building and meaning making are linked to the concept of personal knowledge. I present a two part study of learning as text in a drama pedagogical rooted reading where learning is framed as the ongoing event, and knowledge, as the product of previous processes, is framed as culturally formed utterances. A frame analysis model is constructed as a topological guide for relations between the two concepts learning and knowledge. It visualises an aesthetic understanding, rooted in drama pedagogical comprehension. Insight and perception are linked in an inner relationship that is neither external nor identical. This understanding expresses the movement "in between" connecting asymmetrical and nonlinear features of human endeavour and societal issues. The performability of bodily and oral participation in the learning event in a socio-cultural setting is analysed as a dialogised text. In an ethnographical case study I have gathered material with an interest for the particular. The empirical material is based on three problem based learning situations in a Polytechnic setting. The act of transformation in the polyphony of the event is considered as a turning point in the narrative employment. Negotiation and figuration in the situation form patterns of the space for improvisation (flow) and tensions at the boundaries (thresholds) which imply the logical structure of transformation. Learning as a dialogised text of "yes" and "no", of structure and play for the improvised, interrelate in that movement. It is related to both the syntagmic and the paradigmatic forms of thinking. In the philosophical study, forms of understanding are linked to the logical structure of transformation as a cultural issue. The classical rhetorical concepts of Logos, Pathos, Ethos and Mythos are connected to the multidimensional rationality of the human being. In the Aristotelian form of knowledge, phronesis,a logic structure of inquiry is recognised. The shifting of perspectives between approaches, the construction of knowledge as context and the human project of meaning making as a subtext, illuminates multiple layers of the learning text. In an argumentation that post-modern apprehension of knowledge, emphasising contextual and situational values, has an empowering impact on learning, I find pedagogical benefits. The dialogical perspective has opened lenses that manage to hold in aesthetic doubling the individual action of inquiry and the stage with its cultural tools in a three dimensional reading.
Resumo:
Choice of industrial development options and the relevant allocation of the research funds become more and more difficult because of the increasing R&D costs and pressure for shorter development period. Forecast of the research progress is based on the analysis of the publications activity in the field of interest as well as on the dynamics of its change. Moreover, allocation of funds is hindered by exponential growth in the number of publications and patents. Thematic clusters become more and more difficult to identify, and their evolution hard to follow. The existing approaches of research field structuring and identification of its development are very limited. They do not identify the thematic clusters with adequate precision while the identified trends are often ambiguous. Therefore, there is a clear need to develop methods and tools, which are able to identify developing fields of research. The main objective of this Thesis is to develop tools and methods helping in the identification of the promising research topics in the field of separation processes. Two structuring methods as well as three approaches for identification of the development trends have been proposed. The proposed methods have been applied to the analysis of the research on distillation and filtration. The results show that the developed methods are universal and could be used to study of the various fields of research. The identified thematic clusters and the forecasted trends of their development have been confirmed in almost all tested cases. It proves the universality of the proposed methods. The results allow for identification of the fast-growing scientific fields as well as the topics characterized by stagnant or diminishing research activity.
Resumo:
The present thesis deals with the reception of the celebrity profiles of the MTV3 news current affairs program "45 minuuttia". The target group of the research consists of young adults between the ages from 25 to 34 living in the Helsinki metropolitan area. The research is qualitative and it studies the target group's opinions of the program profiles by the means of a survey and a group interview. In the group interview, the interviewees were also presented sample clips of the program. The results of the survey were analyzed mainly quantitatively. The purpose of the survey results was to map the viewing habits of the target group. In analyzing the results of the group, the interview methods of feedback research and cultural audience research were used. The current thesis was a commissioned research, the purpose of which was to study how interested the young adults are in the program. In addition, the aim was also to find out possible new trends and approaches for the producer of the program. In contrary to the expectations, the reason for the low interest in the profiles seemed to be due to the approach taken to the topics, e.g. the profiled person. By the approach I mean how the person is visually presented or how the person is verbally described. Many people presented in the profiles were regarded as interesting, but at the same time, the way the stories were told was hoped to be more versatile. In addition, a more contemporary visual approach was hoped for. The results also verified the claim that the young adults in general are not interested in the current affair programmes. In addition, these results suggested that in order to obtain more precise information of the viewing preferences of the audience, a more thorough study should be conducted.