929 resultados para Chinese bug textual data
Resumo:
To meet the increasing demands of the complex inter-organizational processes and the demand for continuous innovation and internationalization, it is evident that new forms of organisation are being adopted, fostering more intensive collaboration processes and sharing of resources, in what can be called collaborative networks (Camarinha-Matos, 2006:03). Information and knowledge are crucial resources in collaborative networks, being their management fundamental processes to optimize. Knowledge organisation and collaboration systems are thus important instruments for the success of collaborative networks of organisations having been researched in the last decade in the areas of computer science, information science, management sciences, terminology and linguistics. Nevertheless, research in this area didn’t give much attention to multilingual contexts of collaboration, which pose specific and challenging problems. It is then clear that access to and representation of knowledge will happen more and more on a multilingual setting which implies the overcoming of difficulties inherent to the presence of multiple languages, through the use of processes like localization of ontologies. Although localization, like other processes that involve multilingualism, is a rather well-developed practice and its methodologies and tools fruitfully employed by the language industry in the development and adaptation of multilingual content, it has not yet been sufficiently explored as an element of support to the development of knowledge representations - in particular ontologies - expressed in more than one language. Multilingual knowledge representation is then an open research area calling for cross-contributions from knowledge engineering, terminology, ontology engineering, cognitive sciences, computational linguistics, natural language processing, and management sciences. This workshop joined researchers interested in multilingual knowledge representation, in a multidisciplinary environment to debate the possibilities of cross-fertilization between knowledge engineering, terminology, ontology engineering, cognitive sciences, computational linguistics, natural language processing, and management sciences applied to contexts where multilingualism continuously creates new and demanding challenges to current knowledge representation methods and techniques. In this workshop six papers dealing with different approaches to multilingual knowledge representation are presented, most of them describing tools, approaches and results obtained in the development of ongoing projects. In the first case, Andrés Domínguez Burgos, Koen Kerremansa and Rita Temmerman present a software module that is part of a workbench for terminological and ontological mining, Termontospider, a wiki crawler that aims at optimally traverse Wikipedia in search of domainspecific texts for extracting terminological and ontological information. The crawler is part of a tool suite for automatically developing multilingual termontological databases, i.e. ontologicallyunderpinned multilingual terminological databases. In this paper the authors describe the basic principles behind the crawler and summarized the research setting in which the tool is currently tested. In the second paper, Fumiko Kano presents a work comparing four feature-based similarity measures derived from cognitive sciences. The purpose of the comparative analysis presented by the author is to verify the potentially most effective model that can be applied for mapping independent ontologies in a culturally influenced domain. For that, datasets based on standardized pre-defined feature dimensions and values, which are obtainable from the UNESCO Institute for Statistics (UIS) have been used for the comparative analysis of the similarity measures. The purpose of the comparison is to verify the similarity measures based on the objectively developed datasets. According to the author the results demonstrate that the Bayesian Model of Generalization provides for the most effective cognitive model for identifying the most similar corresponding concepts existing for a targeted socio-cultural community. In another presentation, Thierry Declerck, Hans-Ulrich Krieger and Dagmar Gromann present an ongoing work and propose an approach to automatic extraction of information from multilingual financial Web resources, to provide candidate terms for building ontology elements or instances of ontology concepts. The authors present a complementary approach to the direct localization/translation of ontology labels, by acquiring terminologies through the access and harvesting of multilingual Web presences of structured information providers in the field of finance, leading to both the detection of candidate terms in various multilingual sources in the financial domain that can be used not only as labels of ontology classes and properties but also for the possible generation of (multilingual) domain ontologies themselves. In the next paper, Manuel Silva, António Lucas Soares and Rute Costa claim that despite the availability of tools, resources and techniques aimed at the construction of ontological artifacts, developing a shared conceptualization of a given reality still raises questions about the principles and methods that support the initial phases of conceptualization. These questions become, according to the authors, more complex when the conceptualization occurs in a multilingual setting. To tackle these issues the authors present a collaborative platform – conceptME - where terminological and knowledge representation processes support domain experts throughout a conceptualization framework, allowing the inclusion of multilingual data as a way to promote knowledge sharing and enhance conceptualization and support a multilingual ontology specification. In another presentation Frieda Steurs and Hendrik J. Kockaert present us TermWise, a large project dealing with legal terminology and phraseology for the Belgian public services, i.e. the translation office of the ministry of justice, a project which aims at developing an advanced tool including expert knowledge in the algorithms that extract specialized language from textual data (legal documents) and whose outcome is a knowledge database including Dutch/French equivalents for legal concepts, enriched with the phraseology related to the terms under discussion. Finally, Deborah Grbac, Luca Losito, Andrea Sada and Paolo Sirito report on the preliminary results of a pilot project currently ongoing at UCSC Central Library, where they propose to adapt to subject librarians, employed in large and multilingual Academic Institutions, the model used by translators working within European Union Institutions. The authors are using User Experience (UX) Analysis in order to provide subject librarians with a visual support, by means of “ontology tables” depicting conceptual linking and connections of words with concepts presented according to their semantic and linguistic meaning. The organizers hope that the selection of papers presented here will be of interest to a broad audience, and will be a starting point for further discussion and cooperation.
Resumo:
Aquest treball de final de carrera pretén cobrir una necessitat divulgativa del món de l'espeleologia, proporcionant al col·lectiu espeleològic una aplicació basada en web que permeti publicar, de forma cooperativa, ressenyes de cavitats incorporant dades textuals, fotografies, topografies i mapes d'ubicació.El projecte s'ha desenvolupat com a Treball de Final de Carrera de l'àrea J2EE de l'Enginyeria Tècnica en Informàtica de Gestió. La seva implementació, per tant, està basada en Programació Orientada a Objectes amb llenguatge Java i tecnologies J2EE.
Resumo:
L'acostament entre el professional de la salut i el qui necessita els seus serveis és un tema d'interès per a la psicologia de la salut des que va començar com a disciplina. En el context de la societat de la informació i el coneixement apareix un nou escenari d'intervenció d'aquests dos col·lectius que és necessari conèixer. Per això s'han començat diferents propostes, però la que presenta el grup d'investigació de Psicologia de la Salut i Xarxa (PSINET) de la Universitat Oberta de Catalunya s'encamina a potenciar la creació d'espais virtuals de trobada entre ambdós col·lectius (professionals de la salut i usuaris de serveis de salut). L'establiment de plataformes digitals de serveis sanitaris per als ciutadans del segle XXI passa primer per conèixer la realitat dels diferents col·lectius implicats en la relació salut i xarxa. L'objectiu que es planteja en aquest estudi se centra en el primer col·lectiu i en el descobriment del que hi ha sobre salut a Internet. Per a això, seguint una metodologia de recerca exhaustiva per Internet, s'han recollit webs sobre salut en català i castellà, i s'ha fet una anàlisi de dades textuals de la informació que contenien els webs en català. Aquesta anàlisi ha permès conèixer i descriure el prototip de web sobre salut que hi ha a la xarxa en el moment de fer l'estudi.
Resumo:
TEXTABLE est un nouvel outil open source de programmation visuelle pour l'analyse de données textuelles. Les implications de la conception de ce logiciel du point de vue de l'interopérabilité et de la flexibilité sont abordées, ainsi que la question que son adéquation pour un usage pédagogique. Une brève introduction aux principes de la programmation visuelle pour l'analyse de données textuelles est également proposée.
Resumo:
In this thesis we study the field of opinion mining by giving a comprehensive review of the available research that has been done in this topic. Also using this available knowledge we present a case study of a multilevel opinion mining system for a student organization's sales management system. We describe the field of opinion mining by discussing its historical roots, its motivations and applications as well as the different scientific approaches that have been used to solve this challenging problem of mining opinions. To deal with this huge subfield of natural language processing, we first give an abstraction of the problem of opinion mining and describe the theoretical frameworks that are available for dealing with appraisal language. Then we discuss the relation between opinion mining and computational linguistics which is a crucial pre-processing step for the accuracy of the subsequent steps of opinion mining. The second part of our thesis deals with the semantics of opinions where we describe the different ways used to collect lists of opinion words as well as the methods and techniques available for extracting knowledge from opinions present in unstructured textual data. In the part about collecting lists of opinion words we describe manual, semi manual and automatic ways to do so and give a review of the available lists that are used as gold standards in opinion mining research. For the methods and techniques of opinion mining we divide the task into three levels that are the document, sentence and feature level. The techniques that are presented in the document and sentence level are divided into supervised and unsupervised approaches that are used to determine the subjectivity and polarity of texts and sentences at these levels of analysis. At the feature level we give a description of the techniques available for finding the opinion targets, the polarity of the opinions about these opinion targets and the opinion holders. Also at the feature level we discuss the various ways to summarize and visualize the results of this level of analysis. In the third part of our thesis we present a case study of a sales management system that uses free form text and that can benefit from an opinion mining system. Using the knowledge gathered in the review of this field we provide a theoretical multi level opinion mining system (MLOM) that can perform most of the tasks needed from an opinion mining system. Based on the previous research we give some hints that many of the laborious market research tasks that are done by the sales force, which uses this sales management system, can improve their insight about their partners and by that increase the quality of their sales services and their overall results.
Resumo:
Introduction. This study presents the results of the implementation process of portfolio in the course of four consecutive years. The plan includes three phases (initiation, development and consolidation). The sample is 480 students studying the first year of nursing at the University of Girona. The objective is to evaluate the effectiveness of the instrument and achieve its construction in a self-regulated process. Subjects and methods. The proposed methodology is based on the sequential triangulation between methods. The study of the same empirical unit it’s used two investigation strategies, quantitative and qualitative. Study 1: quantitative, descriptive, longitudinal and prospective. The statistical analysis of paired data for continuous variables that follow a normaldistribution is made with t Student-Fisher test. The correlation between two numerical variables is used the Pearson correlation index. Study 2: qualitative, uses the discussion groups and topics. For textual data analysis is used Atlas.ti. Results. The final score for students who prepare the portfolio is higher (7.78) than the score who do not prepare (7) (p ≤ 0.001). A significant correlation exists between the portfolio score and final score (p ≤ 0.001). The trend study showsa greater sensitivity of the instrument assessment. Conclusion. The final design of the portfolio is characterized by mixed, flexible and encourages the student reflection and empowers the reflection on the continuum of learning
Resumo:
Most frequently reported Chinese renal biopsy data have originated from southeastern China. The present study analyzed the renal biopsy data from northeastern China. The records of 1550 consecutive native patients who were diagnosed with primary glomerular diseases (PGD) after renal biopsy at our hospital during 2005-2009 were used. These patients were divided into four age groups for stratified analysis: <15, 15-44, 45-59, and ≥60 years old. Among PGD, minimal change disease (MCD) was the most common histologically diagnosed disease (30.7%), followed by IgA nephropathy (IgAN), mesangial proliferative glomerulonephritis (MsPGN), membranous nephropathy (MN), membranoproliferative glomerulonephritis (MPGN), focal segmental glomerulosclerosis (FSGS), and endocapillary proliferative glomerulonephritis (EnPGN). MCD was the disease most frequently observed (43.7%) in the <15-year-old group. MsPGN was the most common disease in the elderly group (38.1%). MsPGN was more prevalent in females (27.8%), whereas MCD was more prevalent in males (35.3%). Primary glomerular diseases constituted the most commonly encountered group of diseases with a high prevalence of MCD, which predominantly affected males and young adults. The prevalence of MCD was high in northeastern China. Further study is necessary to expand the epidemiologic data available for renal disease in China.
Resumo:
Finnish companies cross listing in the United States is an exceptional phenomenon. This study examines the cross listing decision, cross listing choice and cross listing process with associated challenges and critical factors. The aim is to create an in-depth understanding of the cross listing process and the required financial information. Based on that, the aim is to establish the process phases with the challenges and the critical factors that ought to be considered be- fore establishing the process plus re-evaluated and further considered at points in time during the process. The empirical part of this study is conducted as a qualitative study. The research data was collected through the adoption of two approaches, which are the interview approach and the textual data approach. The interviews were conducted with Finnish practitioners in the field of accounting and finance. The textual data was from publicly available publications of this phenomenon by the two BIG5 accounting companies worldwide. The results of this study demonstrate the benefits of cross listing in the U.S. are the better growth opportunities, the reduction of cost of capital and the production of higher quality financial information. In the decision making process companies should assess whether the benefits exceed the increased costs, the pressure for performance, the uncertainty of market recognition and the requirements of management. The exchange listing is seen as the most favourable cross listing choice for Finnish companies. The establishment of the processes for producing reliable, transparent and timely financial information was seen as both highly critical and very challenging. The critical success factors relating to the cross listing phases are the assessment and planning as well as the right mix of experiences and expertise. The timing plays important role in the process. The results mainly corroborate the literature concerning cross listing decision and choice. This study contributes to the literature on the cross listing process offering a useful model for the phases of the cross listing process.
Resumo:
This thesis research was a qualitative case study of a single class of Interdisciplinary Studies: Introduction to Engineering taught in a secondary school. The study endeavoured to explore students' experiences in and perceptions of the course, and to investigate the viability of engineering as an interdisciplinary theme at the secondary school level. Data were collected in the form of student questionnaires, the researcher's observations and reflections, and artefacts representative of students' work. Data analysis was performed by coding textual data and classifying text segments into common themes. The themes that emerged from the data were aligned with facets of interdisciplinary study, including making connections, project-based learning, and student engagement and affective outcomes. The findings of the study showed that students were positive about their experiences in the course, and enjoyed its project-driven nature. Content from mathematics, physics, and technological design was easily integrated under the umbrella of engineering. Students felt that the opportunity to develop problem solving and teamwork skills were two of the most important aspects of the course and could be relevant not only for engineering, but for other disciplines or their day-to-day lives after secondary school. The study concluded that engineering education in secondary school can be a worthwhile experience for a variety of students and not just those intending postsecondary study in engineering. This has implications for the inclusion of engineering in the secondary school curriculum and can inform the practice of curriculum planners at the school, school board, and provincial levels. Suggested directions for further research include classroom-based action research in the areas of technological education, engineering education in secondary school, and interdisciplinary education.
Resumo:
Les graphiques ont été réalisés avec le logiciel Alceste.
Resumo:
S’insérant dans les domaines de la Lecture et de l’Analyse de Textes Assistées par Ordinateur (LATAO), de la Gestion Électronique des Documents (GÉD), de la visualisation de l’information et, en partie, de l’anthropologie, cette recherche exploratoire propose l’expérimentation d’une méthodologie descriptive en fouille de textes afin de cartographier thématiquement un corpus de textes anthropologiques. Plus précisément, nous souhaitons éprouver la méthode de classification hiérarchique ascendante (CHA) pour extraire et analyser les thèmes issus de résumés de mémoires et de thèses octroyés de 1985 à 2009 (1240 résumés), par les départements d’anthropologie de l’Université de Montréal et de l’Université Laval, ainsi que le département d’histoire de l’Université Laval (pour les résumés archéologiques et ethnologiques). En première partie de mémoire, nous présentons notre cadre théorique, c'est-à-dire que nous expliquons ce qu’est la fouille de textes, ses origines, ses applications, les étapes méthodologiques puis, nous complétons avec une revue des principales publications. La deuxième partie est consacrée au cadre méthodologique et ainsi, nous abordons les différentes étapes par lesquelles ce projet fut conduit; la collecte des données, le filtrage linguistique, la classification automatique, pour en nommer que quelques-unes. Finalement, en dernière partie, nous présentons les résultats de notre recherche, en nous attardant plus particulièrement sur deux expérimentations. Nous abordons également la navigation thématique et les approches conceptuelles en thématisation, par exemple, en anthropologie, la dichotomie culture ̸ biologie. Nous terminons avec les limites de ce projet et les pistes d’intérêts pour de futures recherches.
Resumo:
Aircraft Maintenance, Repair and Overhaul (MRO) feedback commonly includes an engineer’s complex text-based inspection report. Capturing and normalizing the content of these textual descriptions is vital to cost and quality benchmarking, and provides information to facilitate continuous improvement of MRO process and analytics. As data analysis and mining tools requires highly normalized data, raw textual data is inadequate. This paper offers a textual-mining solution to efficiently analyse bulk textual feedback data. Despite replacement of the same parts and/or sub-parts, the actual service cost for the same repair is often distinctly different from similar previously jobs. Regular expression algorithms were incorporated with an aircraft MRO glossary dictionary in order to help provide additional information concerning the reason for cost variation. Professional terms and conventions were included within the dictionary to avoid ambiguity and improve the outcome of the result. Testing results show that most descriptive inspection reports can be appropriately interpreted, allowing extraction of highly normalized data. This additional normalized data strongly supports data analysis and data mining, whilst also increasing the accuracy of future quotation costing. This solution has been effectively used by a large aircraft MRO agency with positive results.
Resumo:
Dominant paradigms of causal explanation for why and how Western liberal-democracies go to war in the post-Cold War era remain versions of the 'liberal peace' or 'democratic peace' thesis. Yet such explanations have been shown to rest upon deeply problematic epistemological and methodological assumptions. Of equal importance, however, is the failure of these dominant paradigms to account for the 'neoliberal revolution' that has gripped Western liberal-democracies since the 1970s. The transition from liberalism to neoliberalism remains neglected in analyses of the contemporary Western security constellation. Arguing that neoliberalism can be understood simultaneously through the Marxian concept of ideology and the Foucauldian concept of governmentality – that is, as a complementary set of 'ways of seeing' and 'ways of being' – the thesis goes on to analyse British security in policy and practice, considering it as an instantiation of a wider neoliberal way of war. In so doing, the thesis draws upon, but also challenges and develops, established critical discourse analytic methods, incorporating within its purview not only the textual data that is usually considered by discourse analysts, but also material practices of security. This analysis finds that contemporary British security policy is predicated on a neoliberal social ontology, morphology and morality – an ideology or 'way of seeing' – focused on the notion of a globalised 'network-market', and is aimed at rendering circulations through this network-market amenable to neoliberal techniques of government. It is further argued that security practices shaped by this ideology imperfectly and unevenly achieve the realisation of neoliberal 'ways of being' – especially modes of governing self and other or the 'conduct of conduct' – and the re-articulation of subjectivities in line with neoliberal principles of individualism, risk, responsibility and flexibility. The policy and practice of contemporary British 'security' is thus recontextualised as a component of a broader 'neoliberal way of war'.
Resumo:
We introduce a problem called maximum common characters in blocks (MCCB), which arises in applications of approximate string comparison, particularly in the unification of possibly erroneous textual data coming from different sources. We show that this problem is NP-complete, but can nevertheless be solved satisfactorily using integer linear programming for instances of practical interest. Two integer linear formulations are proposed and compared in terms of their linear relaxations. We also compare the results of the approximate matching with other known measures such as the Levenshtein (edit) distance. (C) 2008 Elsevier B.V. All rights reserved.