26 resultados para Online handwriting recognition
em Helda - Digital Repository of University of Helsinki
Resumo:
Tämä pro gradu -tutkielma vertailee korpuksen avulla erisnimien kvantitatiivista jakautumista luokkiin kahdessa saksalaisessa verkkolehdessä. Työn tavoitteena on selvittää, kuinka erisnimiä voidaan luokitella ja mitä eroja niiden avulla on havaittavissa lehtien raportoinnissa. Laajempana kehyksenä toimii kysymys siitä, voidaanko erisnimiä hyödyntäen hahmottaa lehtien sisältöjä. Korpus on kerätty Frankfurter Allgemeine Zeitungin ja Süddeutsche Zeitungin verkkolehtien http: //www.faz.net (FAZ) ja http://www.sueddeutsche.de (SZ) artikkeleista ajalta 2.11.2004-8.11.2004. Valitut sivustot edustavat Saksan arvostetuimpien päivittäisten, koko maan kattavien sanomaleh- tien verkkojulkaisuja. Näistä FAZ:ia pidetään konservatiivisena ja SZ:ia liberaalina lehtenä. Kumpikin korpus käsittelee USA:n presidentinvaaleja syksyllä 2004 ja sisältää hieman alle 30 000 sanaa noin 40 lehtiartikkelista. Aihesidonnaisen korpuksen valinta perustuu erityisesti siihen, että tutkimuksen päämääränä on saada erisnimien avulla selville, miltä osin FAZ ja SZ eroavat toisistaan käsitellessään samaa aihetta. Teoriaosassa käydään läpi saksalaisten verkkolehtien taustaa, työhön liittyviä tekstilingvistisiä teo- rioita sekä erisnimien erikoispiirteitä. Siinä käsitellään myös kolmea aiempaa, saksankielisen eris- nimitutkimuksen luokittelua ja yhtä englanninkielistä, kieliteknologian luokittelua. Näissä havaitut puutteet motivoivat yhdistelemään ja muuttamaan olemassa olevia luokitteluja tätä työtä varten. Uusi luokittelu sisältää neljä yläluokkaa (olentojen, maantieteelliset, instituutioden ja asioiden ni- met), jotka kaikki kattavat kahdesta yhdeksään alaluokkaa. Kummankin korpuksen erisnimet luo- kitellaan tämän perusteella. Kvantitatiivinen analyysi keskittyy ylä- ja alaluokkien vertailuun lehtien välillä. Lisäksi se kattaa sekä kummankin aineiston että pääluokkien frekventimpien sanojen tarkastelun. Vaikka FAZ ja SZ käyttivätkin pääosin samoja erisnimiä raportoinnissaan, voidaan lehtien välillä osoittaa selkeitä eroja alaluokkien kohdalla ja vähäisiä eroja erisnimien jakautumisessa yläluokkiin. chi2 -testin näytti kuitenkin, että erisnimien jakautuminen yläluokkiin on lehtisidonnaista. Siksi voidaan väittää, että muun muassa valittu media vaikuttaa erisnimivalintoihin. Erisnimien frekvenssit antavat ymmärtää, että SZ raportoisi monipuolisemmin kuin FAZ, joka käyttää erisnimiä keskitetymmin. SZ:in aineiston erisnimiä yhdistää eurooppalainen näkökulma vaaleihin, kun taas FAZ pyrkii tuomaan esille tapahtumia USA:n eri osavaltioissa. Niin lehdissä mainitut henkilöiden kuin instituutioden nimet tukevat tätä väitetettä. SZ korostaa maantieteellisesti kaupunkien merkitystä, FAZ osavaltioiden. Saadut tulokset osoittavat, että tämänkaltaisen erisnimitutkimuksen soveltaminen lehtiteksteihin on mahdollista. Luokitellut erisnimet heijastavat osittain käsiteltyjen aineistojen sisältöä ja paljastavat raportoinnin painopisteistä.
Resumo:
The paradigm of computational vision hypothesizes that any visual function -- such as the recognition of your grandparent -- can be replicated by computational processing of the visual input. What are these computations that the brain performs? What should or could they be? Working on the latter question, this dissertation takes the statistical approach, where the suitable computations are attempted to be learned from the natural visual data itself. In particular, we empirically study the computational processing that emerges from the statistical properties of the visual world and the constraints and objectives specified for the learning process. This thesis consists of an introduction and 7 peer-reviewed publications, where the purpose of the introduction is to illustrate the area of study to a reader who is not familiar with computational vision research. In the scope of the introduction, we will briefly overview the primary challenges to visual processing, as well as recall some of the current opinions on visual processing in the early visual systems of animals. Next, we describe the methodology we have used in our research, and discuss the presented results. We have included some additional remarks, speculations and conclusions to this discussion that were not featured in the original publications. We present the following results in the publications of this thesis. First, we empirically demonstrate that luminance and contrast are strongly dependent in natural images, contradicting previous theories suggesting that luminance and contrast were processed separately in natural systems due to their independence in the visual data. Second, we show that simple cell -like receptive fields of the primary visual cortex can be learned in the nonlinear contrast domain by maximization of independence. Further, we provide first-time reports of the emergence of conjunctive (corner-detecting) and subtractive (opponent orientation) processing due to nonlinear projection pursuit with simple objective functions related to sparseness and response energy optimization. Then, we show that attempting to extract independent components of nonlinear histogram statistics of a biologically plausible representation leads to projection directions that appear to differentiate between visual contexts. Such processing might be applicable for priming, \ie the selection and tuning of later visual processing. We continue by showing that a different kind of thresholded low-frequency priming can be learned and used to make object detection faster with little loss in accuracy. Finally, we show that in a computational object detection setting, nonlinearly gain-controlled visual features of medium complexity can be acquired sequentially as images are encountered and discarded. We present two online algorithms to perform this feature selection, and propose the idea that for artificial systems, some processing mechanisms could be selectable from the environment without optimizing the mechanisms themselves. In summary, this thesis explores learning visual processing on several levels. The learning can be understood as interplay of input data, model structures, learning objectives, and estimation algorithms. The presented work adds to the growing body of evidence showing that statistical methods can be used to acquire intuitively meaningful visual processing mechanisms. The work also presents some predictions and ideas regarding biological visual processing.
Resumo:
Online content services can greatly benefit from personalisation features that enable delivery of content that is suited to each user's specific interests. This thesis presents a system that applies text analysis and user modeling techniques in an online news service for the purpose of personalisation and user interest analysis. The system creates a detailed thematic profile for each content item and observes user's actions towards content items to learn user's preferences. A handcrafted taxonomy of concepts, or ontology, is used in profile formation to extract relevant concepts from the text. User preference learning is automatic and there is no need for explicit preference settings or ratings from the user. Learned user profiles are segmented into interest groups using clustering techniques with the objective of providing a source of information for the service provider. Some theoretical background for chosen techniques is presented while the main focus is in finding practical solutions to some of the current information needs, which are not optimally served with traditional techniques.
Resumo:
This study is part of an ongoing collaborative bipolar research project, the Jorvi Bipolar Study (JoBS). The JoBS is run by the Department of Mental Health and Alcohol Research of the National Public Health Institute, Helsinki, and the Department of Psychiatry, Jorvi Hospital, Helsinki University Central Hospital (HUCH), Espoo, Finland. It is a prospective, naturalistic cohort study of secondary level care psychiatric in- and outpatients with a new episode of bipolar disorder (BD). The second report also included 269 major depressive disorder (MDD) patients from the Vantaa Depression Study (VDS). The VDS was carried out in collaboration with the Department of Psychiatry of the Peijas Medical Care District. Using the Mood Disorder Questionnaire (MDQ), all in- and outpatients at the Department of Psychiatry at Jorvi Hospital who currently had a possible new phase of DSM-IV BD were sought. Altogether, 1630 psychiatric patients were screened, and 490 were interviewed using a semistructured interview (SCID-I/P). The patients included in the cohort (n=191) had at intake a current phase of BD. The patients were evaluated at intake and at 6- and 18-month interviews. Based on this study, BD is poorly recognized even in psychiatric settings. Of the BD patients with acute worsening of illness, 39% had never been correctly diagnosed. The classic presentations of BD with hospitalizations, manic episodes, and psychotic symptoms lead clinicians to correct diagnosis of BD I in psychiatric care. Time of follow-up elapsed in psychiatric care, but none of the clinical features, seemed to explain correct diagnosis of BD II, suggesting reliance on cross- sectional presentation of illness. Even though BD II was clearly less often correctly diagnosed than BD I, few other differences between the two types of BD were detected. BD I and II patients appeared to differ little in terms of clinical picture or comorbidity, and the prevalence of psychiatric comorbidity was strongly related to the current illness phase in both types. At the same time, the difference in outcome was clear. BD II patients spent about 40% more time depressed than BD I patients. Patterns of psychiatric comorbidity of BD and MDD differed somewhat qualitatively. Overall, MDD patients were likely to have more anxiety disorders and cluster A personality disorders, and bipolar patients to have more cluster B personality disorders. The adverse consequences of missing or delayed diagnosis are potentially serious. Thus, these findings strongly support the value of screening for BD in psychiatric settings, especially among the major depressive patients. Nevertheless, the diagnosis must be based on a clinical interview and follow-up of mood. Comorbidity, present in 59% of bipolar patients in a current phase, needs concomitant evaluation, follow-up, and treatment. To improve outcome in BD, treatment of bipolar depression is a major challenge for clinicians.
Resumo:
Marja Heinonen s dissertation Verkkomedian käyttö ja tutkiminen. Iltalehti Online 1995-2001 describes the usage of new internet based news service Iltalehti Online during its first years of existence, 1995-2001. The study focuses on the content of the service and users attitudes towards the new media and its contents. Heinonen has also analyzed and described the research methods that can be used in the research of any new media phenomenon when there is no historical perspective to do the research. Heinonen has created a process model for the research of net medium, which is based on a multidimensional approach. She has chosen an iterative research method inspired by Sudweeks and Simoff s CEDA-methodology in which qualitative and quantitative methods take turns both creating results and new research questions. The dissertation discusses and describes the possibilities of combining several research methods in the study of online news media. On general level it discusses the methodological possibilities of researching a completely new media form when there is no historical perspective. The result of these discussions is in favour for the multidimensional methods. The empiric research was built around three cases of Iltalehti Online among its users: log analysis 1996-1999, interviews 1999 and clustering 2000-2001. Even though the results of different cases were somewhat conflicting here are the central results from the analysis of Iltalehti Online 1995-2001: - Reading was strongly determined by the gender. - The structure of Iltalehti Online guided the reading strongly. - People did not make a clear distinction in content between news and entertainment. - Users created new habits in their everyday life during the first years of using Iltalehti Online. These habits were categorized as follows: - break between everyday routines - established habit - new practice within the rhythm of the day - In the clustering of the users sports, culture and celebrities were the most distinguishing contents. Users did not move across these borders as much as within them. The dissertation gives contribution to the development of multidimensional research methods in the field of emerging phenomena in media field. It is also a unique description of a phase of development in media history through an unique research material. There is no such information (logs + demographics) available of any other Finnish online news media. Either from the first years or today.
Resumo:
Recent evidence from adult pronoun comprehension suggests that semantic factors such as verb transitivity affect referent salience and thereby anap- hora resolution. We tested whether the same semantic factors influence pronoun comprehension in young children. In a visual world study, 3-year- olds heard stories that began with a sentence containing either a high or a low transitivity verb. Looking behaviour to pictures depicting the subject and object of this sentence was recorded as children listened to a subsequent sentence containing a pronoun. Children showed a stronger preference to look to the subject as opposed to the object antecedent in the low transitivity condition. In addition there were general preferences (1) to look to the subject in both conditions and (2) to look more at both potential antecedents in the high transitivity condition. This suggests that children, like adults, are affected by semantic factors, specifically semantic prominence, when interpreting anaphoric pronouns.
Resumo:
The paper explores the effect of customer satisfaction with online supporting services on loyalty to providers of an offline core service. Supporting services are provided to customers before, during, or after the purchase of a tangible or intangible core product, and have the purpose of enhancing or facilitating the use of this product. The internet has the potential to dominate all other marketing channels when it comes to the interactive and personalised communication that is considered quintessential for supporting services. Our study shows that the quality of online supporting services powerfully affects satisfaction with the provider and customer loyalty through its effect on online value and enjoyment. Managerial implications are provided.