11 resultados para Image recognition

em Helda - Digital Repository of University of Helsinki


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Euphrase Kezilahabi on tansanialainen kirjailija, joka ensimmäisenä julkaisi swahilinkielisen vapaalla mitalla kirjoitetun runokokoelman. Perinteisessä swahilirunoudessa tiukat muotosäännöt ovat tärkeitä, ja teos synnytti kiivasta keskustelua. Runoteokset Kichomi ( Viilto , Kipu , 1974) ja Karibu Ndani ( Tervetuloa sisään , 1988) sekä Kezilahabin muu tuotanto voidaan nähdä uuden sukupolven taiteena. Kezilahabi on arvostettu runoilija, mutta hänen runojaan ei aiemmin ole käännetty englanniksi (yksittäisiä säkeitä lukuunottamatta), eikä juurikaan tutkittu yksityiskohtaisesti. Yleiskuvaan pyrkivissä lausunnoissa Kezilahabin runouden on hyvin usein määritelty olevan poliittista. Monet Kezilahabin runoista ottavatkin kantaa yhteiskunnallisiin kysymyksiin, mutta niiden pohdinta on kuitenkin runoissa vain yksi taso. Sen lisäksi Kezilahabin lyriikassa on paljon muuta ennen kartoittamatonta tämä tutkimus keskittyy veden kuvaan (the image of water). Kezilahabi vietti lapsuutensa saarella Victoria-järven keskellä, ja hänen vesikuvastonsa on rikasta. Tutkimuskysymyksenä on, mitä veden kuva runoteoksissa Kichomi ja Karibu Ndani esittää. Runojen analysoinnissa ja tulkinnassa on tarkasteltu myös sitä, miten äänteellinen taso osallistuu kuvien luomiseen. Tutkimuksen määritelmä kuvasta pohjautuu osittain Hugh Kennerin näkemykseen, jonka mukaan oleellista kuvassa on kirjaimellinen taso. Kennerin lähtökohtaan on yhdistetty John Shoptawin teoriaa, joka korostaa runon äänteellisen puolen tärkeyttä merkityksen muodostumisessa. Foneemien analyysissä vaikutteena on ollut Reuven Tsurin teoria. Analyysiosio osoittaa, että veden kuva edustaa ja käsittelee teoksissa lukuisia teemoja: elämää, kuolemaa, fyysistä vetovoimaa, runoutta, mielikuvitusta ja (ali)tajuntaa sekä moraalia. Veden kuvan tutkimuksen pohjalta on nähtävissä, että Kezilahabin filosofia asettuu elävä/kuollut- ja elämä/kuolema dikotomioiden ulkopuolelle.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This folk linguistic and human geographic study deals with dialect awareness, dialect use and place attachment. The study discusses theoretical and methodological issues current in sociolinguistics suggesting that the study of attitudes should be regarded as a core area in the study of variation and change. Furthermore, it is suggested that instead of putting effort into improving mental mapping methodology (adopted into folk linguistics from behavioural geography of the 1960 s), the more up-to-date thinking of space in geography should be adopted. The region and the dialect are treated as perceptual constructs in the study. The dialect perceptions of high school seniors in the Finnish Tornio Valley are examined trough a triangulation method involving a questionnaire, interviews and dialect recognition test as the research methods. The h in non-initial syllables (e.g. lähethä(ä)n, saunhaan ~ sauhnaan let s go into sauna ) turns out, expectedly, as the most salient feature in the dialect awareness of the locals and in terms of local identity construction. This feature is no longer heard in most of the present dialects of Finnish but is still thriving in the Tornio Valley in the cross-border dialect area. The metathetic variant (saunhaan > sauhnaan into sauna , käymhään > käyhmään to go ) is a characteristic feature of the Tornio Valley dialect. However, individual differences have long been found in the use of the h. This study challenges the essentialist variationist view of social categories (gender) by analysing variation from a quantitative but emic and human geographic point of view. The study shows that the variation of the h is statistically significantly patterned in terms of the degree of feeling of insideness vs. outsideness. New light is shed on the gender differences found in earlier sociolinguistic studies: differences in dialect use between and inside gender groups are illuminated by the fact that, in this case, it is young women who are generally less attached to the local community than young men, but this does not hold for all the individuals. The ideological motivation for preservation of the h seems to be based on the imagined community of Tornio Valley covering both the Swedish and the Finnish valley area. The general image of the dialect area and it s speakers, the shared cognitive dialect boundaries of the locals and the particularly deep level of awaress of the linguistic variation of the h are notable resources of the Tornio valley identity. Hyperdialectic forms analogical to the most frequently attested metathetic forms are found in the interview data, predicting that in this dialect the h will be maintained also in the future.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The earliest stages of human cortical visual processing can be conceived as extraction of local stimulus features. However, more complex visual functions, such as object recognition, require integration of multiple features. Recently, neural processes underlying feature integration in the visual system have been under intensive study. A specialized mid-level stage preceding the object recognition stage has been proposed to account for the processing of contours, surfaces and shapes as well as configuration. This thesis consists of four experimental, psychophysical studies on human visual feature integration. In two studies, classification image a recently developed psychophysical reverse correlation method was used. In this method visual noise is added to near-threshold stimuli. By investigating the relationship between random features in the noise and observer s perceptual decision in each trial, it is possible to estimate what features of the stimuli are critical for the task. The method allows visualizing the critical features that are used in a psychophysical task directly as a spatial correlation map, yielding an effective "behavioral receptive field". Visual context is known to modulate the perception of stimulus features. Some of these interactions are quite complex, and it is not known whether they reflect early or late stages of perceptual processing. The first study investigated the mechanisms of collinear facilitation, where nearby collinear Gabor flankers increase the detectability of a central Gabor. The behavioral receptive field of the mechanism mediating the detection of the central Gabor stimulus was measured by the classification image method. The results show that collinear flankers increase the extent of the behavioral receptive field for the central Gabor, in the direction of the flankers. The increased sensitivity at the ends of the receptive field suggests a low-level explanation for the facilitation. The second study investigated how visual features are integrated into percepts of surface brightness. A novel variant of the classification image method with brightness matching task was used. Many theories assume that perceived brightness is based on the analysis of luminance border features. Here, for the first time this assumption was directly tested. The classification images show that the perceived brightness of both an illusory Craik-O Brien-Cornsweet stimulus and a real uniform step stimulus depends solely on the border. Moreover, the spatial tuning of the features remains almost constant when the stimulus size is changed, suggesting that brightness perception is based on the output of a single spatial frequency channel. The third and fourth studies investigated global form integration in random-dot Glass patterns. In these patterns, a global form can be immediately perceived, if even a small proportion of random dots are paired to dipoles according to a geometrical rule. In the third study the discrimination of orientation structure in highly coherent concentric and Cartesian (straight) Glass patterns was measured. The results showed that the global form was more efficiently discriminated in concentric patterns. The fourth study investigated how form detectability depends on the global regularity of the Glass pattern. The local structure was either Cartesian or curved. It was shown that randomizing the local orientation deteriorated the performance only with the curved pattern. The results give support for the idea that curved and Cartesian patterns are processed in at least partially separate neural systems.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In visual object detection and recognition, classifiers have two interesting characteristics: accuracy and speed. Accuracy depends on the complexity of the image features and classifier decision surfaces. Speed depends on the hardware and the computational effort required to use the features and decision surfaces. When attempts to increase accuracy lead to increases in complexity and effort, it is necessary to ask how much are we willing to pay for increased accuracy. For example, if increased computational effort implies quickly diminishing returns in accuracy, then those designing inexpensive surveillance applications cannot aim for maximum accuracy at any cost. It becomes necessary to find trade-offs between accuracy and effort. We study efficient classification of images depicting real-world objects and scenes. Classification is efficient when a classifier can be controlled so that the desired trade-off between accuracy and effort (speed) is achieved and unnecessary computations are avoided on a per input basis. A framework is proposed for understanding and modeling efficient classification of images. Classification is modeled as a tree-like process. In designing the framework, it is important to recognize what is essential and to avoid structures that are narrow in applicability. Earlier frameworks are lacking in this regard. The overall contribution is two-fold. First, the framework is presented, subjected to experiments, and shown to be satisfactory. Second, certain unconventional approaches are experimented with. This allows the separation of the essential from the conventional. To determine if the framework is satisfactory, three categories of questions are identified: trade-off optimization, classifier tree organization, and rules for delegation and confidence modeling. Questions and problems related to each category are addressed and empirical results are presented. For example, related to trade-off optimization, we address the problem of computational bottlenecks that limit the range of trade-offs. We also ask if accuracy versus effort trade-offs can be controlled after training. For another example, regarding classifier tree organization, we first consider the task of organizing a tree in a problem-specific manner. We then ask if problem-specific organization is necessary.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The usual task in music information retrieval (MIR) is to find occurrences of a monophonic query pattern within a music database, which can contain both monophonic and polyphonic content. The so-called query-by-humming systems are a famous instance of content-based MIR. In such a system, the user's hummed query is converted into symbolic form to perform search operations in a similarly encoded database. The symbolic representation (e.g., textual, MIDI or vector data) is typically a quantized and simplified version of the sampled audio data, yielding to faster search algorithms and space requirements that can be met in real-life situations. In this thesis, we investigate geometric approaches to MIR. We first study some musicological properties often needed in MIR algorithms, and then give a literature review on traditional (e.g., string-matching-based) MIR algorithms and novel techniques based on geometry. We also introduce some concepts from digital image processing, namely the mathematical morphology, which we will use to develop and implement four algorithms for geometric music retrieval. The symbolic representation in the case of our algorithms is a binary 2-D image. We use various morphological pre- and post-processing operations on the query and the database images to perform template matching / pattern recognition for the images. The algorithms are basically extensions to classic image correlation and hit-or-miss transformation techniques used widely in template matching applications. They aim to be a future extension to the retrieval engine of C-BRAHMS, which is a research project of the Department of Computer Science at University of Helsinki.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The study examines various uses of computer technology in acquisition of information for visually impaired people. For this study 29 visually impaired persons took part in a survey about their experiences concerning acquisition of infomation and use of computers, especially with a screen magnification program, a speech synthesizer and a braille display. According to the responses, the evolution of computer technology offers an important possibility for visually impaired people to cope with everyday activities and interacting with the environment. Nevertheless, the functionality of assistive technology needs further development to become more usable and versatile. Since the challenges of independent observation of environment were emphasized in the survey, the study led into developing a portable text vision system called Tekstinäkö. Contrary to typical stand-alone applications, Tekstinäkö system was constructed by combining devices and programs that are readily available on consumer market. As the system operates, pictures are taken by a digital camera and instantly transmitted to a text recognition program in a laptop computer that talks out loud the text using a speech synthesizer. Visually impaired test users described that even unsure interpretations of the texts in the environment given by Tekstinäkö system are at least a welcome addition to complete perception of the environment. It became clear that even with a modest development work it is possible to bring new, useful and valuable methods to everyday life of disabled people. Unconventional production process of the system appeared to be efficient as well. Achieved results and the proposed working model offer one suggestion for giving enough attention to easily overlooked needs of the people with special abilities. ACM Computing Classification System (1998): K.4.2 Social Issues: Assistive technologies for persons with disabilities I.4.9 Image processing and computer vision: Applications Keywords: Visually impaired, computer-assisted, information, acquisition, assistive technology, computer, screen magnification program, speech synthesizer, braille display, survey, testing, text recognition, camera, text, perception, picture, environment, trasportation, guidance, independence, vision, disabled, blind, speech, synthesizer, braille, software engineering, programming, program, system, freeware, shareware, open source, Tekstinäkö, text vision, TopOCR, Autohotkey, computer engineering, computer science

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The thesis consists of five international congress papers and a summary with an introduction. The overarching aim of the studies and the summary is to examine the inner coherency of the theological and anthropological thinking of Gregory of Nyssa (331-395). To the issue is applied an "apophatic approach" with a "Christological focus". It is suggested that the coherency is to be found from the Christological concept of unity between "true God" and "true man" in the one person of Jesus Christ. Gregory is among the first to make a full recognition of two natures of Christ, and to use this recognition systematically in his writings. The aim of the studies is pursued by the method of "identification", a combination of the modern critical "problematic method" and Gregory's own aphairetic method of "following" (akolouthia). The preoccupation with issues relating to the so-called Hellenization of Christianity in the patristic era was strong in the twentieth-century Gregory scholarship. The most discussed questions have been the Greek influence in his thought and his philosophical sources. In the five articles of the thesis it is examined how Gregory's thinking stands in its own right. The manifestly apophatic character of his theological thinking is made a part of the method of examining his thought according to the principles of his own method of following. The basic issue concerning the relation of theology and anthropology is discussed in the contexts of his central Trinitarian, anhtropological, Christological and eschatological sources. In the summary the Christocentric integration of Gregory's thinking is discussed also in relation to the issue of the alledged Hellenization. The main conclusion of the thesis concerns the concept of theology in Gregory. It is not indebted to the classical concept of theology as metaphysics or human speculation of God. Instead, it is founded to the traditional Judeo-Christian idea of God who speaks with his people face to face. In Gregory, theologia connotes the oikonomia of God's self-revelation. It may be regarded as the state of constant expression of love between the Creator and his created image. In theology, the human person becomes an image of the Word by which the Father expresses his love to "man" whom he loves as his own Son. Eventually the whole humankind, as one, gives the divine Word a physical - audible and sensible - Body. Humankind then becomes what theology is. The whole humanity expresses divine love by manifesting Christ in words and deeds, singing in one voice to the glory of the Father, the Son and the Holy Spirit.