46 resultados para Visual Tracking
Resumo:
One of the greatest conundrums to the contemporary science is the relation between consciousness and brain activity, and one of the specifi c questions is how neural activity can generate vivid subjective experiences. Studies focusing on visual consciousness have become essential in solving the empirical questions of consciousness. Th e main aim of this thesis is to clarify the relation between visual consciousness and the neural and electrophysiological processes of the brain. By applying electroencephalography and functional magnetic resonance image-guided transcranial magnetic stimulation (TMS), we investigated the links between conscious perception and attention, the temporal evolution of visual consciousness during stimulus processing, the causal roles of primary visual cortex (V1), visual area 2 (V2) and lateral occipital cortex (LO) in the generation of visual consciousness and also the methodological issues concerning the accuracy of targeting TMS to V1. Th e results showed that the fi rst eff ects of visual consciousness on electrophysiological responses (about 140 ms aft er the stimulus-onset) appeared earlier than the eff ects of selective attention, and also in the unattended condition, suggesting that visual consciousness and selective attention are two independent phenomena which have distinct underlying neural mechanisms. In addition, while it is well known that V1 is necessary for visual awareness, the results of the present thesis suggest that also the abutting visual area V2 is a prerequisite for conscious perception. In our studies, the activation in V2 was necessary for the conscious perception of change in contrast for a shorter period of time than in the case of more detailed conscious perception. We also found that TMS in LO suppressed the conscious perception of object shape when TMS was delivered in two distinct time windows, the latter corresponding with the timing of the ERPs related to the conscious perception of coherent object shape. Th e result supports the view that LO is crucial in conscious perception of object coherency and is likely to be directly involved in the generation of visual consciousness. Furthermore, we found that visual sensations, or phosphenes, elicited by the TMS of V1 were brighter than identically induced phosphenes arising from V2. Th ese fi ndings demonstrate that V1 contributes more to the generation of the sensation of brightness than does V2. Th e results also suggest that top-down activation from V2 to V1 is probably associated with phosphene generation. The results of the methodological study imply that when a commonly used landmark (2 cm above the inion) is used in targeting TMS to V1, the TMS-induced electric fi eld is likely to be highest in dorsal V2. When V1 was targeted according to the individual retinotopic data, the electric fi eld was highest in V1 only in half of the participants. Th is result suggests that if the objective is to study the role of V1 with TMS methodology, at least functional maps of V1 and V2 should be applied with computational model of the TMS-induced electric fi eld in V1 and V2. Finally, the results of this thesis imply that diff erent features of attention contribute diff erently to visual consciousness, and thus, the theoretical model which is built up of the relationship between visual consciousness and attention should acknowledge these diff erences. Future studies should also explore the possibility that visual consciousness consists of several processing stages, each of which have their distinct underlying neural mechanisms.
Resumo:
Identification of low-dimensional structures and main sources of variation from multivariate data are fundamental tasks in data analysis. Many methods aimed at these tasks involve solution of an optimization problem. Thus, the objective of this thesis is to develop computationally efficient and theoretically justified methods for solving such problems. Most of the thesis is based on a statistical model, where ridges of the density estimated from the data are considered as relevant features. Finding ridges, that are generalized maxima, necessitates development of advanced optimization methods. An efficient and convergent trust region Newton method for projecting a point onto a ridge of the underlying density is developed for this purpose. The method is utilized in a differential equation-based approach for tracing ridges and computing projection coordinates along them. The density estimation is done nonparametrically by using Gaussian kernels. This allows application of ridge-based methods with only mild assumptions on the underlying structure of the data. The statistical model and the ridge finding methods are adapted to two different applications. The first one is extraction of curvilinear structures from noisy data mixed with background clutter. The second one is a novel nonlinear generalization of principal component analysis (PCA) and its extension to time series data. The methods have a wide range of potential applications, where most of the earlier approaches are inadequate. Examples include identification of faults from seismic data and identification of filaments from cosmological data. Applicability of the nonlinear PCA to climate analysis and reconstruction of periodic patterns from noisy time series data are also demonstrated. Other contributions of the thesis include development of an efficient semidefinite optimization method for embedding graphs into the Euclidean space. The method produces structure-preserving embeddings that maximize interpoint distances. It is primarily developed for dimensionality reduction, but has also potential applications in graph theory and various areas of physics, chemistry and engineering. Asymptotic behaviour of ridges and maxima of Gaussian kernel densities is also investigated when the kernel bandwidth approaches infinity. The results are applied to the nonlinear PCA and to finding significant maxima of such densities, which is a typical problem in visual object tracking.
Resumo:
The number of persons with visual impairment in Tanzania is estimated to over 1.6 million. About half a million of these persons are children aged 7-13. Only about 1% of these children are enrolled in schools. The special schools and units are too few and in most cases they are far away from the children’s homes. More and more regular schools are enrolling children with visual impairment, but the schools lack financial resources, tactile teaching materials and trained special education teachers. Children with visual impairment enrolled in regular schools seldom get enough support and often fail in examinations. The general aim of this study was to contribute to increased knowledge and understanding about how teachers can change their teaching practices and thus facilitate the learning of children with visual impairment included in regular classrooms as they participate in an action research project. The project was conducted in a primary school in a poor rural region with a high frequency of blindness and visual impairment. The school was poorly resourced and the average number of pupils per class was 90. The teachers who participated in the collaborative action research project were the 14 teachers who taught blind or visually impaired pupils in grades 4 and 6, in total 6 pupils. The action research project was conducted during a period of 6 months and was carried out in five cycles. The teachers were actively involved in all the project activities; identifying challenges, planning solutions, producing teaching materials, reflecting on outcomes, collaborating and evaluating. Empirical data was collected with questionnaires, interviews, observations and focus group discussions. The findings of the study show that the teachers managed to change their teaching practices through systematic reflection, analysis and collaboration. The teachers produced a variety of tactile teaching materials, which facilitated the learning of the pupils with visual impairment. The pupils learned better and felt more included in the regular classes. The teachers gained new knowledge and skills. They grew professionally and started to collaborate with each other. The study contributes to new knowledge of how collaborative action research can be conducted in the area of special education in a Tanzanian school context. The study has also relevance to the planning of school-based professional development programs and teacher education programs in Tanzania and in other low-income countries. The results also point at strategies which can promote inclusion of children with disabilities in regular schools.
Resumo:
I den första delen av den här avhandlingen presenteras en bildens genealogi. Den skildrar hur begreppen för bilden, seendet och jaget utvecklades i relation till varandra i en specifik vetenskaplig och filosofisk kontext. Berättelsen sträcker sig från den tidiga renässansen och det perspektivistiska måleriet, till fotografiets födelse och positivismen. Den här utvecklingen medförde en form av reduktionism i vilken jagets roll – betydelsen av den mänskliga psykologin, vårt omdöme, vår uppmärksamhet och vår vilja – blev förbisedd. Inom den här tanketraditionen uppstod en förskjutning, från en förståelse av bilden som en representation av det tredimensionella rummet på en tvådimensionell yta, till en uppfattning om bilden som en genomskinlig ruta, ett fönster ut mot världen. Idén om avbildningen som en neutral ”blick från ingenstans” kom att förstärka en skeptisk hållning till kommunikation, dialog och vittnesmål och därmed även undergräva vår tillit till varandra och följaktligen vår tillit till oss själva. I den andra delen erbjuder författaren ett alternativ till den tanketradition som behandlas i den första delen. Det som blev förbisett i uppfattningen om en blick från ingenstans var att bilden är ett hjälpmedel då vi bearbetar vårt synfält. Bilden hjälper oss att dela vår syn på saker. Genom den här uppgiften av att dela blir bilden riktningsgivande i våra försök att orientera oss i världen. Jag kan stå bredvid en annan människa och se vad hon ser, men jag vet inte nödvändigtvis hur hon uppfattar det vi ser. Bilden lägger till ett led i det här förhållandet eftersom den inte enbart visar vad den andra ser. När bilden fungerar som den skall visar den också hur den andra ser och på det här sättet blir bilden verksam. Den föreliggande avhandlingen kombinerar epistemologi med vetenskapshistoria och visuella kulturstudier, men dess huvudintresse är filosofiskt. Den befattar sig med filosofiska missförstånd angående avbildning som en mimetisk konstform, kunskap som domesticering och varseblivning som mottagning av data. ------------------------------------------------------ Tämän väitöskirjan ensimmäinen osa selvittää kuvakäsitteen genealogiaa. Se havainnollistaa miten kuvan, näkemisen ja minän käsitteet kehittyivät suhteessa toisiinsa. Kertomus ulottuu varhaisesta renessanssista ja perspektivistisestä maalaustaiteesta, positivismin aikakauteen ja valokuvan syntyyn. Tämä kehitys toi mukanaan reduktionismin jossa minän rooli – ihmisen psykologian merkitys, meidän arviointikyky, meidän huomiokyky sekä meidän tahtomme – vaipui unohduksiin. Ajatusmaailmassa tapahtui siirtymä, kuvan merkitys vaihtui käsityksestä jossa se on kolmiulotteisen tilan representaatio kaksiulotteisella pinnalla, käsitykseen jossa kuva on läpinäkyvä ruutu, ikkuna kohti maailmaa. Ajatus kuvasta neutraalin näkökulman kantajana vahvisti skeptistä suhtautumista kommunikaatiota, dialogisuutta ja subjektiivisuutta kohtaan. Tämä skeptisyys ilmentyi myös vahvana epäluottamuksena ihmiskeskeisyyttä ja toiseutta kohtaan. Toisessa osassa tekijä tarjoaa vaihtoehdon tälle skeptiselle ajatusmaailmalle jota tarkastellaan ensimmäisessä osassa. Kuva on myös väline joka auttaa meitä jäsentämään meidän näkökenttäämme. Se auttaa meitä jakamaan meidän käsityksiä toistemme kanssa. Tämä näkemisen jakamisen käytäntö on kuvan keskeinen tehtävä. Voin seistä toisen ihmisen vieressä ja nähdä samat asiat kuin hän, mutta en välttämättä ymmärrä miten hän näkee nämä asiat. Kuva lisää jotain olennaista tähän suhteeseen. Kun kuva toimii niin kun sen kuuluu toimia, se näyttää myös miten toinen näkee, tällä tavalla kuvasta tulee välittäjä. Tämä väitöskirja yhdistää epistemologiaa, tieteen historiaa ja visuaalisen kulttuurin tutkimusta, mutta sen pääasiallinen tavoite on filosofinen. Se käsittelee filosofisia väärinkäsityksiä koskien kuvan eideettisyyttä.
Resumo:
Companies require information in order to gain an improved understanding of their customers. Data concerning customers, their interests and behavior are collected through different loyalty programs. The amount of data stored in company data bases has increased exponentially over the years and become difficult to handle. This research area is the subject of much current interest, not only in academia but also in practice, as is shown by several magazines and blogs that are covering topics on how to get to know your customers, Big Data, information visualization, and data warehousing. In this Ph.D. thesis, the Self-Organizing Map and two extensions of it – the Weighted Self-Organizing Map (WSOM) and the Self-Organizing Time Map (SOTM) – are used as data mining methods for extracting information from large amounts of customer data. The thesis focuses on how data mining methods can be used to model and analyze customer data in order to gain an overview of the customer base, as well as, for analyzing niche-markets. The thesis uses real world customer data to create models for customer profiling. Evaluation of the built models is performed by CRM experts from the retailing industry. The experts considered the information gained with help of the models to be valuable and useful for decision making and for making strategic planning for the future.
Resumo:
The importance of package design as a marketing tool is growing as the competition in retail environment increases. However, there is a lack of studies on how each element of package design affects consumer decisions in different countries. The objective of this thesis is to study the role of package design to Japanese consumers. The research was conducted through an experiment with a sample of 37 Japanese female participants. They were divided into two groups and were given different tasks: one group had to choose a chocolate for themselves, and the other for a group of friends. The participants were presented with 15 different Finnish chocolate boxes to choose from. The qualitative data was gathered through observation and semi-structured interviews. In addition, data from questionnaires was quantified and all the data was triangulated. The empirical results suggest that visual elements strongly affect the decision making of Japanese consumers. Image was the most important element which acted as both, a visual and an informational aspect in the experiment. Informational elements on the other hand have little effect, especially when the context is written in a foreign language. However, informational elements affected participants who were choosing chocolates for a group of friends. A unique finding was the importance of kawaii (cuteness) to Japanese consumers.
Resumo:
Since the times preceding the Second World War the subject of aircraft tracking has been a core interest to both military and non-military aviation. During subsequent years both technology and configuration of the radars allowed the users to deploy it in numerous fields, such as over-the-horizon radar, ballistic missile early warning systems or forward scatter fences. The latter one was arranged in a bistatic configuration. The bistatic radar has continuously re-emerged over the last eighty years for its intriguing capabilities and challenging configuration and formulation. The bistatic radar arrangement is used as the basis of all the analyzes presented in this work. The aircraft tracking method of VHF Doppler-only information, developed in the first part of this study, is solely based on Doppler frequency readings in relation to time instances of their appearance. The corresponding inverse problem is solved by utilising a multistatic radar scenario with two receivers and one transmitter and using their frequency readings as a base for aircraft trajectory estimation. The quality of the resulting trajectory is then compared with ground-truth information based on ADS-B data. The second part of the study deals with the developement of a method for instantaneous Doppler curve extraction from within a VHF time-frequency representation of the transmitted signal, with a three receivers and one transmitter configuration, based on a priori knowledge of the probability density function of the first order derivative of the Doppler shift, and on a system of blocks for identifying, classifying and predicting the Doppler signal. The extraction capabilities of this set-up are tested with a recorded TV signal and simulated synthetic spectrograms. Further analyzes are devoted to more comprehensive testing of the capabilities of the extraction method. Besides testing the method, the classification of aircraft is performed on the extracted Bistatic Radar Cross Section profiles and the correlation between them for different types of aircraft. In order to properly estimate the profiles, the ADS-B aircraft location information is adjusted based on extracted Doppler frequency and then used for Bistatic Radar Cross Section estimation. The classification is based on seven types of aircraft grouped by their size into three classes.
Resumo:
This thesis investigates the matter of race in the context of Finnish language acquisition among adult migrants in Finland. Here matter denotes both the materiality of race and how race comes to matter. Drawing primarily on an auto/ethno/graphic account of learning the Finnish language as a participant in the Finnish for foreigners classes, this thesis problematises the ontology and epistemology of race, i.e., what race is, how it is known, and what an engagement with race entails. Taking cues from the bodily practices of learning the Finnish trill or the rolling r, this study proposes a notion of “trilling race” and argues for an onto-epistemological dis/continuity that marks race’s arrival. The notion of dis/continuity reworks the distinction between continuity and discontinuity, and asks about the how of the arrival of any identity, the where, and the when. In so doing, an analysis of “trilling race” engages with one of the major problematics that has exercised much critical attention, namely: how to read race differently. That is, to rethink the conundrum of the need to counter “representational weight” (Puar 2007, 191) of race on the one hand, and to account for the racialised lived realities on the other. The link between a study of the phenomenon of host country language acquisition and an examination of the question of race is not as obvious as it might seem. For example, what does the argument that the process of language learning is racialised actually imply? Does it mean that race, as a process of racialisation or an ongoing configuration of sets of power relations, exerts force from an outside on the otherwise neutral process of learning the host country language? Or does it mean that race, as an identity category, presents as among the analytical perspectives, along with gender and class for instance, of the phenomenon of host country language acquisition? With these questions in mind, and to foreground the examination of the question of race in the context of Finnish language acquisition among adult migrants, this thesis opens with a discussion of the art installation Finnexia by Lisa Erdman. Finnexia is a fictitious drug said to facilitate Finnish language learning through accelerating the cognitive learning process and reducing the anxiety of speaking the Finnish language. Not only does the Finnexia installation make visible the ways in which the lack of skill in Finnish is fgured as the threshold – a border that separates the inside from the outside – to integration, but also, and importantly, it raises questions about the nature of difference, and the process of differentiation that separates the individual from the social, fact from fiction, nature from culture. These puzzles animate much of the analysis in this dissertation. These concerns continue to be addressed in the rest of part one. Whereas chapter two offers a reconsideration of the ambiguities of ethnisme/ethnicity and race, chapter three dilates on the methodological implications of a conception of the dis/continuity of race. Part two focuses on the matter of race and examines the political economy of visual-aural encounters, whereas part three shifts the focus and rethinks the possibilities and limitations of transforming racialised and normative constraints. Taking up these particular problematics, this thesis as a whole argues that race trills itself: its identity/difference is simultaneously made possible and impossible.
Resumo:
In this paper, we review the advances of monocular model-based tracking for last ten years period until 2014. In 2005, Lepetit, et. al, [19] reviewed the status of monocular model based rigid body tracking. Since then, direct 3D tracking has become quite popular research area, but monocular model-based tracking should still not be forgotten. We mainly focus on tracking, which could be applied to aug- mented reality, but also some other applications are covered. Given the wide subject area this paper tries to give a broad view on the research that has been conducted, giving the reader an introduction to the different disciplines that are tightly related to model-based tracking. The work has been conducted by searching through well known academic search databases in a systematic manner, and by selecting certain publications for closer examination. We analyze the results by dividing the found papers into different categories by their way of implementation. The issues which have not yet been solved are discussed. We also discuss on emerging model-based methods such as fusing different types of features and region-based pose estimation which could show the way for future research in this subject.
Resumo:
Simplification of highly detailed CAD models is an important step when CAD models are visualized or by other means utilized in augmented reality applications. Without simplification, CAD models may cause severe processing and storage is- sues especially in mobile devices. In addition, simplified models may have other advantages like better visual clarity or improved reliability when used for visual pose tracking. The geometry of CAD models is invariably presented in form of a 3D mesh. In this paper, we survey mesh simplification algorithms in general and focus especially to algorithms that can be used to simplify CAD models. We test some commonly known algorithms with real world CAD data and characterize some new CAD related simplification algorithms that have not been surveyed in previous mesh simplification reviews.
Resumo:
Advancements in information technology have made it possible for organizations to gather and store vast amounts of data of their customers. Information stored in databases can be highly valuable for organizations. However, analyzing large databases has proven to be difficult in practice. For companies in the retail industry, customer intelligence can be used to identify profitable customers, their characteristics, and behavior. By clustering customers into homogeneous groups, companies can more effectively manage their customer base and target profitable customer segments. This thesis will study the use of the self-organizing map (SOM) as a method for analyzing large customer datasets, clustering customers, and discovering information about customer behavior. Aim of the thesis is to find out whether the SOM could be a practical tool for retail companies to analyze their customer data.
Resumo:
Kandidaatintyö tehtiin osana PulpVision-tutkimusprojektia, jonka tarkoituksena on kehittää kuvapohjaisia laskenta- ja luokittelumetodeja sellun laaduntarkkailuun paperin valmistuksessa. Tämän tutkimusprojektin osana on aiemmin kehitetty metodi, jolla etsittiin kaarevia rakenteita kuvista, ja tätä metodia hyödynnettiin kuitujen etsintään kuvista. Tätä metodia käytettiin lähtökohtana kandidaatintyölle. Työn tarkoituksena oli tutkia, voidaanko erilaisista kuitukuvista laskettujen piirteiden avulla tunnistaa kuvassa olevien kuitujen laji. Näissä kuitukuvissa oli kuituja neljästä eri puulajista ja yhdestä kasvista. Nämä lajit olivat akasia, koivu, mänty, eukalyptus ja vehnä. Jokaisesta lajista valittiin 100 kuitukuvaa ja nämä kuvat jaettiin kahteen ryhmään, joista ensimmäistä käytettiin opetusryhmänä ja toista testausryhmänä. Opetusryhmän avulla jokaiselle kuitulajille laskettiin näitä kuvaavia piirteitä, joiden avulla pyrittiin tunnistamaan testausryhmän kuvissa olevat kuitulajit. Nämä kuvat oli tuottanut CEMIS-Oulu (Center for Measurement and Information Systems), joka on mittaustekniikkaan keskittynyt yksikkö Oulun yliopistossa. Yksittäiselle opetusryhmän kuitukuvalle laskettiin keskiarvot ja keskihajonnat kolmesta eri piirteestä, jotka olivat pituus, leveys ja kaarevuus. Lisäksi laskettiin, kuinka monta kuitua kuvasta löydettiin. Näiden piirteiden eri yhdistelmien avulla testattiin tunnistamisen tarkkuutta käyttämällä k:n lähimmän naapurin menetelmää ja Naiivi Bayes -luokitinta testausryhmän kuville. Testeistä saatiin lupaavia tuloksia muun muassa pituuden ja leveyden keskiarvoja käytettäessä saavutettiin jopa noin 98 %:n tarkkuus molemmilla algoritmeilla. Tunnistuksessa kuitujen keskimäärinen pituus vaikutti olevan kuitukuvia parhaiten kuvaava piirre. Käytettyjen algoritmien välillä ei ollut suurta vaihtelua tarkkuudessa. Testeissä saatujen tulosten perusteella voidaan todeta, että kuitukuvien tunnistaminen on mahdollista. Testien perusteella kuitukuvista tarvitsee laskea vain kaksi piirrettä, joilla kuidut voidaan tunnistaa tarkasti. Käytetyt lajittelualgoritmit olivat hyvin yksinkertaisia, mutta ne toimivat testeissä hyvin.
Resumo:
Verkkopalveluiden ylläpitovaiheessa halutaan varmistua, etteivät palveluun tehdyt muutokset aiheuta verkkopalvelussa virhetilanteita ja palvelu toimii moitteetta. Muutoksen hyväksyntätestaus voidaan tehdä regressiotestauksena vertaamalla palvelun tilaa ennen ja jälkeen muutoksen. Sisältöpainotteisessa verkkopalvelussa testaaminen keskittyy loppukäyttäjälle esitetyn sivun semanttiseen sekä visuaaliseen oikeellisuuteen sekä erilaisiin toiminnallisiin testeihin. Työssä tarkastellaan etenkin suositulla WordPress-julkaisujärjestelmällä toteutettujen verkkopalveluiden ylläpitoa. Keskeisenä osana julkaisujärjestelmillä toteutettujen verkkopalveluiden ylläpitoa on julkaisujärjestelmän ja sitä täydentävien lisäosien päivittämistä ajantasaisiin versioihin. Nämä päivitykset paitsi tuovat uusia ominaisuuksia verkkopalvelun kehittäjille, myös paikkaavat järjestelmän tietoturvahaavoittuvuuksia sekä korjaavat aiemmissa versioissa esiintyneitä virheitä. Tässä työssä kehitettiin kohdeyrityksen aiempia verkkopalveluiden ylläpitoprosesseja niissä tunnistettujen kehityskohteiden perusteella. Uudistettu kokonaisuus jakautuu kahteen kokonaisuuteen: päivitystarpeen seurantaan sekä päivitysten tekemiseen. Päivitystarpeen seurantaa varten kehitettiin uusi työkalu helpottamaan kokonaiskuvan hahmottamista. Päivitysten tekemisen osalta työssä keskityttiin automatisoidun regressiotestauksen kehittämiseen, missä tärkeimpänä testauskeinona käytetään verkkopalvelusta tallennettujen kuvankaappausten vertailuun perustuvaa visuaalista testausta. Uusien ylläpitoprosesseille määriteltiin myös seurannan kohteet uudistuksen onnistumisen ja jatkokehityksen arviointia varten.
Resumo:
Convolutional Neural Networks (CNN) have become the state-of-the-art methods on many large scale visual recognition tasks. For a lot of practical applications, CNN architectures have a restrictive requirement: A huge amount of labeled data are needed for training. The idea of generative pretraining is to obtain initial weights of the network by training the network in a completely unsupervised way and then fine-tune the weights for the task at hand using supervised learning. In this thesis, a general introduction to Deep Neural Networks and algorithms are given and these methods are applied to classification tasks of handwritten digits and natural images for developing unsupervised feature learning. The goal of this thesis is to find out if the effect of pretraining is damped by recent practical advances in optimization and regularization of CNN. The experimental results show that pretraining is still a substantial regularizer, however, not a necessary step in training Convolutional Neural Networks with rectified activations. On handwritten digits, the proposed pretraining model achieved a classification accuracy comparable to the state-of-the-art methods.