36 resultados para visual objects
Resumo:
Paper presented in ISA RC23 meeting, Gothenburg July 16th 2010
Resumo:
Diabetes is a rapidly increasing worldwide problem which is characterised by defective metabolism of glucose that causes long-term dysfunction and failure of various organs. The most common complication of diabetes is diabetic retinopathy (DR), which is one of the primary causes of blindness and visual impairment in adults. The rapid increase of diabetes pushes the limits of the current DR screening capabilities for which the digital imaging of the eye fundus (retinal imaging), and automatic or semi-automatic image analysis algorithms provide a potential solution. In this work, the use of colour in the detection of diabetic retinopathy is statistically studied using a supervised algorithm based on one-class classification and Gaussian mixture model estimation. The presented algorithm distinguishes a certain diabetic lesion type from all other possible objects in eye fundus images by only estimating the probability density function of that certain lesion type. For the training and ground truth estimation, the algorithm combines manual annotations of several experts for which the best practices were experimentally selected. By assessing the algorithm’s performance while conducting experiments with the colour space selection, both illuminance and colour correction, and background class information, the use of colour in the detection of diabetic retinopathy was quantitatively evaluated. Another contribution of this work is the benchmarking framework for eye fundus image analysis algorithms needed for the development of the automatic DR detection algorithms. The benchmarking framework provides guidelines on how to construct a benchmarking database that comprises true patient images, ground truth, and an evaluation protocol. The evaluation is based on the standard receiver operating characteristics analysis and it follows the medical practice in the decision making providing protocols for image- and pixel-based evaluations. During the work, two public medical image databases with ground truth were published: DIARETDB0 and DIARETDB1. The framework, DR databases and the final algorithm, are made public in the web to set the baseline results for automatic detection of diabetic retinopathy. Although deviating from the general context of the thesis, a simple and effective optic disc localisation method is presented. The optic disc localisation is discussed, since normal eye fundus structures are fundamental in the characterisation of DR.
Resumo:
Local features are used in many computer vision tasks including visual object categorization, content-based image retrieval and object recognition to mention a few. Local features are points, blobs or regions in images that are extracted using a local feature detector. To make use of extracted local features the localized interest points are described using a local feature descriptor. A descriptor histogram vector is a compact representation of an image and can be used for searching and matching images in databases. In this thesis the performance of local feature detectors and descriptors is evaluated for object class detection task. Features are extracted from image samples belonging to several object classes. Matching features are then searched using random image pairs of a same class. The goal of this thesis is to find out what are the best detector and descriptor methods for such task in terms of detector repeatability and descriptor matching rate.
Resumo:
The large and growing number of digital images is making manual image search laborious. Only a fraction of the images contain metadata that can be used to search for a particular type of image. Thus, the main research question of this thesis is whether it is possible to learn visual object categories directly from images. Computers process images as long lists of pixels that do not have a clear connection to high-level semantics which could be used in the image search. There are various methods introduced in the literature to extract low-level image features and also approaches to connect these low-level features with high-level semantics. One of these approaches is called Bag-of-Features which is studied in the thesis. In the Bag-of-Features approach, the images are described using a visual codebook. The codebook is built from the descriptions of the image patches using clustering. The images are described by matching descriptions of image patches with the visual codebook and computing the number of matches for each code. In this thesis, unsupervised visual object categorisation using the Bag-of-Features approach is studied. The goal is to find groups of similar images, e.g., images that contain an object from the same category. The standard Bag-of-Features approach is improved by using spatial information and visual saliency. It was found that the performance of the visual object categorisation can be improved by using spatial information of local features to verify the matches. However, this process is computationally heavy, and thus, the number of images must be limited in the spatial matching, for example, by using the Bag-of-Features method as in this study. Different approaches for saliency detection are studied and a new method based on the Hessian-Affine local feature detector is proposed. The new method achieves comparable results with current state-of-the-art. The visual object categorisation performance was improved by using foreground segmentation based on saliency information, especially when the background could be considered as clutter.
Resumo:
This thesis was part of lean adaptation project started at Outotec Lappeenranta factory in early 2013. The purpose of this thesis was to develop and propose lean tools that could be used in daily management, visual management and continuous improvement. This thesis was “outsiders” view, and as such, did not study the current processes deeply. As result of this thesis, two different Daily Management -boards were designed, one for parallel processes and one for sequential processes. In addition, methods of doing continuous improvement and daily task accountability were framed and standard work for the leaders outlined. The tools presented in this thesis are general tools which support work in lean environment. They are visual and, if used correctly, they provide a basis from which continuous improvement can be done. Lean philosophy emphasizes the deep understanding of the current situation and it would be against the lean principles to blindly implement anything developed “on the outside”. The tools presented should be reviewed and modified further by the people working on the factory floor.
Resumo:
Ett ämne som väckt intresse både inom industrin och forskningen är hantering av kundförhållanden (CRM, eng. Customer Relationship Management), dvs. en kundorienterad affärsstrategi där företagen från att ha varit produktorienterade väljer att bli mera kundcentrerade. Numera kan kundernas beteende och aktiviteter lätt registreras och sparas med hjälp av integrerade affärssystem (ERP, eng. Enterprise Resource Planning) och datalager (DW, eng. Data Warehousing). Kunder med olika preferenser och köpbeteende skapar sin egen ”signatur” i synnerhet via användningen av kundkort, vilket möjliggör mångsidig modellering av kundernas köpbeteende. För att få en översikt av kundernas köpbeteende och deras lönsamhet, används ofta kundsegmentering som en metod för att indela kunderna i grupper utgående från deras likheter. De mest använda metoderna för kundsegmentering är analytiska modeller konstruerade för en viss tidsperiod. Dessa modeller beaktar inte att kundernas beteende kan förändras med tiden. I föreliggande avhandling skapas en holistisk översikt av kundernas karaktär och köpbeteende som utöver de konventionella segmenteringsmodellerna även beaktar dynamiken i köpbeteendet. Dynamiken i en kundsegmenteringsmodell innefattar förändringar i segmentens struktur och innehåll, samt förändringen av individuella kunders tillhörighet i ett segment (s.k migrationsanalyser). Vardera förändringen modelleras, analyseras och exemplifieras med visuella datautvinningstekniker, främst med självorganiserande kartor (SOM, eng. Self-Organizing Maps) och självorganiserande tidskartor (SOTM), en vidareutveckling av SOM. Visualiseringen anteciperas underlätta tolkningen av identifierade mönster och göra processen med kunskapsöverföring mellan den som gör analysen och beslutsfattaren smidigare. Asiakkuudenhallinta (CRM) eli organisaation muuttaminen tuotepainotteisesta asiakaskeskeiseksi on herättänyt mielenkiintoa niin yliopisto- kuin yritysmaailmassakin. Asiakkaiden käyttäytymistä ja toimintaa pystytään nykyään helposti tallentamaan ja varastoimaan toiminnanohjausjärjestelmien ja tietovarastojen avulla; asiakkaat jättävät jatkuvasti piirteistään ja ostokäyttäytymisestään kertovia tietojälkiä, joita voidaan analysoida. On tavallista, että asiakkaat poikkeavat toisistaan eri tavoin, ja heidän mieltymyksensä kuten myös ostokäyttäytymisensä saattavat olla hyvinkin erilaisia. Asiakaskäyttäytymisen monimuotoisuuteen ja tuottavuuteen paneuduttaessa käytetäänkin laajalti asiakassegmentointia eli asiakkaiden jakamista ryhmiin samankaltaisuuden perusteella. Perinteiset asiakassegmentoinnin ratkaisut ovat usein yksittäisiä analyyttisia malleja, jotka on tehty tietyn aikajakson perusteella. Tämän vuoksi ne monesti jättävät huomioimatta sen, että asiakkaiden käyttäytyminen saattaa ajan kuluessa muuttua. Tässä väitöskirjassa pyritäänkin tarjoamaan holistinen kuva asiakkaiden ominaisuuksista ja ostokäyttäytymisestä tarkastelemalla kahta muutosvoimaa tiettyyn aikarajaukseen perustuvien perinteisten segmentointimallien lisäksi. Nämä kaksi asiakassegmentointimallin dynamiikkaa ovat muutokset segmenttien rakenteessa ja muutokset yksittäisten asiakkaiden kuulumisessa ryhmään. Ensimmäistä dynamiikkaa lähestytään ajallisen asiakassegmentoinnin avulla, jossa visualisoidaan ajan kuluessa tapahtuvat muutokset segmenttien rakenteissa ja profiileissa. Toista dynamiikkaa taas lähestytään käyttäen nk. segmenttisiirtymien analyysia, jossa visuaalisin keinoin tunnistetaan samantyyppisesti segmentistä toiseen vaihtavat asiakkaat. Visualisoinnin tehtävänä on tukea havaittujen kaavojen tulkitsemista sekä helpottaa tiedonsiirtoa analysoijan ja päättäjien välillä. Visuaalisia tiedonlouhintamenetelmiä, kuten itseorganisoivia karttoja ja niiden laajennuksia, käytetään osoittamaan näiden menetelmien hyödyllisyys sekä asiakkuudenhallinnassa yleisesti että erityisesti asiakassegmentoinnissa.
Resumo:
The recent emergence of low-cost RGB-D sensors has brought new opportunities for robotics by providing affordable devices that can provide synchronized images with both color and depth information. In this thesis, recent work on pose estimation utilizing RGBD sensors is reviewed. Also, a pose recognition system for rigid objects using RGB-D data is implemented. The implementation uses half-edge primitives extracted from the RGB-D images for pose estimation. The system is based on the probabilistic object representation framework by Detry et al., which utilizes Nonparametric Belief Propagation for pose inference. Experiments are performed on household objects to evaluate the performance and robustness of the system.
Resumo:
Presentation at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014
Resumo:
Poster at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014
Resumo:
This thesis presents a framework for segmentation of clustered overlapping convex objects. The proposed approach is based on a three-step framework in which the tasks of seed point extraction, contour evidence extraction, and contour estimation are addressed. The state-of-art techniques for each step were studied and evaluated using synthetic and real microscopic image data. According to obtained evaluation results, a method combining the best performers in each step was presented. In the proposed method, Fast Radial Symmetry transform, edge-to-marker association algorithm and ellipse fitting are employed for seed point extraction, contour evidence extraction and contour estimation respectively. Using synthetic and real image data, the proposed method was evaluated and compared with two competing methods and the results showed a promising improvement over the competing methods, with high segmentation and size distribution estimation accuracy.
Resumo:
One of the greatest conundrums to the contemporary science is the relation between consciousness and brain activity, and one of the specifi c questions is how neural activity can generate vivid subjective experiences. Studies focusing on visual consciousness have become essential in solving the empirical questions of consciousness. Th e main aim of this thesis is to clarify the relation between visual consciousness and the neural and electrophysiological processes of the brain. By applying electroencephalography and functional magnetic resonance image-guided transcranial magnetic stimulation (TMS), we investigated the links between conscious perception and attention, the temporal evolution of visual consciousness during stimulus processing, the causal roles of primary visual cortex (V1), visual area 2 (V2) and lateral occipital cortex (LO) in the generation of visual consciousness and also the methodological issues concerning the accuracy of targeting TMS to V1. Th e results showed that the fi rst eff ects of visual consciousness on electrophysiological responses (about 140 ms aft er the stimulus-onset) appeared earlier than the eff ects of selective attention, and also in the unattended condition, suggesting that visual consciousness and selective attention are two independent phenomena which have distinct underlying neural mechanisms. In addition, while it is well known that V1 is necessary for visual awareness, the results of the present thesis suggest that also the abutting visual area V2 is a prerequisite for conscious perception. In our studies, the activation in V2 was necessary for the conscious perception of change in contrast for a shorter period of time than in the case of more detailed conscious perception. We also found that TMS in LO suppressed the conscious perception of object shape when TMS was delivered in two distinct time windows, the latter corresponding with the timing of the ERPs related to the conscious perception of coherent object shape. Th e result supports the view that LO is crucial in conscious perception of object coherency and is likely to be directly involved in the generation of visual consciousness. Furthermore, we found that visual sensations, or phosphenes, elicited by the TMS of V1 were brighter than identically induced phosphenes arising from V2. Th ese fi ndings demonstrate that V1 contributes more to the generation of the sensation of brightness than does V2. Th e results also suggest that top-down activation from V2 to V1 is probably associated with phosphene generation. The results of the methodological study imply that when a commonly used landmark (2 cm above the inion) is used in targeting TMS to V1, the TMS-induced electric fi eld is likely to be highest in dorsal V2. When V1 was targeted according to the individual retinotopic data, the electric fi eld was highest in V1 only in half of the participants. Th is result suggests that if the objective is to study the role of V1 with TMS methodology, at least functional maps of V1 and V2 should be applied with computational model of the TMS-induced electric fi eld in V1 and V2. Finally, the results of this thesis imply that diff erent features of attention contribute diff erently to visual consciousness, and thus, the theoretical model which is built up of the relationship between visual consciousness and attention should acknowledge these diff erences. Future studies should also explore the possibility that visual consciousness consists of several processing stages, each of which have their distinct underlying neural mechanisms.
Resumo:
The number of persons with visual impairment in Tanzania is estimated to over 1.6 million. About half a million of these persons are children aged 7-13. Only about 1% of these children are enrolled in schools. The special schools and units are too few and in most cases they are far away from the children’s homes. More and more regular schools are enrolling children with visual impairment, but the schools lack financial resources, tactile teaching materials and trained special education teachers. Children with visual impairment enrolled in regular schools seldom get enough support and often fail in examinations. The general aim of this study was to contribute to increased knowledge and understanding about how teachers can change their teaching practices and thus facilitate the learning of children with visual impairment included in regular classrooms as they participate in an action research project. The project was conducted in a primary school in a poor rural region with a high frequency of blindness and visual impairment. The school was poorly resourced and the average number of pupils per class was 90. The teachers who participated in the collaborative action research project were the 14 teachers who taught blind or visually impaired pupils in grades 4 and 6, in total 6 pupils. The action research project was conducted during a period of 6 months and was carried out in five cycles. The teachers were actively involved in all the project activities; identifying challenges, planning solutions, producing teaching materials, reflecting on outcomes, collaborating and evaluating. Empirical data was collected with questionnaires, interviews, observations and focus group discussions. The findings of the study show that the teachers managed to change their teaching practices through systematic reflection, analysis and collaboration. The teachers produced a variety of tactile teaching materials, which facilitated the learning of the pupils with visual impairment. The pupils learned better and felt more included in the regular classes. The teachers gained new knowledge and skills. They grew professionally and started to collaborate with each other. The study contributes to new knowledge of how collaborative action research can be conducted in the area of special education in a Tanzanian school context. The study has also relevance to the planning of school-based professional development programs and teacher education programs in Tanzania and in other low-income countries. The results also point at strategies which can promote inclusion of children with disabilities in regular schools.
Resumo:
I den första delen av den här avhandlingen presenteras en bildens genealogi. Den skildrar hur begreppen för bilden, seendet och jaget utvecklades i relation till varandra i en specifik vetenskaplig och filosofisk kontext. Berättelsen sträcker sig från den tidiga renässansen och det perspektivistiska måleriet, till fotografiets födelse och positivismen. Den här utvecklingen medförde en form av reduktionism i vilken jagets roll – betydelsen av den mänskliga psykologin, vårt omdöme, vår uppmärksamhet och vår vilja – blev förbisedd. Inom den här tanketraditionen uppstod en förskjutning, från en förståelse av bilden som en representation av det tredimensionella rummet på en tvådimensionell yta, till en uppfattning om bilden som en genomskinlig ruta, ett fönster ut mot världen. Idén om avbildningen som en neutral ”blick från ingenstans” kom att förstärka en skeptisk hållning till kommunikation, dialog och vittnesmål och därmed även undergräva vår tillit till varandra och följaktligen vår tillit till oss själva. I den andra delen erbjuder författaren ett alternativ till den tanketradition som behandlas i den första delen. Det som blev förbisett i uppfattningen om en blick från ingenstans var att bilden är ett hjälpmedel då vi bearbetar vårt synfält. Bilden hjälper oss att dela vår syn på saker. Genom den här uppgiften av att dela blir bilden riktningsgivande i våra försök att orientera oss i världen. Jag kan stå bredvid en annan människa och se vad hon ser, men jag vet inte nödvändigtvis hur hon uppfattar det vi ser. Bilden lägger till ett led i det här förhållandet eftersom den inte enbart visar vad den andra ser. När bilden fungerar som den skall visar den också hur den andra ser och på det här sättet blir bilden verksam. Den föreliggande avhandlingen kombinerar epistemologi med vetenskapshistoria och visuella kulturstudier, men dess huvudintresse är filosofiskt. Den befattar sig med filosofiska missförstånd angående avbildning som en mimetisk konstform, kunskap som domesticering och varseblivning som mottagning av data. ------------------------------------------------------ Tämän väitöskirjan ensimmäinen osa selvittää kuvakäsitteen genealogiaa. Se havainnollistaa miten kuvan, näkemisen ja minän käsitteet kehittyivät suhteessa toisiinsa. Kertomus ulottuu varhaisesta renessanssista ja perspektivistisestä maalaustaiteesta, positivismin aikakauteen ja valokuvan syntyyn. Tämä kehitys toi mukanaan reduktionismin jossa minän rooli – ihmisen psykologian merkitys, meidän arviointikyky, meidän huomiokyky sekä meidän tahtomme – vaipui unohduksiin. Ajatusmaailmassa tapahtui siirtymä, kuvan merkitys vaihtui käsityksestä jossa se on kolmiulotteisen tilan representaatio kaksiulotteisella pinnalla, käsitykseen jossa kuva on läpinäkyvä ruutu, ikkuna kohti maailmaa. Ajatus kuvasta neutraalin näkökulman kantajana vahvisti skeptistä suhtautumista kommunikaatiota, dialogisuutta ja subjektiivisuutta kohtaan. Tämä skeptisyys ilmentyi myös vahvana epäluottamuksena ihmiskeskeisyyttä ja toiseutta kohtaan. Toisessa osassa tekijä tarjoaa vaihtoehdon tälle skeptiselle ajatusmaailmalle jota tarkastellaan ensimmäisessä osassa. Kuva on myös väline joka auttaa meitä jäsentämään meidän näkökenttäämme. Se auttaa meitä jakamaan meidän käsityksiä toistemme kanssa. Tämä näkemisen jakamisen käytäntö on kuvan keskeinen tehtävä. Voin seistä toisen ihmisen vieressä ja nähdä samat asiat kuin hän, mutta en välttämättä ymmärrä miten hän näkee nämä asiat. Kuva lisää jotain olennaista tähän suhteeseen. Kun kuva toimii niin kun sen kuuluu toimia, se näyttää myös miten toinen näkee, tällä tavalla kuvasta tulee välittäjä. Tämä väitöskirja yhdistää epistemologiaa, tieteen historiaa ja visuaalisen kulttuurin tutkimusta, mutta sen pääasiallinen tavoite on filosofinen. Se käsittelee filosofisia väärinkäsityksiä koskien kuvan eideettisyyttä.
Resumo:
Companies require information in order to gain an improved understanding of their customers. Data concerning customers, their interests and behavior are collected through different loyalty programs. The amount of data stored in company data bases has increased exponentially over the years and become difficult to handle. This research area is the subject of much current interest, not only in academia but also in practice, as is shown by several magazines and blogs that are covering topics on how to get to know your customers, Big Data, information visualization, and data warehousing. In this Ph.D. thesis, the Self-Organizing Map and two extensions of it – the Weighted Self-Organizing Map (WSOM) and the Self-Organizing Time Map (SOTM) – are used as data mining methods for extracting information from large amounts of customer data. The thesis focuses on how data mining methods can be used to model and analyze customer data in order to gain an overview of the customer base, as well as, for analyzing niche-markets. The thesis uses real world customer data to create models for customer profiling. Evaluation of the built models is performed by CRM experts from the retailing industry. The experts considered the information gained with help of the models to be valuable and useful for decision making and for making strategic planning for the future.
Resumo:
The importance of package design as a marketing tool is growing as the competition in retail environment increases. However, there is a lack of studies on how each element of package design affects consumer decisions in different countries. The objective of this thesis is to study the role of package design to Japanese consumers. The research was conducted through an experiment with a sample of 37 Japanese female participants. They were divided into two groups and were given different tasks: one group had to choose a chocolate for themselves, and the other for a group of friends. The participants were presented with 15 different Finnish chocolate boxes to choose from. The qualitative data was gathered through observation and semi-structured interviews. In addition, data from questionnaires was quantified and all the data was triangulated. The empirical results suggest that visual elements strongly affect the decision making of Japanese consumers. Image was the most important element which acted as both, a visual and an informational aspect in the experiment. Informational elements on the other hand have little effect, especially when the context is written in a foreign language. However, informational elements affected participants who were choosing chocolates for a group of friends. A unique finding was the importance of kawaii (cuteness) to Japanese consumers.