Classificazione di oggetti in immagini attraverso il modello Bag of Visual Words


Autoria(s): Annovi, Andrea
Contribuinte(s)

Mattoccia, Stefano

Data(s)

17/03/2014

Resumo

Generic object recognition is an important function of the human visual system and everybody finds it highly useful in their everyday life. For an artificial vision system it is a really hard, complex and challenging task because instances of the same object category can generate very different images, depending of different variables such as illumination conditions, the pose of an object, the viewpoint of the camera, partial occlusions, and unrelated background clutter. The purpose of this thesis is to develop a system that is able to classify objects in 2D images based on the context, and identify to which category the object belongs to. Given an image, the system can classify it and decide the correct categorie of the object. Furthermore the objective of this thesis is also to test the performance and the precision of different supervised Machine Learning algorithms in this specific task of object image categorization. Through different experiments the implemented application reveals good categorization performances despite the difficulty of the problem. However this project is open to future improvement; it is possible to implement new algorithms that has not been invented yet or using other techniques to extract features to make the system more reliable. This application can be installed inside an embedded system and after trained (performed outside the system), so it can become able to classify objects in a real-time. The information given from a 3D stereocamera, developed inside the department of Computer Engineering of the University of Bologna, can be used to improve the accuracy of the classification task. The idea is to segment a single object in a scene using the depth given from a stereocamera and in this way make the classification more accurate.

Formato

application/pdf

Identificador

http://amslaurea.unibo.it/6855/1/Andrea_Annovi_tesi.pdf

Annovi, Andrea (2014) Classificazione di oggetti in immagini attraverso il modello Bag of Visual Words. [Laurea magistrale], Università di Bologna, Corso di Studio in Ingegneria informatica [LM-DM270] <http://amslaurea.unibo.it/view/cds/CDS0937/>

Relação

http://amslaurea.unibo.it/6855/

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Bag of Visual Words, Classificazione, Riconoscimento oggetti, Classificatore #scuola :: 843884 :: Ingegneria e Architettura #cds :: 0937 :: Ingegneria informatica [LM-DM270] #sessione :: terza
Tipo

PeerReviewed