873 resultados para Audio-visual Speech Recognition, Visual Feature Extraction, Free-parts, Monolithic, ROI
Resumo:
Se presentan los resultados de una investigación desarrollada mediante la administración de un cuestionario sobre usos y evaluaciones que hacen chicos y chicas de diversos medios audiovisuales (televisión, ordenador, consola, CD-Roms educativos, Internet y juegos para ordenador y para consola). Los resultados nos muestran cómo los progenitores sobreestiman sistemáticamente las informaciones que sus hijos e hijas tienen acerca de cualquier medio audiovisual. En términos generales, los videojuegos son el medio con más concordancias negativas, el ordenador el que tiene más concordancias positivas y la televisión el que acumula más discrepancias entre generaciones.
Resumo:
La presentació del patrimoni en els museus i altres espais afins (exposicions, parcs arqueològics, centres d"interpretació patrimonial, etcètera) s"està beneficiant en els últims anys dels avenços a nivell didàctic que ofereixen les noves tecnologies. Però, sovint es fa palès que aquestes estratègies comunicatives no impliquen necessàriament una òptima assimilació del discurs històric i museogràfic per part del públic. La modernització dels espais patrimonials amb la incorporació d"audiovisuals i sistemes informàtics multimèdia no serveix de gran cosa si es planteja com un mer recurs passiu de delectació o com una concessió a la creixent implantació social de les noves tecnologies. És per això que es fa imprescindible plantejar des d"un punt de vista didàctic i comunicatiu com aquests recursos poden construir i/o enriquir el discurs a l"entorn del patrimoni (històric, arqueològic, artístic, etcètera) per a una museografia veritablement comprensiva i, més enllà dels objectes, educadora en valors.
Resumo:
Euroopan unioni on tiukentanut teiden laitteiden ja tukirakenteiden törmäysturvallisuusvaatimuksia. Uuden standardoinnin tarkoituksena on lieventää ajoneuvon kuljettajan ja matkustajan vammojen vakavuutta ajoneuvon törmätessä tielaitteiden pysyviin rakenteisiin. Käytännössä rakenteiden tulee hidastaa ajoneuvon nopeutta hallitusti eri törmäysnopeuksilla, jolloin matkustajaan kohdistuvat kiihtyvyydet eivät aiheuta vakavaa loukkaantumisriskiä. Vuonna 2005 Mikkelin ammattikorkeakoulun YTI-tutkimuskeskus ja Tehomet Oy kehittivät ensimmäisen version törmäysystävällisestä valaisinpylväästä. Tässä diplomityössä tavoitteena oli kehittää aikaisemmin tehdystä versiosta helpommin valmistettava versio sekä parantaa pylvään törmäyskäyttäytymistä. Valmistusmenetelmistä valittiin pultruusio, kuitukelaus, alipaineinjektio ja RTM. Menetelmille suunniteltiin soveltuvat rakenteet ja laskettiin rakenteiden valmistuskustannukset. Pultruusiolla, alipaineinjektiolla ja RTM:11ä valmistettiin koe-erä esitörmäyskokeita varten. Esitörmäyskokeiden jälkeen valittiin valmistusmenetelmäksi RTM. TKK/Tielaboratorion virallisissa testeissä kehitetylle pylväälle myönnettiin HE2-turvaluokitus. Hanketta jatketaan kehittämällä valmistusprosessia tehokkaammaksi uudistamalla muottitekniikkaa sekä ottamalla käyttöön lujiteaihiot. Tavoitteena on käynnistää tuotanto keväällä 2008. Kehitetty pylväs esitellään kansainvälisillä "Sähkö, Tele, Valo- ja AV 2008"-messuilla Jyväskylän Paviljongissa 6.-8.2.2008.
Resumo:
El patrimonio audiovisual de archivos, bibliotecas y museos se encuentra en peligro debido al deterioro de las grabaciones en soportes magnéticos. Se presentan las alteraciones que pueden sufrir las cintas de vídeo, así como las dificultades que representa la obsolescencia de los aparatos necesarios para su lectura. Se considera que la mejor vía de preservación de las cintas de vídeo es su digitalización. Este paso no es fácil dada la complejidad de los formatos de archivos de vídeo digital, incluyendo los contenedores multimedia y los estándares de compresión de vídeo y de audio. Por ello la elección técnica tiene que estar estrechamente ligada a las necesidades de cada servicio. Esta aproximación se ilustra con el caso de los fondos videográficos de las televisiones locales.
Resumo:
El patrimonio fotográfico sobre soportes plásticos, tanto si se trata de acetatos como de nitratos está expuesto a un proceso de degradación por hidrólisis ácida que es autocatalítico. Aunque existe bibliografía extensa sobre los procedimientos de conservación y preservación de este tipo de material, no abunda aquella que analice la situación real y actual de las colecciones custodiadas por los centros documentales. El presente trabajo tiene por objetivo dar a conocer el estado de conservación de este tipo de patrimonio en Cataluña, así como las condiciones en las que se produce dicha conservación. De los resultados de infiere que los procedimientos adecuados de conservación solamente se han aplicado recientemente y que las instalaciones con control medioambiental están infrautilizadas. Aunque cerca de un 85 % del material todavía no ha alcanzado un estado de catálisis autoinducida deben tomarse medidas rápidas de preservación puesto que una tercera parte del material estudiado se encuntra cerca de ello.
Resumo:
Changes in the angle of illumination incident upon a 3D surface texture can significantly alter its appearance, implying variations in the image texture. These texture variations produce displacements of class members in the feature space, increasing the failure rates of texture classifiers. To avoid this problem, a model-based texture recognition system which classifies textures seen from different distances and under different illumination directions is presented in this paper. The system works on the basis of a surface model obtained by means of 4-source colour photometric stereo, used to generate 2D image textures under different illumination directions. The recognition system combines coocurrence matrices for feature extraction with a Nearest Neighbour classifier. Moreover, the recognition allows one to guess the approximate direction of the illumination used to capture the test image
Resumo:
The teaching apprenticeship established by CAPES for post-graduation scholarship beholders has been discussed and the criterion adopted for the implementation in the post-graduation in Inorganic Chemistry Program presented. A teaching plan for the new subject is proposed, based on the experience gained through a first group. An instrument for evaluation of the student's performance has been developed and analyzed. Aspects like knowledge, clearness, enthusiasm, confidence, good manage on the audio-visual resources, class length are mentioned by degree of importance and the major difficulties faced and pointed out by the students.
Resumo:
This thesis deals with distance transforms which are a fundamental issue in image processing and computer vision. In this thesis, two new distance transforms for gray level images are presented. As a new application for distance transforms, they are applied to gray level image compression. The new distance transforms are both new extensions of the well known distance transform algorithm developed by Rosenfeld, Pfaltz and Lay. With some modification their algorithm which calculates a distance transform on binary images with a chosen kernel has been made to calculate a chessboard like distance transform with integer numbers (DTOCS) and a real value distance transform (EDTOCS) on gray level images. Both distance transforms, the DTOCS and EDTOCS, require only two passes over the graylevel image and are extremely simple to implement. Only two image buffers are needed: The original gray level image and the binary image which defines the region(s) of calculation. No other image buffers are needed even if more than one iteration round is performed. For large neighborhoods and complicated images the two pass distance algorithm has to be applied to the image more than once, typically 3 10 times. Different types of kernels can be adopted. It is important to notice that no other existing transform calculates the same kind of distance map as the DTOCS. All the other gray weighted distance function, GRAYMAT etc. algorithms find the minimum path joining two points by the smallest sum of gray levels or weighting the distance values directly by the gray levels in some manner. The DTOCS does not weight them that way. The DTOCS gives a weighted version of the chessboard distance map. The weights are not constant, but gray value differences of the original image. The difference between the DTOCS map and other distance transforms for gray level images is shown. The difference between the DTOCS and EDTOCS is that the EDTOCS calculates these gray level differences in a different way. It propagates local Euclidean distances inside a kernel. Analytical derivations of some results concerning the DTOCS and the EDTOCS are presented. Commonly distance transforms are used for feature extraction in pattern recognition and learning. Their use in image compression is very rare. This thesis introduces a new application area for distance transforms. Three new image compression algorithms based on the DTOCS and one based on the EDTOCS are presented. Control points, i.e. points that are considered fundamental for the reconstruction of the image, are selected from the gray level image using the DTOCS and the EDTOCS. The first group of methods select the maximas of the distance image to new control points and the second group of methods compare the DTOCS distance to binary image chessboard distance. The effect of applying threshold masks of different sizes along the threshold boundaries is studied. The time complexity of the compression algorithms is analyzed both analytically and experimentally. It is shown that the time complexity of the algorithms is independent of the number of control points, i.e. the compression ratio. Also a new morphological image decompression scheme is presented, the 8 kernels' method. Several decompressed images are presented. The best results are obtained using the Delaunay triangulation. The obtained image quality equals that of the DCT images with a 4 x 4
Resumo:
The results presented in this paper are from a research using a questionnaire about activities and evaluations of boys and girls in relation to different audio-visual media (television, computer, videoconsole, educative CD-Roms, Internet and computer and console games). Results show us that children information about any audio-visual media is systematically overestimated by parents. Generally, the media with more negative concordances is video-games, the one with more positive concordances is the computer and the one with more discrepancies between generations is TV
Resumo:
The present dissertation examined reading development during elementary school years by means of eye movement tracking. Three different but related issues in this field were assessed. First of all, the development of parafoveal processing skills in reading was investigated. Second, it was assessed whether and to what extent sublexical units such as syllables and morphemes are used in processing Finnish words and whether the use of these sublexical units changes as a function of reading proficiency. Finally, the developmental trend in the speed of visual information extraction during reading was examined. With regard to parafoveal processing skills, it was shown that 2nd graders extract letter identity information approx. 5 characters to the right of fixation, 4th graders approx. 7 characters to the right of fixation, and 6th graders and adults approx. 9 characters to the right of fixation. Furthermore, it was shown that all age groups extract more parafoveal information within compound words than across adjectivenoun pairs of similar length. In compounds, parafoveal word information can be extracted in parallel with foveal word information, if the compound in question is of high frequency. With regard to the use of sublexical units in Finnish word processing, it was shown that less proficient 2nd graders use both syllables and morphemes in the course of lexical access. More proficient 2nd graders as well as older readers seem to process words more holistically. Finally, it was shown that 60 ms is enough for 4th graders and adults to extract visual information from both 4-letter and 8-letter words, whereas 2nd graders clearly needed more than 60 ms to extract all information from 8- letter words for processing to proceed smoothly. The present dissertation demonstrates that Finnish 2nd graders develop their reading skills rapidly and are already at an adult level in some aspects of reading. This is not to say that there are no differences between less proficient (e.g., 2nd graders) and more proficient readers (e.g., adults) but in some respects it seems that the visual system used in extracting information from the text is matured by the 2nd grade. Furthermore, the present dissertation demonstrates that the allocation of attention in reading depends much on textual properties such as word frequency and whether words are spatially unified (as in compounds) or not. This flexibility of the attentional system naturally needs to be captured in word processing models. Finally, individual differences within age groups are quite substantial but it seems that by the end of the 2nd grade practically all Finnish children have reached a reasonable level of reading proficiency.
Resumo:
David Smithin esitys Europeana työpajassa 20.11.2012 Helsingissä.
Resumo:
Steganografian tarkoituksena on salaisen viestin piilottaminen muun informaation sekaan. Tutkielmassa perehdytään kirjallisuuden pohjalta steganografiaan ja kuvien digitaaliseen vesileimaamiseen. Tutkielmaan kuuluu myös kokeellinen osuus. Siinä esitellään vesileimattujen kuvien tunnistamiseen kehitetty testausjärjestelmä ja testiajojen tulokset. Testiajoissa kuvasarjoja on vesileimattu valituilla vesileimausmenetelmillä parametreja vaihdellen. Tunnistettaville kuville tehdään piirreirrotus. Erotellut piirteet annetaan parametreina luokittimelle, joka tekee lopullisen tunnistamispäätöksen. Tutkimuksessa saatiin toteutettua toimiva ohjelmisto vesileiman lisäämiseen ja vesileimattujen kuvien tunnistamiseen kuvajoukosta. Tulosten perusteella, sopivalla piirreirrottimella ja tukivektorikoneluokittimella päästään yli 95 prosentin tunnistamistarkkuuteen.
Resumo:
Presentation at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014
Resumo:
Feature extraction is the part of pattern recognition, where the sensor data is transformed into a more suitable form for the machine to interpret. The purpose of this step is also to reduce the amount of information passed to the next stages of the system, and to preserve the essential information in the view of discriminating the data into different classes. For instance, in the case of image analysis the actual image intensities are vulnerable to various environmental effects, such as lighting changes and the feature extraction can be used as means for detecting features, which are invariant to certain types of illumination changes. Finally, classification tries to make decisions based on the previously transformed data. The main focus of this thesis is on developing new methods for the embedded feature extraction based on local non-parametric image descriptors. Also, feature analysis is carried out for the selected image features. Low-level Local Binary Pattern (LBP) based features are in a main role in the analysis. In the embedded domain, the pattern recognition system must usually meet strict performance constraints, such as high speed, compact size and low power consumption. The characteristics of the final system can be seen as a trade-off between these metrics, which is largely affected by the decisions made during the implementation phase. The implementation alternatives of the LBP based feature extraction are explored in the embedded domain in the context of focal-plane vision processors. In particular, the thesis demonstrates the LBP extraction with MIPA4k massively parallel focal-plane processor IC. Also higher level processing is incorporated to this framework, by means of a framework for implementing a single chip face recognition system. Furthermore, a new method for determining optical flow based on LBPs, designed in particular to the embedded domain is presented. Inspired by some of the principles observed through the feature analysis of the Local Binary Patterns, an extension to the well known non-parametric rank transform is proposed, and its performance is evaluated in face recognition experiments with a standard dataset. Finally, an a priori model where the LBPs are seen as combinations of n-tuples is also presented
Resumo:
This paper explores the cognitive functions of the Reality Status Evaluation (RSE) system in our experiences of narrative mediated messages (NMM) (fictional, narrative, audio-visual one-way input and moving picture messages), such as fictional TV programs and films. We regard reality in mediated experiences as a special mental and emotional construction and a multi-dimensional concept. We argue that viewers' reality sense in NMM is influenced by many factors with "real - on" as the default value. Some of these factors function as primary mental processes, including the content realism factors of those messages such as Factuality (F), Social Realism (SR), Life Relevance (LR), and Perceptual Realism - involvement (PR), which would have direct impacts on reality evaluations. Other factors, such as Narrative Meaning (NM), Emotional Responses, and personality trait Absorption (AB), will influence the reality evaluations directly or through the mediations of these main dimensions. I designed a questionnaire to study this theoretical construction. I developed items to form scales and sub-scales measuring viewers' subjective experiences of reality evaluations and these factors. Pertinent statistical techniques, such as internal consistency and factorial analysis, were employed to make revisions and improve the quality of the questionnaire. In the formal experiment, after viewing two short films, which were selected as high or low narrative structure messages from previous experiments, participants were required to answer the questionnaire, Absorption questionnaire, and SAM (Self-Assessment Manikin, measuring immediate emotional responses). Results were analyzed using the EQS, structural equation modeling (SEM), and discussed in terms oflatent relations among these subjective factors in mediated experience. The present results supported most of my theoretical hypotheses. In NMM, three main jactors, or dimensions, could be extracted in viewers' subjective reality evaluations: Social Realism (combining with Factuality), Life Relevance and Perceptual Realism. I designed two ways to assess viewers' understanding of na"ative meanings in mediated messages, questionnaire (NM-Q) and rating (NM-R) measurement, and its significant influences on reality evaluations was supported in the final EQS models. Particularly in high story stnlcture messages, the effect of Narrative Meaning (NM) can rarely be explained by only these dimensions of reality evaluations. Also, Empathy seems to playa more important role in RSE of low story structure messages. Also, I focused on two other factors that were pertinent to RSE in NMM, the personality trait Absorption, and Emotional Responses (including two dimensions: Valence and Intensity). Final model results partly supported my theoretical hypotheses about the relationships among Absorption (AB), Social Realism (SR) and Life Relevance (LR); and the immediate impact of Emotional Responses on Perceptual Realism cPR).