997 resultados para Audiovisual documents


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The following topics were dealt with: document analysis and recognition; multimedia document processing; character recognition; document image processing; cheque processing; form processing; music processing; document segmentation; electronic documents; character classification; handwritten character recognition; information retrieval; postal automation; font recognition; Indian language OCR; handwriting recognition; performance evaluation; graphics recognition; oriental character recognition; and word recognition

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a set of metrics that evaluate the uniformity, sharpness, continuity, noise, stroke width variance,pulse width ratio, transient pixels density, entropy and variance of components to quantify the quality of a document image. The measures are intended to be used in any optical character recognition (OCR) engine to a priori estimate the expected performance of the OCR. The suggested measures have been evaluated on many document images, which have different scripts. The quality of a document image is manually annotated by users to create a ground truth. The idea is to correlate the values of the measures with the user annotated data. If the measure calculated matches the annotated description,then the metric is accepted; else it is rejected. In the set of metrics proposed, some of them are accepted and the rest are rejected. We have defined metrics that are easily estimatable. The metrics proposed in this paper are based on the feedback of homely grown OCR engines for Indic (Tamil and Kannada) languages. The metrics are independent of the scripts, and depend only on the quality and age of the paper and the printing. Experiments and results for each proposed metric are discussed. Actual recognition of the printed text is not performed to evaluate the proposed metrics. Sometimes, a document image containing broken characters results in good document image as per the evaluated metrics, which is part of the unsolved challenges. The proposed measures work on gray scale document images and fail to provide reliable information on binarized document image.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

When document corpus is very large, we often need to reduce the number of features. But it is not possible to apply conventional Non-negative Matrix Factorization(NMF) on billion by million matrix as the matrix may not fit in memory. Here we present novel Online NMF algorithm. Using Online NMF, we reduced original high-dimensional space to low-dimensional space. Then we cluster all the documents in reduced dimension using k-means algorithm. We experimentally show that by processing small subsets of documents we will be able to achieve good performance. The method proposed outperforms existing algorithms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The broader goal of the research being described here is to automatically acquire diagnostic knowledge from documents in the domain of manual and mechanical assembly of aircraft structures. These documents are treated as a discourse used by experts to communicate with others. It therefore becomes possible to use discourse analysis to enable machine understanding of the text. The research challenge addressed in the paper is to identify documents or sections of documents that are potential sources of knowledge. In a subsequent step, domain knowledge will be extracted from these segments. The segmentation task requires partitioning the document into relevant segments and understanding the context of each segment. In discourse analysis, the division of a discourse into various segments is achieved through certain indicative clauses called cue phrases that indicate changes in the discourse context. However, in formal documents such language may not be used. Hence the use of a domain specific ontology and an assembly process model is proposed to segregate chunks of the text based on a local context. Elements of the ontology/model, and their related terms serve as indicators of current context for a segment and changes in context between segments. Local contexts are aggregated for increasingly larger segments to identify if the document (or portions of it) pertains to the topic of interest, namely, assembly. Knowledge acquired through such processes enables acquisition and reuse of knowledge during any part of the lifecycle of a product.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In optical character recognition of very old books, the recognition accuracy drops mainly due to the merging or breaking of characters. In this paper, we propose the first algorithm to segment merged Kannada characters by using a hypothesis to select the positions to be cut. This method searches for the best possible positions to segment, by taking into account the support vector machine classifier's recognition score and the validity of the aspect ratio (width to height ratio) of the segments between every pair of cut positions. The hypothesis to select the cut position is based on the fact that a concave surface exists above and below the touching portion. These concave surfaces are noted down by tracing the valleys in the top contour of the image and similarly doing it for the image rotated upside-down. The cut positions are then derived as closely matching valleys of the original and the rotated images. Our proposed segmentation algorithm works well for different font styles, shapes and sizes better than the existing vertical projection profile based segmentation. The proposed algorithm has been tested on 1125 different word images, each containing multiple merged characters, from an old Kannada book and 89.6% correct segmentation is achieved and the character recognition accuracy of merged words is 91.2%. A few points of merge are still missed due to the absence of a matched valley due to the specific shapes of the particular characters meeting at the merges.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Torna pública a abertura de inscrição para o concurso público destinado ao preenchimento de cargos de Contador, Assistente Administrativo, Agente de Segurança Legislativo, Operador de Máquinas e Operador de Audiovisual da Câmara dos Deputados.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Comunica que no resultado da prova de Conhecimentos Específicos, Língua Portuguesa e Legislação do concurso público para Operador de Audiovisual, Edital nº 4/92, foi omitido um candidato.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Em conformidade com os Editais nº 1/1992 e nº 10/1992, comunica aos candidatos a data de realização da Prova Prática para a categoria funcional de Operador de Audiovisual.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Torna pública a abertura de inscrição para concurso público destinado ao preenchimento de cargos de Operador de Audiovisual, do Grupo-Atividades de Apoio Legislativo da Câmara dos Deputados.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Em conformidade com o Edital nº1/1992, torna público o resultado da prova prática e o resultado final, com a classificação geral do concurso público para Operador de Audiovisual.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Investiga como a produção e a distribuição de programas de televisão, filmes e outros conteúdos audiovisuais de produção nacional podem ser promovidas pela regulação. O estudo identifica as atuais ferramentas regulatórias e as diferentes políticas adotadas para a promoção do conteúdo nacional. Ele também aponta novas questões resultantes da transformação substancial que a mídia tem sofrido nos últimos anos. O setor audiovisual hoje é caracterizado pela abundância de canais de televisão e de serviços de telecomunicações e pela convergência digital em curso. Este novo cenário impacta a eficiência e a racionalidade da regulamentação dos conteúdos. Focada no Reino Unido, França e Brasil, esta pesquisa comparativa investiga as mudanças regulatórias, políticas, sócio-culturais, econômicas, tecnológicas e mercadológicas dos serviços de comunicação nas últimas décadas e como esse desdobramento tem influenciado a oferta de conteúdo audiovisual nacional. O ponto de partida da análise é a década de oitenta, quando a radiodifusão começou a ser gradualmente liberalizada em diversos países, e termina na primeira década do novo milênio, quando as tecnologias da informação, as telecomunicações e a radiodifusão convergem para a oferta de serviços interconectados, complementares e suplementares.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

[EN] The paper analyses a very interesting documentary film about the ancient history of Spain from a franquist, more specifically, falangist point of view (“Nueva Visión de la Historia). It was based on a book about the history of Spain written by a member of Falange, the fascist group in Spain, Antonio Almagro, and was intended as a formative instrument for the youth in the late forties or early fifties. We know only the episodes dealing with Ancient Spain (probably the only ones shot), 25 minutes in all, and it represents an outstanding example of an hyper-nationalistic and hyper-catholic perspective of Spanish history. It also shows a very sympathetic image of José Antonio, the leader of Falange, but, remarkably enough, not of Franco. The film also includes a “theoretical” Introduction about the notion of History.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Eguíluz, Federico; Merino, Raquel; Olsen, Vickie; Pajares, Eterio; Santamaría, José Miguel (eds.)