793 resultados para Video summarization
Resumo:
La medida de calidad de vídeo sigue siendo necesaria para definir los criterios que caracterizan una señal que cumpla los requisitos de visionado impuestos por el usuario. Las nuevas tecnologías, como el vídeo 3D estereoscópico o formatos más allá de la alta definición, imponen nuevos criterios que deben ser analizadas para obtener la mayor satisfacción posible del usuario. Entre los problemas detectados durante el desarrollo de esta tesis doctoral se han determinado fenómenos que afectan a distintas fases de la cadena de producción audiovisual y tipo de contenido variado. En primer lugar, el proceso de generación de contenidos debe encontrarse controlado mediante parámetros que eviten que se produzca el disconfort visual y, consecuentemente, fatiga visual, especialmente en lo relativo a contenidos de 3D estereoscópico, tanto de animación como de acción real. Por otro lado, la medida de calidad relativa a la fase de compresión de vídeo emplea métricas que en ocasiones no se encuentran adaptadas a la percepción del usuario. El empleo de modelos psicovisuales y diagramas de atención visual permitirían ponderar las áreas de la imagen de manera que se preste mayor importancia a los píxeles que el usuario enfocará con mayor probabilidad. Estos dos bloques se relacionan a través de la definición del término saliencia. Saliencia es la capacidad del sistema visual para caracterizar una imagen visualizada ponderando las áreas que más atractivas resultan al ojo humano. La saliencia en generación de contenidos estereoscópicos se refiere principalmente a la profundidad simulada mediante la ilusión óptica, medida en términos de distancia del objeto virtual al ojo humano. Sin embargo, en vídeo bidimensional, la saliencia no se basa en la profundidad, sino en otros elementos adicionales, como el movimiento, el nivel de detalle, la posición de los píxeles o la aparición de caras, que serán los factores básicos que compondrán el modelo de atención visual desarrollado. Con el objetivo de detectar las características de una secuencia de vídeo estereoscópico que, con mayor probabilidad, pueden generar disconfort visual, se consultó la extensa literatura relativa a este tema y se realizaron unas pruebas subjetivas preliminares con usuarios. De esta forma, se llegó a la conclusión de que se producía disconfort en los casos en que se producía un cambio abrupto en la distribución de profundidades simuladas de la imagen, aparte de otras degradaciones como la denominada “violación de ventana”. A través de nuevas pruebas subjetivas centradas en analizar estos efectos con diferentes distribuciones de profundidades, se trataron de concretar los parámetros que definían esta imagen. Los resultados de las pruebas demuestran que los cambios abruptos en imágenes se producen en entornos con movimientos y disparidades negativas elevadas que producen interferencias en los procesos de acomodación y vergencia del ojo humano, así como una necesidad en el aumento de los tiempos de enfoque del cristalino. En la mejora de las métricas de calidad a través de modelos que se adaptan al sistema visual humano, se realizaron también pruebas subjetivas que ayudaron a determinar la importancia de cada uno de los factores a la hora de enmascarar una determinada degradación. Los resultados demuestran una ligera mejora en los resultados obtenidos al aplicar máscaras de ponderación y atención visual, los cuales aproximan los parámetros de calidad objetiva a la respuesta del ojo humano. ABSTRACT Video quality assessment is still a necessary tool for defining the criteria to characterize a signal with the viewing requirements imposed by the final user. New technologies, such as 3D stereoscopic video and formats of HD and beyond HD oblige to develop new analysis of video features for obtaining the highest user’s satisfaction. Among the problems detected during the process of this doctoral thesis, it has been determined that some phenomena affect to different phases in the audiovisual production chain, apart from the type of content. On first instance, the generation of contents process should be enough controlled through parameters that avoid the occurrence of visual discomfort in observer’s eye, and consequently, visual fatigue. It is especially necessary controlling sequences of stereoscopic 3D, with both animation and live-action contents. On the other hand, video quality assessment, related to compression processes, should be improved because some objective metrics are adapted to user’s perception. The use of psychovisual models and visual attention diagrams allow the weighting of image regions of interest, giving more importance to the areas which the user will focus most probably. These two work fields are related together through the definition of the term saliency. Saliency is the capacity of human visual system for characterizing an image, highlighting the areas which result more attractive to the human eye. Saliency in generation of 3DTV contents refers mainly to the simulated depth of the optic illusion, i.e. the distance from the virtual object to the human eye. On the other hand, saliency is not based on virtual depth, but on other features, such as motion, level of detail, position of pixels in the frame or face detection, which are the basic features that are part of the developed visual attention model, as demonstrated with tests. Extensive literature involving visual comfort assessment was looked up, and the development of new preliminary subjective assessment with users was performed, in order to detect the features that increase the probability of discomfort to occur. With this methodology, the conclusions drawn confirmed that one common source of visual discomfort was when an abrupt change of disparity happened in video transitions, apart from other degradations, such as window violation. New quality assessment was performed to quantify the distribution of disparities over different sequences. The results confirmed that abrupt changes in negative parallax environment produce accommodation-vergence mismatches derived from the increasing time for human crystalline to focus the virtual objects. On the other side, for developing metrics that adapt to human visual system, additional subjective tests were developed to determine the importance of each factor, which masks a concrete distortion. Results demonstrated slight improvement after applying visual attention to objective metrics. This process of weighing pixels approximates the quality results to human eye’s response.
Resumo:
The usage of HTTP adaptive streaming (HAS) has become widely spread in multimedia services. Because it allows the service providers to improve the network resource utilization and user׳s Quality of Experience (QoE). Using this technology, the video playback interruption is reduced since the network and server status in addition to capability of user device, all are taken into account by HAS client to adapt the quality to the current condition. Adaptation can be done using different strategies. In order to provide optimal QoE, the perceptual impact of adaptation strategies from point of view of the user should be studied. However, the time-varying video quality due to the adaptation which usually takes place in a long interval introduces a new type of impairment making the subjective evaluation of adaptive streaming system challenging. The contribution of this paper is two-fold: first, it investigates the testing methodology to evaluate HAS QoE by comparing the subjective experimental outcomes obtained from ACR standardized method and a semi-continuous method developed to evaluate the long sequences. In addition, influence of using audiovisual stimuli to evaluate the video-related impairment is inquired. Second, impact of some of the adaptation technical factors including the quality switching amplitude and chunk size in combination with high range of commercial content type is investigated. The results of this study provide a good insight toward achieving appropriate testing method to evaluate HAS QoE, in addition to designing switching strategies with optimal visual quality.
Resumo:
Acknowledgements We would like to thank Erik Rexstad and Rob Williams for useful reviews of this manuscript. The collection of visual and acoustic data was funded by the UK Department of Energy & Climate Change, the Scottish Government, Collaborative Offshore Wind Research into the Environment (COWRIE) and Oil & Gas UK. Digital aerial surveys were funded by Moray Offshore Renewables Ltd and additional funding for analysis of the combined datasets was provided by Marine Scotland. Collaboration between the University of Aberdeen and Marine Scotland was supported by MarCRF. We thank colleagues at the University of Aberdeen, Moray First Marine, NERI, Hi-Def Aerial Surveying Ltd and Ravenair for essential support in the field, particularly Tim Barton, Bill Ruck, Rasmus Nielson and Dave Rutter. Thanks also to Andy Webb, David Borchers, Len Thomas, Kelly McLeod, David L. Miller, Dinara Sadykova and Thomas Cornulier for advice on survey design and statistical approache. Data Accessibility Data are available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.cf04g
Resumo:
Toxoplasma gondii is a member of the phylum Apicomplexa, a diverse group of intracellular parasites that share a unique form of gliding motility. Gliding is substrate dependent and occurs without apparent changes in cell shape and in the absence of traditional locomotory organelles. Here, we demonstrate that gliding is characterized by three distinct forms of motility: circular gliding, upright twirling, and helical rotation. Circular gliding commences while the crescent-shaped parasite lies on its right side, from where it moves in a counterclockwise manner at a rate of ∼1.5 μm/s. Twirling occurs when the parasite rights itself vertically, remaining attached to the substrate by its posterior end and spinning clockwise. Helical gliding is similar to twirling except that it occurs while the parasite is positioned horizontally, resulting in forward movement that follows the path of a corkscrew. The parasite begins lying on its left side (where the convex side is defined as dorsal) and initiates a clockwise revolution along the long axis of the crescent-shaped body. Time-lapse video analyses indicated that helical gliding is a biphasic process. During the first 180o of the turn, the parasite moves forward one body length at a rate of ∼1–3 μm/s. In the second phase, the parasite flips onto its left side, in the process undergoing little net forward motion. All three forms of motility were disrupted by inhibitors of actin filaments (cytochalasin D) and myosin ATPase (butanedione monoxime), indicating that they rely on an actinomyosin motor in the parasite. Gliding motility likely provides the force for active penetration of the host cell and may participate in dissemination within the host and thus is of both fundamental and practical interest.
Resumo:
A bexiga neurogênica é uma disfunção vesical decorrente principalmente da lesão medular. O cateterismo vesical intermitente é o tratamento mais indicado na atualidade, deve ser realizado de 4 a 6 vezes ao dia, durante toda a vida, visando a proteção do trato urinário superior e a aquisição da continência urinária. Na reabilitação desses indivíduos, a autocateterização vesical é um desafio enfrentado na busca pela autonomia, privacidade, inserção social e participação. Os vídeos educativos são utilizados para o aprendizado do autocateterismo em vários países, por serem de fácil utilização e acesso via internet. Apesar disso, não existem vídeos realizados para o contexto brasileiro, levando em consideração os cateteres urinários e a técnica utilizada no Brasil. Este estudo teve como objetivo desenvolver e validar um vídeo educativo para a realização do autocateterismo vesical intermitente limpo. Trata-se de um estudo quantitativo, desenvolvido em duas fases: 1ª fase com a avaliação de vídeos educativos públicos direcionados para a aprendizagem do autocateterismo vesical intermitente com a técnica limpa; e 2ª Fase, com o desenvolvimento e validação de um vídeo educativo voltado para aprendizagem do autocateterismo. O levantamento dos vídeos utilizou um site de compartilhamento de vídeos utilizando o descritor \"autocateterismo\". Os vídeos foram avaliados por três juízes da área de saúde. O processo de desenvolvimento e validação do roteiro do vídeo educativo utilizou questionários previamente ratificados. Participaram dessas etapas, respectivamente, 18 e 17 juízes experts em reabilitação e/ou no ensino em saúde. O levantamento mostrou que apenas 3,5% (172) do total de vídeos disponíveis no site pesquisado eram voltados para o aprendizado do autocateterismo no contexto brasileiro. Seis vídeos eram específicos para o autocateterismo, dos quais quatro tinham informações desatualizadas ou incorretas, apenas dois atingiram a pontuação aceitável. Na validação do roteiro observou-se um predomínio de participantes do sexo feminino (94,44%), com idade de 30 a 60 anos, dos quais 72,22% possuíam mestrado e 50% atuavam há mais de cinco anos na área de reabilitação. O roteiro foi considerado validado com 96,29% das respostas dos juízes \"concordo\" ou \"concordo totalmente\" nas questões referentes ao quesito objetivo, 91,09% para quesito conteúdo, 98,12% em relação ao quesito relevância, 75% quanto ao quesito ambiente, 71,11% no quesito linguagem verbal e 92,70% referente à inclusão de tópicos. A produção do vídeo contou com uso de tecnologia 3D e apoio de uma equipe técnica especializada. No que se refere à validação do conteúdo do vídeo educativo, o conteúdo do vídeo foi considerado validado com 100% dos juízes que responderam \"concordo\" ou \"concordo totalmente\" nas questões referentes à funcionalidade, 86,27% referentes à usabilidade, 97,06% no quesito eficiência, 100% para técnica audiovisual, 94,11% quanto ao ambiente e 97,05% procedimento. O vídeo educativo foi avaliado positivamente tanto pela qualidade das informações quanto pela didática do ensino, mostrando a relevância da validação de materiais educativos. A expectativa é disseminar o vídeo educativo em diferentes centros de reabilitação e Universidades, visando propagar e tornar o conhecimento sobre a temática mais acessível à sociedade e aos profissionais de saúde, em especial os de reabilitação. Além de incentivar e embasar metodologicamente o desenvolvimento de outros vídeos educativos na área da saúde
Resumo:
In 2014, eight academic libraries in the state of New York collaborated on a group patron driven acquisition (PDA) pilot program with Kanopy, a video streaming service for libraries. The institutions, despite vast differences in size and profile, each launched Kanopy’s streaming solution on their campuses under a program where they would jointly contribute to and acquire films based on group usage. The pilot ran for seven months and led to some fascinating insights into the differences in demand for film across campuses, the possibility of PDA as a model for library acquisition, and the feasibility of a group approach to acquisition. This paper presents the background to, results of, and reflections on the pilot program from the three unique perspectives of the consortium, the vendor, and one of the libraries involved, providing a holistic view of the success of the pilot and the lessons learned.
Resumo:
The Elliptical Scanning Algorithm is an effective method to individually detect and label the projected rings. It consecutively defines an elliptical annulus of one pixel wide which grows pixel by pixel and sweeps the image, from centre to periphery, until it detects and labels each whole ring. In a way, it works like a snake-annealing algorithm. Active contour models (snakes) are energy-minimising curves that deform to fit image features. Elliptical Scanning Algorithm changes its geometry in order to label reflected rings.
Resumo:
The exponential increase of subjective, user-generated content since the birth of the Social Web, has led to the necessity of developing automatic text processing systems able to extract, process and present relevant knowledge. In this paper, we tackle the Opinion Retrieval, Mining and Summarization task, by proposing a unified framework, composed of three crucial components (information retrieval, opinion mining and text summarization) that allow the retrieval, classification and summarization of subjective information. An extensive analysis is conducted, where different configurations of the framework are suggested and analyzed, in order to determine which is the best one, and under which conditions. The evaluation carried out and the results obtained show the appropriateness of the individual components, as well as the framework as a whole. By achieving an improvement over 10% compared to the state-of-the-art approaches in the context of blogs, we can conclude that subjective text can be efficiently dealt with by means of our proposed framework.
Resumo:
We propose an original method to geoposition an audio/video stream with multiple emitters that are at the same time receivers of the mixed signal. The achieved method is suitable for those comes where a list of positions within a designated area is encoded with a degree of precision adjusted to the visualization capabilities; and is also easily extensible to support new requirements. This method extends a previously proposed protocol, without incurring in any performance penalty.
Resumo:
In this paper, we propose an original method to geoposition an audio/video stream with multiple emitters that are at the same time receivers of the mixed signal. The obtained method is suitable when a list of positions within a known area is encoded with precision tailored to the visualization capabilities of the target device. Nevertheless, it is easily adaptable to new precision requirements, as well as parameterized data precision. This method extends a previously proposed protocol, without incurring in any performance penalty.