129 resultados para video summarization


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the work being conducted in the baseline rail level crossing project, supported by the Australian rail industry and the Cooperative Research Centre for Rail Innovation. The paper discusses the limitations of near-miss data for analysis obtained using current level crossing occurrence reporting practices. The project is addressing these limitations through the development of a data collection and analysis system with an underlying level crossing accident causation model. An overview of the methodology and improved data recording process are described. The paper concludes with a brief discussion of benefits this project is expected to provide the Australian rail industry.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper examines the use of short video tutorials in a post-graduate accounting subject, as a means of helping students transition from dependent to more independent learners. Five short (three to five minute) video tutorials were introduced in an effort to shift the reliance for learning from the lecturer to the student. Students’ usage of video tutorials, comments by students, and reliance on teaching staff for individual assistance were monitored over three semesters from 2008 to 2009. Interviews with students were then conducted in late 2009 to more comprehensively evaluate the use and benefits of video tutorials. Findings reveal preliminary but positive outcomes in terms of both more efficient teaching and more effective learning.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Distributed Wireless Smart Camera (DWSC) network is a special type of Wireless Sensor Network (WSN) that processes captured images in a distributed manner. While image processing on DWSCs sees a great potential for growth, with its applications possessing a vast practical application domain such as security surveillance and health care, it suffers from tremendous constraints. In addition to the limitations of conventional WSNs, image processing on DWSCs requires more computational power, bandwidth and energy that presents significant challenges for large scale deployments. This dissertation has developed a number of algorithms that are highly scalable, portable, energy efficient and performance efficient, with considerations of practical constraints imposed by the hardware and the nature of WSN. More specifically, these algorithms tackle the problems of multi-object tracking and localisation in distributed wireless smart camera net- works and optimal camera configuration determination. Addressing the first problem of multi-object tracking and localisation requires solving a large array of sub-problems. The sub-problems that are discussed in this dissertation are calibration of internal parameters, multi-camera calibration for localisation and object handover for tracking. These topics have been covered extensively in computer vision literatures, however new algorithms must be invented to accommodate the various constraints introduced and required by the DWSC platform. A technique has been developed for the automatic calibration of low-cost cameras which are assumed to be restricted in their freedom of movement to either pan or tilt movements. Camera internal parameters, including focal length, principal point, lens distortion parameter and the angle and axis of rotation, can be recovered from a minimum set of two images of the camera, provided that the axis of rotation between the two images goes through the camera's optical centre and is parallel to either the vertical (panning) or horizontal (tilting) axis of the image. For object localisation, a novel approach has been developed for the calibration of a network of non-overlapping DWSCs in terms of their ground plane homographies, which can then be used for localising objects. In the proposed approach, a robot travels through the camera network while updating its position in a global coordinate frame, which it broadcasts to the cameras. The cameras use this, along with the image plane location of the robot, to compute a mapping from their image planes to the global coordinate frame. This is combined with an occupancy map generated by the robot during the mapping process to localised objects moving within the network. In addition, to deal with the problem of object handover between DWSCs of non-overlapping fields of view, a highly-scalable, distributed protocol has been designed. Cameras that follow the proposed protocol transmit object descriptions to a selected set of neighbours that are determined using a predictive forwarding strategy. The received descriptions are then matched at the subsequent camera on the object's path using a probability maximisation process with locally generated descriptions. The second problem of camera placement emerges naturally when these pervasive devices are put into real use. The locations, orientations, lens types etc. of the cameras must be chosen in a way that the utility of the network is maximised (e.g. maximum coverage) while user requirements are met. To deal with this, a statistical formulation of the problem of determining optimal camera configurations has been introduced and a Trans-Dimensional Simulated Annealing (TDSA) algorithm has been proposed to effectively solve the problem.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mobile video, as an emerging market and a promising research field, has attracted much attention from both industry and researchers. Considering the quality of user-experience as the crux of mobile video services, this chapter aims to provide a guide to user-centered studies of mobile video quality. This will benefit future research in better understanding user needs and experiences, designing effective research, and providing solid solutions to improve the quality of mobile video. This chapter is organized in three main parts: (1) a review of recent user studies from the perspectives of research focuses, user study methods, and data analysis methods; (2) an example of conducting a user study of mobile video research, together with the discussion on a series of relative issues, such as participants, materials and devices, study procedure, and analysis results, and; (3) a conclusion with an open discussion about challenges and opportunities in mobile video related research, and associated potential future improvements.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this project was to gain the voice of the early adolescent (aged between 11 and 13 years) about the things that are genuinely important to them in their lives. Eight participants were asked to record a private video diary entry each night for one week. A number of thematic topics were identified including: their experiences and perspectives on school curriculum and assessment, opinions about schooling structures, and importance of friendship and family. Giving young adolescents the opportunity to voice their opinions has been valuable in gaining insight to the relative impacts of teaching and learning approaches in their school contexts and the issues they consider as the most important in their lives.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The balance between player competence and the challenge presented by a task has been acknowledged as a major factor in providing optimal experience in video games. While Dynamic Difficulty Adjustment (DDA) presents methods for adjusting difficulty in real-time during singleplayer games, little research has explored its application in competitive multiplayer games where challenge is dictated by the competence of human opponents. By conducting a formal review of 180 existing competitive multiplayer games, it was found that a large number of modern games are utilizing DDA techniques to balance challenge between human opponents. From this data, we propose a preliminary framework for classifying Multiplayer Dynamic Difficulty Adjustment (mDDA) instances.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents an investigation into event detection in crowded scenes, where the event of interest co-occurs with other activities and only binary labels at the clip level are available. The proposed approach incorporates a fast feature descriptor from the MPEG domain, and a novel multiple instance learning (MIL) algorithm using sparse approximation and random sensing. MPEG motion vectors are used to build particle trajectories that represent the motion of objects in uniform video clips, and the MPEG DCT coefficients are used to compute a foreground map to remove background particles. Trajectories are transformed into the Fourier domain, and the Fourier representations are quantized into visual words using the K-Means algorithm. The proposed MIL algorithm models the scene as a linear combination of independent events, where each event is a distribution of visual words. Experimental results show that the proposed approaches achieve promising results for event detection compared to the state-of-the-art.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Non-rigid face alignment is a very important task in a large range of applications but the existing tracking based non-rigid face alignment methods are either inaccurate or requiring person-specific model. This dissertation has developed simultaneous alignment algorithms that overcome these constraints and provide alignment with high accuracy, efficiency, robustness to varying image condition, and requirement of only generic model.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Importance Older men are at risk of dying of melanoma. Objective To assess attendance at and clinical outcomes of clinical skin examinations (CSEs) in older men exposed to a video-based behavioral intervention. Design, Setting, and Participants This was a behavioral randomized clinical trial of a video-based intervention in men aged at least 50 years. Between June 1 and August 31, 2008, men were recruited, completed baseline telephone interviews, and were than randomized to receive either a video-based intervention (n = 469) or brochures only (n = 461; overall response rate, 37.1%) and were again interviewed 7 months later (n = 870; 93.5% retention). Interventions Video on skin self-examination and skin awareness and written informational materials. The control group received written materials only. Main Outcomes and Measures Participants who reported a CSE were asked for the type of CSE (skin spot, partial body, or whole body), who initiated it, whether the physician noted any suspicious lesions, and, if so, how lesions were managed. Physicians completed a case report form that included the type of CSE, who initiated it, the number of suspicious lesions detected, how lesions were managed (excision, nonsurgical treatment, monitoring, or referral), and pathology reports after lesion excision or biopsy. Results Overall, 540 of 870 men (62.1%) self-reported a CSE since receiving intervention materials, and 321 of 540 (59.4%) consented for their physician to provide medical information (received for 266 of 321 [82.9%]). Attendance of any CSE was similar between groups (intervention group, 246 of 436 [56.4%]; control group, 229 of 434 [52.8%]), but men in the intervention group were more likely to self-report a whole-body CSE (154 of 436 [35.3%] vs 118 of 434 [27.2%] for control group; P = .01). Two melanomas, 29 squamous cell carcinomas, and 38 basal cell carcinomas were diagnosed, with a higher proportion of malignant lesions in the intervention group (60.0% vs 40.0% for controls; P = .03). Baseline attitudes, behaviors, and skin cancer history were associated with higher odds of CSE and skin cancer diagnosis. Conclusions and Relevance A video-based intervention may increase whole-body CSE and skin cancer diagnosis in older men. Trial Registration: anzctr.org.au Identifier: ACTRN12608000384358

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Clustering identities in a broadcast video is a useful task to aid in video annotation and retrieval. Quality based frame selection is a crucial task in video face clustering, to both improve the clustering performance and reduce the computational cost. We present a frame work that selects the highest quality frames available in a video to cluster the face. This frame selection technique is based on low level and high level features (face symmetry, sharpness, contrast and brightness) to select the highest quality facial images available in a face sequence for clustering. We also consider the temporal distribution of the faces to ensure that selected faces are taken at times distributed throughout the sequence. Normalized feature scores are fused and frames with high quality scores are used in a Local Gabor Binary Pattern Histogram Sequence based face clustering system. We present a news video database to evaluate the clustering system performance. Experiments on the newly created news database show that the proposed method selects the best quality face images in the video sequence, resulting in improved clustering performance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis introduces improved techniques towards automatically estimating the pose of humans from video. It examines a complete workflow to estimating pose, from the segmentation of the raw video stream to extract silhouettes, to using the silhouettes in order to determine the relative orientation of parts of the human body. The proposed segmentation algorithms have improved performance and reduced complexity, while the pose estimation shows superior accuracy during difficult cases of self occlusion.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Letting the patron choose ebooks has been a successful experience. Why not apply the same purchase model to other formats? This showcase outlines Queensland University of Technology’s experience with a trial of patron driven acquisition (PDA) for online video. The trial commencing in August 2012 provided access to over 700 online videos licensed from Kanopy across a number of discipline areas. As online video publishing is still in the early stages of development, and as the trial is only in the very early stages, it is too early to draw any firm conclusions about the likely suitability of this model for online video selection and acquisition. However, the trial provides some interesting initial comparisons with ebook PDA and existing online video purchase models and prompts further consideration of PDA as a method for online video selection and licensing.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper explores the potential for online video as a mechanism to transform the ways students learn, as measured by research, user experience and usage following surveys and trials of patron-driven acquisition collaboratively undertaken by Queensland University of Technology, La Trobe University and Kanopy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

While video is recognised as an important medium for teaching and learning in the digital age, many video resources are not as effective as they might be, because they do not adequately exploit the strengths of the medium. Presented here are some case studies of video learning resources produced for various courses in a university environment. This ongoing project attempts to identify pedagogic strategies for the use of video; learning situations in which video has the most efficacy; and what production techniques can be employed to make effective video learning resources.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Efficient and effective feature detection and representation is an important consideration when processing videos, and a large number of applications such as motion analysis, 3D scene understanding, tracking etc. depend on this. Amongst several feature description methods, local features are becoming increasingly popular for representing videos because of their simplicity and efficiency. While they achieve state-of-the-art performance with low computational complexity, their performance is still too limited for real world applications. Furthermore, rapid increases in the uptake of mobile devices has increased the demand for algorithms that can run with reduced memory and computational requirements. In this paper we propose a semi binary based feature detectordescriptor based on the BRISK detector, which can detect and represent videos with significantly reduced computational requirements, while achieving comparable performance to the state of the art spatio-temporal feature descriptors. First, the BRISK feature detector is applied on a frame by frame basis to detect interest points, then the detected key points are compared against consecutive frames for significant motion. Key points with significant motion are encoded with the BRISK descriptor in the spatial domain and Motion Boundary Histogram in the temporal domain. This descriptor is not only lightweight but also has lower memory requirements because of the binary nature of the BRISK descriptor, allowing the possibility of applications using hand held devices.We evaluate the combination of detectordescriptor performance in the context of action classification with a standard, popular bag-of-features with SVM framework. Experiments are carried out on two popular datasets with varying complexity and we demonstrate comparable performance with other descriptors with reduced computational complexity.