910 resultados para foreground background segmentation
Resumo:
Texture analysis and textural cues have been applied for image classification, segmentation and pattern recognition. Dominant texture descriptors include directionality, coarseness, line-likeness etc. In this dissertation a class of textures known as particulate textures are defined, which are predominantly coarse or blob-like. The set of features that characterise particulate textures are different from those that characterise classical textures. These features are micro-texture, macro-texture, size, shape and compaction. Classical texture analysis techniques do not adequately capture particulate texture features. This gap is identified and new methods for analysing particulate textures are proposed. The levels of complexity in particulate textures are also presented ranging from the simplest images where blob-like particles are easily isolated from their back- ground to the more complex images where the particles and the background are not easily separable or the particles are occluded. Simple particulate images can be analysed for particle shapes and sizes. Complex particulate texture images, on the other hand, often permit only the estimation of particle dimensions. Real life applications of particulate textures are reviewed, including applications to sedimentology, granulometry and road surface texture analysis. A new framework for computation of particulate shape is proposed. A granulometric approach for particle size estimation based on edge detection is developed which can be adapted to the gray level of the images by varying its parameters. This study binds visual texture analysis and road surface macrotexture in a theoretical framework, thus making it possible to apply monocular imaging techniques to road surface texture analysis. Results from the application of the developed algorithm to road surface macro-texture, are compared with results based on Fourier spectra, the auto- correlation function and wavelet decomposition, indicating the superior performance of the proposed technique. The influence of image acquisition conditions such as illumination and camera angle on the results was systematically analysed. Experimental data was collected from over 5km of road in Brisbane and the estimated coarseness along the road was compared with laser profilometer measurements. Coefficient of determination R2 exceeding 0.9 was obtained when correlating the proposed imaging technique with the state of the art Sensor Measured Texture Depth (SMTD) obtained using laser profilometers.
Resumo:
Object segmentation is one of the fundamental steps for a number of robotic applications such as manipulation, object detection, and obstacle avoidance. This paper proposes a visual method for incorporating colour and depth information from sequential multiview stereo images to segment objects of interest from complex and cluttered environments. Rather than segmenting objects using information from a single frame in the sequence, we incorporate information from neighbouring views to increase the reliability of the information and improve the overall segmentation result. Specifically, dense depth information of a scene is computed using multiple view stereo. Depths from neighbouring views are reprojected into the reference frame to be segmented compensating for imperfect depth computations for individual frames. The multiple depth layers are then combined with color information from the reference frame to create a Markov random field to model the segmentation problem. Finally, graphcut optimisation is employed to infer pixels belonging to the object to be segmented. The segmentation accuracy is evaluated over images from an outdoor video sequence demonstrating the viability for automatic object segmentation for mobile robots using monocular cameras as a primary sensor.
Resumo:
In this paper, we argue that second language (L2) reading research, which has been informed by studies involving first language (L1) alphabetic English reading, may be less relevant to L2 readers with non-alphabetic reading backgrounds, such as Chinese readers with an L1 logographic (Chinese character) learning history. We provide both neuroanatomical and behavioural evidence from Chinese language reading studies to support our claims. The paper concludes with an argument outlining the need for a universal L2 reading model which can adequately account for readers with diverse L1 orthographic language learning histories.
Resumo:
This paper presents the findings of an indepth study into the effects and success of marketing segmentation, target marketing and fundraising. Organisations are constantly seeking new ways and more efficient means to raise funds so that they can fulfill their objectives. These organisations review and evaluate their resources to gain competitive advantage and increased fundraising success...
Resumo:
Modelling activities in crowded scenes is very challenging as object tracking is not robust in complicated scenes and optical flow does not capture long range motion. We propose a novel approach to analyse activities in crowded scenes using a “bag of particle trajectories”. Particle trajectories are extracted from foreground regions within short video clips using particle video, which estimates long range motion in contrast to optical flow which is only concerned with inter-frame motion. Our applications include temporal video segmentation and anomaly detection, and we perform our evaluation on several real-world datasets containing complicated scenes. We show that our approaches achieve state-of-the-art performance for both tasks.
Resumo:
Humanitarian entrants remain invisible in existing populations datasets, and this has significant implications for health care and health policy. We suggest adding 'year of arrival' to population datasets; enabling the combination of 'country of birth' and 'year of arrival' to be used as a proxy for refugee status.
Resumo:
This paper examines whether recent innovation in market design can address persistent problems of housing choice and affordability in the inner and middle suburbs of Australian cities. Australia's ageing middle suburbs are the result of a low density and highly car-dependent garden city greenfield approach to planning that failed to consider possible future resource or environmental constraints on urban development (Newton et al., 2011). Described as 'greyfield' sites in contrast to greenfield (signalling the change from rural to urban land use) and 'brownfield' (being the transformation of former industrial use to mixed use, including housing), intensification of development in such areas is expected to deliver positive social, economic and environmental outcomes (Trubka et al., 2008; Gurran et al., 2006; Newton et al., 2011; Goodman et al., 2010). Yet despite broad policy consensus progress remains elusive (Major Cities Unit, 2010). In this paper we argue that the application of market design theory, specifically through the internet-based coordination of market information, offers a new policy approach and practical measures to address these problems.
Resumo:
The Australian Research Council (ARC) Innovative Products Project aims to facilitate project innovation by exploring means to improve the diffusion of innovative products to road and bridge projects. It adopts a highly novel approach to achieve this end, developing three different ways of viewing the problem: (1) as a relational governance issue, (2) as an absorptive capacity issue and (3) as a knowlege intermediation issue. This report presents teh results of teh first phase of a three phase fieldwork program.
Resumo:
This paper presents an efficient face detection method suitable for real-time surveillance applications. Improved efficiency is achieved by constraining the search window of an AdaBoost face detector to pre-selected regions. Firstly, the proposed method takes a sparse grid of sample pixels from the image to reduce whole image scan time. A fusion of foreground segmentation and skin colour segmentation is then used to select candidate face regions. Finally, a classifier-based face detector is applied only to selected regions to verify the presence of a face (the Viola-Jones detector is used in this paper). The proposed system is evaluated using 640 x 480 pixels test images and compared with other relevant methods. Experimental results show that the proposed method reduces the detection time to 42 ms, where the Viola-Jones detector alone requires 565 ms (on a desktop processor). This improvement makes the face detector suitable for real-time applications. Furthermore, the proposed method requires 50% of the computation time of the best competing method, while reducing the false positive rate by 3.2% and maintaining the same hit rate.
Resumo:
Fishtown is a series of mediated animated works which embody artistic conceptions of ambience and explore the interplay between foreground and background. The series draws upon a representation of natural patterns and rhythms in the ambient environment and is produced using a hybrid style of animation process that incorporates motion capture, dynamics and keyframe animation to construct a biomemtic peripheral rhythm. The display of the work is a crucial part of the project, and contributes a considerable amount to the reception of the work. Based on the ambient conceptions defined by Cage, Eno and Bizzocchi, ambient animation should incorporate some form of ambient display. As Eno (1978) states, it should be as ignorable as it is interesting. The ultimate intention is to place the work outside the gallery setting, to provide a more neutral ambient setting for the viewing of the work, and therefore the use of an ambient display is necessary if the work is to be situated in an ambient setting. Craig Walsh is a contemporary artist producing work for large scale projections in ambient settings. Completing Walsh's masterclass in 2011 (Tanawha Arts and Ecology Centre) has been an important factor in arriving at a strategy for the display of the Fishtown series. The most recent work in the Fishtown series was developed during a residency at the Crane Arts studios in Philadelphia USA in August 2012, and is comprised of a screen based animated work, utilizing large scale digital projection. Documentation of this work can be found at the Crane Arts Residency Website: http://cranearts.qcagriffith.com/crane-arts-residency-chris-denaro
Resumo:
In this paper, we propose an approach which attempts to solve the problem of surveillance event detection, assuming that we know the definition of the events. To facilitate the discussion, we first define two concepts. The event of interest refers to the event that the user requests the system to detect; and the background activities are any other events in the video corpus. This is an unsolved problem due to many factors as listed below: 1) Occlusions and clustering: The surveillance scenes which are of significant interest at locations such as airports, railway stations, shopping centers are often crowded, where occlusions and clustering of people are frequently encountered. This significantly affects the feature extraction step, and for instance, trajectories generated by object tracking algorithms are usually not robust under such a situation. 2) The requirement for real time detection: The system should process the video fast enough in both of the feature extraction and the detection step to facilitate real time operation. 3) Massive size of the training data set: Suppose there is an event that lasts for 1 minute in a video with a frame rate of 25fps, the number of frames for this events is 60X25 = 1500. If we want to have a training data set with many positive instances of the event, the video is likely to be very large in size (i.e. hundreds of thousands of frames or more). How to handle such a large data set is a problem frequently encountered in this application. 4) Difficulty in separating the event of interest from background activities: The events of interest often co-exist with a set of background activities. Temporal groundtruth typically very ambiguous, as it does not distinguish the event of interest from a wide range of co-existing background activities. However, it is not practical to annotate the locations of the events in large amounts of video data. This problem becomes more serious in the detection of multi-agent interactions, since the location of these events can often not be constrained to within a bounding box. 5) Challenges in determining the temporal boundaries of the events: An event can occur at any arbitrary time with an arbitrary duration. The temporal segmentation of events is difficult and ambiguous, and also affected by other factors such as occlusions.
Resumo:
In this paper we focus on one facet of Asia literacy and examine the potential of intercultural understanding through two films about Asians in Australia, as the basis for exploring Asia and Australia’s engagement with Asia 'inside' and not through the more accepted mode of 'outside' the nation. In doing so we foreground how teachers’ critical and imaginative curriculum work can realise some of the promises of the framing document for the current national curriculum project, the Melbourne Declaration (MCEECDYA, 2008). In particular, we focus on opportunities for young people to develop an Asia-related cultural literacy that goes beyond instrumental notions of engagement with Asia and explore the evolving nature of contemporary Australian society; a society that continues to develop in response to regional flows and interactions with people and cultures. To this end we engage with the notion of “diasporic hybridity” as a dynamic cultural space through selected films and literature, about Asia in Australia, in particular, Bondi Tsunami (Lucas, 2004) and Footy Legends (Do, 2006) and selected prose works. Our paper introduces the policy background of the Australian Curriculum and suggests multimodal, English classroom applications for the films and literature under study.