919 resultados para optical character recognition system


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Emergence has the potential to effect complex, creative or open-ended interactions and novel game-play. We report on research into an emergent interactive system. This investigates emergent user behaviors and experience through the creation and evaluation of an interactive system. The system is +-NOW, an augmented reality, tangible, interactive art system. The paper briefly describes the qualities of emergence and +-NOW before focusing on its evaluation. This was a qualitative study with 30 participants conducted in context. Data analysis followed Grounded Theory Methods. Coding schemes, induced from data and external literature are presented. Findings show that emergence occurred in over half of the participants. The nature of these emergent behaviors is discussed along with examples from the data. Other findings indicate that participants found interaction with the work satisfactory. Design strategies for facilitating satisfactory experience despite the often unpredictable character of emergence, are briefly reviewed and potential application areas for emergence are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Emergence is discussed in the context of a practice-based study of interactive art and a new taxonomy of emergence is proposed. The interactive art system ‘plus minus now’ is described and its relationship to emergence is discussed. ‘Plus minus now’ uses a novel method for instantiating emergent shapes. A preliminary investigation of this art system has been conducted and reveals the creation of temporal compositions by a participant. These temporal compositions and the emergent shapes are described using the taxonomy of emergence. Characteristics of emergent interactions and the implications of designing for them are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper investigates the use of mel-frequency deltaphase (MFDP) features in comparison to, and in fusion with, traditional mel-frequency cepstral coefficient (MFCC) features within joint factor analysis (JFA) speaker verification. MFCC features, commonly used in speaker recognition systems, are derived purely from the magnitude spectrum, with the phase spectrum completely discarded. In this paper, we investigate if features derived from the phase spectrum can provide additional speaker discriminant information to the traditional MFCC approach in a JFA based speaker verification system. Results are presented which provide a comparison of MFCC-only, MFDPonly and score fusion of the two approaches within a JFA speaker verification approach. Based upon the results presented using the NIST 2008 Speaker Recognition Evaluation (SRE) dataset, we believe that, while MFDP features alone cannot compete with MFCC features, MFDP can provide complementary information that result in improved speaker verification performance when both approaches are combined in score fusion, particularly in the case of shorter utterances.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Waste management and minimisation is considered to be an important issue for achieving sustainability in the construction industry. Retrofit projects generate less waste than demolitions and new builds, but they possess unique features and require waste management approaches that are different to traditional new builds. With the increasing demand for more energy efficient and environmentally sustainable office spaces, the office building retrofit market is growing in capital cities around Australia with a high level of refurbishment needed for existing aging properties. Restricted site space and uncertain delivery process in these projects make it a major challenge to manage waste effectively. The labour-intensive nature of retrofit projects creates the need for the involvement of small and medium enterprises (SMEs) as subcontractors in on-site works. SMEs are familiar with on-site waste generation but are not as actively motivated and engaged in waste management activities as the stakeholders in other construction projects in the industry. SMEs’ responsibilities for waste management in office building retrofit projects need to be identified and adapted to the work delivery processes and the waste management system supported by project stakeholders. The existing literature provides an understanding of how to manage construction waste that is already generated and how to increase the waste recovery rate for office building retrofit projects. However, previous research has not developed theories or practical solutions that can guide project stakeholders to understand the specific waste generation process and effectively plan for and manage waste in ongoing project works. No appropriate method has been established for the potential role and capability of SMEs to manage and minimise waste from their subcontracting works. This research probes into the characteristics of office building retrofit project delivery with the aim to develop specific tools to manage waste and incorporate SMEs in this process in an appropriate and effective way. Based on an extensive literature review, the research firstly developed a questionnaire survey to identify the critical factors of on-site waste generation in office building retrofit projects. Semi-structured interviews were then utilised to validate the critical waste factors and establish the interrelationships between the factors. The interviews served another important function of identifying the current problems of waste management in the industry and the performance of SMEs in this area. Interviewees’ opinions on remedies to the problems were also collected. On the foundation of the findings from the questionnaire survey and semi-structured interviews, two waste planning and management strategies were identified for the dismantling phase and fit-out phase of office building retrofit projects, respectively. Two models were then established to organize SMEs’ waste management activities, including a work process-based integrated waste planning model for the dismantling phase and a system dynamics model for the fit-out phase. In order to apply the models in real practice, procedures were developed to guide SMEs’ work flow in on-site waste planning and management. In addition, a collaboration framework was established for SMEs and other project stakeholders for effective waste planning and management. Furthermore, an organisational engagement strategy was developed to improve SME waste management practices. Three case studies were conducted to validate and finalise the research deliverables. This research extends the current literature that mostly covers waste management plans in new build projects, by presenting the knowledge and understanding of addressing waste problems in retrofit projects. It provides practical tools and guidance for industry practitioners to effectively manage the waste generation processes in office building retrofit projects. It can also promote industry-level recognition of the role of SMEs and their performance in on-site waste management.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An optical system which performs the multiplication of binary numbers is described and proof-of-principle experiments are performed. The simultaneous generation of all partial products, optical regrouping of bit products, and optical carry look-ahead addition are novel features of the proposed scheme which takes advantage of the parallel operations capability of optical computers. The proposed processor uses liquid crystal light valves (LCLVs). By space-sharing the LCLVs one such system could function as an array of multipliers. Together with the optical carry look-ahead adders described, this would constitute an optical matrix-vector multiplier.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper investigates advanced channel compensation techniques for the purpose of improving i-vector speaker verification performance in the presence of high intersession variability using the NIST 2008 and 2010 SRE corpora. The performance of four channel compensation techniques: (a) weighted maximum margin criterion (WMMC), (b) source-normalized WMMC (SN-WMMC), (c) weighted linear discriminant analysis (WLDA), and; (d) source-normalized WLDA (SN-WLDA) have been investigated. We show that, by extracting the discriminatory information between pairs of speakers as well as capturing the source variation information in the development i-vector space, the SN-WLDA based cosine similarity scoring (CSS) i-vector system is shown to provide over 20% improvement in EER for NIST 2008 interview and microphone verification and over 10% improvement in EER for NIST 2008 telephone verification, when compared to SN-LDA based CSS i-vector system. Further, score-level fusion techniques are analyzed to combine the best channel compensation approaches, to provide over 8% improvement in DCF over the best single approach, (SN-WLDA), for NIST 2008 interview/ telephone enrolment-verification condition. Finally, we demonstrate that the improvements found in the context of CSS also generalize to state-of-the-art GPLDA with up to 14% relative improvement in EER for NIST SRE 2010 interview and microphone verification and over 7% relative improvement in EER for NIST SRE 2010 telephone verification.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Robust hashing is an emerging field that can be used to hash certain data types in applications unsuitable for traditional cryptographic hashing methods. Traditional hashing functions have been used extensively for data/message integrity, data/message authentication, efficient file identification and password verification. These applications are possible because the hashing process is compressive, allowing for efficient comparisons in the hash domain but non-invertible meaning hashes can be used without revealing the original data. These techniques were developed with deterministic (non-changing) inputs such as files and passwords. For such data types a 1-bit or one character change can be significant, as a result the hashing process is sensitive to any change in the input. Unfortunately, there are certain applications where input data are not perfectly deterministic and minor changes cannot be avoided. Digital images and biometric features are two types of data where such changes exist but do not alter the meaning or appearance of the input. For such data types cryptographic hash functions cannot be usefully applied. In light of this, robust hashing has been developed as an alternative to cryptographic hashing and is designed to be robust to minor changes in the input. Although similar in name, robust hashing is fundamentally different from cryptographic hashing. Current robust hashing techniques are not based on cryptographic methods, but instead on pattern recognition techniques. Modern robust hashing algorithms consist of feature extraction followed by a randomization stage that introduces non-invertibility and compression, followed by quantization and binary encoding to produce a binary hash output. In order to preserve robustness of the extracted features, most randomization methods are linear and this is detrimental to the security aspects required of hash functions. Furthermore, the quantization and encoding stages used to binarize real-valued features requires the learning of appropriate quantization thresholds. How these thresholds are learnt has an important effect on hashing accuracy and the mere presence of such thresholds are a source of information leakage that can reduce hashing security. This dissertation outlines a systematic investigation of the quantization and encoding stages of robust hash functions. While existing literature has focused on the importance of quantization scheme, this research is the first to emphasise the importance of the quantizer training on both hashing accuracy and hashing security. The quantizer training process is presented in a statistical framework which allows a theoretical analysis of the effects of quantizer training on hashing performance. This is experimentally verified using a number of baseline robust image hashing algorithms over a large database of real world images. This dissertation also proposes a new randomization method for robust image hashing based on Higher Order Spectra (HOS) and Radon projections. The method is non-linear and this is an essential requirement for non-invertibility. The method is also designed to produce features more suited for quantization and encoding. The system can operate without the need for quantizer training, is more easily encoded and displays improved hashing performance when compared to existing robust image hashing algorithms. The dissertation also shows how the HOS method can be adapted to work with biometric features obtained from 2D and 3D face images.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Experimentally, hydrogen-free diamond-like carbon (DLC) films were assembled by means of pulsed laser deposition (PLD), where energetic small-carbon-clusters were deposited on the substrate. In this paper, the chemisorption of energetic C2 and C10 clusters on diamond (001)-( 2×1) surface was investigated by molecular dynamics simulation. The influence of cluster size and the impact energy on the structure character of the deposited clusters is mainly addressed. The impact energy was varied from a few tens eV to 100 eV. The chemisorption of C10 was found to occur only when its incident energy is above a threshold value ( E th). While, the C2 cluster was easily to adsorb on the surface even at much lower incident energy. With increasing the impact energy, the structures of the deposited C2 and C10 are different from the free clusters. Finally, the growth of films synthesized by energetic C2 and C10 clusters were simulated. The statistics indicate the C2 cluster has high probability of adsorption and films assembled of C2 present slightly higher SP3 fraction than that of C10-films, especially at higher impact energy and lower substrate temperature. Our result supports the experimental findings. Moreover, the simulation underlines the deposition mechanism at atomic scale.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A significant amount of speech is typically required for speaker verification system development and evaluation, especially in the presence of large intersession variability. This paper introduces a source and utterance duration normalized linear discriminant analysis (SUN-LDA) approaches to compensate session variability in short-utterance i-vector speaker verification systems. Two variations of SUN-LDA are proposed where normalization techniques are used to capture source variation from both short and full-length development i-vectors, one based upon pooling (SUN-LDA-pooled) and the other on concatenation (SUN-LDA-concat) across the duration and source-dependent session variation. Both the SUN-LDA-pooled and SUN-LDA-concat techniques are shown to provide improvement over traditional LDA on NIST 08 truncated 10sec-10sec evaluation conditions, with the highest improvement obtained with the SUN-LDA-concat technique achieving a relative improvement of 8% in EER for mis-matched conditions and over 3% for matched conditions over traditional LDA approaches.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Near work may play an important role in the development of myopia in the younger population. The prevalence of myopia has also been found to be higher in occupations that involve substantial near work tasks, for example in microscopists and textile workers. When nearwork is performed, it typically involves accommodation, convergence and downward gaze. A number of previous studies have examined the effects of accommodation and convergence on changes in the optics and biometrics of the eye in primary gaze. However, little is known about the influence of accommodation on the eye in downward gaze. This thesis is primarily concerned with investigating the changes in the eye during near work in downward gaze under natural viewing conditions. To measure wavefront aberrations in downward gaze under natural viewing conditions, we modified a commercial Shack-Hartmann wavefront sensor by adding a relay lens system to allow on-axis ocular aberration measurements in primary gaze and downward gaze, with binocular fixation. Measurements with the modified wavefront sensor in primary and downward gaze were validated against a conventional aberrometer using both a model eye and in 9 human subjects. We then conducted an experiment to investigate changes in ocular aberrations associated with accommodation in downward gaze over 10 mins in groups of both myopes (n = 14) and emmetropes (n =12) using the modified Shack-Hartmann wavefront sensor. During the distance accommodation task, small but significant changes in refractive power (myopic shift) and higher order aberrations were observed in downward gaze compared to primary gaze. Accommodation caused greater changes in higher order aberrations (in particular coma and spherical aberration) in downward gaze than primary gaze, and there was evidence that the changes in certain aberrations with accommodation over time were different in downward gaze compared to primary gaze. There were no obvious systematic differences in higher order aberrations between refractive error groups during accommodation or downward gaze for fixed pupils. However, myopes exhibited a significantly greater change in higher order aberrations (in particular spherical aberration) than emmetropes for natural pupils after 10 mins of a near task (5 D accommodation) in downward gaze. These findings indicated that ocular aberrations change from primary to downward gaze, particularly with accommodation. To understand the mechanism underlying these changes in greater detail, we then extended this work to examine the characteristics of the corneal optics, internal optics, anterior biometrics and axial length of the eye during a near task, in downward gaze, over 10 mins. Twenty young adult subjects (10 emmetropes and 10 myopes) participated in this study. To measure corneal topography and ocular biometrics in downward gaze, a rotating Scheimpflug camera and an optical biometer were inclined on a custom built, height and tilt adjustable table. We found that both corneal optics and internal optics change with downward gaze, resulting in a myopic shift (~0.10 D) in the spherical power of the eye. The changes in corneal optics appear to be due to eyelid pressure on the anterior surface of the cornea, whereas the changes in the internal optics (an increase in axial length and a decrease in anterior chamber depth) may be associated with movement of the crystalline lens, under the action of gravity, and the influence of altered biomechanical forces from the extraocular muscles on the globe with downward gaze. Changes in axial length with accommodation were significantly greater in downward gaze than primary gaze (p < 0.05), indicating an increased effect of the mechanical forces from the ciliary muscle and extraocular muscles. A subsequent study was conducted to investigate the changes in anterior biometrics, axial length and choroidal thickness in nine cardinal gaze directions under the actions of the extraocular muscles. Ocular biometry measurements were obtained from 30 young adults (10 emmetropes, 10 low myopes and 10 moderate myopes) through a rotating prism with 15° deviation, along the foveal axis, using a non-contact optical biometer in each of nine different cardinal directions of gaze, over 5 mins. There was a significant influence of gaze angle and time on axial length (both p < 0.001), with the greatest axial elongation (+18 ± 8 μm) occurring with infero-nasal gaze (p < 0.001) and a slight decrease in axial length in superior gaze (−12 ± 17 μm) compared with primary gaze (p < 0.001). There was a significant correlation between refractive error (spherical equivalent refraction) and the mean change in axial length in the infero-nasal gaze direction (Pearson's R2 = 0.71, p < 0.001). To further investigate the relative effect of gravity and extraocular muscle force on the axial length, we measured axial length in 15° and 25° downward gaze with the biometer inclined on a tilting table that allowed gaze shifts to occur with either full head turn but no eye turn (reflects the effect of gravity), or full eye turn with no head turn (reflects the effect of extraocular muscle forces). We observed a significant axial elongation in 15° and 25° downward gaze in the full eye turn condition. However, axial length did not change significantly in downward gaze over 5 mins (p > 0.05) in the full head turn condition. The elongation of the axial length in downward gaze appears to be due to the influence of the extraocular muscles, since the effect was not present when head turn was used instead of eye turn. The findings of these experiments collectively show the dynamic characteristics of the optics and biometrics of the eye in downward gaze during a near task, over time. These were small but significant differences between myopic and emmetropic eyes in both the optical and biomechanical changes associated with shifts of gaze direction. These differences between myopes and emmetropes could arise as a consequence of excessive eye growth associated with myopia. However the potentially additive effects of repeated or long lasting near work activities employing infero-nasal gaze could also act to promote elongation of the eye due to optical and/or biomechanical stimuli.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A people-to-people matching system (or a match-making system) refers to a system in which users join with the objective of meeting other users with the common need. Some real-world examples of these systems are employer-employee (in job search networks), mentor-student (in university social networks), consume-to-consumer (in marketplaces) and male-female (in an online dating network). The network underlying in these systems consists of two groups of users, and the relationships between users need to be captured for developing an efficient match-making system. Most of the existing studies utilize information either about each of the users in isolation or their interaction separately, and develop recommender systems using the one form of information only. It is imperative to understand the linkages among the users in the network and use them in developing a match-making system. This study utilizes several social network analysis methods such as graph theory, small world phenomenon, centrality analysis, density analysis to gain insight into the entities and their relationships present in this network. This paper also proposes a new type of graph called “attributed bipartite graph”. By using these analyses and the proposed type of graph, an efficient hybrid recommender system is developed which generates recommendation for new users as well as shows improvement in accuracy over the baseline methods.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Raven and Song Scope are two automated sound anal-ysis tools based on machine learning technique for en-vironmental monitoring. Many research works have been conducted upon them, however, no or rare explo-ration mentions about the performance and comparison between them. This paper investigates the comparisons from six aspects: theory, software interface, ease of use, detection targets, detection accuracy, and potential application. Through deep exploration one critical gap is identified that there is a lack of approach to detect both syllables and call structures, since Raven only aims to detect syllables while Song Scope targets call structures. Therefore, a Timed Probabilistic Automata (TPA) system is proposed which separates syllables first and clusters them into complex structures after.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A fundamental part of many authentication protocols which authenticate a party to a human involves the human recognizing or otherwise processing a message received from the party. Examples include typical implementations of Verified by Visa in which a message, previously stored by the human at a bank, is sent by the bank to the human to authenticate the bank to the human; or the expectation that humans will recognize or verify an extended validation certificate in a HTTPS context. This paper presents general definitions and building blocks for the modelling and analysis of human recognition in authentication protocols, allowing the creation of proofs for protocols which include humans. We cover both generalized trawling and human-specific targeted attacks. As examples of the range of uses of our construction, we use the model presented in this paper to prove the security of a mutual authentication login protocol and a human-assisted device pairing protocol.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we present a method for autonomously tuning the threshold between learning and recognizing a place in the world, based on both how the rodent brain is thought to process and calibrate multisensory data and the pivoting movement behaviour that rodents perform in doing so. The approach makes no assumptions about the number and type of sensors, the robot platform, or the environment, relying only on the ability of a robot to perform two revolutions on the spot. In addition, it self-assesses the quality of the tuning process in order to identify situations in which tuning may have failed. We demonstrate the autonomous movement-driven threshold tuning on a Pioneer 3DX robot in eight locations spread over an office environment and a building car park, and then evaluate the mapping capability of the system on journeys through these environments. The system is able to pick a place recognition threshold that enables successful environment mapping in six of the eight locations while also autonomously flagging the tuning failure in the remaining two locations. We discuss how the method, in combination with parallel work on autonomous weighting of individual sensors, moves the parameter dependent RatSLAM system significantly closer to sensor, platform and environment agnostic operation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Whole-image descriptors such as GIST have been used successfully for persistent place recognition when combined with temporal filtering or sequential filtering techniques. However, whole-image descriptor localization systems often apply a heuristic rather than a probabilistic approach to place recognition, requiring substantial environmental-specific tuning prior to deployment. In this paper we present a novel online solution that uses statistical approaches to calculate place recognition likelihoods for whole-image descriptors, without requiring either environmental tuning or pre-training. Using a real world benchmark dataset, we show that this method creates distributions appropriate to a specific environment in an online manner. Our method performs comparably to FAB-MAP in raw place recognition performance, and integrates into a state of the art probabilistic mapping system to provide superior performance to whole-image methods that are not based on true probability distributions. The method provides a principled means for combining the powerful change-invariant properties of whole-image descriptors with probabilistic back-end mapping systems without the need for prior training or system tuning.