919 resultados para optical character recognition system
Resumo:
Theoretical investigations have been carried out to analyze and compare the link power budget and power dissipation of non-return-to-zero (NRZ), pulse amplitude modulation-4 (PAM-4), carrierless amplitude and phase modulation-16 (CAP-16) and 16-quadrature amplitude modulation-orthogonal frequency division multiplexing (16-QAM-OFDM) systems for data center interconnect scenarios. It is shown that for multimode fiber (MMF) links, NRZ modulation schemes with electronic equalization offer the best link power budget margins with the least power dissipation for short transmission distances up to 200 m; while OOFDM is the only scheme which can support a distance of 300 m albeit with power dissipation as high as 4 times that of NRZ. For short single mode fiber (SMF) links, all the modulation schemes offer similar link power budget margins for fiber lengths up to 15 km, but NRZ and PAM-4 are preferable due to their system simplicity and low power consumption. For lengths of up to 30 km, CAP-16 and OOFDM are required although the schemes consume 2 and 4 times as much power respectively compared to that of NRZ. OOFDM alone allows link operation up to 35 km distances. © 1983-2012 IEEE.
Resumo:
Optical motion capture systems suffer from marker occlusions resulting in loss of useful information. This paper addresses the problem of real-time joint localisation of legged skeletons in the presence of such missing data. The data is assumed to be labelled 3d marker positions from a motion capture system. An integrated framework is presented which predicts the occluded marker positions using a Variable Turn Model within an Unscented Kalman filter. Inferred information from neighbouring markers is used as observation states; these constraints are efficient, simple, and real-time implementable. This work also takes advantage of the common case that missing markers are still visible to a single camera, by combining predictions with under-determined positions, resulting in more accurate predictions. An Inverse Kinematics technique is then applied ensuring that the bone lengths remain constant over time; the system can thereby maintain a continuous data-flow. The marker and Centre of Rotation (CoR) positions can be calculated with high accuracy even in cases where markers are occluded for a long period of time. Our methodology is tested against some of the most popular methods for marker prediction and the results confirm that our approach outperforms these methods in estimating both marker and CoR positions. © 2012 Springer-Verlag.
Resumo:
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often combine outputs from multiple sub-systems that may even be developed at different sites. Cross system adaptation, in which model adaptation is performed using the outputs from another sub-system, can be used as an alternative to hypothesis level combination schemes such as ROVER. Normally cross adaptation is only performed on the acoustic models. However, there are many other levels in LVCSR systems' modelling hierarchy where complimentary features may be exploited, for example, the sub-word and the word level, to further improve cross adaptation based system combination. It is thus interesting to also cross adapt language models (LMs) to capture these additional useful features. In this paper cross adaptation is applied to three forms of language models, a multi-level LM that models both syllable and word sequences, a word level neural network LM, and the linear combination of the two. Significant error rate reductions of 4.0-7.1% relative were obtained over ROVER and acoustic model only cross adaptation when combining a range of Chinese LVCSR sub-systems used in the 2010 and 2011 DARPA GALE evaluations. © 2012 Elsevier Ltd. All rights reserved.
Resumo:
This paper proposed a non-intrusive method of measuring the optical beam profile at the surface of the liquid crystal on silicon (LCOS) device in an optical fiber switch. This method is based on blazed grating and can be employed in situ (on-line) for two-dimensional beam profiling in the LCOS-based optical fiber switches without introducing additional components or rearranging the system. The measured beam radius was in excellent agreement with that measured by the knife-edge technique. © 2013 Elsevier Ltd.
Resumo:
We demonstrate an uncooled WDM system using standard WDM components and receiver signal processing, with a different number of receivers to transmitters, to allow wide temperature drift of the transmitter lasers. A 100 Gb/s 8-wavelength demonstrator has been developed, which proves the feasibility of the approach over 25 km of SMF. © 2012 OSA.
Resumo:
For the first time, simulations have analysed the feasibility of 100Gb/s CAP and OFDM systems over SMF links using 18.6GHz directly modulated lasers. We have shown that CAP-16/16- QAM-OFDM and CAP-64/64-QAM-OFDM over a single channel can successfully support transmission over 2km SMF, with power dissipation of ∼2 times that of a 4×25Gb/s NRZ system. © 2012 OSA.
Resumo:
Optical interconnects are increasingly considered for use in high-performance electronic systems. Multimode polymer waveguides are a promising technology for the formation of optical backplanes as they enable cost-effective integration of optical links onto standard printed circuit boards. In this paper, we present a 40 Gb/s optical backplane demonstrator based on the use of polymer multimode waveguides and a regenerative shared bus architecture. The system allows bus extension by cascading multiple polymeric bus modules through 3R regenerator units enabling the connection of an arbitrary number of electrical cards onto the bus. The proof-ofprinciple demonstrator reported here is formed with low-cost, commercially-available active devices and electronic components mounted on conventional FR4 substrates and achieves error-free 4×10 Gb/s optical interconnection between any two card interfaces on the bus. © 2013 IEEE.
Resumo:
The tunable liquid crystal (LC) lens designed for a holographic projection system is demonstrated. By using a single patterned electrode LC lens, a solid lens and an encoded Fresnel lens on the LCoS panel, we can maintain the image size of the holographic projector with different wavelengths (λ:674nm, 532nm and 445nm) . The zoom ratio of the holographic projection system depends on the lens power of the solid lens and the tunable lens power of the LC lens. The optical zoom function can help to solve the image size mismatching problem of the holographic projection system. © 2013 SPIE.
Resumo:
We demonstrate a new type of transistors, the electrical/optical "dual-function redox-potential transistors", which is solution processable and environmentally stable. This device consists of vertically staked electrodes that act as gate, emitter and collector. It can perform as a normal transistor, whilst one electrode which is sensitised by dye enables to generate photocurrent when illuminated. Solution processable oxide-nanoparticles were used to form various functional layers, which allow an electrolyte to penetrate through and, consequently, the current between emitter and collector can be controlled by the gate potential modulated distribution of ions. The result here shows that the device performs with high ON-current under low driving voltage (<1â€...V), while the transistor performance can readily be controlled by photo-illumination. Such device with combined optical and electrical functionalities allows single device to perform the tasks that are usually done by a circuit/system with multiple optical and electrical components, and it is promising for various applications.
Resumo:
We present a system for keyword search on Cantonese conversational telephony audio, collected for the IARPA Babel program, that achieves good performance by combining postings lists produced by diverse speech recognition systems from three different research groups. We describe the keyword search task, the data on which the work was done, four different speech recognition systems, and our approach to system combination for keyword search. We show that the combination of four systems outperforms the best single system by 7%, achieving an actual term-weighted value of 0.517. © 2013 IEEE.
Resumo:
Adaptation to speaker and environment changes is an essential part of current automatic speech recognition (ASR) systems. In recent years the use of multi-layer percpetrons (MLPs) has become increasingly common in ASR systems. A standard approach to handling speaker differences when using MLPs is to apply a global speaker-specific constrained MLLR (CMLLR) transform to the features prior to training or using the MLP. This paper considers the situation when there are both speaker and channel, communication link, differences in the data. A more powerful transform, front-end CMLLR (FE-CMLLR), is applied to the inputs to the MLP to represent the channel differences. Though global, these FE-CMLLR transforms vary from time-instance to time-instance. Experiments on a channel distorted dialect Arabic conversational speech recognition task indicates the usefulness of adapting MLP features using both CMLLR and FE-CMLLR transforms. © 2013 IEEE.
Resumo:
This paper presents a novel method of using experimentally observed optical phenomena to reverse-engineer a model of the carbon nanofiber-addressed liquid crystal microlens array (C-MLA) using Zemax. It presents the first images of the optical profile for the C-MLA along the optic axis. The first working optical models of the C-MLA have been developed by matching the simulation results to the experimental results. This approach bypasses the need to know the exact carbon nanofiber-liquid crystal interaction and can be easily adapted to other systems where the nature of an optical device is unknown. Results show that the C-MLA behaves like a simple lensing system at 0.060-0.276 V/μm. In this lensing mode the C-MLA is successfully modeled as a reflective convex lens array intersecting with a flat reflective plane. The C-MLA at these field strengths exhibits characteristics of mostly spherical or low order aspheric arrays, with some aspects of high power aspherics. It also exhibits properties associated with varying lens apertures and strengths, which concur with previously theorized models based on E-field patterns. This work uniquely provides evidence demonstrating an apparent "rippling" of the liquid crystal texture at low field strengths, which were successfully reproduced using rippled Gaussian-like lens profiles. © 2014 Published by Elsevier B.V.
Resumo:
We demonstrate an uncooled WDM system using standard WDM components and receiver signal processing, with a different number of receivers to transmitters, to allow wide temperature drift of the transmitter lasers. A 100 Gb/s 8-wavelength demonstrator has been developed, which proves the feasibility of the approach over 25 km of SMF. © 2012 Optical Society of America.
Resumo:
The first multi-channel optical backplane demonstrator using on-board multimode polymer waveguides and a scalable shared-bus regenerative architecture is reported. The system allows bus extension by cascading multiple polymeric bus modules, and enables error-free 4×10 Gb/s interconnection between any two card interfaces on the bus.
Resumo:
Large margin criteria and discriminative models are two effective improvements for HMM-based speech recognition. This paper proposed a large margin trained log linear model with kernels for CSR. To avoid explicitly computing in the high dimensional feature space and to achieve the nonlinear decision boundaries, a kernel based training and decoding framework is proposed in this work. To make the system robust to noise a kernel adaptation scheme is also presented. Previous work in this area is extended in two directions. First, most kernels for CSR focus on measuring the similarity between two observation sequences. The proposed joint kernels defined a similarity between two observation-label sequence pairs on the sentence level. Second, this paper addresses how to efficiently employ kernels in large margin training and decoding with lattices. To the best of our knowledge, this is the first attempt at using large margin kernel-based log linear models for CSR. The model is evaluated on a noise corrupted continuous digit task: AURORA 2.0. © 2013 IEEE.