977 resultados para multi band
Resumo:
This paper describes a semi-automatic tool for annotation of multi-script text from natural scene images. To our knowledge, this is the maiden tool that deals with multi-script text or arbitrary orientation. The procedure involves manual seed selection followed by a region growing process to segment each word present in the image. The threshold for region growing can be varied by the user so as to ensure pixel-accurate character segmentation. The text present in the image is tagged word-by-word. A virtual keyboard interface has also been designed for entering the ground truth in ten Indic scripts, besides English. The keyboard interface can easily be generated for any script, thereby expanding the scope of the toolkit. Optionally, each segmented word can further be labeled into its constituent characters/symbols. Polygonal masks are used to split or merge the segmented words into valid characters/symbols. The ground truth is represented by a pixel-level segmented image and a '.txt' file that contains information about the number of words in the image, word bounding boxes, script and ground truth Unicode. The toolkit, developed using MATLAB, can be used to generate ground truth and annotation for any generic document image. Thus, it is useful for researchers in the document image processing community for evaluating the performance of document analysis and recognition techniques. The multi-script annotation toolokit (MAST) is available for free download.
Resumo:
This paper describes a new method of color text localization from generic scene images containing text of different scripts and with arbitrary orientations. A representative set of colors is first identified using the edge information to initiate an unsupervised clustering algorithm. Text components are identified from each color layer using a combination of a support vector machine and a neural network classifier trained on a set of low-level features derived from the geometric, boundary, stroke and gradient information. Experiments on camera-captured images that contain variable fonts, size, color, irregular layout, non-uniform illumination and multiple scripts illustrate the robustness of the method. The proposed method yields precision and recall of 0.8 and 0.86 respectively on a database of 100 images. The method is also compared with others in the literature using the ICDAR 2003 robust reading competition dataset.
Resumo:
A new multi-sensor image registration technique is proposed based on detecting the feature corner points using modified Harris Corner Detector (HDC). These feature points are matched using multi-objective optimization (distance condition and angle criterion) based on Discrete Particle Swarm Optimization (DPSO). This optimization process is more efficient as it considers both the distance and angle criteria to incorporate multi-objective switching in the fitness function. This optimization process helps in picking up three corresponding corner points detected in the sensed and base image and thereby using the affine transformation, the sensed image is aligned with the base image. Further, the results show that the new approach can provide a new dimension in solving multi-sensor image registration problems. From the obtained results, the performance of image registration is evaluated and is concluded that the proposed approach is efficient.
Resumo:
The nontrivial electronic topology of a topological insulator is thus far known to display signatures in a robust metallic state at the surface. Here, we establish vibrational anomalies in Raman spectra of the bulk that signify changes in electronic topology: an E-g(2) phonon softens unusually and its linewidth exhibits an asymmetric peak at the pressure induced electronic topological transition (ETT) in Sb2Se3 crystal. Our first-principles calculations confirm the electronic transition from band to topological insulating state with reversal of parity of electronic bands passing through a metallic state at the ETT, but do not capture the phonon anomalies which involve breakdown of adiabatic approximation due to strongly coupled dynamics of phonons and electrons. Treating this within a four-band model of topological insulators, we elucidate how nonadiabatic renormalization of phonons constitutes readily measurable bulk signatures of an ETT, which will facilitate efforts to develop topological insulators by modifying a band insulator. DOI: 10.1103/PhysRevLett.110.107401
Resumo:
Supramolecular chemistry is an emerging tool for devising materials that can perform specified functions. The self-assembly of facially amphiphilic bile acid molecules has been extensively utilized for the development of functional soft materials. Supramolecular hydrogels derived from the bile acid backbone act as useful templates for the intercalation of multiple components. Based on this, synthesis of gel-nanoparticle hybrid materials, photoluminescent coating materials, development of a new enzyme assay technique, etc. were achieved in the author's laboratory. The present account highlights some of these achievements.
Resumo:
The role of a computer emerged from modeling and analyzing concepts (ideas) to generate concepts. Research into methods for supporting conceptual design using automated synthesis had attracted much attention in the past decades. To find out how designers synthesize solution concepts for multi-state mechanical devices, ten experimental studies were conducted. Observations from these empirical studies would be used as the basis to develop knowledge involved in the multi-state design synthesis process. In this paper, we propose a computational representation for expressing the multi-state design task and for enumerating multi-state behaviors of kinematic pairs and mechanisms. This computational representation would be used to formulate computational methods for the synthesis process to develop a system for supporting design synthesis of multiple state mechanical devices by generating a comprehensive variety of solution alternatives.
Resumo:
In this paper, we address a physics based closed form model for the energy band gap (E-g) and the transport electron effective mass in relaxed and strained 100] and 110] oriented rectangular Silicon Nanowire (SiNW). Our proposed analytical model along 100] and 110] directions are based on the k.p formalism of the conduction band energy dispersion relation through an appropriate rotation of the Hamiltonian of the electrons in the bulk crystal along 001] direction followed by the inclusion of a 4 x 4 Luttinger Hamiltonian for the description of the valance band structure. Using this, we demonstrate the variation in Eg and the transport electron effective mass as function of the cross-sectional dimensions in a relaxed 100] and 110] oriented SiNW. The behaviour of these two parameters in 100] oriented SiNW has further been studied with the inclusion of a uniaxial strain along the transport direction and a biaxial strain, which is assumed to be decomposed from a hydrostatic deformation along 001] with the former one. In addition, the energy band gap and the effective mass of a strained 110] oriented SiNW has also been formulated. Using this, we compare our analytical model with that of the extracted data using the nearest neighbour empirical tight binding sp(3)d(5)s* method based simulations and has been found to agree well over a wide range of device dimensions and applied strain. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Bulk texture measurement of multi-axial forged body center cubic interstitial free steel performed in this study using x-ray and neutron diffraction indicated the presence of a strong {101}aOE (c) 111 > single texture component. Viscoplastic self-consistent simulations could successfully predict the formation of this texture component by incorporating the complicated strain path followed during this process and assuming the activity of {101}aOE (c) 111 > slip system. In addition, a first-order estimate of mechanical properties in terms of highly anisotropic yield locus and Lankford parameter was also obtained from the simulations.
Resumo:
We investigate the direct band-to-band tunneling (BTBT) in a reverse biased molybdenum disulfide (MoS2) nanoribbon p-n junction by analyzing the complex band structure obtained from semiempirical extended Huckel method under relaxed and strained conditions. It is demonstrated that the direct BTBT is improbable in relaxed monolayer nanoribbon; however, with the application of certain uniaxial tensile strain, the material becomes favorable for it. On the other hand, the relaxed bilayer nanoribbon is suitable for direct BTBT but becomes unfavorable when the applied uniaxial tensile or compressive strain goes beyond a certain limit. Considering the Wentzel-Kramers-Brillouin approximation, we evaluate the tunneling probability to estimate the tunneling current for a small applied reverse bias. Reasonably high tunneling current in the MoS2 nanoribbons shows that it can take advantage over graphene nanoribbon in future tunnel field-effect transistor applications.
Resumo:
In this paper, we propose a new sub-band approach to estimate the glottal activity. The method is based on the spectral harmonicity and the sub-band temporal properties of voiced speech. We propose a method to represent glottal excitation signal using sub-band temporal envelope. Instants of maximum glottal excitation or Glottal Closure Instants (GCI) are extracted from the estimated glottal excitation pattern and the result is compared with a standard GCI computation method, DYPSA [1]. The performance of the algorithm is also compared for the noisy signal and it is shown that the proposed method is less variant to GCI estimation under noisy conditions compared to DYPSA. The algorithm is evaluated on the CMU-ARCTIC database.
Resumo:
This work proposes a boosting-based transfer learning approach for head-pose classification from multiple, low-resolution views. Head-pose classification performance is adversely affected when the source (training) and target (test) data arise from different distributions (due to change in face appearance, lighting, etc). Under such conditions, we employ Xferboost, a Logitboost-based transfer learning framework that integrates knowledge from a few labeled target samples with the source model to effectively minimize misclassifications on the target data. Experiments confirm that the Xferboost framework can improve classification performance by up to 6%, when knowledge is transferred between the CLEAR and FBK four-view headpose datasets.
Resumo:
Multi-view head-pose estimation in low-resolution, dynamic scenes is difficult due to blurred facial appearance and perspective changes as targets move around freely in the environment. Under these conditions, acquiring sufficient training examples to learn the dynamic relationship between position, face appearance and head-pose can be very expensive. Instead, a transfer learning approach is proposed in this work. Upon learning a weighted-distance function from many examples where the target position is fixed, we adapt these weights to the scenario where target positions are varying. The adaptation framework incorporates reliability of the different face regions for pose estimation under positional variation, by transforming the target appearance to a canonical appearance corresponding to a reference scene location. Experimental results confirm effectiveness of the proposed approach, which outperforms state-of-the-art by 9.5% under relevant conditions. To aid further research on this topic, we also make DPOSE- a dynamic, multi-view head-pose dataset with ground-truth publicly available with this paper.
Resumo:
Closed-form expressions for the propagation characteristics of coupled microstrip lines with a symmetrical aperture in the ground plane are derived. Expressions for the regular microstrip coupled lines have been modified using physical insights to incorporate the effect of the aperture. The accuracy of these expressions has been verified by full-wave simulations and compared with conformal mapping analysis. These expressions are accurate within 5% for a substrate whose thickness varies from 0.2 to 1.6mm and permittivity in the range of 210. Designing a broadband filter based on planar multi-conductor coupled lines with aperture in the ground plane is demonstrated in this paper using the proposed expressions for its practical use.
Resumo:
Impact of global warming on daily rainfall is examined using atmospheric variables from five General Circulation Models (GCMs) and a stochastic downscaling model. Daily rainfall at eleven raingauges over Malaprabha catchment of India and National Center for Environmental Prediction (NCEP) reanalysis data at grid points over the catchment for a continuous time period 1971-2000 (current climate) are used to calibrate the downscaling model. The downscaled rainfall simulations obtained using GCM atmospheric variables corresponding to the IPCC-SRES (Intergovernmental Panel for Climate Change - Special Report on Emission Scenarios) A2 emission scenario for the same period are used to validate the results. Following this, future downscaled rainfall projections are constructed and examined for two 20 year time slices viz. 2055 (i.e. 2046-2065) and 2090 (i.e. 2081-2100). The model results show reasonable skill in simulating the rainfall over the study region for the current climate. The downscaled rainfall projections indicate no significant changes in the rainfall regime in this catchment in the future. More specifically, 2% decrease by 2055 and 5% decrease by 2090 in monsoon (HAS) rainfall compared to the current climate (1971-2000) under global warming conditions are noticed. Also, pre-monsoon (JFMAM) and post-monsoon (OND) rainfall is projected to increase respectively, by 2% in 2055 and 6% in 2090 and, 2% in 2055 and 12% in 2090, over the region. On annual basis slight decreases of 1% and 2% are noted for 2055 and 2090, respectively.
Resumo:
Even though satellite observations are the most effective means to gather global information in a short span of time, the challenges in this field still remain over continental landmass, despite most of the aerosol sources being land-based. This is a hurdle in global and regional aerosol climate forcing assessment. Retrieval of aerosol properties over land is complicated due to irregular terrain characteristics and the high and largely uncertain surface reflection which acts as `noise' to the much smaller amount of radiation scattered by aerosols, which is the `signal'. In this paper, we describe a satellite sensor the - `Aerosol Satellite (AEROSAT)', which is capable of retrieving aerosols over land with much more accuracy and reduced dependence on models. The sensor, utilizing a set of multi-spectral and multi-angle measurements of polarized components of radiation reflected from the Earth's surface, along with measurements of thermal infrared broadband radiance, results in a large reduction of the `noise' component (compared to the `signal). A conceptual engineering model of AEROSAT has been designed, developed and used to measure the land-surface features in the visible spectral band. Analysing the received signals using a polarization radiative transfer approach, we demonstrate the superiority of this method. It is expected that satellites carrying sensors following the AEROSAT concept would be `self-sufficient', to obtain all the relevant information required for aerosol retrieval from its own measurements.