27 resultados para Multimodal översättningsanalys
Resumo:
A novel approach that can more effectively use the structural information provided by the traditional imaging modalities in multimodal diffuse optical tomographic imaging is introduced. This approach is based on a prior image-constrained-l(1) minimization scheme and has been motivated by the recent progress in the sparse image reconstruction techniques. It is shown that the proposed framework is more effective in terms of localizing the tumor region and recovering the optical property values both in numerical and gelatin phantom cases compared to the traditional methods that use structural information. (C) 2012 Optical Society of America
Resumo:
Adaptive Gaussian Mixture Models (GMM) have been one of the most popular and successful approaches to perform foreground segmentation on multimodal background scenes. However, the good accuracy of the GMM algorithm comes at a high computational cost. An improved GMM technique was proposed by Zivkovic to reduce computational cost by minimizing the number of modes adaptively. In this paper, we propose a modification to his adaptive GMM algorithm that further reduces execution time by replacing expensive floating point computations with low cost integer operations. To maintain accuracy, we derive a heuristic that computes periodic floating point updates for the GMM weight parameter using the value of an integer counter. Experiments show speedups in the range of 1.33 - 1.44 on standard video datasets where a large fraction of pixels are multimodal.
Resumo:
In this paper, we present a machine learning approach for subject independent human action recognition using depth camera, emphasizing the importance of depth in recognition of actions. The proposed approach uses the flow information of all 3 dimensions to classify an action. In our approach, we have obtained the 2-D optical flow and used it along with the depth image to obtain the depth flow (Z motion vectors). The obtained flow captures the dynamics of the actions in space time. Feature vectors are obtained by averaging the 3-D motion over a grid laid over the silhouette in a hierarchical fashion. These hierarchical fine to coarse windows capture the motion dynamics of the object at various scales. The extracted features are used to train a Meta-cognitive Radial Basis Function Network (McRBFN) that uses a Projection Based Learning (PBL) algorithm, referred to as PBL-McRBFN, henceforth. PBL-McRBFN begins with zero hidden neurons and builds the network based on the best human learning strategy, namely, self-regulated learning in a meta-cognitive environment. When a sample is used for learning, PBLMcRBFN uses the sample overlapping conditions, and a projection based learning algorithm to estimate the parameters of the network. The performance of PBL-McRBFN is compared to that of a Support Vector Machine (SVM) and Extreme Learning Machine (ELM) classifiers with representation of every person and action in the training and testing datasets. Performance study shows that PBL-McRBFN outperforms these classifiers in recognizing actions in 3-D. Further, a subject-independent study is conducted by leave-one-subject-out strategy and its generalization performance is tested. It is observed from the subject-independent study that McRBFN is capable of generalizing actions accurately. The performance of the proposed approach is benchmarked with Video Analytics Lab (VAL) dataset and Berkeley Multimodal Human Action Database (MHAD). (C) 2013 Elsevier Ltd. All rights reserved.
Resumo:
This paper explains the algorithm of Modified Roaming Optimization (MRO) for capturing the multiple optima for multimodal functions. There are some similarities between the Roaming Optimization (RO) and MRO algorithms, but the MRO algorithm is created to overcome the problems facing while applying the RO to the problems possessing large number of solutions. The MRO mainly uses the concept of density to overcome the challenges posed by RO. The algorithm is tested with standard test functions and also discussions are made to improve the efficacy of the MRO algorithm. This paper also gives the results of MRO applied for solving Inverse Kinematics (IK) problem for SCARA and PUMA robots.
Resumo:
A droplet introduced in an external convective flow field exhibits significant multimodal shape oscillations depending upon the intensity of the aerodynamic forcing. In this paper, a theoretical model describing the temporal evolution of normal modes of the droplet shape is developed. The fluid is assumed to be weakly viscous and Newtonian. The convective flow velocity, which is assumed to be incompressible and inviscid, is incorporated in the model through the normal stress condition at the droplet surface and the equation of motion governing the dynamics of each mode is derived. The coupling between the external flow and the droplet is approximated to be a one-way process, i.e., the external flow perturbations effect the droplet shape oscillations and the droplet oscillation itself does not influence the external flow characteristics. The shape oscillations of the droplet with different fluid properties under different unsteady flow fields were simulated. For a pulsatile external flow, the frequency spectra of the normal modes of the droplet revealed a dominant response at the resonant frequency, in addition to the driving frequency and the corresponding harmonics. At driving frequencies sufficiently different from the resonant frequency of the prolate-oblate oscillation mode of the droplet, the oscillations are stable. But at resonance the oscillation amplitude grows in time leading to breakup depending upon the fluid viscosity. A line vortex advecting past the droplet, simulated as an isotropic jump in the far field velocity, leads to the resonant excitation of the droplet shape modes if and only if the time taken by the vortex to cross the droplet is less than the resonant period of the P-2 mode of the droplet. A train of two vortices interacting with the droplet is also analysed. It shows clearly that the time instant of introduction of the second vortex with respect to the droplet shape oscillation cycle is crucial in determining the amplitude of oscillation. (C) 2014 AIP Publishing LLC.
Resumo:
USC-TIMIT is an extensive database of multimodal speech production data, developed to complement existing resources available to the speech research community and with the intention of being continuously refined and augmented. The database currently includes real-time magnetic resonance imaging data from five male and five female speakers of American English. Electromagnetic articulography data have also been presently collected from four of these speakers. The two modalities were recorded in two independent sessions while the subjects produced the same 460 sentence corpus used previously in the MOCHA-TIMIT database. In both cases the audio signal was recorded and synchronized with the articulatory data. The database and companion software are freely available to the research community. (C) 2014 Acoustical Society of America.
Resumo:
Injection of liquid fuel in cross flowing air has been a strategy for future aircraft engines in order to control the emissions. In this context, breakup of a pressure swirl spray in gaseous cross-flow is investigated experimentally. The atomizer discharges a conical swirling sheet of liquid that interacts with cross-flowing air. This complex interaction and the resulting spray structures at various flow conditions are studied through flow visualization using still as well as high speed photography. Experiments are performed over a wide range of aerodynamic Weber number (2-300) and liquid-to-air momentum flux ratio (5-150). Various breakup regimes exhibiting different breakup processes are mapped on a parameter space based on flow conditions. This map shows significant variations from breakup regime map for a plain liquid jet in cross-flow. It is observed that the breakup of leeward side of the sheet is dominated by bag breakup and the windward side of the sheet undergoes breakup through surface waves. Similarities and differences between bag breakup present in plain liquid jet in cross-flow and swirl spray in cross-flow are explained. Multimodal drop size distribution from bag breakup, frequency of bag breakup, wavelength of surface waves and trajectory of spray in cross-flow are measured by analyzing the spray images and parametric study of their variations is also presented. (C) 2014 Elsevier Ltd. All rights reserved.
Resumo:
Magnetic Resonance Imaging (MRI) is a widely used non-invasive medical tool for detection and diagnosis of cancer. In recent years, MRI has witnessed significant contributions from nanotechnology to incorporate advanced features such as multimodality of nanoparticles, therapeutic delivery, specific targeting and the optical detectability for molecular imaging. Accurate composition, right scheme of surface chemistry and properly designed structure is essential for achieving desired properties of nanomaterials such as non-fouling surface, high imaging contrast, chemical stability, target specificity and/or multimodality. This review provides an overview of the recent progress in theranostic nanomaterials in imaging and the development of nanomaterial based magnetic resonance imaging of cancer. In particular, targeted theranostics is a promising approach along with its targeting strategy in cancer treatment using MRI and multimodal imaging. We also discuss recent advances in integrin mediated targeted MRI of cancer.
Resumo:
Plants emit volatile organic compounds (VOCs) from most parts of their anatomy. Conventionally, the volatiles of leaves, flowers, fruits and seeds have been investigated separately. This review presents an integrated perspective of volatiles produced by fruits and seeds in the context of selection on the whole plant. It suggests that fruit and seed volatiles may only be understood in the light of the chemistry of the whole plant. Fleshy fruit may be viewed as an ecological arena within which several evolutionary games are being played involving fruit VOCs. Fruit odour and colour may be correlated and interact via multimodal signalling in influencing visits by frugivores. The hypothesis of volatile crypsis in the evolution of hard seeds as protection against volatile diffusion and perception by seed predators is reviewed. Current views on the role of volatiles in ant dispersal of seeds or myrmecochory are summarised, especially the suggestion that ants are being manipulated by plants in the form of a sensory trap while providing this service. Plant VOC production is presented as an emergent phenotype that could result from multiple selection pressures acting on various plant parts; the ``plant'' phenotype and VOC profile may receive significant contributions from symbionts within the plant. Viewing the plant as a holobiont would benefit an understanding of the emergent plant phenotype.
Resumo:
Action recognition plays an important role in various applications, including smart homes and personal assistive robotics. In this paper, we propose an algorithm for recognizing human actions using motion capture action data. Motion capture data provides accurate three dimensional positions of joints which constitute the human skeleton. We model the movement of the skeletal joints temporally in order to classify the action. The skeleton in each frame of an action sequence is represented as a 129 dimensional vector, of which each component is a 31) angle made by each joint with a fixed point on the skeleton. Finally, the video is represented as a histogram over a codebook obtained from all action sequences. Along with this, the temporal variance of the skeletal joints is used as additional feature. The actions are classified using Meta-Cognitive Radial Basis Function Network (McRBFN) and its Projection Based Learning (PBL) algorithm. We achieve over 97% recognition accuracy on the widely used Berkeley Multimodal Human Action Database (MHAD).
Resumo:
Isospectral beams have identical free vibration frequency spectrum for a specific boundary condition. The problem of finding non-uniform beams which are isospectral to a given uniform beam, with fixed-free boundary condition, leads to a multimodal optimization problem. The first Q natural frequencies of the given uniform Euler-Bernoulli beam are determined using analytical solution. The first Q natural frequencies of a non-uniform beam are obtained with the help of finite element modeling. In order to obtain the non-uniform beams isospectral to a given uniform beam, an error function is designed, which calculates the difference between the spectra of the given uniform beam and the non-uniform beam. In our study, this error function is minimized using electromagnetism inspired optimization technique, a population based iterative algorithm inspired by the attraction-repulsion physics of electromagnetism. Numerical results show the existence of the isospectral non-uniform beams for a given uniform beam, which occur as local minima. Non-uniform beams isospectral to a damaged beam, are also explored using the proposed methodology to illustrate the fact that accurate structural damage identification is difficult by just frequency measurements. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Polyurethane foams with multimodal cell distribution exhibit superior mechanical and thermal properties. A technique for generating bimodal bubble size distribution exists in the literature, but it uses supercritical conditions. In the present work, an alternative based on milder operating conditions is proposed. It is a modification of reaction injection molding (RIM), using reactants already seeded with bubbles. The number density of the seeds determines if two nucleating events can occur. A bimodal bubble size distribution is obtained when this happens A mathematical model is used to test this hypothesis by simulating water blown free rise polyurethane foams. The effects of initial concentration of bubbles, temperature of the reactants, and the weight fraction of water are studied. The study reveals that for certain concentrations of initial number of bubbles, when initial temperature and weight fraction of water are high, it is possible to obtain a second nucleation event, leading to bimodal bubble size distribution.