983 resultados para Adaptive Image Binarization
Resumo:
The work is intended to study the following important aspects of document image processing and develop new methods. (1) Segmentation ofdocument images using adaptive interval valued neuro-fuzzy method. (2) Improving the segmentation procedure using Simulated Annealing technique. (3) Development of optimized compression algorithms using Genetic Algorithm and parallel Genetic Algorithm (4) Feature extraction of document images (5) Development of IV fuzzy rules. This work also helps for feature extraction and foreground and background identification. The proposed work incorporates Evolutionary and hybrid methods for segmentation and compression of document images. A study of different neural networks used in image processing, the study of developments in the area of fuzzy logic etc is carried out in this work
Resumo:
In this paper, a new directionally adaptive, learning based, single image super resolution method using multiple direction wavelet transform, called Directionlets is presented. This method uses directionlets to effectively capture directional features and to extract edge information along different directions of a set of available high resolution images .This information is used as the training set for super resolving a low resolution input image and the Directionlet coefficients at finer scales of its high-resolution image are learned locally from this training set and the inverse Directionlet transform recovers the super-resolved high resolution image. The simulation results showed that the proposed approach outperforms standard interpolation techniques like Cubic spline interpolation as well as standard Wavelet-based learning, both visually and in terms of the mean squared error (mse) values. This method gives good result with aliased images also.
Resumo:
The thesis explores the area of still image compression. The image compression techniques can be broadly classified into lossless and lossy compression. The most common lossy compression techniques are based on Transform coding, Vector Quantization and Fractals. Transform coding is the simplest of the above and generally employs reversible transforms like, DCT, DWT, etc. Mapped Real Transform (MRT) is an evolving integer transform, based on real additions alone. The present research work aims at developing new image compression techniques based on MRT. Most of the transform coding techniques employ fixed block size image segmentation, usually 8×8. Hence, a fixed block size transform coding is implemented using MRT and the merits and demerits are analyzed for both 8×8 and 4×4 blocks. The N2 unique MRT coefficients, for each block, are computed using templates. Considering the merits and demerits of fixed block size transform coding techniques, a hybrid form of these techniques is implemented to improve the performance of compression. The performance of the hybrid coder is found to be better compared to the fixed block size coders. Thus, if the block size is made adaptive, the performance can be further improved. In adaptive block size coding, the block size may vary from the size of the image to 2×2. Hence, the computation of MRT using templates is impractical due to memory requirements. So, an adaptive transform coder based on Unique MRT (UMRT), a compact form of MRT, is implemented to get better performance in terms of PSNR and HVS The suitability of MRT in vector quantization of images is then experimented. The UMRT based Classified Vector Quantization (CVQ) is implemented subsequently. The edges in the images are identified and classified by employing a UMRT based criteria. Based on the above experiments, a new technique named “MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ)”is developed. Its performance is evaluated and compared against existing techniques. A comparison with standard JPEG & the well-known Shapiro’s Embedded Zero-tree Wavelet (EZW) is done and found that the proposed technique gives better performance for majority of images
Resumo:
Self-organizing neural networks have been implemented in a wide range of application areas such as speech processing, image processing, optimization and robotics. Recent variations to the basic model proposed by the authors enable it to order state space using a subset of the input vector and to apply a local adaptation procedure that does not rely on a predefined test duration limit. Both these variations have been incorporated into a new feature map architecture that forms an integral part of an Hybrid Learning System (HLS) based on a genetic-based classifier system. Problems are represented within HLS as objects characterized by environmental features. Objects controlled by the system have preset targets set against a subset of their features. The system's objective is to achieve these targets by evolving a behavioural repertoire that efficiently explores and exploits the problem environment. Feature maps encode two types of knowledge within HLS — long-term memory traces of useful regularities within the environment and the classifier performance data calibrated against an object's feature states and targets. Self-organization of these networks constitutes non-genetic-based (experience-driven) learning within HLS. This paper presents a description of the HLS architecture and an analysis of the modified feature map implementing associative memory. Initial results are presented that demonstrate the behaviour of the system on a simple control task.
Resumo:
The tap-length, or the number of the taps, is an important structural parameter of the linear MMSE adaptive filter. Although the optimum tap-length that balances performance and complexity varies with scenarios, most current adaptive filters fix the tap-length at some compromise value, making them inefficient to implement especially in time-varying scenarios. A novel gradient search based variable tap-length algorithm is proposed, using the concept of the pseudo-fractional tap-length, and it is shown that the new algorithm can converge to the optimum tap-length in the mean. Results of computer simulations are also provided to verify the analysis.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Biological processes are very complex mechanisms, most of them being accompanied by or manifested as signals that reflect their essential characteristics and qualities. The development of diagnostic techniques based on signal and image acquisition from the human body is commonly retained as one of the propelling factors in the advancements in medicine and biosciences recorded in the recent past. It is a fact that the instruments used for biological signal and image recording, like any other acquisition system, are affected by non-idealities which, by different degrees, negatively impact on the accuracy of the recording. This work discusses how it is possible to attenuate, and ideally to remove, these effects, with a particular attention toward ultrasound imaging and extracellular recordings. Original algorithms developed during the Ph.D. research activity will be examined and compared to ones in literature tackling the same problems; results will be drawn on the base of comparative tests on both synthetic and in-vivo acquisitions, evaluating standard metrics in the respective field of application. All the developed algorithms share an adaptive approach to signal analysis, meaning that their behavior is not dependent only on designer choices, but driven by input signal characteristics too. Performance comparisons following the state of the art concerning image quality assessment, contrast gain estimation and resolution gain quantification as well as visual inspection highlighted very good results featured by the proposed ultrasound image deconvolution and restoring algorithms: axial resolution up to 5 times better than algorithms in literature are possible. Concerning extracellular recordings, the results of the proposed denoising technique compared to other signal processing algorithms pointed out an improvement of the state of the art of almost 4 dB.
Resumo:
In recent years, due to the rapid convergence of multimedia services, Internet and wireless communications, there has been a growing trend of heterogeneity (in terms of channel bandwidths, mobility levels of terminals, end-user quality-of-service (QoS) requirements) for emerging integrated wired/wireless networks. Moreover, in nowadays systems, a multitude of users coexists within the same network, each of them with his own QoS requirement and bandwidth availability. In this framework, embedded source coding allowing partial decoding at various resolution is an appealing technique for multimedia transmissions. This dissertation includes my PhD research, mainly devoted to the study of embedded multimedia bitstreams in heterogenous networks, developed at the University of Bologna, advised by Prof. O. Andrisano and Prof. A. Conti, and at the University of California, San Diego (UCSD), where I spent eighteen months as a visiting scholar, advised by Prof. L. B. Milstein and Prof. P. C. Cosman. In order to improve the multimedia transmission quality over wireless channels, joint source and channel coding optimization is investigated in a 2D time-frequency resource block for an OFDM system. We show that knowing the order of diversity in time and/or frequency domain can assist image (video) coding in selecting optimal channel code rates (source and channel code rates). Then, adaptive modulation techniques, aimed at maximizing the spectral efficiency, are investigated as another possible solution for improving multimedia transmissions. For both slow and fast adaptive modulations, the effects of imperfect channel estimation errors are evaluated, showing that the fast technique, optimal in ideal systems, might be outperformed by the slow adaptive modulation, when a real test case is considered. Finally, the effects of co-channel interference and approximated bit error probability (BEP) are evaluated in adaptive modulation techniques, providing new decision regions concepts, and showing how the widely used BEP approximations lead to a substantial loss in the overall performance.
Resumo:
A new generation of high definition computed tomography (HDCT) 64-slice devices complemented by a new iterative image reconstruction algorithm-adaptive statistical iterative reconstruction, offer substantially higher resolution compared to standard definition CT (SDCT) scanners. As high resolution confers higher noise we have compared image quality and radiation dose of coronary computed tomography angiography (CCTA) from HDCT versus SDCT. Consecutive patients (n = 93) underwent HDCT, and were compared to 93 patients who had previously undergone CCTA with SDCT matched for heart rate (HR), HR variability and body mass index (BMI). Tube voltage and current were adapted to the patient's BMI, using identical protocols in both groups. The image quality of all CCTA scans was evaluated by two independent readers in all coronary segments using a 4-point scale (1, excellent image quality; 2, blurring of the vessel wall; 3, image with artefacts but evaluative; 4, non-evaluative). Effective radiation dose was calculated from DLP multiplied by a conversion factor (0.014 mSv/mGy × cm). The mean image quality score from HDCT versus SDCT was comparable (2.02 ± 0.68 vs. 2.00 ± 0.76). Mean effective radiation dose did not significantly differ between HDCT (1.7 ± 0.6 mSv, range 1.0-3.7 mSv) and SDCT (1.9 ± 0.8 mSv, range 0.8-5.5 mSv; P = n.s.). HDCT scanners allow low-dose 64-slice CCTA scanning with higher resolution than SDCT but maintained image quality and equally low radiation dose. Whether this will translate into higher accuracy of HDCT for CAD detection remains to be evaluated.
Resumo:
During decades Distance Transforms have proven to be useful for many image processing applications, and more recently, they have started to be used in computer graphics environments. The goal of this paper is to propose a new technique based on Distance Transforms for detecting mesh elements which are close to the objects' external contour (from a given point of view), and using this information for weighting the approximation error which will be tolerated during the mesh simplification process. The obtained results are evaluated in two ways: visually and using an objective metric that measures the geometrical difference between two polygonal meshes.
Resumo:
Purpose: The rapid distal falloff of a proton beam allows for sparing of normal tissues distal to the target. However proton beams that aim directly towards critical structures are avoided due to concerns of range uncertainties, such as CT number conversion and anatomy variations. We propose to eliminate range uncertainty and enable prostate treatment with a single anterior beam by detecting the proton’s range at the prostate-rectal interface and adaptively adjusting the range in vivo and in real-time. Materials and Methods: A prototype device, consisting of an endorectal liquid scintillation detector and dual-inverted Lucite wedges for range compensation, was designed to test the feasibility and accuracy of the technique. Liquid scintillation filled volume was fitted with optical fiber and placed inside the rectum of an anthropomorphic pelvic phantom. Photodiode-generated current signal was generated as a function of proton beam distal depth, and the spatial resolution of this technique was calculated by relating the variance in detecting proton spills to its maximum penetration depth. The relative water-equivalent thickness of the wedges was measured in a water phantom and prospectively tested to determine the accuracy of range corrections. Treatment simulation studies were performed to test the potential dosimetric benefit in sparing the rectum. Results: The spatial resolution of the detector in phantom measurement was 0.5 mm. The precision of the range correction was 0.04 mm. The residual margin to ensure CTV coverage was 1.1 mm. The composite distal margin for 95% treatment confidence was 2.4 mm. Planning studies based on a previously estimated 2mm margin (90% treatment confidence) for 27 patients showed a rectal sparing up to 51% at 70 Gy and 57% at 40 Gy relative to IMRT and bilateral proton treatment. Conclusion: We demonstrated the feasibility of our design. Use of this technique allows for proton treatment using a single anterior beam, significantly reducing the rectal dose.
Resumo:
New-onset impairment of ocular motility will cause incomitant strabismus, i.e., a gaze-dependent ocular misalignment. This ocular misalignment will cause retinal disparity, that is, a deviation of the spatial position of an image on the retina of both eyes, which is a trigger for a vergence eye movement that results in ocular realignment. If the vergence movement fails, the eyes remain misaligned, resulting in double vision. Adaptive processes to such incomitant vergence stimuli are poorly understood. In this study, we have investigated the physiological oculomotor response of saccadic and vergence eye movements in healthy individuals after shifting gaze from a viewing position without image disparity into a field of view with increased image disparity, thus in conditions mimicking incomitance. Repetitive saccadic eye movements into a visual field with increased stimulus disparity lead to a rapid modification of the oculomotor response: (a) Saccades showed immediate disconjugacy (p < 0.001) resulting in decreased retinal image disparity at the end of a saccade. (b) Vergence kinetics improved over time (p < 0.001). This modified oculomotor response enables a more prompt restoration of ocular alignment in new-onset incomitance.
Resumo:
Monte Carlo integration is firmly established as the basis for most practical realistic image synthesis algorithms because of its flexibility and generality. However, the visual quality of rendered images often suffers from estimator variance, which appears as visually distracting noise. Adaptive sampling and reconstruction algorithms reduce variance by controlling the sampling density and aggregating samples in a reconstruction step, possibly over large image regions. In this paper we survey recent advances in this area. We distinguish between “a priori” methods that analyze the light transport equations and derive sampling rates and reconstruction filters from this analysis, and “a posteriori” methods that apply statistical techniques to sets of samples to drive the adaptive sampling and reconstruction process. They typically estimate the errors of several reconstruction filters, and select the best filter locally to minimize error. We discuss advantages and disadvantages of recent state-of-the-art techniques, and provide visual and quantitative comparisons. Some of these techniques are proving useful in real-world applications, and we aim to provide an overview for practitioners and researchers to assess these approaches. In addition, we discuss directions for potential further improvements.
Resumo:
Introduction. Investigations into the shortcomings of current intracavitary brachytherapy (ICBT) technology has lead us to design an Anatomically Adaptive Applicator (A3). The goal of this work was to design and characterize the imaging and dosimetric capabilities of this device. The A3 design incorporates a single shield that can both rotate and translate within the colpostat. We hypothesized that this feature, coupled with specific A3 component construction materials and imaging techniques, would facilitate artifact-free CT and MR image acquisition. In addition, by shaping the delivered dose distribution via the A3 movable shield, dose delivered to the rectum will be less compared to equivalent treatments utilizing current state-of-the-art ICBT applicators. ^ Method and materials. A method was developed to facilitate an artifact-free CT imaging protocol that used a "step-and-shoot" technique: pausing the scanner midway through the scan and moving the A 3 shield out of the path of the beam. The A3 CT imaging capabilities were demonstrated acquiring images of a phantom that positioned the A3 and FW applicators in a clinically-applicable geometry. Artifact-free MRI imaging was achieved by utilizing MRI-compatible ovoid components and pulse-sequences that minimize susceptibility artifacts. Artifacts were qualitatively compared, in a clinical setup. For the dosimetric study, Monte-Carlo (MC) models of the A3 and FW (shielded and unshielded) applicators were validated. These models were incorporated into a MC model of one cervical cancer patient ICBT insertion, using 192Ir (mHDR v2 source). The A3 shield's rotation and translation was adjusted for each dwell position to minimize dose to the rectum. Superposition of dose to rectum for all A3 dwell sources (4 per ovoid) was applied to obtain a comparison of equivalent FW treatments. Rectal dose-volume histograms (absolute and HDR/PDR biologically effective dose (BED)) and BED to 2 cc (BED2cc ) were determined for all applicators and compared. ^ Results. Using a "step-and-shoot" CT scanning method and MR compliant materials and optimized pulse-sequences, images of the A 3 were nearly artifact-free for both modalities. The A3 reduced BED2cc by 18.5% and 7.2% for a PDR treatment and 22.4% and 8.7% for a HDR treatment compared to treatments delivered using an uFW and sFW applicator, respectively. ^ Conclusions. The novel design of the A3 facilitated nearly artifact-free image quality for both CT and MR clinical imaging protocols. The design also facilitated a reduction in BED to the rectum compared to equivalent ICBT treatments delivered using current, state-of-the-art applicators. ^