970 resultados para Cross-lingual document retrieval
Resumo:
In this paper, we present a new feature-based approach for mosaicing of camera-captured document images. A novel block-based scheme is employed to ensure that corners can be reliably detected over a wide range of images. 2-D discrete cosine transform is computed for image blocks defined around each of the detected corners and a small subset of the coefficients is used as a feature vector A 2-pass feature matching is performed to establish point correspondences from which the homography relating the input images could be computed. The algorithm is tested on a number of complex document images casually taken from a hand-held camera yielding convincing results.
Resumo:
Skew correction of complex document images is a difficult task. We propose an edge-based connected component approach for robust skew correction of documents with complex layout and content. The algorithm essentially consists of two steps - an 'initialization' step to determine the image orientation from the centroids of the connected components and a 'search' step to find the actual skew of the image. During initialization, we choose two different sets of points regularly spaced across the the image, one from the left to right and the other from top to bottom. The image orientation is determined from the slope between the two succesive nearest neighbors of each of the points in the chosen set. The search step finds succesive nearest neighbors that satisfy the parameters obtained in the initialization step. The final skew is determined from the slopes obtained in the 'search' step. Unlike other connected component based methods, the proposed method does not require any binarization step that generally precedes connected component analysis. The method works well for scanned documents with complex layout of any skew with a precision of 0.5 degrees.
Resumo:
The document images that are fed into an Optical Character Recognition system, might be skewed. This could be due to improper feeding of the document into the scanner or may be due to a faulty scanner. In this paper, we propose a skew detection and correction method for document images. We make use of the inherent randomness in the Horizontal Projection profiles of a text block image, as the skew of the image varies. The proposed algorithm has proved to be very robust and time efficient. The entire process takes less than a second on a 2.4 GHz Pentium IV PC.
Resumo:
Prior to embarking on further study into the subject of relevance it is essential to consider why the concept of relevance has remained inconclusive, despite extensive research and its centrality to the discipline of information science. The approach taken in this paper is to reconstruct the science of information retrieval from first principles including the problem statement, role, scope and objective. This framework for document selection is put forward as a straw man for comparison with the historical relevance models. The paper examines five influential relevance models over the past 50 years. Each is examined with respect to its treatment of relevance and compared with the first principles model to identify contributions and deficiencies. The major conclusion drawn is that relevance is a significantly overloaded concept which is both confusing and detrimental to the science.
Resumo:
The ultrafast vibrational phase relaxation of O–H stretch in bulk water is investigated in molecular dynamics simulations. The dephasing time (T2) of the O–H stretch in bulk water calculated from the frequency fluctuation time correlation function (Cω(t)) is in the range of 70–80 femtosecond (fs), which is comparable to the characteristic timescale obtained from the vibrational echo peak shift measurements using infrared photon echo [W.P. de Boeij, M.S. Pshenichnikov, D.A. Wiersma, Ann. Rev. Phys. Chem. 49 (1998) 99]. The ultrafast decay of Cω(t) is found to be responsible for the ultrashort T2 in bulk water. Careful analysis reveals the following two interesting reasons for the ultrafast decay of Cω(t). (A) The large amplitude angular jumps of water molecules (within 30–40 fs time duration) provide a large scale contribution to the mean square vibrational frequency fluctuation and gives rise to the rapid spectral diffusion on 100 fs time scale. (B) The projected force, due to all the atoms of the solvent molecules on the oxygen (FO(t)) and hydrogen (FH(t)) atom of the O–H bond exhibit a large negative cross-correlation (NCC). We further find that this NCC is partly responsible for a weak, non-Arrhenius temperature dependence of the dephasing rate.
Resumo:
A model for total cross-sections incorporating QCD jet cross-sections and soft gluon resummation is described and compared with present data on pp and pp cross-sections. Predictions for LHC are presented for different parameter sets. It is shown that they differ according to the small x-behaviour of available parton density functions.
Resumo:
The increased availability of image capturing devices has enabled collections of digital images to rapidly expand in both size and diversity. This has created a constantly growing need for efficient and effective image browsing, searching, and retrieval tools. Pseudo-relevance feedback (PRF) has proven to be an effective mechanism for improving retrieval accuracy. An original, simple yet effective rank-based PRF mechanism (RB-PRF) that takes into account the initial rank order of each image to improve retrieval accuracy is proposed. This RB-PRF mechanism innovates by making use of binary image signatures to improve retrieval precision by promoting images similar to highly ranked images and demoting images similar to lower ranked images. Empirical evaluations based on standard benchmarks, namely Wang, Oliva & Torralba, and Corel datasets demonstrate the effectiveness of the proposed RB-PRF mechanism in image retrieval.
Resumo:
We discuss the infrared limit for soft gluon k(t)-resummation and relate it to physical observables such as the intrinsic transverse momentum and the high energy limit of total cross-sections.
Resumo:
Objective: To identify key stakeholder preferences and priorities when considering a national healthcare-associated infection (HAI) surveillance programme through the use of a discrete choice experiment (DCE). Setting: Australia does not have a national HAI surveillance programme. An online web-based DCE was developed and made available to participants in Australia. Participants: A sample of 184 purposively selected healthcare workers based on their senior leadership role in infection prevention in Australia. Primary and secondary outcomes: A DCE requiring respondents to select 1 HAI surveillance programme over another based on 5 different characteristics (or attributes) in repeated hypothetical scenarios. Data were analysed using a mixed logit model to evaluate preferences and identify the relative importance of each attribute. Results: A total of 122 participants completed the survey (response rate 66%) over a 5-week period. Excluding 22 who mismatched a duplicate choice scenario, analysis was conducted on 100 responses. The key findings included: 72% of stakeholders exhibited a preference for a surveillance programme with continuous mandatory core components (mean coefficient 0.640 (p<0.01)), 65% for a standard surveillance protocol where patient-level data are collected on infected and non-infected patients (mean coefficient 0.641 (p<0.01)), and 92% for hospital-level data that are publicly reported on a website and not associated with financial penalties (mean coefficient 1.663 (p<0.01)). Conclusions: The use of the DCE has provided a unique insight to key stakeholder priorities when considering a national HAI surveillance programme. The application of a DCE offers a meaningful method to explore and quantify preferences in this setting.
Resumo:
In this brief, we present a new circuit technique to generate the sigmoid neuron activation function (NAF) and its derivative (DNAF). The circuit makes use of transistor asymmetry in cross-coupled differential pair to obtain the derivative. The asymmetry is introduced through external control signal, as and when required. This results in the efficient utilization of the hard-ware by realizing NAF and DNAF using the same building blocks. The operation of the circuit is presented in the subthreshold region for ultra low-power applications. The proposed circuit has been experimentally prototyped and characterized as a proof of concept on the 1.5-mum AMI technology.
Resumo:
The conformational stability of Plasmodium falciparum triosephosphate isomerase (TIMWT) enzyme has been investigated in urea and guanidinium chloride (GdmCl) solutions using circular dichroism, fluorescence, and size-exclusion chromatography. The dimeric enzyme is remarkably stable in urea solutions. It retains considerable secondary, tertiary, and quaternary structure even in 8 M urea. In contrast, the unfolding transition is complete by 2.4 M GdmCl. Although the secondary as well as the tertiary interactions melt before the perturbation of the quaternary structure, these studies imply that the dissociation of the dimer into monomers ultimately leads to the collapse of the structure, suggesting that the interfacial interactions play a major role in determining multimeric protein stability. The C-m(urea)/C-m(GdmCl) ratio (where C-m is the concentration of the denaturant required at the transition midpoint) is unusually high for triosephosphate isomerase as compared to other monomeric and dimeric proteins. A disulfide crosslinked mutant protein (Y74C) engineered to form two disulfide cross-links across the interface (13-74') and (13'-74) is dramatically destablized in urea. The unfolding transition is complete by 6 M urea and involves a novel mechanism of dimer dissociation through intramolecular thiol-disulfide exchange.