937 resultados para Text analysis
Resumo:
We present a technique for irreversible watermarking approach robust to affine transform attacks in camera, biomedical and satellite images stored in the form of monochrome bitmap images. The watermarking approach is based on image normalisation in which both watermark embedding and extraction are carried out with respect to an image normalised to meet a set of predefined moment criteria. The normalisation procedure is invariant to affine transform attacks. The result of watermarking scheme is suitable for public watermarking applications, where the original image is not available for watermark extraction. Here, direct-sequence code division multiple access approach is used to embed multibit text information in DCT and DWT transform domains. The proposed watermarking schemes are robust against various types of attacks such as Gaussian noise, shearing, scaling, rotation, flipping, affine transform, signal processing and JPEG compression. Performance analysis results are measured using image processing metrics.
Resumo:
The toplogical features of a sporadic trifurcated C-H center dot center dot center dot O interaction region, where an oxygen atom acts as an acceptor of three weak hydrogen bonds, has been investigated by experimental and theoretical charge density analysis of ferulic acid. The interaction energy of the asymmetric molecular dimer formed by the trifurcated C-H center dot center dot center dot O motif, based on the multipolar model, is shown to be greater than the corresponding asymmetric O-H center dot center dot center dot O dimer in this crystal structure. Further, the hydrogen bond energies associated with these interaction motifs have been estimated from the local kinetic and potential energy densities at the bond critical points. The trends suggest that the interaction energy of the trifurcated C-H center dot center dot center dot O region is comparable to that of a single O-H center dot center dot center dot O hydrogen bond.
Resumo:
This paper describes a semi-automatic tool for annotation of multi-script text from natural scene images. To our knowledge, this is the maiden tool that deals with multi-script text or arbitrary orientation. The procedure involves manual seed selection followed by a region growing process to segment each word present in the image. The threshold for region growing can be varied by the user so as to ensure pixel-accurate character segmentation. The text present in the image is tagged word-by-word. A virtual keyboard interface has also been designed for entering the ground truth in ten Indic scripts, besides English. The keyboard interface can easily be generated for any script, thereby expanding the scope of the toolkit. Optionally, each segmented word can further be labeled into its constituent characters/symbols. Polygonal masks are used to split or merge the segmented words into valid characters/symbols. The ground truth is represented by a pixel-level segmented image and a '.txt' file that contains information about the number of words in the image, word bounding boxes, script and ground truth Unicode. The toolkit, developed using MATLAB, can be used to generate ground truth and annotation for any generic document image. Thus, it is useful for researchers in the document image processing community for evaluating the performance of document analysis and recognition techniques. The multi-script annotation toolokit (MAST) is available for free download.
Resumo:
Scenic word images undergo degradations due to motion blur, uneven illumination, shadows and defocussing, which lead to difficulty in segmentation. As a result, the recognition results reported on the scenic word image datasets of ICDAR have been low. We introduce a novel technique, where we choose the middle row of the image as a sub-image and segment it first. Then, the labels from this segmented sub-image are used to propagate labels to other pixels in the image. This approach, which is unique and distinct from the existing methods, results in improved segmentation. Bayesian classification and Max-flow methods have been independently used for label propagation. This midline based approach limits the impact of degradations that happens to the image. The segmented text image is recognized using the trial version of Omnipage OCR. We have tested our method on ICDAR 2003 and ICDAR 2011 datasets. Our word recognition results of 64.5% and 71.6% are better than those of methods in the literature and also methods that competed in the Robust reading competition. Our method makes an implicit assumption that degradation is not present in the middle row.
Resumo:
The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as ``Prakriti''. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p <= 1 x 10(-5)) were significantly different between Prakritis, without any confounding effect of stratification, after 10(6) permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India's traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine.
Resumo:
Computer Assisted Assessment (CAA) has been existing for several years now. While some forms of CAA do not require sophisticated text understanding (e.g., multiple choice questions), there are also student answers that consist of free text and require analysis of text in the answer. Research towards the latter till date has concentrated on two main sub-tasks: (i) grading of essays, which is done mainly by checking the style, correctness of grammar, and coherence of the essay and (ii) assessment of short free-text answers. In this paper, we present a structured view of relevant research in automated assessment techniques for short free-text answers. We review papers spanning the last 15 years of research with emphasis on recent papers. Our main objectives are two folds. First we present the survey in a structured way by segregating information on dataset, problem formulation, techniques, and evaluation measures. Second we present a discussion on some of the potential future directions in this domain which we hope would be helpful for researchers.
Resumo:
Contributed to: "Measuring the Changes": 13th FIG International Symposium on Deformation Measurements and Analysis; 4th IAG Symposium on Geodesy for Geotechnical and Structural Enginering (Lisbon, Portugal, May 12-15, 2008).
Resumo:
This dataset provides raw data of chemical analyses made during studies on seasonal variations of some major ions in the stream water of the catchment of Lake Windermere in Cumbria. Measurements of sodium, calcium, potassium, magnesium, chloride ions and pH were taken at 37 stations in the catchment between 1975 and 1978.
Resumo:
Compared with structured data sources that are usually stored and analyzed in spreadsheets, relational databases, and single data tables, unstructured construction data sources such as text documents, site images, web pages, and project schedules have been less intensively studied due to additional challenges in data preparation, representation, and analysis. In this paper, our vision for data management and mining addressing such challenges are presented, together with related research results from previous work, as well as our recent developments of data mining on text-based, web-based, image-based, and network-based construction databases.
Resumo:
Compared with construction data sources that are usually stored and analyzed in spreadsheets and single data tables, data sources with more complicated structures, such as text documents, site images, web pages, and project schedules have been less intensively studied due to additional challenges in data preparation, representation, and analysis. In this paper, our definition and vision for advanced data analysis addressing such challenges are presented, together with related research results from previous work, as well as our recent developments of data analysis on text-based, image-based, web-based, and network-based construction sources. It is shown in this paper that particular data preparation, representation, and analysis operations should be identified, and integrated with careful problem investigations and scientific validation measures in order to provide general frameworks in support of information search and knowledge discovery from such information-abundant data sources.
Resumo:
This paper reports on an extensive analysis of the electroluminescence characteristics of InGaN-based LEDs with color-coded structure, i.e., with a triple quantum well structure in which each quantum well has a different indium content. The analysis is based on combined electroluminescence measurements and two-dimensional simulations, carried out at different current and temperature levels. Results indicate that (i) the efficiency of each of the quantum wells strongly depends on device operating conditions (current and temperature); (ii) at low current and temperature levels, only the quantum well closer to the p-side has a significant emission; (iii) emission from the other quantum wells is favored at high current levels. The role of carrier injection, hole mobility, carrier density and non-radiative recombination in determining the relative intensity of the quantum wells is discussed in the text. © 2013 The Japan Society of Applied Physics.
Resumo:
The Bifurcation Interpreter is a computer program that autonomously explores the steady-state orbits of one-parameter families of periodically- driven oscillators. To report its findings, the Interpreter generates schematic diagrams and English text descriptions similar to those appearing in the science and engineering research literature. Given a system of equations as input, the Interpreter uses symbolic algebra to automatically generate numerical procedures that simulate the system. The Interpreter incorporates knowledge about dynamical systems theory, which it uses to guide the simulations, to interpret the results, and to minimize the effects of numerical error.
Resumo:
The recognition and protection of constitutional rights is a fundamental precept. In Ireland, the right to marry is provided for in the equality provisions of Article 40 of the Irish Constitution (1937). However, lesbians and gay men are denied the right to marry in Ireland. The ‘last word’ on this issue came into being in the High Court in 2006, when Katherine Zappone and Ann Louise Gilligan sought, but failed, to have their Canadian marriage recognised in Ireland. My thesis centres on this constitutional court ruling. So as to contextualise the pursuit of marriage equality in Ireland, I provide details of the Irish trajectory vis-à-vis relationship and family recognition for same-sex couples. In Chapter One, I discuss the methodological orientation of my research, which derives from a critical perspective. Chapter Two denotes my theorisation of the principle of equality and the concept of difference. In Chapter Three, I discuss the history of the institution of marriage in the West with its legislative underpinning. Marriage also has a constitutional underpinning in Ireland, which derives from Article 41 of our Constitution. In Chapter Four, I discuss ways in which marriage and family were conceptualised in Ireland, by looking at historical controversies surrounding the legalisation of contraception and divorce. Chapter Five denotes a Critical Discourse Analysis of the High Court ruling in Zappone and Gilligan. In Chapter Six, I critique text from three genres of discourse, i.e. ‘Letters to the Editor’ regarding same-sex marriage in Ireland, communication from legislators vis-à-vis the 2004 legislative impediment to same-sex marriage in Ireland, and parliamentary debates surrounding the 2010 enactment of civil partnership legislation in Ireland. I conclude my research by reflecting on my methodological and theoretical considerations with a view to answering my research questions. Author’s Update: Following the outcome of the 2015 constitutional referendum vis-à-vis Article 41, marriage equality has been realised in Ireland.
Resumo:
We have performed a retrospective analysis to evaluate the impact of age, using a 70 year cutoff, on the safety and efficacy of pegylated liposomal doxorubicin (Caelyx) given at 60 mg/m(2) every 6 weeks (treatment A) or 50 mg/m(2) every 4 weeks (treatment B) to 136 metastatic breast cancer patients in two EORTC trials, of whom 65 were 70 years of age or older. No difference in terms of toxicity was observed between younger and older patients treated with the 4-week schedule, while a higher incidence of hematological toxicity, anorexia, asthenia, and stomatitis was observed in older patients when the 6-week schedule was used. Antitumor activity was not affected by age. In the older cohort of patients, no dependence was found between the incidence of grade 3-4 toxicity or antitumor activity and patients' baseline performance status, number and severity of comorbidities, or number of concomitant medications. The higher therapeutic index of Caelyx 50 mg/m(2) every 4 weeks makes it, of the two dose schedules investigated, the preferred regimen in the elderly.
Resumo:
Liver metastases have long been known to indicate an unfavourable disease course in breast cancer (BC). However, a small subset of patients with liver metastases alone who were treated with pre-taxane chemotherapy regimens was reported to have longer survival compared with patients with liver and metastases at other sites. In the present study, we examined the clinical outcome of breast cancer patients with liver metastases alone in the context of two phase III European Organisation for Research and Treatment of Cancer (EORTC) trials which compared the efficacy of doxorubicin (A) versus paclitaxel (T) (trial 10923) and of AC (cyclophosphamide) versus AT (trial 10961), given as first-line chemotherapy in metastatic BC patients. The median follow-up for the patients with liver metastases was 90.5 months in trial 10923 and 56.6 months in trial 10961. Patients with liver metastases alone comprised 18% of all patients with liver metastases, in both the 10923 and 10961 trials. The median survival of patients with liver metastases alone and liver plus other sites of metastases were 22.7 and 14.2 months (log rank test, P=0.002) in trial 10923 and 27.1 and 16.8 months (log rank test, P=0.19) in trial 10961. The median TTP (time to progression) for patients with liver metastases alone was also longer compared with the liver plus other sites of metastases group in both trials: 10.2 versus 8.8 months (log rank test, P=0.02) in trial 10923 and 8.3 versus 6.7 months (log rank test, P=0.37) in trial 10961. Most patients with liver metastases alone have progression of their disease in their liver again (96 and 60% of patients in trials 10923 and 10961, respectively). Given the high prevalence of breast cancer, improved detection of liver metastases, encouraging survival achieved with currently available cytotoxic agents and the fact that a significant portion of patients with liver metastases alone have progression of their tumour in the liver again, a more aggressive multimodality treatment approach through prospective clinical trials seems worth exploring in this specific subset of women.