875 resultados para character features
Resumo:
Author identification is the problem of identifying the author of an anonymous text or text whose authorship is in doubt from a given set of authors. The works by different authors are strongly distinguished by quantifiable features of the text. This paper deals with the attempts made on identifying the most likely author of a text in Malayalam from a list of authors. Malayalam is a Dravidian language with agglutinative nature and not much successful tools have been developed to extract syntactic & semantic features of texts in this language. We have done a detailed study on the various stylometric features that can be used to form an authors profile and have found that the frequencies of word collocations can be used to clearly distinguish an author in a highly inflectious language such as Malayalam. In our work we try to extract the word level and character level features present in the text for characterizing the style of an author. Our first step was towards creating a profile for each of the candidate authors whose texts were available with us, first from word n-gram frequencies and then by using variable length character n-gram frequencies. Profiles of the set of authors under consideration thus formed, was then compared with the features extracted from anonymous text, to suggest the most likely author.
Resumo:
Esta tesina tiene el propósito de analizar las características de algunos de lospersonajes principales de La Sombra del Viento de Carlos Ruiz Zafón desde un punto de vistade género. El estudio se basa en diferentes teorías de género que pretenden funcionar comoherramientas para poder destacar las diferencias entre las descripciones femeninas y lasmasculinas que aparecen en la obra. Primero, definimos y concretamos el término género conla ayuda de las teorías de Yvonne Hirdman. En segundo lugar, presentamos la teoría deldualismo, de acuerdo con la cual Lena Gemzöe hace una división entre las cualidadesmasculinas y femeninas. El objetivo de nuestro estudio ha sido hacer un análisis de lascaracterísticas de algunos de los personajes principales de para demostrar si existenconstrucciones de identidad de género desde una perspectiva dualista. Como resultado denuestro estudio podemos afirmar que Zafón refuerza la división entre las cualidadesmasculinas y femeninas. Los personajes masculinos son descritos como fuertes, valientes,lógicos, intelectuales e independientes. Paralelamente, las mujeres son descritas como débiles,cobardes, intuitivas y dependientes. Además, consideramos que Zafón da a todos lospersonajes masculinos mayor espacio, estatus y protagonismo en el desarrollo de la historia.En todo momento, queda claro que Zafón crea de forma inconsciente el orden de género yrefuerza así las diferencias sexuales.
Resumo:
Mode of access: Internet.
Resumo:
Mapania belongs to Mapanioideae, a quite controversial subfamily in Cyperaceae due to the existence of unusual characters in both reproductive and vegetative organs. The genus is represented by seven species in Northern Brazil but taxonomic valuable information related to the leaf organs is still unknown. The present study aimed the anatomical description of the leaf organs (either basal leaves or cataphylls and involucral bracts) of three representative Brazilian species of Mapania. Samples of cataphylls, basal leaves and involucral bracts were sectioned and stained for observations under light microscopy. The involucral bracts provide the most elucidative characters (ten) to distinguish the three species The basal leaves provides six distinguishing characters and are useful to M. macrophylla and M. pycnostachya, as they are absent in M. sylvatica. Mesophyll arrangement in the involucral bracts supports the circumscription of M. macrophylla and M. pycnostachya in M. sect. Pycnocephala and of M. sylvatica in M. sect. Mapania. Some features as thin-walled epidermal cells, stomata level and aerenchyma were considered to be adaptive to the humid environment in which the species occur. The translucent cells are here considered as aerenchyma precursors and a supportive function is assumed for the bulliform cells on the basal leaves and involucral bracts. No silica bodies were found which confirm it as a diagnostic character of Mapania among Hypolytreae genera.
Resumo:
On-line handwriting recognition has been a frontier area of research for the last few decades under the purview of pattern recognition. Word processing turns to be a vexing experience even if it is with the assistance of an alphanumeric keyboard in Indian languages. A natural solution for this problem is offered through online character recognition. There is abundant literature on the handwriting recognition of western, Chinese and Japanese scripts, but there are very few related to the recognition of Indic script such as Malayalam. This paper presents an efficient Online Handwritten character Recognition System for Malayalam Characters (OHR-M) using K-NN algorithm. It would help in recognizing Malayalam text entered using pen-like devices. A novel feature extraction method, a combination of time domain features and dynamic representation of writing direction along with its curvature is used for recognizing Malayalam characters. This writer independent system gives an excellent accuracy of 98.125% with recognition time of 15-30 milliseconds
Resumo:
This paper presents a novel approach to recognize Grantha, an ancient script in South India and converting it to Malayalam, a prevalent language in South India using online character recognition mechanism. The motivation behind this work owes its credit to (i) developing a mechanism to recognize Grantha script in this modern world and (ii) affirming the strong connection among Grantha and Malayalam. A framework for the recognition of Grantha script using online character recognition is designed and implemented. The features extracted from the Grantha script comprises mainly of time-domain features based on writing direction and curvature. The recognized characters are mapped to corresponding Malayalam characters. The framework was tested on a bed of medium length manuscripts containing 9-12 sample lines and printed pages of a book titled Soundarya Lahari writtenin Grantha by Sri Adi Shankara to recognize the words and sentences. The manuscript recognition rates with the system are for Grantha as 92.11%, Old Malayalam 90.82% and for new Malayalam script 89.56%. The recognition rates of pages of the printed book are for Grantha as 96.16%, Old Malayalam script 95.22% and new Malayalam script as 92.32% respectively. These results show the efficiency of the developed system
Resumo:
Optical Character Recognition plays an important role in Digital Image Processing and Pattern Recognition. Even though ambient study had been performed on foreign languages like Chinese and Japanese, effort on Indian script is still immature. OCR in Malayalam language is more complex as it is enriched with largest number of characters among all Indian languages. The challenge of recognition of characters is even high in handwritten domain, due to the varying writing style of each individual. In this paper we propose a system for recognition of offline handwritten Malayalam vowels. The proposed method uses Chain code and Image Centroid for the purpose of extracting features and a two layer feed forward network with scaled conjugate gradient for classification
Resumo:
The Amharic language is the Official language of over 70 million people mainly in Ethiopia. An extensive literature survey and the government report reveal no single Amharic character recognition is found in the country. The Amharic script has 33 basic characters each with seven orders giving 310 distinct characters, including numbers and punctuation symbols. The characters are visually similar; there is a typeface, but no capitalization. Beside this there is no any standard font to use the language in the computer but they use different fonts developed by different stakeholders without keeping a standard on their own way and interest and this create a problem of incompatibility between different fonts and documents.This project is to investigate the reason why Amharic optical character recognition is not addressed by local and international researchers and developers and finally to develop Amharic optical character recognition uses the features and facilities of Microsoft windows Vista or 7 using Unicode standard.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Appendices: I. A historical view of the translations and different editions of the Icelandic scriptures.--II. Poems of thanks from Iceland ... by S. J. Thorlakson.--III. An inquiry into the origin, progress, nature, and characteristic features of Icelandic poetry.
Resumo:
The purpose of this guide is to assist investigators conducting geologic hazard assessments with the understanding, detection, and characterization of surface features related to subsidence from underground coal mining. Subsidence related to underground coal mining can present serious problems to new and/or existing infrastructure, utilities, and facilities. For example, heavy equipment driving over the ground surface during construction processes may punch into voids created by sinkholes or cracks, resulting in injury to persons and property. Abandoned underground mines also may be full of water, and if punctured, can flood nearby areas. Furthermore, the integrity of rigid structures such as buildings, dams and bridges may be compromised if mining subsidence results in differential movement at the ground surface. Subsidence of the ground surface is a phenomenon associated with the removal of material at depth, and may occur coincident with mining, gradually over time, or sometimes suddenly, long after mining operations have ceased (Gray and Bruhn, 1984). The spatial limits of underground coal mines may extend for great distances beyond the surface operations of a mine, in some cases more than 10 miles for an individual mine. When conducting geologic hazard assessments, several remote investigation methods can be used to observe surface features related to underground mining subsidence. LiDAR-derived DEMs are generally the most useful method available for identifying these features because the bare earth surface can be viewed. However, due to limitations in the availability of LiDAR data, other methods often need to be considered when investigating surface features related to underground coal mining subsidence, such as Google Earth and aerial imagery. Mine maps, when available, can be viewed in tandem with these datasets, potentially improving the confidence of any possible mining subsidence-related features observed remotely. However, maps for both active and abandoned mines may be incomplete or unavailable. Therefore, it is important to be able to recognize possible surface features related to underground mining subsidence. This guide provides examples of surface subsidence features related to the two principal underground coal mining methods used in the United States: longwall mining and room and pillar mining. The depth and type of mining, geologic conditions, hydrologic conditions, and time are all factors that may influence the type of features that manifest at the surface. This guide provides investigators a basic understanding about the size, character and conditions of various surface features that occur as a result of underground mining subsidence.
Resumo:
Many attempts have been made to overcome problems involved in character recognition which have resulted in the manufacture of character reading machines. An investigation into a new approach to character recognition is described. Features for recognition are Fourier coefficients. These are generated optically by convolving characters with periodic gratings. The development of hardware to enable automatic measurement of contrast and position of periodic shadows produced by the convolution is described. Fourier coefficients of character sets were measured, many of which are tabulated. Their analysis revealed that a few low frequency sampling points could be selected to recognise sets of numerals. Limited treatment is given to show the effect of type face variations on the values of coefficients which culminated in the location of six sampling frequencies used as features to recognise numerals in two type fonts. Finally, the construction of two character recognition machines is compared and contrasted. The first is a pilot plant based on a test bed optical Fourier analyser, while the second is a more streamlined machine d(3signed for high speed reading. Reasons to indicate that the latter machine would be the most suitable to adapt for industrial and commercial applications are discussed.
Resumo:
The morphological criteria for identification of intercalated duct lesions (IDLs) of salivary glands have been defined recently. It has been hypothesised that IDL could be a precursor of basal cell adenoma (BCA). BCAs show a variety of histological patterns, and the tubular variant is the one that presents the strongest resemblance with IDLs. The aim of this study was to analyse the morphological and immunohistochemical profiles of IDLs and BCAs classified into tubular and non-tubular subtypes, to determine whether or not IDL and tubular BCA represent distinct entities. Eight IDLs, nine tubular BCAs and 19 non-tubular BCAs were studied. All tubular BCAs contained IDL-like areas, which represented 20-70% of the tumour. In non-tubular BCA, IDL-like areas were occasional and small (<5%). One patient presented IDLs, tubular BCAs and IDL/tubular BCA combined lesions. Luminal ductal cells of IDLs and tubular BCAs exhibited positivity for CK7, lysozyme, S100 and DOG1. In the non-tubular BCA group, few luminal cells exhibited such an immunoprofile; they were mainly CK14-positive. Basal/myoepithelial cells of IDLs, tubular BCAs and non-tubular BCAs were positive for CK14, calponin, α-SMA and p63; they were more numerous in BCA lesions. IDL, tubular BCA and non-tubular BCA form a continuum of lesions in which IDLs are related closely to tubular BCA. In both, the immunoprofile of luminal and myoepithelial cells recapitulates the normal intercalated duct. The difference between the adenoma-like subset of IDLs and tubular BCA rests mainly on the larger numbers of myoepithelial cells in the latter. Our findings indicate that at least some BCAs can arise via IDLs.
Resumo:
Brazilian epidemiological studies on rheumatoid arthritis are scarce, mainly in the northeast; thus many data currently available originate from the international literature. To describe demographic, clinical and serological characteristics of patients with rheumatoid arthritis (RA) followed-up by the same physician, in state of Piauí, Brazil. Data were collected between August 2010 and March 2013, in three health services of Piauí that provided health care in Rheumatology: a university-affiliated hospital, a public outpatient clinic and a private clinic. The numbers represent mean ± SD or percentage: 47.5±11.03 years-old non-Caucasian woman, non-smoker (59.2%), low educational level, mean disease duration of 7.7 years ± 7.6, and major extra-articular manifestations were rheumatoid nodules (19.4%) and sicca syndrome (46.9%). Features of rheumatoid arthritis obtained in this study are similar to those found in some national and international studies, but we observed higher female preponderance and illiteracy rate, in addition to a moderately severe erosive disease on average, with frequent sicca and other extra-articular manifestations.