956 resultados para Semen characters


Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper describes the efforts at MILE lab, IISc, to create a 100,000-word database each in Kannada and Tamil for the design and development of Online Handwritten Recognition. It has been collected from over 600 users in order to capture the variations in writing style. We describe features of the scripts and how the number of symbols were reduced to be able to effectively train the data for recognition. The list of words include all the characters, Kannada and Indo-Arabic numerals, punctuations and other symbols. A semi-automated tool for the annotation of data from stroke to word level is used. It segments each word into stroke groups and also acts as a validation mechanism for segmentation. The tool displays the stroke, stroke groups and aksharas of a word and hence can be used to study the various styles of writing, delayed strokes and for assigning quality tags to the words. The tool is currently being used for annotating Tamil and Kannada data. The output is stored in a standard XML format.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We present a fractal coding method to recognize online handwritten Tamil characters and propose a novel technique to increase the efficiency in terms of time while coding and decoding. This technique exploits the redundancy in data, thereby achieving better compression and usage of lesser memory. It also reduces the encoding time and causes little distortion during reconstruction. Experiments have been conducted to use these fractal codes to classify the online handwritten Tamil characters from the IWFHR 2006 competition dataset. In one approach, we use fractal coding and decoding process. A recognition accuracy of 90% has been achieved by using DTW for distortion evaluation during classification and encoding processes as compared to 78% using nearest neighbor classifier. In other experiments, we use the fractal code, fractal dimensions and features derived from fractal codes as features in separate classifiers. While the fractal code is successful as a feature, the other two features are not able to capture the wide within-class variations.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we study different methods for prototype selection for recognizing handwritten characters of Tamil script. In the first method, cumulative pairwise- distances of the training samples of a given class are used to select prototypes. In the second method, cumulative distance to allographs of different orientation is used as a criterion to decide if the sample is representative of the group. The latter method is presumed to offset the possible orientation effect. This method still uses fixed number of prototypes for each of the classes. Finally, a prototype set growing algorithm is proposed, with a view to better model the differences in complexity of different character classes. The proposed algorithms are tested and compared for both writer independent and writer adaptation scenarios.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper presents the preliminary analysis of Kannada WordNet and the set of relevant computational tools. Although the design has been inspired by the famous English WordNet, and to certain extent, by the Hindi WordNet, the unique features of Kannada WordNet are graded antonyms and meronymy relationships, nominal as well as verbal compoundings, complex verb constructions and efficient underlying database design (designed to handle storage and display of Kannada unicode characters). Kannada WordNet would not only add to the sparse collection of machine-readable Kannada dictionaries, but also will give new insights into the Kannada vocabulary. It provides sufficient interface for applications involved in Kannada machine translation, spell checker and semantic analyser.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A new species of the shrub frog genus Raorchestes Biju, Souche, Dubois, Dutta and Bossuyt is described as Raorchestes kakachi sp. nov. from Agastyamalai hill region in the southern Western Ghats, India. The small sized Raorchestes (male: 24.7–25.8 mm, n = 3 and female: 24.3–34.1 mm, n = 3) is distinguished from all other known congeners by the following suite of characters. Snout oval in dorsal view; tympanum indistinct; head wider than long; moderate webbing in feet; colour on dorsum varying from ivory to brown, blotches of dark brown on flanks, brown mottling on throat reducing towards vent; inner and outer surface of thigh, inner surface of shank and inner surface of tarsus with a distinct dark brown horizontal band which extends upto first three toes on upper surface. A detailed description, advertisement call features, ecology, natural history notes and comparison with closely related species are provided for the new species.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Indian region is presently the second region after the Neotropics in terms of diversity of phalangopsid crickets. Yet their study is impeded by the lack of necessary taxonomic tools for taxon identification. In the present paper, all generic diagnoses are clarified, using morphological and genitalic characters; female genitalia are described and illustrated for all genera with known females. New taxa are described from southern India: Kempiola flavipunctatus Desutter-Grandcolas n. sp., Opiliosina meridionalis Desutter-Grandcolas n. gen., n. sp., Phalangopsina bolivari Desutter-Grandcolas n. sp., P. chopardi Desutter-Grandcolas n. sp., P. gravelyi Desutter-Grandcolas n. sp., and Speluncasina Desutter-Grandcolas n. gen. The list of phalangopsid crickets from the Indian Region is updated, and a key to phalangopsid genera proposed. A lectotype and a paralectotype are designated to fix the name of Phalangopsina dubia (Bolivar, 1900). Opilionacris annandalei Chopard, 1928, previously transferred to the African genus Phaeophilacris Walker, 1871, is transferred to the genus Speluncasina Desutter-Grandcolas n. gen., while Larandopsis jharnae Bhowmik, 1981 and L. newguineae Bhowmik, 1981 described from New Guinea are transferred to the eneopterine genus Lebinthus Stal, 1877. Finally Luzaropsis confusa Chopard, 1969 is removed from its synonymy with L. ferruginea Walker, 1871.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery of palindromic peptide sequences two decades ago, little effort has been made to understand its structural, functional and evolutionary significance. Therefore, in view of this, an algorithm has been developed to identify all perfect palindromes (excluding the palindromic subset and tandem repeats) in a single protein sequence. The proposed algorithm does not impose any restriction on the number of residues to be given in the input sequence. This avant-garde algorithm will aid in the identification of palindromic peptide sequences of varying lengths in a single protein sequence.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper describes a semi-automatic tool for annotation of multi-script text from natural scene images. To our knowledge, this is the maiden tool that deals with multi-script text or arbitrary orientation. The procedure involves manual seed selection followed by a region growing process to segment each word present in the image. The threshold for region growing can be varied by the user so as to ensure pixel-accurate character segmentation. The text present in the image is tagged word-by-word. A virtual keyboard interface has also been designed for entering the ground truth in ten Indic scripts, besides English. The keyboard interface can easily be generated for any script, thereby expanding the scope of the toolkit. Optionally, each segmented word can further be labeled into its constituent characters/symbols. Polygonal masks are used to split or merge the segmented words into valid characters/symbols. The ground truth is represented by a pixel-level segmented image and a '.txt' file that contains information about the number of words in the image, word bounding boxes, script and ground truth Unicode. The toolkit, developed using MATLAB, can be used to generate ground truth and annotation for any generic document image. Thus, it is useful for researchers in the document image processing community for evaluating the performance of document analysis and recognition techniques. The multi-script annotation toolokit (MAST) is available for free download.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We propose a set of metrics that evaluate the uniformity, sharpness, continuity, noise, stroke width variance,pulse width ratio, transient pixels density, entropy and variance of components to quantify the quality of a document image. The measures are intended to be used in any optical character recognition (OCR) engine to a priori estimate the expected performance of the OCR. The suggested measures have been evaluated on many document images, which have different scripts. The quality of a document image is manually annotated by users to create a ground truth. The idea is to correlate the values of the measures with the user annotated data. If the measure calculated matches the annotated description,then the metric is accepted; else it is rejected. In the set of metrics proposed, some of them are accepted and the rest are rejected. We have defined metrics that are easily estimatable. The metrics proposed in this paper are based on the feedback of homely grown OCR engines for Indic (Tamil and Kannada) languages. The metrics are independent of the scripts, and depend only on the quality and age of the paper and the printing. Experiments and results for each proposed metric are discussed. Actual recognition of the printed text is not performed to evaluate the proposed metrics. Sometimes, a document image containing broken characters results in good document image as per the evaluated metrics, which is part of the unsolved challenges. The proposed measures work on gray scale document images and fail to provide reliable information on binarized document image.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Recent work on molecular phylogenetics of Scolopendridae from the Western Ghats, Peninsular India, has suggested the presence of six cryptic species of the otostigmine Digitipes Attems, 1930, together with three species described in previous taxonomic work by Jangi and Dass (1984). Digitipes is the correct generic attribution for a monophyletic group of Indian species, these being united with three species from tropical Africa (including the type) that share a distomedial process on the ultimate leg femur of males that is otherwise unknown in Otostigminae. Second maxillary characters previously used in the diagnosis of Digitipes are dismissed because Indian species do not possess the putatively diagnostic character states. Two new species from the Western Ghats that correspond to groupings identified based on monophyly, sequence divergence and coalescent analysis using molecular data are diagnosed based on distinct morphological characters. They are D. jangii and D. periyarensis n. spp. Three species named by Jangi and Dass (Digitipes barnabasi, D. coonoorensis and D. indicus) are revised based on new collections; D. indicus is a junior subjective synonym of Arthrorhabdus jonesii Verhoeff, 1938, the combination becoming Digitipes jonesii (Verhoeff, 1938) n. comb. The presence of Arthrorhabdus in India is accordingly refuted. Three putative species delimited by molecular and ecological data remain cryptic from the perspective of diagnostic morphological characters and are presently retained in D. barnabasi, D. jangii and D. jonesii. A molecularly-delimited species that resolved as sister group to a well-supported clade of Indian Digitipes is identified as Otostigmus ruficeps Pocock, 1890, originally described from a single specimen and revised herein. One Indian species originally assigned to Digitipes, D. gravelyi, deviates from confidently-assigned Digitipes with respect to several characters and is reassigned to Otostigmus, as O. gravelyi (Jangi and Dass, 1984) n. comb.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A new species of montane toad Duttaphrynus is described from Nagaland state of Northeast India. The new species is diagnosable based on following combination of characters: absence of preorbital, postorbital and orbitotympanic ridges, elongated and broad parotid gland, first finger longer than second and presence of a mid-dorsal line. The tympanum is hidden under a skin fold (in male) or absent (in female). The species is compared with its congers from India and Indo-China. We propose to consider Duttaphrynus wokhaensis as junior synonym of Duttaphrynus melanostictus.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Sepsophis punctatus Beddome 1870, the only species of a monotypic genus, was described based on a single specimen from the Eastern Ghats of India. We rediscovered the species based on specimens from Odisha and Andhra Pradesh state, India, after a gap of 137 years, including four specimens from close to the type locality. The holotype was studied in detail, and we present additional morphological characters of the species with details on natural history, habitat and diet. The morphological characters of the holotype along with two additional specimens collected by Beddome are compared with the specimens collected by us. We also briefly discuss the distribution of other members of the subfamily Scincinae and their evolutionary affinities.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A new species of lygosomatine scincid lizard is described from the sacred forests of Mawphlang, in Meghalaya, northeastern India. Sphenomorphus apalpebratus sp. nov. possesses a spectacle or brille, an unusual feature within the Scincidae, and a first for the paraphyletic genus Sphenomorphus. The new species is compared with other members of the genus to which it is here assigned, as well as to members of the lygosomatine genera Lipinia and Scincella from mainland India, the Andaman and Nicobar Islands, and south-east Asia, to which it also bears resemblance. The new taxon is diagnosable in exhibiting the following combination of characters: small body size (SVL to 42.0 mm); moveable eyelids absent; auricular opening scaleless, situated in a shallow depression; dorsal scales show a line of demarcation along posterior edge of ventral pes; midbody scale rows 27-28; longitudinal scale rows between parietals and base of tail 62-64; lamellae under toe IV 8-9; supraoculars five; supralabials 5-6; infralabials 4-5; subcaudals 92; and dorsum golden brown, except at dorsal margin of lateral line, which is lighter, with four faintly spotted lines, two along each side of vertebral row of scales, that extend to tail base. The new species differs from its congeners in the lack of moveable eyelids, a character shared with several distantly related scincid genera.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Reproductive modes are diverse and unique in anurans. Selective pressures of evolution, ecology and environment are attributed to such diverse reproductive modes. Globally forty different reproductive modes in anurans have been described to date. The genus Nyctibatrachus has been recently revised and belongs to an ancient lineage of frog families in the Western Ghats of India. Species of this genus are known to exhibit mountain associated clade endemism and novel breeding behaviours. The purpose of this study is to present unique reproductive behaviour, oviposition and parental care in a new species Nyctibatrachus kumbara sp. nov. which is described in the paper. Nyctibatrachus kumbara sp. nov. is a medium sized stream dwelling frog. It is distinct from the congeners based on a suite of morphological characters and substantially divergent in DNA sequences of the mitochondrial 16S rRNA gene. Males exhibit parental care by mud packing the egg clutch. Such parental care has so far not been described from any other frog species worldwide. Besides this, we emphasize that three co-occurring congeneric species of Nyctibatrachus, namely N. jog, N. kempholeyensis and Nyctibatrachus kumbara sp. nov. from the study site differ in breeding behaviour, which could represent a case of reproductive character displacement. These three species are distinct in their size, call pattern, reproductive behaviour, maximum number of eggs in a clutch, oviposition and parental care, which was evident from the statistical analysis. The study throws light on the reproductive behaviour of Nyctibatrachus kumbara sp. nov. and associated species to understand the evolution and adaptation of reproductive modes of anurans in general, and Nyctibatrachus in particular from the Western Ghats.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this article, we aim at reducing the error rate of the online Tamil symbol recognition system by employing multiple experts to reevaluate certain decisions of the primary support vector machine classifier. Motivated by the relatively high percentage of occurrence of base consonants in the script, a reevaluation technique has been proposed to correct any ambiguities arising in the base consonants. Secondly, a dynamic time-warping method is proposed to automatically extract the discriminative regions for each set of confused characters. Class-specific features derived from these regions aid in reducing the degree of confusion. Thirdly, statistics of specific features are proposed for resolving any confusions in vowel modifiers. The reevaluation approaches are tested on two databases (a) the isolated Tamil symbols in the IWFHR test set, and (b) the symbols segmented from a set of 10,000 Tamil words. The recognition rate of the isolated test symbols of the IWFHR database improves by 1.9 %. For the word database, the incorporation of the reevaluation step improves the symbol recognition rate by 3.5 % (from 88.4 to 91.9 %). This, in turn, boosts the word recognition rate by 11.9 % (from 65.0 to 76.9 %). The reduction in the word error rate has been achieved using a generic approach, without the incorporation of language models.