15 resultados para Tamil language

em Cochin University of Science


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This work is aimed at building an adaptable frame-based system for processing Dravidian languages. There are about 17 languages in this family and they are spoken by the people of South India.Karaka relations are one of the most important features of Indian languages. They are the semabtuco-syntactic relations between verbs and other related constituents in a sentence. The karaka relations and surface case endings are analyzed for meaning extraction. This approach is comparable with the borad class of case based grammars.The efficiency of this approach is put into test in two applications. One is machine translation and the other is a natural language interface (NLI) for information retrieval from databases. The system mainly consists of a morphological analyzer, local word grouper, a parser for the source language and a sentence generator for the target language. This work make contributios like, it gives an elegant account of the relation between vibhakthi and karaka roles in Dravidian languages. This mapping is elegant and compact. The same basic thing also explains simple and complex sentence in these languages. This suggests that the solution is not just ad hoc but has a deeper underlying unity. This methodology could be extended to other free word order languages. Since the frame designed for meaning representation is general, they are adaptable to other languages coming in this group and to other applications.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

. The cotton mill industry is one of the important medium and large-scale industries in the State of Kerala. Due to the widespread development of the handloom industry in the State, there is an environment conducive to the growth of cotton spinning mills which produce yarn, the raw material required by the handloom industry. New spin— ing mills are being commissioned. But the performance of the existing cotton spinning and weaving mills in the State is not quite satisfactory. Hence an analysis has been carried out into the profitability and financial position of the industry in Kerala. The objective of the study is to make a financial analysis of the industry covering various aspects such as cost structure, productivity, asset structure, financial structure and working capital management.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This is a Named Entity Based Question Answering System for Malayalam Language. Although a vast amount of information is available today in digital form, no effective information access mechanism exists to provide humans with convenient information access. Information Retrieval and Question Answering systems are the two mechanisms available now for information access. Information systems typically return a long list of documents in response to a user’s query which are to be skimmed by the user to determine whether they contain an answer. But a Question Answering System allows the user to state his/her information need as a natural language question and receives most appropriate answer in a word or a sentence or a paragraph. This system is based on Named Entity Tagging and Question Classification. Document tagging extracts useful information from the documents which will be used in finding the answer to the question. Question Classification extracts useful information from the question to determine the type of the question and the way in which the question is to be answered. Various Machine Learning methods are used to tag the documents. Rule-Based Approach is used for Question Classification. Malayalam belongs to the Dravidian family of languages and is one of the four major languages of this family. It is one of the 22 Scheduled Languages of India with official language status in the state of Kerala. It is spoken by 40 million people. Malayalam is a morphologically rich agglutinative language and relatively of free word order. Also Malayalam has a productive morphology that allows the creation of complex words which are often highly ambiguous. Document tagging tools such as Parts-of-Speech Tagger, Phrase Chunker, Named Entity Tagger, and Compound Word Splitter are developed as a part of this research work. No such tools were available for Malayalam language. Finite State Transducer, High Order Conditional Random Field, Artificial Immunity System Principles, and Support Vector Machines are the techniques used for the design of these document preprocessing tools. This research work describes how the Named Entity is used to represent the documents. Single sentence questions are used to test the system. Overall Precision and Recall obtained are 88.5% and 85.9% respectively. This work can be extended in several directions. The coverage of non-factoid questions can be increased and also it can be extended to include open domain applications. Reference Resolution and Word Sense Disambiguation techniques are suggested as the future enhancements

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Low-lying coastal areas are more vulnerable to the impacts of climate change as they are highly prone for inundation to SLR (Sea-Level Rise). This study presents an appraisal of the impacts of SLR on the coastal natural resources and its dependent social communities in the low-lying area of VellareColeroon estuarine region of the Tamil Nadu coast, India. Digital Elevation Model (DEM) derived from SRTM 90M (Shuttle Radar Topographic Mission) data, along with GIS (Geographic Information System) techniques are used to identify an area of inundation in the study site. The vulnerability of coastal areas in Vellar-Coleroon estuarine region of Tamil Nadu coast to inundation was calculated based on the projected SLR scenarios of 0.5 m and 1 m. The results demonstrated that about 1570 ha of the LULC (Land use and Land cover) of the study area would be permanently inundated to 0.5 m and 2407 ha for 1 m SLR and has also resulted in the loss of three major coastal natural resources like coastal agriculture, mangroves and aquaculture. It has been identified that six hamlets of the social communities who depend on these resources are at high-risk and vulnerable to 0.5 m SLR and 12 hamlets for 1 m SLR. From the study, it has been emphasized that mainstreaming adaptation options to SLR should be embedded within a coastal zone management and planning effort, which includes all coastal natural resources (ecosystem-based adaptation), and its dependent social communities (community-based adaptation) involved through capacity building

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for OCR on hand written documents. Even though, sufficient studies have performed in foreign scripts like Chinese, Japanese and Arabic characters, only a very few work can be traced for handwritten character recognition of Indian scripts especially for the South Indian scripts. This paper provides an overview of offline handwritten character recognition in South Indian Scripts, namely Malayalam, Tamil, Kannada and Telungu

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers in the age group ranging from 20 to 40 years. The system obtained a word recognition accuracy of 87.4% and a sentence recognition accuracy of 84%, when tested with a set of continuous speech data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A connected digit speech recognition is important in many applications such as automated banking system, catalogue-dialing, automatic data entry, automated banking system, etc. This paper presents an optimum speaker-independent connected digit recognizer forMalayalam language. The system employs Perceptual Linear Predictive (PLP) cepstral coefficient for speech parameterization and continuous density Hidden Markov Model (HMM) in the recognition process. Viterbi algorithm is used for decoding. The training data base has the utterance of 21 speakers from the age group of 20 to 40 years and the sound is recorded in the normal office environment where each speaker is asked to read 20 set of continuous digits. The system obtained an accuracy of 99.5 % with the unseen data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The span of writer identification extends to broad domes like digital rights administration, forensic expert decisionmaking systems, and document analysis systems and so on. As the success rate of a writer identification scheme is highly dependent on the features extracted from the documents, the phase of feature extraction and therefore selection is highly significant for writer identification schemes. In this paper, the writer identification in Malayalam language is sought for by utilizing feature extraction technique such as Scale Invariant Features Transform (SIFT).The schemes are tested on a test bed of 280 writers and performance evaluated

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Concentration levels of Cr, Ni, Zn, Pb and Cu in relation to those of the nutrients - total phosphates, exchangeable nitrates, total organic carbon, etc. have been investigated in the sediments of Nagapattinam beach after the 2004 tsunami. The maximum values in the study area were 3204, 75, 71, 57 and 18.5 ug g-l for Cr, Ni, Zn, Pb and Cu respectively; Cd was below detectable level. All the trace elements were relatively high in the near-shore sediments and the distribution pattern of the metals in the study area was in the order: Cr > Ni > Zn > Pb > Cu. The present study shows that the tsunami has brought the clayey sediments from the sea-bottom that were settled for years together in inland areas as well as from the offshore sediments. The event has changed the chemical composition of the beach sediments and is threatening fishing grounds even in trace concentrations