19 resultados para Scheduled caste
Resumo:
This is a Named Entity Based Question Answering System for Malayalam Language. Although a vast amount of information is available today in digital form, no effective information access mechanism exists to provide humans with convenient information access. Information Retrieval and Question Answering systems are the two mechanisms available now for information access. Information systems typically return a long list of documents in response to a user’s query which are to be skimmed by the user to determine whether they contain an answer. But a Question Answering System allows the user to state his/her information need as a natural language question and receives most appropriate answer in a word or a sentence or a paragraph. This system is based on Named Entity Tagging and Question Classification. Document tagging extracts useful information from the documents which will be used in finding the answer to the question. Question Classification extracts useful information from the question to determine the type of the question and the way in which the question is to be answered. Various Machine Learning methods are used to tag the documents. Rule-Based Approach is used for Question Classification. Malayalam belongs to the Dravidian family of languages and is one of the four major languages of this family. It is one of the 22 Scheduled Languages of India with official language status in the state of Kerala. It is spoken by 40 million people. Malayalam is a morphologically rich agglutinative language and relatively of free word order. Also Malayalam has a productive morphology that allows the creation of complex words which are often highly ambiguous. Document tagging tools such as Parts-of-Speech Tagger, Phrase Chunker, Named Entity Tagger, and Compound Word Splitter are developed as a part of this research work. No such tools were available for Malayalam language. Finite State Transducer, High Order Conditional Random Field, Artificial Immunity System Principles, and Support Vector Machines are the techniques used for the design of these document preprocessing tools. This research work describes how the Named Entity is used to represent the documents. Single sentence questions are used to test the system. Overall Precision and Recall obtained are 88.5% and 85.9% respectively. This work can be extended in several directions. The coverage of non-factoid questions can be increased and also it can be extended to include open domain applications. Reference Resolution and Word Sense Disambiguation techniques are suggested as the future enhancements
Resumo:
The customary laws of Union Territory of Lakshadweep islands are a challenge for judicial institution as well as administrative machinery. With the peculiarities of socio-legal institutions, Lakshadweep system stands apart from the mainstream of legal systems in India. How far do the charismatic modernisation trends flowing into the Lakshadweep society affect the people already protected by the uncodified laws of the past? Many are the issues at this stage. This study analyses them. It examines the growth, evolution and development of the legal system in the islands vis-a-vis the administrative mechanism imposed by the mainland ethos and culture.
Resumo:
Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers in the age group ranging from 20 to 40 years. The system obtained a word recognition accuracy of 87.4% and a sentence recognition accuracy of 84%, when tested with a set of continuous speech data.
Resumo:
The study is focused on education of tribes particularly the problem of high dropout rate existing among the tribal students at school level. Scheduled Tribe is one of the marginalized communities experiencing high level of educational deprivation. The analysis of the study shows that the extent of deprivation existing among STs of Kerala is much higher compared to that of other communities. The present study covered tribes of three tribal predominant districts of Kerala such as Idukki, Palakkad and Wayanad. Out of the 35 tribal communities in the State, 17 of them are concentrated in these districts. Tribes concentrated in Idukki include Muthuvans, Malai Arayan, Uraly, Mannan and Hill Pulaya. The present study analyzed dropouts situation in tribal areas of Kerala by conducting Field Survey among dropout and non-dropout students at school level. High dropouts among STs persist due to many problems which are of structural in nature. Important problems faced by the tribal students that have been analyzed, this can be classified as economic, social, cultural and institutional. It is found that there exists high correlation between Income and expenditure of the family with the well-being of individuals. Significant economic factors are poverty and financial indebtedness of the family. Some of the common cultural factors of tribes are Nature of Habitation, Difference in Dialect and Medium of Instruction etc. Social factors analyzed in the study are illiteracy of parents, migration of family, family environment, motivation by parents, activities engaged in for helping the family and students’ lack of interest in studies. The analysis showed that all these factors except migration of the family, are affecting the education of tribal students. Apart from social, economic and cultural factors, there are a few institutional factors which will also influence the education of tribal students. Institutional factors analyzed in the study include students’ absenteeism, irregularity of teachers, attitude of non-tribal teachers and non-tribal students, infrastructure facilities and accessibility to school. The study found irregularity of students and accessibility to school as significant factors which determine the dropout of the students.