987 resultados para HTML (Language for Labelling Documents)


Relevância:

30.00% 30.00%

Publicador:

Resumo:

UANL

Relevância:

30.00% 30.00%

Publicador:

Resumo:

L'ensemble de mon travail a été réalisé grâce a l'utilisation de logiciel libre.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Thèse numérisée par la Division de la gestion de documents et des archives de l'Université de Montréal.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Thèse numérisée par la Division de la gestion de documents et des archives de l'Université de Montréal

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Thèse numérisée par la Division de la gestion de documents et des archives de l'Université de Montréal

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Mémoire numérisé par la Division de la gestion de documents et des archives de l'Université de Montréal

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Mémoire numérisé par la Division de la gestion de documents et des archives de l'Université de Montréal

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This is a Named Entity Based Question Answering System for Malayalam Language. Although a vast amount of information is available today in digital form, no effective information access mechanism exists to provide humans with convenient information access. Information Retrieval and Question Answering systems are the two mechanisms available now for information access. Information systems typically return a long list of documents in response to a user’s query which are to be skimmed by the user to determine whether they contain an answer. But a Question Answering System allows the user to state his/her information need as a natural language question and receives most appropriate answer in a word or a sentence or a paragraph. This system is based on Named Entity Tagging and Question Classification. Document tagging extracts useful information from the documents which will be used in finding the answer to the question. Question Classification extracts useful information from the question to determine the type of the question and the way in which the question is to be answered. Various Machine Learning methods are used to tag the documents. Rule-Based Approach is used for Question Classification. Malayalam belongs to the Dravidian family of languages and is one of the four major languages of this family. It is one of the 22 Scheduled Languages of India with official language status in the state of Kerala. It is spoken by 40 million people. Malayalam is a morphologically rich agglutinative language and relatively of free word order. Also Malayalam has a productive morphology that allows the creation of complex words which are often highly ambiguous. Document tagging tools such as Parts-of-Speech Tagger, Phrase Chunker, Named Entity Tagger, and Compound Word Splitter are developed as a part of this research work. No such tools were available for Malayalam language. Finite State Transducer, High Order Conditional Random Field, Artificial Immunity System Principles, and Support Vector Machines are the techniques used for the design of these document preprocessing tools. This research work describes how the Named Entity is used to represent the documents. Single sentence questions are used to test the system. Overall Precision and Recall obtained are 88.5% and 85.9% respectively. This work can be extended in several directions. The coverage of non-factoid questions can be increased and also it can be extended to include open domain applications. Reference Resolution and Word Sense Disambiguation techniques are suggested as the future enhancements

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The span of writer identification extends to broad domes like digital rights administration, forensic expert decisionmaking systems, and document analysis systems and so on. As the success rate of a writer identification scheme is highly dependent on the features extracted from the documents, the phase of feature extraction and therefore selection is highly significant for writer identification schemes. In this paper, the writer identification in Malayalam language is sought for by utilizing feature extraction technique such as Scale Invariant Features Transform (SIFT).The schemes are tested on a test bed of 280 writers and performance evaluated

Relevância:

30.00% 30.00%

Publicador:

Resumo:

El llibre Hebreu: llengua, literatura i gramàtica conté una síntesi breu de la història de la llengua hebrea, un resum de la història de la literatura escrita en hebreu i una descripció panoràmica dels trets gramaticals més significatius de la llengua hebrea contemporània. Aquestes pàgines han estat pensades per a servir de complement als cursos d'hebreu de la Universitat de Girona

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Manual pensat per a servir de guia per als alumnes dels curs Hebreu Bíblic de la Universitat de Girona. Hebreu Bíblic conté una descripció de la gramàtica fonamental de la llengua hebrea bíblica en 54 lliçons. Cada lliçó presenta sols els continguts essencials de la gramàtica, que en el treball de classe és ampliat i matisat. La segona part del manual consta de 24 vocabularis i exercicis formats exclusivament per frases tretes del text de la Bíblia

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Amb aquesta publicació es pretén, d'una banda, homogeneïtzar els documents administratius que es produeixen a la UdG, tant des del punt de vista formal com textual, per contribuir a la imatge de qualitat de la UdG, i, d'altra banda, oferir un recurs a les persones de la UdG que han de redactar documents administratius, amb la intenció que produeixin textos tan eficients com sigui possible. Els diferents documents administratius es presenten classificats en tres apartats: textos dels òrgans de govern unipersonals, textos dels òrgans col·legiats i textos comuns a tots dos òrgans. De cada document se'n fa una descripció acurada, se'n presenta l'estructura i se'n donen alguns exemples. Un dels trets més importants de la publicació és que ha estat elaborada per tres serveis de la Universitat: el Servei de Llengües Modernes, la Unitat de Gestió Documental, Arxiu i Registre i l'Assessoria Jurídica, els quals hi han aportat els seus punts de vista respectius, propis de les disciplines de què tracten