3 resultados para Hierarchy of text classifiers
em Central European University - Research Support Scheme
Resumo:
Mr. Kubon's project was inspired by the growing need for an automatic, syntactic analyser (parser) of Czech, which could be used in the syntactic processing of large amounts of texts. Mr. Kubon notes that such a tool would be very useful, especially in the field of corpus linguistics, where creating a large-scale "tree bank" (a collection of syntactic representations of natural language sentences) is a very important step towards the investigation of the properties of a given language. The work involved in syntactically parsing a whole corpus in order to get a representative set of syntactic structures would be almost inconceivable without the help of some kind of robust (semi)automatic parser. The need for the automatic natural language parser to be robust increases with the size of the linguistic data in the corpus or in any other kind of text which is going to be parsed. Practical experience shows that apart from syntactically correct sentences, there are many sentences which contain a "real" grammatical error. These sentences may be corrected in small-scale texts, but not generally in the whole corpus. In order to be able to complete the overall project, it was necessary to address a number of smaller problems. These were; 1. the adaptation of a suitable formalism able to describe the formal grammar of the system; 2. the definition of the structure of the system's dictionary containing all relevant lexico-syntactic information, and the development of a formal grammar able to robustly parse Czech sentences from the test suite; 3. filling the syntactic dictionary with sample data allowing the system to be tested and debugged during its development (about 1000 words); 4. the development of a set of sample sentences containing a reasonable amount of grammatical and ungrammatical phenomena covering some of the most typical syntactic constructions being used in Czech. Number 3, building a formal grammar, was the main task of the project. The grammar is of course far from complete (Mr. Kubon notes that it is debatable whether any formal grammar describing a natural language may ever be complete), but it covers the most frequent syntactic phenomena, allowing for the representation of a syntactic structure of simple clauses and also the structure of certain types of complex sentences. The stress was not so much on building a wide coverage grammar, but on the description and demonstration of a method. This method uses a similar approach as that of grammar-based grammar checking. The problem of reconstructing the "correct" form of the syntactic representation of a sentence is closely related to the problem of localisation and identification of syntactic errors. Without a precise knowledge of the nature and location of syntactic errors it is not possible to build a reliable estimation of a "correct" syntactic tree. The incremental way of building the grammar used in this project is also an important methodological issue. Experience from previous projects showed that building a grammar by creating a huge block of metarules is more complicated than the incremental method, which begins with the metarules covering most common syntactic phenomena first, and adds less important ones later, especially from the point of view of testing and debugging the grammar. The sample of the syntactic dictionary containing lexico-syntactical information (task 4) now has slightly more than 1000 lexical items representing all classes of words. During the creation of the dictionary it turned out that the task of assigning complete and correct lexico-syntactic information to verbs is a very complicated and time-consuming process which would itself be worth a separate project. The final task undertaken in this project was the development of a method allowing effective testing and debugging of the grammar during the process of its development. The problem of the consistency of new and modified rules of the formal grammar with the rules already existing is one of the crucial problems of every project aiming at the development of a large-scale formal grammar of a natural language. This method allows for the detection of any discrepancy or inconsistency of the grammar with respect to a test-bed of sentences containing all syntactic phenomena covered by the grammar. This is not only the first robust parser of Czech, but also one of the first robust parsers of a Slavic language. Since Slavic languages display a wide range of common features, it is reasonable to claim that this system may serve as a pattern for similar systems in other languages. To transfer the system into any other language it is only necessary to revise the grammar and to change the data contained in the dictionary (but not necessarily the structure of primary lexico-syntactic information). The formalism and methods used in this project can be used in other Slavic languages without substantial changes.
Resumo:
Marina Katnic-Bakarsic. Linguistic Stylistics The practical, i.e. educational, objective of this research was to produce lectures on linguistic stylistics for the students of Sarajevo University, while the theoretical one was to produce a monograph on the subject. This monograph, which can also be used as a university textbook, includes twenty-nine chapters, an index of topics, a bibliography and a list of sources. The theoretical postulates are followed by examples from texts in various functional styles in Bosnian, Croatian, Serbian, and in some cases Russian or English. Linguo-stylistic problems were investigated from both the structuralist and post-structuralist points of view. Linguistic stylistics is therefore understood as a discipline which studies expressive, stylistically marked language units on all language levels, functional-stylistic language variation and various aspects of intertextuality and metatext. The author introduces a notion of stylistic competence. The stylistic competence of a speaker is directly proportional to his/her knowledge of different varieties of language (i.e. subcodes) and to the successful switching from one subcode to another. Stylistic creativity is a special segment of stylistic competence as a feature of individual style. A new classification of functional styles has also been introduced. This includes six primary styles (scientific, colloquial, administrative, publicistic, journalistic and literary-artistic) and five secondary styles (oratorical, the style of advertisements and commercials, that of comics, that of essays and that of screenplays). A special place is given to the analysis of the style of hypertext and hypermedia, which can be understood only within the post-structuralist theory of text deconstruction and intertextuality. The project also analysed some new topics, including reregistration in literary texts, gender and style of dialogue, and citations as metatextual signals and their role in different types of text. The results therefore offer a new approach to the study of linguistic stylistics both in Bosnia and Herzegovina and in the field in general.
Resumo:
Mr. Korosenyi begins by analysing the particular relationship holding between politics and administration in different countries. Within Europe three major patterns have emerged in the 20th century. Firstly there is the politically neutral British Civil Service, secondly the German and French state bureaucracies, which traditionally are supposed to embody the "common good", and thirdly there is the patronage system of the so-called consociate democracies, e.g. Austria. In general Mr. Korosenyi believes that, though politics do not penetrate into the Hungarian administration to the extent they do in Belgium and Austria, nevertheless, there is a stronger fusion than there is in the traditional British pattern. He is particularly interested in this relationship with regard to its effect on democratic institution building and the stabilisation of the new regime in Hungary, now the old "nomenklatura" system has been abolished. The structure of the Hungarian government was a result of the constitutional amendments of 1989 and 1990. Analysing this period, it becomes clear that for all the political actors who initiated and supported the democratic transition to democracy, the underlying assumption was a radical depoliticisation of the administration in order to maintain its stability. The political leadership of the executive is a cabinet government. The government is structured along ministries, each headed by a politician, i.e. the minister, who is a member of the cabinet. The minister's political secretary is not a cabinet member, but he or she is a politician, usually a member of the parliament. The head of the administration of the ministry is the administrative state secretary, who is a civil servant. He or she usually has four deputies, also civil servants. Naturally it is assumed that there should be a clear separation between politicians and civil servants. However in practice, the borders can be blurred, giving rise to a hybrid known as the "political civil servant". Mr. Korosenyi analyses the different faces of these hybrids. They are civil servants for the following reasons. They need special educational qualifications, working experience, a civil service exam etc., they are not allowed to do anything which is incompatible with their impartial role, and they cannot occupy political office nor may they appear in the name of any political party. On the other hand, the accepted political dimension to their function is revealed by the following facts. The state secretary (a civil servant) may participate in cabinet meetings instead of the minister. The state secretary is employed by the minister. A state secretary or any of their deputies can be dismissed at any time by the minister or the prime minister. In practice then, ministers appoint to these senior administrative positions civil servants whose personal and political loyaties are strong. To the second level of political patronage in ministries belong the ministerial cabinet, press office and public relation office. The ministerial cabinet includes the private advisors and members of the personal staff of the minister. The press office and the PR office, if they exist, are not adjusted to the administrative hierarchy of the ministry, but under the direct control of the minister. In the beginning of the 1990s, such offices were exceptions; in the second half of the 90s they are accepted and to be found in most ministries. Mr. Korosenyi's work, a 92-page manuscript of a book in Hungarian, marks the first piece of literature within the field of political science which analyses the structure of the Hungarian government in the 1990s and the relationship between the political leadership and the public administration.