3 resultados para Serial-correlation common features
em Central European University - Research Support Scheme
Resumo:
Mr. Kubon's project was inspired by the growing need for an automatic, syntactic analyser (parser) of Czech, which could be used in the syntactic processing of large amounts of texts. Mr. Kubon notes that such a tool would be very useful, especially in the field of corpus linguistics, where creating a large-scale "tree bank" (a collection of syntactic representations of natural language sentences) is a very important step towards the investigation of the properties of a given language. The work involved in syntactically parsing a whole corpus in order to get a representative set of syntactic structures would be almost inconceivable without the help of some kind of robust (semi)automatic parser. The need for the automatic natural language parser to be robust increases with the size of the linguistic data in the corpus or in any other kind of text which is going to be parsed. Practical experience shows that apart from syntactically correct sentences, there are many sentences which contain a "real" grammatical error. These sentences may be corrected in small-scale texts, but not generally in the whole corpus. In order to be able to complete the overall project, it was necessary to address a number of smaller problems. These were; 1. the adaptation of a suitable formalism able to describe the formal grammar of the system; 2. the definition of the structure of the system's dictionary containing all relevant lexico-syntactic information, and the development of a formal grammar able to robustly parse Czech sentences from the test suite; 3. filling the syntactic dictionary with sample data allowing the system to be tested and debugged during its development (about 1000 words); 4. the development of a set of sample sentences containing a reasonable amount of grammatical and ungrammatical phenomena covering some of the most typical syntactic constructions being used in Czech. Number 3, building a formal grammar, was the main task of the project. The grammar is of course far from complete (Mr. Kubon notes that it is debatable whether any formal grammar describing a natural language may ever be complete), but it covers the most frequent syntactic phenomena, allowing for the representation of a syntactic structure of simple clauses and also the structure of certain types of complex sentences. The stress was not so much on building a wide coverage grammar, but on the description and demonstration of a method. This method uses a similar approach as that of grammar-based grammar checking. The problem of reconstructing the "correct" form of the syntactic representation of a sentence is closely related to the problem of localisation and identification of syntactic errors. Without a precise knowledge of the nature and location of syntactic errors it is not possible to build a reliable estimation of a "correct" syntactic tree. The incremental way of building the grammar used in this project is also an important methodological issue. Experience from previous projects showed that building a grammar by creating a huge block of metarules is more complicated than the incremental method, which begins with the metarules covering most common syntactic phenomena first, and adds less important ones later, especially from the point of view of testing and debugging the grammar. The sample of the syntactic dictionary containing lexico-syntactical information (task 4) now has slightly more than 1000 lexical items representing all classes of words. During the creation of the dictionary it turned out that the task of assigning complete and correct lexico-syntactic information to verbs is a very complicated and time-consuming process which would itself be worth a separate project. The final task undertaken in this project was the development of a method allowing effective testing and debugging of the grammar during the process of its development. The problem of the consistency of new and modified rules of the formal grammar with the rules already existing is one of the crucial problems of every project aiming at the development of a large-scale formal grammar of a natural language. This method allows for the detection of any discrepancy or inconsistency of the grammar with respect to a test-bed of sentences containing all syntactic phenomena covered by the grammar. This is not only the first robust parser of Czech, but also one of the first robust parsers of a Slavic language. Since Slavic languages display a wide range of common features, it is reasonable to claim that this system may serve as a pattern for similar systems in other languages. To transfer the system into any other language it is only necessary to revise the grammar and to change the data contained in the dictionary (but not necessarily the structure of primary lexico-syntactic information). The formalism and methods used in this project can be used in other Slavic languages without substantial changes.
Resumo:
The original aim of this project was to describe and analyse the higher education acts in force in five Central and Eastern European countries at present, trying to understand the dependence of higher education on the historical traditions, national peculiarities and all-European tendencies. The description and comparison of the main aspects of higher education was supplemented by a study of the possibilities of transferring experience in the field between the five countries and possible solutions to implementing foreign structural and functional models. Questions covered included the role of the state in the management of higher education, the structures of the higher education systems and the organisation of institutions, academic autonomy and the classifications of academic teaching staff, the main trends in the recent development of research, academic degrees, the accreditation of higher education institutions, and the financing of higher education. Popov found that it was almost impossible to understand the dependence of higher education on historical traditions and national peculiarities purely through a study of the relevant legislation. Education traditions in these countries have twice been broken, once with the start of communism (1917 in Russia and 1944-45 in Bulgaria, Hungary, Poland and Romania) and for a second time at the beginning of the 1990s. The most recent higher education acts in all five countries studied have abandoned many of their historical and national traditions, following instead all-European trends as determined by Western Europe, and the project included a study of the dependence on these trends. There were also difficulties in comparing some aspects of higher education as it depended on how far a given aspect has different or common features in the different countries and to what extent the application is comparable. While many possible areas for transferring experience between the five countries were identified, Popov concentrated on those where he felt that there was a real practical possibility of application in view of national academic differences. He concluded by defining some of the challenges facing each country in the field of higher education and by making some predictions as to the developments in the different countries.
Resumo:
The collapse of the Soviet Union at the beginning of the 1990s also meant the end of the idea of a common soviet identity incarnated in the "soviet man" and the new "historic community of the soviet people". While this idea still lives on in the generations of the 1920s to 1940s, the younger generations tend to prefer identification with family, profession, ethnic group or religion. Ms. Alexakhina set out to investigate different interethnic interaction strategies in the multi-ethnic context of the Russian Federation, with an emphasis on analysing the role of cultural and ethno-demographic characteristics of minority ethnic groups. It aimed to identify those specific patterns of interaction dynamics that have emerged in response to the political and economic transformation at present under way. The basic supposition was that the size and growth of an ethnic population are defined not only by demographic features such as fertility, mortality and net migration, but are also dependent on processes interethnic interaction and ethnic transition. The central hypothesis of the project was that the multi-ethnic and multi-cultural composition of Russia is apparently manifesting itself in the ethnic minority groups in various forms, but particularly in the form of ethnic revival and/or assimilation. The results of these complex phenomena are manifested as changes in ethnic attachments (national re-identification and language behaviour (multi-lingualism, language transition and loss of the mother tongue). The stress of the political and economic crisis has stimulated significant changes in ethnographic, social and cultural characteristics of inter-ethnic dynamics such as the rate of national re-identification, language behaviour, migration activity and the spread of mixed marriages, among both those minorities with a long history of settlement in Russia and those that were annexed during the soviet period. Patterns of language behaviour and the spread of mixed marriages were taken as the main indicators of the directions of interethnic interaction described as assimilation, ethnic revival and cultural pluralism. The first stage of the research involved a statistical analysis of census data from 1959 to 1994 in order to analyse the changing demographic composition of the largest ethnic groups of the Russian Federation. Until 1989 interethnic interaction in soviet society was distinguished by the process of russification but the political and economic transformation has stimulated the process of ethnic revival, leading to an apparent fall in the size of the Russian population due to ethnic re-identification by members of other ethnic groups who had previously identified themselves as Russian. Cross-classification of nationalities by demographic, social and cultural indicators has shown that the most important determinants of the nature of interethnic interaction are cultural factors such as religion and language affiliation. The analysis of the dynamics of language shift through the study of bilingualism and the domains of language usage for different demographic groups revealed a strong correlation between recognition of Russian as a mother tongue among some non-Russian ethnic groups and the declining size of these groups. The main conclusion from this macro-analysis of census data was the hypothesis of the growing importance of social and political factors upon ethnic succession, that ethnic identity is no longer a stable characteristic but has become dynamic in nature. In order to verify this hypothesis Ms. Alexakhina conducted a survey in four regions showing different patterns of interethnic interaction: the Karelian Republic, Buryatiya, the Nenezkii Autonomous Region and Tatarstan. These represented the west, east, north and south of the Russian Federation. Samples for the survey were prepared on the basis of census lists so as to exclude mono-Russian families in favour of mixed and ethnic-minority families. The survey confirmed the significant growth in the importance of ethnic affiliation in the everyday lives of people in the Federation following the de-centralisation of the political and economic spheres. Language was shown to be a key symbol of the consciousness of national distinction, confirmed by the fact that the process of russification has been reversed by the active mastering of the languages of titular nationalities. The results also confirmed that individual ethnic identity has ceased to be a fixed personal characteristic of one's cultural and genetic belonging, and people's social adaptation to the current political, social and economic conditions is also demonstrated in changes in individual ethnic self-identification. In general terms, the dynamic nature of national identity means that ethnic identity is at present acquiring the special features of overall social identity, for which the frequent change of priorities is an inherent feature of a person's life cycle. These are mainly linked with a multi-ethnic environment and high individual social mobility. From her results Ms. Alexakhina concludes that the development of national languages and multi-lingualism, together with the preservation of Russian as a state language, seems to be the most promising path to peaceful coexistence and the development of the national cultures of different ethnic groups within the Russian Federation.