998 resultados para corpus analysis
Resumo:
This paper describes the methodology used to compile a corpus called MorphoQuantics that contains a comprehensive set of 17,943 complex word types extracted from the spoken component of the British National Corpus (BNC). The categorisation of these complex words was derived primarily from the classification of Prefixes, Suffixes and Combining Forms proposed by Stein (2007). The MorphoQuantics corpus has been made available on a website of the same name; it lists 554 word-initial and 281 word-final morphemes in English, their etymology and meaning, and records the type and token frequencies of all the associated complex words containing these morphemes from the spoken element of the BNC, together with their Part of Speech. The results show that, although the number of word-initial affixes is nearly double that of word-final affixes, the relative number of each observed in the BNC is very similar; however, word-final affixes are more productive in that, on average, the frequency with which they attach to different bases is three times that of word-initial affixes. Finally, this paper considers how linguists, psycholinguists and psychologists may use MorphoQuantics to support their empirical work in first and second language acquisition, and clinical and educational research.
Resumo:
Esta tesis doctoral, que es la culminación de mis estudios de doctorado impartidos por el Departamento de Lingüística Aplicada a la Ciencia y a la Tecnología de la Universidad Politécnica de Madrid, aborda el análisis del uso de la matización (hedging) en el lenguaje legal inglés siguiendo los postulados y principios de la análisis crítica de género (Bhatia, 2004) y empleando las herramientas de análisis de córpora WordSmith Tools versión 6 (Scott, 2014). Como refleja el título, el estudio se centra en la descripción y en el análisis contrastivo de las variedades léxico-sintácticas de los matizadores del discurso (hedges) y las estrategias discursivas que con ellos se llevan a cabo, además de las funciones que éstas desempeñan en un corpus de sentencias del Tribunal Supremo de EE. UU., y de artículos jurídicos de investigación americanos, relacionando, en la medida posible, éstas con los rasgos determinantes de los dos géneros, desde una perspectiva socio-cognitiva. El elemento innovador que ofrece es que, a pesar de los numerosos estudios que se han podido realizar sobre los matizadores del discurso en el inglés general (Lakoff, 1973; Hübler, 1983; Clemen, 1997; Markkanen and Schröder, 1997; Mauranen, 1997; Fetzer 2010; y Finnegan, 2010 entre otros) académico (Crompton, 1997; Meyer, 1997; Skelton, 1997; Martín Butragueňo, 2003) científico (Hyland, 1996a, 1996c, 1998c, 2007; Grabe and Kaplan, 1997; Salager-Meyer, 1997 Varttala, 2001) médico (Prince, 1982; Salager-Meyer, 1994; Skelton, 1997), y, en menor medida el inglés legal (Toska, 2012), no existe ningún tipo de investigación que vincule los distintos usos de la matización a las características genéricas de las comunicaciones profesionales. Dentro del lenguaje legal, la matización confirma su dependencia tanto de las expectativas a macro-nivel de la comunidad de discurso, como de las intenciones a micro-nivel del escritor de la comunicación, variando en función de los propósitos comunicativos del género ya sean éstos educativos, pedagógicos, interpersonales u operativos. El estudio pone de relieve el uso predominante de los verbos modales epistémicos y de los verbos léxicos como matizadores del discurso, estos últimos divididos en cuatro tipos (Hyland 1998c; Palmer 1986, 1990, 2001) especulativos, citativos, deductivos y sensoriales. La realización léxico-sintáctica del matizador puede señalar una de cuatro estrategias discursivas particulares (Namsaraev, 1997; Salager-Meyer, 1994), la indeterminación, la despersonalización, la subjectivisación, o la matización camuflada (camouflage hedging), cuya incidencia y función varia según género. La identificación y cuantificación de los distintos matizadores y estrategias empleados en los diferentes géneros del discurso legal puede tener implicaciones pedagógicos para los estudiantes de derecho no nativos que tienen que demostrar una competencia adecuada en su uso y procesamiento. ABSTRACT This doctoral thesis, which represents the culmination of my doctoral studies undertaken in the Department of Linguistics Applied to Science and Technology of the Universidad Politécnica de Madrid, focusses on the analysis of hedging in legal English following the principles of Critical Genre Analysis (Bhatia, 2004), and using WordSmith Tools version 6 (Scott, 2014) corpus analysis tools. As the title suggests, this study centers on the description and contrastive analysis of lexico-grammatical realizations of hedges and the discourse strategies which they can indicate, as well as the functions they can carry out, in a corpus of U.S. Supreme Court opinions and American law review articles. The study relates realization, incidence and function of hedging to the predominant generic characteristics of the two genres from a socio-cognitive perspective. While there have been numerous studies on hedging in general English (Lakoff, 1973; Hübler, 1983; Clemen, 1997; Markkanen and Schröder, 1997; Mauranen, 1997; Fetzer 2010; and Finnegan, 2010 among others) academic English (Crompton, 1997; Meyer, 1997; Skelton, 1997; Martín Butragueňo, 2003) scientific English (Hyland, 1996a, 1996c, 1998c, 2007; Grabe and Kaplan, 1997; Salager-Meyer, 1997 Varttala, 2001) medical English (Prince, 1982; Salager-Meyer, 1994; Skelton, 1997), and, to a lesser degree, legal English (Toska, 2012), this study is innovative in that it links the different realizations and functions of hedging to the generic characteristics of a particular professional communication. Within legal English, hedging has been found to depend on not only the macro-level expectations of the discourse community for a specific genre, but also on the micro-level intentions of the author of a communication, varying according to the educational, pedagogical, interpersonal or operative purposes the genre may have. The study highlights the predominance of epistemic modal verbs and lexical verbs as hedges, dividing the latter into four types (Hyland, 1998c; Palmer, 1986, 1990, 2001): speculative, quotative, deductive and sensorial. Lexical-grammatical realizations of hedges can signal one of four discourse strategies (Namsaraev, 1997; Salager-Meyer, 1994), indetermination, depersonalization, subjectivization and camouflage hedging, as well as fulfill a variety of functions. The identification and quantification of the different hedges and hedging strategies and functions in the two genres may have pedagogical implications for non-native law students who must demonstrate adequate competence in the production and interpretation of hedged discourse.
Resumo:
This article considers the issue of low levels of motivation for foreign language learning in England by exploring how language learning is conceptualised by different key voices in that country through the examination of written data: policy documents and reports on the UK's language needs, curriculum documents, and press articles. The extent to which this conceptualisation has changed over time is explored, through the consideration of documents from two time points, before and after a change in government in the UK. The study uses corpus analysis methods in this exploration. The picture that emerges is a complex one regarding how the 'problems' and 'solutions' surrounding language learning in that context are presented in public discourse. This, we conclude, has implications for the likely success of measures adopted to increase language learning uptake in that context.
Resumo:
The geographical proximity and socioeconomic dependence on the United States brought about a deep rooted anglicization of the Cuban Spanish lexis and social strata, especially throughout the Neocolonial period (1902–1959). This study is based on the revision of a renowned newspaper of that time, Diario de la Marina, and the corresponding elaboration of a corpus of English-induced loanwords. Diario de la Marina particularly targeted upper social class, and only crónicas sociales (society pages’ columns) and print advertising were revised because of their fully descriptive texts, which encoded the ruling class ideology and consumerism. The findings show that there existed a high number of lexical and cultural anglicisms in the sociolect in question, and that the sociolinguistic anglicization was openly embraced by the upper socioeconomic stratum, entailing a differentiating sign of sophistication and social stratification. Likewise, a number of the anglicisms collected, particularly those related with social events, are unused in contemporary Cuban Spanish, which suggests a major semantic shifting in this sociolect after 1959.
Resumo:
Esta pesquisa tematizou o ensino inicial da leitura na escola primária, no estado do Espírito Santo, nos anos de 1960, com o objetivo de analisar princípios que fundamentam o “método global” e sua apropriação no campo da política educacional, para justificar a representação desse método como eficaz para a alfabetização de crianças. Para tanto, nos debruçamos sobre manuais didáticos elaborados para o ensino inicial da leitura, que propõem o “método global” por meio de contos e historietas, que circularam em escolas capixabas. Desse modo a questão central que norteou esta investigação foi: Que apropriações foram feitas de princípios que fundamentavam o “método global” pelas professoras autoras de manuais didáticos (de contos e historietas) que circularam no Estado do Espírito Santo, nos anos de 1960? Os manuais didáticos que integram o corpus documental da análise compreendem: as mais belas histórias: pré-livro, parte do mestre [196?]; as mais belas histórias: pré-livro (1964) e as mais belas histórias: pré-livro, bloco de atividades [196?], de autoria Lúcia Casasanta. O livro de Lilí: método global, manual da professora (1940) e o livro de Lilí: cartilha (1961), de Anita Fonseca e o circo do Carequinha, manual do professor (1969), de Maria Serafina de Freitas. Além dessas fontes privilegiamos outras como revista pedagógica, correspondências oficiais, ata de reunião pedagógica, jornal, orientações/prescrições para prática pedagógica. Consideramos na análise o esquema conceitual apresentado por Roger Chartier: circulação, representação, apropriação e práticas culturais e, ainda o conceito de cultura escolar de Dominique Julia. Compreende-se que foram feitas apropriações inventivas, dos princípios teóricos formulados por Jean-Ovide Decroly, pelas autoras dos manuais didáticos e estes puseram em circulação a representação de método e de ensino da leitura que foi apropriada e legitimada pela política educacional capixaba. Entende-se que a proposta do “método global” não provocou significativas modificações na condição de passividade do aluno no processo de aprendizagem da leitura, tendo em vista a naturalização dos processos de desenvolvimento da criança e permanência de procedimentos mecanicista e reducionista da língua.
Resumo:
This thesis investigates epistemic indefinites (EIs), elements noteworthy for their grammaticalized ignorance implicature, i.e. inability to provide further information about the identity of the expression's referent. This work contributes to the effort of finding a unified account of the cross-linguistic repertoire of EIs. It comprises a corpus survey and a semantic analysis of Slovak voľa- and -si, EI items not studied until now. First, the following hypothesis was tested: the semantic/syntactic functions expressed by an indefinite will fall into contiguous areas on an implicational map (Haspelmath 1997). The results of the corpus analysis revealed that the map does not entirely capture the Slovak EIs' functional distribution and interpretations. Secondly, the semantic analysis was developed within the alternatives-and-exhaustification framework (Chierchia 2013). I show that some of the EIs' behavior can be explained as a consequence of an assumed sensitivity to parameters proposed by Chierchia. I situate voľa- and -si with respect to the framework’s typology and offer a critical assessment of this theoretical perspective.
Resumo:
This paper presents an Italian to CatalanRBMT system automatically built bycombining the linguistic data of theexisting pairs Spanish-Catalan andSpanish-Italian. A lightweight manualpostprocessing is carried out in order tofix inconsistencies in the automaticallyderived dictionaries and to add very frequentwords that are missing accordingto a corpus analysis. The system isevaluated on the KDE4 corpus and outperformsGoogle Translate by approximatelyten absolute points in terms ofboth TER and GTM.
Resumo:
La version intégrale de cette thèse est disponible uniquement pour consultation individuelle à la Bibliothèque de musique de l'Université de Montréal (www.bib.umontreal.ca/MU).
Resumo:
This thesis summarizes the results on the studies on a syntax based approach for translation between Malayalam, one of Dravidian languages and English and also on the development of the major modules in building a prototype machine translation system from Malayalam to English. The development of the system is a pioneering effort in Malayalam language unattempted by previous researchers. The computational models chosen for the system is first of its kind for Malayalam language. An in depth study has been carried out in the design of the computational models and data structures needed for different modules: morphological analyzer , a parser, a syntactic structure transfer module and target language sentence generator required for the prototype system. The generation of list of part of speech tags, chunk tags and the hierarchical dependencies among the chunks required for the translation process also has been done. In the development process, the major goals are: (a) accuracy of translation (b) speed and (c) space. Accuracy-wise, smart tools for handling transfer grammar and translation standards including equivalent words, expressions, phrases and styles in the target language are to be developed. The grammar should be optimized with a view to obtaining a single correct parse and hence a single translated output. Speed-wise, innovative use of corpus analysis, efficient parsing algorithm, design of efficient Data Structure and run-time frequency-based rearrangement of the grammar which substantially reduces the parsing and generation time are required. The space requirement also has to be minimised
Resumo:
O presente trabalho teve o objetivo de identificar os elementos da cultura organizacional do Tribunal de Justiça do Estado da Bahia (TJBA) à luz das contribuições de autores do pensamento social brasileiro - representados neste estudo por Sergio Buarque de Holanda, Raymundo Faoro, Darcy Ribeiro e Celso Furtado. O tema cultura tem recebido, nas últimas décadas, relativa atenção na área de estudos organizacionais. Apesar da atenção recebida, é possível perceber lacunas na produção acadêmica sobre o tema quanto à atenção dispensada ao contexto nacional e às instituições do setor público, especialmente no que se refere ao Poder Judiciário. Nesse sentido, o presente trabalho analisou a produção acadêmica nacional sobre cultura organizacional brasileira nos últimos vinte anos e buscou identificar possibilidades de utilização dos autores do pensamento social brasileiro numa abordagem de cultura organizacional que considerasse o contexto nacional. Na etapa de coleta de dados, foram realizadas trinta e sete entrevistas semiestruturadas com servidores do TJBA, no período de fevereiro a julho de 2013. A análise do corpus de dados foi feita pelo método de análise de conteúdo, com auxilio do software ATLAS.ti. A análise dos dados apontou para possibilidades de utilização das ideias dos autores do pensamento social brasileiro no estudo da cultura organizacional, principalmente na identificação da influência dos traços culturais nos valores e práticas sociais dos servidores. A análise apontou também para uma ausência de elementos baseados em valores gerencialistas na cultura organizacional desta instituição, buscando provocar uma reflexão sobre o alcance e a adequação do modelo de reforma adotado no Judiciário. Como contribuição, esta tese buscou realizar um mapeamento da cultura organizacional do TJBA, gerando uma contribuição empírica para um aprofundamento do conhecimento do Poder Judiciário sob o aspecto de universo organizacional e para os estudos que utilizam o pensamento social brasileiro, que em sua maioria têm se tratado de ensaios teóricos.
Resumo:
This paper intents to investigate the repercussions of the Pró-Letramento - Alphabetization and Language Course Program in the practices and conceptions of alphabetization of learning teachers, according to their own perspectives. The program, part of the National Formation Network of the Education Ministry, in partnership with public universities since 2006, is destined to aid the formation of teachers acting in the first three years of public basic education, with the goal of qualifying them to work with alphabetization and improve quality of learning processes and results. This investigation adopted the qualitative research paradigm as well as the Case Study methodology, being our empirical field a Belém do Brejo do Cruz/PB public school. The subjects are five Program graduation female teachers, including the tutor-trainer and four graduated teachers that already teach in the first three years of basic education. The data, gathered with documental analysis, individual and collective semi-structured interview, and non-participant observation, were analyzed according with Speech Analysis principals. Based on those principals, we intersected teachers enunciations, observation sessions registries and the Program s propositions and built interpretations based on theorization taken as fundaments of the investigation, among which we highlight: studies on a criticalreflexive perspective of a teacher s formation; continued formation as a permanent development process; the principals of historical-cultural approach on alphabetization processes and development with the centrality of language; alphabetization in a interactionist approach. The analysis focused on the Program s repercussions: 1) in the teachers conceptions about: 1.1) learning; 1.2) Alphabetization and literacy; and in the practices and conceptions related to: 1.3) alphabetization in a literacy perspective and 1.4) appropriation of the writing system. The corpus analysis evidenced relations of continuity and discontinuity, approach and distancing between the teachers conceptions and the Program s propositions, as well as conceptions/significance of their speeches and related or observed practices. Observing teachers elaborations evidences the repercussion of the Program s formation, whilst also showing gaps and mismatches in their appropriation process in concepts/assumptions as well as teaching propositions. These mismatches involve interaction relationships between teachers and students, with their possibilities and limitations surrounding the Program s knowledge objects complexity, also linking to the social, economical, political, and cultural conditions that involve both the implementation of the Program in each context and the conditions in which alphabetization in public schools are developed, demanding permanent and accompanied formation processes, investments to improve work conditions and valuing teaching
Resumo:
This paper presents results concerning studies conducted during the master's research in the area of Applied Linguistics of the Graduate Program in Language Studies - PPgEL, linked to the Federal University of Rio Grande do Norte. We approached this study the question of authorship, which has gained ground increasingly important in academic research, because the act of authoring, in view of the amount of circulating discourses socially, becomes increasingly questionable. We understand authorship as related to a positioning of the subject than to take responsibility, in the Bakhtinian sense, by its text, it makes their views, ideologies, beliefs and values, from ownership and restructuring of the speech of others. This research is methodologically inserted in a qualitative, interpretive in nature and has a corpus analysis of the ten productions written memoir genre of readings produced by students in 9th grade in elementary school. Research aimed to analyze the students' written productions held in the school environment, to identify these texts marks or evidence of authorship and investigate how the restructuring of ownership and voices of others in the process of authoring. We adopt a concept of language from bakhitinianas concepts and understand the text in a vision Geraldiana. The analyzes in this study showed that in most texts there is the strong presence of voices of others, either implicitly and/or explicit. That is, students, to build their sayings, if embraced the voices of others in various formats, in particular styles, namely, by what Bakhtin names as linear style and pictorial style. Some proved his statement highlighting their positions through direct quotes from authors of books, family, teachers, showing us that their voices are filled with voices of others with whom they approve, deny, analyze, think and reflect. We therefore conclude that students were constituted as subjects of their texts, authors, since they leave marked the voices of others so presentified and restructured voices of others who do not actually sounded like voices of others, but as present voices from a discursive restructuring producer
Resumo:
Starting from the premise that we live in the society of spectacle, as proclaimed by Guy Debbord, and, in this context, the media feeds itself off of this spectacularization and constructs a culture of images and production of goods, providing templates from which the subject can identify himself/herself as being male or female, successful or unsuccessful, powerful or powerless. In other words, the culture conveyed by the media produces material for the creation of identities through which individuals insert and recognize themselves in contemporary society. Observing the election campaigns, we can see clearly that this profusion of identities is fairly explored in the advertising propaganda used by the candidates, particularly in the propaganda broadcasted on the Free Electoral Time on TV. Instigated by the explicit relation between the media and politics within the society of the spectacle, this study aims to investigate the main identities that emerge in the discursive practices of the media in the election campaigns of 2010 for president of the Republic and governor of the State of Rio Grande do Norte that had as protagonists the candidates at that moment Dilma Rousseff (PT) for president and Rosalba Ciarline (DEM) for governor. To do so, we based ourselves on the theory of Bakhtin Circle, which considers the statement as a unit of verbal communication and conceives language as a dialogical phenomena and a discursive practice and also in the conceptions of dialogical relationships, social voices and chronotope formulated by the previous mentioned theory. Still in the theoretical field, we have established an interconnection with the theories coming from the Cultural Studies (Hall, Woodward) about the identity, which conceives it as multiple, fragmented, non-fixed, so that, the subject assumes different identities, not always coherent, at different times, depending on the context in which they are approached. The research is situated in the frames of Applied Linguistics, which considers language as the center of its studies and settles on the border of an open number of areas of knowledge expanding its possibilities of investigation by means of the interdisciplinary. Our corpus consists in 20 electoral propaganda videos aired on TV during the Free Election Time in 2010 campaign; among these, 14 videos are Dilma Rousseff s propaganda and 06 videos are Rosalba Ciarline s propaganda. We seek for the purpose of the analysis to identify the identities which emerge from the discourses about the candidates in propaganda videos broadcasted in the referred campaign, as well as realize the dialogical relations established in these discourses and even if the identity construction of these subjects is located in the same axiological axis. The corpus analysis revealed that the multiple cultural identities of the candidates campaigning emerge in the discourses circulating in the electoral propaganda aired on TV such as: the identities of pioneer woman, competent, sensitive, mother, grandmother, religious. And, yet, those are changeable as the electoral demands, in other words, the need to obtain support and votes, outline a fluid identity construction about the candidate to the position in question
Resumo:
This research has as main proposal to verify how the virtue of generosity, considered as one of the important aspects of moral development on children, shows itself in children between four and six years old in institutional welcome condition. Toward this aim, knowledge in Human Development Psychology, specifically Jean Piaget and his contemporary collaborators, were associated to philosophic knowledge, with emphasis in Kant and Aristotle. The research took place in an institutional welcome service in the town of Natal-RN, between February and May of 2009, witch is responsible to offer care to children from zero till six years old. As procedure were used: the rulebased game Memory Game ; the childhood story One note fight ; and observations of the institution s everyday. The corpus analysis was done by the Thematic Content Analysis method. In general, in the daily observations, children showed themselves more co-operative and generous than in the executed procedures, mainly when the matter subject was other´s well-being. The duty, Kantian perspective, was overcome by the Aristotelian sumo bem, classified as generous act of afford to other what is not posted as right, however guaranteed by solidarity
Resumo:
Pós-graduação em Comunicação - FAAC