954 resultados para Linguistic phenomena
Resumo:
In Natural Language Processing (NLP) symbolic systems, several linguistic phenomena, for instance, the thematic role relationships between sentence constituents, such as AGENT, PATIENT, and LOCATION, can be accounted for by the employment of a rule-based grammar. Another approach to NLP concerns the use of the connectionist model, which has the benefits of learning, generalization and fault tolerance, among others. A third option merges the two previous approaches into a hybrid one: a symbolic thematic theory is used to supply the connectionist network with initial knowledge. Inspired on neuroscience, it is proposed a symbolic-connectionist hybrid system called BIO theta PRED (BIOlogically plausible thematic (theta) symbolic-connectionist PREDictor), designed to reveal the thematic grid assigned to a sentence. Its connectionist architecture comprises, as input, a featural representation of the words (based on the verb/noun WordNet classification and on the classical semantic microfeature representation), and, as output, the thematic grid assigned to the sentence. BIO theta PRED is designed to ""predict"" thematic (semantic) roles assigned to words in a sentence context, employing biologically inspired training algorithm and architecture, and adopting a psycholinguistic view of thematic theory.
Resumo:
Neste artigo procuramos reflectir sobre a função dos corpora na observação e análise de fenómenos de uma língua natural bem como na criação de novos recursos de exploração linguísticos que as tecnologias de informação têm vindo a potenciar e a tornar mais eficaz.
Resumo:
The automatic acquisition of lexical associations from corpora is a crucial issue for Natural Language Processing. A lexical association is a recurrent combination of words that co-occur together more often than expected by chance in a given domain. In fact, lexical associations define linguistic phenomena such as idiomes, collocations or compound words. Due to the fact that the sense of a lexical association is not compositionnal, their identification is fundamental for the realization of analysis and synthesis that take into account all the subtilities of the language. In this report, we introduce a new statistically-based architecture that extracts from naturally occurring texts contiguous and non contiguous. For that purpose, three new concepts have been defined : the positional N-gram models, the Mutual Expectation and the GenLocalMaxs algorithm. Thus, the initial text is fisrtly transformed in a set of positionnal N-grams i.e ordered vectors of simple lexical units. Then, an association measure, the Mutual Expectation, evaluates the degree of cohesion of each positional N-grams based on the identification of local maximum values of Mutual Expectation. Great efforts have also been carried out to evaluate our metodology. For that purpose, we have proposed the normalisation of five well-known association measures and shown that both the Mutual Expectation and the GenLocalMaxs algorithm evidence significant improvements comparing to existent metodologies.
Resumo:
Dissertação de mestrado em Estudos Interculturais Português / Chinês: Tradução, Formação e Comunicação Empresarial
Resumo:
This work deals with coming and going verbs in three typologically distinct languages: a Romance one (Spanish), a Germanic one (German) and a Slavic one (Polish) within Fillmore`s (1966, 1971, 1975, 1982, 1983) framework. On the grounds of data description it is shown that visible linguistic phenomena, such as deixis or Aktionsart related clearly to the coming and going verbs, can be transcended, resorting to the more abstract notion called by the author, following Winston (1987) and Speas and Tenny (2003), “viewpoint”.
Resumo:
L'objectiu d'aquest article és defensar la responsabilitat del component lèxic pel que fa a la presència (o l'absència) del clític en la construcció anticausativa. Es presenten proves que justifiquen que les argumentacions sobre aquest fenomen des de la semàntica o l'estructura eventual, que són les més prolíferes en la bibliografia, no són vàlides per a tots els verbs que participen en l'estructura sintàctica esmentada. Es considera que ni la sintaxi ni la morfologia tampoc poden donar llum dels fets i es proposa que l'explicació de la manca d'uniformitat pel que fa a l'aparició del pronom ha de trobar-se a partir de fenòmens que no es poden discernir en observar només l'estat actual de la llengua.
Resumo:
Aquest estudi és una aproximació crítica a la dicotomia establerta per Saussure entre la lingüística interna i la lingüística externa i l'aplicació que se n'ha fet als tractats historiogràfics de la llengua anglesa. Pel caràcter ontològic de la investigació diacrònica, aquestes consideracions saussurianes posen les bases empíriques per teoritzar sobre el canvi lingüístic en la llengua anglesa. D'altra banda, la distinció més o menys explícita entre la història interna de la llengua anglesa i l'externa és un fet comunament acceptat i utilitzat pels tractats historiogràfics consultats. A l'hora de dur a terme la investigació diacrònica, però, la història interna necessita, per a la delimitació cronològica, les aportacions de l'externa; així, encara que els condicionants sociològics es considerin marginals, els factors externs adquireixen importància no només pel fet d'incloure el context social on s'emmarquen els fenòmens lingüístics, sinó també per les consideracions de tipus sociolingüístic que en resulten, les quals aporten noves línies de recerca historiogràfica, basada en els models de variacions en el registre escrit de la llengua anglesa.
Resumo:
The ERP repetition priming paradigm has been shown to be sensitive to the processing differences between regular and irregular verb forms in English and German. The purpose of the present study is to extend this research to a language with a different inflectional system, Spanish. The design (delayed visual repetition priming) was adopted from our previous study on English, and the specific linguistic phenomena we examined are priming relations between different kinds of stem (or root) forms. There were two experimental conditions: In the first condition, the prime and the target shared the same stem form, e.g., "ando-andar" [I walk-to walk], whereas in the second condition, the prime contained a marked (alternated) stem, e.g., "duermo-dormir" [I sleep-to sleep]. A reduced N400 was found for unmarked (nonalternated) stems in the primed condition, whereas marked stems showed no such effect. Moreover, control conditions demonstrated that the surface form properties (i.e., the different degree of phonetic and orthographic overlap between primes and targets) do not explain the observed priming difference. The ERP priming effect for verb forms with unmarked stems in Spanish is parallel to that found for regularly inflected verb forms in English and German. We argue that effective priming is possible because prime target pairs such as "ando-andar" access the same lexical entry for their stems. By contrast, verb forms with alternated stems (e.g., "duermo") constitute separate lexical entries, and are therefore less powerful primes for their corresponding base forms.
Resumo:
En aquest article s'intenta aportar una visió descriptiva del funcionament d'haber, ser i estar en construccions 'perifràstiques' del castellà medieval. Quant a les perífrasis 'AUX+ infinitiu', s'observa que no s'ajusten a una anàlisi del tipus 'SV que selecciona un SV'. D'altra banda, s'aporta evidència que afavoreix una visió de la derivació dels futurs i condicionals analítics on el verb [-finit] s'excorpora de l'auxiliar funcional per traslladar-se a CO. Finalment, s'estableix que les construccions 'haber/ser/estar+participi' poden ser analitzades com a verbs lèxics que subcategoritzen una oració reduïda el predicat de la qual és el participi. Això permet relacionar l'avantposició de participi amb la dels SA i la dels arguments interns. Els aspectes bàsics d'aquest canvi sintàctic que duu del castellà medieval i preclàssic a l'espanyol actual són: (a) la categoria lèxica SV dels verbs en qüestió es reanalitza com una categoria funcional SAsp i aquests verbs esdevenen auxiliars; (b) hi ha un canvi de subcabgorització, ja que aquests verbs deixen de subcategoritzar una oració reduïda per passar a subcategoritzar un SVmàx, i (c) la pèrdua de la projecció màxima SCONC1 comporta la desparició dels efectes de la llei Tobler-Mussafia, de la possibilitat d'avantposar el participi i també de la concordança de participi en els perfets compostos.
Resumo:
Studies about discursive genres peak at a necessity to understand how these genres work in a society that is more and more submissive to the technology of informatics such as ours. In acknowledging that the virtual context of the Internet provides the manifestation and development of new genres of the discourse, we perceived that the online journal, or blog, as commonly known, is responsible for a variety of linguistic phenomena that would normally take severaI years to consolidate. Since its appearance in 1997 blogs rush as a virtual version of the personal diary and in a short time due to communicative demands suffers several changes, making new categories of blog genres emerge. Facing such phenomena, this work intends at first to characterize blogs as a genre that exercises a social action, evidencing its formal, structural and pragmatic characteristics from the notion of recurrence and rhetoric in a discursive-semiotic perspective. The methodological postulates adopted by this research are considered of qualitative basis in the sense that they are not restricted to looking at the discursive events as a product, but mainly they take into consideration a group of situational, cultural and ideologic factors that are present in the constitution of genre.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
This paper is part of a larger project that is being developed at our university, whose goal is to enable students of Literature and Translation to detect and analyze linguistic phenomena based on electronic corpora, consisting of original texts and texts translated, which show the general language, literary language and specialized language. One of its aspects is the analysis of medical abstracts and their translation process. Thus, this undergraduate research project, we seek to observe the technical-scientific English language by performing translations of abstracts of scientific articles of Neurology and Oncology.
Resumo:
This article presents an investigation of four linguistic phenomena of the Portuguese in different synchronicities. These research are about construction, description and analysis of corpora for the creation of databases that support linguistic description.
Resumo:
Pós-graduação em Letras - FCLAR