12 resultados para Corpus Christi, Battle of, Argentina, 1536.
em Aston University Research Archive
Resumo:
Almost everyone who has an email account receives from time to time unwanted emails. These emails can be jokes from friends or commercial product offers from unknown people. In this paper we focus on these unwanted messages which try to promote a product or service, or to offer some “hot” business opportunities. These messages are called junk emails. Several methods to filter junk emails were proposed, but none considers the linguistic characteristics of junk emails. In this paper, we investigate the linguistic features of a corpus of junk emails, and try to decide if they constitute a distinct genre. Our corpus of junk emails was build from the messages received by the authors over a period of time. Initially, the corpus consisted of 1563, but after eliminating the duplications automatically we kept only 673 files, totalising just over 373,000 tokens. In order to decide if the junk emails constitute a different genre, a comparison with a corpus of leaflets extracted from BNC and with the whole BNC corpus is carried out. Several characteristics at the lexical and grammatical levels were identified.
Resumo:
University students encounter difficulties with academic English because of its vocabulary, phraseology, and variability, and also because academic English differs in many respects from general English, the language which they have experienced before starting their university studies. Although students have been provided with many dictionaries that contain some helpful information on words used in academic English, these dictionaries remain focused on the uses of words in general English. There is therefore a gap in the dictionary market for a dictionary for university students, and this thesis provides a proposal for such a dictionary (called the Dictionary of Academic English; DOAE) in the form of a model which depicts how the dictionary should be designed, compiled, and offered to students. The model draws on state-of-the-art techniques in lexicography, dictionary-use research, and corpus linguistics. The model demanded the creation of a completely new corpus of academic language (Corpus of Academic Journal Articles; CAJA). The main advantages of the corpus are its large size (83.5 million words) and balance. Having access to a large corpus of academic language was essential for a corpus-driven approach to data analysis. A good corpus balance in terms of domains enabled a detailed domain-labelling of senses, patterns, collocates, etc. in the dictionary database, which was then used to tailor the output according to the needs of different types of student. The model proposes an online dictionary that is designed as an online dictionary from the outset. The proposed dictionary is revolutionary in the way it addresses the needs of different types of student. It presents students with a dynamic dictionary whose contents can be customised according to the user's native language, subject of study, variant spelling preferences, and/or visual preferences (e.g. black and white).
Resumo:
This study uses a purpose-built corpus to explore the linguistic legacy of Britain’s maritime history found in the form of hundreds of specialised ‘Maritime Expressions’ (MEs), such as TAKEN ABACK, ANCHOR and ALOOF, that permeate modern English. Selecting just those expressions commencing with ’A’, it analyses 61 MEs in detail and describes the processes by which these technical expressions, from a highly specialised occupational discourse community, have made their way into modern English. The Maritime Text Corpus (MTC) comprises 8.8 million words, encompassing a range of text types and registers, selected to provide a cross-section of ‘maritime’ writing. It is analysed using WordSmith analytical software (Scott, 2010), with the 100 million-word British National Corpus (BNC) as a reference corpus. Using the MTC, a list of keywords of specific salience within the maritime discourse has been compiled and, using frequency data, concordances and collocations, these MEs are described in detail and their use and form in the MTC and the BNC is compared. The study examines the transformation from ME to figurative use in the general discourse, in terms of form and metaphoricity. MEs are classified according to their metaphorical strength and their transference from maritime usage into new registers and domains such as those of business, politics, sports and reportage etc. A revised model of metaphoricity is developed and a new category of figurative expression, the ‘resonator’, is proposed. Additionally, developing the work of Lakov and Johnson, Kovesces and others on Conceptual Metaphor Theory (CMT), a number of Maritime Conceptual Metaphors are identified and their cultural significance is discussed.
Resumo:
Research in social psychology has shown that public attitudes towards feminism are mostly based on stereotypical views linking feminism with leftist politics and lesbian orientation. It is claimed that such attitudes are due to the negative and sexualised media construction of feminism. Studies concerned with the media representation of feminism seem to confirm this tendency. While most of this research provides significant insights into the representation of feminism, the findings are often based on a small sample of texts. Also, most of the research was conducted in an Anglo-American setting. This study attempts to address some of the shortcomings of previous work by examining the discourse of feminism in a large corpus of German and British newspaper data. It does so by employing the tools of Corpus Linguistics. By investigating the collocation profiles of the search term feminism, we provide evidence of salient discourse patterns surrounding feminism in two different cultural contexts. © The Author(s) 2012.
Resumo:
In this paper, I concentrate on court cases with litigants in person (lay people who act on their own behalf in legal proceedings without a counsel or solicitor) and discuss the challenges of building a corpus of courtroom discourse where it is crucial to distinguish between speakers due to their distinct institutional roles. The corpus incorporates seven sub-corpora of verbatim transcripts from different court cases with litigants in person and comprises over eleven-million tokens. The focus of this paper is on the interplay between the legal and lay discourse types and how judges project their institutional roles through well-initiated turns directed at litigants in person and counsels. As a versatile discourse marker, well provides a good opportunity to explore how judges have to adapt their roles to ensure lay litigants in person receive the necessary support and that their lack of competence does not impede on the fairness of the proceedings. Given the breadth and importance of the topic of litigation in person, I discuss how the tools and approaches of corpus linguistics can be helpful in this multi-disciplinary area where multiple functions and uses of individual linguistic features need to be explored in depth.
Resumo:
Automatic Term Recognition (ATR) is a fundamental processing step preceding more complex tasks such as semantic search and ontology learning. From a large number of methodologies available in the literature only a few are able to handle both single and multi-word terms. In this paper we present a comparison of five such algorithms and propose a combined approach using a voting mechanism. We evaluated the six approaches using two different corpora and show how the voting algorithm performs best on one corpus (a collection of texts from Wikipedia) and less well using the Genia corpus (a standard life science corpus). This indicates that choice and design of corpus has a major impact on the evaluation of term recognition algorithms. Our experiments also showed that single-word terms can be equally important and occupy a fairly large proportion in certain domains. As a result, algorithms that ignore single-word terms may cause problems to tasks built on top of ATR. Effective ATR systems also need to take into account both the unstructured text and the structured aspects and this means information extraction techniques need to be integrated into the term recognition process.
Resumo:
The density of axons in the optic nerve, olfactory tract and corpus callosum was quantified in non-demented elderly subjects and in Alzheimer’s disease (AD) using an image analysis system. In each fibre tract, there was significant reduction in the density of axons in AD compared with non-demented subjects, the greatest reductions being observed in the olfactory tract and corpus callosum. Axonal loss in the optic nerve and olfactory tract was mainly of axons with smaller myelinated cross-sectional areas. In the corpus callosum, a reduction in the number of ‘thin’ and ‘thick’ fibres was observed in AD, but there was a proportionally greater loss of the ‘thick’ fibres. The data suggest significant degeneration of white matter fibre tracts in AD involving the smaller axons in the two sensory nerves and both large and small axons in the corpus callosum. Loss of axons in AD could reflect an associated white matter disorder and/or be secondary to neuronal degeneration.
Resumo:
The present study examines the effect of the goodness of view on the minimal exposure time required to recognize depth-rotated objects. In a previous study, Verfaillie and Boutsen (1995) derived scales of goodness of view, using a new corpus of images of depth-rotated objects. In the present experiment, a subset of this corpus (five views of 56 objects) is used to determine the recognition exposure time for each view, by increasing exposure time across successive presentations until the object is recognized. The results indicate that, for two thirds of the objects, good views are recognized more frequently and have lower recognition exposure times than bad views.
Resumo:
The Andean forearc of northern Chile comprises four morphotectonic units, which include from east to west: 1) The Cordillera de la Costa: composed of Jurassic granites and andesites, thought to represent a volcanic arc, the Mejillones terrane, an accreted allochthonous terrane, and the Lower Cretaceous Coloso basin, which formed through forearc extension along the suture between the Mejillones terrane and the Jurassic arc. Palaeomagnetic studies of the above units have identified approximately 29+/-11 degrees of clockwise rotation. Rotation is due to extension (caused by subduction roll back and slab pull), at an angle to the direction of absolute motion of the South American Plate. 2) The Central Depression: a large arid basin containing isolated fault-bounded blocks of pre-Mesozoic metamorphosed igneous rocks, Triassic sediments and volcanics, and Jurassic carbonates, deposited in a. back-arc basin setting. The isolated blocks formed through extension along previous thrust faults, these originated through compression of the back-arc basin due to accretion of the Jurassic volcanic arc. 3) The Precordillera.: composed of Permian-Triassic rift-related sediments and volcanics, Jurassic continental sediments synchronous with back-arc basin sedimentation, and Cretaceous and Oligo-Miocene continental sediments deposited in foreland basins. Palaeomagnetism has identified clockwise rotation in rocks ranging in age from Jurassic-Miocene. Rotation in the Precordillera. affected larger structural blocks than in the Cordillera de la Costa. 4) The Salar Depression: a. series of arid continental basins developed on continental crust. These basins nay have originated in the Triassic, when rifting of the South American craton is thought to have taken place. In conclusion, palaeomagnetic and geological evidence is consistent with the view that the north Chilean forearc was largely under an extensional stress regime. However, the presence of extensive compressional structures in Palaeocene and older rocks in the forearc together with the currently active foreland thrust belt of Argentina. indicate that throughout the evolution of the Andean Orogen, a delicate balance between compressional and extensional tectonic regimes has existed.
Resumo:
The present thesis investigates mode related aspects in biology lecture discourse and attempts to identify the position of this variety along the spontaneous spoken versus planned written language continuum. Nine lectures (of 43,000 words) consisting of three sets of three lectures each, given by the three lecturers at Aston University, make up the corpus. The indeterminacy of the results obtained from the investigation of grammatical complexity as measured in subordination motivates the need to take the analysis beyond sentence level to the study of mode related aspects in the use of sentence-initial connectives, sub-topic shifting and paraphrase. It is found that biology lecture discourse combines features typical of speech and writing at sentence as well as discourse level: thus, subordination is more used than co-ordination, but one degree complexity sentence is favoured; some sentence initial connectives are only found in uses typical of spoken language but sub-topic shift signalling (generally introduced by a connective) typical of planned written language is a major feature of the lectures; syntactic and lexical revision and repetition, interrupted structures are found in the sub-topic shift signalling utterance and paraphrase, but the text is also amenable to analysis into sentence like units. On the other hand, it is also found that: (1) while there are some differences in the use of a given feature, inter-speaker variation is on the whole not significant; (2) mode related aspects are often motivated by the didactic function of the variety; and (3) the structuring of the text follows a sequencing whose boundaries are marked by sub-topic shifting and the summary paraphrase. This study enables us to draw four theoretical conclusions: (1) mode related aspects cannot be approached as a simple dichotomy since a combination of aspects of both speech and writing are found in a given feature. It is necessary to go to the level of textual features to identify mode related aspects; (2) homogeneity is dominant in this sample of lectures which suggests that there is a high level of standardization in this variety; (3) the didactic function of the variety is manifested in some mode related aspects; (4) the features studied play a role in the structuring of the text.
Resumo:
Introduction: Resveratrol (RVT) found in red wine protects against erectile dysfunction and relaxes penile tissue (corpus cavernosum) via a nitric oxide (NO) independent pathway. However, the mechanism remains to be elucidated. Hydrogen sulfide (H2S) is a potent vasodilator and neuromodulator generated in corpus cavernosum. Aims: We investigated whether RVT caused the relaxation of mice corpus cavernosum (MCC) through H2S. Methods: H2S formation is measured by methylene blue assay and vascular reactivity experiments have been performed by DMT strip myograph in CD1 MCC strips. Main Outcome Measures: Endothelial NO synthase (eNOS) inhibitor Nω-Nitro-L-arginine (L-NNA, 0.1mM) or H2S inhibitor aminooxyacetic acid (AOAA, 2mM) which inhibits both cystathionine-β-synthase (CBS) and cystathionine-gamma-lyase (CSE) enzyme or combination of AOAA with PAG (CSE inhibitor) has been used in the presence/absence of RVT (0.1mM, 30min) to elucidate the role of NO or H2S pathways on the effects of RVT in MCC. Concentration-dependent relaxations to RVT, L-cysteine, sodium hydrogen sulfide (NaHS) and acetylcholine (ACh) were studied. Results: Exposure of murine corpus cavernosum to RVT increased both basal and L-cysteine-stimulated H2S formation. Both of these effects were reversed by AOAA but not by L-NNA. RVT caused concentration-dependent relaxation of MCC and that RVT-induced relaxation was significantly inhibited by AOAA or AOAA+PAG but not by L-NNA. L-cysteine caused concentration-dependent relaxations, which are inhibited by AOAA or AOAA+PAG significantly. Incubation of MCC with RVT significantly increased L-cysteine-induced relaxation, and this effect was inhibited by AOAA+PAG. However, RVT did not alter the effect of exogenous H2S (NaHS) or ACh-induced relaxations. Conclusions: These results demonstrate that RVT-induced relaxation is at least partly dependent on H2S formation and acts independent of eNOS pathway. In phosphodiesterase 5 inhibitor (PDE-5i) nonresponder population, combination therapy with RVT may reverse erectile dysfunction via stimulating endogenous H2S formation. Yetik-Anacak G, Dereli MV, Sevin G, Ozzayim O, Erac Y, and Ahmed A. Resveratrol stimulates hydrogen sulfide (H2S) formation to relax murine corpus cavernosum.
Resumo:
In this article I argue that the study of the linguistic aspects of epistemology has become unhelpfully focused on the corpus-based study of hedging and that a corpus-driven approach can help to improve upon this. Through focusing on a corpus of texts from one discourse community (that of genetics) and identifying frequent tri-lexical clusters containing highly frequent lexical items identified as keywords, I undertake an inductive analysis identifying patterns of epistemic significance. Several of these patterns are shown to be hedging devices and the whole corpus frequencies of the most salient of these, candidate and putative, are then compared to the whole corpus frequencies for comparable wordforms and clusters of epistemic significance. Finally I interviewed a ‘friendly geneticist’ in order to check my interpretation of some of the terms used and to get an expert interpretation of the overall findings. In summary I argue that the highly unexpected patterns of hedging found in genetics demonstrate the value of adopting a corpus-driven approach and constitute an advance in our current understanding of how to approach the relationship between language and epistemology.