914 resultados para Plays on words.
Resumo:
Les documents publiés par des entreprises, tels les communiqués de presse, contiennent une foule d’informations sur diverses activités des entreprises. C’est une source précieuse pour des analyses en intelligence d’affaire. Cependant, il est nécessaire de développer des outils pour permettre d’exploiter cette source automatiquement, étant donné son grand volume. Ce mémoire décrit un travail qui s’inscrit dans un volet d’intelligence d’affaire, à savoir la détection de relations d’affaire entre les entreprises décrites dans des communiqués de presse. Dans ce mémoire, nous proposons une approche basée sur la classification. Les méthodes de classifications existantes ne nous permettent pas d’obtenir une performance satisfaisante. Ceci est notamment dû à deux problèmes : la représentation du texte par tous les mots, qui n’aide pas nécessairement à spécifier une relation d’affaire, et le déséquilibre entre les classes. Pour traiter le premier problème, nous proposons une approche de représentation basée sur des mots pivots c’est-à-dire les noms d’entreprises concernées, afin de mieux cerner des mots susceptibles de les décrire. Pour le deuxième problème, nous proposons une classification à deux étapes. Cette méthode s’avère plus appropriée que les méthodes traditionnelles de ré-échantillonnage. Nous avons testé nos approches sur une collection de communiqués de presse dans le domaine automobile. Nos expérimentations montrent que les approches proposées peuvent améliorer la performance de classification. Notamment, la représentation du document basée sur les mots pivots nous permet de mieux centrer sur les mots utiles pour la détection de relations d’affaire. La classification en deux étapes apporte une solution efficace au problème de déséquilibre entre les classes. Ce travail montre que la détection automatique des relations d’affaire est une tâche faisable. Le résultat de cette détection pourrait être utilisé dans une analyse d’intelligence d’affaire.
Resumo:
Les moteurs de recherche font partie de notre vie quotidienne. Actuellement, plus d’un tiers de la population mondiale utilise l’Internet. Les moteurs de recherche leur permettent de trouver rapidement les informations ou les produits qu'ils veulent. La recherche d'information (IR) est le fondement de moteurs de recherche modernes. Les approches traditionnelles de recherche d'information supposent que les termes d'indexation sont indépendants. Pourtant, les termes qui apparaissent dans le même contexte sont souvent dépendants. L’absence de la prise en compte de ces dépendances est une des causes de l’introduction de bruit dans le résultat (résultat non pertinents). Certaines études ont proposé d’intégrer certains types de dépendance, tels que la proximité, la cooccurrence, la contiguïté et de la dépendance grammaticale. Dans la plupart des cas, les modèles de dépendance sont construits séparément et ensuite combinés avec le modèle traditionnel de mots avec une importance constante. Par conséquent, ils ne peuvent pas capturer correctement la dépendance variable et la force de dépendance. Par exemple, la dépendance entre les mots adjacents "Black Friday" est plus importante que celle entre les mots "road constructions". Dans cette thèse, nous étudions différentes approches pour capturer les relations des termes et de leurs forces de dépendance. Nous avons proposé des méthodes suivantes: ─ Nous réexaminons l'approche de combinaison en utilisant différentes unités d'indexation pour la RI monolingue en chinois et la RI translinguistique entre anglais et chinois. En plus d’utiliser des mots, nous étudions la possibilité d'utiliser bi-gramme et uni-gramme comme unité de traduction pour le chinois. Plusieurs modèles de traduction sont construits pour traduire des mots anglais en uni-grammes, bi-grammes et mots chinois avec un corpus parallèle. Une requête en anglais est ensuite traduite de plusieurs façons, et un score classement est produit avec chaque traduction. Le score final de classement combine tous ces types de traduction. Nous considérons la dépendance entre les termes en utilisant la théorie d’évidence de Dempster-Shafer. Une occurrence d'un fragment de texte (de plusieurs mots) dans un document est considérée comme représentant l'ensemble de tous les termes constituants. La probabilité est assignée à un tel ensemble de termes plutôt qu’a chaque terme individuel. Au moment d’évaluation de requête, cette probabilité est redistribuée aux termes de la requête si ces derniers sont différents. Cette approche nous permet d'intégrer les relations de dépendance entre les termes. Nous proposons un modèle discriminant pour intégrer les différentes types de dépendance selon leur force et leur utilité pour la RI. Notamment, nous considérons la dépendance de contiguïté et de cooccurrence à de différentes distances, c’est-à-dire les bi-grammes et les paires de termes dans une fenêtre de 2, 4, 8 et 16 mots. Le poids d’un bi-gramme ou d’une paire de termes dépendants est déterminé selon un ensemble des caractères, en utilisant la régression SVM. Toutes les méthodes proposées sont évaluées sur plusieurs collections en anglais et/ou chinois, et les résultats expérimentaux montrent que ces méthodes produisent des améliorations substantielles sur l'état de l'art.
Resumo:
Automatic keyword or keyphrase extraction is concerned with assigning keyphrases to documents based on words from within the document. Previous studies have shown that in a significant number of cases author-supplied keywords are not appropriate for the document to which they are attached. This can either be because they represent what the author believes the paper is about not what it actually is, or because they include keyphrases which are more classificatory than explanatory e.g., “University of Poppleton” instead of “Knowledge Discovery in Databases”. Thus, there is a need for a system that can generate appropriate and diverse range of keyphrases that reflect the document. This paper proposes a solution that examines the synonyms of words and phrases in the document to find the underlying themes, and presents these as appropriate keyphrases. The primary method explores taking n-grams of the source document phrases, and examining the synonyms of these, while the secondary considers grouping outputs by their synonyms. The experiments undertaken show the primary method produces good results and that the secondary method produces both good results and potential for future work.
Resumo:
The current research agenda for construction process improvement is heavily influenced by the rhetoric of business process re-engineering (BPR). In contrast to the wider literature on BPR, there is little evidence of critical thought within the construction management research community. A postmodernist interpretation is advocated whereby the reality of management practice is defined by the dominant management discourse. The persuasiveness of BPR rhetoric is analysed with particular reference to the way in which it plays on the insecurity of modern managers. Despite the lip service given to ‘empowerment’ and ‘teamwork’, the dominant theme of the re-engineering movement is that of technocratic totalitarianism. From a critical perspective, it is suggested that BPR is imposed on construction organizations to ensure continued control by the industry's dominant power groups. Whilst industry leaders are fond of calling for ‘attitudinal and cultural improvement’, the language of the accepted research agenda continually reinforces the industry's dominant culture of ‘control and command’. Therefore, current research directions in process improvement perpetuate existing attitudes rather than facilitating cultural change. The concept of lean construction is seen to be the latest manifestation of this phenomenon.
Resumo:
Automatic keyword or keyphrase extraction is concerned with assigning keyphrases to documents based on words from within the document. Previous studies have shown that in a significant number of cases author-supplied keywords are not appropriate for the document to which they are attached. This can either be because they represent what the author believes a paper is about not what it actually is, or because they include keyphrases which are more classificatory than explanatory e.g., “University of Poppleton” instead of “Knowledge Discovery in Databases”. Thus, there is a need for a system that can generate an appropriate and diverse range of keyphrases that reflect the document. This paper proposes two possible solutions that examine the synonyms of words and phrases in the document to find the underlying themes, and presents these as appropriate keyphrases. Using three different freely available thesauri, the work undertaken examines two different methods of producing keywords and compares the outcomes across multiple strands in the timeline. The primary method explores taking n-grams of the source document phrases, and examining the synonyms of these, while the secondary considers grouping outputs by their synonyms. The experiments undertaken show the primary method produces good results and that the secondary method produces both good results and potential for future work. In addition, the different qualities of the thesauri are examined and it is concluded that the more entries in a thesaurus, the better it is likely to perform. The age of the thesaurus or the size of each entry does not correlate to performance.
Resumo:
The Plaut, McClelland, Seidenberg and Patterson (1996) connectionist model of reading was evaluated at two points early in its training against reading data collected from British children on two occasions during their first year of literacy instruction. First, the network’s non-word reading was poor relative to word reading when compared with the children. Second, the network made more non-lexical than lexical errors, the opposite pattern to the children. Three adaptations were made to the training of the network to bring it closer to the learning environment of a child: an incremental training regime was adopted; the network was trained on grapheme– phoneme correspondences; and a training corpus based on words found in children’s early reading materials was used. The modifications caused a sharp improvement in non-word reading, relative to word reading, resulting in a near perfect match to the children’s data on this measure. The modified network, however, continued to make predominantly non-lexical errors, although evidence from a small-scale implementation of the full triangle framework suggests that this limitation stems from the lack of a semantic pathway. Taken together, these results suggest that, when properly trained, connectionist models of word reading can offer insights into key aspects of reading development in children.
Resumo:
The present study examined the effects of word length on children’s eye movement behaviour when other variables were carefully controlled. Importantly, the results showed that word length influenced children’s reading times and fixation positions on words. Furthermore, children exhibited stronger word length effects than adults in gaze durations and refixations. Adults and children generally did not differ in initial landing positions, but did differ in refixation behaviour. Overall, the results indicated that while adults and children show similar effects of word length for early measures of eye movement behaviour, differences emerge in later measures.
Resumo:
Significant interindividual variations in health outcome may be caused by the inheritance of variant polymorphic genes, such as CYP2D6 and CYP2E1 for activation, and GSTM1 and GSTT1 for detoxification of chemicals. However. mechanistic studies linking the inheritance of predisposing genes with genotoxic effects towards cancer have yet to be systematically conducted. We have studied 54 lung cancer patients and 50 matched normal controls, who have been cigarette smokers, to elucidate the role of polymorphic genes in cancer. Our data indicates that the inheritance of unfavorable CYP2D6, CYP2E1, and GSTT1 genes is strongly correlated with the smoking-related lung cancer. For heavy cigarette smokers (> 30 pack-years), the smoking habit is the strongest predictor of lung cancer risk irrespective of the inheritance of unfavorable metabolizing genes. For moderate to light smokers (< 30 pack-years), the genetic predisposition plays on important role For the risk (odds ratio = 3.46; 95% CL = 0.46-40.2). Using a subgroup of the study population, we observed that cigarette smokers having the defective GST genes have significantly more chromosome aberrations as determined by the fluorescence-in-situ-hybridization (FISH) technique than smokers with the normal GST genes (P < 0.001). In conclusion, our study provides data to indicate that individuals who have inherited unfavorable metabolizing genes have increased body burden of toxicants to cause increased genetic damage and to have increased risk for cancer. Studies like ours can be used to understand the basis for interindividual variations in cancer outcome, to identify high risk individuals and to assess health risk. (C) 1997 Wiley Liss, Inc.
Resumo:
Studies related with the flight activity of aphids captured with traps can be useful to predict the appearance of alate forms in the field. It is also useful to understand the population dynamics of those insects. This work was carried out at the Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista, Jaboticabal, State of São Paulo, during the period from July 1997 to December 1999, aiming to determine the population curve of alate Brevicoryne brassicae (L.) in the region of Jaboticabal and the influence meteorological factors plays on it. The population survey was carried out in four places, and the alate aphids were sampled by yellow water traps. The influence of air temperature, relative humidity, rainfall and sunshine brightness was verified using stepwise regression. The first captures of alate B. brassicae in the traps began at the end of June. In the region of Jaboticabal, the population curve of alate B. brassicae showed the highest frequency of flights during September. The appearance of alate forms in kale fields at initial stage of development was favored by maximum and minimum temperatures of 26.4°C and 13.5°C, respectively; relative humidity of 75.1%, and reduced rainfall.
Resumo:
Pós-graduação em Medicina Veterinária - FCAV
Resumo:
The legendary Don Juan character inhabits the popular imaginary with such a vibrancy of a myth, despite its slight recognition as such by the scientific literature. The purpose of this article is to verify if Don Juan’s play can be understood, scientifically, as a modern myth, as well as understand its mythical nature and its relation to the modern man. A structural and historical analysis of the two first plays on Don Juan has guided us: the one written by Tirso de Molina in 1630, and the one by Molière, written in 1665. The studies allow us to say that Don Juan is a typical modern myth, once it is constituted by narratives that expose a world looming under the signs of individualism, hedonism, arrogance and audacity forged under the appeals of sexual desire.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
O presente estudo descreve o contexto das brincadeiras de pré-escolares da Ilha do Combu, em Belém-Pará, destacando aspectos do ambiente físico e social, as principais atividades dos adultos e peculiaridades das brincadeiras. Participaram da pesquisa treze crianças de uma turma de educação infantil, onze meninos e duas meninas, entre quatro e cinco anos de idade. As crianças e seus familiares foram entrevistados. As crianças foram observadas brincando em suas casas. Foram analisados os temas, os parceiros, os locais, os objetos e os significados produzidos nas brincadeiras. Apesar de estarem em contato com o contexto urbano, as crianças mostraram-se vinculadas, principalmente, ao contexto ribeirinho.
Resumo:
We present an experimental and numerical study examining the dynamics of a gravity-driven contact line of a thin viscous film traveling down the outside of a vertical cylinder of radius R. Experiments on cylinders with radii ranging between 0.159 and 3.81 cm show that the contact line is unstable to a fingering pattern for two fluids with differing viscosities, surface tensions, and wetting properties. The dynamics of the contact line is studied and results are compared to previous studies of inclined plane experiments in order to understand the influence substrate curvature plays on the fingering pattern. A lubrication model is derived for the film height in the limit that ε = H/R≪1, where H is the upstream film thickness, and in terms of a Bond number ρgR3/(γH), and the linear stability of the contact line is analyzed using traveling wave solutions. Curvature controls the capillary ridge height of the traveling wave and the range of unstable wavelength when ε = O(10-1), whereas the shape and stability of the contact line converge to the behavior one observes on a vertical plane when ε ≤ O(10-2). The most unstable wave mode, cutoff wave mode for neutral stability, and maximum growth rate scale as 0.45 where = ρgR2/γ ≥ 1.3, and the contact line is unstable to fingering when ≥ 0.56. Using the experimental data to extrapolate outside the range of validity of the thin film model, we estimate the contact line is stable when <0.56. Agreement is excellent between the model and the experimental data for the wave number (i.e., number of fingers) and wavelength of the fingering pattern that forms along the contact line.