899 resultados para Topics Extraction
Resumo:
2000 Mathematics Subject Classification: 62H30
Resumo:
传统的主题抽取方法单纯依靠分析网页内容的来自动获取网页主题,其分析结果并不十分精确.在WWW上,网页之间通过超链接来互相联系,而链接关系紧密的网页趋向于属于同一主题.基于这一思想,本文提出了一种利用Web链接结构信息来对主题抽取结果进行求精的方法,其通过所链接网页对本网页的影响来修正本网页的主题权值.本文还通过一个实际应用例子,分析了这一方法的特点.
Resumo:
Guaranteeing the quality of extracted features that describe relevant knowledge to users or topics is a challenge because of the large number of extracted features. Most popular existing term-based feature selection methods suffer from noisy feature extraction, which is irrelevant to the user needs (noisy). One popular method is to extract phrases or n-grams to describe the relevant knowledge. However, extracted n-grams and phrases usually contain a lot of noise. This paper proposes a method for reducing the noise in n-grams. The method first extracts more specific features (terms) to remove noisy features. The method then uses an extended random set to accurately weight n-grams based on their distribution in the documents and their terms distribution in n-grams. The proposed approach not only reduces the number of extracted n-grams but also improves the performance. The experimental results on Reuters Corpus Volume 1 (RCV1) data collection and TREC topics show that the proposed method significantly outperforms the state-of-art methods underpinned by Okapi BM25, tf*idf and Rocchio.
Resumo:
AN ENGINEERING Workshop was held from 21 to 24 November 2006 in Veracruz, Mexico. Forty delegates from 12 countries attended the workshop on theory and practice of milling and diffusion extraction. This report provides a general overview of activities undertaken during that workshop which consisted of five technical sessions over two days with presentations and discussions plus two days of field and factory visits. Topics covered during the technical sessions included: power transmissions, cane preparation, diffusers, mills, and a comparison of milling and diffusion.
Resumo:
This research was directed towards the investigation and development of an aryne route to the syntheses of aporphi ne and dibenzopyrrocolinium (dibenzoindolizinium) alkaloids and to the stability of the latter under the conditions used for aryne formation. The work c an be divided into three main sections . i) - Synthesis of Glaucine 6-Bromo-3,4-dimethoxyphenylacetic acid, prepared by the action of bromine i n acetic acid on3,4-dimethoxyphenylacetic a cid, was converted into its acid chloride by t he action of thionyl chloride. This on treatment with 3,4- dimethoxyphenylethylamine pr ovided N-(3, 4-dimethoxyphenylethyl)- 2-(2-bromo-4,S-dimethoxyphenyl)-acetamide which on dehydration with phosphoryl chloride (Bischler Napieralski reaction) in dry benzene afforded l -(2-bromo-4,S-dimethoxybenzyl)- 3,4-dihydro-6,7-dimethoxyisoquinoline, isolated as hydrochl oride. A new method o f destroying the excess of phosphoryl chloride was developed which proved to be quite useful. Methylation of the dihydroisoquinoline'with methyl iodide in methanol , and subsequent reduction with sodium borohydride provided (±)-6-bromolaudanosine. Act ion of potassamide or sodamide in anhydrous liquid ammonia on (±)-6-bromolaudanosine yielded the corresponding amino derivative along with other products. Diazotization and ring closure of (±)-6-aminolaudanosine then a f forded (±)-glaucine which was isolated as methiodide. ii) - Intramolecular Capture of Aryne During Glaucine Synthesis, and Subsequent Reactions . This section deals with the by-products formed under the conditions of the aryne stage of t he glaucine synthesis. The crude product, obtained in the reaction of potassamide or sodamide in liquid ammonia on (±)-6-bromolaudanosine, was s eparated by chromatography, Three products were separated and identified. a ) - 5,6-Dimethoxy-2-( 3,4-dimethoxy-6-ethylphenyl)-lmethylindole. Two mechanisms are proposed for the formation of this interesting product. This compound also was prepared by the action of potassamide in l,iquid ammonia on 5,6 ,l2,l2atetrahydro- 2,3,9,lO-tetramethoxy-7-methyldibenz[b,g]indolizinium i odide . b) - 5,6-Dimethoxy-2-(3,4-dimethoxy-6-vinylphenyl)-lmethylindoline. Its formation represented a new method of Hofmann degradation . Further confirmation of structure was done by performing the normal Hofmann reaction on 5, 6,12,12a-tetrahydro -2/3,9,lO-tetramethoxy ~7-methyldibe nz[ b,g]indolizinium iodide. The indoline prepared i n this way was identical in all respects with that prepared above . c) - 1- (2-amino-4,5-dimethoxybenzyl ) -l,2,3,4-tetrahydro-2- methyl-6,7-dimethoxyisoquinoline, was converted t o glaucine as stated in section 1 . iii) - Attempt:,ed Sxnthesis of Liriodenine Piperonal was converted into 3,4-methylenedioxyinitrostyrene which on reduction with lithium aluminium hydride provided 3,4-methylenedioxyphenylethylamine. The method of extraction after the reduction was improved t o some extent. The amine on condensation with m-chlorophenylacetyl chloride, prepared by the action of oxalyl chloride on 3,4-methylenedioxyphenylacetic acid, provided N-[ ~ -(3,4-methylenedioxyphenyl)- e thyl)-3-chlorophenylacetamide. This on dehydration with phosphoryl chloride in dry benzene followed by air oxidation afforded l-(3-chlorobenzoyl)-6,7-methylenedioxyi soquinoline. This compound on r eaction with potassamide in liquid ammonia afforded a crude product from which. one product was separated by chromatography i n a pure condition . This yellow compound analysed as,c17Hl ON2021 and was t he main product i n the reaction ; a t entative structure is proposed. A second compound, not obtained in pure condition, was submitted to Pschorr reaction in the hope of obtaining liriodenine, but without success.
Resumo:
Latin America, a region rich in both energy resources and native heritage, faces a rising politico-social confrontation that has been growing for over two decades. While resources like oil and gas are exploited to enhance the state’s economic growth, indigenous groups feel threatened because the operations related to this exploitation are infringing on their homelands. Furthermore, they believe that the potential resource wealth found in these environmentally-sensitive regions is provoking an “intrusion” in their ancestral territory of either government agencies or corporations allowed by governmental decree. Indigenous groups, which have achieved greater political voice over the past decade, are protesting against government violations. These protests have reached the media and received international attention, leading the discourse on topics such as civil and human rights violations. When this happens, the State finds itself “between a rock and a hard place”: In a debate between indigenous groups’ rights and economic sustainability.
Resumo:
Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.