5 resultados para automatic content extraction

em Universidad de Alicante


Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper we present an automatic system for the extraction of syntactic semantic patterns applied to the development of multilingual processing tools. In order to achieve optimum methods for the automatic treatment of more than one language, we propose the use of syntactic semantic patterns. These patterns are formed by a verbal head and the main arguments, and they are aligned among languages. In this paper we present an automatic system for the extraction and alignment of syntactic semantic patterns from two manually annotated corpora, and evaluate the main linguistic problems that we must deal with in the alignment process.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In recent years, Twitter has become one of the most important microblogging services of the Web 2.0. Among the possible uses it allows, it can be employed for communicating and broadcasting information in real time. The goal of this research is to analyze the task of automatic tweet generation from a text summarization perspective in the context of the journalism genre. To achieve this, different state-of-the-art summarizers are selected and employed for producing multi-lingual tweets in two languages (English and Spanish). A wide experimental framework is proposed, comprising the creation of a new corpus, the generation of the automatic tweets, and their assessment through a quantitative and a qualitative evaluation, where informativeness, indicativeness and interest are key criteria that should be ensured in the proposed context. From the results obtained, it was observed that although the original tweets were considered as model tweets with respect to their informativeness, they were not among the most interesting ones from a human viewpoint. Therefore, relying only on these tweets may not be the ideal way to communicate news through Twitter, especially if a more personalized and catchy way of reporting news wants to be performed. In contrast, we showed that recent text summarization techniques may be more appropriate, reflecting a balance between indicativeness and interest, even if their content was different from the tweets delivered by the news providers.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An exhaustive characterization of the biogas from some waste disposal facilities has been carried out. The analysis includes the main components (methane, carbon dioxide, nitrogen and oxygen) as well as trace components such as hydrogen sulphide, ammonia and VOCs (volatile organic compounds) including siloxanes and halogenated compounds. VOCs were measured by GC/MS (Gas Chromatography/Mass Spectrometry) using two different procedures: thermal desorption of the Tenax TA and Carbotrap 349 tubes and SPME (Solid Phase Micro-Extraction). A method has been established to measure the total halogen content of the biogas with the AOX (adsorbable organically bound halogens) technique. The equipment used to analyze the samples was a Total Organic Halogen Analyzer (TOX-100). Similar results were obtained when comparing the TOX (Total Organic Halogen) values with those obtained by GC/MS. The halogen content in all the samples was under 22 mg Cl/Nm3 which is below the limit of 150 mg/Nm3 proposed in the Spanish Regulations for any use of the biogas. The low chlorine content in the biogas studied, as well as the low content of other trace compounds, makes it suitable for use as a fuel for electricity generating engines.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A microwave-assisted extraction (MAE) procedure to isolate phenolic compounds from almond skin byproducts was optimized. A three-level, three-factor Box–Behnken design was used to evaluate the effect of almond skin weight, microwave power, and irradiation time on total phenolic content (TPC) and antioxidant activity (DPPH). Almond skin weight was the most important parameter in the studied responses. The best extraction was achieved using 4 g, 60 s, 100 W, and 60 mL of 70% (v/v) ethanol. TPC, antioxidant activity (DPPH, FRAP), and chemical composition (HPLC-DAD-ESI-MS/MS) were determined by using the optimized method from seven different almond cultivars. Successful discrimination was obtained for all cultivars by using multivariate linear discriminant analysis (LDA), suggesting the influence of cultivar type on polyphenol content and antioxidant activity. The results show the potential of almond skin as a natural source of phenolics and the effectiveness of MAE for the reutilization of these byproducts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Automatic video segmentation plays a vital role in sports videos annotation. This paper presents a fully automatic and computationally efficient algorithm for analysis of sports videos. Various methods of automatic shot boundary detection have been proposed to perform automatic video segmentation. These investigations mainly concentrate on detecting fades and dissolves for fast processing of the entire video scene without providing any additional feedback on object relativity within the shots. The goal of the proposed method is to identify regions that perform certain activities in a scene. The model uses some low-level feature video processing algorithms to extract the shot boundaries from a video scene and to identify dominant colours within these boundaries. An object classification method is used for clustering the seed distributions of the dominant colours to homogeneous regions. Using a simple tracking method a classification of these regions to active or static is performed. The efficiency of the proposed framework is demonstrated over a standard video benchmark with numerous types of sport events and the experimental results show that our algorithm can be used with high accuracy for automatic annotation of active regions for sport videos.