881 resultados para Estrazione informazioni, analisi dati non strutturati, Web semantico, data mining, text mining, big data, open data, classificazione di testi.


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We propose to use the Tensor Space Modeling (TSM) to represent and analyze the user’s web log data that consists of multiple interests and spans across multiple dimensions. Further we propose to use the decomposition factors of the Tensors for clustering the users based on similarity of search behaviour. Preliminary results show that the proposed method outperforms the traditional Vector Space Model (VSM) based clustering.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this exploratory study was to identify the key factors that enhance and inhibit the export activities of wineries and identify differences between exporters and non-exporters. Based on data collected from Chilean wineries, the findings of this study suggest that the major constraints for non-exporters are the lack of financial resources, limited quantities of stocks for market expansion, management’s lack of knowledge and experience, and the high cost of travelling and participating in trade shows. In addition, the main international markets for Chilean wineries were not psychically close markets as has been found for Australian or other wine industries. For domestic market oriented wineries cellar door sales were an important source of revenue. Finally, the results show that managers have educational levels and international experience exceeding those of other comparable New World wineries.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Many existing information retrieval models do not explicitly take into account in- formation about word associations. Our approach makes use of rst and second order relationships found in natural language, known as syntagmatic and paradigmatic associ- ations, respectively. This is achieved by using a formal model of word meaning within the query expansion process. On ad hoc retrieval, our approach achieves statistically sig- ni cant improvements in MAP (0.158) and P@20 (0.396) over our baseline model. The ERR@20 and nDCG@20 of our system was 0.249 and 0.192 respectively. Our results and discussion suggest that information about both syntagamtic and paradigmatic associa- tions can assist with improving retrieval eectiveness on ad hoc retrieval.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Use of cetuximab, a monoclonal antibody targeting the epidermal growth factor receptor (EGFR), has the potential to increase survival in patients with advanced non-small-cell lung cancer. We therefore compared chemotherapy plus cetuximab with chemotherapy alone in patients with advanced EGFR-positive non-small-cell lung cancer. Methods: In a multinational, multicentre, open-label, phase III trial, chemotherapy-naive patients (≥18 years) with advanced EGFR-expressing histologically or cytologically proven stage wet IIIB or stage IV non-small-cell lung cancer were randomly assigned in a 1:1 ratio to chemotherapy plus cetuximab or just chemotherapy. Chemotherapy was cisplatin 80 mg/m 2 intravenous infusion on day 1, and vinorelbine 25 mg/m 2 intravenous infusion on days 1 and 8 of every 3-week cycle) for up to six cycles. Cetuximab-at a starting dose of 400 mg/m 2 intravenous infusion over 2 h on day 1, and from day 8 onwards at 250 mg/m 2 over 1 h per week-was continued after the end of chemotherapy until disease progression or unacceptable toxicity had occurred. The primary endpoint was overall survival. Analysis was by intention to treat. This study is registered with ClinicalTrials.gov, number NCT00148798. Findings: Between October, 2004, and January, 2006, 1125 patients were randomly assigned to chemotherapy plus cetuximab (n=557) or chemotherapy alone (n=568). Patients given chemotherapy plus cetuximab survived longer than those in the chemotherapy-alone group (median 11·3 months vs 10·1 months; hazard ratio for death 0·871 [95% CI 0·762-0·996]; p=0·044). The main cetuximab-related adverse event was acne-like rash (57 [10%] of 548, grade 3). Interpretation: Addition of cetuximab to platinum-based chemotherapy represents a new treatment option for patients with advanced non-small-cell lung cancer. Funding: Merck KGaA. © 2009 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Two new statistics, namely Delta(chi 2) and Delta(chi), based on the extreme value theory, were derived by Gupta et al. We use these statistics to study the direction dependence in the HST Key Project data, which provides one of the most precise measurements of the Hubble constant. We also study the non-Gaussianity in this data set using these statistics. Our results for Delta(chi 2) show that the significance of direction-dependent systematics is restricted to well below the 1 sigma confidence limit; however, the presence of non-Gaussian features is subtle. On the other hand, the Delta(chi). statistic, which is more sensitive to direction dependence, shows direction dependence systematics to be at a slightly higher confidence level, and the presence of non-Gaussian features at a level similar to the Delta(chi 2) statistic.

Relevância:

100.00% 100.00%

Publicador:

Relevância:

100.00% 100.00%

Publicador:

Resumo:

R. Jensen and Q. Shen, 'Fuzzy-Rough Attribute Reduction with Application to Web Categorization,' Fuzzy Sets and Systems, vol. 141, no. 3, pp. 469-485, 2004.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Scientists rarely reuse expert knowledge of phylogeny, in spite of years of effort to assemble a great "Tree of Life" (ToL). A notable exception involves the use of Phylomatic, which provides tools to generate custom phylogenies from a large, pre-computed, expert phylogeny of plant taxa. This suggests great potential for a more generalized system that, starting with a query consisting of a list of any known species, would rectify non-standard names, identify expert phylogenies containing the implicated taxa, prune away unneeded parts, and supply branch lengths and annotations, resulting in a custom phylogeny suited to the user's needs. Such a system could become a sustainable community resource if implemented as a distributed system of loosely coupled parts that interact through clearly defined interfaces. RESULTS: With the aim of building such a "phylotastic" system, the NESCent Hackathons, Interoperability, Phylogenies (HIP) working group recruited 2 dozen scientist-programmers to a weeklong programming hackathon in June 2012. During the hackathon (and a three-month follow-up period), 5 teams produced designs, implementations, documentation, presentations, and tests including: (1) a generalized scheme for integrating components; (2) proof-of-concept pruners and controllers; (3) a meta-API for taxonomic name resolution services; (4) a system for storing, finding, and retrieving phylogenies using semantic web technologies for data exchange, storage, and querying; (5) an innovative new service, DateLife.org, which synthesizes pre-computed, time-calibrated phylogenies to assign ages to nodes; and (6) demonstration projects. These outcomes are accessible via a public code repository (GitHub.com), a website (http://www.phylotastic.org), and a server image. CONCLUSIONS: Approximately 9 person-months of effort (centered on a software development hackathon) resulted in the design and implementation of proof-of-concept software for 4 core phylotastic components, 3 controllers, and 3 end-user demonstration tools. While these products have substantial limitations, they suggest considerable potential for a distributed system that makes phylogenetic knowledge readily accessible in computable form. Widespread use of phylotastic systems will create an electronic marketplace for sharing phylogenetic knowledge that will spur innovation in other areas of the ToL enterprise, such as annotation of sources and methods and third-party methods of quality assessment.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Non-market effects of agriculture are often estimated using discrete choice models from stated preference surveys. In this context we propose two ways of modelling attribute non-attendance. The first involves constraining coefficients to zero in a latent class framework, whereas the second is based on stochastic attribute selection and grounded in Bayesian estimation. Their implications are explored in the context of a stated preference survey designed to value landscapes in Ireland. Taking account of attribute non-attendance with these data improves fit and tends to involve two attributes one of which is likely to be cost, thereby leading to substantive changes in derived welfare estimates.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction The majority of stage III patients with non-small cell lung cancer (NSCLC) are unsuitable for concurrent chemoradiotherapy, the non-surgical gold standard of care. As the alternative treatment options of sequential chemoradiotherapy and radiotherapy alone are associated with high local failure rates, various intensification strategies have been employed. There is evidence to suggest that altered fractionation using hyperfractionation, acceleration, dose escalation, and individualisation may be of benefit. The MAASTRO group have pioneered the concept of ‘isotoxic’ radiotherapy allowing for individualised dose escalation using hyperfractionated accelerated radiotherapy based on predefined normal tissue constraints. This study aims to evaluate whether delivering isotoxic radiotherapy using intensity modulated radiotherapy (IMRT) is achievable.

Methods and analysis Isotoxic IMRT is a multicentre feasibility study. From June 2014, a total of 35 patients from 7 UK centres, with a proven histological or cytological diagnosis of inoperable NSCLC, unsuitable for concurrent chemoradiotherapy will be recruited. A minimum of 2 cycles of induction chemotherapy is mandated before starting isotoxic radiotherapy. The dose of radiation will be increased until one or more of the organs at risk tolerance or the maximum dose of 79.2 Gy is reached. The primary end point is feasibility, with accrual rates, local control and overall survival our secondary end points. Patients will be followed up for 5 years.

Ethics and dissemination The study has received ethical approval (REC reference: 13/NW/0480) from the National Research Ethics Service (NRES) Committee North West—Greater Manchester South. The trial is conducted in accordance with the Declaration of Helsinki and Good Clinical Practice (GCP). The trial results will be published in a peer-reviewed journal and presented internationally.

Trial registration number NCT01836692; Pre-results.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tese de doutoramento, Informática (Ciências da Computação), Universidade de Lisboa, Faculdade de Ciências, 2015

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Spatial analysis and social network analysis typically take into consideration social processes in specific contexts of geographical or network space. The research in political science increasingly strives to model heterogeneity and spatial dependence. To better understand and geographically model the relationship between “non-political” events, streaming data from social networks, and political climate was the primary objective of the current study. Geographic information systems (GIS) are useful tools in the organization and analysis of streaming data from social networks. In this study, geographical and statistical analysis were combined in order to define the temporal and spatial nature of the data eminating from the popular social network Twitter during the 2014 FIFA World Cup. The study spans the entire globe because Twitter’s geotagging function, the fundamental data that makes this study possible, is not limited to a geographic area. By examining the public reactions to an inherenlty non-political event, this study serves to illuminate broader questions about social behavior and spatial dependence. From a practical perspective, the analyses demonstrate how the discussion of political topics fluсtuate according to football matches. Tableau and Rapidminer, in addition to a set basic statistical methods, were applied to find patterns in the social behavior in space and time in different geographic regions. It was found some insight into the relationship between an ostensibly non-political event – the World Cup - and public opinion transmitted by social media. The methodology could serve as a prototype for future studies and guide policy makers in governmental and non-governmental organizations in gauging the public opinion in certain geographic locations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The trabecular bone score (TBS, Med-Imaps, Pessac, France) is an index of bone microarchitecture texture extracted from anteroposterior dual-energy X-ray absorptiometry images of the spine. Previous studies have documented the ability of TBS of the spine to differentiate between women with and without fractures among age- and areal bone mineral density (aBMD)-matched controls, as well as to predict future fractures. In this cross-sectional analysis of data collected from 3 geographically dispersed facilities in the United States, we investigated age-related changes in the microarchitecture of lumbar vertebrae as assessed by TBS in a cohort of non-Hispanic US white American women. All subjects were 30 yr of age and older and had an L1-L4aBMDZ-score within ±2 SD of the population mean. Individuals were excluded if they had fractures, were on any osteoporosis treatment, or had any illness that would be expected to impact bone metabolism. All data were extracted from Prodigy dual-energy X-ray absorptiometry devices (GE-Lunar, Madison, WI). Cross-calibrations between the 3 participating centers were performed for TBS and aBMD. aBMD and TBS were evaluated for spine L1-L4 but also for all other possible vertebral combinations. To validate the cohort, a comparison between the aBMD normative data of our cohort and US non-Hispanic white Lunar data provided by the manufacturer was performed. A database of 619 non-Hispanic US white women, ages 30-90 yr, was created. aBMD normative data obtained from this cohort were not statistically different from the non-Hispanic US white Lunar normative data provided by the manufacturer (p = 0.30). This outcome thereby indirectly validates our cohort. TBS values at L1-L4 were weakly inversely correlated with body mass index (r = -0.17) and weight (r = -0.16) and not correlated with height. TBS values for all lumbar vertebral combinations decreased significantly with age. There was a linear decrease of 16.0% (-2.47 T-score) in TBS at L1-L4 between 45 and 90 yr of age (vs. -2.34 for aBMD). Microarchitectural loss rate increased after age 65 by 50% (-0.004 to -0.006). Similar results were obtained for other combinations of lumbar vertebra. TBS, an index of bone microarchitectural texture, decreases with advancing age in non-Hispanic US white women. Little change in TBS is observed between ages 30 and 45. Thereafter, a progressive decrease is observed with advancing age. The changes we observed in these American women are similar to that previously reported for a French population of white women (r(2) > 0.99). This reference database will facilitate the use of TBS to assess bone microarchitectural deterioration in clinical practice.