882 resultados para word likelihood scores


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a unifying picture where the notion of generalized entropy is related to information theory by means of a group-theoretical approach. The group structure comes from the requirement that an entropy be well defined with respect to the composition of independent systems, in the context of a recently proposed generalization of the Shannon-Khinchin axioms. We associate to each member of a large class of entropies a generalized information measure, satisfying the additivity property on a set of independent systems as a consequence of the underlying group law. At the same time, we also show that Einstein's likelihood function naturally emerges as a byproduct of our informational interpretation of (generally nonadditive) entropies. These results confirm the adequacy of composable entropies both in physical and social science contexts.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We evaluate the use of Generalized Empirical Likelihood (GEL) estimators in portfolios efficiency tests for asset pricing models in the presence of conditional information. Estimators from GEL family presents some optimal statistical properties, such as robustness to misspecification and better properties in finite samples. Unlike GMM, the bias for GEL estimators do not increase as more moment conditions are included, which is expected in conditional efficiency analysis. We found some evidences that estimators from GEL class really performs differently in small samples, where efficiency tests using GEL generate lower estimates compared to tests using the standard approach with GMM. With Monte Carlo experiments we see that GEL has better performance when distortions are present in data, especially under heavy tails and Gaussian shocks.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article analyzes the solutions given in Spanish translations to the morphological creativity shown in the names of Marvel comic book characters. The English versions almost invariably provide a full description of the hero (or villain) by means of a wide variety of word-formation mechanisms leading to highly expressive charactonyms. Indeed, examples shall be listed of names of comic book heroes created through compounding, derivation, including prefixation or suffixation (both classical and Anglo-Saxon but also from other origins), lexical blending, abbreviation, clipping, onomatopoeia, and borrowings from Spanish or from other languages. Early translations into Spanish seemed to be slightly less expressive than the original, even when the same word-formation mechanism was used, usually due to either problems of transparency mainly in some of the word parts or to translation constraints. In later periods, a number of factors, including the influence from other media featuring the same characters and the general trend towards globalization through English, have led translators to choose repetition as the most frequent strategy, which has almost eliminated the creative power of wordformation mechanisms in Spanish and their ability to convey the stylistic effects found in the English versions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Unlike traditional approaches, new communicative trends disregard the role of word-formation mechanisms. They tend to focus on syntax and/or vocabulary without analyzing the mechanisms involved in the creation of lexical items. In this paper, based on the analysis of the use of prefixes by L2 learners in oral and written productions, as provided by the SULEC, we emphasize the advantages that word-formation awareness and knowledge may have for the learners in terms of production, creativity, understanding, autonomy, and proficiency. Through the teaching of word-formation learners may more easily decipher, decode and/or encode messages, create words they have never seen before, etc.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

En este trabajo se presenta un método para la detección de subjetividad a nivel de oraciones basado en la desambiguación subjetiva del sentido de las palabras. Para ello se extiende un método de desambiguación semántica basado en agrupamiento de sentidos para determinar cuándo las palabras dentro de la oración están siendo utilizadas de forma subjetiva u objetiva. En nuestra propuesta se utilizan recursos semánticos anotados con valores de polaridad y emociones para determinar cuándo un sentido de una palabra puede ser considerado subjetivo u objetivo. Se presenta un estudio experimental sobre la detección de subjetividad en oraciones, en el cual se consideran las colecciones del corpus MPQA y Movie Review Dataset, así como los recursos semánticos SentiWordNet, Micro-WNOp y WordNet-Affect. Los resultados obtenidos muestran que nuestra propuesta contribuye de manera significativa en la detección de subjetividad.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study examined the reliability and validity evidence drawn from the scores of the French version of the Questionnaire about Interpersonal Difficulties for Adolescents (QIDA) in a sample of 957 adolescents (48.5% boys) ranging in age from 11 to 18 years (M = 14.48, SD = 1.85). A principal axis factoring (PAF) and confirmatory factor analyses (CFA) were performed to determine the fit of the factor structure of scores on the QIDA. PAF and CFA replicated the previously identified correlated five-factor structure of the QIDA: Assertiveness, Heterosexual Relationships, Public Speaking, Family Relationships, and Close Friendships. The QIDA yielded acceptable reliability scores for French adolescents. Validity evidence of QIDA was also established through correlations with scores on the School Anxiety Inventory and the Social Anxiety Scale for Adolescents. Most of the correlations were positive and exceeded the established criteria of statistical significance, but the magnitude of these varied according to the scales of the QIDA. Results supported the reliability and validity evidence drawn from the scores of the French version of the QIDA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Internet traffic classification is a relevant and mature research field, anyway of growing importance and with still open technical challenges, also due to the pervasive presence of Internet-connected devices into everyday life. We claim the need for innovative traffic classification solutions capable of being lightweight, of adopting a domain-based approach, of not only concentrating on application-level protocol categorization but also classifying Internet traffic by subject. To this purpose, this paper originally proposes a classification solution that leverages domain name information extracted from IPFIX summaries, DNS logs, and DHCP leases, with the possibility to be applied to any kind of traffic. Our proposed solution is based on an extension of Word2vec unsupervised learning techniques running on a specialized Apache Spark cluster. In particular, learning techniques are leveraged to generate word-embeddings from a mixed dataset composed by domain names and natural language corpuses in a lightweight way and with general applicability. The paper also reports lessons learnt from our implementation and deployment experience that demonstrates that our solution can process 5500 IPFIX summaries per second on an Apache Spark cluster with 1 slave instance in Amazon EC2 at a cost of $ 3860 year. Reported experimental results about Precision, Recall, F-Measure, Accuracy, and Cohen's Kappa show the feasibility and effectiveness of the proposal. The experiments prove that words contained in domain names do have a relation with the kind of traffic directed towards them, therefore using specifically trained word embeddings we are able to classify them in customizable categories. We also show that training word embeddings on larger natural language corpuses leads improvements in terms of precision up to 180%.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

ight standard inbred mouse strains were evaluated for ethanol effects on a refined battery of behavioral tests in a study that was originally designed to assess the influence of rat odors in the colony on mouse behaviors. As part of the design of the study, two experimenters conducted the tests, and the study was carefully balanced so that equal numbers of mice in all groups and times of day were tested by each experimenter. A defect in airflow in the facility compromised the odor manipulation, and in fact the different odor exposure groups did not differ in their behaviors. The two experimenters, however, obtained markedly different results for three of the tests. Certain of the experimenter effects arose from the way they judged behaviors that were not automated and had to be rated by the experimenter, such as slips on the balance beam. Others were not evident prior to ethanol injection but had a major influence after the injection. For several measures, the experimenter effects were notably different for different inbred strains. Methods to evaluate and reduce the impact of experimenter effects in future research are discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We compare eight pollen records reflecting climatic and environmental change from the tropical Andes. Our analysis focuses on the last 50 ka, with particular emphasis on the Pleistocene to Holocene transition. We explore ecological grouping and downcore ordination results as two approaches for extracting environmental variability from pollen records. We also use the records of aquatic and shoreline vegetation as markers for lake level fluctuations, and precipitation change. Our analysis focuses on the signature of millennial-scale variability in the tropical Andes, in particular, Heinrich stadials and Greenland interstadials. We identify rapid responses of the tropical vegetation to this climate variability, and relate differences between sites to moisture sources and site sensitivity.