Biblioteca Digital

974 resultados para Benign entity

Chinese named entity recognition based on hierarchical hybrid model

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Analysis of malicious and benign Android applications

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Since its establishment, the Android applications market has been infected by a proliferation of malicious applications. Recent studies show that rogue developers are injecting malware into legitimate market applications which are then installed on open source sites for consumer uptake. Often, applications are infected several times. In this paper, we investigate the behavior of malicious Android applications, we present a simple and effective way to safely execute and analyze them. As part of this analysis, we use the Android application sandbox Droidbox to generate behavioral graphs for each sample and these provide the basis of the development of patterns to aid in identifying it. As a result, we are able to determine if family names have been correctly assigned by current anti-virus vendors. Our results indicate that the traditional anti-virus mechanisms are not able to correctly identify malicious Android applications.

Veja mais

A generic classifier-ensemble approach for biomedical named entity recognition

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In named entity recognition (NER) for biomedical literature, approaches based on combined classifiers have demonstrated great performance improvement compared to a single (best) classifier. This is mainly owed to sufficient level of diversity exhibited among classifiers, which is a selective property of classifier set. Given a large number of classifiers, how to select different classifiers to put into a classifier-ensemble is a crucial issue of multiple classifier-ensemble design. With this observation in mind, we proposed a generic genetic classifier-ensemble method for the classifier selection in biomedical NER. Various diversity measures and majority voting are considered, and disjoint feature subsets are selected to construct individual classifiers. A basic type of individual classifier – Support Vector Machine (SVM) classifier is adopted as SVM-classifier committee. A multi-objective Genetic algorithm (GA) is employed as the classifier selector to facilitate the ensemble classifier to improve the overall sample classification accuracy. The proposed approach is tested on the benchmark dataset – GENIA version 3.02 corpus, and compared with both individual best SVM classifier and SVM-classifier ensemble algorithm as well as other machine learning methods such as CRF, HMM and MEMM. The results show that the proposed approach outperforms other classification algorithms and can be a useful method for the biomedical NER problem.

Veja mais

The master–slave dialectic and the “sado-masochistic entity”

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Benign ovarian cysts and ovarian cancer: a cohort study with implications for screening

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Application of the reporting entity concept and lodgement of special purpose financial statements

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This Research Report analyses the application of the reporting entity concept and the adoption of special purpose financial reporting, particularly by entities lodging financial statements with the Australian Securities and Investments Commission (ASIC) and with state-based regulators in Australia’s three most populous states, namely, Consumer Affairs Victoria, NSW Fair Trading and Queensland Office of Fair Trading. This Report does not cover entities that have their equity interests traded in a public market, such as listed companies, and some other entities with ‘public accountability’.

Veja mais

Properties of bamboo fibres produced using an environmentally benign method

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The properties of bamboo fibres extracted from raw bamboo plants in an environmentally benign manner were investigated. To reduce environmental impacts of the manufacturing process, microwave, ultra-sonication and enzyme were used to extract the bamboo fibres, avoiding the use of hazardous chemicals. The new method enabled the extraction of single fibres while retaining a certain quantity of lignin in fibre. The retained lignin allowed the fibre to possess UV absorption and antibacterial properties, which will be advantageous for many textile applications.

Veja mais

Application of the reporting entity concept in Australia

Relevância:

20.00% 20.00%

Publicador:

Veja mais

A hybrid model for named entity recognition using unstructured medical text

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Named entity recognition (NER) is an essential step in the process of information extraction within text mining. This paper proposes a technique to extract drug named entities from unstructured and informal medical text using a hybrid model of lexicon-based and rule-based techniques. In the proposed model, a lexicon is first used as the initial step to detect drug named entities. Inference rules are then deployed to further extract undetected drug names. The designed rules employ part of speech tags and morphological features for drug name detection. The proposed hybrid model is evaluated using a benchmark data set from the i2b2 2009 medication challenge, and is able to achieve an f-score of 66.97%.

Veja mais

Enhancing medical named entity recognition with an extended segment representation technique

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Objective : The objective of this paper is to formulate an extended segment representation (SR) technique to enhance named entity recognition (NER) in medical applications.

Methods : An extension to the IOBES (Inside/Outside/Begin/End/Single) SR technique is formulated. In the proposed extension, a new class is assigned to words that do not belong to a named entity (NE) in one context but appear as an NE in other contexts. Ambiguity in such cases can negatively affect the results of classification-based NER techniques. Assigning a separate class to words that can potentially cause ambiguity in NER allows a classifier to detect NEs more accurately; therefore increasing classification accuracy.

Results : The proposed SR technique is evaluated using the i2b2 2010 medical challenge data set with eight different classifiers. Each classifier is trained separately to extract three different medical NEs, namely treatment, problem, and test. From the three experimental results, the extended SR technique is able to improve the average F1-measure results pertaining to seven out of eight classifiers. The kNN classifier shows an average reduction of 0.18% across three experiments, while the C4.5 classifier records an average improvement of 9.33%.

Veja mais

Classification ensemble to improve medical named entity recognition

Relevância:

20.00% 20.00%

Publicador:

Resumo:

An accurate Named Entity Recognition (NER) is important for knowledge discovery in text mining. This paper proposes an ensemble machine learning approach to recognise Named Entities (NEs) from unstructured and informal medical text. Specifically, Conditional Random Field (CRF) and Maximum Entropy (ME) classifiers are applied individually to the test data set from the i2b2 2010 medication challenge. Each classifier is trained using a different set of features. The first set focuses on the contextual features of the data, while the second concentrates on the linguistic features of each word. The results of the two classifiers are then combined. The proposed approach achieves an f-score of 81.8%, showing a considerable improvement over the results from CRF and ME classifiers individually which achieve f-scores of 76% and 66.3% for the same data set, respectively.

Veja mais

Enhancement of medical named entity recognition using graph-based features

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Named Entity Recognition (NER) is a crucial step in text mining. This paper proposes a new graph-based technique for representing unstructured medical text. The new representation is used to extract discriminative features that are able to enhance the NER performance. To evaluate the usefulness of the proposed graph-based technique, the i2b2 medication challenge data set is used. Specifically, the 'treatment' named entities are extracted for evaluation using six different classifiers. The F-measure results of five classifiers are enhanced, with an average improvement of up to 26% in performance.

Veja mais