995 resultados para Prototype Selection
Resumo:
The identification of non-linear systems using only observed finite datasets has become a mature research area over the last two decades. A class of linear-in-the-parameter models with universal approximation capabilities have been intensively studied and widely used due to the availability of many linear-learning algorithms and their inherent convergence conditions. This article presents a systematic overview of basic research on model selection approaches for linear-in-the-parameter models. One of the fundamental problems in non-linear system identification is to find the minimal model with the best model generalisation performance from observational data only. The important concepts in achieving good model generalisation used in various non-linear system-identification algorithms are first reviewed, including Bayesian parameter regularisation and models selective criteria based on the cross validation and experimental design. A significant advance in machine learning has been the development of the support vector machine as a means for identifying kernel models based on the structural risk minimisation principle. The developments on the convex optimisation-based model construction algorithms including the support vector regression algorithms are outlined. Input selection algorithms and on-line system identification algorithms are also included in this review. Finally, some industrial applications of non-linear models are discussed.
Resumo:
Clustering analysis of data from DNA microarray hybridization studies is an essential task for identifying biologically relevant groups of genes. Attribute cluster algorithm (ACA) has provided an attractive way to group and select meaningful genes. However, ACA needs much prior knowledge about the genes to set the number of clusters. In practical applications, if the number of clusters is misspecified, the performance of the ACA will deteriorate rapidly. In fact, it is a very demanding to do that because of our little knowledge. We propose the Cooperative Competition Cluster Algorithm (CCCA) in this paper. In the algorithm, we assume that both cooperation and competition exist simultaneously between clusters in the process of clustering. By using this principle of Cooperative Competition, the number of clusters can be found in the process of clustering. Experimental results on a synthetic and gene expression data are demonstrated. The results show that CCCA can choose the number of clusters automatically and get excellent performance with respect to other competing methods.
Resumo:
A melphalan-resistant variant (Roswell Park Memorial Institute (RPMI)-2650M1) and a paclitaxel-resistant variant (RPMI-1650Tx) of the drug-sensitive human nasal carcinoma cell line, RPMI-2650. were established. The multidrug resistance (MDR) phenotype in the RPMI-2650Tx appeared to be P-glycoprotein (PgP)-mediated. Overexpression of multidrug resistant protein (MRP) family members was observed in the RPMI-2650M1 cells, which were also much more invasive in vitro than the parental cell line or the paclitaxel-resistant variant. Increased expression of alpha (2), alpha (5), alpha (6), beta (1) and beta (4) integrin subunits, decreased expression of alpha (4) integrin subunit, stronger adhesion to collagen type IV, laminin, fibronectin and matrigel, increased expression of MMP-2 and MMP-9 and significant motility compared with the parental cells were observed, along with a high invasiveness in the RPMI-7650M1 cells. Decreased expression of the alpha (2) integrin subunit, decreased attachment to collagen type IV, absence of cytokeratin 18 expression, no detectable expression of gelatin-degrading proteases and poor motility may be associated with the non-invasiveness of the RPMI-2650Tx variant. These results suggest that melphalan exposure can result in not only a MDR phenotype. but could also make cancer cells more invasive, whereas paclitaxel exposure resulted in MDR without increasing the in vitro invasiveness in the RPMI-2650 cells. (C) 2001 Elsevier Science Ltd. All rights reserved.
Resumo:
The eng-genes concept involves the use of fundamental known system functions as activation functions in a neural model to create a 'grey-box' neural network. One of the main issues in eng-genes modelling is to produce a parsimonious model given a model construction criterion. The challenges are that (1) the eng-genes model in most cases is a heterogenous network consisting of more than one type of nonlinear basis functions, and each basis function may have different set of parameters to be optimised; (2) the number of hidden nodes has to be chosen based on a model selection criterion. This is a mixed integer hard problem and this paper investigates the use of a forward selection algorithm to optimise both the network structure and the parameters of the system-derived activation functions. Results are included from case studies performed on a simulated continuously stirred tank reactor process, and using actual data from a pH neutralisation plant. The resulting eng-genes networks demonstrate superior simulation performance and transparency over a range of network sizes when compared to conventional neural models. (c) 2007 Elsevier B.V. All rights reserved.
Resumo:
Extraction of dibenzothiophene from dodecane using ionic liquids as the extracting phase has been investigated for a range of ionic liquids with varying cation classes (imidazolium, pyridinium, and pyrrolidinium) and a range of anion types using liquid-liquid partition studies and QSPR (quantitative structure-activity relationship) analysis. The partition ratio of dibenzothiophene to the ionic liquids showed a clear variation with cation class (dimethylpyridinium > methylpyridinium > pyridinium approximate to imidazolium approximate to pyrrolidinium), with much less significant variation with anion type. Polyaromatic quinolinium-based ionic liquids showed even greater extraction potential, but were compromised by higher melting points. For example, 1-butyl-6-methylquinolinium bis{(trifluoromethyl)sulfonyl} amide (mp 47 degrees C) extracted 90% of the available dibenzothiophene from dodecane at 60 degrees C.
Resumo:
Bovine serum albumin (BSA) is a commonly used model protein in the development of pharmaceutical formulations. In order to assay its release from various dosage forms, either the bicinchoninic acid (BCA) assay or a more specific size-exclusion high performance liquid chromatography (SE-HPLC) method are commonly employed. However, these can give erroneous results in the presence of some commonly-used pharmaceutical excipients. We therefore investigated the ability of these methods to accurately determine BSA concentrations in pharmaceutical formulations that also contained various polymers and compared them with a new and compared with a new reverse-phase (RP)–HPLC technique. We found that the RP-HPLC technique was the most suitable method. It gave a linear response in the range of 0.5 -100 µg/ml with a correlation coefficient of 0.9999, a limit of detection of 0.11 µg/ml and quantification of 0.33 µg/ml. The performed ‘t’ test for the estimated and theoretical concentration indicated no significant difference between them providing the accuracy. Low % relative standard deviation values (0.8-1.39%) indicate the precision of the method. Furthermore, the method was used to quantify in vitro BSA release from polymeric freeze-dried formulations.
Resumo:
In this paper we report on our attempts to fit the optimal data selection (ODS) model (Oaksford Chater, 1994; Oaksford, Chater, & Larkin, 2000) to the selection task data reported in Feeney and Handley (2000) and Handley, Feeney, and Harper (2002). Although Oaksford (2002b) reports good fits to the data described in Feeney and Handley (2000), the model does not adequately capture the data described in Handley et al. (2002). Furthermore, across all six of the experiments modelled here, the ODS model does not predict participants' behaviour at the level of selection rates for individual cards. Finally, when people's probability estimates are used in the modelling exercise, the model adequately captures only I out of 18 conditions described in Handley et al. We discuss the implications of these results for models of the selection task and claim that they support deductive, rather than probabilistic, accounts of the task.