271 resultados para partitions
Resumo:
Packet forwarding is a memory-intensive application requiring multiple accesses through a trie structure. The efficiency of a cache for this application critically depends on the placement function to reduce conflict misses. Traditional placement functions use a one-level mapping that naively partitions trie-nodes into cache sets. However, as a significant percentage of trie nodes are not useful, these schemes suffer from a non-uniform distribution of useful nodes to sets. This in turn results in increased conflict misses. Newer organizations such as variable associativity caches achieve flexibility in placement at the expense of increased hit-latency. This makes them unsuitable for L1 caches.We propose a novel two-level mapping framework that retains the hit-latency of one-level mapping yet incurs fewer conflict misses. This is achieved by introducing a secondlevel mapping which reorganizes the nodes in the naive initial partitions into refined partitions with near-uniform distribution of nodes. Further as this remapping is accomplished by simply adapting the index bits to a given routing table the hit-latency is not affected. We propose three new schemes which result in up to 16% reduction in the number of misses and 13% speedup in memory access time. In comparison, an XOR-based placement scheme known to perform extremely well for general purpose architectures, can obtain up to 2% speedup in memory access time.
Resumo:
This paper reports on our study of the edge of the 2/5 fractional quantum Hall state, which is more complicated than the edge of the 1/3 state because of the presence of edge sectors corresponding to different partitions of composite fermions in the lowest two Lambda levels. The addition of an electron at the edge is a nonperturbative process and it is not a priori obvious in what manner the added electron distributes itself over these sectors. We show, from a microscopic calculation, that when an electron is added at the edge of the ground state in the [N(1), N(2)] sector, where N(1) and N(2) are the numbers of composite fermions in the lowest two Lambda levels, the resulting state lies in either [N(1) + 1, N(2)] or [N(1), N(2) + 1] sectors; adding an electron at the edge is thus equivalent to adding a composite fermion at the edge. The coupling to other sectors of the form [N(1) + 1 + k, N(2) - k], k integer, is negligible in the asymptotically low-energy limit. This study also allows a detailed comparison with the two-boson model of the 2/5 edge. We compute the spectral weights and find that while the individual spectral weights are complicated and nonuniversal, their sum is consistent with an effective two-boson description of the 2/5 edge.
Resumo:
Location area planning problem is to partition the cellular/mobile network into location areas with the objective of minimizing the total cost. This partitioning problem is a difficult combinatorial optimization problem. In this paper, we use the simulated annealing with a new solution representation. In our method, we can automatically generate different number of location areas using Compact Index (CI) to obtain the optimal/best partitions. We compare the results obtained in our method with the earlier results available in literature. We show that our methodology is able to perform better than earlier methods.
Resumo:
Lack of supervision in clustering algorithms often leads to clusters that are not useful or interesting to human reviewers. We investigate if supervision can be automatically transferred for clustering a target task, by providing a relevant supervised partitioning of a dataset from a different source task. The target clustering is made more meaningful for the human user by trading-off intrinsic clustering goodness on the target task for alignment with relevant supervised partitions in the source task, wherever possible. We propose a cross-guided clustering algorithm that builds on traditional k-means by aligning the target clusters with source partitions. The alignment process makes use of a cross-task similarity measure that discovers hidden relationships across tasks. When the source and target tasks correspond to different domains with potentially different vocabularies, we propose a projection approach using pivot vocabularies for the cross-domain similarity measure. Using multiple real-world and synthetic datasets, we show that our approach improves clustering accuracy significantly over traditional k-means and state-of-the-art semi-supervised clustering baselines, over a wide range of data characteristics and parameter settings.
Resumo:
The effectiveness of the last-level shared cache is crucial to the performance of a multi-core system. In this paper, we observe and make use of the DelinquentPC - Next-Use characteristic to improve shared cache performance. We propose a new PC-centric cache organization, NUcache, for the shared last level cache of multi-cores. NUcache logically partitions the associative ways of a cache set into MainWays and DeliWays. While all lines have access to the MainWays, only lines brought in by a subset of delinquent PCs, selected by a PC selection mechanism, are allowed to enter the DeliWays. The PC selection mechanism is an intelligent cost-benefit analysis based algorithm that utilizes Next-Use information to select the set of PCs that can maximize the hits experienced in DeliWays. Performance evaluation reveals that NUcache improves the performance over a baseline design by 9.6%, 30% and 33% respectively for dual, quad and eight core workloads comprised of SPEC benchmarks. We also show that NUcache is more effective than other well-known cache-partitioning algorithms.
Resumo:
Clustering has been the most popular method for data exploration. Clustering is partitioning the data set into sub-partitions based on some measures say the distance measure, each partition has its own significant information. There are a number of algorithms explored for this purpose, one such algorithm is the Particle Swarm Optimization(PSO) which is a population based heuristic search technique derived from swarm intelligence. In this paper we present an improved version of the Particle Swarm Optimization where, each feature of the data set is given significance accordingly by adding some random weights, which also minimizes the distortions in the dataset if any. The performance of the above proposed algorithm is evaluated using some benchmark datasets from Machine Learning Repository. The experimental results shows that our proposed methodology performs significantly better than the previously performed experiments.
Resumo:
We consider refined versions of Markov chains related to juggling introduced by Warrington. We further generalize the construction to juggling with arbitrary heights as well as infinitely many balls, which are expressed more succinctly in terms of Markov chains on integer partitions. In all cases, we give explicit product formulas for the stationary probabilities. The normalization factor in one case can be explicitly written as a homogeneous symmetric polynomial. We also refine and generalize enriched Markov chains on set partitions. Lastly, we prove that in one case, the stationary distribution is attained in bounded time.
Resumo:
The isomerization of glucose into fructose is a large-scale reaction for the production of high-fructose corn syrup, and is now being considered as an intermediate step in the possible route of biomass conversion into fuels and chemicals. Recently, it has been shown that a hydrophobic, large pore, silica molecular sieve having the zeolite beta structure and containing framework Sn4+ (Sn-Beta) is able to isomerize glucose into fructose in aqueous media. Here, I have investigated how this catalyst converts glucose to fructose and show that it is analogous to that achieved with metalloenzymes. Specifically, glucose partitions into the molecular sieve in the pyranose form, ring opens to the acyclic form in the presence of the Lewis acid center (framework Sn4+), isomerizes into the acyclic form of fructose and finally ring closes to yield the furanose product. Akin to the metalloenzyme, the isomerization step proceeds by intramolecular hydride transfer from C2 to C1. Extraframework tin oxides located within hydrophobic channels of the molecular sieve that exclude liquid water can also isomerize glucose to fructose in aqueous media, but do so through a base-catalyzed proton abstraction mechanism. Extraframework tin oxide particles located at the external surface of the molecular sieve crystals or on amorphous silica supports are not active in aqueous media but are able to perform the isomerization in methanol by a base-catalyzed proton abstraction mechanism. Post-synthetic exchange of Na+ with Sn-Beta alters the glucose reaction pathway from the 1,2 intramolecular hydrogen shift (isomerization) to produce fructose towards the 1,2 intramolecular carbon shift (epimerization) that forms mannose. Na+ remains exchanged onto silanol groups during reaction in methanol solvent, leading to a near complete shift in selectivity towards glucose epimerization to mannose. In contrast, decationation occurs during reaction in aqueous solutions and gradually increases the reaction selectivity to isomerization at the expense of epimerization. Decationation and concomitant changes in selectivity can be eliminated by addition of NaCl to the aqueous reaction solution. Thus, framework tin sites with a proximal silanol group are the active sites for the 1, 2 intramolecular hydride shift in the isomerization of glucose to fructose, while these sites with Na-exchanged silanol group are the active sites for the 1, 2 intramolecular carbon shift in epimerization of glucose to mannose.
Resumo:
If E and F are real Banach spaces let Cp,q(E, F) O ≤ q ≤ p ≤ ∞, denote those maps from E to F which have p continuous Frechet derivatives of which the first q derivatives are bounded. A Banach space E is defined to be Cp,q smooth if Cp,q(E,R) contains a nonzero function with bounded support. This generalizes the standard Cp smoothness classification.
If an Lp space, p ≥ 1, is Cq smooth then it is also Cq,q smooth so that in particular Lp for p an even integer is C∞,∞ smooth and Lp for p an odd integer is Cp-1,p-1 smooth. In general, however, a Cp smooth B-space need not be Cp,p smooth. Co is shown to be a non-C2,2 smooth B-space although it is known to be C∞ smooth. It is proved that if E is Cp,1 smooth then Co(E) is Cp,1 smooth and if E has an equivalent Cp norm then co(E) has an equivalent Cp norm.
Various consequences of Cp,q smoothness are studied. If f ϵ Cp,q(E,F), if F is Cp,q smooth and if E is non-Cp,q smooth, then the image under f of the boundary of any bounded open subset U of E is dense in the image of U. If E is separable then E is Cp,q smooth if and only if E admits Cp,q partitions of unity; E is Cp,psmooth, p ˂∞, if and only if every closed subset of E is the zero set of some CP function.
f ϵ Cq(E,F), 0 ≤ q ≤ p ≤ ∞, is said to be Cp,q approximable on a subset U of E if for any ϵ ˃ 0 there exists a g ϵ Cp(E,F) satisfying
sup/xϵU, O≤k≤q ‖ Dk f(x) - Dk g(x) ‖ ≤ ϵ.
It is shown that if E is separable and Cp,q smooth and if f ϵ Cq(E,F) is Cp,q approximable on some neighborhood of every point of E, then F is Cp,q approximable on all of E.
In general it is unknown whether an arbitrary function in C1(l2, R) is C2,1 approximable and an example of a function in C1(l2, R) which may not be C2,1 approximable is given. A weak form of C∞,q, q≥1, to functions in Cq(l2, R) is proved: Let {Uα} be a locally finite cover of l2 and let {Tα} be a corresponding collection of Hilbert-Schmidt operators on l2. Then for any f ϵ Cq(l2,F) such that for all α
sup ‖ Dk(f(x)-g(x))[Tαh]‖ ≤ 1.
xϵUα,‖h‖≤1, 0≤k≤q
Resumo:
A new method of finding the optimal group membership and number of groupings to partition population genetic distance data is presented. The software program Partitioning Optimization with Restricted Growth Strings (PORGS), visits all possible set partitions and deems acceptable partitions to be those that reduce mean intracluster distance. The optimal number of groups is determined with the gap statistic which compares PORGS results with a reference distribution. The PORGS method was validated by a simulated data set with a known distribution. For efficiency, where values of n were larger, restricted growth strings (RGS) were used to bipartition populations during a nested search (bi-PORGS). Bi-PORGS was applied to a set of genetic data from 18 Chinook salmon (Oncorhynchus tshawytscha) populations from the west coast of Vancouver Island. The optimal grouping of these populations corresponded to four geographic locations: 1) Quatsino Sound, 2) Nootka Sound, 3) Clayoquot +Barkley sounds, and 4) southwest Vancouver Island. However, assignment of populations to groups did not strictly reflect the geographical divisions; fish of Barkley Sound origin that had strayed into the Gold River and close genetic similarity between transferred and donor populations meant groupings crossed geographic boundaries. Overall, stock structure determined by this partitioning method was similar to that determined by the unweighted pair-group method with arithmetic averages (UPGMA), an agglomerative clustering algorithm.
Resumo:
A extração de regras de associação (ARM - Association Rule Mining) de dados quantitativos tem sido pesquisa de grande interesse na área de mineração de dados. Com o crescente aumento das bases de dados, há um grande investimento na área de pesquisa na criação de algoritmos para melhorar o desempenho relacionado a quantidade de regras, sua relevância e a performance computacional. O algoritmo APRIORI, tradicionalmente usado na extração de regras de associação, foi criado originalmente para trabalhar com atributos categóricos. Geralmente, para usá-lo com atributos contínuos, ou quantitativos, é necessário transformar os atributos contínuos, discretizando-os e, portanto, criando categorias a partir dos intervalos discretos. Os métodos mais tradicionais de discretização produzem intervalos com fronteiras sharp, que podem subestimar ou superestimar elementos próximos dos limites das partições, e portanto levar a uma representação imprecisa de semântica. Uma maneira de tratar este problema é criar partições soft, com limites suavizados. Neste trabalho é utilizada uma partição fuzzy das variáveis contínuas, que baseia-se na teoria dos conjuntos fuzzy e transforma os atributos quantitativos em partições de termos linguísticos. Os algoritmos de mineração de regras de associação fuzzy (FARM - Fuzzy Association Rule Mining) trabalham com este princípio e, neste trabalho, o algoritmo FUZZYAPRIORI, que pertence a esta categoria, é utilizado. As regras extraídas são expressas em termos linguísticos, o que é mais natural e interpretável pelo raciocício humano. Os algoritmos APRIORI tradicional e FUZZYAPRIORI são comparado, através de classificadores associativos, baseados em regras extraídas por estes algoritmos. Estes classificadores foram aplicados em uma base de dados relativa a registros de conexões TCP/IP que destina-se à criação de um Sistema de Detecção de Intrusos.
Resumo:
Effective dialogue management is critically dependent on the information that is encoded in the dialogue state. In order to deploy reinforcement learning for policy optimization, dialogue must be modeled as a Markov Decision Process. This requires that the dialogue statemust encode all relevent information obtained during the dialogue prior to that state. This can be achieved by combining the user goal, the dialogue history, and the last user action to form the dialogue state. In addition, to gain robustness to input errors, dialogue must be modeled as a Partially Observable Markov Decision Process (POMDP) and hence, a distribution over all possible states must be maintained at every dialogue turn. This poses a potential computational limitation since there can be a very large number of dialogue states. The Hidden Information State model provides a principled way of ensuring tractability in a POMDP-based dialogue model. The key feature of this model is the grouping of user goals into partitions that are dynamically built during the dialogue. In this article, we extend this model further to incorporate the notion of complements. This allows for a more complex user goal to be represented, and it enables an effective pruning technique to be implemented that preserves the overall system performance within a limited computational resource more effectively than existing approaches. © 2011 ACM.
Resumo:
The fundamental aim of clustering algorithms is to partition data points. We consider tasks where the discovered partition is allowed to vary with some covariate such as space or time. One approach would be to use fragmentation-coagulation processes, but these, being Markov processes, are restricted to linear or tree structured covariate spaces. We define a partition-valued process on an arbitrary covariate space using Gaussian processes. We use the process to construct a multitask clustering model which partitions datapoints in a similar way across multiple data sources, and a time series model of network data which allows cluster assignments to vary over time. We describe sampling algorithms for inference and apply our method to defining cancer subtypes based on different types of cellular characteristics, finding regulatory modules from gene expression data from multiple human populations, and discovering time varying community structure in a social network.
Resumo:
Amblycipitidae Day, 1873 is an Asian family of catfishes (Siluriformes) usually considered to contain 28 species placed in three genera: Amblyceps (14 spp.), Liobagrus (12 spp.) and Xiurenbagrus (2 spp.). Morphology-based systematics has supported the monophyly of this family, with some authors placing Amblycipitidae within a larger group including Akysidae, Sisoridae and Aspredinidae, termed the Sisoroidea. Here we investigate the phylogenetic relationships among four species of Amblyceps, six species of Liobagrus and the two species of Xiurenbagrus with respect to other sisoroid taxa as well as other catfish groups using 6100 aligned base pairs of DNA sequence data from the rag1 and rag2 genes of the nuclear genome and from three regions (cyt b, COL ND4 plus tRNA-His and tRNA-Ser) of the mitochondrial genome. Parsimony and Bayesian analyses of the data indicate strong support for a diphyletic Amblycipitidae in which the genus Amblyceps is the sister group to the Sisoridae and a clade formed by genera Liobagrus and Xiurenbagrus is the sister group to Akysidae. These taxa together form a well supported monophyletic group that assembles all Asian sisoroid taxa, but excludes the South American Aspredinidae. Results for aspredinids are consistent with previous molecular studies that indicate these catfishes are not sisoroids, but the sister group to the South American doradoid catfishes (Auchenipteridae + Doradidae). The redefined sisoroid clade plus Bagridae, Horabagridae and (Ailia + Laides) make up a larger monophyletic group informally termed "Big Asia." Likelihood-based SH tests and Bayes Factor comparisons of the rag and the mitochondrial data partitions considered separately and combined reject both the hypothesis of amblycipitid monophyly and the hypothesis of aspredinid inclusion within Sisoroidea. This result for amblycipitids conflicts with a number of well documented morphological synapomorphies that we briefly review. Possible nomenclatural changes for amblycipitid taxa are noted.
Resumo:
Ontologies play a core role to provide shared knowledge models to semantic-driven applications targeted by Semantic Web. Ontology metrics become an important area because they can help ontology engineers to assess ontology and better control project management and development of ontology based systems, and therefore reduce the risk of project failures. In this paper, we propose a set of ontology cohesion metrics which focuses on measuring (possibly inconsistent) ontologies in the context of dynamic and changing Web. They are: Number of Ontology Partitions (NOP), Number of Minimally Inconsistent Subsets (NMIS) and Average Value of Axiom Inconsistencies (AVAI). These ontology metrics are used to measure ontological semantics rather than ontological structure. They are theoretically validated for ensuring their theoretical soundness, and further empirically validated by a standard test set of debugging ontologies. The related algorithms to compute these ontology metrics also are discussed. These metrics proposed in this paper can be used as a very useful complementarity of existing ontology cohesion metrics.