338 resultados para HDFS bottleneck


Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper reviews the transport and economic development trends for the last 20 years in Spain at a detailed (province or NUTS3) level. As Spain has sustained a significant transport investment effort in this period, with the support of EU funding, this review offers an excellent perspective to put some further light on how the transport-and-regional-development paradigm has shaped decision-making in the transport sector. The paper reviews changes in gross domestic product (GDP), population and motorway endowment for the 47 provinces in mainland Spain. Regional development trends seem to be closely associated to particular local conditions, not clearly associated to transport (motorway) infrastructure endowment. This is consistent with the fact that transport infrastructure has not generally been a critical bottleneck for trade and economic activity during this period. The paper concludes that, in general terms, transport infrastructure investment does not seem to be clearly associated to the otherwise substantial differences in regional development among Spanish mainland provinces during this period.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Resource analysis aims at inferring the cost of executing programs for any possible input, in terms of a given resource, such as the traditional execution steps, time ormemory, and, more recently energy consumption or user defined resources (e.g., number of bits sent over a socket, number of database accesses, number of calls to particular procedures, etc.). This is performed statically, i.e., without actually running the programs. Resource usage information is useful for a variety of optimization and verification applications, as well as for guiding software design. For example, programmers can use such information to choose different algorithmic solutions to a problem; program transformation systems can use cost information to choose between alternative transformations; parallelizing compilers can use cost estimates for granularity control, which tries to balance the overheads of task creation and manipulation against the benefits of parallelization. In this thesis we have significatively improved an existing prototype implementation for resource usage analysis based on abstract interpretation, addressing a number of relevant challenges and overcoming many limitations it presented. The goal of that prototype was to show the viability of casting the resource analysis as an abstract domain, and howit could overcome important limitations of the state-of-the-art resource usage analysis tools. For this purpose, it was implemented as an abstract domain in the abstract interpretation framework of the CiaoPP system, PLAI.We have improved both the design and implementation of the prototype, for eventually allowing an evolution of the tool to the industrial application level. The abstract operations of such tool heavily depend on the setting up and finding closed-form solutions of recurrence relations representing the resource usage behavior of program components and the whole program as well. While there exist many tools, such as Computer Algebra Systems (CAS) and libraries able to find closed-form solutions for some types of recurrences, none of them alone is able to handle all the types of recurrences arising during program analysis. In addition, there are some types of recurrences that cannot be solved by any existing tool. This clearly constitutes a bottleneck for this kind of resource usage analysis. Thus, one of the major challenges we have addressed in this thesis is the design and development of a novel modular framework for solving recurrence relations, able to combine and take advantage of the results of existing solvers. Additionally, we have developed and integrated into our novel solver a technique for finding upper-bound closed-form solutions of a special class of recurrence relations that arise during the analysis of programs with accumulating parameters. Finally, we have integrated the improved resource analysis into the CiaoPP general framework for resource usage verification, and specialized the framework for verifying energy consumption specifications of embedded imperative programs in a real application, showing the usefulness and practicality of the resulting tool.---ABSTRACT---El Análisis de recursos tiene como objetivo inferir el coste de la ejecución de programas para cualquier entrada posible, en términos de algún recurso determinado, como pasos de ejecución, tiempo o memoria, y, más recientemente, el consumo de energía o recursos definidos por el usuario (por ejemplo, número de bits enviados a través de un socket, el número de accesos a una base de datos, cantidad de llamadas a determinados procedimientos, etc.). Ello se realiza estáticamente, es decir, sin necesidad de ejecutar los programas. La información sobre el uso de recursos resulta muy útil para una gran variedad de aplicaciones de optimización y verificación de programas, así como para asistir en el diseño de los mismos. Por ejemplo, los programadores pueden utilizar dicha información para elegir diferentes soluciones algorítmicas a un problema; los sistemas de transformación de programas pueden utilizar la información de coste para elegir entre transformaciones alternativas; los compiladores paralelizantes pueden utilizar las estimaciones de coste para realizar control de granularidad, el cual trata de equilibrar el coste debido a la creación y gestión de tareas, con los beneficios de la paralelización. En esta tesis hemos mejorado de manera significativa la implementación de un prototipo existente para el análisis del uso de recursos basado en interpretación abstracta, abordando diversos desafíos relevantes y superando numerosas limitaciones que éste presentaba. El objetivo de dicho prototipo era mostrar la viabilidad de definir el análisis de recursos como un dominio abstracto, y cómo se podían superar las limitaciones de otras herramientas similares que constituyen el estado del arte. Para ello, se implementó como un dominio abstracto en el marco de interpretación abstracta presente en el sistema CiaoPP, PLAI. Hemos mejorado tanto el diseño como la implementación del mencionado prototipo para posibilitar su evolución hacia una herramienta utilizable en el ámbito industrial. Las operaciones abstractas de dicha herramienta dependen en gran medida de la generación, y posterior búsqueda de soluciones en forma cerrada, de relaciones recurrentes, las cuales modelizan el comportamiento, respecto al consumo de recursos, de los componentes del programa y del programa completo. Si bien existen actualmente muchas herramientas capaces de encontrar soluciones en forma cerrada para ciertos tipos de recurrencias, tales como Sistemas de Computación Algebraicos (CAS) y librerías de programación, ninguna de dichas herramientas es capaz de tratar, por sí sola, todos los tipos de recurrencias que surgen durante el análisis de recursos. Existen incluso recurrencias que no las puede resolver ninguna herramienta actual. Esto constituye claramente un cuello de botella para este tipo de análisis del uso de recursos. Por lo tanto, uno de los principales desafíos que hemos abordado en esta tesis es el diseño y desarrollo de un novedoso marco modular para la resolución de relaciones recurrentes, combinando y aprovechando los resultados de resolutores existentes. Además de ello, hemos desarrollado e integrado en nuestro nuevo resolutor una técnica para la obtención de cotas superiores en forma cerrada de una clase característica de relaciones recurrentes que surgen durante el análisis de programas lógicos con parámetros de acumulación. Finalmente, hemos integrado el nuevo análisis de recursos con el marco general para verificación de recursos de CiaoPP, y hemos instanciado dicho marco para la verificación de especificaciones sobre el consumo de energía de programas imperativas embarcados, mostrando la viabilidad y utilidad de la herramienta resultante en una aplicación real.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Legumes establish a root-nodule symbiosis with soil bacteria collectively known as rhizobia. This symbiosis allows legumes to benefit from the nitrogen fixation capabilities of rhizobia and thus to grow in the absence of any fixed nitrogen source. This is especially relevant for Agriculture, where intensive plant growth depletes soils of useable, fixed nitrogen sources. One of the main features of the root nodule symbiosis is its specificity. Different rhizobia are able to nodulate different legumes. Rhizobium leguminosarum bv. viciae is able to establish an effective symbiosis with four different plant genera (Pisum, Lens, Vicia, Lathyrus), and any given isolate will nodulate any of the four plant genera. A population genomics study with rhizobia isolated from P. sativum, L. culinaris, V. sativa or V. faba, all originating in the same soil, showed that plants select specific genotypes from those available in that soil. This was demonstrated at the genome-wide level, but also for specific genes. Accelerated mesocosm studies with successive plant cultures provided additional evidence on this plant selection and on the nature of the genotypes selected. Finally, representatives from the major rhizobial genotypes isolated from these plants allowed characterization of the size and nature of the respective pangenome and specific genome compartments. These were compared to the different genotypes ?symbiotic and non-symbiotic?present in rhizobial populations isolated directly from the soil without plant intervention.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Rhizobium leguminosarum bv viciae (Rlv) is a soil bacterium able to establish specific root-nodule symbioses with legumes of four different genera: Pisum, Vicia, Lens and Lathyrus. Rlv isolates from nodules of any of these legumes can nodulate any of them; however, it has been shown that plants select specific rhizobial genotypes from those present in the soil (1,2). We have previously shown this at the genomic level by following a population genomics approach. Pool genomic sequences from 100 isolates from each of four plant species: P. sativum, L. culinaris, V. faba and V. sativa, show different, specific profiles at the single nucleotide polymorphism (SNP) level for relevant genes. In this work, the extent of Rlv selection from a well-characterized soil population by different legume plant hosts: P. sativum, L. culinaris, V. faba and V. sativa, after a medium-term mesocosm study is described. Direct soil isolates from each of these mesocosm studies have been tested for specific rhizobial genes (glnII and fnrN) and symbiotic genes (nodC and nifH). Different populations were characterized further by Sanger sequencing of both the rpoB phylogenetic marker gene and the symbiotic genes nodC and nifH. The distribution and size of the rhizobial population for each legume host showed changes during the medium-term mesocosm study. Particularly, a non-symbiotic group of rhizobia was enriched by all four hosts, in contrast to the symbiotic rhizobia profile, which was specific for each legume plant host.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Rhizobium leguminosarum bv viciae (Rlv) is a bacterium able to establish effective symbioses with four different legume genera: Pisum, Lens, Lathyrus and Vicia. Classic studies using trap plants have previously shown that, given a choice, different plants prefer specific genotypes of rhizobia, which are adapted to the host (1, 2). In previous work we have performed a Pool-Seq analysis bases on pooled DNA samples from Rlv nodule isolates obtained from Pisum sativum, Lens culinaris, Vicia fava and V. sativa plants, used as rhizobial traps. This experiment allowed us to test the host preference hypothesis: different plant hosts select specific sub-populations of rhizobia from the available population present in a given soil. We have observed that plant-selected sub-populations are different at the single nucleotide polymorphism (SNP) level. We have selected individual isolates from each sub-population (9 fava-bean isolates, 14 pea isolates 9 vetch isolates and 9 lentil isolates) and sequenced their genomes at draft level (ca. 30x, 90 contigs). Genomic analyses have been carried out using J-species and CMG-Biotools. All the isolates had similar genome size (7.5 Mb) and number of genes (7,300). The resulting Average Nucleotide Identity (ANIm) tree showed that Rhizobium leguminosarum bv viciae is a highly diverse group. Each plant-selected subpopulation showed a closed pangenome and core genomes of similar size (11,500 and 4,800 genes, respectively). The addition of all four sub-population results in a larger, closed pangenome of 19,040 genes and a core genome of similar size (4,392 genes). Each sub-population contains a characteristic set of genes but no universal, plant-specific genes were found. The core genome obtained from all four sub-populations is probably a representative core genome for Rhizobium leguminosarum, given that the reference genome (Rhizobium leguminosarum bv. viciae strain 3841) contains most of the core genome. We have also analyzed the symbiotic cluster (nod), and different nod cluster genotypes were found in each sub-population. Supported by MINECO (Consolider-Ingenio 2010, MICROGEN Project, CSD2009-00006).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Linkage disequilibrium analysis can provide high resolution in the mapping of disease genes because it incorporates information on recombinations that have occurred during the entire period from the mutational event to the present. A circumstance particularly favorable for high-resolution mapping is when a single founding mutation segregates in an isolated population. We review here the population structure of Finland in which a small founder population some 100 generations ago has expanded into 5.1 million people today. Among the 30-odd autosomal recessive disorders that are more prevalent in Finland than elsewhere, several appear to have segregated for this entire period in the “panmictic” southern Finnish population. Linkage disequilibrium analysis has allowed precise mapping and determination of genetic distances at the 0.1-cM level in several of these disorders. Estimates of genetic distance have proven accurate, but previous calculations of the confidence intervals were too small because sampling variation was ignored. In the north and east of Finland the population can be viewed as having been “founded” only after 1500. Disease mutations that have undergone such a founding bottleneck only 20 or so generations ago exhibit linkage disequilibrium and haplotype sharing over long genetic distances (5–15 cM). These features have been successfully exploited in the mapping and cloning of many genes. We review the statistical issues of fine mapping by linkage disequilibrium and suggest that improved methodologies may be necessary to map diseases of complex etiology that may have arisen from multiple founding mutations.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

It has long been assumed that HIV-1 evolution is best described by deterministic evolutionary models because of the large population size. Recently, however, it was suggested that the effective population size (Ne) may be rather small, thereby allowing chance to influence evolution, a situation best described by a stochastic evolutionary model. To gain experimental evidence supporting one of the evolutionary models, we investigated whether the development of resistance to the protease inhibitor ritonavir affected the evolution of the env gene. Sequential serum samples from five patients treated with ritonavir were used for analysis of the protease gene and the V3 domain of the env gene. Multiple reverse transcription–PCR products were cloned, sequenced, and used to construct phylogenetic trees and to calculate the genetic variation and Ne. Genotypic resistance to ritonavir developed in all five patients, but each patient displayed a unique combination of mutations, indicating a stochastic element in the development of ritonavir resistance. Furthermore, development of resistance induced clear bottleneck effects in the env gene. The mean intrasample genetic variation, which ranged from 1.2% to 5.7% before treatment, decreased significantly (P < 0.025) during treatment. In agreement with these findings, Ne was estimated to be very small (500–15,000) compared with the total HIV-1 RNA copy number. This study combines three independent observations, strong population bottlenecking, small Ne, and selection of different combinations of protease-resistance mutations, all of which indicate that HIV-1 evolution is best described by a stochastic evolutionary model.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

How a reacting system climbs through a transition state during the course of a reaction has been an intriguing subject for decades. Here we present and quantify a technique to identify and characterize local invariances about the transition state of an N-particle Hamiltonian system, using Lie canonical perturbation theory combined with microcanonical molecular dynamics simulation. We show that at least three distinct energy regimes of dynamical behavior occur in the region of the transition state, distinguished by the extent of their local dynamical invariance and regularity. Isomerization of a six-atom Lennard–Jones cluster illustrates this: up to energies high enough to make the system manifestly chaotic, approximate invariants of motion associated with a reaction coordinate in phase space imply a many-body dividing hypersurface in phase space that is free of recrossings even in a sea of chaos. The method makes it possible to visualize the stable and unstable invariant manifolds leading to and from the transition state, i.e., the reaction path in phase space, and how this regularity turns to chaos with increasing total energy of the system. This, in turn, illuminates a new type of phase space bottleneck in the region of a transition state that emerges as the total energy and mode coupling increase, which keeps a reacting system increasingly trapped in that region.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The folding mechanism of a 125-bead heteropolymer model for proteins is investigated with Monte Carlo simulations on a cubic lattice. Sequences that do and do not fold in a reasonable time are compared. The overall folding behavior is found to be more complex than that of models for smaller proteins. Folding begins with a rapid collapse followed by a slow search through the semi-compact globule for a sequence-dependent stable core with about 30 out of 176 native contacts which serves as the transition state for folding to a near-native structure. Efficient search for the core is dependent on structural features of the native state. Sequences that fold have large amounts of stable, cooperative structure that is accessible through short-range initiation sites, such as those in anti-parallel sheets connected by turns. Before folding is completed, the system can encounter a second bottleneck, involving the condensation and rearrangement of surface residues. Overly stable local structure of the surface residues slows this stage of the folding process. The relation of the results from the 125-mer model studies to the folding of real proteins is discussed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Major histocompatibility complex (MHC) genes encode cell surface proteins whose function is to bind and present intracellularly processed peptides to T lymphocytes of the immune system. Extensive MHC diversity has been documented in many species and is maintained by some form of balancing selection. We report here that both European and North American populations of moose (Alces alces) exhibit very low levels of genetic diversity at an expressed MHC class II DRB locus. The observed polymorphism was restricted to six amino acid substitutions, all in the peptide binding site, and four of these were shared between continents. The data imply that the moose have lost MHC diversity in a population bottleneck, prior to the divergence of the Old and New World subspecies. Sequence analysis of mtDNA showed that the two subspecies diverged at least 100,000 years ago. Thus, viable moose populations with very restricted MHC diversity have been maintained for a long period of time. Both positive selection for polymorphism and intraexonic recombination have contributed to the generation of MHC diversity after the putative bottleneck.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

O aumento na demanda mundial por energia, a perspectiva de encolhimento dos recursos energéticos e a preocupação global com a questão ambiental, despertaram o interesse por fontes alternativas de energia. A biomassa lignocelulósica é abundante e de baixo custo, com potencial para complementar a produção em larga escala de combustíveis. A degradação das moléculas constituintes da parede celular à açúcares fermentescíveis e então à etanol, ocorre através da hidrólise enzimática da biomassa. Contudo, a utilização de enzimas para esse fim encontra-se em estágio exploratório e representa um gargalo na implementação de tecnologias de etanol 2G em escala industrial, desencadeando a busca de celulases bioquimicamente mais ativas, estáveis e economicamente viáveis. O presente trabalho visou a caracterização da endoglucanase I do fungo Trichoderma harzianum, e para isso foi realizada expressão, ensaios bioquímicos e biofísicos do domínio catalítico (ThCel7B-CCD) e da proteína inteira (ThCel7B-full). A enzima exibiu um perfil acidofílico, com atividade ótima em pH 3,0 a 55°C. A proteína também se mostrou capaz de hidrolisar uma variedade de substratos, sendo a maior atividade hidrolítica em β-glucano (75 U mg-1). Ao analisar a estabilidade térmica medida a 55°C em pH 5, a atividade residual manteve-se intacta por mais de 2 meses. Outra característica relevante foi o elevado grau de sinergismo entre ThCel7B e ThCel7A. Análises de microscopia eletrônica de flocos de aveia submetidas à hidrólise com ThCel7B evidenciaram os efeitos de degradação do substrato em relação às amostras controle. O conjunto desses resultados, além de importante para a compreensão do mecanismo molecular de ThCel7B e de outras endoglucanases da família GH7, também revelou uma enzima de interesse biotecnológicos uma vez que o comportamento ácido e sua estabilidade térmica são características relevantes para aplicações industriais sob condições extremamente ácidas.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The extension to new languages is a well known bottleneck for rule-based systems. Considerable human effort, which typically consists in re-writing from scratch huge amounts of rules, is in fact required to transfer the knowledge available to the system from one language to a new one. Provided sufficient annotated data, machine learning algorithms allow to minimize the costs of such knowledge transfer but, up to date, proved to be ineffective for some specific tasks. Among these, the recognition and normalization of temporal expressions still remains out of their reach. Focusing on this task, and still adhering to the rule-based framework, this paper presents a bunch of experiments on the automatic porting to Italian of a system originally developed for Spanish. Different automatic rule translation strategies are evaluated and discussed, providing a comprehensive overview of the challenge.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This study received partial financial support from the Séneca Foundation, Agencia Regional de Ciencia y Tecnología de la Región de Murcia, Spain (11881/PI/09). The first author (MGW) was supported by Fundação para a Ciência e Tecnologia (FCT, Portugal) postdoctoral grant (SFRH/BPD/70689/2010). JD was supported through an Erasmus grant (2011–2013) of the European MSc in Marine Biodiversity and Conservation (EMBC).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Customizing shoe manufacturing is one of the great challenges in the footwear industry. It is a production model change where design adopts not only the main role, but also the main bottleneck. It is therefore necessary to accelerate this process by improving the accuracy of current methods. Rapid prototyping techniques are based on the reuse of manufactured footwear lasts so that they can be modified with CAD systems leading rapidly to new shoe models. In this work, we present a shoe last fast reconstruction method that fits current design and manufacturing processes. The method is based on the scanning of shoe last obtaining sections and establishing a fixed number of landmarks onto those sections to reconstruct the shoe last 3D surface. Automated landmark extraction is accomplished through the use of the self-organizing network, the growing neural gas (GNG), which is able to topographically map the low dimensionality of the network to the high dimensionality of the contour manifold without requiring a priori knowledge of the input space structure. Moreover, our GNG landmark method is tolerant to noise and eliminates outliers. Our method accelerates up to 12 times the surface reconstruction and filtering processes used by the current shoe last design software. The proposed method offers higher accuracy compared with methods with similar efficiency as voxel grid.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A major problem related to the treatment of ecosystems is that they have no available mathematical formalization. This implies that many of their properties are not presented as short, rigorous modalities, but rather as long expressions which, from a biological standpoint, totally capture the significance of the property, but which have the disadvantage of not being sufficiently manageable, from a mathematical standpoint. The interpretation of ecosystems through networks allows us to employ the concepts of coverage and invariance alongside other related concepts. The latter will allow us to present the two most important relations in an ecosystem – predator–prey and competition – in a different way. Biological control, defined as “the use of living organisms, their resources or their products to prevent or reduce loss or damage caused by pests”, is now considered the environmentally safest and most economically advantageous method of pest control (van Lenteren, 2011). A guild includes all those organisms that share a common food resource (Polis et al., 1989), which in the context of biological control means all the natural enemies of a given pest. There are several types of intraguild interactions, but the one that has received most research attention is intraguild predation, which occurs when two organisms share the same prey while at the same time participating in some kind of trophic interaction. However, this is not the only intraguild relationship possible, and studies are now being conducted on others, such as oviposition deterrence. In this article, we apply the developed concepts of structural functions, coverage, invariant sets, etc. (Lloret et al., 1998, Esteve and Lloret, 2006a, Esteve and Lloret, 2006b and Esteve and Lloret, 2007) to a tritrophic system that includes aphids, one of the most damaging pests and a current bottleneck for the success of biological control in Mediterranean greenhouses.