21 resultados para HDFS bottleneck
Resumo:
Due to the relative transparency of its embryos and larvae, the zebrafish is an ideal model organism for bioimaging approaches in vertebrates. Novel microscope technologies allow the imaging of developmental processes in unprecedented detail, and they enable the use of complex image-based read-outs for high-throughput/high-content screening. Such applications can easily generate Terabytes of image data, the handling and analysis of which becomes a major bottleneck in extracting the targeted information. Here, we describe the current state of the art in computational image analysis in the zebrafish system. We discuss the challenges encountered when handling high-content image data, especially with regard to data quality, annotation, and storage. We survey methods for preprocessing image data for further analysis, and describe selected examples of automated image analysis, including the tracking of cells during embryogenesis, heartbeat detection, identification of dead embryos, recognition of tissues and anatomical landmarks, and quantification of behavioral patterns of adult fish. We review recent examples for applications using such methods, such as the comprehensive analysis of cell lineages during early development, the generation of a three-dimensional brain atlas of zebrafish larvae, and high-throughput drug screens based on movement patterns. Finally, we identify future challenges for the zebrafish image analysis community, notably those concerning the compatibility of algorithms and data formats for the assembly of modular analysis pipelines.
Resumo:
This paper reviews the transport and economic development trends for the last 20 years in Spain at a detailed (province or NUTS3) level. As Spain has sustained a significant transport investment effort in this period, with the support of EU funding, this review offers an excellent perspective to put some further light on how the transport-and-regional-development paradigm has shaped decision-making in the transport sector. The paper reviews changes in gross domestic product (GDP), population and motorway endowment for the 47 provinces in mainland Spain. Regional development trends seem to be closely associated to particular local conditions, not clearly associated to transport (motorway) infrastructure endowment. This is consistent with the fact that transport infrastructure has not generally been a critical bottleneck for trade and economic activity during this period. The paper concludes that, in general terms, transport infrastructure investment does not seem to be clearly associated to the otherwise substantial differences in regional development among Spanish mainland provinces during this period.
Resumo:
Resource analysis aims at inferring the cost of executing programs for any possible input, in terms of a given resource, such as the traditional execution steps, time ormemory, and, more recently energy consumption or user defined resources (e.g., number of bits sent over a socket, number of database accesses, number of calls to particular procedures, etc.). This is performed statically, i.e., without actually running the programs. Resource usage information is useful for a variety of optimization and verification applications, as well as for guiding software design. For example, programmers can use such information to choose different algorithmic solutions to a problem; program transformation systems can use cost information to choose between alternative transformations; parallelizing compilers can use cost estimates for granularity control, which tries to balance the overheads of task creation and manipulation against the benefits of parallelization. In this thesis we have significatively improved an existing prototype implementation for resource usage analysis based on abstract interpretation, addressing a number of relevant challenges and overcoming many limitations it presented. The goal of that prototype was to show the viability of casting the resource analysis as an abstract domain, and howit could overcome important limitations of the state-of-the-art resource usage analysis tools. For this purpose, it was implemented as an abstract domain in the abstract interpretation framework of the CiaoPP system, PLAI.We have improved both the design and implementation of the prototype, for eventually allowing an evolution of the tool to the industrial application level. The abstract operations of such tool heavily depend on the setting up and finding closed-form solutions of recurrence relations representing the resource usage behavior of program components and the whole program as well. While there exist many tools, such as Computer Algebra Systems (CAS) and libraries able to find closed-form solutions for some types of recurrences, none of them alone is able to handle all the types of recurrences arising during program analysis. In addition, there are some types of recurrences that cannot be solved by any existing tool. This clearly constitutes a bottleneck for this kind of resource usage analysis. Thus, one of the major challenges we have addressed in this thesis is the design and development of a novel modular framework for solving recurrence relations, able to combine and take advantage of the results of existing solvers. Additionally, we have developed and integrated into our novel solver a technique for finding upper-bound closed-form solutions of a special class of recurrence relations that arise during the analysis of programs with accumulating parameters. Finally, we have integrated the improved resource analysis into the CiaoPP general framework for resource usage verification, and specialized the framework for verifying energy consumption specifications of embedded imperative programs in a real application, showing the usefulness and practicality of the resulting tool.---ABSTRACT---El Análisis de recursos tiene como objetivo inferir el coste de la ejecución de programas para cualquier entrada posible, en términos de algún recurso determinado, como pasos de ejecución, tiempo o memoria, y, más recientemente, el consumo de energía o recursos definidos por el usuario (por ejemplo, número de bits enviados a través de un socket, el número de accesos a una base de datos, cantidad de llamadas a determinados procedimientos, etc.). Ello se realiza estáticamente, es decir, sin necesidad de ejecutar los programas. La información sobre el uso de recursos resulta muy útil para una gran variedad de aplicaciones de optimización y verificación de programas, así como para asistir en el diseño de los mismos. Por ejemplo, los programadores pueden utilizar dicha información para elegir diferentes soluciones algorítmicas a un problema; los sistemas de transformación de programas pueden utilizar la información de coste para elegir entre transformaciones alternativas; los compiladores paralelizantes pueden utilizar las estimaciones de coste para realizar control de granularidad, el cual trata de equilibrar el coste debido a la creación y gestión de tareas, con los beneficios de la paralelización. En esta tesis hemos mejorado de manera significativa la implementación de un prototipo existente para el análisis del uso de recursos basado en interpretación abstracta, abordando diversos desafíos relevantes y superando numerosas limitaciones que éste presentaba. El objetivo de dicho prototipo era mostrar la viabilidad de definir el análisis de recursos como un dominio abstracto, y cómo se podían superar las limitaciones de otras herramientas similares que constituyen el estado del arte. Para ello, se implementó como un dominio abstracto en el marco de interpretación abstracta presente en el sistema CiaoPP, PLAI. Hemos mejorado tanto el diseño como la implementación del mencionado prototipo para posibilitar su evolución hacia una herramienta utilizable en el ámbito industrial. Las operaciones abstractas de dicha herramienta dependen en gran medida de la generación, y posterior búsqueda de soluciones en forma cerrada, de relaciones recurrentes, las cuales modelizan el comportamiento, respecto al consumo de recursos, de los componentes del programa y del programa completo. Si bien existen actualmente muchas herramientas capaces de encontrar soluciones en forma cerrada para ciertos tipos de recurrencias, tales como Sistemas de Computación Algebraicos (CAS) y librerías de programación, ninguna de dichas herramientas es capaz de tratar, por sí sola, todos los tipos de recurrencias que surgen durante el análisis de recursos. Existen incluso recurrencias que no las puede resolver ninguna herramienta actual. Esto constituye claramente un cuello de botella para este tipo de análisis del uso de recursos. Por lo tanto, uno de los principales desafíos que hemos abordado en esta tesis es el diseño y desarrollo de un novedoso marco modular para la resolución de relaciones recurrentes, combinando y aprovechando los resultados de resolutores existentes. Además de ello, hemos desarrollado e integrado en nuestro nuevo resolutor una técnica para la obtención de cotas superiores en forma cerrada de una clase característica de relaciones recurrentes que surgen durante el análisis de programas lógicos con parámetros de acumulación. Finalmente, hemos integrado el nuevo análisis de recursos con el marco general para verificación de recursos de CiaoPP, y hemos instanciado dicho marco para la verificación de especificaciones sobre el consumo de energía de programas imperativas embarcados, mostrando la viabilidad y utilidad de la herramienta resultante en una aplicación real.
Resumo:
Legumes establish a root-nodule symbiosis with soil bacteria collectively known as rhizobia. This symbiosis allows legumes to benefit from the nitrogen fixation capabilities of rhizobia and thus to grow in the absence of any fixed nitrogen source. This is especially relevant for Agriculture, where intensive plant growth depletes soils of useable, fixed nitrogen sources. One of the main features of the root nodule symbiosis is its specificity. Different rhizobia are able to nodulate different legumes. Rhizobium leguminosarum bv. viciae is able to establish an effective symbiosis with four different plant genera (Pisum, Lens, Vicia, Lathyrus), and any given isolate will nodulate any of the four plant genera. A population genomics study with rhizobia isolated from P. sativum, L. culinaris, V. sativa or V. faba, all originating in the same soil, showed that plants select specific genotypes from those available in that soil. This was demonstrated at the genome-wide level, but also for specific genes. Accelerated mesocosm studies with successive plant cultures provided additional evidence on this plant selection and on the nature of the genotypes selected. Finally, representatives from the major rhizobial genotypes isolated from these plants allowed characterization of the size and nature of the respective pangenome and specific genome compartments. These were compared to the different genotypes ?symbiotic and non-symbiotic?present in rhizobial populations isolated directly from the soil without plant intervention.
Resumo:
Rhizobium leguminosarum bv viciae (Rlv) is a soil bacterium able to establish specific root-nodule symbioses with legumes of four different genera: Pisum, Vicia, Lens and Lathyrus. Rlv isolates from nodules of any of these legumes can nodulate any of them; however, it has been shown that plants select specific rhizobial genotypes from those present in the soil (1,2). We have previously shown this at the genomic level by following a population genomics approach. Pool genomic sequences from 100 isolates from each of four plant species: P. sativum, L. culinaris, V. faba and V. sativa, show different, specific profiles at the single nucleotide polymorphism (SNP) level for relevant genes. In this work, the extent of Rlv selection from a well-characterized soil population by different legume plant hosts: P. sativum, L. culinaris, V. faba and V. sativa, after a medium-term mesocosm study is described. Direct soil isolates from each of these mesocosm studies have been tested for specific rhizobial genes (glnII and fnrN) and symbiotic genes (nodC and nifH). Different populations were characterized further by Sanger sequencing of both the rpoB phylogenetic marker gene and the symbiotic genes nodC and nifH. The distribution and size of the rhizobial population for each legume host showed changes during the medium-term mesocosm study. Particularly, a non-symbiotic group of rhizobia was enriched by all four hosts, in contrast to the symbiotic rhizobia profile, which was specific for each legume plant host.
Resumo:
Rhizobium leguminosarum bv viciae (Rlv) is a bacterium able to establish effective symbioses with four different legume genera: Pisum, Lens, Lathyrus and Vicia. Classic studies using trap plants have previously shown that, given a choice, different plants prefer specific genotypes of rhizobia, which are adapted to the host (1, 2). In previous work we have performed a Pool-Seq analysis bases on pooled DNA samples from Rlv nodule isolates obtained from Pisum sativum, Lens culinaris, Vicia fava and V. sativa plants, used as rhizobial traps. This experiment allowed us to test the host preference hypothesis: different plant hosts select specific sub-populations of rhizobia from the available population present in a given soil. We have observed that plant-selected sub-populations are different at the single nucleotide polymorphism (SNP) level. We have selected individual isolates from each sub-population (9 fava-bean isolates, 14 pea isolates 9 vetch isolates and 9 lentil isolates) and sequenced their genomes at draft level (ca. 30x, 90 contigs). Genomic analyses have been carried out using J-species and CMG-Biotools. All the isolates had similar genome size (7.5 Mb) and number of genes (7,300). The resulting Average Nucleotide Identity (ANIm) tree showed that Rhizobium leguminosarum bv viciae is a highly diverse group. Each plant-selected subpopulation showed a closed pangenome and core genomes of similar size (11,500 and 4,800 genes, respectively). The addition of all four sub-population results in a larger, closed pangenome of 19,040 genes and a core genome of similar size (4,392 genes). Each sub-population contains a characteristic set of genes but no universal, plant-specific genes were found. The core genome obtained from all four sub-populations is probably a representative core genome for Rhizobium leguminosarum, given that the reference genome (Rhizobium leguminosarum bv. viciae strain 3841) contains most of the core genome. We have also analyzed the symbiotic cluster (nod), and different nod cluster genotypes were found in each sub-population. Supported by MINECO (Consolider-Ingenio 2010, MICROGEN Project, CSD2009-00006).