969 resultados para Databases - Duplicate tuples


Relevância:

10.00% 10.00%

Publicador:

Resumo:

This study presents a first attempt to extend the “Multi-scale integrated analysis of societal and ecosystem metabolism (MuSIASEM)” approach to a spatial dimension using GIS techniques in the Metropolitan area of Barcelona. We use a combination of census and commercial databases along with a detailed land cover map to create a layer of Common Geographic Units that we populate with the local values of human time spent in different activities according to MuSIASEM hierarchical typology. In this way, we mapped the hours of available human time, in regards to the working hours spent in different locations, putting in evidence the gradients in spatial density between the residential location of workers (generating the work supply) and the places where the working hours are actually taking place. We found a strong three-modal pattern of clumps of areas with different combinations of values of time spent on household activities and on paid work. We also measured and mapped spatial segregation between these two activities and put forward the conjecture that this segregation increases with higher energy throughput, as the size of the functional units must be able to cope with the flow of exosomatic energy. Finally, we discuss the effectiveness of the approach by comparing our geographic representation of exosomatic throughput to the one issued from conventional methods.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

En termes de temps d'execució i ús de dades, les aplicacions paral·leles/distribuïdes poden tenir execucions variables, fins i tot quan s'empra el mateix conjunt de dades d'entrada. Existeixen certs aspectes de rendiment relacionats amb l'entorn que poden afectar dinàmicament el comportament de l'aplicació, tals com: la capacitat de la memòria, latència de la xarxa, el nombre de nodes, l'heterogeneïtat dels nodes, entre d'altres. És important considerar que l'aplicació pot executar-se en diferents configuracions de maquinari i el desenvolupador d'aplicacions no port garantir que els ajustaments de rendiment per a un sistema en particular continuïn essent vàlids per a d'altres configuracions. L'anàlisi dinàmica de les aplicacions ha demostrat ser el millor enfocament per a l'anàlisi del rendiment per dues raons principals. En primer lloc, ofereix una solució molt còmoda des del punt de vista dels desenvolupadors mentre que aquests dissenyen i evaluen les seves aplicacions paral·leles. En segon lloc, perquè s'adapta millor a l'aplicació durant l'execució. Aquest enfocament no requereix la intervenció de desenvolupadors o fins i tot l'accés al codi font de l'aplicació. S'analitza l'aplicació en temps real d'execució i es considra i analitza la recerca dels possibles colls d'ampolla i optimitzacions. Per a optimitzar l'execució de l'aplicació bioinformàtica mpiBLAST, vam analitzar el seu comportament per a identificar els paràmetres que intervenen en el rendiment d'ella, com ara: l'ús de la memòria, l'ús de la xarxa, patrons d'E/S, el sistema de fitxers emprat, l'arquitectura del processador, la grandària de la base de dades biològica, la grandària de la seqüència de consulta, la distribució de les seqüències dintre d'elles, el nombre de fragments de la base de dades i/o la granularitat dels treballs assignats a cada procés. El nostre objectiu és determinar quins d'aquests paràmetres tenen major impacte en el rendiment de les aplicacions i com ajustar-los dinàmicament per a millorar el rendiment de l'aplicació. Analitzant el rendiment de l'aplicació mpiBLAST hem trobat un conjunt de dades que identifiquen cert nivell de serial·lització dintre l'execució. Reconeixent l'impacte de la caracterització de les seqüències dintre de les diferents bases de dades i una relació entre la capacitat dels workers i la granularitat de la càrrega de treball actual, aquestes podrien ser sintonitzades dinàmicament. Altres millores també inclouen optimitzacions relacionades amb el sistema de fitxers paral·lel i la possibilitat d'execució en múltiples multinucli. La grandària de gra de treball està influenciat per factors com el tipus de base de dades, la grandària de la base de dades, i la relació entre grandària de la càrrega de treball i la capacitat dels treballadors.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Type 2 diabetes mellitus (T2DM) is a major disease affecting nearly 280 million people worldwide. Whilst the pathophysiological mechanisms leading to disease are poorly understood, dysfunction of the insulin-producing pancreatic beta-cells is key event for disease development. Monitoring the gene expression profiles of pancreatic beta-cells under several genetic or chemical perturbations has shed light on genes and pathways involved in T2DM. The EuroDia database has been established to build a unique collection of gene expression measurements performed on beta-cells of three organisms, namely human, mouse and rat. The Gene Expression Data Analysis Interface (GEDAI) has been developed to support this database. The quality of each dataset is assessed by a series of quality control procedures to detect putative hybridization outliers. The system integrates a web interface to several standard analysis functions from R/Bioconductor to identify differentially expressed genes and pathways. It also allows the combination of multiple experiments performed on different array platforms of the same technology. The design of this system enables each user to rapidly design a custom analysis pipeline and thus produce their own list of genes and pathways. Raw and normalized data can be downloaded for each experiment. The flexible engine of this database (GEDAI) is currently used to handle gene expression data from several laboratory-run projects dealing with different organisms and platforms. Database URL: http://eurodia.vital-it.ch.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

MOTIVATION: Microarray results accumulated in public repositories are widely reused in meta-analytical studies and secondary databases. The quality of the data obtained with this technology varies from experiment to experiment, and an efficient method for quality assessment is necessary to ensure their reliability. RESULTS: The lack of a good benchmark has hampered evaluation of existing methods for quality control. In this study, we propose a new independent quality metric that is based on evolutionary conservation of expression profiles. We show, using 11 large organ-specific datasets, that IQRray, a new quality metrics developed by us, exhibits the highest correlation with this reference metric, among 14 metrics tested. IQRray outperforms other methods in identification of poor quality arrays in datasets composed of arrays from many independent experiments. In contrast, the performance of methods designed for detecting outliers in a single experiment like Normalized Unscaled Standard Error and Relative Log Expression was low because of the inability of these methods to detect datasets containing only low-quality arrays and because the scores cannot be directly compared between experiments. AVAILABILITY AND IMPLEMENTATION: The R implementation of IQRray is available at: ftp://lausanne.isb-sib.ch/pub/databases/Bgee/general/IQRray.R. CONTACT: Marta.Rosikiewicz@unil.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Consider a model with parameter phi, and an auxiliary model with parameter theta. Let phi be a randomly sampled from a given density over the known parameter space. Monte Carlo methods can be used to draw simulated data and compute the corresponding estimate of theta, say theta_tilde. A large set of tuples (phi, theta_tilde) can be generated in this manner. Nonparametric methods may be use to fit the function E(phi|theta_tilde=a), using these tuples. It is proposed to estimate phi using the fitted E(phi|theta_tilde=theta_hat), where theta_hat is the auxiliary estimate, using the real sample data. This is a consistent and asymptotically normally distributed estimator, under certain assumptions. Monte Carlo results for dynamic panel data and vector autoregressions show that this estimator can have very attractive small sample properties. Confidence intervals can be constructed using the quantiles of the phi for which theta_tilde is close to theta_hat. Such confidence intervals are found to have very accurate coverage.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The International Molecular Exchange (IMEx) consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website (http://www.imexconsortium.org/). Common curation rules have been developed, and a central registry is used to manage the selection of articles to enter into the dataset. We discuss the advantages of such a service to the user, our quality-control measures and our data-distribution practices.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The usual way to investigate the statistical properties of finitely generated subgroups of free groups, and of finite presentations of groups, is based on the so-called word-based distribution: subgroups are generated (finite presentations are determined) by randomly chosen k-tuples of reduced words, whose maximal length is allowed to tend to infinity. In this paper we adopt a different, though equally natural point of view: we investigate the statistical properties of the same objects, but with respect to the so-called graph-based distribution, recently introduced by Bassino, Nicaud and Weil. Here, subgroups (and finite presentations) are determined by randomly chosen Stallings graphs whose number of vertices tends to infinity. Our results show that these two distributions behave quite differently from each other, shedding a new light on which properties of finitely generated subgroups can be considered frequent or rare. For example, we show that malnormal subgroups of a free group are negligible in the raph-based distribution, while they are exponentially generic in the word-based distribution. Quite surprisingly, a random finite presentation generically presents the trivial group in this new distribution, while in the classical one it is known to generically present an infinite hyperbolic group.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Eosinophilic esophagitis (EoE) is a clinicopathologic condition of increasing recognition and prevalence. In 2007, a consensus recommendation provided clinical and histopathologic guidance for the diagnosis and treatment of EoE; however, only a minority of physicians use the 2007 guidelines, which require fulfillment of both histologic and clinical features. Since 2007, the number of EoE publications has doubled, providing new disease insight. Accordingly, a panel of 33 physicians with expertise in pediatric and adult allergy/immunology, gastroenterology, and pathology conducted a systematic review of the EoE literature (since September 2006) using electronic databases. Based on the literature review and expertise of the panel, information and recommendations were provided in each of the following areas of EoE: diagnostics, genetics, allergy testing, therapeutics, and disease complications. Because accumulating animal and human data have provided evidence that EoE appears to be an antigen-driven immunologic process that involves multiple pathogenic pathways, a new conceptual definition is proposed highlighting that EoE represents a chronic, immune/antigen-mediated disease characterized clinically by symptoms related to esophageal dysfunction and histologically by eosinophil-predominant inflammation. The diagnostic guidelines continue to define EoE as an isolated chronic disorder of the esophagus diagnosed by the need of both clinical and pathologic features. Patients commonly have high rates of concurrent allergic diatheses, especially food sensitization, compared with the general population. Proved therapeutic options include chronic dietary elimination, topical corticosteroids, and esophageal dilation. Important additions since 2007 include genetic underpinnings that implicate EoE susceptibility caused by polymorphisms in the thymic stromal lymphopoietin protein gene and the description of a new potential disease phenotype, proton pump inhibitor-responsive esophageal eosinophila. Further advances and controversies regarding diagnostic methods, surrogate disease markers, allergy testing, and treatment approaches are discussed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The recent advances in sequencing technologies have given all microbiology laboratories access to whole genome sequencing. Providing that tools for the automated analysis of sequence data and databases for associated meta-data are developed, whole genome sequencing will become a routine tool for large clinical microbiology laboratories. Indeed, the continuing reduction in sequencing costs and the shortening of the 'time to result' makes it an attractive strategy in both research and diagnostics. Here, we review how high-throughput sequencing is revolutionizing clinical microbiology and the promise that it still holds. We discuss major applications, which include: (i) identification of target DNA sequences and antigens to rapidly develop diagnostic tools; (ii) precise strain identification for epidemiological typing and pathogen monitoring during outbreaks; and (iii) investigation of strain properties, such as the presence of antibiotic resistance or virulence factors. In addition, recent developments in comparative metagenomics and single-cell sequencing offer the prospect of a better understanding of complex microbial communities at the global and individual levels, providing a new perspective for understanding host-pathogen interactions. Being a high-resolution tool, high-throughput sequencing will increasingly influence diagnostics, epidemiology, risk management, and patient care.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The paper discusses the utilization of new techniques ot select processes for protein recovery, separation and purification. It describesa rational approach that uses fundamental databases of proteins molecules to simplify the complex problem of choosing high resolution separation methods for multi component mixtures. It examines the role of modern computer techniques to help solving these questions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Issues. Numerous studies have reported that brief interventions delivered in primary care are effective in reducing excessive drinking. However, much of this work has been criticised for being clinically unrepresentative. This review aimed to assess the effectiveness of brief interventions in primary care and determine if outcomes differ between efficacy and effectiveness trials. Approach. A pre-specified search strategy was used to search all relevant electronic databases up to 2006. We also hand-searched the reference lists of key articles and reviews. We included randomised controlled trials (RCT) involving patients in primary care who were not seeking alcohol treatment and who received brief intervention. Two authors independently abstracted data and assessed trial quality. Random effects meta-analyses, subgroup and sensitivity analyses and meta-regression were conducted. Key Findings. The primary meta-analysis included 22 RCT and evaluated outcomes in over 5800 patients. At 1 year follow up, patients receiving brief intervention had a significant reduction in alcohol consumption compared with controls [mean difference: -38 g week(-1), 95%CI (confidence interval): -54 to -23], although there was substantial heterogeneity between trials (I(2) = 57%). Subgroup analysis confirmed the benefit of brief intervention in men but not in women. Extended intervention was associated with a non-significantly increased reduction in alcohol consumption compared with brief intervention. There was no significant difference in effect sizes for efficacy and effectiveness trials. Conclusions. Brief interventions can reduce alcohol consumption in men, with benefit at a year after intervention, but they are unproven in women for whom there is insufficient research data. Longer counselling has little additional effect over brief intervention. The lack of differences in outcomes between efficacy and effectiveness trials suggests that the current literature is relevant to routine primary care.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

El Consorci de Biblioteques Universitàries de Catalunya (CBUC) va ser creat el 1996 amb l'objectiu de fer i mantenir el catàleg col·lectiu de les universitats de Catalunya (CCUC) però aviat va ampliar les seves activitats amb el préstec interbibliotecari i les compres conjuntes d'informació electrònica. Aquesta darrera activitat va iniciar-se a finals de 1997 quan el CBUC va presentar als vicerectors de recerca de les universitats públiques de Catalunya el projecte de comprar bases de dades de manera consorciada. Aquests van estar-hi d'acord i van manifestar el seu interès de que en les compres conjuntes també s'incloguessin revistes electròniques. El CBUC va decidir englobar aquestes activitats sota el nom Biblioteca Digital de Catalunya (BDC) la qual naixia amb la "finalitat de proporcionar un conjunt nuclear comú d'informació electrònica per a la totalitat dels usuaris de les biblioteques del CBUC". A finals de 1998 el projecte de la BDC es va presentar a la Generalitat de Catalunya i es va obtenir un finançament per al projecte que cobria el període 1999-2001. Des de llavors la BDC ha passat per almenys tres fases: Una de formació, 1999-2001, que es va iniciar amb un ajut del llavors Departament d'Universitats, Recerca i Societat de la Informació (DURSI) de la Generalitat de Catalunya, ajut que es va traduir en una inversió de 180.000€/any i que va permetre l'inici de subscripcions conjuntes, principalment bases de dades. Una de creixement, 2002-2004, realitzada a partir d'un increment de l'ajut del DURSI, ajut que s'usa com a "capital llavor" per subscriure de forma especial revistes. En aquest moment la BDC s'amplia a universitats no membres del CBUC. Una d'estabilització, 2005-2009, en la que s'han fet algunes compres per a una part de les universitats (i no per a totes com fins llavors) i s'han iniciat alguns intents d'estendre la BDC a altres institucions de recerca. L'article caracteritza les diferents fases i mostra les causes de la seva evolució. Finalment, s'exposen els principals assoliments de la BDC així com els reptes de futur més immediats.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The first scientific meeting of the newly established European SYSGENET network took place at the Helmholtz Centre for Infection Research (HZI) in Braunschweig, April 7-9, 2010. About 50 researchers working in the field of systems genetics using mouse genetic reference populations (GRP) participated in the meeting and exchanged their results, phenotyping approaches, and data analysis tools for studying systems genetics. In addition, the future of GRP resources and phenotyping in Europe was discussed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: Superinfection with drug resistant HIV strains could potentially contribute to compromised therapy in patients initially infected with drug-sensitive virus and receiving antiretroviral therapy. To investigate the importance of this potential route to drug resistance, we developed a bioinformatics pipeline to detect superinfection from routinely collected genotyping data, and assessed whether superinfection contributed to increased drug resistance in a large European cohort of viremic, drug treated patients. METHODS: We used sequence data from routine genotypic tests spanning the protease and partial reverse transcriptase regions in the Virolab and EuResist databases that collated data from five European countries. Superinfection was indicated when sequences of a patient failed to cluster together in phylogenetic trees constructed with selected sets of control sequences. A subset of the indicated cases was validated by re-sequencing pol and env regions from the original samples. RESULTS: 4425 patients had at least two sequences in the database, with a total of 13816 distinct sequence entries (of which 86% belonged to subtype B). We identified 107 patients with phylogenetic evidence for superinfection. In 14 of these cases, we analyzed newly amplified sequences from the original samples for validation purposes: only 2 cases were verified as superinfections in the repeated analyses, the other 12 cases turned out to involve sample or sequence misidentification. Resistance to drugs used at the time of strain replacement did not change in these two patients. A third case could not be validated by re-sequencing, but was supported as superinfection by an intermediate sequence with high degenerate base pair count within the time frame of strain switching. Drug resistance increased in this single patient. CONCLUSIONS: Routine genotyping data are informative for the detection of HIV superinfection; however, most cases of non-monophyletic clustering in patient phylogenies arise from sample or sequence mix-up rather than from superinfection, which emphasizes the importance of validation. Non-transient superinfection was rare in our mainly treatment experienced cohort, and we found a single case of possible transmitted drug resistance by this route. We therefore conclude that in our large cohort, superinfection with drug resistant HIV did not compromise the efficiency of antiretroviral treatment.