58 resultados para Large-scale experiments
Resumo:
Expression data contribute significantly to the biological value of the sequenced human genome, providing extensive information about gene structure and the pattern of gene expression. ESTs, together with SAGE libraries and microarray experiment information, provide a broad and rich view of the transcriptome. However, it is difficult to perform large-scale expression mining of the data generated by these diverse experimental approaches. Not only is the data stored in disparate locations, but there is frequent ambiguity in the meaning of terms used to describe the source of the material used in the experiment. Untangling semantic differences between the data provided by different resources is therefore largely reliant on the domain knowledge of a human expert. We present here eVOC, a system which associates labelled target cDNAs for microarray experiments, or cDNA libraries and their associated transcripts with controlled terms in a set of hierarchical vocabularies. eVOC consists of four orthogonal controlled vocabularies suitable for describing the domains of human gene expression data including Anatomical System, Cell Type, Pathology and Developmental Stage. We have curated and annotated 7016 cDNA libraries represented in dbEST, as well as 104 SAGE libraries,with expression information,and provide this as an integrated, public resource that allows the linking of transcripts and libraries with expression terms. Both the vocabularies and the vocabulary-annotated libraries can be retrieved from http://www.sanbi.ac.za/evoc/. Several groups are involved in developing this resource with the aim of unifying transcript expression information.
Resumo:
We present MBIS (Multivariate Bayesian Image Segmentation tool), a clustering tool based on the mixture of multivariate normal distributions model. MBIS supports multichannel bias field correction based on a B-spline model. A second methodological novelty is the inclusion of graph-cuts optimization for the stationary anisotropic hidden Markov random field model. Along with MBIS, we release an evaluation framework that contains three different experiments on multi-site data. We first validate the accuracy of segmentation and the estimated bias field for each channel. MBIS outperforms a widely used segmentation tool in a cross-comparison evaluation. The second experiment demonstrates the robustness of results on atlas-free segmentation of two image sets from scan-rescan protocols on 21 healthy subjects. Multivariate segmentation is more replicable than the monospectral counterpart on T1-weighted images. Finally, we provide a third experiment to illustrate how MBIS can be used in a large-scale study of tissue volume change with increasing age in 584 healthy subjects. This last result is meaningful as multivariate segmentation performs robustly without the need for prior knowledge.
Resumo:
Each year, approximately five million people die worldwide from putatively vaccine-preventable mucosally transmitted diseases. With respect to mass vaccination campaigns, one strategy to cope with this formidable challenge is aerosol vaccine delivery, which offers potential safety, logistical, and cost-saving advantages over traditional vaccination routes. Additionally, aerosol vaccination may elicit pivotal mucosal immune responses that could contain or eliminate mucosally transmitted pathogens in a preventative or therapeutic vaccine context. In this current preclinical non-human primate investigation, we demonstrate the feasibility of aerosol vaccination with the recombinant poxvirus-based vaccine vectors NYVAC and MVA. Real-time in vivo scintigraphy experiments with radiolabeled, aerosol-administered NYVAC-C (Clade C, HIV-1 vaccine) and MVA-HPV vaccines revealed consistent mucosal delivery to the respiratory tract. Furthermore, aerosol delivery of the vaccines was safe, inducing no vaccine-associated pathology, in particular in the brain and lungs, and was immunogenic. Administration of a DNA-C/NYVAC-C prime/boost regime resulted in both systemic and anal-genital HIV-specific immune responses that were still detectable 5 months after immunization. Thus, aerosol vaccination with NYVAC and MVA vectored vaccines constitutes a tool for large-scale vaccine efforts against mucosally transmitted pathogens.
Resumo:
Soil pollution with hexachlorocyclohexane (HCH) has caused serious environmental problems. Here we describe the targeted degradation of all HCH isomers by applying the aerobic bacterium Sphingobium indicum B90A. In particular, we examined possibilities for large-scale cultivation of strain B90A, tested immobilization, storage and inoculation procedures, and determined the survival and HCH-degradation activity of inoculated cells in soil. Optimal growth of strain B90A was achieved in glucose-containing mineral medium and up to 65% culturability could be maintained after 60 days storage at 30 degrees C by mixing cells with sterile dry corncob powder. B90A biomass produced in water supplemented with sugarcane molasses and immobilized on corncob powder retained 15-20% culturability after 30 days storage at 30 degrees C, whereas full culturability was maintained when cells were stored frozen at -20 degrees C. On the contrary, cells stored on corncob degraded gamma-HCH faster than those that had been stored frozen, with between 15 and 85% of gamma-HCH disappearance in microcosms within 20 h at 30 degrees C. Soil microcosm tests at 25 degrees C confirmed complete mineralization of [(14)C]-gamma-HCH by corncob-immobilized strain B90A. Experiments conducted in small pits and at an HCH-contaminated agricultural site resulted in between 85 and 95% HCH degradation by strain B90A applied via corncob, depending on the type of HCH isomer and even at residual HCH concentrations. Up to 20% of the inoculated B90A cells survived under field conditions after 8 days and could be traced among other soil microorganisms by a combination of natural antibiotic resistance properties, unique pigmentation and PCR amplification of the linA genes. Neither the addition of corncob nor of corncob immobilized B90A did measurably change the microbial community structure as determined by T-RFLP analysis. Overall, these results indicate that on-site aerobic bioremediation of HCH exploiting the biodegradation activity of S. indicum B90A cells stored on corncob powder is a promising technology.
Resumo:
With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website ( http://www.uniprot.org/ ). It also evokes precautions that are necessary for successful predictions and extrapolations.
Resumo:
BACKGROUND: Pseudogenes have long been considered as nonfunctional genomic sequences. However, recent evidence suggests that many of them might have some form of biological activity, and the possibility of functionality has increased interest in their accurate annotation and integration with functional genomics data. RESULTS: As part of the GENCODE annotation of the human genome, we present the first genome-wide pseudogene assignment for protein-coding genes, based on both large-scale manual annotation and in silico pipelines. A key aspect of this coupled approach is that it allows us to identify pseudogenes in an unbiased fashion as well as untangle complex events through manual evaluation. We integrate the pseudogene annotations with the extensive ENCODE functional genomics information. In particular, we determine the expression level, transcription-factor and RNA polymerase II binding, and chromatin marks associated with each pseudogene. Based on their distribution, we develop simple statistical models for each type of activity, which we validate with large-scale RT-PCR-Seq experiments. Finally, we compare our pseudogenes with conservation and variation data from primate alignments and the 1000 Genomes project, producing lists of pseudogenes potentially under selection. CONCLUSIONS: At one extreme, some pseudogenes possess conventional characteristics of functionality; these may represent genes that have recently died. On the other hand, we find interesting patterns of partial activity, which may suggest that dead genes are being resurrected as functioning non-coding RNAs. The activity data of each pseudogene are stored in an associated resource, psiDR, which will be useful for the initial identification of potentially functional pseudogenes.
Resumo:
Structural and regulatory genes involved in the synthesis of antimicrobial metabolites are essential for the biocontrol activity of fluorescent pseudomonads and, in principle, amenable to genetic engineering for strain improvement. An eventual large-scale release of such bacteria raises the question of whether such genes also contribute to the persistence and dissemination of the bacteria in soil ecosystems. Pseudomonas fluorescens wild-type strain CHA0 protects plants against a variety of fungal diseases and produces several antimicrobial metabolites. The regulatory gene gacA globally controls antibiotic production and is crucial for disease suppression in CHA0. This gene also regulates the production of extracellular protease and phospholipase. The contribution of gacA to survival and vertical translocation of CHA0 in soil microcosms of increasing complexity was studied in coinoculation experiments with the wild type and a gacA mutant which lacks antibiotics and some exoenzymes. Both strains were marked with spontaneous resistance to rifampin. In a closed system with sterile soil, strain CHA0 and the gacA mutant multiplied for several weeks, whereas these strains declined exponentially in nonsterile soil of different Swiss origins. The gacA mutant was less persistent in nonrhizosphere raw soil than was the wild type, but no competitive disadvantage when colonizing the rhizosphere and roots of wheat was found in the particular soil type and during the period studied. Vertical translocation was assessed after strains had been applied to undisturbed, long (60-cm) or short (20-cm) soil columns, both planted with wheat. A smaller number of cells of the gacA mutant than of the wild type were detected in the percolated water and in different depths of the soil column. Single-strain inoculation gave similar results in all microcosms tested. We conclude that mutation in a single regulatory gene involved in antibiotic and exoenzyme synthesis can affect the survival of P. fluorescens more profoundly in unplanted soil than in the rhizosphere.
Resumo:
BACKGROUND: The annotation of protein post-translational modifications (PTMs) is an important task of UniProtKB curators and, with continuing improvements in experimental methodology, an ever greater number of articles are being published on this topic. To help curators cope with this growing body of information we have developed a system which extracts information from the scientific literature for the most frequently annotated PTMs in UniProtKB. RESULTS: The procedure uses a pattern-matching and rule-based approach to extract sentences with information on the type and site of modification. A ranked list of protein candidates for the modification is also provided. For PTM extraction, precision varies from 57% to 94%, and recall from 75% to 95%, according to the type of modification. The procedure was used to track new publications on PTMs and to recover potential supporting evidence for phosphorylation sites annotated based on the results of large scale proteomics experiments. CONCLUSIONS: The information retrieval and extraction method we have developed in this study forms the basis of a simple tool for the manual curation of protein post-translational modifications in UniProtKB/Swiss-Prot. Our work demonstrates that even simple text-mining tools can be effectively adapted for database curation tasks, providing that a thorough understanding of the working process and requirements are first obtained. This system can be accessed at http://eagl.unige.ch/PTM/.
Resumo:
Simulated-annealing-based conditional simulations provide a flexible means of quantitatively integrating diverse types of subsurface data. Although such techniques are being increasingly used in hydrocarbon reservoir characterization studies, their potential in environmental, engineering and hydrological investigations is still largely unexploited. Here, we introduce a novel simulated annealing (SA) algorithm geared towards the integration of high-resolution geophysical and hydrological data which, compared to more conventional approaches, provides significant advancements in the way that large-scale structural information in the geophysical data is accounted for. Model perturbations in the annealing procedure are made by drawing from a probability distribution for the target parameter conditioned to the geophysical data. This is the only place where geophysical information is utilized in our algorithm, which is in marked contrast to other approaches where model perturbations are made through the swapping of values in the simulation grid and agreement with soft data is enforced through a correlation coefficient constraint. Another major feature of our algorithm is the way in which available geostatistical information is utilized. Instead of constraining realizations to match a parametric target covariance model over a wide range of spatial lags, we constrain the realizations only at smaller lags where the available geophysical data cannot provide enough information. Thus we allow the larger-scale subsurface features resolved by the geophysical data to have much more due control on the output realizations. Further, since the only component of the SA objective function required in our approach is a covariance constraint at small lags, our method has improved convergence and computational efficiency over more traditional methods. Here, we present the results of applying our algorithm to the integration of porosity log and tomographic crosshole georadar data to generate stochastic realizations of the local-scale porosity structure. Our procedure is first tested on a synthetic data set, and then applied to data collected at the Boise Hydrogeophysical Research Site.
Resumo:
Advancements in high-throughput technologies to measure increasingly complex biological phenomena at the genomic level are rapidly changing the face of biological research from the single-gene single-protein experimental approach to studying the behavior of a gene in the context of the entire genome (and proteome). This shift in research methodologies has resulted in a new field of network biology that deals with modeling cellular behavior in terms of network structures such as signaling pathways and gene regulatory networks. In these networks, different biological entities such as genes, proteins, and metabolites interact with each other, giving rise to a dynamical system. Even though there exists a mature field of dynamical systems theory to model such network structures, some technical challenges are unique to biology such as the inability to measure precise kinetic information on gene-gene or gene-protein interactions and the need to model increasingly large networks comprising thousands of nodes. These challenges have renewed interest in developing new computational techniques for modeling complex biological systems. This chapter presents a modeling framework based on Boolean algebra and finite-state machines that are reminiscent of the approach used for digital circuit synthesis and simulation in the field of very-large-scale integration (VLSI). The proposed formalism enables a common mathematical framework to develop computational techniques for modeling different aspects of the regulatory networks such as steady-state behavior, stochasticity, and gene perturbation experiments.
Resumo:
To date, published studies of alluvial bar architecture in large rivers have been restricted mostly to case studies of individual bars and single locations. Relatively little is known about how the depositional processes and sedimentary architecture of kilometre-scale bars vary within a multi-kilometre reach or over several hundreds of kilometres downstream. This study presents Ground Penetrating Radar and core data from 11, kilometre-scale bars from the Rio Parana, Argentina. The investigated bars are located between 30km upstream and 540km downstream of the Rio Parana - Rio Paraguay confluence, where a significant volume of fine-grained suspended sediment is introduced into the network. Bar-scale cross-stratified sets, with lengths and widths up to 600m and thicknesses up to 12m, enable the distinction of large river deposits from stacked deposits of smaller rivers, but are only present in half the surface area of the bars. Up to 90% of bar-scale sets are found on top of finer-grained ripple-laminated bar-trough deposits. Bar-scale sets make up as much as 58% of the volume of the deposits in small, incipient mid-channel bars, but this proportion decreases significantly with increasing age and size of the bars. Contrary to what might be expected, a significant proportion of the sedimentary structures found in the Rio Parana is similar in scale to those found in much smaller rivers. In other words, large river deposits are not always characterized by big structures that allow a simple interpretation of river scale. However, the large scale of the depositional units in big rivers causes small-scale structures, such as ripple sets, to be grouped into thicker cosets, which indicate river scale even when no obvious large-scale sets are present. The results also show that the composition of bars differs between the studied reaches upstream and downstream of the confluence with the Rio Paraguay. Relative to other controls on downstream fining, the tributary input of fine-grained suspended material from the Rio Paraguay causes a marked change in the composition of the bar deposits. Compared to the upstream reaches, the sedimentary architecture of the downstream reaches in the top ca 5m of mid-channel bars shows: (i) an increase in the abundance and thickness (up to metre-scale) of laterally extensive (hundreds of metres) fine-grained layers; (ii) an increase in the percentage of deposits comprised of ripple sets (to >40% in the upper bar deposits); and (iii) an increase in bar-trough deposits and a corresponding decrease in bar-scale cross-strata (<10%). The thalweg deposits of the Rio Parana are composed of dune sets, even directly downstream from the Rio Paraguay where the upper channel deposits are dominantly fine-grained. Thus, the change in sedimentary facies due to a tributary point-source of fine-grained sediment is primarily expressed in the composition of the upper bar deposits.
Resumo:
Background Biological rhythmicity has been extensively studied in animals for many decades. Although temporal patterns of physical activity have been identified in humans, no large-scale, multi-national study has been published, and no comparison has been attempted of the ubiquity of activity rhythms at different time scales (such as daily, weekly, monthly, and annual). Methods Using individually worn actigraphy devices, physical activity of 2,328 individuals from five different countries (adults of African descent from Ghana, South Africa, Jamaica, Seychelles, and the United States) was measured for seven consecutive days at different times of the year. Results Analysis for rhythmic patterns identified daily rhythmicity of physical activity in all five of the represented nationalities. Weekly rhythmicity was found in some, but not all, of the nationalities. No significant evidence of lunar rhythmicity or seasonal rhythmicity was found in any of the groups. Conclusions These findings extend previous small-scale observations of daily rhythmicity to a large cohort of individuals from around the world. The findings also confirm the existence of modest weekly rhythmicity but not lunar or seasonal rhythmicity in human activity. These differences in rhythm strength have implications for the management of health hazards of rhythm misalignment. Key Messages Analysis of the pattern of physical activity of 2,328 individuals from five countries revealed strong daily rhythmicity in all five countries, moderate weekly rhythmicity in some countries, and no lunar rhythmicity or seasonal rhythmicity in any of the countries.
Resumo:
On a geological time scale the conditions on earth are very variable and biological patterns (for example the distributions of species) are very dynamic. Understanding large scale patterns of variation observed today thus requires a deep understanding of the historical factors that drove their evolution. In this thesis, we reevaluated the evolution and maintenance of a continental color cline observed in the European barn owl (Tyto alba) using population genetic tools. The colour cline spans from south-est Europe where most individual have pure white underparts to north and east Europe where most individuals have rufous-brown underparts. Our results globally showed that the old scenario, stipulating that the color cline evolved by secondary contact of two color morphs (white and rufous) that evolved in allopatry during the last ice age has to be revised. We collected samples of about 700 barn owls from the Western Palearctic to establish the first population genetic data set for this species. Individuals were genotyped at 22 microsatellites markers, at one mitochondrial gene, and at a candidate color gene. The color of each individuals was assessed and their sex determined by molecular methods. We first showed that the genetic variation in Western Europe is very limited compared to the heritable color variation. We found no evidences of different glacial lineages, and showed that selection must be involved in the maintenance of the color cline (chapter 1). Using computer simulations, we demonstrated that the post-glacial colonization of Europe occurred from the Iberian Peninsula and that the color cline could not have evolved by neutral demographic processes during this colonization (chapter 2). Finally we reevaluated the whole history of the establishment of the Western Palearctic variation of the barn owl (chapter 3): This study showed that all Western European barn owls descend from white barn owls phenotypes from the Middle East that colonized the Iberian Peninsula via North-Africa. Following the end of the last ice age (20'000 years ago), these white barn owls colonized Western Europe and under selection a novel rufous phenotype evolved (during or after the colonization). An important part of the color variation could be explained by a single mutation in the melanocortin-1-receptor (MC1R) gene that appeared during or after the colonization. The colonization of Europe reached until Greece, where the rufous birds encountered white ones (which reached Greece from the Middle East over the Bosporus) in a secondary contact zone. Our analyses show that white and rufous barn owls in Greece interbreed only to a limited extent. This suggests that barn owls are at the verge of becoming two species in Greece and demonstrates that European barn owls represent an incipient ring species around the Mediterranean. The revisited history of the establishment of the European barn owl color cline makes this model system remarkable for several aspects. It is a very clear example of strong local adaptation that can be achieved despite high gene flow (strong color and MC1R differentiation despite almost no neutral genetic differentiation). It also offers a wonderful model system to study the interactions between colonization processes and selection processes which have, for now, been remarkably understudied despite their potentially ubiquitous importance. Finally it represents a very interesting case in the speciation continuum and appeals for further studying the amount of gene flow that occurs between the color morphs in Greece. -- Sur l'échelle des temps géologiques, les conditions sur terre sont très variables et les patrons biologiques (telle que la distribution des espèces) sont très dynamiques. Si l'on veut comprendre des patrons que l'on peut observer à large échelle aujourd'hui, il est nécessaire de d'abord comprendre les facteurs historiques qui ont gouverné leur établissement. Dans cette thèse, nous allons réévaluer, grâce à des outils modernes de génétique des populations, l'évolution et la maintenance d'un cline de couleur continental observé chez l'effraie des clochers européenne (Tyto alba). Globalement, nos résultats montrent que le scenario accepté jusqu'à maintenant, qui stipule que le cline de couleur a évolué à partir du contact secondaire de deux morphes de couleur (blanches et rousses) ayant évolué en allopatrie durant les dernières glaciations, est à revoir. Afin de constituer le premier jeu de données de génétique des populations pour cette espèce, nous avons récolté des échantillons d'environ 700 effraies de l'ouest Paléarctique. Nous avons génotypé tous les individus à 22 loci microsatellites, sur un gène mitochondrial et sur un autre gène participant au déterminisme de la couleur. Nous avons aussi mesuré la couleur de tous les individus et déterminé leur sexe génétiquement. Nous avons tout d'abord pu montrer que la variation génétique neutre est négligeable en comparaison avec la variation héritable de couleur, qu'il n'existe qu'une seule lignée européenne et que de la sélection doit être impliquée dans le maintien du cline de couleur (chapitre 1). Grâce à des simulations informatiques, nous avons démontré que l'ensemble de l'Europe de l'ouest a été recolonisé depuis la Péninsule Ibérique après les dernières glaciations et que le cline de couleur ne peut pas avoir évolué par des processus neutre durant cette colonisation (chapitre 2). Finalement, nous avons réévalué l'ensemble de l'histoire postglaciaire de l'espèce dans l'ouest Paléarctique (chapitre 3): l'ensemble des effraies du Paléarctique descendent d'effraie claire du Moyen-Orient qui ont colonisé la péninsule ibérique en passant par l'Afrique du nord. Après la fin de la dernière glaciation (il y a 20'000 ans), ces effraies claires ont colonisé l'Europe de l'ouest et ont évolués par sélection le phénotype roux (durant ou après la colonisation). Une part importante de la variation de couleur peut être expliquée par une mutation sur le gène MC1R qui est apparue durant ou juste après la colonisation. Cette vague de colonisation s'est poursuivie jusqu'en Grèce où ces effraies rousses ont rencontré dans une zone de contact secondaire des effraies claires (qui sont remontées en Grèce depuis le Moyen-Orient via le Bosphore). Nos analyses montrent que le flux de gènes entre effraies blanches et rousses est limité en Grèce, ce qui suggère qu'elles sont en passe de former deux espèces et ce qui montre que les effraies constituent un exemple naissant de spéciation en anneaux autour de la Méditerranée. L'histoire revisitée des effraies des clochers de l'ouest Paléarctique en fait un système modèle remarquable pour plusieurs aspects. C'est un exemple très claire de forte adaptation locale maintenue malgré un fort flux de gènes (différenciation forte de couleur et sur le gène MC1R malgré presque aucune structure neutre). Il offre également un très bon système pour étudier l'interaction entre colonisation et sélection, un thème ayant été remarquablement peu étudié malgré son importance. Et il offre finalement un cas très intéressant dans le « continuum de spéciation » et il serait très intéressant d'étudier plus en détail l'importance du flux de gènes entre les morphes de couleur en Grèce.