67 resultados para genome project
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
L’èxit del Projecte Genoma Humà (PGH) l’any 2000 va fer de la “medicina personalitzada” una realitat més propera. Els descobriments del PGH han simplificat les tècniques de seqüenciació de tal manera que actualment qualsevol persona pot aconseguir la seva seqüència d’ADN complerta. La tecnologia de Read Mapping destaca en aquest tipus de tècniques i es caracteritza per manegar una gran quantitat de dades. Hadoop, el framework d’Apache per aplicacions intensives de dades sota el paradigma Map Reduce, resulta un aliat perfecte per aquest tipus de tecnologia i ha sigut l’opció escollida per a realitzar aquest projecte. Durant tot el treball es realitza l’estudi, l’anàlisi i les experimentacions necessàries per aconseguir un Algorisme Genètic innovador que utilitzi tot el potencial de Hadoop.
Resumo:
Desde el inicio del proyecto del genoma humano y su éxito en el año 2001 se han secuenciado genomas de multitud de especies. La mejora en las tecnologías de secuenciación ha generado volúmenes de datos con un crecimiento exponencial. El proyecto Análisis bioinformáticos sobre la tecnología Hadoop abarca la computación paralela de datos biológicos como son las secuencias de ADN. El estudio ha sido encauzado por la naturaleza del problema a resolver. El alineamiento de secuencias genéticas con el paradigma MapReduce.
Resumo:
The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.
Resumo:
With the increasing availability of various 'omics data, high-quality orthology assignment is crucial for evolutionary and functional genomics studies. We here present the fourth version of the eggNOG database (available at http://eggnog.embl.de) that derives nonsupervised orthologous groups (NOGs) from complete genomes, and then applies a comprehensive characterization and analysis pipeline to the resulting gene families. Compared with the previous version, we have more than tripled the underlying species set to cover 3686 organisms, keeping track with genome project completions while prioritizing the inclusion of high-quality genomes to minimize error propagation from incomplete proteome sets. Major technological advances include (i) a robust and scalable procedure for the identification and inclusion of high-quality genomes, (ii) provision of orthologous groups for 107 different taxonomic levels compared with 41 in eggNOGv3, (iii) identification and annotation of particularly closely related orthologous groups, facilitating analysis of related gene families, (iv) improvements of the clustering and functional annotation approach, (v) adoption of a revised tree building procedure based on the multiple alignments generated during the process and (vi) implementation of quality control procedures throughout the entire pipeline. As in previous versions, eggNOGv4 provides multiple sequence alignments and maximum-likelihood trees, as well as broad functional annotation. Users can access the complete database of orthologous groups via a web interface, as well as through bulk download.
Resumo:
Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.
Resumo:
Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.
Resumo:
Estudi realitzat a partir d’una estada a la Institut J.W. Jenkinson Laboratory for Evolution and Development of the University of Oxford, Regne Unit, entre 2010 i 2012. He estat membre del laboratori del Professor Peter W.H. Holland com a becari post-doctoral Beatriu de Pinós des de setembre de 2010 al setembre de 2012. El nostre projecte de recerca se centra en l'anàlisi genòmic comparatiu del Regne Animal, tot explorant el contingut dels genomes a través de totes les branques de l'arbre dels animals. Totes les referències a les meves publicacions durant aquest post-doc es poden trobar a http://about.me/jordi_paps. Crec que el nombre i la qualitat dels resultats del meu post-doc, un total de 8 publicacions incloent dos articles a la prestigiosa revista Nature, són prova de l'èxit d'aquest post-doc. Prof Peter W. H. Holland (Departament de Zoologia de la Universitat d'Oxford) i jo som coautors de tres articles de genòmica comparativa, resultats directes d'aquest projecte: 1) comparació de families gèniques entre vertebrats invertebrats (Briefings in Functional Genomics), 2) el genoma de l'ostra (publicat a la revista Nature), i 3) els genomes de 6 platihelmints paràsits (acceptat també a Nature). A més, tenim altres 2 treballs en preparació. Un d'ells analitza l'evolució, expressió i funció dels gens Hox al a la tènia Hymenolepis. El perfil fi d'aquests gens clau del desenvolupament esclareix els canvis d'estil de vida dels organismes. A més, durant aquest últim post-doc he participat en diverses col•laboracions, incloent anàlisi de gens d'envelliment a cucs plans, un estudi sobre la filogènia del grup Gastrotricha, una revisió de l'evolució phylum Platyhelminthes, així com un capítol d'un llibre sobre l'evolució dels animals bilaterals. Finalment, gràcies a la beca Beatriu de Pinós, el Prof. Peter W.H. Holland m'ha convidat a formar part del seu equip com un investigador post-doctoral en el seu projecte ERC Advance actual sobre duplicacions genòmiques.
Resumo:
There is great scientific and popular interest in understanding the genetic history of populations in the Americas. We wish to understand when different regions of the continent were inhabited, where settlers came from, and how current inhabitants relate genetically to earlier populations. Recent studies unraveled parts of the genetic history of the continent using genotyping arrays and uniparental markers. The 1000 Genomes Project provides a unique opportunity for improving our understanding of population genetic history by providing over a hundred sequenced low coverage genomes and exomes from Colombian (CLM), Mexican-American (MXL), and Puerto Rican (PUR) populations. Here, we explore the genomic contributions of African, European, and especially Native American ancestry to these populations. Estimated Native American ancestry is 48% in MXL, 25% in CLM, and 13% in PUR. Native American ancestry in PUR is most closely related to populations surrounding the Orinoco River basin, confirming the Southern American ancestry of the Taíno people of the Caribbean. We present new methods to estimate the allele frequencies in the Native American fraction of the populations, and model their distribution using a demographic model for three ancestral Native American populations. These ancestral populations likely split in close succession: the most likely scenario, based on a peopling of the Americas 16 thousand years ago (kya), supports that the MXL Ancestors split 12.2kya, with a subsequent split of the ancestors to CLM and PUR 11.7kya. The model also features effective populations of 62,000 in Mexico, 8,700 in Colombia, and 1,900 in Puerto Rico. Modeling Identity-by-descent (IBD) and ancestry tract length, we show that post-contact populations also differ markedly in their effective sizes and migration patterns, with Puerto Rico showing the smallest effective size and the earlier migration from Europe. Finally, we compare IBD and ancestry assignments to find evidence for relatedness among European founders to the three populations.
Resumo:
In the evolution of Catalan nationalism, as much politician as cultural, the period of II Spanish Republic (1931-1939) was essential. The obtaining of the Statute of Autonomy (1931-1932) supposed the beginning of a stage of expansion in multiple aspects. One of them were the contacts with the Catalanists nuclei of the rest of the cultural space of Catalan language in which, at that time, it would begin to call Catalan Countries (Balearic Islands, Valencian Country, Andorra, Rosselló, to l'Alguer). On Those Collaborations between cultural organizations, political and particular parties Catalonia always will be the model to follow. The Increasing connections will be visualized on press, as well as on cultural celebrations, policy of parties and Constituent Courts. This evolution will be cut by the Franco victory in the Civil War in 1939.
Resumo:
The aim of the project has been to demonstrate how the farm animal breeding industry can utilise gene mapping technology to accelerate genetic improvement. Previous theoretical studies had suggested that the use of marker assisted selection could potentially increase the annual improvement for quantitative traits like backfat with about 10% and for more difficult traits such as meat quality and reproduction by as much as 40-60% compared with existing technology. The work has comprised two major tasks: 1. Commercially relevant populations have been screened for segregation at QTLs identified in experimental populations. The aim has been to establish optimal strategies for QTL detection in commercial pig populations and the extent to which QTLs explaining major phenotypic differences between divergent lines used in experimental studies also explain quantitative variation within commercial lines. The results are important for specifying future strategies for finding economically valuable QTLs. 2. Marker assisted backcrossing has been used to demonstrate how a QTL allele can be introgressed from one breed to another. The work has focused on the major fatness QTL on pig chromosome 4 previously identified in a wild pig/Large White intercross. The end result was not designed to be a commercially viable product in its own right, but the process has validated a number of points of major importance for the exploitation of QTLs in livestock.
Resumo:
Agents voluntarily contribute to an infinitely repeated joint project. We investigate the conditions for cooperation to be a renegotiation-proof and coalition-proof equilibrium before examining the influence of output share inequality on the sustainability of cooperation. When shares are not equally distributed, cooperation requires agents to be more patient than under perfect equality. Beyond a certain degree of share inequality, full efficiency cannot be reached without redistribution. This model also explains the coexistence of one cooperating and one free-riding coalition. In this case, increasing inequality can have a positive or negative impact on the aggregate level of effort.
Resumo:
Projecte de recerca elaborat a partir d’una estada a la Satandford University, EEUU, entre 2007 i 2009. Els darrers anys, hi ha hagut un avanç espectacular en la tecnologia aplicada a l’anàlisi del genoma i del proteoma (microarrays, PCR quantitativa real time, electroforesis dos dimensions, espectroscòpia de masses, etc.) permetent la resolució de mostres complexes i la detecció quantitativa de diferents gens i proteïnes en un sol experiment. A més a més, la seva importància radica en la capacitat d’identificar potencials dianes terapèutiques i possibles fàrmacs, així com la seva aplicació en el disseny i desenvolupament de noves eines de diagnòstic. L’aplicabilitat de les tècniques actuals, però, està limitada al nivell al que el teixit pot ser disseccionat. Si bé donen valuosa informació sobre expressió de gens i proteïnes implicades en una malaltia o en resposta a un fàrmac per exemple, en cap cas, s’obté una informació in situ ni es pot obtenir informació espacial o una resolució temporal, així com tampoc s’obté informació de sistemes in vivo. L’objectiu d’aquest projecte és desenvolupar i validar un nou microscopi, d’alta resolució, ultrasensible i de fàcil ús, que permeti tant la detecció de metabòlits, gens o proteïnes a la cèl•lula viva en temps real com l’estudi de la seva funció. Obtenint així una descripció detallada de les interaccions entre proteïnes/gens que es donen dins la cèl•lula. Aquest microscopi serà un instrument sensible, selectiu, ràpid, robust, automatitzat i de cost moderat que realitzarà processos de cribatge d’alt rendiment (High throughput screening) genètics, mèdics, químics i farmacèutics (per aplicacions diagnòstiques i de identificació i selecció de compostos actius) de manera més eficient. Per poder realitzar aquest objectius el microscopi farà ús de les més noves tecnologies: 1)la microscopia òptica i d’imatge, per millorar la visualització espaial i la sensibilitat de l’imatge; 2) la utilització de nous mètodes de detecció incloent els més moderns avanços en nanopartícules; 3) la creació de mètodes informàtics per adquirir, emmagatzemar i processar les imatges obtingudes.
Resumo:
L’aplicació de la tecnologia de Google Art Project al Museu d’Art Contemporani de Barcelona (MACBA) com a forma d’aproximació de l’art a un públic més internacional és el plantejament d’aquest treball. Amb aquesta finalitat es desenvoluparà una estratègia de comunicació digital que comprengui aquesta eina com a principal i abasti altres mètodes interactius a xarxes socials i a altres espais de socialització 2.0. L’elaboració d’aquesta estratègia estarà basada dins un context real de l’art contemporani a Barcelona i de la seva màxima compenetració amb aquesta innovadora iniciativa.
Resumo:
With the advent of High performance computing, it is now possible to achieve orders of magnitude performance and computation e ciency gains over conventional computer architectures. This thesis explores the potential of using high performance computing to accelerate whole genome alignment. A parallel technique is applied to an algorithm for whole genome alignment, this technique is explained and some experiments were carried out to test it. This technique is based in a fair usage of the available resource to execute genome alignment and how this can be used in HPC clusters. This work is a rst approximation to whole genome alignment and it shows the advantages of parallelism and some of the drawbacks that our technique has. This work describes the resource limitations of current WGA applications when dealing with large quantities of sequences. It proposes a parallel heuristic to distribute the load and to assure that alignment quality is mantained.