19 resultados para FORESTs Genome Project database

em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain


Relevância:

100.00% 100.00%

Publicador:

Resumo:

With the increasing availability of various 'omics data, high-quality orthology assignment is crucial for evolutionary and functional genomics studies. We here present the fourth version of the eggNOG database (available at http://eggnog.embl.de) that derives nonsupervised orthologous groups (NOGs) from complete genomes, and then applies a comprehensive characterization and analysis pipeline to the resulting gene families. Compared with the previous version, we have more than tripled the underlying species set to cover 3686 organisms, keeping track with genome project completions while prioritizing the inclusion of high-quality genomes to minimize error propagation from incomplete proteome sets. Major technological advances include (i) a robust and scalable procedure for the identification and inclusion of high-quality genomes, (ii) provision of orthologous groups for 107 different taxonomic levels compared with 41 in eggNOGv3, (iii) identification and annotation of particularly closely related orthologous groups, facilitating analysis of related gene families, (iv) improvements of the clustering and functional annotation approach, (v) adoption of a revised tree building procedure based on the multiple alignments generated during the process and (vi) implementation of quality control procedures throughout the entire pipeline. As in previous versions, eggNOGv4 provides multiple sequence alignments and maximum-likelihood trees, as well as broad functional annotation. Users can access the complete database of orthologous groups via a web interface, as well as through bulk download.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

L’èxit del Projecte Genoma Humà (PGH) l’any 2000 va fer de la “medicina personalitzada” una realitat més propera. Els descobriments del PGH han simplificat les tècniques de seqüenciació de tal manera que actualment qualsevol persona pot aconseguir la seva seqüència d’ADN complerta. La tecnologia de Read Mapping destaca en aquest tipus de tècniques i es caracteritza per manegar una gran quantitat de dades. Hadoop, el framework d’Apache per aplicacions intensives de dades sota el paradigma Map Reduce, resulta un aliat perfecte per aquest tipus de tecnologia i ha sigut l’opció escollida per a realitzar aquest projecte. Durant tot el treball es realitza l’estudi, l’anàlisi i les experimentacions necessàries per aconseguir un Algorisme Genètic innovador que utilitzi tot el potencial de Hadoop.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Desde el inicio del proyecto del genoma humano y su éxito en el año 2001 se han secuenciado genomas de multitud de especies. La mejora en las tecnologías de secuenciación ha generado volúmenes de datos con un crecimiento exponencial. El proyecto Análisis bioinformáticos sobre la tecnología Hadoop abarca la computación paralela de datos biológicos como son las secuencias de ADN. El estudio ha sido encauzado por la naturaleza del problema a resolver. El alineamiento de secuencias genéticas con el paradigma MapReduce.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Es un proyecto sobre todos los pasos a seguir para un proyecto informático. Este proyecto está realizado sobre una gestión de amonestaciones y sanciones en un centro educativo. Tenemos el diseño de la base de datos, su implementación y todas las diferentes pruebas para su testeo.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The NW Mediterranean region experiences every year heavy rainfall and flash floods that occasionally produce catastrophic damages. Less frequent are floods that affect large regions. Although a large number of databases devoted exclusively to floods or considering all kind of natural hazards do exist, usually they only record catastrophic flood events. This paper deals with the new flood database that is being developed within the framework of HYMEX project. Results are focused on four regions representative of the NW sector of Mediterranean Europe: Catalonia, Spain; the Balearic Islands, Spain; Calabria, Italy; and Languedoc-Roussillon, Midi-Pyrenées and PACA, France. The common available 30-yr period starts in 1981 and ends in 2010. The paper shows the database structure and criteria, the comparison with other flood databases, some statistics on spatial and temporal distribution, and an identification of the most important events. The paper also provides a table that includes the date and affected region of all the catastrophic events identified in the regions of study, in order to make this information available for all audiences.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Heriot-Watt University uses a software package called Syllabus Plus for its timetabling. This package can perform scheduling functions however it is currently employed only as a room booking system at present. In academic session 2008-2009 the university will be restructuring its academic year from 3 terms of 10 weeks to semesters of 14 weeks and therefore major changes will be required to the timetabling information. This project has two functions, both with practical and relevant applications to the timetabling of the university. The aims of the project are the ability to change population number of modules and activities, delete term 3 modules and activities, the ability to change module and activity name, and change the teaching week pattern from the semester

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Estudi realitzat a partir d’una estada a la Institut J.W. Jenkinson Laboratory for Evolution and Development of the University of Oxford, Regne Unit, entre 2010 i 2012. He estat membre del laboratori del Professor Peter W.H. Holland com a becari post-doctoral Beatriu de Pinós des de setembre de 2010 al setembre de 2012. El nostre projecte de recerca se centra en l'anàlisi genòmic comparatiu del Regne Animal, tot explorant el contingut dels genomes a través de totes les branques de l'arbre dels animals. Totes les referències a les meves publicacions durant aquest post-doc es poden trobar a http://about.me/jordi_paps. Crec que el nombre i la qualitat dels resultats del meu post-doc, un total de 8 publicacions incloent dos articles a la prestigiosa revista Nature, són prova de l'èxit d'aquest post-doc. Prof Peter W. H. Holland (Departament de Zoologia de la Universitat d'Oxford) i jo som coautors de tres articles de genòmica comparativa, resultats directes d'aquest projecte: 1) comparació de families gèniques entre vertebrats invertebrats (Briefings in Functional Genomics), 2) el genoma de l'ostra (publicat a la revista Nature), i 3) els genomes de 6 platihelmints paràsits (acceptat també a Nature). A més, tenim altres 2 treballs en preparació. Un d'ells analitza l'evolució, expressió i funció dels gens Hox al a la tènia Hymenolepis. El perfil fi d'aquests gens clau del desenvolupament esclareix els canvis d'estil de vida dels organismes. A més, durant aquest últim post-doc he participat en diverses col•laboracions, incloent anàlisi de gens d'envelliment a cucs plans, un estudi sobre la filogènia del grup Gastrotricha, una revisió de l'evolució phylum Platyhelminthes, així com un capítol d'un llibre sobre l'evolució dels animals bilaterals. Finalment, gràcies a la beca Beatriu de Pinós, el Prof. Peter W.H. Holland m'ha convidat a formar part del seu equip com un investigador post-doctoral en el seu projecte ERC Advance actual sobre duplicacions genòmiques.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Annotation of protein-coding genes is a key goal of genome sequencing projects. In spite of tremendous recent advances in computational gene finding, comprehensive annotation remains a challenge. Peptide mass spectrometry is a powerful tool for researching the dynamic proteome and suggests an attractive approach to discover and validate protein-coding genes. We present algorithms to construct and efficiently search spectra against a genomic database, with no prior knowledge of encoded proteins. By searching a corpus of 18.5 million tandem mass spectra (MS/MS) from human proteomic samples, we validate 39,000 exons and 11,000 introns at the level of translation. We present translation-level evidence for novel or extended exons in 16 genes, confirm translation of 224 hypothetical proteins, and discover or confirm over 40 alternative splicing events. Polymorphisms are efficiently encoded in our database, allowing us to observe variant alleles for 308 coding SNPs. Finally, we demonstrate the use of mass spectrometry to improve automated gene prediction, adding 800 correct exons to our predictions using a simple rescoring strategy. Our results demonstrate that proteomic profiling should play a role in any genome sequencing project.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Selenoproteins are a diverse group of proteinsusually misidentified and misannotated in sequencedatabases. The presence of an in-frame UGA (stop)codon in the coding sequence of selenoproteingenes precludes their identification and correctannotation. The in-frame UGA codons are recodedto cotranslationally incorporate selenocysteine,a rare selenium-containing amino acid. The developmentof ad hoc experimental and, more recently,computational approaches have allowed the efficientidentification and characterization of theselenoproteomes of a growing number of species.Today, dozens of selenoprotein families have beendescribed and more are being discovered in recentlysequenced species, but the correct genomic annotationis not available for the majority of thesegenes. SelenoDB is a long-term project that aims toprovide, through the collaborative effort of experimentaland computational researchers, automaticand manually curated annotations of selenoproteingenes, proteins and SECIS elements. Version 1.0 ofthe database includes an initial set of eukaryoticgenomic annotations, with special emphasis on thehuman selenoproteome, for immediate inspectionby selenium researchers or incorporation into moregeneral databases. SelenoDB is freely available athttp://www.selenodb.org.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A new multimodal biometric database designed and acquired within the framework of the European BioSecure Network of Excellence is presented. It is comprised of more than 600 individuals acquired simultaneously in three scenarios: 1) over the Internet, 2) in an office environment with desktop PC, and 3) in indoor/outdoor environments with mobile portable hardware. The three scenarios include a common part of audio/video data. Also, signature and fingerprint data have been acquired both with desktop PC and mobile portable hardware. Additionally, hand and iris data were acquired in the second scenario using desktop PC. Acquisition has been conducted by 11 European institutions. Additional features of the BioSecure Multimodal Database (BMDB) are: two acquisitionsessions, several sensors in certain modalities, balanced gender and age distributions, multimodal realistic scenarios with simple and quick tasks per modality, cross-European diversity, availability of demographic data, and compatibility with other multimodal databases. The novel acquisition conditions of the BMDB allow us to perform new challenging research and evaluation of eithermonomodal or multimodal biometric systems, as in the recent BioSecure Multimodal Evaluation campaign. A description of this campaign including baseline results of individual modalities from the new database is also given. The database is expected to beavailable for research purposes through the BioSecure Association during 2008.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

One of the aims of the MEDEX project is to improve the knowledge of high-impact weather events in the Mediterranean. According to the guidelines of this project, a pilot study was carried out in two regions of Spain (the Balearic Islands and Catalonia) by the Social Impact Research group of MEDEX. The main goal is to suggest some general and suitable criteria about how to analyse requests received in Meteorological Services arising out of the damage caused by weather events. Thus, all the requests received between 2000 and 2002 at the Servei Meteorològic de Catalunya as well as at the Division of AEMET in the Balearic Islands were analysed. Firstly, the proposed criteria in order to build the database are defined and discussed. Secondly, the temporal distribution of the requests for damage claims is analysed. On average, almost half of them were received during the first month after the event happened. During the first six months, the percentage increases by 90%. Thirdly, various factors are taken into account to determine the impact of specific events on society. It is remarkable that the greatest number of requests is for those episodes with simultaneous heavy rain and strong wind, and finally, those that are linked to high population density.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Information about the genomic coordinates and the sequence of experimentally identified transcription factor binding sites is found scattered under a variety of diverse formats. The availability of standard collections of such high-quality data is important to design, evaluate and improve novel computational approaches to identify binding motifs on promoter sequences from related genes. ABS (http://genome.imim.es/datasets/abs2005/index.html) is a public database of known binding sites identified in promoters of orthologous vertebrate genes that have been manually curated from bibliography. We have annotated 650 experimental binding sites from 68 transcription factors and 100 orthologous target genes in human, mouse, rat or chicken genome sequences. Computational predictions and promoter alignment information are also provided for each entry. A simple and easy-to-use web interface facilitates data retrieval allowing different views of the information. In addition, the release 1.0 of ABS includes a customizable generator of artificial datasets based on the known sites contained in the collection and an evaluation tool to aid during the training and the assessment of motif-finding programs.