738 resultados para Annotation de génomes


Relevância:

10.00% 10.00%

Publicador:

Resumo:

The discovery and clinical application of molecular biomarkers in solid tumors, increasingly relies on nucleic acid extraction from FFPE tissue sections and subsequent molecular profiling. This in turn requires the pathological review of haematoxylin & eosin (H&E) stained slides, to ensure sample quality, tumor DNA sufficiency by visually estimating the percentage tumor nuclei and tumor annotation for manual macrodissection. In this study on NSCLC, we demonstrate considerable variation in tumor nuclei percentage between pathologists, potentially undermining the precision of NSCLC molecular evaluation and emphasising the need for quantitative tumor evaluation. We subsequently describe the development and validation of a system called TissueMark for automated tumor annotation and percentage tumor nuclei measurement in NSCLC using computerized image analysis. Evaluation of 245 NSCLC slides showed precise automated tumor annotation of cases using Tissuemark, strong concordance with manually drawn boundaries and identical EGFR mutational status, following manual macrodissection from the image analysis generated tumor boundaries. Automated analysis of cell counts for % tumor measurements by Tissuemark showed reduced variability and significant correlation (p < 0.001) with benchmark tumor cell counts. This study demonstrates a robust image analysis technology that can facilitate the automated quantitative analysis of tissue samples for molecular profiling in discovery and diagnostics.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Static timing analysis provides the basis for setting the clock period of a microprocessor core, based on its worst-case critical path. However, depending on the design, this critical path is not always excited and therefore dynamic timing margins exist that can theoretically be exploited for the benefit of better speed or lower power consumption (through voltage scaling). This paper introduces predictive instruction-based dynamic clock adjustment as a technique to trim dynamic timing margins in pipelined microprocessors. To this end, we exploit the different timing requirements for individual instructions during the dynamically varying program execution flow without the need for complex circuit-level measures to detect and correct timing violations. We provide a design flow to extract the dynamic timing information for the design using post-layout dynamic timing analysis and we integrate the results into a custom cycle-accurate simulator. This simulator allows annotation of individual instructions with their impact on timing (in each pipeline stage) and rapidly derives the overall code execution time for complex benchmarks. The design methodology is illustrated at the microarchitecture level, demonstrating the performance and power gains possible on a 6-stage OpenRISC in-order general purpose processor core in a 28nm CMOS technology. We show that employing instruction-dependent dynamic clock adjustment leads on average to an increase in operating speed by 38% or to a reduction in power consumption by 24%, compared to traditional synchronous clocking, which at all times has to respect the worst-case timing identified through static timing analysis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The introduction of Next Generation Sequencing (NGS) has revolutionised population genetics, providing studies of non-model species with unprecedented genomic coverage, allowing evolutionary biologists to address questions previously far beyond the reach of available resources. Furthermore, the simple mutation model of Single Nucleotide Polymorphisms (SNPs) permits cost-effective high-throughput genotyping in thousands of individuals simultaneously. Genomic resources are scarce for the Atlantic herring (Clupea harengus), a small pelagic species that sustains high revenue fisheries. This paper details the development of 578 SNPs using a combined NGS and high-throughput genotyping approach. Eight individuals covering the species distribution in the eastern Atlantic were bar-coded and multiplexed into a single cDNA library and sequenced using the 454 GS FLX platform. SNP discovery was performed by de novo sequence clustering and contig assembly, followed by the mapping of reads against consensus contig sequences. Selection of candidate SNPs for genotyping was conducted using an in silico approach. SNP validation and genotyping were performed simultaneously using an Illumina 1,536 GoldenGate assay. Although the conversion rate of candidate SNPs in the genotyping assay cannot be predicted in advance, this approach has the potential to maximise cost and time efficiencies by avoiding expensive and time-consuming laboratory stages of SNP validation. Additionally, the in silico approach leads to lower ascertainment bias in the resulting SNP panel as marker selection is based only on the ability to design primers and the predicted presence of intron-exon boundaries. Consequently SNPs with a wider spectrum of minor allele frequencies (MAFs) will be genotyped in the final panel. The genomic resources presented here represent a valuable multi-purpose resource for developing informative marker panels for population discrimination, microarray development and for population genomic studies in the wild.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The growing accessibility to genomic resources using next-generation sequencing (NGS) technologies has revolutionized the application of molecular genetic tools to ecology and evolutionary studies in non-model organisms. Here we present the case study of the European hake (Merluccius merluccius), one of the most important demersal resources of European fisheries. Two sequencing platforms, the Roche 454 FLX (454) and the Illumina Genome Analyzer (GAII), were used for Single Nucleotide Polymorphisms (SNPs) discovery in the hake muscle transcriptome. De novo transcriptome assembly into unique contigs, annotation, and in silico SNP detection were carried out in parallel for 454 and GAII sequence data. High-throughput genotyping using the Illumina GoldenGate assay was performed for validating 1,536 putative SNPs. Validation results were analysed to compare the performances of 454 and GAII methods and to evaluate the role of several variables (e.g. sequencing depth, intron-exon structure, sequence quality and annotation). Despite well-known differences in sequence length and throughput, the two approaches showed similar assay conversion rates (approximately 43%) and percentages of polymorphic loci (67.5% and 63.3% for GAII and 454, respectively). Both NGS platforms therefore demonstrated to be suitable for large scale identification of SNPs in transcribed regions of non-model species, although the lack of a reference genome profoundly affects the genotyping success rate. The overall efficiency, however, can be improved using strict quality and filtering criteria for SNP selection (sequence quality, intron-exon structure, target region score).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Evidence that persistent environmental pollutants may target the male reproductive system is increasing. The male reproductive system is regulated by secretion of testosterone by testicular Leydig cells, and perturbation of Leydig cell function may have ultimate consequences. 3-Methylsulfonyl-DDE (3-MeSO2-DDE) is a potent adrenal toxicants formed from the persistent insecticide DDT. Although studies have revealed the endocrine disruptive effect of 3-MeSO2-DDE, the underlying mechanisms at cellular level in steroidogenic Leydig cells remains to be established. The current study addresses the effect of 3-MeSO2-DDE on viability, hormone production and proteome response of primary neonatal porcine Leydig cells. The AlamarBlue™ assay was used to evaluate cell viability. Solid phase radioimmunoassay was used to measure concentration of hormones produced by both unstimulated and Luteinizing hormone (LH)-stimulated Leydig cells following 48h exposure. Protein samples from Leydig cells exposed to a non-cytotoxic concentration of 3-MeSO2-DDE (10μM) were subjected to nano-LC-MS/MS and analyzed on a Q Exactive mass spectrometer and quantified using label-free quantitative algorithm. Gene Ontology (GO) and Ingenuity Pathway Analysis (IPA) were carried out for functional annotation and identification of protein interaction networks. 3-MeSO2-DDE regulated Leydig cell steroidogenesis differentially depending on cell culture condition. Whereas its effect on testosterone secretion at basal condition was stimulatory, the effect on LH-stimulated cells was inhibitory. From triplicate experiments, a total of 6804 proteins were identified in which the abundance of 86 proteins in unstimulated Leydig cells and 145 proteins in LH-stimulated Leydig cells was found to be significantly regulated in response to 3-MeSO2-DDE exposure. These proteins not only are the first reported in relation to 3-MeSO2-DDE exposure, but also display small number of proteins shared between culture conditions, suggesting the action of 3-MeSO2-DDE on several targeted pathways, including mitochondrial dysfunction, oxidative phosphorylation, EIF2-signaling, and glutathione-mediated detoxification. Further identification and characterization of these proteins and pathways may build our understanding to the molecular basis of 3-MeSO2-DDE induced endocrine disruption in Leydig cells.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Introduction: Fewer than 50% of adults and 40% of youth meet US CDC guidelines for physical activity (PA) with the built environment (BE) a culprit for limited PA. A challenge in evaluating policy and BE change is the forethought to capture a priori PA behaviors and the ability to eliminate bias in post-change environments. The present objective was to analyze existing public data feeds to quantify effectiveness of BE interventions. The Archive of Many Outdoor Scenes (AMOS) has collected 135 million images of outdoor environments from 12,000 webcams since 2006. Many of these environments have experienced BE change. Methods: One example of BE change is the addition of protected bike lanes and a bike share program in Washington, DC.Weselected an AMOS webcam that captured this change. AMOS captures a photograph from eachwebcamevery half hour.AMOScaptured the 120 webcam photographs between 0700 and 1900 during the first work week of June 2009 and the 120 photographs from the same week in 2010. We used the Amazon Mechanical Turk (MTurk) website to crowd-source the image annotation. MTurk workers were paid US$0.01 to mark each pedestrian, cyclist and vehicle in a photograph. Each image was coded 5 unique times (n=1200). The data, counts of transportation mode, was downloaded to SPSS for analysis. Results: The number of cyclists per scene increased four-fold between 2009 and 2010 (F=36.72, p=0.002). There was no significant increase in pedestrians between the two years, however there was a significant increase in number of vehicles per scene (F=16.81, p

Relevância:

10.00% 10.00%

Publicador:

Resumo:

O nosso estudo debruça-se sobre o uso de estruturadores do discurso na interacção verbal, em contexto pedagógico. As nossas referências teóricas estão vinculadas à Análise do Discurso, quer à escola francesa (com origem na Linguística), quer à escola anglo-saxónica (com origem na Antropologia). Em relação à área da Linguística, buscámos os pressupostos da Pragmática, Sociolinguística e Psicolinguística; relativamente à Antropologia, seguimos as abordagens etnográficas, etnometodológicas e interaccionistas. Nesta pesquisa participaram 15 professores e 778 alunos de cinco escolas do ensino secundário/equiparado da cidade da Beira e da região de Maputo (Moçambique), que integravam, nomeadamente, as turmas do 1.º e 2.º ano do ramo comercial e 9.ª e 10.ª classe do ensino secundário geral. Observámos 40 aulas, das quais foram transcritas e analisadas 10 aulas. A transcrição e a anotação foram realizadas com o auxílio do programa Transcriber. Usámos métodos qualitativos e quantitativos e, predominantemente, procedimentos descritivos. Identificámos 4700 marcadores discursivos distribuídos nas seguintes subcategorias: marcadores discursivos directivos, marcadores discursivos de confirmação, marcadores discursivos de natureza fáctica e de concordância e as interjeições como marcadores discursivos. Os resultados da nossa pesquisa permitiram-nos concluir que os marcadores discursivos e as disfluências desempenham funções ligadas à estruturação textual-interactiva. Estes fenómenos linguísticos, ao estruturarem o discurso de professores e alunos, contribuem para a produção/compreensão de sentido das frases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

O conhecimento de mecanismos de genómica funcional tem sido maioritariamente adquirido pela utilização de organismos modelo que são mantidos em condições laboratoriais. Contudo, estes organismos não reflectem as respostas a alterações ambientais. Por outro lado, várias espécies, ecologicamente bem estudadas, reflectem bem as interacções entre genes e ambiente mas que, das quais não existem recursos genéticos disponíveis. O imposex, caracterizado pela superimposição de caracteres sexuais masculinos em fêmeas, é induzido pelo tributilestanho (TBT) e trifenilestanho (TPT) e representa um dos melhores exemplos de disrupção endócrina com causas antropogénicas no ambiente aquático. Com o intuito de elucidar as bases moleculares deste fenómeno, procedeu-se à combinação das metodologias de pirosequenciação (sequenciação 454 da Roche) e microarrays (Agilent 4*180K) de forma a contribuir para um melhor conhecimento desta interacção gene-ambiente no gastrópode Nucella lapillus, uma espécie sentinela para imposex. O trancriptoma de N. lapillus foi sequenciado, reconstruído e anotado e posteriormente utilizado para a produção de um “array” de nucleótidos. Este array foi então utilizado para explorar níveis de expressão génica em resposta à contaminação por TBT. Os resultados obtidos confirmaram as hipóteses anteriormente propostas (esteróidica, neuroendócrina, retinóica) e adicionalmente revelou a existência de potenciais novos mecanismos envolvidos no fenómeno imposex. Evidência para alvos moleculares de disrupção endócrina não relacionados com funções reprodutoras, tais como, sistema imunitário, apoptose e supressores de tumores, foram identificados. Apesar disso, tendo em conta a forte componente reprodutiva do imposex, esta componente funcional foi a mais explorada. Assim, factores de transcrição e receptores nucleares lipofílicos, funções mitocondriais e actividade de transporte celular envolvidos na diferenciação de géneros estão na base de potenciais novos mecanismos associados ao imposex em N. lapillus. Em particular, foi identificado como estando sobre-expresso, um possível homólogo do receptor nuclear “peroxisome proliferator-activated receptor gamma” (PPARγ), cuja função na indução de imposex foi confirmada experimentalmente in vivo após injecção dos animais com Rosiglitazone, um conhecido ligando de PPARγ em vertebrados. De uma forma geral, os resultados obtidos mostram que o fenómeno imposex é um mecanismo complexo, que possivelmente envolve a cascata de sinalização envolvendo o receptor retinoid X (RXR):PPARγ “heterodimer” que, até à data não foi descrito em invertebrados. Adicionalmente, os resultados obtidos apontam para alguma conservação de mecanismos de acção envolvidos na disrupção endócrina em invertebrados e vertebrados. Finalmente, a informação molecular produzida e as ferramentas moleculares desenvolvidas contribuem de forma significativa para um melhor conhecimento do fenómeno imposex e constituem importantes recursos para a continuação da investigação deste fenómeno e, adicionalmente, poderão vir a ser aplicadas no estudo de outras respostas a alterações ambientais usando N. lapillus como organismo modelo. Neste sentido, N. lapillus foi também utilizada para explorar a adaptação na morfologia da concha em resposta a alterações naturais induzidas por acção das ondas e pelo risco de predação por caranguejos. O contributo da componente genética, plástica e da sua interacção para a expressão fenotípica é crucial para compreender a evolução de caracteres adaptativos a ambientes heterogéneos. A contribuição destes factores na morfologia da concha de N. lapillus foi explorada recorrendo a transplantes recíprocos e experiências laboratoriais em ambiente comum (com e sem influência de predação) e complementada com análises genéticas, utilizando juvenis provenientes de locais representativos de costas expostas e abrigadas da acção das ondas. As populações estudadas são diferentes geneticamente mas possuem o mesmo cariótipo. Adicionalmente, análises morfométricas revelaram plasticidade da morfologia da concha em ambas as direcções dos transplantes recíprocos e também a retenção parcial, em ambiente comum, da forma da concha nos indivíduos da F2, indicando uma correlação positiva (co-gradiente) entre heritabilidade e plasticidade. A presença de estímulos de predação por caranguejos estimulou a produção de conchas com labros mais grossos, de forma mais evidente em animais recolhidos de costas expostas e também provocou alterações na forma da concha em animais desta proveniência. Estes dados sugerem contra-gradiente em alterações provocadas por predação na morfologia da concha, na produção de labros mais grossos e em níveis de crescimento. O estudo das interacções gene-ambiente descritas acima demonstram a actual possibilidade de produzir recursos e conhecimento genómico numa espécie bem caracterizada ecologicamente mas com limitada informação genómica. Estes recursos permitem um maior conhecimento biológico desta espécie e abrirão novas oportunidades de investigação, que até aqui seriam impossíveis de abordar.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The rapid evolution and proliferation of a world-wide computerized network, the Internet, resulted in an overwhelming and constantly growing amount of publicly available data and information, a fact that was also verified in biomedicine. However, the lack of structure of textual data inhibits its direct processing by computational solutions. Information extraction is the task of text mining that intends to automatically collect information from unstructured text data sources. The goal of the work described in this thesis was to build innovative solutions for biomedical information extraction from scientific literature, through the development of simple software artifacts for developers and biocurators, delivering more accurate, usable and faster results. We started by tackling named entity recognition - a crucial initial task - with the development of Gimli, a machine-learning-based solution that follows an incremental approach to optimize extracted linguistic characteristics for each concept type. Afterwards, Totum was built to harmonize concept names provided by heterogeneous systems, delivering a robust solution with improved performance results. Such approach takes advantage of heterogenous corpora to deliver cross-corpus harmonization that is not constrained to specific characteristics. Since previous solutions do not provide links to knowledge bases, Neji was built to streamline the development of complex and custom solutions for biomedical concept name recognition and normalization. This was achieved through a modular and flexible framework focused on speed and performance, integrating a large amount of processing modules optimized for the biomedical domain. To offer on-demand heterogenous biomedical concept identification, we developed BeCAS, a web application, service and widget. We also tackled relation mining by developing TrigNER, a machine-learning-based solution for biomedical event trigger recognition, which applies an automatic algorithm to obtain the best linguistic features and model parameters for each event type. Finally, in order to assist biocurators, Egas was developed to support rapid, interactive and real-time collaborative curation of biomedical documents, through manual and automatic in-line annotation of concepts and relations. Overall, the research work presented in this thesis contributed to a more accurate update of current biomedical knowledge bases, towards improved hypothesis generation and knowledge discovery.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Dissertação de mestrado, Engenharia Informática, Faculdade de Ciências e Tecnologia, Universidade do Algarve, 2015

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tese de mestrado em Bioinformática e Biologia Computacional (Bioinformática), apresentada à Universidade de Lisboa, através da Faculdade de Ciências, 2014

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tese de doutoramento, Ciências Biomédicas (Biologia Celular e Molecular), Universidade de Lisboa, Faculdade de Medicina, 2015

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cost-effective semantic description and annotation of shared knowledge resources has always been of great importance for digital libraries and large scale information systems in general. With the emergence of the Social Web and Web 2.0 technologies, a more effective semantic description and annotation, e.g., folksonomies, of digital library contents is envisioned to take place in collaborative and personalised environments. However, there is a lack of foundation and mathematical rigour for coping with contextualised management and retrieval of semantic annotations throughout their evolution as well as diversity in users and user communities. In this paper, we propose an ontological foundation for semantic annotations of digital libraries in terms of flexonomies. The proposed theoretical model relies on a high dimensional space with algebraic operators for contextualised access of semantic tags and annotations. The set of the proposed algebraic operators, however, is an adaptation of the set theoretic operators selection, projection, difference, intersection, union in database theory. To this extent, the proposed model is meant to lay the ontological foundation for a Digital Library 2.0 project in terms of geometric spaces rather than logic (description) based formalisms as a more efficient and scalable solution to the semantic annotation problem in large scale.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Fusobacterium necrophorum is a causative agent of persistent sore throat syndrome, tonsillar abscesses and Lemierre’s syndrome (LS) in humans. LS is characterised by thrombophlebitis of the jugular vein and bacteraemia. It is a Gram-negative, anaerobic bacterium which to date has no available reference genome. Draft genomes suggest it to be a single circular chromosome of approximately 2.2Mb. A reference strain of each of the two F. necrophorum subspecies and a clinical isolate from a LS patient were sequenced on a Roche 454 GS-FLX+. Sequence data was assembled using Roche GS Assembler and the resulting contigs annotated using xBASE, Pfam and BLAST. The annotation data was mined for gene products associated with virulence revealing a leukotoxin, haemolysin, filamentous haemagglutinnin, adhesin, hemin receptor, phage genes, CRISPR-associated proteins, ecotin and a putative type V secretion system. Data will be presented on comparative genomics of the three strains, with a focus on putative virulence genes. Tools such as Artemis Comparison Tool and ClustalO were used for sequence alignments and PhyML was used to generate phylogenetic trees. Conserved motifs associated with virulence were also located. Understanding variations at the genomic level may help to explain the increased virulence of some F. necrophorum strains.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We have developed an in-house pipeline for the processing and analyses of sequence data generated during Illumina technology-based metagenomic studies of the human gut microbiota. Each component of the pipeline has been selected following comparative analysis of available tools; however, the modular nature of software facilitates replacement of any individual component with an alternative should a better tool become available in due course. The pipeline consists of quality analysis and trimming followed by taxonomic filtering of sequence data allowing reads associated with samples to be binned according to whether they represent human, prokaryotic (bacterial/archaeal), viral, parasite, fungal or plant DNA. Viral, parasite, fungal and plant DNA can be assigned to species level on a presence/absence basis, allowing – for example – identification of dietary intake of plant-based foodstuffs and their derivatives. Prokaryotic DNA is subject to taxonomic and functional analyses, with assignment to taxonomic hierarchies (kingdom, class, order, family, genus, species, strain/subspecies) and abundance determination. After de novo assembly of sequence reads, genes within samples are predicted and used to build a non-redundant catalogue of genes. From this catalogue, per-sample gene abundance can be determined after normalization of data based on gene length. Functional annotation of genes is achieved through mapping of gene clusters against KEGG proteins, and InterProScan. The pipeline is undergoing validation using the human faecal metagenomic data of Qin et al. (2014, Nature 513, 59–64). Outputs from the pipeline allow development of tools for the integration of metagenomic and metabolomic data, moving metagenomic studies beyond determination of gene richness and representation towards microbial-metabolite mapping. There is scope to improve the outputs from viral, parasite, fungal and plant DNA analyses, depending on the depth of sequencing associated with samples. The pipeline can easily be adapted for the analyses of environmental and non-human animal samples, and for use with data generated via non-Illumina sequencing platforms.