980 resultados para genome project


Relevância:

60.00% 60.00%

Publicador:

Resumo:

SNARE proteins have been classified as vesicular (v)- and target (t)-SNAREs and play a central role in the various membrane interactions in eukaryotic cells. Based on the Paramecium genome project, we have identified a multigene family of at least 26 members encoding the t-SNARE syntaxin (PtSyx) that can be grouped into 15 subfamilies. Paramecium syntaxins match the classical build-up of syntaxins, being 'tail-anchored' membrane proteins with an N-terminal cytoplasmic domain and a membrane-bound single C-terminal hydrophobic domain. The membrane anchor is preceded by a conserved SNARE domain of approximately 60 amino acids that is supposed to participate in SNARE complex assembly. In a phylogenetic analysis, most of the Paramecium syntaxin genes were found to cluster in groups together with those from other organisms in a pathway-specific manner, allowing an assignment to different compartments in a homology-dependent way. However, some of them seem to have no counterparts in metazoans. In another approach, we fused one representative member of each of the syntaxin isoforms to green fluorescent protein and assessed the in vivo localization, which was further supported by immunolocalization of some syntaxins. This allowed us to assign syntaxins to all important trafficking pathways in Paramecium.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background Arsenic is one of the most ubiquitous toxins and endangers the health of tens of millions of humans worldwide. It is a mainly a water-borne contaminant. Inorganic trivalent arsenic (AsIII) is one of the major species that exists environmentally. The transport of AsIII has been studied in microbes, plants and mammals. Members of the aquaglyceroporin family have been shown to actively conduct AsIII and its organic metabolite, monomethylarsenite (MAsIII). However, the transport of AsIII and MAsIII in in any fish species has not been characterized. Results In this study, five members of the aquaglyceroporin family from zebrafish (Danio rerio) were cloned, and their ability to transport water, glycerol, and trivalent arsenicals (AsIII and MAsIII) and antimonite (SbIII) was investigated. Genes for at least seven aquaglyceroporins have been annotated in the zebrafish genome project. Here, five genes which are close homologues to human AQP3, AQP9 and AQP10 were cloned from a zebrafish cDNA preparation. These genes were namedaqp3, aqp3l, aqp9a, aqp9b and aqp10 according to their similarities to the corresponding human AQPs. Expression of aqp9a, aqp9b, aqp3, aqp3l and aqp10 in multiple zebrafish organs were examined by RT-PCR. Our results demonstrated that these aquaglyceroporins exhibited different tissue expression. They are all detected in more than one tissue. The ability of these five aquaglyceroporins to transport water, glycerol and the metalloids arsenic and antimony was examined following expression in oocytes from Xenopus leavis. Each of these channels showed substantial glycerol transport at equivalent rates. These aquaglyceroporins also facilitate uptake of inorganic AsIII, MAsIII and SbIII. Arsenic accumulation in fish larvae and in different tissues from adult zebrafish was studied following short-term arsenic exposure. The results showed that liver is the major organ of arsenic accumulation; other tissues such as gill, eye, heart, intestine muscle and skin also exhibited significant ability to accumulate arsenic. The zebrafish larvae also accumulate considerable amounts of arsenic. Conclusion This is the first molecular identification of fish arsenite transport systems and we propose that the extensive expression of the fish aquaglyceroporins and their ability to transport metalloids suggests that aquaglyceroporins are the major pathways for arsenic accumulation in a variety of zebrafish tissues. Uptake is one important step of arsenic metabolism. Our results will contribute to a new understanding of aquatic arsenic metabolism and will support the use of zebrafish as a new model system to study arsenic associated human diseases.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Email exchange in 2013 between Kathryn Maxson (Duke) and Kris Wetterstrand (NHGRI), regarding country funding and other data for the HGP sequencing centers. Also includes the email request for such information, from NHGRI to the centers, in 2000, and the aggregate data collected.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Jean Weissenbach, telephone interview by Kathryn Maxson and Robert Cook-Deegan, conducted from Durham, NC 09 February 2012. Jean Weissenbach, a leader in French genetic mapping, directed the French national sequencing center, Généthon, during the HGP and was instrumental in helping to build agreement to the Bermuda Principles in France.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Mark Guyer and Jane Peterson, in-person interview with Kathryn Maxson and Robert Cook-Deegan, conducted in Rockville, MD (NIH campus), 18 August 2011. Mark Guyer and Jane Peterson were grants program officers at the NIH during the HGP, and were some of the longest-standing employees in the HGP administrative structure. Both witnessed the transformation of the Office of Genome Research into the National Center for Human Genome Research and, finally, the National Human Genome Research Institute. They were close participants in the history of the Bermuda Principles within the NIH.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Autism Spectrum Disorder (ASD) is a frequent and complex neurodevelopmental disorder, characterized by impairments in social communication and repetitive behaviors and with a high male to female ratio: ~4:1. Genetic factors, including rare Copy Number Variants (CNVs), have a substantial impact in ASD risk1, and are associated with specific phenotypic manifestations2. Recent studies reported that rare inherited CNVs are enriched in mothers of ASD children compared with mothers of controls and are preferentially transmitted from mothers to ASD children suggesting a sex bias in CNV transmission; further, the imbalanced transmission of small pathogenic CNVs from unaffected mothers to their sons with ASD has been described3,4. An increased prevalence of autism-like personality traits is found in unaffected relatives of ASD children, suggesting a genetic liability of a broader autism phenotype (BAP)5. The BAP in parents of autistic children can be assessed by the Social Responsiveness Scale (SRS)6 and Broad Autism Phenotype Questionnaire (BAPQ)7 reports. The SRS is 65-item questionnaire to identify sub-clinical social impairments and interpersonal behaviour in individuals. The BAPQ is a 36-item questionnaire measures social aloofness, rigid personality, and pragmatic language deficits in both parents and children.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Pine Wood Nematode (PWN) Bursaphelenchus xylophilus is a severe forest pathogen in countries where it has been introduced and is considered a worldwide quarantine organism. In this study, protein markers for differentiating populations of this nematode were identified by studying differences among four selected Iberian and one American population. These populations were compared by quantitative proteomics (iTRAQ). From a total of 2860 proteins identified using the public database from the B. xylophilus genome project, 216 were unambiguous and significantly differentially regulated in the studied populations. Comparisons of their pairwise ratio were statistically treated and supported in order to convert them into discrete character states, suggesting that 141 proteins were not informative as population specific markers. Application of the Character Compatibility methodology on the remaining 75 proteins (belonging to families with different biological functions) excludes 27 which are incompatible among them. Considering only the compatible proteins, the method selects a subset of 30 specific unique protein markers which allowed the compared classification of the Iberian isolates. This approach makes it easier search for diagnostic tools and phylogenetic inference within species and populations of a pathogen exhibiting a high level of genetic diversity.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

With the widespread availability of high-throughput sequencing technologies, sequencing projects have become pervasive in the molecular life sciences. The huge bulk of data generated daily must be analyzed further by biologists with skills in bioinformatics and by "embedded bioinformaticians," i.e., bioinformaticians integrated in wet lab research groups. Thus, students interested in molecular life sciences must be trained in the main steps of genomics: sequencing, assembly, annotation and analysis. To reach that goal, a practical course has been set up for master students at the University of Lausanne: the "Sequence a genome" class. At the beginning of the academic year, a few bacterial species whose genome is unknown are provided to the students, who sequence and assemble the genome(s) and perform manual annotation. Here, we report the progress of the first class from September 2010 to June 2011 and the results obtained by seven master students who specifically assembled and annotated the genome of Estrella lausannensis, an obligate intracellular bacterium related to Chlamydia. The draft genome of Estrella is composed of 29 scaffolds encompassing 2,819,825 bp that encode for 2233 putative proteins. Estrella also possesses a 9136 bp plasmid that encodes for 14 genes, among which we found an integrase and a toxin/antitoxin module. Like all other members of the Chlamydiales order, Estrella possesses a highly conserved type III secretion system, considered as a key virulence factor. The annotation of the Estrella genome also allowed the characterization of the metabolic abilities of this strictly intracellular bacterium. Altogether, the students provided the scientific community with the Estrella genome sequence and a preliminary understanding of the biology of this recently-discovered bacterial genus, while learning to use cutting-edge technologies for sequencing and to perform bioinformatics analyses.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genomesequence. The experiment had two major goals: the assessment of the accuracy of computationalmethods to predict protein coding genes; and the overall assessment of the completeness of thecurrent human genome annotations as represented in the ENCODE regions. For thecomputational prediction assessment, eighteen groups contributed gene predictions. Weevaluated these submissions against each other based on a ‘reference set’ of annotationsgenerated as part of the GENCODE project. These annotations were not available to theprediction groups prior to the submission deadline, so that their predictions were blind and anexternal advisory committee could perform a fair assessment.Results: The best methods had at least one gene transcript correctly predicted for close to 70%of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into accountalternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotidelevel, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programsrelying on mRNA and protein sequences were the most accurate in reproducing the manuallycurated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could beverified.Conclusions: This is the first such experiment in human DNA, and we have followed thestandards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe theresults presented here contribute to the value of ongoing large-scale annotation projects and shouldguide further experimental methods when being scaled up to the entire human genome sequence.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

BACKGROUND: We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. RESULTS: The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified. CONCLUSION: This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Genomics is expanding the horizons of epidemiology, providing a new dimension for classical epidemiological studies and inspiring the development of large-scale multicenter studies with the statistical power necessary for the assessment of gene-gene and gene-environment interactions in cancer etiology and prognosis. This paper describes the methodology of the Clinical Genome of Cancer Project in São Paulo, Brazil (CGCP), which includes patients with nine types of tumors and controls. Three major epidemiological designs were used to reach specific objectives: cross-sectional studies to examine gene expression, case-control studies to evaluate etiological factors, and follow-up studies to analyze genetic profiles in prognosis. The clinical groups included patients' data in the electronic database through the Internet. Two approaches were used for data quality control: continuous data evaluation and data entry consistency. A total of 1749 cases and 1509 controls were entered into the CGCP database from the first trimester of 2002 to the end of 2004. Continuous evaluation showed that, for all tumors taken together, only 0.5% of the general form fields still included potential inconsistencies by the end of 2004. Regarding data entry consistency, the highest percentage of errors (11.8%) was observed for the follow-up form, followed by 6.7% for the clinical form, 4.0% for the general form, and only 1.1% for the pathology form. Good data quality is required for their transformation into useful information for clinical application and for preventive measures. The use of the Internet for communication among researchers and for data entry is perhaps the most innovative feature of the CGCP. The monitoring of patients' data guaranteed their quality.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)