964 resultados para Comparative genomics


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background The turbot (Scophthalmus maximus) is a highly appreciated European aquaculture species. Growth related traits constitute the main goal of the ongoing genetic breeding programs of this species. The recent construction of a consensus linkage map in this species has allowed the selection of a panel of 100 homogeneously distributed markers covering the 26 linkage groups (LG) suitable for QTL search. In this study we addressed the detection of QTL with effect on body weight, length and Fulton's condition factor. Results Eight families from two genetic breeding programs comprising 814 individuals were used to search for growth related QTL using the panel of microsatellites available for QTL screening. Two different approaches, maximum likelihood and regression interval mapping, were used in order to search for QTL. Up to eleven significant QTL were detected with both methods in at least one family: four for weight on LGs 5, 14, 15 and 16; five for length on LGs 5, 6, 12, 14 and 15; and two for Fulton's condition factor on LGs 3 and 16. In these LGs an association analysis was performed to ascertain the microsatellite marker with the highest apparent effect on the trait, in order to test the possibility of using them for marker assisted selection. Conclusions The use of regression interval mapping and maximum likelihood methods for QTL detection provided consistent results in many cases, although the high variation observed for traits mean among families made it difficult to evaluate QTL effects. Finer mapping of detected QTL, looking for tightly linked markers to the causative mutation, and comparative genomics are suggested to deepen in the analysis of QTL in turbot so they can be applied in marker assisted selection programs.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The pufferfish Fugu rubripes has a genome ≈7.5 times smaller than that of mammals but with a similar number of genes. Although conserved synteny has been demonstrated between pufferfish and mammals across some regions of the genome, there is some controversy as to what extent Fugu will be a useful model for the human genome, e.g., [Gilley, J., Armes, N. & Fried, M. (1997) Nature (London) 385, 305–306]. We report extensive conservation of synteny between a 1.5-Mb region of human chromosome 11 and <100 kb of the Fugu genome in three overlapping cosmids. Our findings support the idea that the majority of DNA in the region of human chromosome 11p13 is intergenic. Comparative analysis of three unrelated genes with quite different roles, WT1, RCN1, and PAX6, has revealed differences in their structural evolution. Whereas the human WT1 gene can generate 16 protein isoforms via a combination of alternative splicing, RNA editing, and alternative start site usage, our data predict that Fugu WT1 is capable of generating only two isoforms. This raises the question of the extent to which the evolution of WT1 isoforms is related to the evolution of the mammalian genitourinary system. In addition, this region of the Fugu genome shows a much greater overall compaction than usual but with significant noncoding homology observed at the PAX6 locus, implying that comparative genomics has identified regulatory elements associated with this gene.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The complete DNA sequence of Pseudomonas aeruginosa provides an opportunity to apply functional genomics to a major human pathogen. A comparative genomics approach combined with genetic footprinting was used as a strategy to identify genes required for viability in P. aeruginosa. Use of a highly efficient in vivo mariner transposition system in P. aeruginosa facilitated the analysis of candidate genes of this class. We have developed a rapid and efficient allelic exchange system by using the I-SceI homing endonuclease in conjunction with in vitro mariner mutagenesis to generate mutants within targeted regions of the P. aeruginosa chromosome for genetic footprinting analyses. This technique for generating transposon insertion mutants should be widely applicable to other organisms that are not naturally transformable or may lack well developed in vivo transposition systems. We tested this system with three genes in P. aeruginosa that have putative essential homologs in Haemophilus influenzae. We show that one of three H. influenzae essential gene homologs is needed for growth in P. aeruginosa, validating the practicality of this comparative genomics strategy to identify essential genes in P. aeruginosa.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Thousands of genes have been painstakingly identified and characterized a few genes at a time. Many thousands more are being predicted by large scale cDNA and genomic sequencing projects, with levels of evidence ranging from supporting mRNA sequence and comparative genomics to computing ab initio models. This, coupled with the burgeoning scientific literature, makes it critical to have a comprehensive directory for genes and reference sequences for key genomes. The NCBI provides two resources, LocusLink and RefSeq, to meet these needs. LocusLink organizes information around genes to generate a central hub for accessing gene-specific information for fruit fly, human, mouse, rat and zebrafish. RefSeq provides reference sequence standards for genomes, transcripts and proteins; human, mouse and rat mRNA RefSeqs, and their corresponding proteins, are discussed here. Together, RefSeq and LocusLink provide a non-redundant view of genes and other loci to support research on genes and gene families, variation, gene expression and genome annotation. Additional information about LocusLink and RefSeq is available at http://www.ncbi.nlm.nih.gov/LocusLink/.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Olfactory receptor (OR) genes represent ≈1% of genomic coding sequence in mammals, and these genes are clustered on multiple chromosomes in both the mouse and human genomes. We have taken a comparative genomics approach to identify features that may be involved in the dynamic evolution of this gene family and in the transcriptional control that results in a single OR gene expressed per olfactory neuron. We sequenced ≈350 kb of the murine P2 OR cluster and used synteny, gene linkage, and phylogenetic analysis to identify and sequence ≈111 kb of an orthologous cluster in the human genome. In total, 18 mouse and 8 human OR genes were identified, including 7 orthologs that appear to be functional in both species. Noncoding homology is evident between orthologs and generally is confined within the transcriptional unit. We find no evidence for common regulatory features shared among paralogs, and promoter regions generally do not contain strong promoter motifs. We discuss these observations, as well as OR clustering, in the context of evolutionary expansion and transcriptional regulation of OR repertoires.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We generated draft genome sequences for two cold-adapted Archaea, Methanogenium frigidum and Methanococcoides burtonii, to identify genotypic characteristics that distinguish them from Archaea with a higher optimal growth temperature (OGT). Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins. Proteins from the cold-adapted Archaea are characterized by a higher content of noncharged polar amino acids, particularly Gin and Thr and a lower content of hydrophobic amino acids, particularly Leu. Sequence data from nine methanogen genomes (OGT 15degrees-98degreesC) were used to generate IIII modeled protein structures. Analysis of the models from the cold-adapted Archaea showed a strong tendency in the solvent-accessible area for more Gin, Thr, and hydrophobic residues and fewer charged residues. A cold shock domain (CSD) protein (CspA homolog) was identified in M. frigidum, two hypothetical proteins with CSD-folds in M. burtonii, and a unique winged helix DNA-binding domain protein in M. burtonii. This suggests that these types of nucleic acid binding proteins have a critical role in cold-adapted Archaea. Structural analysis of tRNA sequences from the Archaea indicated that GC content is the major factor influencing tRNA stability in hyperthermophiles, but not in the psychrophiles, mesophiles or moderate thermophiles. Below an OGT of 60degreesC, the GC content in tRNA was largely unchanged, indicating that any requirement for flexibility of tRNA in psychrophiles is mediated by other means. This is the first time that comparisons have been performed with genome data from Archaea spanning the growth temperature extremes. from psychrophiles to hyperthermophiles

Relevância:

60.00% 60.00%

Publicador:

Resumo:

With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Do non-coding RNAs that are derived from the introns and exons of protein-coding and non-protein-coding genes represent a fundamental advance in the genetic operating system of higher organisms? Recent evidence from comparative genomics and molecular genetics indicates that this might be the case. If so, there will be profound consequences for our understanding of the genetics of these organisms, and in particular how the trajectories of differentiation and development and the differences among individuals and species are genomically programmed. But how might this hypothesis be tested?

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The phylum Planctomycetes of the domain Bacteria consists of budding, peptidoglycan-less organisms important for understanding the origins of complex cell organization. Their significance for cell biology lies in their possession of intracellular membrane compartmentation. All planctomycetes share a unique cell plan, in which the cell cytoplasm is divided into compartments by one or more membranes, including a major cell compartment containing the nucleoid. Of special significance is Gemmata obscuriglobus, in which the nucleoid is enveloped in two membranes to form a nuclear body that is analogous to the structure of a eukaryotic nucleus. Planctomycete compartmentation may have functional physiological roles, as in the case of anaerobic ammonium-oxidizing anammox planctomycetes, in which the anammoxosome harbors specialized enzymes and is wrapped in an envelope possessing unique ladderane lipids. Organisms in phyla other than the phylum Planctomycetes may possess compartmentation similar to that of some planctomycetes, as in the case of members of the phylum Poribacteria from marine sponges.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in intron 2, 5 kb downstream of the core BRCA1 promoter. The functionality of these elements was examined using homologous-recombination-based mutagenesis of reporter gene-tagged cosmids incorporating these regions and flanking sequences from the BRCA1 locus. This showed that CNS-1 and CNS-2 have differential transcriptional regulatory activity in epithelial cell lines. Mutation of CNS-1 significantly reduced reporter gene expression to 30% of control levels. Conversely mutation of CNS-2 increased expression to 200% of control levels. Regulation is at the level of transcription and shows promoter specificity. Both elements also specifically bind nuclear proteins in vitro. These studies demonstrate that the combination of comparative genomics and functional analysis is a successful strategy to identify novel regulatory elements and provide the first direct evidence that conserved noncoding sequences in BRCA1 regulate gene expression. (c) 2005 Elsevier Inc. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The function of the prion protein gene (PRNP) and its normal product PrPC is elusive. We used comparative genomics as a strategy to understand the normal function of PRNP. As the reliability of comparisons increases with the number of species and increased evolutionary distance, we isolated and sequenced a 66.5 kb BAC containing the PRNP gene from a distantly related mammal, the model Australian marsupial Macropus eugenii (tammar wallaby). Marsupials are separated from eutherians such as human and mouse by roughly 180 million years of independent evolution. We found that tammar PRNP, like human PRNP, has two exons. Prion proteins encoded by the tammar wallaby and a distantly related marsupial, Monodelphis domestica (Brazilian opossum) PRNP contain proximal PrP repeats with a distinct, marsupial-specific composition and a variable number. Comparisons of tammar wallaby PRNP with PRNPs from human, mouse, bovine and ovine allowed us to identify non-coding gene regions conserved across the marsupial-eutherian evolutionary distance, which are candidates for regulatory regions. In the PRNP 3' UTR we found a conserved signal for nuclear-specific polyadenylation and the putative cytoplasmic polyadenylation element (CPE), indicating that post-transcriptional control of PRNP mRNA activity is important. Phylogenetic footprinting revealed conserved potential binding sites for the MZF-1 transcription factor in both upstream promoter and intron/intron 1, and for the MEF2, MyTI, Oct-1 and NFAT transcription factors in the intron(s). The presence of a conserved NFAT-binding site and CPE indicates involvement of PrPC in signal transduction and synaptic plasticity. (c) 2004 Elsevier B.V. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The southern cattle tick, Boophilus microplus (Canestrini), causes annual economic losses in the hundreds of millions of dollars to cattle producers throughout the world, and ranks as the most economically important tick from a global perspective. Control failures attributable to the development of pesticide resistance have become commonplace, and novel control technologies are needed. The availability of the genome sequence will facilitate the development of these new technologies, and we are proposing sequencing to a 4-6X draft coverage. Many existing biological resources are available to facilitate a genome sequencing project, including several inbred laboratory tick strains, a database of approximate to 45,000 expressed sequence tags compiled into a B. microplus Gene Index, a bacterial artificial chromosome (BAC) library, an established B. microplus cell line, and genomic DNA suitable for library synthesis. Collaborative projects are underway to map BACs and cDNAs to specific chromosomes and to sequence selected BAC clones. When completed, the genome sequences from the cow, B. microphis, and the B. microphis-borne pathogens Babesia bovis and Anaplasma marginale will enhance studies of host-vector-pathogen systems. Genes involved in the regeneration of amputated tick limbs and transitions through developmental stages are largely unknown. Studies of these and other interesting biological questions will be advanced by tick genome sequence data. Comparative genomics offers the prospect of new insight into many, perhaps all, aspects of the biology of ticks and the pathogens they transmit to farm animals and people. The B. microplus genome sequence will fill a major gap in comparative genomics: a sequence from the Metastriata lineage of ticks. The purpose of the article is to synergize interest in and provide rationales for sequencing the genome of B. microplus and for publicizing currently available genomic resources for this tick.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Systems biology is based on computational modelling and simulation of large networks of interacting components. Models may be intended to capture processes, mechanisms, components and interactions at different levels of fidelity. Input data are often large and geographically disperse, and may require the computation to be moved to the data, not vice versa. In addition, complex system-level problems require collaboration across institutions and disciplines. Grid computing can offer robust, scaleable solutions for distributed data, compute and expertise. We illustrate some of the range of computational and data requirements in systems biology with three case studies: one requiring large computation but small data (orthologue mapping in comparative genomics), a second involving complex terabyte data (the Visible Cell project) and a third that is both computationally and data-intensive (simulations at multiple temporal and spatial scales). Authentication, authorisation and audit systems are currently not well scalable and may present bottlenecks for distributed collaboration particularly where outcomes may be commercialised. Challenges remain in providing lightweight standards to facilitate the penetration of robust, scalable grid-type computing into diverse user communities to meet the evolving demands of systems biology.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

To carry out their specific roles in the cell, genes and gene products often work together in groups, forming many relationships among themselves and with other molecules. Such relationships include physical protein-protein interaction relationships, regulatory relationships, metabolic relationships, genetic relationships, and much more. With advances in science and technology, some high throughput technologies have been developed to simultaneously detect tens of thousands of pairwise protein-protein interactions and protein-DNA interactions. However, the data generated by high throughput methods are prone to noise. Furthermore, the technology itself has its limitations, and cannot detect all kinds of relationships between genes and their products. Thus there is a pressing need to investigate all kinds of relationships and their roles in a living system using bioinformatic approaches, and is a central challenge in Computational Biology and Systems Biology. This dissertation focuses on exploring relationships between genes and gene products using bioinformatic approaches. Specifically, we consider problems related to regulatory relationships, protein-protein interactions, and semantic relationships between genes. A regulatory element is an important pattern or "signal", often located in the promoter of a gene, which is used in the process of turning a gene "on" or "off". Predicting regulatory elements is a key step in exploring the regulatory relationships between genes and gene products. In this dissertation, we consider the problem of improving the prediction of regulatory elements by using comparative genomics data. With regard to protein-protein interactions, we have developed bioinformatics techniques to estimate support for the data on these interactions. While protein-protein interactions and regulatory relationships can be detected by high throughput biological techniques, there is another type of relationship called semantic relationship that cannot be detected by a single technique, but can be inferred using multiple sources of biological data. The contributions of this thesis involved the development and application of a set of bioinformatic approaches that address the challenges mentioned above. These included (i) an EM-based algorithm that improves the prediction of regulatory elements using comparative genomics data, (ii) an approach for estimating the support of protein-protein interaction data, with application to functional annotation of genes, (iii) a novel method for inferring functional network of genes, and (iv) techniques for clustering genes using multi-source data.