928 resultados para genomics
Resumo:
The mango industry in Australia is worth in excess of $150 million annually with the Kensington Pride (KP) cultivar capturing 60% of the domestic market. Valued by consumers for desirable taste and colour characteristics, KP has been used extensively as a parent in the Department of Agriculture and Fisheries’ (Queensland, Australia) mango breeding program with over 400 hybrid trees sharing KP as the male parent. In order to gain a better understanding of Australia’s most significant mango variety, Horticulture Innovation Australia had led an international collaboration between the Queensland Department of Agriculture and Fisheries (Australia), the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT, India) and the Beijing Genomics Institute (China) to sequence the KP genome. Preliminary de novo assembly of illumina short read sequence data suggests that the KP genome is highly heterozygous and has an estimated genome size of 407 Mb. As refinements and additional sequence data are added to the assembly, a more complete picture of the mango genome will be elucidated.
Resumo:
While many placental herpesvirus genomes have been fully sequenced, the complete genome of a marsupial herpesvirus has not been described. Here we present the first genome sequence of a metatherian herpesvirus, Macropodid herpesvirus 1 (MaHV-1).
Resumo:
Background The obligate intracellular bacterium Chlamydia pneumoniae is a common respiratory pathogen, which has been found in a range of hosts including humans, marsupials and amphibians. Whole genome comparisons of human C. pneumoniae have previously highlighted a highly conserved nucleotide sequence, with minor but key polymorphisms and additional coding capacity when human and animal strains are compared. Results In this study, we sequenced three Australian human C. pneumoniae strains, two of which were isolated from patients in remote indigenous communities, and compared them to all available C. pneumoniae genomes. Our study demonstrated a phylogenetically distinct human C. pneumoniae clade containing the two indigenous Australian strains, with estimates that the most recent common ancestor of these strains predates the arrival of European settlers to Australia. We describe several polymorphisms characteristic to these strains, some of which are similar in sequence to animal C. pneumoniae strains, as well as evidence to suggest that several recombination events have shaped these distinct strains. Conclusions Our study reveals a greater sequence diversity amongst both human and animal C. pneumoniae strains, and suggests that a wider range of strains may be circulating in the human population than current sampling indicates.
Resumo:
During the past ten years, large-scale transcript analysis using microarrays has become a powerful tool to identify and predict functions for new genes. It allows simultaneous monitoring of the expression of thousands of genes and has become a routinely used tool in laboratories worldwide. Microarray analysis will, together with other functional genomics tools, take us closer to understanding the functions of all genes in genomes of living organisms. Flower development is a genetically regulated process which has mostly been studied in the traditional model species Arabidopsis thaliana, Antirrhinum majus and Petunia hybrida. The molecular mechanisms behind flower development in them are partly applicable in other plant systems. However, not all biological phenomena can be approached with just a few model systems. In order to understand and apply the knowledge to ecologically and economically important plants, other species also need to be studied. Sequencing of 17 000 ESTs from nine different cDNA libraries of the ornamental plant Gerbera hybrida made it possible to construct a cDNA microarray with 9000 probes. The probes of the microarray represent all different ESTs in the database. From the gerbera ESTs 20% were unique to gerbera while 373 were specific to the Asteraceae family of flowering plants. Gerbera has composite inflorescences with three different types of flowers that vary from each other morphologically. The marginal ray flowers are large, often pigmented and female, while the central disc flowers are smaller and more radially symmetrical perfect flowers. Intermediate trans flowers are similar to ray flowers but smaller in size. This feature together with the molecular tools applied to gerbera, make gerbera a unique system in comparison to the common model plants with only a single kind of flowers in their inflorescence. In the first part of this thesis, conditions for gerbera microarray analysis were optimised including experimental design, sample preparation and hybridization, as well as data analysis and verification. Moreover, in the first study, the flower and flower organ-specific genes were identified. After the reliability and reproducibility of the method were confirmed, the microarrays were utilized to investigate transcriptional differences between ray and disc flowers. This study revealed novel information about the morphological development as well as the transcriptional regulation of early stages of development in various flower types of gerbera. The most interesting finding was differential expression of MADS-box genes, suggesting the existence of flower type-specific regulatory complexes in the specification of different types of flowers. The gerbera microarray was further used to profile changes in expression during petal development. Gerbera ray flower petals are large, which makes them an ideal model to study organogenesis. Six different stages were compared and specifically analysed. Expression profiles of genes related to cell structure and growth implied that during stage two, cells divide, a process which is marked by expression of histones, cyclins and tubulins. Stage 4 was found to be a transition stage between cell division and expansion and by stage 6 cells had stopped division and instead underwent expansion. Interestingly, at the last analysed stage, stage 9, when cells did not grow any more, the highest number of upregulated genes was detected. The gerbera microarray is a fully-functioning tool for large-scale studies of flower development and correlation with real-time RT-PCR results show that it is also highly sensitive and reliable. Gene expression data presented here will be a source for gene expression mining or marker gene discovery in the future studies that will be performed in the Gerbera Laboratory. The publicly available data will also serve the plant research community world-wide.
Resumo:
Protein Kinase-Like Non-kinases (PKLNKs), which are closely related to protein kinases, lack the crucial catalytic aspartate in the catalytic loop, and hence cannot function as protein kinase, have been analysed. Using various sensitive sequence analysis methods, we have recognized 82 PKLNKs from four higher eukaryotic organisms, namely, Homo sapiens, Mus musculus, Rattus norvegicus, and Drosophila melanogaster. On the basis of their domain combination and function, PKLNKs have been classified mainly into four categories: (1) Ligand binding PKLNKs, (2) PKLNKs with extracellular protein-protein interaction domain, (3) PKLNKs involved in dimerization, and (4) PKLNKs with cytoplasmic protein-protein interaction module. While members of the first two classes of PKLNKs have transmembrane domain tethered to the PKLNK domain, members of the other two classes of PKLNKs are cytoplasmic in nature. The current classification scheme hopes to provide a convenient framework to classify the PKLNKs from other eukaryotes which would be helpful in deciphering their roles in cellular processes.
Resumo:
Background: The Mycobacterium leprae genome has less than 50% coding capacity and 1,133 pseudogenes. Preliminary evidence suggests that some pseudogenes are expressed. Therefore, defining pseudogene transcriptional and translational potentials of this genome should increase our understanding of their impact on M. leprae physiology. Results: Gene expression analysis identified transcripts from 49% of all M. leprae genes including 57% of all ORFs and 43% of all pseudogenes in the genome. Transcribed pseudogenes were randomly distributed throughout the chromosome. Factors resulting in pseudogene transcription included: 1) co-orientation of transcribed pseudogenes with transcribed ORFs within or exclusive of operon-like structures; 2) the paucity of intrinsic stem-loop transcriptional terminators between transcribed ORFs and downstream pseudogenes; and 3) predicted pseudogene promoters. Mechanisms for translational ``silencing'' of pseudogene transcripts included the lack of both translational start codons and strong Shine-Dalgarno (SD) sequences. Transcribed pseudogenes also contained multiple ``in-frame'' stop codons and high Ka/Ks ratios, compared to that of homologs in M. tuberculosis and ORFs in M. leprae. A pseudogene transcript containing an active promoter, strong SD site, a start codon, but containing two in frame stop codons yielded a protein product when expressed in E. coli. Conclusion: Approximately half of M. leprae's transcriptome consists of inactive gene products consuming energy and resources without potential benefit to M. leprae. Presently it is unclear what additional detrimental affect(s) this large number of inactive mRNAs has on the functional capability of this organism. Translation of these pseudogenes may play an important role in overall energy consumption and resultant pathophysiological characteristics of M. leprae. However, this study also demonstrated that multiple translational ``silencing'' mechanisms are present, reducing additional energy and resource expenditure required for protein production from the vast majority of these transcripts.
Resumo:
Acute anterior uveitis (AAU) involves inflammation of the iris and ciliary body of the eye. It occurs both in isolation and as a complication of ankylosing spondylitis (AS). It is strongly associated with HLA-B*27, but previous studies have suggested that further genetic factors may confer additional risk. We sought to investigate this using the Illumina Exomechip microarray, to compare 1504 cases with AS and AAU, 1805 with AS but no AAU and 21 133 healthy controls. We also used a heterogeneity test to test the differences in effect size between AS with AAU and AS without AAU. In the analysis comparing AS+AAU+ cases versus controls, HLA-B*27 and HLA-A*02:01 were significantly associated with the presence of AAU (P<10−300 and P=6 × 10−8, respectively). Secondary independent association with PSORS1C3 (P=4.7 × 10−5) and TAP2 (P=1.1 × 10−5) were observed in the major histocompatibility complex. There was a new suggestive association with a low-frequency variant at zinc-finger protein 154 in the AS without AAU versus control analysis (zinc-finger protein 154 (ZNF154), P=2.2 × 10−6). Heterogeneity testing showed that rs30187 in ERAP1 has a larger effect on AAU compared with that in AS alone. These findings also suggest that variants in ERAP1 have a differential impact on the risk of AAU when compared with AS, and hence the genetic risk for AAU differs from AS.
Resumo:
Plants are sessile organisms that have evolved a variety of mechanisms to maintain their cellular homeostasis under stressful environmental conditions. Survival of plants under abiotic stress conditions requires specialized group of heat shock protein machinery, belonging to Hsp70:J-protein family. These heat shock proteins are most ubiquitous types of chaperone machineries involved in diverse cellular processes including protein folding, translocation across cell membranes, and protein degradation. They play a crucial role in maintaining the protein homeostasis by reestablishing functional native conformations under environmental stress conditions, thus providing protection to the cell. J-proteins are co-chaperones of Hsp70 machine, which play a critical role by stimulating Hsp70s ATPase activity, thereby stabilizing its interaction with client proteins. Using genome-wide analysis of Arabidopsis thaliana, here we have outlined identification and systematic classification of J-protein co-chaperones which are key regulators of Hsp70s function. In comparison with Saccharomyces cerevisiae model system, a comprehensive domain structural organization, cellular localization, and functional diversity of A. thaliana J-proteins have also been summarized. Electronic supplementary material The online version of this article (doi:10.1007/s10142-009-0132-0) contains supplementary material, which is available to authorized users.
Resumo:
Filamentous fungi of the subphylum Pezizomycotina are well known as protein and secondary metabolite producers. Various industries take advantage of these capabilities. However, the molecular biology of yeasts, i.e. Saccharomycotina and especially that of Saccharomyces cerevisiae, the baker's yeast, is much better known. In an effort to explain fungal phenotypes through their genotypes we have compared protein coding gene contents of Pezizomycotina and Saccharomycotina. Only biomass degradation and secondary metabolism related protein families seem to have expanded recently in Pezizomycotina. Of the protein families clearly diverged between Pezizomycotina and Saccharomycotina, those related to mitochondrial functions emerge as the most prominent. However, the primary metabolism as described in S. cerevisiae is largely conserved in all fungi. Apart from the known secondary metabolism, Pezizomycotina have pathways that could link secondary metabolism to primary metabolism and a wealth of undescribed enzymes. Previous studies of individual Pezizomycotina genomes have shown that regardless of the difference in production efficiency and diversity of secreted proteins, the content of the known secretion machinery genes in Pezizomycotina and Saccharomycotina appears very similar. Genome wide analysis of gene products is therefore needed to better understand the efficient secretion of Pezizomycotina. We have developed methods applicable to transcriptome analysis of non-sequenced organisms. TRAC (Transcriptional profiling with the aid of affinity capture) has been previously developed at VTT for fast, focused transcription analysis. We introduce a version of TRAC that allows more powerful signal amplification and multiplexing. We also present computational optimisations of transcriptome analysis of non-sequenced organism and TRAC analysis in general. Trichoderma reesei is one of the most commonly used Pezizomycotina in the protein production industry. In order to understand its secretion system better and find clues for improvement of its industrial performance, we have analysed its transcriptomic response to protein secretion stress conditions. In comparison to S. cerevisiae, the response of T. reesei appears different, but still impacts on the same cellular functions. We also discovered in T. reesei interesting similarities to mammalian protein secretion stress response. Together these findings highlight targets for more detailed studies.
Resumo:
The extent to which low-frequency (minor allele frequency (MAF) between 1-5%) and rare (MAF = 1%) variants contribute to complex traits and disease in the general population is mainly unknown. Bone mineral density (BMD) is highly heritable, a major predictor of osteoporotic fractures, and has been previously associated with common genetic variants, as well as rare, population-specific, coding variants. Here we identify novel non-coding genetic variants with large effects on BMD (ntotal = 53,236) and fracture (ntotal = 508,253) in individuals of European ancestry from the general population. Associations for BMD were derived from whole-genome sequencing (n = 2,882 from UK10K (ref. 10); a population-based genome sequencing consortium), whole-exome sequencing (n = 3,549), deep imputation of genotyped samples using a combined UK10K/1000 Genomes reference panel (n = 26,534), and de novo replication genotyping (n = 20,271). We identified a low-frequency non-coding variant near a novel locus, EN1, with an effect size fourfold larger than the mean of previously reported common variants for lumbar spine BMD (rs11692564(T), MAF = 1.6%, replication effect size = +0.20 s.d., Pmeta = 2 x 10(-14)), which was also associated with a decreased risk of fracture (odds ratio = 0.85; P = 2 x 10(-11); ncases = 98,742 and ncontrols = 409,511). Using an En1(cre/flox) mouse model, we observed that conditional loss of En1 results in low bone mass, probably as a consequence of high bone turnover. We also identified a novel low-frequency non-coding variant with large effects on BMD near WNT16 (rs148771817(T), MAF = 1.2%, replication effect size = +0.41 s.d., Pmeta = 1 x 10(-11)). In general, there was an excess of association signals arising from deleterious coding and conserved non-coding variants. These findings provide evidence that low-frequency non-coding variants have large effects on BMD and fracture, thereby providing rationale for whole-genome sequencing and improved imputation reference panels to study the genetic architecture of complex traits and disease in the general population.
Resumo:
Background: The members of cupin superfamily exhibit large variations in their sequences, functions, organization of domains, quaternary associations and the nature of bound metal ion, despite having a conserved beta-barrel structural scaffold. Here, an attempt has been made to understand structure-function relationships among the members of this diverse superfamily and identify the principles governing functional diversity. The cupin superfamily also contains proteins for which the structures are available through world-wide structural genomics initiatives but characterized as ``hypothetical''. We have explored the feasibility of obtaining clues to functions of such proteins by means of comparative analysis with cupins of known structure and function. Methodology/Principal Findings: A 3-D structure-based phylogenetic approach was undertaken. Interestingly, a dendrogram generated solely on the basis of structural dissimilarity measure at the level of domain folds was found to cluster functionally similar members. This clustering also reflects an independent evolution of the two domains in bicupins. Close examination of structural superposition of members across various functional clusters reveals structural variations in regions that not only form the active site pocket but are also involved in interaction with another domain in the same polypeptide or in the oligomer. Conclusions/Significance: Structure-based phylogeny of cupins can influence identification of functions of proteins of yet unknown function with cupin fold. This approach can be extended to other proteins with a common fold that show high evolutionary divergence. This approach is expected to have an influence on the function annotation in structural genomics initiatives.
Resumo:
Background: Protein kinases are involved in diverse spectrum of cellular processes. Availability of draft version of the human genomic data in the year 2001 enabled recognition of repertoire of protein kinases. However, over the years the human genomic data is being refined and the current release of human genomic data has helped us to recognize a larger repertoire of over 900 human protein kinases represented mainly by splice variants. Results: Many of these identified protein kinases are alternatively spliced products. Interestingly, some of the human kinase splice variants appear to be significantly diverged in terms of their functional properties as represented by incorporation or absence of one or more domains. Many sets of protein kinase splice variants have substantially different domain organization and in a few sets of splice variants kinase domains belong to different subfamilies of kinases suggesting potential participation in different signal transduction pathways. Conclusions: Addition or deletion of a domain between splice variants of multi-domain kinases appears to be a means of generating differences in the functional features of otherwise similar kinases. It is intriguing that marked sequence diversity within the catalytic regions of some of the splice variant kinases result in kinases belonging to different subfamilies. These human kinase splice variants with different functions might contribute to diversity of eukaryotic cellular signaling.
Resumo:
Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics.
Resumo:
Meibomian cell carcinoma (MCC) is a malignant tumor of the meibomian glands located in the eyelids. No information exists on the cytogenctic and genetic aspects of MCC. There is no report on the gene expression profile of MCC. Thus there is a need, for both scientific and clinical reasons, to identify genes and pathways that are involved in the development and progression of MCC. We analyzed the gene expression profile of MCC by the microarray technique. Forty-four genes were upregulated and 149 genes were downregulated in MCC. Differential expression data were confirmed for 5 genes by semiquantitative RT-PCR in MCC tumors: GTF2H4, RBM12, UBE2D3, DDX17, and LZTS1. We found dysregulation of two major pathways in MCC: MAPK and JAK/STAT. Clusters of genes on chromosomes 1, 12, and 19 were dysregUlated in MCC. The data presented here will facilitate the identification of specific markers and therapeutic targets for the treatment of MCC patients. (c) 2007 Elsevier Inc. All rights reserved.
Resumo:
Autism is a childhood-onset developmental disorder characterized by deficits in reciprocal social interaction, verbal and non-verbal communication, and dependence on routines and rituals. It belongs to a spectrum of disorders (autism spectrum disorders, ASDs) which share core symptoms but show considerable variation in severity. The whole spectrum affects 0.6-0.7% of children worldwide, inducing a substantial public health burden and causing suffering to the affected families. Despite having a very high heritability, ASDs have shown exceptional genetic heterogeneity, which has complicated the identification of risk variants and left the etiology largely unknown. However, recent studies suggest that rare, family-specific factors contribute significantly to the genetic basis of ASDs. In this study, we investigated the role of DISC1 (Disrupted-in-schizophrenia-1) in ASDs, and identified association with markers and haplotypes previously associated with psychiatric phenotypes. We identified four polymorphic micro-RNA target sites in the 3 UTR of DISC1, and showed that hsa-miR-559 regulates DISC1 expression in vitro in an allele-specific manner. We also analyzed an extended autism pedigree with genealogical roots in Central Finland reaching back to the 17th century. To take advantage of the beneficial characteristics of population isolates to gene mapping and reduced genetic heterogeneity observed in distantly related individuals, we performed a microsatellite-based genome-wide screen for linkage and linkage disequilibrium in this pedigree. We identified a putative autism susceptibility locus on chromosome 19p13.3 and obtained further support for previously reported loci at 1q23 and 15q11-q13. To follow-up these findings, we extended our study sample from the same sub-isolate and initiated a genome-wide analysis of homozygosity and allelic sharing using high-density SNP markers. We identified a small number of haplotypes shared by different subsets of the genealogically connected cases, along with convergent biological pathways from SNP and gene expression data, which highlighted axon guidance molecules in the pathogenesis of ASDs. In conclusion, the results obtained in this thesis show that multiple distinct genetic variants are responsible for the ASD phenotype even within single pedigrees from an isolated population. We suggest that targeted resequencing of the shared haplotypes, linkage regions, and other susceptibility loci is essential to identify the causal variants. We also report a possible micro-RNA mediated regulatory mechanism, which might partially explain the wide-range neurobiological effects of the DISC1 gene.