224 resultados para bioinformatic
Resumo:
Predictive models of peptide-Major Histocompatibility Complex (MHC) binding affinity are important components of modern computational immunovaccinology. Here, we describe the development and deployment of a reliable peptide-binding prediction method for a previously poorly-characterized human MHC class I allele, HLA-Cw*0102.
Resumo:
Quantitative structure-activity relationship (QSAR) analysis is a cornerstone of modern informatics. Predictive computational models of peptide-major histocompatibility complex (MHC)-binding affinity based on QSAR technology have now become important components of modern computational immunovaccinology. Historically, such approaches have been built around semiqualitative, classification methods, but these are now giving way to quantitative regression methods. We review three methods--a 2D-QSAR additive-partial least squares (PLS) and a 3D-QSAR comparative molecular similarity index analysis (CoMSIA) method--which can identify the sequence dependence of peptide-binding specificity for various class I MHC alleles from the reported binding affinities (IC50) of peptide sets. The third method is an iterative self-consistent (ISC) PLS-based additive method, which is a recently developed extension to the additive method for the affinity prediction of class II peptides. The QSAR methods presented here have established themselves as immunoinformatic techniques complementary to existing methodology, useful in the quantitative prediction of binding affinity: current methods for the in silico identification of T-cell epitopes (which form the basis of many vaccines, diagnostics, and reagents) rely on the accurate computational prediction of peptide-MHC affinity. We have reviewed various human and mouse class I and class II allele models. Studied alleles comprise HLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*3101, HLA-A*6801, HLA-A*6802, HLA-B*3501, H2-K(k), H2-K(b), H2-D(b) HLA-DRB1*0101, HLA-DRB1*0401, HLA-DRB1*0701, I-A(b), I-A(d), I-A(k), I-A(S), I-E(d), and I-E(k). In this chapter we show a step-by-step guide into predicting the reliability and the resulting models to represent an advance on existing methods. The peptides used in this study are available from the AntiJen database (http://www.jenner.ac.uk/AntiJen). The PLS method is available commercially in the SYBYL molecular modeling software package. The resulting models, which can be used for accurate T-cell epitope prediction, will be made are freely available online at the URL http://www.jenner.ac.uk/MHCPred.
Resumo:
The accurate identification of T-cell epitopes remains a principal goal of bioinformatics within immunology. As the immunogenicity of peptide epitopes is dependent on their binding to major histocompatibility complex (MHC) molecules, the prediction of binding affinity is a prerequisite to the reliable prediction of epitopes. The iterative self-consistent (ISC) partial-least-squares (PLS)-based additive method is a recently developed bioinformatic approach for predicting class II peptide−MHC binding affinity. The ISC−PLS method overcomes many of the conceptual difficulties inherent in the prediction of class II peptide−MHC affinity, such as the binding of a mixed population of peptide lengths due to the open-ended class II binding site. The method has applications in both the accurate prediction of class II epitopes and the manipulation of affinity for heteroclitic and competitor peptides. The method is applied here to six class II mouse alleles (I-Ab, I-Ad, I-Ak, I-As, I-Ed, and I-Ek) and included peptides up to 25 amino acids in length. A series of regression equations highlighting the quantitative contributions of individual amino acids at each peptide position was established. The initial model for each allele exhibited only moderate predictivity. Once the set of selected peptide subsequences had converged, the final models exhibited a satisfactory predictive power. Convergence was reached between the 4th and 17th iterations, and the leave-one-out cross-validation statistical terms - q2, SEP, and NC - ranged between 0.732 and 0.925, 0.418 and 0.816, and 1 and 6, respectively. The non-cross-validated statistical terms r2 and SEE ranged between 0.98 and 0.995 and 0.089 and 0.180, respectively. The peptides used in this study are available from the AntiJen database (http://www.jenner.ac.uk/AntiJen). The PLS method is available commercially in the SYBYL molecular modeling software package. The resulting models, which can be used for accurate T-cell epitope prediction, will be made freely available online (http://www.jenner.ac.uk/MHCPred).
Resumo:
TAP is responsible for the transit of peptides from the cytosol to the lumen of the endoplasmic reticulum. In an immunological context, this event is followed by the binding of peptides to MHC molecules before export to the cell surface and recognition by T cells. Because TAP transport precedes MHC binding, TAP preferences may make a significant contribution to epitope selection. To assess the impact of this preselection, we have developed a scoring function for TAP affinity prediction using the additive method, have used it to analyze and extend the TAP binding motif, and have evaluated how well this model acts as a preselection step in predicting MHC binding peptides. To distinguish between MHC alleles that are exclusively dependent on TAP and those exhibiting only a partial dependence on TAP, two sets of MHC binding peptides were examined: HLA-A*0201 was selected as a representative of partially TAP-dependent HLA alleles, and HLA-A*0301 represented fully TAP-dependent HLA alleles. TAP preselection has a greater impact on TAP-dependent alleles than on TAP-independent alleles. The reduction in the number of nonbinders varied from 10% (TAP-independent) to 33% (TAP-dependent), suggesting that TAP preselection is an important component in the successful in silico prediction of T cell epitopes.
Resumo:
The ability to define and manipulate the interaction of peptides with MHC molecules has immense immunological utility, with applications in epitope identification, vaccine design, and immunomodulation. However, the methods currently available for prediction of peptide-MHC binding are far from ideal. We recently described the application of a bioinformatic prediction method based on quantitative structure-affinity relationship methods to peptide-MHC binding. In this study we demonstrate the predictivity and utility of this approach. We determined the binding affinities of a set of 90 nonamer peptides for the MHC class I allele HLA-A*0201 using an in-house, FACS-based, MHC stabilization assay, and from these data we derived an additive quantitative structure-affinity relationship model for peptide interaction with the HLA-A*0201 molecule. Using this model we then designed a series of high affinity HLA-A2-binding peptides. Experimental analysis revealed that all these peptides showed high binding affinities to the HLA-A*0201 molecule, significantly higher than the highest previously recorded. In addition, by the use of systematic substitution at principal anchor positions 2 and 9, we showed that high binding peptides are tolerant to a wide range of nonpreferred amino acids. Our results support a model in which the affinity of peptide binding to MHC is determined by the interactions of amino acids at multiple positions with the MHC molecule and may be enhanced by enthalpic cooperativity between these component interactions.
Resumo:
Classification of MHC molecules into supertypes in terms of peptide-binding specificities is an important issue, with direct implications for the development of epitope-based vaccines with wide population coverage. In view of extremely high MHC polymorphism (948 class I and 633 class II HLA alleles) the experimental solution of this task is presently impossible. In this study, we describe a bioinformatics strategy for classifying MHC molecules into supertypes using information drawn solely from three-dimensional protein structure. Two chemometric techniques–hierarchical clustering and principal component analysis–were used independently on a set of 783 HLA class I molecules to identify supertypes based on structural similarities and molecular interaction fields calculated for the peptide binding site. Eight supertypes were defined: A2, A3, A24, B7, B27, B44, C1, and C4. The two techniques gave 77% consensus, i.e., 605 HLA class I alleles were classified in the same supertype by both methods. The proposed strategy allowed “supertype fingerprints” to be identified. Thus, the A2 supertype fingerprint is Tyr9/Phe9, Arg97, and His114 or Tyr116; the A3-Tyr9/Phe9/Ser9, Ile97/Met97 and Glu114 or Asp116; the A24-Ser9 and Met97; the B7-Asn63 and Leu81; the B27-Glu63 and Leu81; for B44-Ala81; the C1-Ser77; and the C4-Asn77. action fields calculated for the peptide binding site. Eight supertypes were defined: A2, A3, A24, B7, B27, B44, C1, and C4. The two techniques gave 77% consensus, i.e., 605 HLA class I alleles were classified in the same supertype by both methods. The proposed strategy allowed “supertype fingerprints” to be identified. Thus, the A2 supertype fingerprint is Tyr9/Phe9, Arg97, and His114 or Tyr116; the A3-Tyr9/Phe9/Ser9, Ile97/Met97 and Glu114 or Asp116; the A24-Ser9 and Met97; the B7-Asn63 and Leu81; the B27-Glu63 and Leu81; for B44-Ala81; the C1-Ser77; and the C4-Asn77.
Resumo:
To carry out their specific roles in the cell, genes and gene products often work together in groups, forming many relationships among themselves and with other molecules. Such relationships include physical protein-protein interaction relationships, regulatory relationships, metabolic relationships, genetic relationships, and much more. With advances in science and technology, some high throughput technologies have been developed to simultaneously detect tens of thousands of pairwise protein-protein interactions and protein-DNA interactions. However, the data generated by high throughput methods are prone to noise. Furthermore, the technology itself has its limitations, and cannot detect all kinds of relationships between genes and their products. Thus there is a pressing need to investigate all kinds of relationships and their roles in a living system using bioinformatic approaches, and is a central challenge in Computational Biology and Systems Biology. This dissertation focuses on exploring relationships between genes and gene products using bioinformatic approaches. Specifically, we consider problems related to regulatory relationships, protein-protein interactions, and semantic relationships between genes. A regulatory element is an important pattern or "signal", often located in the promoter of a gene, which is used in the process of turning a gene "on" or "off". Predicting regulatory elements is a key step in exploring the regulatory relationships between genes and gene products. In this dissertation, we consider the problem of improving the prediction of regulatory elements by using comparative genomics data. With regard to protein-protein interactions, we have developed bioinformatics techniques to estimate support for the data on these interactions. While protein-protein interactions and regulatory relationships can be detected by high throughput biological techniques, there is another type of relationship called semantic relationship that cannot be detected by a single technique, but can be inferred using multiple sources of biological data. The contributions of this thesis involved the development and application of a set of bioinformatic approaches that address the challenges mentioned above. These included (i) an EM-based algorithm that improves the prediction of regulatory elements using comparative genomics data, (ii) an approach for estimating the support of protein-protein interaction data, with application to functional annotation of genes, (iii) a novel method for inferring functional network of genes, and (iv) techniques for clustering genes using multi-source data.
Resumo:
This work is concerned with the genetic basis of normal human pigmentation variation. Specifically, the role of polymorphisms within the solute carrier family 45 member 2 (SLC45A2 or membrane associated transporter protein; MATP) gene were investigated with respect to variation in hair, skin and eye colour ― both between and within populations. SLC45A2 is an important regulator of melanin production and mutations in the gene underly the most recently identified form of oculocutaneous albinism. There is evidence to suggest that non-synonymous polymorphisms in SLC45A2 are associated with normal pigmentation variation between populations. Therefore, the underlying hypothesis of this thesis is that polymorphisms in SLC45A2 will alter the function or regulation of the protein, thereby altering the important role it plays in melanogenesis and providing a mechanism for normal pigmentation variation. In order to investigate the role that SLC45A2 polymorphisms play in human pigmentation variation, a DNA database was established which collected pigmentation phenotypic information and blood samples of more than 700 individuals. This database was used as the foundation for two association studies outlined in this thesis, the first of which involved genotyping two previously-described non-synonymous polymorphisms, p.Glu272Lys and p.Phe374Leu, in four different population groups. For both polymorphisms, allele frequencies were significantly different between population groups and the 272Lys and 374Leu alleles were strongly associated with black hair, brown eyes and olive skin colour in Caucasians. This was the first report to show that SLC45A2 polymorphisms were associated with normal human intra-population pigmentation variation. The second association study involved genotyping several SLC45A2 promoter polymorphisms to determine if they also played a role in pigmentation variation. Firstly, the transcription start site (TSS), and hence putative proximal promoter region, was identified using 5' RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE). Two alternate TSSs were identified and the putative promoter region was screened for novel polymorphisms using denaturing high performance liquid chromatography (dHPLC). A novel duplication (c.–1176_–1174dupAAT) was identified along with other previously described single nucleotide polymorphisms (c.–1721C>G and c.–1169G>A). Strong linkage disequilibrium ensured that all three polymorphisms were associated with skin colour such that the –1721G, +dup and –1169A alleles were associated with olive skin in Caucasians. No linkage disequilibrium was observed between the promoter and coding region polymorphisms, suggesting independent effects. The association analyses were complemented with functional data, showing that the –1721G, +dup and –1169A alleles significantly decreased SLC45A2 transcriptional activity. Based on in silico bioinformatic analysis that showed these alleles remove a microphthalmia-associated transcription factor (MITF) binding site, and that MITF is a known regulator of SLC45A2 (Baxter and Pavan, 2002; Du and Fisher, 2002), it was postulated that SLC45A2 promoter polymorphisms could contribute to the regulation of pigmentation by altering MITF binding affinity. Further characterisation of the SLC45A2 promoter was carried out using luciferase reporter assays to determine the transcriptional activity of different regions of the promoter. Five constructs were designed of increasing length and their promoter activity evaluated. Constitutive promoter activity was observed within the first ~200 bp and promoter activity increased as the construct size increased. The functional impact of the –1721G, +dup and –1169A alleles, which removed a MITF consensus binding site, were assessed using electrophoretic mobility shift assays (EMSA) and expression analysis of genotyped melanoblast and melanocyte cell lines. EMSA results confirmed that the promoter polymorphisms affected DNA-protein binding. Interestingly, however, the protein/s involved were not MITF, or at least MITF was not the protein directly binding to the DNA. In an effort to more thoroughly characterise the functional consequences of SLC45A2 promoter polymorphisms, the mRNA expression levels of SLC45A2 and MITF were determined in melanocyte/melanoblast cell lines. Based on SLC45A2’s role in processing and trafficking TYRP1 from the trans-Golgi network to stage 2 melanosmes, the mRNA expression of TYRP1 was also investigated. Expression results suggested a coordinated expression of pigmentation genes. This thesis has substantially contributed to the field of pigmentation by showing that SLC45A2 polymorphisms not only show allele frequency differences between population groups, but also contribute to normal pigmentation variation within a Caucasian population. In addition, promoter polymorphisms have been shown to have functional consequences for SLC45A2 transcription and the expression of other pigmentation genes. Combined, the data presented in this work supports the notion that SLC45A2 is an important contributor to normal pigmentation variation and should be the target of further research to elucidate its role in determining pigmentation phenotypes. Understanding SLC45A2’s function may lead to the development of therapeutic interventions for oculocutaneous albinism and other disorders of pigmentation. It may also help in our understanding of skin cancer susceptibility and evolutionary adaptation to different UV environments, and contribute to the forensic application of pigmentation phenotype prediction.
Resumo:
Chlamydia pneumoniae is a common human and animal pathogen associated with a wide range of upper and lower respiratory tract infections. In more recent years there has been increasing evidence to suggest a link between C. pneumoniae and chronic diseases in humans, including atherosclerosis, stroke and Alzheimer’s disease. C. pneumoniae human strains show little genetic variation, indicating that the human-derived strain originated from a common ancestor in the recent past. Despite extensive information on the genetics and morphology processes of the human strain, knowledge concerning many other hosts (including marsupials, amphibians, reptiles and equines) remains virtually unexplored. The koala (Phascolarctos cinereus) is a native Australian marsupial under threat due to habitat loss, predation and disease. Koalas are very susceptible to chlamydial infections, most commonly affecting the conjunctiva, urogenital tract and/or respiratory tract. To address this gap in the literature, the present study (i) provides a detailed description of the morphologic and genomic architecture of the C. pneumoniae koala (and human) strain, and shows that the koala strain is microscopically, developmentally and genetically distinct from the C. pneumoniae human strain, and (ii) examines the genetic relationship of geographically diverse C. pneumoniae isolates from human, marsupial, amphibian, reptilian and equine hosts, and identifies two distinct lineages that have arisen from animal-to-human cross species transmissions. Chapter One of this thesis explores the scientific problem and aims of this study, while Chapter Two provides a detailed literature review of the background in this field of work. Chapter Three, the first results chapter, describes the morphology and developmental stages of C. pneumoniae koala isolate LPCoLN, as revealed by fluorescence and transmission electron microscopy. The profile of this isolate, when cultured in HEp-2 human epithelial cells, was quite different to the human AR39 isolate. Koala LPCoLN inclusions were larger; the elementary bodies did not have the characteristic pear-shaped appearance, and the developmental cycle was completed within a shorter period of time (as confirmed by quantitative real-time PCR). These in vitro findings might reflect biological differences between koala LPCoLN and human AR39 in vivo. Chapter Four describes the complete genome sequence of the koala respiratory pathogen, C. pneumoniae LPCoLN. This is the first animal isolate of C. pneumoniae to be fully-sequenced. The genome sequence provides new insights into genomic ‘plasticity’ (organisation), evolution and biology of koala LPCoLN, relative to four complete C. pneumoniae human genomes (AR39, CWL029, J138 and TW183). Koala LPCoLN contains a plasmid that is not shared with any of the human isolates, there is evidence of gene loss in nucleotide salvage pathways, and there are 10 hot spot genomic regions of variation that were previously not identified in the C. pneumoniae human genomes. Sequence (partial-length) from a second, independent, wild koala isolate (EBB) at several gene loci confirmed that the koala LPCoLN isolate was representative of a koala C. pneumoniae strain. The combined sequence data provides evidence that the C. pneumoniae animal (koala LPCoLN) genome is ancestral to the C. pneumoniae human genomes and that human infections may have originated from zoonotic infections. Chapter Five examines key genome components of the five C. pneumoniae genomes in more detail. This analysis reveals genomic features that are shared by and/or contribute to the broad ecological adaptability and evolution of C. pneumoniae. This analysis resulted in the identification of 65 gene sequences for further analysis of intraspecific variation, and revealed some interesting differences, including fragmentation, truncation and gene decay (loss of redundant ancestral traits). This study provides valuable insights into metabolic diversity, adaptation and evolution of C. pneumoniae. Chapter Six utilises a subset of 23 target genes identified from the previous genomic comparisons and makes a significant contribution to our understanding of genetic variability among C. pneumoniae human (11) and animal (6 amphibian, 5 reptilian, 1 equine and 7 marsupial hosts) isolates. It has been shown that the animal isolates are genetically diverse, unlike the human isolates that are virtually clonal. More convincing evidence that C. pneumoniae originated in animals and recently (in the last few hundred thousand years) crossed host species to infect humans is provided in this study. It is proposed that two animal-to-human cross species events have occurred in the context of the results, one evident by the nearly clonal human genotype circulating in the world today, and the other by a more animal-like genotype apparent in Indigenous Australians. Taken together, these data indicate that the C. pneumoniae koala LPCoLN isolate has morphologic and genomic characteristics that are distinct from the human isolates. These differences may affect the survival and activity of the C. pneumoniae koala pathogen in its natural host, in vivo. This study, by utilising the genetic diversity of C. pneumoniae, identified new genetic markers for distinguishing human and animal isolates. However, not all C. pneumoniae isolates were genetically diverse; in fact, several isolates were highly conserved, if not identical in sequence (i.e. Australian marsupials) emphasising that at some stage in the evolution of this pathogen, there has been an adaptation/s to a particular host, providing some stability in the genome. The outcomes of this study by experimental and bioinformatic approaches have significantly enhanced our knowledge of the biology of this pathogen and will advance opportunities for the investigation of novel vaccine targets, antimicrobial therapy, or blocking of pathogenic pathways.
Resumo:
Bioinformatics is dominated by online databases and sophisticated web-accessible tools. As such, it is ideally placed to benefit from the rapid, purpose specific combination of services achievable via web mashups. The recent introduction of a number of sophisticated frameworks has greatly simplified the mashup creation process, making them accessible to scientists with limited programming expertise. In this paper we investigate the feasibility of mashups as a new approach to bioinformatic experimentation, focusing on an exploratory niche between interactive web usage and robust workflows, and attempting to identify the range of computations for which mashups may be employed. While we treat each of the major frameworks, we illustrate the ideas with a series of examples developed under the Popfly framework
Resumo:
To identify microRNAs potentially involved in melanomagenesis, we compared microRNA expression profiles between melanoma cell lines and cultured melanocytes. The most differentially expressed microRNA between the normal and tumor cell lines was miR-211. We focused on this pigment-cell-enriched miRNA as it is derived from the microphthalmia-associated transcription factor (MITF)-regulated gene, TRPM1 (melastatin). We find that miR-211 expression is greatly decreased in melanoma cells and melanoblasts compared to melanocytes. Bioinformatic analysis identified a large number of potential targets of miR-211, including POU3F2 (BRN2). Inhibition of miR-211 in normal melanocytes resulted in increased BRN2 protein, indicating that endogenous miR-211 represses BRN2 in differentiated cells. Over-expression of miR-211 in melanoma cell lines changed the invasive potential of the cells in vitro through directly targeting BRN2 translation. We propose a model for the apparent non-overlapping expression levels of BRN2 and MITF in melanoma, mediated by miR-211 expression.
Resumo:
Vitamin A deficiency (VAD) is a serious problem in developing countries, affecting approximately 127 million children of preschool age and 7.2 million pregnant women each year. However, this deficiency is readily treated and prevented through adequate nutrition. This can potentially be achieved through genetically engineered biofortification of staple food crops to enhance provitamin A (pVA) carotenoid content. Bananas are the fourth most important food crop with an annual production of 100 million tonnes and are widely consumed in areas affected by VAD. However, the fruit pVA content of most widely consumed banana cultivars is low (~ 0.2 to 0.5 ìg/g dry weight). This includes cultivars such as the East African highland banana (EAHB), the staple crop in countries such as Uganda, where annual banana consumption is approximately 250 kg per person. This fact, in addition to the agronomic properties of staple banana cultivars such as vegetative reproduction and continuous cropping, make bananas an ideal target for pVA enhancement through genetic engineering. Interestingly, there are banana varieties known with high fruit pVA content (up to 27.8 ìg/g dry weight), although they are not widely consumed due to factors such as cultural preference and availability. The genes involved in carotenoid accumulation during banana fruit ripening have not been well studied and an understanding of the molecular basis for the differential capacity of bananas to accumulate carotenoids may impact on the effective production of genetically engineered high pVA bananas. The production of phytoene by the enzyme phytoene synthase (PSY) has been shown to be an important rate limiting determinant of pVA accumulation in crop systems such as maize and rice. Manipulation of this gene in rice has been used successfully to produce Golden Rice, which exhibits higher seed endosperm pVA levels than wild type plants. Therefore, it was hypothesised that differences between high and low pVA accumulating bananas could be due either to differences in PSY enzyme activity or factors regulating the expression of the psy gene. Therefore, the aim of this thesis was to investigate the role of PSY in accumulation of pVA in banana fruit of representative high (Asupina) and low (Cavendish) pVA banana cultivars by comparing the nucleic acid and encoded amino acid sequences of the banana psy genes, in vivo enzyme activity of PSY in rice callus and expression of PSY through analysis of promoter activity and mRNA levels. Initially, partial sequences of the psy coding region from five banana cultivars were obtained using reverse transcriptase (RT)-PCR with degenerate primers designed to conserved amino acids in the coding region of available psy sequences from other plants. Based on phylogenetic analysis and comparison to maize psy sequences, it was found that in banana, psy occurs as a gene family of at least three members (psy1, psy2a and psy2b). Subsequent analysis of the complete coding regions of these genes from Asupina and Cavendish suggested that they were all capable of producing functional proteins due to high conservation in the catalytic domain. However, inability to obtain the complete mRNA sequences of Cavendish psy2a, and isolation of two non-functional Cavendish psy2a coding region variants, suggested that psy2a expression may be impaired in Cavendish. Sequence analysis indicated that these Cavendish psy2a coding region variants may have resulted from alternate splicing. Evidence of alternate splicing was also observed in one Asupina psy1 coding region variant, which was predicted to produce a functional PSY1 isoform. The complete mRNA sequence of the psy2b coding regions could not be isolated from either cultivar. Interestingly, psy1 was cloned predominantly from leaf while psy2 was obtained preferentially from fruit, suggesting some level of tissue-specific expression. The Asupina and Cavendish psy1 and psy2a coding regions were subsequently expressed in rice callus and the activity of the enzymes compared in vivo through visual observation and quantitative measurement of carotenoid accumulation. The maize B73 psy1 coding region was included as a positive control. After several weeks on selection, regenerating calli showed a range of colours from white to dark orange representing various levels of carotenoid accumulation. These results confirmed that the banana psy coding regions were all capable of producing functional enzymes. No statistically significant differences in levels of activity were observed between banana PSYs, suggesting that differences in PSY activity were not responsible for differences in the fruit pVA content of Asupina and Cavendish. The psy1 and psy2a promoter sequences were isolated from Asupina and Cavendish gDNA using a PCR-based genome walking strategy. Interestingly, three Cavendish psy2a promoter clones of different sizes, representing possible allelic variants, were identified while only single promoter sequences were obtained for the other Asupina and Cavendish psy genes. Bioinformatic analysis of these sequences identified motifs that were previously characterised in the Arabidopsis psy promoter. Notably, an ATCTA motif associated with basal expression in Arabidopsis was identified in all promoters with the exception of two of the Cavendish psy2a promoter clones (Cpsy2apr2 and Cpsy2apr3). G1 and G2 motifs, linked to light-regulated responses in Arabidopsis, appeared to be differentially distributed between psy1 and psy2a promoters. In the untranscribed regulatory regions, the G1 motifs were found only in psy1 promoters, while the G2 motifs were found only in psy2a. Interestingly, both ATCTA and G2 motifs were identified in the 5’ UTRs of Asupina and Cavendish psy1. Consistent with other monocot promoters, introns were present in the Asupina and Cavendish psy1 5’ UTRs, while none were observed in the psy2a 5’ UTRs. Promoters were cloned into expression constructs, driving the â-glucuronidase (GUS) reporter gene. Transient expression of the Asupina and Cavendish psy1 and psy2a promoters in both Cavendish embryogenic cells and Cavendish fruit demonstrated that all promoters were active, except Cpsy2apr2 and Cpsy2apr3. The functional Cavendish psy2a promoter (Cpsy2apr1) appeared to have activity similar to the Asupina psy2a promoter. The activities of the Asupina and Cavendish psy1 promoters were similar to each other, and comparable to those of the functional psy2a promoters. Semi-quantitative PCR analysis of Asupina and Cavendish psy1 and psy2a transcripts showed that psy2a levels were high in green fruit and decreased during ripening, reinforcing the hypothesis that fruit pVA levels were largely dependent on levels of psy2a expression. Additionally, semi-quantitative PCR using intron-spanning primers indicated that high levels of unprocessed psy2a and psy2b mRNA were present in the ripe fruit of Cavendish but not in Asupina. This raised the possibility that differences in intron processing may influence pVA accumulation in Asupina and Cavendish. In this study the role of PSY in banana pVA accumulation was analysed at a number of different levels. Both mRNA accumulation and promoter activity of psy genes studied were very similar between Asupina and Cavendish. However, in several experiments there was evidence of cryptic or alternate splicing that differed in Cavendish compared to Asupina, although these differences were not conclusively linked to the differences in fruit pVA accumulation between Asupina and Cavendish. Therefore, other carotenoid biosynthetic genes or regulatory mechanisms may be involved in determining pVA levels in these cultivars. This study has contributed to an increased understanding of the role of PSY in the production of pVA carotenoids in banana fruit, corroborating the importance of this enzyme in regulating carotenoid production. Ultimately, this work may serve to inform future research into pVA accumulation in important crop varieties such as the EAHB and the discovery of avenues to improve such crops through genetic modification.
Resumo:
Prior to the completion of the human genome project, the human genome was thought to have a greater number of genes as it seemed structurally and functionally more complex than other simpler organisms. This along with the belief of “one gene, one protein”, were demonstrated to be incorrect. The inequality in the ratio of gene to protein formation gave rise to the theory of alternative splicing (AS). AS is a mechanism by which one gene gives rise to multiple protein products. Numerous databases and online bioinformatic tools are available for the detection and analysis of AS. Bioinformatics provides an important approach to study mRNA and protein diversity by various tools such as expressed sequence tag (EST) sequences obtained from completely processed mRNA. Microarrays and deep sequencing approaches also aid in the detection of splicing events. Initially it was postulated that AS occurred only in about 5%; of all genes but was later found to be more abundant. Using bioinformatic approaches, the level of AS in human genes was found to be fairly high with 35-59%; of genes having at least one AS form. Our ability to determine and predict AS is important as disorders in splicing patterns may lead to abnormal splice variants resulting in genetic diseases. In addition, the diversity of proteins produced by AS poses a challenge for successful drug discovery and therefore a greater understanding of AS would be beneficial.
Resumo:
Recent developments in genomic technologies have resulted in increased understanding of pathogenic mechanisms and emphasized the importance of central survival pathways. Here, we use a novel bioinformatic based integrative genomic profiling approach to elucidate conserved mechanisms of lymphomagenesis in the three commonest non-Hodgkin's lymphoma (NHL) entities: diffuse large B-cell lymphoma, follicular lymphoma, and B-cell chronic lymphocytic leukemia. By integrating genome-wide DNA copy number analysis and transcriptome profiling of tumor cohorts, we identified genetic lesions present in each entity and highlighted their likely target genes. This revealed a significant enrichment of components of both the apoptosis pathway and the mitogen activated protein kinase pathway, including amplification of the MAP3K12 locus in all three entities, within the set of genes targeted by genetic alterations in these diseases. Furthermore, amplification of 12p13.33 was identified in all three entities and found to target the FOXM1 oncogene. Amplification of FOXM1 was subsequently found to be associated with an increased MYC oncogenic signaling signature, and siRNA-mediated knock-down of FOXM1 resulted in decreased MYC expression and induced G2 arrest. Together, these findings underscore genetic alteration of the MAPK and apoptosis pathways, and genetic amplification of FOXM1 as conserved mechanisms of lymphomagenesis in common NHL entities. Integrative genomic profiling identifies common central survival mechanisms and highlights them as attractive targets for directed therapy.