44 resultados para genomic phenotype
em DigitalCommons@The Texas Medical Center
Resumo:
This research characterized a serologically indistinguishable form of HLA-DR1 that: (1) cannot stimulate some DR1-restricted or specific T-lymphocyte clones; (2) displays an unusual electrophoretic pattern on two dimensional gels; and (3) is marked by a polymorphic restriction site of the alpha gene. Inefficient stimulation of some DR1-restricted clones was a property of DR1$\sp{+}$ cells that shared HLA-B14 on the same haplotype and/or were carriers of 21-hydroxylase (21-OH) deficiency. Nonclassical 21-OH deficiency frequently demonstrates genetic linkage with HLA-B14;DR1 haplotypes and associates with duplications of C4B and one 21-OH gene. Cells having both stimulatory (DR1$\sb{\rm n}$) and nonstimulatory (DR1$\sb{\rm x}$) parental haplotypes did not mediate proliferation of these clones. However, heterozygous DR1$\sb{\rm x}$, 2 and DR1$\sb{\rm x}$, 7 cells were efficient stimulators of DR2 and DR7 specific clones, respectively, suggesting that a trans acting factor may modify DR1 alleles or products to yield a dominant DR1$\sb{\rm x}$ phenotype. Incompetent stimulator populations did not secrete an intercellular soluble or contact dependent suppressor factor nor did they express interleukin-2 receptors competing for T-cell growth factors. Two dimensional gel analysis of anti-DR immunoprecipitates revealed, in addition to normal DR$\alpha$ and DR$\beta$ chains, a 50kD species from DR1$\sb{\rm x}$ but not from the majority of DR1$\sb{\rm n}$ or non-DR1 cells. The 50kD structure was stable under reducing conditions in SDS and urea, had antigenic homology with DR, and dissociated after boiling into 34kD and 28kD peptide chains apparently identical with DR$\alpha$ and DR$\beta$ as shown by limited digest peptide maps. N-linked glycosylation and sialation of DRgp50 appeared to be unchanged from normal DR$\alpha$ and DR$\beta$. Bg1II digestion and $DR\alpha$ probing of DR1$\sb{\rm x}$ genomic DNA revealed a 4.5kb fragment while DR1$\sb{\rm n}$ DNA yielded 3.8 and 0.76kb fragments; all restriction sites mapped to the 3$\sp\prime$ untranslated region of $DR\alpha$. Collectively, these data suggest that DRgp50 represents a novel combinatorial association between constitutive chains of DR that may interfere with or compete for normal T cell receptor recognition of DR1 as both an alloantigen and restricting element. Furthermore, extensive chromosomal abnormalities previously mapped to the class III region of B14;DR1 haplotypes may extend into the adjacent class II region with consequent intrusion on immune function. ^
Resumo:
Next-generation DNA sequencing platforms can effectively detect the entire spectrum of genomic variation and is emerging to be a major tool for systematic exploration of the universe of variants and interactions in the entire genome. However, the data produced by next-generation sequencing technologies will suffer from three basic problems: sequence errors, assembly errors, and missing data. Current statistical methods for genetic analysis are well suited for detecting the association of common variants, but are less suitable to rare variants. This raises great challenge for sequence-based genetic studies of complex diseases.^ This research dissertation utilized genome continuum model as a general principle, and stochastic calculus and functional data analysis as tools for developing novel and powerful statistical methods for next generation of association studies of both qualitative and quantitative traits in the context of sequencing data, which finally lead to shifting the paradigm of association analysis from the current locus-by-locus analysis to collectively analyzing genome regions.^ In this project, the functional principal component (FPC) methods coupled with high-dimensional data reduction techniques will be used to develop novel and powerful methods for testing the associations of the entire spectrum of genetic variation within a segment of genome or a gene regardless of whether the variants are common or rare.^ The classical quantitative genetics suffer from high type I error rates and low power for rare variants. To overcome these limitations for resequencing data, this project used functional linear models with scalar response to develop statistics for identifying quantitative trait loci (QTLs) for both common and rare variants. To illustrate their applications, the functional linear models were applied to five quantitative traits in Framingham heart studies. ^ This project proposed a novel concept of gene-gene co-association in which a gene or a genomic region is taken as a unit of association analysis and used stochastic calculus to develop a unified framework for testing the association of multiple genes or genomic regions for both common and rare alleles. The proposed methods were applied to gene-gene co-association analysis of psoriasis in two independent GWAS datasets which led to discovery of networks significantly associated with psoriasis.^
Resumo:
My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.
Resumo:
Mutations in the p53 tumor suppressor gene are found in over 50% of human tumors and in the germline of Li-Fraumeni syndrome families. About 80% of these mutations are missense in nature. In order to study how p53 missense mutations affect tumorigenesis in vivo, we focused on the murine p53 arg-to-his mutation at amino acid 172, which corresponds to the human hot spot mutation at amino acid 175. The double replacement procedure was employed to introduce the p53 R172H mutation into the p53 locus of ES cells and mice were generated. An additional 1bp deletion in the intron 2 splice acceptor site was detected in the same allele in mice. We named this allele p53R172HΔg. This allele makes a small amount of full length p53 mutant protein. ^ Spontaneous tumor formation and survival were studied in these mice. Mice heterozygous for the p53R172HΔg allele showed 50% survival at 17 months of age, similar to the p53+/− mice. Moreover, the p53R172HΔg/+ mice showed a distinct tumor spectrum: 55% sarcomas, including osteosarcoms, fibrosarcomas and angiosarcomas; 27% carcinomas, including lung adenocarcinomas, squamous cell carcinomas, hepatocellular carcinomas and islet cell carcinomas; and 18% lymphomas. Compared to the p53+/− mice, there was a clear increase in the frequency of carcinoma development and a decrease in lymphoma incidence. Among the sarcomas that developed, fibrosarcomas in the skin were also more frequently observed. More importantly, osteosarcomas and carinomas that developed in the p53R172HΔg/+ mice metastasized at very high frequency (64% and 67%, respectively) compared with less than 10% in the p53+/− mice. The metastatic lesions were usually found in lung and liver, and less frequently in other tissues. The altered tumor spectrum in the mice and increased metastatic potential of the tumors suggested that the p53R172H mutation represents a gain-of-function. ^ Mouse embryonic fibroblasts (MEFs) from the mice homozygous and heterozygous for the p53R172HΔg allele were studied for growth characteristics, immortalization potential and genomic instability. All of the p53R172HΔg /+ MEF lines are immortalized under a 3T3 protocol while under the same protocol p53+/− MEFs are not immortalized. Karyotype analysis showed a persistent appearance of chromosome end-to-end fusion in the MEFs both homozygous and heterozygous for the p53R172HΔg allele. These observations suggest that increased genomic instability in the cells may cause the altered tumor phenotypes. ^
Resumo:
A wealth of genetic associations for cardiovascular and metabolic phenotypes in humans has been accumulating over the last decade, in particular a large number of loci derived from recent genome wide association studies (GWAS). True complex disease-associated loci often exert modest effects, so their delineation currently requires integration of diverse phenotypic data from large studies to ensure robust meta-analyses. We have designed a gene-centric 50 K single nucleotide polymorphism (SNP) array to assess potentially relevant loci across a range of cardiovascular, metabolic and inflammatory syndromes. The array utilizes a "cosmopolitan" tagging approach to capture the genetic diversity across approximately 2,000 loci in populations represented in the HapMap and SeattleSNPs projects. The array content is informed by GWAS of vascular and inflammatory disease, expression quantitative trait loci implicated in atherosclerosis, pathway based approaches and comprehensive literature searching. The custom flexibility of the array platform facilitated interrogation of loci at differing stringencies, according to a gene prioritization strategy that allows saturation of high priority loci with a greater density of markers than the existing GWAS tools, particularly in African HapMap samples. We also demonstrate that the IBC array can be used to complement GWAS, increasing coverage in high priority CVD-related loci across all major HapMap populations. DNA from over 200,000 extensively phenotyped individuals will be genotyped with this array with a significant portion of the generated data being released into the academic domain facilitating in silico replication attempts, analyses of rare variants and cross-cohort meta-analyses in diverse populations. These datasets will also facilitate more robust secondary analyses, such as explorations with alternative genetic models, epistasis and gene-environment interactions.
Resumo:
Empirical evidence and theoretical studies suggest that the phenotype, i.e., cellular- and molecular-scale dynamics, including proliferation rate and adhesiveness due to microenvironmental factors and gene expression that govern tumor growth and invasiveness, also determine gross tumor-scale morphology. It has been difficult to quantify the relative effect of these links on disease progression and prognosis using conventional clinical and experimental methods and observables. As a result, successful individualized treatment of highly malignant and invasive cancers, such as glioblastoma, via surgical resection and chemotherapy cannot be offered and outcomes are generally poor. What is needed is a deterministic, quantifiable method to enable understanding of the connections between phenotype and tumor morphology. Here, we critically assess advantages and disadvantages of recent computational modeling efforts (e.g., continuum, discrete, and cellular automata models) that have pursued this understanding. Based on this assessment, we review a multiscale, i.e., from the molecular to the gross tumor scale, mathematical and computational "first-principle" approach based on mass conservation and other physical laws, such as employed in reaction-diffusion systems. Model variables describe known characteristics of tumor behavior, and parameters and functional relationships across scales are informed from in vitro, in vivo and ex vivo biology. We review the feasibility of this methodology that, once coupled to tumor imaging and tumor biopsy or cell culture data, should enable prediction of tumor growth and therapy outcome through quantification of the relation between the underlying dynamics and morphological characteristics. In particular, morphologic stability analysis of this mathematical model reveals that tumor cell patterning at the tumor-host interface is regulated by cell proliferation, adhesion and other phenotypic characteristics: histopathology information of tumor boundary can be inputted to the mathematical model and used as a phenotype-diagnostic tool to predict collective and individual tumor cell invasion of surrounding tissue. This approach further provides a means to deterministically test effects of novel and hypothetical therapy strategies on tumor behavior.
Resumo:
Repressor element 1 (RE1)-silencing transcription factor (REST)/neuron-restrictive silencer factor (NRSF) can repress several terminal neuronal differentiation genes by binding to a specific DNA sequence (RE1/neuron-restrictive silencer element [NRSE]) present in their regulatory regions. REST-VP16 binds to the same RE1/NRSE, but activates these REST/NRSF target genes. However, it is unclear whether REST-VP16 expression is sufficient to cause formation of functional neurons either from neural stem cells or from heterologous stem cells. Here we show that the expression of REST-VP16 in myoblasts grown under muscle differentiation conditions blocked entry into the muscle differentiation pathway, countered endogenous REST/NRSF-dependent repression, activated the REST/NRSF target genes, and, surprisingly, activated other neuronal differentiation genes and converted the myoblasts to a physiologically active neuronal phenotype. Furthermore, in vitro differentiated neurons produced by REST-VP16-expressing myoblasts, when injected into mouse brain, survived, incorporated into the normal brain, and did not form tumors. This is the first instance in which myoblasts were converted to a neuronal phenotype. Our results suggest that direct activation of REST/NRSF target genes with a single transgene, REST-VP16, is sufficient to activate other terminal neuronal differentiation genes and to override the muscle differentiation pathways, and they suggest that this approach provides an efficient way of triggering neuronal differentiation in myoblasts and possibly other stem cells.
Resumo:
The Mendelian inheritance of genetic mutations can lead to adult-onset cardiovascular disease. Several genetic loci have been mapped for the familial form of Thoracic Aortic Aneurysms (TAA), and many causal mutations have been identified for this disease. Intracranial Aneurysms (ICA) also show linkage heterogeneity, but no mutations have been identified causing familial ICA alone. Here, we characterized a large family (TAA288) with an autosomal dominant pattern of inherited aneurysms. It is intriguing that female patients predominantly present with ICA and male patients predominantly with TAA in this family. To identify a causal mutation in this family, a genome-wide linkage analysis was previously performed on nine members of this family using the 50k GenChips Hind array from Affymetrix. This analysis eventually identified a single disease-segregating locus, on chromosome 5p15. We build upon this previous analysis in this study, hypothesizing that a genetic mutation inherited in this locus leads to the sex-specific phenotype of TAA and ICA in this family First we refined the boundaries of the 5p15 disease linked locus down to the genomic coordinates 5p15: 3,424,465- 6,312,925 (GRCh37/hg19 Assembly). This locus was named the TAA288 critical interval. Next, we sequenced candidate genes within the TAA288 critical interval. The selection of genes was simplified by the relatively small number of well-characterized genetic elements within the region. Seeking novel or rare disease-segregating variants, we initially observed a single point alteration in the metalloproteinase gene ADAMTS16 fulfilling this criteria. This variant was later classified as a low-frequency population polymorphism (rs72647757), but we continued to explore the potential role of the ADAMTS16 as the cause of disease in TAA288. We observed that fibroblasts cultured from TAA288 patients consistently upregulated the expression of this gene more strongly compared to matched control fibroblasts when treated with the cytokine TGF-β1, though there was some variation in the exact nature of this expression. We also observed evidence that this protein is expressed at elevated levels in aortic aneurysm tissue from patients with mutations in the gene TGFBR2 and Marfan syndrome, shown by immunohistochemical detection of this protein.
Resumo:
Our recent studies have shown that the FoxM1B transcription factor is overexpressed in human glioma tissues and that the level of its expression correlates directly with glioma grade. However, whether FoxM1B plays a role in the early development of glioma (i.e., in transformation) is unknown. In this study, we found that the FoxM1B molecule causes cellular transformation and tumor formation in normal human astrocytes (NHA) immortalized by p53 and pRB inhibition. Moreover, brain tumors that arose from intracranial injection of FoxM1B-expressing immortalized NHAs displayed glioblastoma multiforme (GBM) phenotypes, suggesting that FoxM1B overexpression in immortalized NHAs not only transforms the cells but also leads to GBM formation. Mechanistically, our results showed that overexpression of FoxM1B upregulated NEDD4-1, an E3 ligase that mediates the degradation and downregulation of phosphatase and tensin homologue (PTEN) in multiple cell lines. Decreased PTEN in turn resulted in the hyperactivation of Akt, which led to phosphorylation and cytoplasmic retention of FoxO3a. Blocking Akt activation with phosphoinositide 3-kinase/Akt inhibitors inhibited the FoxM1B-induced transformation of immortalized NHAs. Furthermore, overexpression of FoxM1B in immortalized NHAs increased the expression of survivin, cyclin D1, and cyclin E, which are important molecules for tumor growth. Collectively, these results indicate that overexpression of FoxM1B, in cooperation with p53 and pRB inhibition in NHA cells, promotes astrocyte transformation and GBM formation through multiple mechanisms.
Resumo:
Nonsyndromic cleft lip with or without cleft palate (NSCLP) is a common birth anomaly that requires prolonged multidisciplinary rehabilitation. Although variation in several genes has been identified as contributing to NSCLP, most of the genetic susceptibility loci have yet to be defined. To identify additional contributory genes, a high-throughput genomic scan was performed using the Illumina Linkage IVb Panel platform. We genotyped 6008 SNPs in nine non-Hispanic white NSCLP multiplex families and a single large African-American NSCLP multiplex family. Fourteen chromosomal regions were identified with LOD>1.5, including six regions not previously reported. Analysis of the data from the African-American and non-Hispanic white families revealed two likely chromosomal regions: 8q21.3-24.12 and 22q12.2-12.3 with LOD scores of 2.98 and 2.66, respectively. On the basis of biological function, syndecan 2 (SDC2) and growth differentiation factor 6 (GDF6) in 8q21.3-24.12 and myosin heavy-chain 9, non-muscle (MYH9) in 22q12.2-12.3 were selected as candidate genes. Association analyses from these genes yielded marginally significant P-values for SNPs in SDC2 and GDF6 (0.01
Resumo:
Gene silencing due to epigenetic mechanisms shows evidence of significant contributions to cancer development. We hypothesis that the genetic architecture based on retrotransposon elements surrounding the transcription start site, plays an important role in the suppression and promotion of DNA methylation. In our investigation we found a high rate of SINE and LINEs retrotransposon elements near the transcription start site of unmethylated genes when compared to methylated genes. The presence of these elements were positively associated with promoter methylation, contrary to logical expectations, due to the malicious effects of retrotransposon elements which insert themselves randomly into the genome causing possible loss of gene function. In our genome wide analysis of human genes, results suggested that 22% of the genes in cancer were predicted to be methylation-prone; in cancer these genes are generally down-regulated and function in the development process. In summary, our investigation validated our hypothesis and showed that these widespread genomic elements in cancer are highly associated with promoter DNA methylation and may further participate in influencing epigenetic regulation.
Resumo:
Nephroblastoma or Wilms' tumor is a pediatric renal malignancy that is the most frequently occurring childhood solid tumor. Approximately 1-2% of children with Wilms' tumor also present with aniridia, a congenital absence of all or part of the iris of the eye. These children also have high rates of genitourinary anomalies and mental retardation resulting in what is called the WAGR (Wilms' tumor, aniridia, genitourinary anomaly, mental retardation) syndrome. Cytogenetic analysis of metaphase chromosomes from these patients revealed a consistent deletion of band P13 on chromosome 11. These observations suggest close physical linkage between the disease-related loci, and further imply that development of each phenotype results from the loss of normal gene function.^ The objective of this work is to understand the molecular events at chromosome band 11p13 that are essential to the development of sporadic Wilms' tumor and sporadic aniridia. Two human/hamster somatic cell hybrids have been used to identify sixteen independent DNA probes that map to this segment of the human genome. These newly identified DNA probes and four previously reported probes (CAT, FSHB, D11S16, and HBVIS) have been used to subdivide 11p13 into five intervals defined by overlapping constitutional deletions from several WAGR patients. A long-range physical map of 11p13 has been constructed using each of these probes in Southern blot analysis of genomic DNA after digestion with infrequently cutting restriction enzymes and pulse-field gel electrophoresis. This map, established primarily with MluI and NotI, spans approximately 13 $\times$ 10$\sp{6}$ bp and encompasses deletion and translocation breakpoints associated with genitourinary anomalies, aniridia, and sporadic Wilms' tumor. This complete physical map of human chromosome band 11p13 enables us to localize the genes for sporadic Wilms' tumor and sporadic aniridia to a small number of specific NotI fragments. ^
Resumo:
Cmd4 is a colcemid-sensitive CHO cell line that is temperature sensitive for growth and expresses an altered $\beta$-tubulin, $\beta\sb1$. One revertant of this cell line, D2, exhibits a further alteration in $\beta\sb1$ resulting in an acidic shift in its isoelectric point and a decrease in its molecular weight to 40 kD, as measured by two dimensional gel electrophoresis. This $\beta$-tubulin variant has been shown to be assembly-defective and unstable. Characterization of the mutant $\beta\sb1$ in D2 by high pressure liquid chromatography (HPLC) revealed the loss of methionine containing tryptic peptides 7,8,9, and 10. Southern analysis of the genomic DNA digested with several different restriction enzymes resulted in the appearance of new restriction fragments 250 base pairs shorter than the corresponding fragments from the wild-type $\beta\sb1$-tubulin gene. Northern analysis on mRNA from D2 revealed two new message products that also differed by 250 bases from the corresponding wild type $\beta$-tubulin transcripts. To precisely define the region of the alteration, cloning and sequencing of the mutant and wild type genomic $\beta$-tubulin genes were conducted. A size-selected EcoRI genomic library was prepared using the Stratagene lambda Zap II phage cloning system. Using subclones of CHO $\beta$-tubulin cDNA as probes, a 2.5 kb wild type clone and a 2.3 kb mutant clone were identified from this library. Each of these was shown to contain a portion of the gene extending from intron 3 through the end of the coding sequence in exon 4 and into the 3$\sp\prime$ untranslated region on the basis of alignment with the published human $\beta$-tubulin sequence. Sequencing of the mutant 2.3 kb clone revealed that the mutation is due to a 246 base pair internal deletion in exon 4 (base pair 756-1001) that encodes amino acids 253-334. This deletion results in the loss of a putative binding site for GTP which could potentially explain the phenotype of this mutant $\beta$-tubulin. Also sequence comparison of the 3$\sp\prime$ untranslated region between different species revealed the conservation of 200 base pairs with 78% homology. It is proposed that this region could play an important role in the regulation of $\beta$-tubulin gene expression. ^
Resumo:
I have cloned cDNAs corresponding to two distinct genes, Xlmf1 and Xlmf25, which encode skeletal muscle-specific, transcriptional regulatory proteins. These proteins are members of the helix-loop-helix family of DNA binding factors, and are most homologous to MyoD1. These two genes have disparate temporal expression patterns during early embryogenesis; although, both transcripts are present exclusively in skeletal muscle of the adult. Xlmf1 is first detected 7 hours after fertilization, shortly after the midblastula transition. Xlmf25 is detected in maternal stores of mRNA, during early cleavage stages of the embryo and throughout later development. Both Xlmf1 and Xlmf25 transcripts are detected prior to the expression of other, previously characterized, muscle-specific genes. The ability of Xlmf1 and Xlmf25 to convert mouse 10T1/2 fibroblasts to a myogenic phenotype demonstrates their activity as myogenic regulatory factors. Additionally, Xlmf1 and Xlmf25 can directly transactivate a reporter gene linked to the muscle-specific, muscle creatine kinase (MCK) enhancer. The functional properties of Xlmf1 and Xlmf25 proteins were further explored by investigating their interactions with the binding site in the MCK enhancer. Analysis of dissociation rates revealed that Xlmf25-E12 dimers had a two-fold lower avidity for this site than did Xlmf1-E12 dimers. Clones containing genomic sequence of Xlmf1 and Xlmf25 have been isolated. Reporter gene constructs containing a lac-z gene driven by Xlmf1 regulatory sequences were analyzed by embryo injections and transfections into cultured muscle cells. Elements within $-$200 bp of the transcription start site can promote high levels of muscle specific expression. Embryo injections show that 3500 bp of upstream sequence is sufficient to drive somite specific expression. EMSAs and DNAse I footprint analysis has shown the discrete interaction of factors with several cis-elements within 200 bp of the transcription start site. Mutation of several of these elements shows a positive requirement for two CCAAT boxes and two E boxes. It is evident from the work performed with this promoter that Xlmf1 is tightly regulated during muscle cell differentiation. This is not surprising given the fact that its gene product is crucial to the determination of cell fate choices. ^
Resumo:
The initial step in coronavirus-mouse hepatitis virus (MHV) replication is the synthesis of negative strand RNA from a positive strand genomic RNA template. Our approach to studying MHV RNA replication is to identify the cis-acting signals for RNA synthesis and the protein(s) which recognizes these signals at the 3$\sp\prime$ end of genomic RNA of MHV. To determine whether host cellular and/or virus-specific proteins interact with the 3$\sp\prime$ end of the coronavirus genome, an RNase T$\sb1$ protection/gel mobility shift electrophoresis assay was used to examine cytoplasmic extracts from either mock- or MHV-JHM-infected 17Cl-1 murine cells for the ability to form complexes with defined regions of the genomic RNA. A conserved 11 nucleotide sequence UGAAUGAAGUU at nucleotide positions 36 to 26 from the 3$\sp\prime$ end of genomic RNA was identified to be responsible for the specific binding of host proteins, by using a series of RNA probes with deletions and mutations in this region. The RNA probe containing the 11 nucleotide sequence bound approximately four host cellular proteins with a highly labeled 120 kDa and three minor species with sizes of 103, 81 and 55 kDa, assayed by UV-induced covalent cross-linking. Mutation of the 11 nucleotide motif strongly inhibited cellular protein binding, and decreased the amount of the 103 and 81 kDa proteins in the complex to undetectable levels and strongly reduced the binding of the 120 kDa protein. Less extensive mutations within this 11 nucleotide motif resulted in variable decreases in RNA-protein complex formation depending on each probe tested. The RNA-protein complexes observed with cytoplasmic extracts from MHV-JHM-infected cells in both RNase protection/gel mobility shift and UV cross-linking assays were indistinguishable to those observed with extracts from uninfected cells.^ To investigate the possible role of this 3$\sp\prime$ protein binding element in viral RNA replication in vivo, defective interfering RNA molecules with complete or partial mutations of the 11 nucleotide conserved sequence were transcribed in vitro, transfected to host 17Cl-1 cells in the presence of helper virus MHV-JHM and analyzed by agarose gel electrophoresis, competitive RT-PCR and direct sequencing of the RT-PCR products. Both negative strand synthesis and positive strand replication of DI RNA were affected by mutation that disrupts RNA-protein complex formation, even though the 11 mutated nucleotides were converted to wild type sequence, presumably by recombination with helper virus. Kinetic analysis indicated that recombination between DI RNA and helper virus occurred 5.5 to 7.5 hours post infection when replication of positive strand DI RNA was barely observed. Replication of positive strand DI RNAs carrying partial mutations within the 11 nucleotide motif was dependent upon recombination events after transfection. Replication was strongly inhibited when reversion to wild type sequence did not occur, and after recombination, reached similar levels as wild type DI RNA. A DI RNA with mutation upstream of the protein binding motif replicated as efficiently as wild type without undergoing recombination. Thus the conserved 11 nucleotide host protein binding motif appears to play an important role in viral RNA replication. ^