995 resultados para DNA-CLONING
Resumo:
Abstract Background The CACTA (also called En/Spm) superfamily of DNA-only transposons contain the core sequence CACTA in their Terminal Inverted Repeats (TIRs) and so far have only been described in plants. Large transcriptome and genome sequence data have recently become publicly available for Schistosoma mansoni, a digenetic blood fluke that is a major causative agent of schistosomiasis in humans, and have provided a comprehensive repository for the discovery of novel genes and repetitive elements. Despite the extensive description of retroelements in S. mansoni, just a single DNA-only transposon belonging to the Merlin family has so far been reported in this organism. Results We describe a novel S. mansoni transposon named SmTRC1, for S. mansoni Transposon Related to CACTA 1, an element that shares several characteristics with plant CACTA transposons. Southern blotting indicates approximately 30–300 copies of SmTRC1 in the S. mansoni genome. Using genomic PCR followed by cloning and sequencing, we amplified and characterized a full-length and a truncated copy of this element. RT-PCR using S. mansoni mRNA followed by cloning and sequencing revealed several alternatively spliced transcripts of this transposon, resulting in distinct ORFs coding for different proteins. Interestingly, a survey of complete genomes from animals and fungi revealed several other novel TRC elements, indicating new families of DNA transposons belonging to the CACTA superfamily that have not previously been reported in these kingdoms. The first three bases in the S. mansoni TIR are CCC and they are identical to those in the TIRs of the insects Aedes aegypti and Tribolium castaneum, suggesting that animal TRCs may display a CCC core sequence. Conclusion The DNA-only transposable element SmTRC1 from S. mansoni exhibits various characteristics, such as generation of multiple alternatively-spliced transcripts, the presence of terminal inverted repeats at the extremities of the elements flanked by direct repeats and the presence of a Transposase_21 domain, that suggest a distant relationship to CACTA transposons from Magnoliophyta. Several sequences from other Metazoa and Fungi code for proteins similar to those encoded by SmTRC1, suggesting that such elements have a common ancestry, and indicating inheritance through vertical transmission before separation of the Eumetazoa, Fungi and Plants.
Resumo:
Abstract Background The CHD7 (Chromodomain Helicase DNA binding protein 7) gene encodes a member of the chromodomain family of ATP-dependent chromatin remodeling enzymes. Mutations in the CHD7 gene are found in individuals with CHARGE, a syndrome characterized by multiple birth malformations in several tissues. CHD7 was identified as a binding partner of PBAF complex (Polybromo and BRG Associated Factor containing complex) playing a central role in the transcriptional reprogramming process associated to the formation of multipotent migratory neural crest, a transient cell population associated with the genesis of various tissues. CHD7 is a large gene containing 38 annotated exons and spanning 200 kb of genomic sequence. Although genes containing such number of exons are expected to have several alternative transcripts, there are very few evidences of alternative transcripts associated to CHD7 to date indicating that alternative splicing associated to this gene is poorly characterized. Findings Here, we report the cloning and characterization by experimental and computational studies of a novel alternative transcript of the human CHD7 (named CHD7 CRA_e), which lacks most of its coding exons. We confirmed by overexpression of CHD7 CRA_e alternative transcript that it is translated into a protein isoform lacking most of the domains displayed by the canonical isoform. Expression of the CHD7 CRA_e transcript was detected in normal liver, in addition to the DU145 human prostate carcinoma cell line from which it was originally isolated. Conclusions Our findings indicate that the splicing event associated to the CHD7 CRA_e alternative transcript is functional. The characterization of the CHD7 CRA_e novel isoform presented here not only sets the basis for more detailed functional studies of this isoform, but, also, contributes to the alternative splicing annotation of the CHD7 gene and the design of future functional studies aimed at the elucidation of the molecular functions of its gene products.
Resumo:
The corpus luteum (CL) is a temporary organ involved in the maintenance of pregnancy. In the course of its life-cycle, the CL undergoes two distinct and consecutive processes for its inevitable removal through apoptosis: functional and structural luteolysis. We isolated a gene encoding for a novel rat zinc finger protein (ZFP), named rat ZFP96 (rZFP96) from an ovarian lambda cDNA library. Sequence analysis revealed close sequence and structural similarity to mouse ZFP96 and human zinc finger protein 305 (ZNF305). Quantitative reverse transcription-polymerase chain reaction analysis revealed a positive correlation with the end of pregnancy, that is, the onset of structural luteolysis of the CL. Messenger RNA levels increased 3-fold (P < 0.01) between days 13 and 22 of pregnancy and 8-fold (P < 0.01) between day 13 of pregnancy and day 1 post-partum. In addition, we detected rZFP96 expression in mammary, placenta, heart, kidney and skeletal muscle. Sequence analysis predicted that rZFP96 has a high probability of localizing to the nuclear compartment. The presence of both a perfect consensus TGEKP linker sequence between zinc fingers 2 and 3 as well as several similar sequences between the other zinc fingers suggests physical interaction with DNA. Speculatively, rZFP96 may therefore function as a transcription factor, switching-off pro-survival genes and/or upregulating pro-apoptotic genes and thereby contributing to the demise of the CL.
Resumo:
Human x rodent somatic cell hybrids have played an important role in human genetics research. They have been especially useful for assigning genes to chromosomes and isolating DNA markers from specific regions of the human genome.^ By employing a combination of somatic cell genetic, recombinant DNA, and cytogenetic techniques, human DNA excision repair gene ERCC4 was mapped regionally to human 16p13.13-13.2, even though the gene has not been cloned. Human x Chinese hamster ovary (CHO) cell hybrids selected for human ERCC4 activity and containing 16p13.1-p13.3 as the only human genetic material were identified. These hybrids were used to order DNA markers located in 16p13.1-p13.3. New DNA markers physically close to ERCC4 were isolated from such hybrids. Using amplified human DNA from the hybrids as probe in fluorescent in situ hybridization, the short arm breakpoint in the chromosome 16 inversion associated with acute myelomonocytic leukemia (AMML) was found to be physically close to the ERCC4 gene. The physical mapping and eventually, the cloning of the ERCC4 gene, will benefit the understanding of the DNA repair system and the study of other important biomedical problems such as tumorigenesis.^ To facilitate the cloning of ERCC4 gene and, in general, the cloning of genes from any defined regions of the human genome, a method was developed for the direct isolation of human transcribed genes ffom somatic cell hybrids. cDNA was prepared from human x rodent hybrid by using consensus 5$\sp\prime$ splice site sequences as primers. These primers were designed to select immature, unspliced messenger RNA (still retaining species specific repeat sequences) as templates. Screening of a derived cDNA library for human repeat sequences resulted in the isolation of human clones at the anticipated frequency with characteristics expected of exons of transcribed human genes. The usefulness of the splice site specific primers was analyzed and the cDNA synthesis conditions with these primers were optimized. The procedure was shown to be sensitive enough to clone weakly expressed genes. Studying the expression of the represented genes with the isolated clones was shown to be feasible. Such regional specific human gene fragments will be very valuable for many human genetic studies such as the search of inherited disease genes and the construction of a cDNA map of the human genome. ^
Resumo:
Aniridia (AN) is a congenital, panocular disorder of the eye characterized by the complete or partial absence of the iris. The disease can occur in both the sporadic and familial forms which, in the latter case, is inherited as an autosomal dominant trait with high penetrance. The objective of this study was to isolate and characterize the genes involved in AN and Sey, and thereby to gain a better understanding of the molecular basis of the two disorders.^ Using a positional cloning strategy, I have approached and cloned from the AN locus in human chromosomal band 11p13 a cDNA that is deleted in two patients with AN. The deletions in these patients overlap by about 70 kb and encompass the 3$\sp\prime$ end of the cDNA. This cDNA detects a 2.7 kb mRNA encoded by a transcription unit estimated to span approximately 50 kb of genomic DNA. The message is specifically expressed in all tissues affected in all forms of AN, namely within the presumptive iris, lens, neuroretina, the superficial layers of the cornea, the olfactory bulbs, and the cerebellum. Sequence analysis of the AN cDNA revealed a number of motifs characteristic of certain transcription factors. Chief among these are the presence of the paired domain, the homeodomain, and a carboxy-terminal domain rich in serine, threonine and proline residues. The overall structure shows high homology to the Drosophila segmentation gene paired and members of the murine Pax family of developmental control genes.^ Utilizing a conserved human genomic DNA sequence as probe, I was able to isolate an embryonic murine cDNA which is over 92% homologous in nucleotide sequence and virtually identical at the amino acid level to the human AN cDNA. The expression pattern of the murine gene is the same as that in man, supporting the conclusion that it probably corresponds to the Sey gene. Its specific expression in the neuroectodermal component of the eye, in glioblastomas, but not in the neural crest-derived PC12 pheochromocytoma cell line, suggests that a defect in neuroectodermal rather mesodermal development might be the common etiological factor underlying AN and Sey. ^
Resumo:
Expression of the differentiated skeletal muscle phenotype is a process that appears to occur in at least two stages. First, pluripotent stem cells become committed to the myogenic lineage. Although undifferentiated and capable of continued proliferation, determined myoblasts are restricted to a single developmental fate. Upon receiving the appropriate environmental signals, these determined myoblasts withdraw from the cell cycle, fuse to form multi-nucleated myotubes, and begin to express a battery of muscle-specific gene products that make up the functional and contractile apparatus of the muscle. This project is aimed at the identification and characterization of factors that control the determination and differentiation of myogenic cells. We have cloned a cDNA, called myogenin, that plays an important role in these processes. Myogenin is expressed exclusively in skeletal muscle in vivo and myogenic cell lines in vitro. Its expression is sharply upregulated during differentiation. When constitutively expressed in fibroblasts, myogenin converts these cells to the myogenic lineage. Transfected cells behave as myogenic tissue culture cells with respect to the genes they express, the way they respond to environmental cues, and are capable of fusing to form multinucleated myotubes. Sequence analysis showed that this cDNA has homology to a family of transcription factors in a region of 72 amino acids known as the basic helix-loop-helix motif. This domain appears to mediate binding to a DNA sequence element known as an E-box (CANNTG) essential for the activity of the enhancers of many muscle-specific genes.^ Analysis of myogenin in tissue culture cells showed that its expression is responsive to many of the environmental cues, such as the presence of growth factors and oncogenes, that modulate myogenesis. In an attempt to identify the cis- and trans-elements that control myogenin expression and thereby understand what factors are responsible for the establishment of the myogenic lineage, we have cloned the myogenin gene. After analysis of the gene structure, we constructed a series of reporter constructs from the 5$\prime$ upstream sequence of the myogenin gene to determine which cis-acting sequences might be important in myogenin regulation. We found that 184 nucleotides of the 5$\prime$ sequence was sufficient to direct high-level muscle-specific expression of the reporter gene. Two sequence elements present in the 184 fragment, an E-box and a MEF-2 site, have been shown previously to be important in muscle-specific transcription. Mutagenesis of these sites revealed that both sites are necessary for full activity of the myogenin promoter, and suggests that a complex hierarchy of transcription factors control myogenic differentiation. ^
Resumo:
The Mixed Function Oxidase System metabolizes a wide range of biochemicals including drugs, pesticides and steroids. Cytochrome P450 reductase is a key enzymatic component of this system, supplying reducing equivalents from NADPH to cytochrome P450. The electrons are shuttled through reductase via two flavin moieties: FAD and FMN. Although the exact mechanism of flavins action is not known, the enzymatic features of reductase greatly depleted of either FMN of FAD have been characterized. Additionally, flavin location within reductase has been proposed by homology and chemical modification studies. This study seeks to extend the flavin depletion analysis in a more controlled system by eliminating the proposed FMN binding domain with recombinant DNA techniques and biochemical analysis. Two P450 reductase cDNA clones containing only the FMN and NADPH binding domain were isolated, expressed and the protein products purified and analysed. This study confirms the proposed FAD binding site, role of FAD in electron shuttling pathway and provides new methods to study the FAD binding domain. ^
Resumo:
Cell differentiation are associated with activation of cell lineage-specific genes. The $LpS{\it 1}\beta$ gene of Lytechinus pictus is activated at the late cleavage stage. $LpS{\it 1}\beta$ transcripts accumulate exclusively in aboral ectoderm lineages. Previous studies demonstrated two G-string DNA-elements, proximal and distal G-strings, which bind to an ectoderm-enriched nuclear factor. In order to define the cis-elements which control positive expression of the $LpS{\it 1}\beta$ gene, the regulatory region from $-$108 to +17 bp of the $LpS{\it 1}\beta$ gene promoter was characterized. The ectoderm G-string factor binds to a G/C-rich region larger than the G-string itself and the binding of the G-string factor requires sequences immediately downstream from the G-string. These downstream sequences are essential for full promoter activity. In addition, only 108 bp of $LpS{\it 1}\beta\ 5\sp\prime$ flanking DNA drives $LpS{\it 1}\beta$ gene expression in aboral ectoderm/mesenchyme cells. Therefore, for positive control of $LpS{\it 1}\beta$ gene expression, two regions of 5$\sp\prime$ flanking DNA are required: region I from base pairs $-$762 to $-$511, and region II, which includes the G/C-rich element, from base pairs $-$108 to $-$61. A mesenchyme cell repressor element is located within region I.^ DNA-binding proteins play key roles in determination of cell differentiation. The zinc finger domain is a DNA-binding domain present in many transcription factors. Based on homologies in zinc fingers, a zinc finger-encoding gene, SpKrox-1, was cloned from S. purpuratus. The putative SpKrox-1 protein has all structural characteristics of a transcription factor: four zinc fingers for DNA binding; acidic domain for transactivation; basic domain for nuclear targeting; and leucine zipper for dimerization. SpKrox-1 RNA transcripts showed a transient expression pattern which correlates largely with early embryonic development. The spatial expression of SpKrox-1 mRNA was distributed throughout the gastrula and larva ectodermal wall. However, SpKrox-1 was not expressed in pigment cells. The SpKrox-1 gene is thus a marker of a subset of SMCs or ectoderm cells. The structural features, and the transient temporal and restricted spatial expression patterns suggest that SpKrox-1 plays a role in a specific developmental event. ^
Resumo:
An important question in biology is to understand the role of specific gene products in regulating embryogenesis and cellular differentiation. Many of the regulatory proteins possess specific motifs, such as the homeodomain, basic helix-loop-helix structure, zinc finger, and leucine zipper. These sequence motifs participate in specific protein-DNA, protein-RNA, and protein-protein interactions, and are important for the function of these regulatory proteins.^ The human rfp (ret finger protein) belongs to a novel zinc finger protein family, the B box zinc finger family. Most of the B box proteins, including rfp, have a conserved tripartite motif, consisting of two novel zinc fingers (the RING finger and the B box) and a coiled-coil domain. Interestingly, a fusion protein between the tripartite motif of rfp and the tyrosine kinase domain of c-ret has transforming activity. In this study, we examined the expression of rfp during mouse development, and characterized the role of the tripartite motif in rfp function.^ We cloned the mouse rfp cDNA, which shares a 98.4% homology with the human sequence at amino acid level. Such strikingly high degree of homology indicates the high evolutionary pressure on the conservation of the sequence, suggesting that rfp may have an important function. Using the somatic cell hybrid system, we assigned the rfp gene to mouse chromosome 13 and human chromosome 6. Rfp transcripts and protein were ubiquitous in day 10.5-13.5 mouse embryos; however, they were restricted in adult mice, with the highest level of expression in the testis. Rfp expression in the testis is detected only in late pachytene spermatocytes and round spermatids. In both embryos and spermatogenic cells, rfp protein was distributed within cell nuclei in a punctate pattern, similar to the PODs (PML oncogenic domains) observed with another B box protein, PML. In cultured mammalian cells, we found that rfp was indeed co-localized to the PODs with PML. Using the yeast two-hybrid system, we showed that the rfp could specifically interact with PML, and that the interaction was dependent on the distal portion of the rfp coiled-coil domain.^ We also showed that rfp could form homodimers, and both the B box and coiled-coil domain were required for proper dimerization. It seems that the proximal portion of the coiled-coil domain provides the interacting interface, while the B box zinc finger orients the coil and maintains the correct structure of the whole molecule. Our data are consistent with the zinc-binding property and structural analysis of the B box. The RING finger seems to be involved in rfp nuclear localization through interaction with other proteins. We believe that homodimerization and interaction with PML are important for the normal interaction of rfp during development and differentiation. In addition, rfp homodimerization may also be essential for the oncogenic activation of the rfp-ret fusion protein. ^
Resumo:
A 14-kDa outer membrane protein (OMP) was purified from Actinobacillus pleuro-pneumoniae serotype 2. The protein strongly reacts with sera from pigs experimentally or naturally infected with any of the 12 serotypes of A. pleuropneumoniae. The gene encoding this protein was isolated from a gene library of A. pleuropneumoniae serotype 2 reference strain by immunoscreening. Expression of the cloned gene in Escherichia coli revealed that the protein is also located in the outer membrane fraction of the recombinant host. DNA sequence analysis of the gene reveals high similarity of the protein's amino acid sequence to that of the E. coli peptidoglycan-associated lipoprotein PAL, to the Haemophilus influenzae OMP P6 and to related proteins of several other Gram-negative bacteria. We have therefore named the 14-kDa protein PalA, and its corresponding gene, palA. The 20 amino-terminal amino acid residues of PalA constitute a signal sequence characteristic of membrane lipoproteins of prokaryotes with a recognition site for the signal sequence peptidase II and a sorting signal for the final localization of the mature protein in the outer membrane. The DNA sequence upstream of palA contains an open reading frame which is highly similar to the E. coli tolB gene, indicating a gene cluster in A. pleuropneumoniae which is very similar to the E. coli tol locus. The palA gene is conserved and expressed in all A. pleuropneumoniae serotypes and in A. lignieresii. A very similar palA gene is present in A. suis and A. equuli.
Resumo:
Retroviruses are RNA viruses that replicate through a double-stranded DNA intermediate. The viral enzyme reverse transcriptase copies the retroviral genomic RNA into this DNA intermediate through the process of reverse transcription. Many variables can affect the fidelity of reverse transcriptase during reverse transcription, including specific sequences within the retroviral genome. ^ Previous studies have observed that multiple cloning sites (MCS) and sequences predicted to form stable hairpin structures are hotspots for deletion during retroviral replication. The studies described in this dissertation were performed to elucidate the variables that affect the stability of MCS and hairpin structures in retroviral vectors. Two series of retroviral vectors were constructed and characterized in these studies. ^ Spleen necrosis virus-based vectors were constructed containing separate MCS insertions of varying length, orientation, and symmetry. The only MCS that was a hotspot for deletion formed a stable hairpin structure. Upon more detailed study, the MCS previously reported as a hotspot for deletion was found to contain a tandem linker insertion that formed a hairpin structure. Murine leukemia virus-based vectors were constructed containing separate sequence insertions of either inverted repeat symmetry (122IR) that could form a hairpin structure, or little symmetry (122c) that would form a less stable structure. These insertions were made into either the neomycin resistance marker ( neo) or the hygromycin resistance marker (hyg) of the vector. 122c was stable in both neo and hyg, while 122IR was preferentially deleted in neo and was remarkably unstable in hyg. ^ These results suggest that MCS are hotspots for deletion in retroviral vectors if they can form hairpin structures, and that hairpin structures can be highly unstable at certain locations in retroviral vectors. This information may contribute to improved design of retroviral vectors for such uses as human gene therapy, and will contribute to a greater understanding of the basic science of retroviral reverse transcription. ^
Resumo:
Genetic analysis, both karyotyping and comparative genomic hybridization, of prostate cancer cell lines and specimens have revealed multiple areas of concordant increases in DNA content. An increase of DNA in specific regions of the genome in cancer is often associated with the amplification of oncogenes. Based on these observations we have hypothesized that oncogenes are involved in the initiation or progression of prostate cancer. An expression cloning approach was utilized to identify candidate oncogenes in prostate cancer. ^ A full-length, unidirectional cDNA expression library was constructed from DU145 prostate cancer cells. The cDNA library was screened using CP12, a rat prostate epithelial cell line. In soft agarose assays, CP12 (parental or vector transfected) do not form colonies. However, upon the introduction of a number of known oncogenes CP12 becomes anchorage independent in soft agarose. Based on this in-vitro phenotypic shift, a DU145 cDNA library was stably transfected into CP12, and selected for anchorage independence. Two hundred fifty nine anchorage independent clones were isolated. Some colonies contained more than one insert, bringing the candidate oncogene pool to approximately 400. Seven inserts were sequenced at random. Using the sequences obtained, GenBank was screened, and matches were found with p53, PARG1, a mitochondrial ATPase, RNF6, and three unknown genes that mapped to Unigene clusters. As the pool of cDNA inserts appeared promising, overexpressed genes were further selected. From 259 clones, 17 clones were overexpressed more than 6-fold in DU145 compared to Normal Prostate. From the 17 clones, 12 cDNA inserts were strongly expressed in DU145 and were isolated for sequencing. ^ Two of the sequences, 1G6 and 3E9, were identical. Expression of 1G6/2G9/3E9 was tested by RT-PCR. 1G6/2G9/3E9 was not expressed in normal prostate, but was expressed in all prostate cancer cell lines tested as well as six prostate cancer samples. When retransfected into CP12, 1G6/2G9/3E9 induced the formation of foci and anchorage independent colonies. Thus, functional and expression data suggest that 1G6/2G9/3E9 may be a prostate cancer oncogene. ^
Resumo:
Retinitis pigmentosa (RP) is a genetically heterogeneous group of retinal degenerations that affects over one million people worldwide. To date, 11 autosomal dominant, 13 autosomal recessive, and 5 X-linked forms of retinitis pigmentosa have been identified through linkage analysis, but the disease-causing genes and mutations have been found for only half of these loci. My research uses a positional candidate cloning approach to identify the gene and mutations responsible for one type of autosomal dominant retinitis pigmentosa, RP10. The premise is that identifying the genes and mutations responsible for disease will provide insight into disease mechanisms and provide treatment options. Previous research mapped the RP10 locus to a 5cM region on chromosome 7q31 between markers D7S686 and D7S530. Linkage and fine-point haplotype analysis was used to reduce and refine the RP10 disease interval to a 4cM region located between D7S2471 and a new marker located 45,000bp telomeric of D7S461. In order to identify genes located in the RP10 interval, an extensive EST map was created of this region. Five EST clusters from this map were analyzed to determine if mutations in these genes cause the RP10 form of retinitis pigmentosa. The genomic structure of a known metabotrophic glutamate receptor, GRMS8, was determined first. DNA sequencing of GRM8 in RP10 family members did not identify any disease-causing mutations. Four other EST clusters (A170, A173, A189, and A258) were characterized and determined to be part of the same gene, UBNL1 (ubinuclein-like 1). The full-length mRNA sequence and genomic structure of UBNL1 was determined and then screened in patients. No disease-causing mutations were identified in any of the RP10 family members tested. Recent data made available with the release of the public and Celera genome assemblies indicates that UBNL1 is outside of the RP10 disease region. Despite this complication, characterization of UBNL1 is still important in the understanding of normal visual processes and it is possible that mutations in UBNL1 could cause other forms of retinopathy. The EST map and list of RP10 candidates will continue to aid others in the search for the RP10 gene and mutations. ^
Resumo:
The protein p53 binding protein one (53BP1) was discovered in a yeast two-hybrid screen that used the DNA binding domain of p53 as bait. Cloning of full-length 53BP1 showed that this protein contains several protein domains which help make up the protein, which include two tandem BRCT domains and a amino-terminal serine/glutamine cluster domain (SCD). These are two protein domains are often seen in factors that are involved in the cellular response to DNA damage and control of cell cycle checkpoints and we hypothesize that 53BP1 is involved in the cellular response to DNA damage. In support of this hypothesis we observe that 53BP1 is phosphorylated and undergoes a dramatic nuclear re-localization in response to DNA damaging agents. 53BP1 also interacts with several factors that are important in the cellular response to DNA damage, such as the BRCA1 tumor suppressor, ATM and Rad3 related (ATR), and the phosphorylated version of the histone variant H2AX. Mice deficient in 53BP1 display increased sensitivity ionizing radiation (IR), a DNA damaging agent that introduces DNA double strand breaks (DSBs). In addition, 53BP1-deficient mice do not properly undergo the process of class switch recombination (CSR). We also observe that when a defect in 53BP1 is combined with a defect in p53; the resulting mice have an increased rate of formation of spontaneous tumors, notably the formation of B and T lineage lymphomas. The T lineage tumors arise by two distinct mechanisms: one driven by defects in cell cycle regulation and a second driven by defects in the ability to repair DNA DSBs. The B lineage tumors arise by the inability to repair DNA damage and over-expression of the oncogene c-myc. ^ With these observations, we conclude that not only does 53BP1 function in the cellular response to DNA damage, but it also works in concert with p53 to suppress tumor formation. ^
Resumo:
Many eukaryotic promoters contain a CCAAT element at a site close ($-$80 to $-$120) to the transcription initiation site. CBF (CCAAT Binding Factor), also called NF-Y and CP1, was initially identified as a transcription factor binding to such sites in the promoters of the Type I collagen, albumin and MHC class II genes. CBF is a heteromeric transcription factor and purification and cloning of two of the subunits, CBF-A and CBF-B revealed that it was evolutionarily conserved with striking sequence identities with the yeast polypeptides HAP3 and HAP2, which are components of a CCAAT binding factor in yeast. Recombinant CBF-A and CBF-B however failed to bind to DNA containing CCAAT sequences. Biochemical experiments led to the identification of a third subunit, CBF-C which co-purified with CBF-A and complemented the DNA binding of recombinant CBF-A and CBF-B. We have recently isolated CBF-C cDNAs and have shown that bacterially expressed purified CBF-C binds to CCAAT containing DNA in the presence of recombinant CBF-A and CBF-B. Our experiments also show that a single molecule each of all the three subunits are present in the protein-DNA complex. Interestingly, CBF-C is also evolutionarily conserved and the conserved domain between CBF-C and its yeast homolog HAP5 is sufficient for CBF-C activity. Using GST-pulldown experiments we have demonstrated the existence of protein-protein interaction between CBF-A and CBF-C in the absence of CBF-B and DNA. CBF-B on other hand, requires both CBF-A and CBF-C to form a ternary complex which then binds to DNA. Mutational studies of CBF-A have revealed different domains of the protein which are involved in CBF-C interaction and CBF-B interaction. In addition, CBF-A harbors a domain which is involved in DNA recognition along with CBF-B. Dominant negative analogs of CBF-A have also substantiated our initial observation of assembly of CBF subunits. Our studies define a novel DNA binding structure of heterotrimeric CBF, where the three subunits of CBF follow a particular pathway of assembly of subunits that leads to CBF binding to DNA and activating transcription. ^