916 resultados para Genome-specific Sequence
Resumo:
Chromosome identification is an essential step in genomic research, which so far has not been possible in oysters. We tested bacteriophage P1 clones for chromosomal identification in the eastern oyster Crassostrea virginica, using fluorescence in situ hybridization (FISH). P1 clones were labeled with digoxigenin-11-dUTP using nick translation. Hybridization was detected with fluorescein-isothiocyanate-labeled anti-digoxigenin antibodies and amplified with 2 layers of antibodies. Nine of the 21 P1 clones tested produced clear and consistent FISH signals when Cot-1 DNA was used as a blocking agent against repetitive sequences. Karyotypic analysis and cohybridization positively assigned the 9 P1 clones to 7 chromosomes. The remaining 3 chromosomes can be separated by size and arm ratio. Five of the 9 P1 clones were sequenced at both ends, providing sequence-tagged sites that can be used to integrate linkage and cytogenetic maps. One sequence is part of the bone morphogenetic protein type 1b receptor, a member of the transforming growth factor superfamily, and mapped to the telomeric region of the long arm of chromosome 2. This study shows that large-insert clones such as P1 are useful as chromosome-specific FISH probes and for gene mapping in oysters.
Resumo:
The x- and y-type high molecular weight (HMW) glutenin subunits are conserved seed storage proteins in wheat and related species. Here we describe investigations on the HMW glutenin subunits from several Pseudoroegneria accessions. The electrophoretic mobilities of the HMW glutenin subunits from Pd. stipifolia, Pd tauri and Pd strigosa were much faster than those of orthologous wheat subunits, indicating that their protein size may be smaller than that of wheat subunits. The coding sequence of the Glu-1St1 subunit (encoded by the Pseudoroegneria stipifolia accession PI325181) was isolated, and found to represent the native open reading frame (ORF) by in vitro expression. The deduced amino acid sequence of Glu-1St1 matched with that determined from the native subunit by mass spectrometric analysis. The domain organization in Glu-1St1 showed high similarity with that of typical HMW glutenin subunits. However, Glu-1St1 exhibited several distinct characteristics. First, the length of its repetitive domain was substantially smaller than that of conventional subunits, which explains its much faster electrophoretic mobility in SDS-PAGE. Second, although the N-terminal domain of Glu-1St1 resembled that of y-type subunit, its C-terminal domain was more similar to that of x-type subunit. Third, the N- and C-terminat domains of Glu-1St1 shared conserved features with those of barley D-hordein, but the repeat motifs and the organization of its repetitive domain were more similar to those of HMW glutenin subunits than to D-hordein. We conclude that Glu-1St1 is a novel variant of HMW glutenin subunits. The analysis of Glu-1St1 may provide new insight into the evolution of HMW glutenin subunits in Triticeae species. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
"Da-Huang" (Radix et Rhizoma Rhei, medicinal rhubarb), a famous and important Traditional Chinese Medicine, has often been confused with the adulterant species in the same genus, Rheum. Through sequencing the trnL (UAA)/trnF (GAA) regions of chloroplast DNA of thirteen species of Rheum (three medicinal rhubarb species and ten adulterant ones), a molecular marker of the medicinal species was found. A pair of PCR primers based on the sequences, was thus designed, which amplified a highly specific DNA fragment in medicinal rhubarb exclusively, and absent in the adulterants at all under an optimized PCR condition.
Resumo:
Danny S. Tuckwell, Matthew J. Nicholson, Christopher S. McSweeney, Michael K. Theodorou and Jayne L. Brookman (2005). The rapid assignment of ruminal fungi to presumptive genera using ITS1 and ITS2 RNA secondary structures to produce group-specific fingerprints. Microbiology, 151 (5) pp.1557-1567 Sponsorship: BBSRC / Stapledon Memorial Trust RAE2008
Resumo:
Gustavo Chemale, Arjan J. van Rossum, James R. Jefferies, John Barrett, Peter M. Brophy, Henrique B. Ferreira, Arnaldo Zaha (2003). Proteomic analysis of the larval stage of the parasite Echinococcus granulosus: causative agent of cystic hydatid disease. Proteomics, 3(8), 1633-1636. Sponsorship: CNPq / PADCT/CNPq / FAPERGS (Brazil)/ BBSRC (UK) RAE2008
Resumo:
INTRODUCTION:Subclinical atherosclerosis (SCA) measures in multiple arterial beds are heritable phenotypes that are associated with increased incidence of cardiovascular disease. We conducted a genome-wide association study (GWAS) for SCA measurements in the community-based Framingham Heart Study.METHODS:Over 100,000 single nucleotide polymorphisms (SNPs) were genotyped (Human 100K GeneChip, Affymetrix) in 1345 subjects from 310 families. We calculated sex-specific age-adjusted and multivariable-adjusted residuals in subjects tested for quantitative SCA phenotypes, including ankle-brachial index, coronary artery calcification and abdominal aortic calcification using multi-detector computed tomography, and carotid intimal medial thickness (IMT) using carotid ultrasonography. We evaluated associations of these phenotypes with 70,987 autosomal SNPs with minor allele frequency [greater than or equal to] 0.10, call rate [greater than or equal to] 80%, and Hardy-Weinberg p-value [greater than or equal to] 0.001 in samples ranging from 673 to 984 subjects, using linear regression with generalized estimating equations (GEE) methodology and family-based association testing (FBAT). Variance components LOD scores were also calculated.RESULTS:There was no association result meeting criteria for genome-wide significance, but our methods identified 11 SNPs with p < 10-5 by GEE and five SNPs with p < 10-5 by FBAT for multivariable-adjusted phenotypes. Among the associated variants were SNPs in or near genes that may be considered candidates for further study, such as rs1376877 (GEE p < 0.000001, located in ABI2) for maximum internal carotid artery IMT and rs4814615 (FBAT p = 0.000003, located in PCSK2) for maximum common carotid artery IMT. Modest significant associations were noted with various SCA phenotypes for variants in previously reported atherosclerosis candidate genes, including NOS3 and ESR1. Associations were also noted of a region on chromosome 9p21 with CAC phenotypes that confirm associations with coronary heart disease and CAC in two recently reported genome-wide association studies. In linkage analyses, several regions of genome-wide linkage were noted, confirming previously reported linkage of internal carotid artery IMT on chromosome 12. All GEE, FBAT and linkage results are provided as an open-access results resource at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?id=phs000007.CONCLUSION:The results from this GWAS generate hypotheses regarding several SNPs that may be associated with SCA phenotypes in multiple arterial beds. Given the number of tests conducted, subsequent independent replication in a staged approach is essential to identify genetic variants that may be implicated in atherosclerosis.
Resumo:
BACKGROUND:Blood lipid levels including low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), and triglycerides (TG) are highly heritable. Genome-wide association is a promising approach to map genetic loci related to these heritable phenotypes.METHODS:In 1087 Framingham Heart Study Offspring cohort participants (mean age 47 years, 52% women), we conducted genome-wide analyses (Affymetrix 100K GeneChip) for fasting blood lipid traits. Total cholesterol, HDL-C, and TG were measured by standard enzymatic methods and LDL-C was calculated using the Friedewald formula. The long-term averages of up to seven measurements of LDL-C, HDL-C, and TG over a ~30 year span were the primary phenotypes. We used generalized estimating equations (GEE), family-based association tests (FBAT) and variance components linkage to investigate the relationships between SNPs (on autosomes, with minor allele frequency [greater than or equal to]10%, genotypic call rate [greater than or equal to]80%, and Hardy-Weinberg equilibrium p [greater than or equal to] 0.001) and multivariable-adjusted residuals. We pursued a three-stage replication strategy of the GEE association results with 287 SNPs (P < 0.001 in Stage I) tested in Stage II (n ~1450 individuals) and 40 SNPs (P < 0.001 in joint analysis of Stages I and II) tested in Stage III (n~6650 individuals).RESULTS:Long-term averages of LDL-C, HDL-C, and TG were highly heritable (h2 = 0.66, 0.69, 0.58, respectively; each P < 0.0001). Of 70,987 tests for each of the phenotypes, two SNPs had p < 10-5 in GEE results for LDL-C, four for HDL-C, and one for TG. For each multivariable-adjusted phenotype, the number of SNPs with association p < 10-4 ranged from 13 to 18 and with p < 10-3, from 94 to 149. Some results confirmed previously reported associations with candidate genes including variation in the lipoprotein lipase gene (LPL) and HDL-C and TG (rs7007797; P = 0.0005 for HDL-C and 0.002 for TG). The full set of GEE, FBAT and linkage results are posted at the database of Genotype and Phenotype (dbGaP). After three stages of replication, there was no convincing statistical evidence for association (i.e., combined P < 10-5 across all three stages) between any of the tested SNPs and lipid phenotypes.CONCLUSION:Using a 100K genome-wide scan, we have generated a set of putative associations for common sequence variants and lipid phenotypes. Validation of selected hypotheses in additional samples did not identify any new loci underlying variability in blood lipids. Lack of replication may be due to inadequate statistical power to detect modest quantitative trait locus effects (i.e., < 1% of trait variance explained) or reduced genomic coverage of the 100K array. GWAS in FHS using a denser genome-wide genotyping platform and a better-powered replication strategy may identify novel loci underlying blood lipids.
Resumo:
Restless Legs Syndrome (RLS) is a common neurological disorder affecting nearly 15% of the general population. Ironically, RLS can be described as the most common condition one has never heard of. It is usually characterised by uncomfortable, unpleasant sensations in the lower limbs inducing an uncontrollable desire to move the legs. RLS exhibits a circadian pattern with symptoms present predominantly in the evening or at night, thus leading to sleep disruption and daytime somnolence. RLS is generally classified into primary (idiopathic) and secondary (symptomatic) forms. Primary RLS includes sporadic and familial cases of which the age of onset is usually less than 45 years and progresses slowly with a female to male ratio of 2:1. Secondary forms often occur as a complication of another health condition, such as iron deficiency or thyroid dysfunction. The age of onset is usually over 45 years, with an equal male to female ratio and more rapid progression. Ekbom described the familial component of the disorder in 1945 and since then many studies have been published on the familial forms of the disorder. Molecular genetic studies have so far identified ten loci (5q, 12q, 14p, 9p, 20p, 16p, 19p, 4q, 17p). No specific gene within these loci has been identified thus far. Association mapping has highlighted a further five areas of interest. RLS6 has been found to be associated with SNPs in the BTBD9 gene. Four other variants were found within intronic and intergenic regions of MEIS1, MAP2K5/LBXCOR1, PTPRD and NOS1. The pathophysiology of RLS is complex and remains to be fully elucidated. Conditions associated with secondary RLS, such as pregnancy or end-stage renal disease, are characterised by iron deficiency, which suggests that disturbed iron homeostasis plays a role. Dopaminergic dysfunction in subcortical systems also appears to play a central role. An ongoing study within the Department of Pathology (University College Cork) is investigating the genetic characteristics of RLS in Irish families. A three generation RLS pedigree RLS3002 consisting of 11 affected and 7 unaffected living family members was recruited. The family had been examined for four of the known loci (5q, 12q, 14p and 9p) (Abdulrahim 2008). The aim of this study was to continue examining this Irish RLS pedigree for possible linkage to the previously described loci and associated regions. Using informative microsatellite markers linkage was excluded to the loci on 5q, 12q, 14p, 9p, 20p, 16p, 19p, 4q, 17p and also within the regions reported to be associated with RLS. This suggested the presence of a new unidentified locus. A genome-wide scan was performed using two microsatellite marker screening sets (Research Genetics Inc. Mapping set and the Applied Biosystems Linkage mapping set version 2.5). Linkage analysis was conducted under an autosomal dominant model with a penetrance of 95% and an allele frequency of 0.01. A maximum LOD score of 3.59 at θ=0.00 for marker D19S878 indicated significant linkage on chromosome 19p. Haplotype analysis defined a genetic region of 6.57 cM on chromosome 19p13.3, corresponding to 2.5 Mb. There are approximately 100 genes annotated within the critical region. Sequencing of two candidate genes, KLF16 and GAMT, selected on the assumed pathophysiology of RLS, did not identify any sequence variant. This study provides evidence of a novel RLS locus in an Irish pedigree, thus supporting the picture of RLS as a genetically heterogeneous trait.
Resumo:
An aim of proactive risk management strategies is the timely identification of safety related risks. One way to achieve this is by deploying early warning systems. Early warning systems aim to provide useful information on the presence of potential threats to the system, the level of vulnerability of a system, or both of these, in a timely manner. This information can then be used to take proactive safety measures. The United Nation’s has recommended that any early warning system need to have four essential elements, which are the risk knowledge element, a monitoring and warning service, dissemination and communication and a response capability. This research deals with the risk knowledge element of an early warning system. The risk knowledge element of an early warning system contains models of possible accident scenarios. These accident scenarios are created by using hazard analysis techniques, which are categorised as traditional and contemporary. The assumption in traditional hazard analysis techniques is that accidents are occurred due to a sequence of events, whereas, the assumption of contemporary hazard analysis techniques is that safety is an emergent property of complex systems. The problem is that there is no availability of a software editor which can be used by analysts to create models of accident scenarios based on contemporary hazard analysis techniques and generate computer code that represent the models at the same time. This research aims to enhance the process of generating computer code based on graphical models that associate early warning signs and causal factors to a hazard, based on contemporary hazard analyses techniques. For this purpose, the thesis investigates the use of Domain Specific Modeling (DSM) technologies. The contributions of this thesis is the design and development of a set of three graphical Domain Specific Modeling languages (DSML)s, that when combined together, provide all of the necessary constructs that will enable safety experts and practitioners to conduct hazard and early warning analysis based on a contemporary hazard analysis approach. The languages represent those elements and relations necessary to define accident scenarios and their associated early warning signs. The three DSMLs were incorporated in to a prototype software editor that enables safety scientists and practitioners to create and edit hazard and early warning analysis models in a usable manner and as a result to generate executable code automatically. This research proves that the DSM technologies can be used to develop a set of three DSMLs which can allow user to conduct hazard and early warning analysis in more usable manner. Furthermore, the three DSMLs and their dedicated editor, which are presented in this thesis, may provide a significant enhancement to the process of creating the risk knowledge element of computer based early warning systems.
Resumo:
RNA editing is a biological phenomena that alters nascent RNA transcripts by insertion, deletion and/or substitution of one or a few nucleotides. It is ubiquitous in all kingdoms of life and in viruses. The predominant editing event in organisms with a developed central nervous system is Adenosine to Inosine deamination. Inosine is recognized as Guanosine by the translational machinery and reverse-transcriptase. In primates, RNA editing occurs frequently in transcripts from repetitive regions of the genome. In humans, more than 500,000 editing instances have been identified, by applying computational pipelines on available ESTs and high-throughput sequencing data, and by using chemical methods. However, the functions of only a small number of cases have been studied thoroughly. RNA editing instances have been found to have roles in peptide variants synthesis by non-synonymous codon substitutions, transcript variants by alterations in splicing sites and gene silencing by miRNAs sequence modifications. We established the Database of RNA EDiting (DARNED) to accommo-date the reference genomic coordinates of substitution editing in human, mouse and fly transcripts from published literatures, with additional information on edited genomic coordinates collected from various databases e.g. UCSC, NCBI. DARNED contains mostly Adenosine to Inosine editing and allows searches based on genomic region, gene ID, and user provided sequence. The Database is accessible at http://darned.ucc.ie RNA editing instances in coding region are likely to result in recoding in protein synthesis. This encouraged me to focus my research on the occurrences of RNA editing specific CDS and non-Alu exonic regions. By applying various filters on discrepancies between available ESTs and their corresponding reference genomic sequences, putative RNA editing candidates were identified. High-throughput sequencing was used to validate these candidates. All predicted coordinates appeared to be either SNPs or unedited.
Resumo:
Lactococcus lactis is used extensively world-wide for the production of fermented dairy products. Bacteriophages (phages) infecting L. lactis can result in slow or incomplete fermentations, or may even cause total fermentation failure. Therefore, bacteriophages disrupting L. lactis fermentation are of economic concern. This thesis employed a multifaceted approach to investigate various molecular aspects of phage-host interaction in L. lactis. The genome sequence of an Irish dairy starter strain, the prophage-cured L. lactis subsp. cremoris UC509.9, was studied. The 2,250,427 bp circular chromosome represents the smallest among its sequenced lactococcal equivalents. The genome displays clear genetic adaptation to the dairy niche in the form of extensive reductive evolution. Gene prediction identified 2066 protein-encoding genes, including 104 which showed significant homology to transposase-specifying genes. Over 9 % of the identified genes appear to be inactivated through stop codons or frame shift mutations. Many pseudogenes were found in genes that are assigned to carbohydrate and amino acid transport and metabolism orthologous groups, reflecting L. lactis UC509.9’s adaptation to the lactose and casein-rich dairy environment. Sequence analysis of the eight plasmids of L. lactis revealed extensive adaptation to the dairy environment. Key industrial phenotypes were mapped and novel lactococcal plasmid-associated genes highlighted. In addition to chromosomally-encoded bacteriophage resistance systems, six functional such systems were identified, including two abortive infection systems, AbiB and AbiD1, explaining the observed phage resistance of L. lactis UC509.9 Molecular analysis suggests that the constitutive expression of AbiB is not lethal to cells, suggesting the protein is expressed in an un/inactivated form. Analysis of 936 species phage sk1-escape mutants of AbiB revealed that all such mutants harbour mutations in orf6, which encodes the major capsid protein. Results suggest that the major capsid protein is required for activation of the AbiB system, although this requires furrther investigations. Temporal transcriptomes of L. lactis UC509.9 undergoing lytic infection with either one of two distinct bacteriophages, Tuc2009 and c2, was determined and compared to the transcriptome of uninfected UC509.9 cells. Whole genome microarrays performed at various time-points post-infection demonstrated a rather modest impact on host transcription. Alterations in the UC509.9 transcriptome during lytic infection appear phage-specific, with a relatively small number of differentially transcribed genes shared between infection with either Tuc2009 or c2. Transcriptional profiles of both bacteriophages during lytic infection was shown to generally correlate with previous studies and allowed the confirmation of previously predicted promoter sequences. Bioinformatic analysis of genomic regions encoding the presumed cell wall polysaccharide (CW PS) biosynthesis gene cluster of several strains of L. lactis was performed. Results demonstrate the presence of three dominant genetic types of this gene cluster, termed type A, B and C. These regions were used for the development of a multiplex PCR to identify CW PS genotype of various lactococcal strains. Analysis of 936 species phage receptor binding protein phylogeny (RBP) and CW PS genotype revealed an apparent correlation between RBP phylogeny and CW PS type, thereby providing a partial explanation for the observed narrow host range of 936 phages. Further analysis of the genetic locus encompassing the presumed CW PS biosynthesis operon of eight strains identified as belonging to the CW PS C (geno)type, revealed the presence of a variable region among the examined strains. The obtained comparative analysis allowed for the identification of five subgroups of the C type, named C1 to C5. We purified an acidic polysaccharide from the cell wall of L. lactis 3107 (C2 subtype) and confirmed that it is structurally different from the CW PS of the C1 subtype L. lactis MG1363. Combinations of genes from the variable region of C2 subtype were amplified from L. lactis 3107 and introduced into a mutant of the C1 subtype L. lactis NZ9000 (a direct derivative of MG1363) deficient in CW PS biosynthesis. The resulting recombinant mutant synthesized a CW PS with a composition characteristic for that of the C2 subtype L. lactis 3107 and not the wildtype C1 L. lactis NZ9000. The recombinant mutant exhibited a changed phage resistance/sensitivity profile consistent with that of L. lactis 3107, which unambiguously demonstrated that L. lactis 3107 CW PS is the host cell surface receptor of two bacteriophages belonging to the P335 species as well as phages that are member of the 936 species. The research presented in this thesis has significantly advanced our understanding of L. lactis bacteriophage-host interactions in several ways. Firstly, the examination of plasmidencoded bacteriophage resistance systems has allowed inferences to be made regarding the mode of action of AbiB, thereby providing a platform for further elucidation of the molecular trigger of this system. Secondly, the phage infection transcriptome data presented, in addition to previous work, has made L. lactis a model organism in terms of transcriptomic studies of bacteriophage-host interactions. And finally, the research described in this thesis has for the first time explicitly revealed the nature of a carbohydrate bacteriophage receptor in L. lactis, while also providing a logical explanation for the observed narrow host ranges exhibited by 936 and P335 phages. Future research in discerning the structures of other L. lactis CW PS, combined with the determination of the molecular interplay between receptor binding proteins of these phages and CW PS will allow an in depth understanding of the mechanism by which the most prevalent lactococcal phages identify and adsorb to their specific host.
Resumo:
This report describes the identification of a novel protein named PS1D (Genbank accession number ), which is composed of an S1-like RNA-binding domain, a (cysteine)x3-(histidine) CCCH-zinc finger, and a very basic carboxyl domain. PS1D is expressed as two isoforms, probably resulting from the alternative splicing of mRNA. The long PS1D isoform differs from the short one by the presence of 48 additional amino acids at its amino-terminal extremity. Analysis of PS1D subcellular distribution by cell fractionation reveals that this protein belongs to the core of the eukaryotic 60S ribosomal subunit. Interestingly, PS1D protein is a highly conserved protein among mammalians as murine, human, and simian PS1D homologues share more than 95% identity. In contrast, no homologous protein is found in lower eukaryotes such as yeast and Caenorhabditis elegans. These observations indicate that PS1D is the first eukaryotic ribosomal protein that is specific to higher eukaryotes.
Resumo:
We describe here a patient with a clinical and molecular diagnosis of recombinase activating gene 1-deficient (RAG1-deficient) SCID, who produced specific antibodies despite minimal B cell numbers. Memory B cells were detected and antibodies were produced not only against some vaccines and infections, but also against autoantigens. The patient had severely reduced levels of oligoclonal T cells expressing the alphabeta TCR but surprisingly normal numbers of T cells expressing the gammadelta TCR. Analysis at a clonal level and TCR complementarity-determining region-3 spectratyping for gammadelta T cells revealed a diversified oligoclonal repertoire with predominance of cells expressing a gamma4-delta3 TCR. Several gammadelta T cell clones displayed reactivity against CMV-infected cells. These observations are compatible with 2 non-mutually exclusive explanations for the gammadelta T cell predominance: a developmental advantage and infection-triggered, antigen-driven peripheral expansion. The patient carried the homozygous hypomorphic R561H RAG1 mutation leading to reduced V(D)J recombination but lacked all clinical features characteristic of Omenn syndrome. This report describes a new phenotype of RAG deficiency and shows that the ability to form specific antibodies does not exclude the diagnosis of SCID.
Resumo:
Somatostatin receptor 2 (SSTR2) is expressed by most medulloblastomas (MEDs). We isolated monoclonal antibodies (MAbs) to the 12-mer (33)QTEPYYDLTSNA(44), which resides in the extracellular domain of the SSTR2 amino terminus, screened the peptide-bound MAbs by fluorescence microassay on D341 and D283 MED cells, and demonstrated homogeneous cell-surface binding, indicating that all cells expressed cell surface-detectable epitopes. Five radiolabeled MAbs were tested for immunoreactive fraction (IRF), affinity (KA) (Scatchard analysis vs. D341 MED cells), and internalization by MED cells. One IgG(3) MAb exhibited a 50-100% IRF, but low KA. Four IgG(2a) MAbs had 46-94% IRFs and modest KAs versus intact cells (0.21-1.2 x 10(8) M(-1)). Following binding of radiolabeled MAbs to D341 MED at 4 degrees C, no significant internalization was observed, which is consistent with results obtained in the absence of ligand. However, all MAbs exhibited long-term association with the cells; binding at 37 degrees C after 2 h was 65-66%, and after 24 h, 52-64%. In tests with MAbs C10 and H5, the number of cell surface receptors per cell, estimated by Scatchard and quantitative FACS analyses, was 3.9 x 10(4) for the "glial" phenotype DAOY MED cell line and 0.6-8.8 x 10(5) for four neuronal phenotype MED cell lines. Our results indicate a potential immunotherapeutic application for these MAbs.
Resumo:
The BUZ/Znf-UBP domain is a protein module found in the cytoplasmic deacetylase HDAC6, E3 ubiquitin ligase BRAP2/IMP, and a subfamily of ubiquitin-specific proteases. Although several BUZ domains have been shown to bind ubiquitin with high affinity by recognizing its C-terminal sequence (RLRGG-COOH), it is currently unknown whether the interaction is sequence-specific or whether the BUZ domains are capable of binding to proteins other than ubiquitin. In this work, the BUZ domains of HDAC6 and Ubp-M were subjected to screening against a one-bead-one-compound (OBOC) peptide library that exhibited random peptide sequences with free C-termini. Sequence analysis of the selected binding peptides as well as alanine scanning studies revealed that the BUZ domains require a C-terminal Gly-Gly motif for binding. At the more N-terminal positions, the two BUZ domains have distinct sequence specificities, allowing them to bind to different peptides and/or proteins. A database search of the human proteome on the basis of the BUZ domain specificities identified 11 and 24 potential partner proteins for Ubp-M and HDAC6 BUZ domains, respectively. Peptides corresponding to the C-terminal sequences of four of the predicted binding partners (FBXO11, histone H4, PTOV1, and FAT10) were synthesized and tested for binding to the BUZ domains by fluorescence polarization. All four peptides bound to the HDAC6 BUZ domain with low micromolar K(D) values and less tightly to the Ubp-M BUZ domain. Finally, in vitro pull-down assays showed that the Ubp-M BUZ domain was capable of binding to the histone H3-histone H4 tetramer protein complex. Our results suggest that BUZ domains are sequence-specific protein-binding modules, with each BUZ domain potentially binding to a different subset of proteins.