923 resultados para Genome scans


Relevância:

20.00% 20.00%

Publicador:

Resumo:

When compared to other model organisms whose genome is sequenced, the number of mutations identified in the mouse appears extremely reduced and this situation seriously hampers our understanding of mammalian gene function(s). Another important consequence of this shortage is that a majority of human genetic diseases still await an animal model. To improve the situation, two strategies are currently used: the first makes use of embryonic stem cells, in which one can induce knockout mutations almost at will; the second consists of a genome-wide random chemical mutagenesis, followed by screening for mutant phenotypes and subsequent identification of the genetic alteration(s). Several projects are now in progress making use of one or the other of these strategies. Here, we report an original effort where we mutagenized BALB/c males, with the mutagen ethylnitrosourea. Offspring of these males were screened for dominant mutations and a three-generation breeding protocol was set to recover recessive mutations. Eleven mutations were identified (one dominant and ten recessives). Three of these mutations are new alleles (Otop1mlh, Foxn1sepe and probably rodador) at loci where mutations have already been reported, while 4 are new and original alleles (carc, eqlb, frqz, and Sacc). This result indicates that the mouse genome, as expected, is far from being saturated with mutations. More mutations would certainly be discovered using more sophisticated phenotyping protocols. Seven of the 11 new mutant alleles induced in our experiment have been localized on the genetic map as a first step towards positional cloning.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Data on genome damage, lipid peroxidation, and levels of glutathione peroxidase (GPX) in newborns after transplacental exposure to xenobiotics are rare and insufficient for risk assessment. The aim of the current study was to analyze, in an animal model, transplacental genotoxicity, lipid peroxidation, and detoxification disturbances caused by the following drugs commonly prescribed to pregnant women: paracetamol, fluconazole, 5-nitrofurantoin, and sodium valproate. Genome damage in dams and their newborn pups transplacentally exposed to these drugs was investigated using the in vivo micronucleus (MN) assay. The drugs were administered to dams intraperitoneally in three consecutive daily doses between days 12 and 14 of pregnancy. The results were correlated, with detoxification capacity of the newborn pups measured by the levels of GPX in blood and lipid peroxidation in liver measured by malondialdehyde (HPLC-MDA) levels. Sodium valproate and 5-nitrofurantoin significantly increased MN frequency in pregnant dams. A significant increase in the MN frequency of newborn pups was detected for all drugs tested. This paper also provides reference levels of MDA in newborn pups, according to which all drugs tested significantly lowered MDA levels of newborn pups, while blood GPX activity dropped significantly only after exposure to paracetamol. The GPX reduction reflected systemic oxidative stress, which is known to occur with paracetamol treatment. The reduction of MDA in the liver is suggested to be an unspecific metabolic reaction to the drugs that express cytotoxic, in particular hepatotoxic, effects associated with oxidative stress and lipid peroxidation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

DNA methylation is essential in X chromosome inactivation and genomic imprinting, maintaining repression of XIST in the active X chromosome and monoallelic repression of imprinted genes. Disruption of the DNA methyltransferase genes DNMT1 and DNMT3B in the HCT116 cell line (DKO cells) leads to global DNA hypomethylation and biallelic expression of the imprinted gene IGF2 but does not lead to reactivation of XIST expression, suggesting thatXIST repression is due to a more stable epigenetic mark than imprinting. To test this hypothesis, we induced acute hypomethylation in HCT116 cells by 5-aza-2′-deoxycytidine (5-aza-CdR) treatment (HCT116-5-aza-CdR) and compared that to DKO cells, evaluating DNA methylation by microarray and monitoring the expression of XIST and imprinted genes IGF2, H19, and PEG10. Whereas imprinted genes showed biallelic expression in HCT116-5-aza-CdR and DKO cells, the XIST locus was hypomethylated and weakly expressed only under acute hypomethylation conditions, indicating the importance ofXIST repression in the active X to cell survival. Given that DNMT3A is the only active DNMT in DKO cells, it may be responsible for ensuring the repression of XIST in those cells. Taken together, our data suggest that XIST repression is more tightly controlled than genomic imprinting and, at least in part, is due to DNMT3A.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Ropinirole (ROP) is a dopamine agonist that has been used as therapy for Parkinson's disease. In the present study, we aimed to detect whether gene expression was modulated by ROP in SH-SY5Y cells. SH-SY5Y cell lines were treated with 10 µM ROP for 2 h, after which total RNA was extracted for whole genome analysis. Gene expression profiling revealed that 113 genes were differentially expressed after ROP treatment compared with control cells. Further pathway analysis revealed modulation of the phosphatidylinositol 3-kinase (PI3K) signaling pathway, with prominent upregulation of PIK3C2B. Moreover, batches of regulated genes, including PIK3C2B, were found to be located on chromosome 1. These findings were validated by quantitative RT-PCR and Western blot analysis. Our study, therefore, revealed that ROP altered gene expression in SH-SY5Y cells, and future investigation of PIK3C2B and other loci on chromosome 1 may provide long-term implications for identifying novel target genes of Parkinson's disease.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Personalized medicine will revolutionize our capabilities to combat disease. Working toward this goal, a fundamental task is the deciphering of geneticvariants that are predictive of complex diseases. Modern studies, in the formof genome-wide association studies (GWAS) have afforded researchers with the opportunity to reveal new genotype-phenotype relationships through the extensive scanning of genetic variants. These studies typically contain over half a million genetic features for thousands of individuals. Examining this with methods other than univariate statistics is a challenging task requiring advanced algorithms that are scalable to the genome-wide level. In the future, next-generation sequencing studies (NGS) will contain an even larger number of common and rare variants. Machine learning-based feature selection algorithms have been shown to have the ability to effectively create predictive models for various genotype-phenotype relationships. This work explores the problem of selecting genetic variant subsets that are the most predictive of complex disease phenotypes through various feature selection methodologies, including filter, wrapper and embedded algorithms. The examined machine learning algorithms were demonstrated to not only be effective at predicting the disease phenotypes, but also doing so efficiently through the use of computational shortcuts. While much of the work was able to be run on high-end desktops, some work was further extended so that it could be implemented on parallel computers helping to assure that they will also scale to the NGS data sets. Further, these studies analyzed the relationships between various feature selection methods and demonstrated the need for careful testing when selecting an algorithm. It was shown that there is no universally optimal algorithm for variant selection in GWAS, but rather methodologies need to be selected based on the desired outcome, such as the number of features to be included in the prediction model. It was also demonstrated that without proper model validation, for example using nested cross-validation, the models can result in overly-optimistic prediction accuracies and decreased generalization ability. It is through the implementation and application of machine learning methods that one can extract predictive genotype–phenotype relationships and biological insights from genetic data sets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Lichens are symbiotic organisms, which consist of the fungal partner and the photosynthetic partner, which can be either an alga or a cyanobacterium. In some lichen species the symbiosis is tripartite, where the relationship includes both an alga and a cyanobacterium alongside the primary symbiont, fungus. The lichen symbiosis is an evolutionarily old adaptation to life on land and many extant fungal species have evolved from lichenised ancestors. Lichens inhabit a wide range of habitats and are capable of living in harsh environments and on nutrient poor substrates, such as bare rocks, often enduring frequent cycles of drying and wetting. Most lichen species are desiccation tolerant, and they can survive long periods of dehydration, but can rapidly resume photosynthesis upon rehydration. The molecular mechanisms behind lichen desiccation tolerance are still largely uncharacterised and little information is available for any lichen species at the genomic or transcriptomic level. The emergence of the high-throughput next generation sequencing (NGS) technologies and the subsequent decrease in the cost of sequencing new genomes and transcriptomes has enabled non-model organism research on the whole genome level. In this doctoral work the transcriptome and genome of the grey reindeer lichen, Cladonia rangiferina, were sequenced, de novo assembled and characterised using NGS and traditional expressed sequence tag (EST) technologies. RNA extraction methods were optimised to improve the yield and quality of RNA extracted from lichen tissue. The effects of rehydration and desiccation on C. rangiferina gene expression on whole transcriptome level were studied and the most differentially expressed genes were identified. The secondary metabolites present in C. rangiferina decreased the quality – integrity, optical characteristics and utility for sensitive molecular biological applications – of the extracted RNA requiring an optimised RNA extraction method for isolating sufficient quantities of high-quality RNA from lichen tissue in a time- and cost-efficient manner. The de novo assembly of the transcriptome of C. rangiferina was used to produce a set of contiguous unigene sequences that were used to investigate the biological functions and pathways active in a hydrated lichen thallus. The de novo assembly of the genome yielded an assembly containing mostly genes derived from the fungal partner. The assembly was of sufficient quality, in size similar to other lichen-forming fungal genomes and included most of the core eukaryotic genes. Differences in gene expression were detected in all studied stages of desiccation and rehydration, but the largest changes occurred during the early stages of rehydration. The most differentially expressed genes did not have any annotations, making them potentially lichen-specific genes, but several genes known to participate in environmental stress tolerance in other organisms were also identified as differentially expressed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Adenoviruses are non-enveloped icosahedral-shaped particles which possess a double-stranded DNA genome. Currently, nearly 100 serotypes of adenoviruses have been identified, 48 of which are of human origin. Bovine adenoviruses (BAVs), causing both mild respiratory and/or enteral diseases in cattle, have been reported in many countries all over the world. Currently, nine serotypes of SAVs have been isolated which have been placed into two subgroups based on a number of characteristics which include complement fixation tests as well as the ability to replicate in various cell lines. Bovine adenovirus type 2 (BAV2), belonging to subgroup I, is able to cause pneumonia as well as pneumonic-like symptoms in calves. In this study, the genome of BAV2 (strain No. 19) was subcloned into the plasmid vector pUC19. In total, 16 plasmids were constructed; three carry internal San fragments (spanning 3.1 to 65.2% ), and 10 carry internal Pstl fragments (spanning 4.9 to 97.4%), of the viral genome. Each of these plasmids was analyzed using twelve restriction endonucleases; BamHI, CiaI, EcoRl, HiOOlll, Kpnl, Noll, NS(N, Ps~, Pvul, Saj, Xbal, and Xhol. Terminal end fragments were also cloned and analyzed, sUbsequent to the removal of the 5' terminal protein, in the form of 2 BamHI B fragments, cloned in opposite orientations (spanning 0 to 18.1°k), and one Pstll fragment (spanning 97.4 to 1000/0). These cloned fragments, along with two other plasmids previously constructed carrying internal EcoRI fragments (spanning 20.6 to 90.5%), were then used to construct a detailed physical restriction map using the twelve restriction endonucleases, as well as to estimate the size of the genome for BAV2(32.5 Kbp). The DNA sequences of the early region 1 (E1) and hexon-associated gene (protein IX) have also been determined. The amino acid sequences of four open reading frames (ORFs) have been compared to those of the E1 proteins and protein IX from other Ads.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sequence repeats are an important phenomenon in the human genome, playing important roles in genomic alteration often with phenotypic consequences. The two major types of repeat elements in the human genome are tandem repeats (TRs) including microsatellites, minisatellites, and satellites and transposable elements (TEs). So far, very little has been known about the relationship between these two types of repeats. In this study, we identified TRs that are derived from TEs either based on sequence similarity or overlapping genomic positions. We then analyzed the distribution of these TRs among TE families/subfamilies. Our study shows that at least 7,276 TRs or 23% of all minisatellites/satellites is derived from TEs, contributing ∼0.32% of the human genome. TRs seem to be generated more likely from younger/more active TEs, and once initiated they are expanded with time via local duplication of the repeat units. The currently postulated mechanisms for origin of TRs can explain only 6% of all TE-derived TRs, indicating the presence of one or more yet to be identified mechanisms for the initiation of such repeats. Our result suggests that TEs are contributing to genome expansion and alteration not only by transposition but also by generating tandem repeats.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genome sequence varies in numerous ways among individuals although the gross architecture is fixed for all humans. Retrotransposons create one of the most abundant structural variants in the human genome and are divided in many families, with certain members in some families, e.g., L1, Alu, SVA, and HERV-K, remaining active for transposition. Along with other types of genomic variants, retrotransponson-derived variants contribute to the whole spectrum of genome variants in humans. With the advancement of sequencing techniques, many human genomes are being sequenced at the individual level, fueling the comparative research on these variants among individuals. In this thesis, the evolution and functional impact of structural variations is examined primarily focusing on retrotransposons in the context of human evolution. The thesis comprises of three different studies on the topics that are presented in three data chapters. First, the recent evolution of all human specific AluYb members, representing the second most active subfamily of Alus, was tracked to identify their source/master copy using a novel approach. All human-specific AluYb elements from the reference genome were extracted, aligned with one another to construct clusters of similar copies and each cluster was analyzed to generate the evolutionary relationship between the members of the cluster. The approach resulted in identification of one major driver copy of all human specific Yb8 and the source copy of the Yb9 lineage. Three new subfamilies within the AluYb family – Yb8a1, Yb10 and Yb11 were also identified, with Yb11 being the youngest and most polymorphic. Second, an attempt to construct a relation between transposable elements (TEs) and tandem repeats (TRs) was made at a genome-wide scale for the first time. Upon sequence comparison, positional cross-checking and other relevant analyses, it was observed that over 20% of all TRs are derived from TEs. This result established the first connection between these two types of repetitive elements, and extends our appreciation for the impact of TEs on genomes. Furthermore, only 6% of these TE-derived TRs follow the already postulated initiation and expansion mechanisms, suggesting that the others are likely to follow a yet-unidentified mechanism. Third, by taking a combination of multiple computational approaches involving all types of genetic variations published so far including transposable elements, the first whole genome sequence of the most recent common ancestor of all modern human populations that diverged into different populations around 125,000-100,000 years ago was constructed. The study shows that the current reference genome sequence is 8.89 million base pairs larger than our common ancestor’s genome, contributed by a whole spectrum of genetic mechanisms. The use of this ancestral reference genome to facilitate the analysis of personal genomes was demonstrated using an example genome and more insightful recent evolutionary analyses involving the Neanderthal genome. The three data chapters presented in this thesis conclude that the tandem repeats and transposable elements are not two entirely distinctly isolated elements as over 20% TRs are actually derived from TEs. Certain subfamilies of TEs themselves are still evolving with the generation of newer subfamilies. The evolutionary analyses of all TEs along with other genomic variants helped to construct the genome sequence of the most recent common ancestor to all modern human populations which provides a better alternative to human reference genome and can be a useful resource for the study of personal genomics, population genetics, human and primate evolution.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The complete genome of an Erwinia amylovora bacteriophage, vB_EamM_Ea35-70 (Ea35-70), is 271,084 bp, encodes 318 putative proteins, and contains one tRNA. Comparative analysis with other Myoviridae genomes suggests that Ea35-70 is related to the Phikzlikevirus genus within the family Myoviridae, since 26% of Ea35-70 proteins share homology to proteins in Pseudomonas phage φKZ.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Affiliation: Henner Brinkmann : Département de biochimie, Faculté de médecine, Université de Montreal

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Proteolytic processing of the CUX1 transcription factor generates an isoform, p110 that accelerates entry into S phase. To identify targets of p110 CUX1 that are involved in cell cycle progression, we performed genome-wide location analysis using a promoter microarray. Since there are no antibodies that specifically recognize p110, but not the full-length protein, we expressed physiological levels of a p110 isoform with two tags and purified chromatin by tandem affinity purification (ChAP). Conventional ChIP performed on synchronized populations of cells confirmed that p110 CUX1 is recruited to the promoter of cell cycle-related targets preferentially during S phase. Multiple approaches including silencing RNA (siRNA), transient infection with retroviral vectors, constitutive expression and reporter assays demonstrated that most cell cycle targets are activated whereas a few are repressed or not affected by p110 CUX1. Functional classes that were over-represented among targets included DNA replication initiation. Consistent with this finding, constitutive expression of p110 CUX1 led to a premature and more robust induction of replication genes during cell cycle progression, and stimulated the long-term replication of a plasmid bearing the oriP replicator of Epstein Barr virus (EBV).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

La démence d'Alzheimer est une maladie neurodégénérative caractérisée par une perte progressive et irreversible des fonctions cognitives et des compétences intellectuelles. La maladie d’Alzheimer se présente sous deux formes: la forme familiale ou précoce (EOAD) qui représente 5% des cas et elle est liée à des mutations génétiques affectant le métabolisme des peptides amyloïde; et la forme tardive ou sporadique (LOAD) qui représente 95% des cas mais son étiologie est encore mal définie. Cependant, le vieillissement reste le principal facteur de risque pour développer LOAD. Les changements épigénétiques impliquant des modifications des histones jouent un rôle crucial dans les maladies neurodégénératives et le vieillissement lié à l'âge. Des données récentes ont décrit LOAD comme un désordre de l'épigénome et ont associé ce trouble à l'instabilité génomique. Les protéines Polycomb sont des modificateurs épigénétiques qui induisent le remodelage de la chromatine et la répression des gènes à l'hétérochromatine facultative. Nous rapportons que les souris hétérozygotes pour une protéine Polycomb développent avec l'âge un trouble neurologique ressemblant à LOAD caractérisé par l’altération des fonctions cognitives, la phosphorylation de la protéine tau, l'accumulation des peptides amyloïde, et le dysfonctionnement synaptique. Ce phénotype pathologique est précédé par la décondensation de l’hétérochromatine neuronale et l'activation de la réponse aux dommages à l'ADN. Parallèlement, une réduction d’expression de polycomb, malformations de l'hétérochromatine neuronale, et l'accumulation de dommages à l'ADN étaient également présents dans les cerveaux de patients LOAD. Remarquablement, les dommages de l'ADN ne sont pas distribués de façon aléatoire sur le génome mais sont enrichis au niveau des séquences répétitives. Les conclusions présentées dans cette thèse ont identifié des modifications épigénétiques spécifiques qui conduisent à une instabilité génomique aberrante menant à la formation de LOAD. Ces résultats vont aider au développement de nouveaux traitements qui peuvent potentiellement ralentir la neurodégénérescence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Le surenroulement de l’ADN est important pour tous les processus cellulaires qui requièrent la séparation des brins de l’ADN. Il est régulé par l’activité enzymatique des topoisomérases. La gyrase (gyrA et gyrB) utilise l’ATP pour introduire des supertours négatifs dans l’ADN, alors que la topoisomérase I (topA) et la topoisomérase IV (parC et parE) les éliminent. Les cellules déficientes pour la topoisomérase I sont viables si elles ont des mutations compensatoires dans un des gènes codant pour une sous-unité de la gyrase. Ces mutations réduisent le niveau de surenroulement négatif du chromosome et permettent la croissance bactérienne. Une de ces mutations engendre la production d'une gyrase thermosensible. L’activité de surenroulement de la gyrase en absence de la topoisomérase I cause l’accumulation d’ADN hyper-surenroulé négativement à cause de la formation de R-loops. La surproduction de la RNase HI (rnhA), une enzyme qui dégrade l’ARN des R-loops, permet de prévenir l’accumulation d’un excès de surenroulement négatif. En absence de RNase HI, des R-loops sont aussi formés et peuvent être utilisés pour déclencher la réplication de l’ADN indépendamment du système normal oriC/DnaA, un phénomène connu sous le nom de « constitutive stable DNA replication » (cSDR). Pour mieux comprendre le lien entre la formation de R-loops et l’excès de surenroulement négatif, nous avons construit un mutant conditionnel topA rnhA gyrB(Ts) avec l’expression inductible de la RNase HI à partir d’un plasmide. Nous avons trouvé que l’ADN des cellules de ce mutant était excessivement relâché au lieu d'être hypersurenroulé négativement en conditions de pénurie de RNase HI. La relaxation de l’ADN a été montrée comme étant indépendante de l'activité de la topoisomérase IV. Les cellules du triple mutant topA rnhA gyrB(Ts) forment de très longs filaments remplis d’ADN, montrant ainsi un défaut de ségrégation des chromosomes. La surproduction de la topoisomérase III (topB), une enzyme qui peut effectuer la décaténation de l’ADN, a corrigé les problèmes de ségrégation sans toutefois restaurer le niveau de surenroulement de l’ADN. Nous avons constaté que des extraits protéiques du mutant topA rnhA gyrB(Ts) pouvaient inhiber l’activité de surenroulement négatif de la gyrase dans des extraits d’une souche sauvage, suggérant ainsi que la pénurie de RNase HI avait déclenché une réponse cellulaire d’inhibition de cette activité de la gyrase. De plus, des expériences in vivo et in vitro ont montré qu’en absence de RNase HI, l’activité ATP-dépendante de surenroulement négatif de la gyrase était inhibée, alors que l’activité ATP-indépendante de cette enzyme demeurait intacte. Des suppresseurs extragéniques du défaut de croissance du triple mutant topA rnhA gyrB(Ts) qui corrigent également les problèmes de surenroulement et de ségrégation des chromosomes ont pour la plupart été cartographiés dans des gènes impliqués dans la réplication de l’ADN, le métabolisme des R-loops, ou la formation de fimbriae. La deuxième partie de ce projet avait pour but de comprendre les rôles des topoisomérases de type IA (topoisomérase I et topoisomérase III) dans la ségrégation et la stabilité du génome de Escherichia coli. Pour étudier ces rôles, nous avons utilisé des approches de génétique combinées avec la cytométrie en flux, l’analyse de type Western blot et la microscopie. Nous avons constaté que le phénotype Par- et les défauts de ségrégation des chromosomes d’un mutant gyrB(Ts) avaient été corrigés en inactivant topA, mais uniquement en présence du gène topB. En outre, nous avons démontré que la surproduction de la topoisomérase III pouvait corriger le phénotype Par- du mutant gyrB(Ts) sans toutefois corriger les défauts de croissance de ce dernier. La surproduction de topoisomérase IV, enzyme responsable de la décaténation des chromosomes chez E. coli, ne pouvait pas remplacer la topoisomérase III. Nos résultats suggèrent que les topoisomérases de type IA jouent un rôle important dans la ségrégation des chromosomes lorsque la gyrase est inefficace. Pour étudier le rôle des topoisomérases de type IA dans la stabilité du génome, la troisième partie du projet, nous avons utilisé des approches génétiques combinées avec des tests de « spot » et la microscopie. Nous avons constaté que les cellules déficientes en topoisomérase I avaient des défauts de ségrégation de chromosomes et de croissance liés à un excès de surenroulement négatif, et que ces défauts pouvaient être corrigés en inactivant recQ, recA ou par la surproduction de la topoisomérase III. Le suppresseur extragénique oriC15::aph isolé dans la première partie du projet pouvait également corriger ces problèmes. Les cellules déficientes en topoisomérases de type IA formaient des très longs filaments remplis d’ADN d’apparence diffuse et réparti inégalement dans la cellule. Ces phénotypes pouvaient être partiellement corrigés par la surproduction de la RNase HI ou en inactivant recA, ou encore par des suppresseurs isolés dans la première partie du projet et impliques dans le cSDR (dnaT18::aph et rne59::aph). Donc, dans E. coli, les topoisomérases de type IA jouent un rôle dans la stabilité du génome en inhibant la réplication inappropriée à partir de oriC et de R-loops, et en empêchant les défauts de ségrégation liés à la recombinaison RecA-dépendante, par leur action avec RecQ. Les travaux rapportés ici révèlent que la réplication inappropriée et dérégulée est une source majeure de l’instabilité génomique. Empêcher la réplication inappropriée permet la ségrégation des chromosomes et le maintien d’un génome stable. La RNase HI et les topoisomérases de type IA jouent un rôle majeur dans la prévention de la réplication inappropriée. La RNase HI réalise cette tâche en modulant l’activité de surenroulement ATP-dependante de la gyrase, et en empêchant la réplication à partir des R-loops. Les topoisomérases de type IA assurent le maintien de la stabilité du génome en empêchant la réplication inappropriée à partir de oriC et des R-loops et en agissant avec RecQ pour résoudre des intermédiaires de recombinaison RecA-dépendants afin de permettre la ségrégation des chromosomes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Le centromère est la région chromosomique où le kinétochore s'assemble en mitose. Contrairement à certaines caractéristiques géniques, la séquence centromérique n'est ni conservée entre les espèces ni suffisante à la fonction centromérique. Il est donc bien accepté dans la littérature que le centromère est régulé épigénétiquement par une variante de l'histone H3, CENP-A. KNL-2, aussi connu sous le nom de M18BP1, ainsi que ces partenaires Mis18α et Mis18β sont des protéines essentielles pour l'incorporation de CENP-A nouvellement synthétisé aux centromères. Des évidences expérimentales démontrent que KNL-2, ayant un domaine de liaison à l'ADN nommé Myb, est la protéine la plus en amont pour l'incorporation de CENP-A aux centromères en phase G1. Par contre, sa fonction dans le processus d'incorporation de CENP-A aux centromères n'est pas bien comprise et ces partenaires de liaison ne sont pas tous connus. De nouveaux partenaires de liaison de KNL-2 ont été identifiés par des expériences d'immunoprécipitation suivies d'une analyse en spectrométrie de masse. Un rôle dans l'incorporation de CENP-A nouvellement synthétisé aux centromères a été attribué à MgcRacGAP, une des 60 protéines identifiées par l'essai. MgcRacGAP ainsi que les protéines ECT-2 (GEF) et la petite GTPase Cdc42 ont été démontrées comme étant requises pour la stabilité de CENP-A incorporé aux centromères. Ces différentes observations ont mené à l'identification d'une troisième étape au niveau moléculaire pour l'incorporation de CENP-A nouvellement synthétisé en phase G1, celle de la stabilité de CENP-A nouvellement incorporé aux centromères. Cette étape est importante pour le maintien de l'identité centromérique à chaque division cellulaire. Pour caractériser la fonction de KNL-2 lors de l'incorporation de CENP-A nouvellement synthétisé aux centromères, une technique de microscopie à haute résolution couplée à une quantification d'image a été utilisée. Les résultats générés démontrent que le recrutement de KNL-2 au centromère est rapide, environ 5 minutes après la sortie de la mitose. De plus, la structure du domaine Myb de KNL-2 provenant du nématode C. elegans a été résolue par RMN et celle-ci démontre un motif hélice-tour-hélice, une structure connue pour les domaines de liaison à l'ADN de la famille Myb. De plus, les domaines humain (HsMyb) et C. elegans (CeMyb) Myb lient l'ADN in vitro, mais aucune séquence n'est reconnue spécifiquement par ces domaines. Cependant, il a été possible de démontrer que ces deux domaines lient préférentiellement la chromatine CENP-A-YFP comparativement à la chromatine H2B-GFP par un essai modifié de SIMPull sous le microscope TIRF. Donc, le domaine Myb de KNL-2 est suffisant pour reconnaître de façon spécifique la chromatine centromérique. Finalement, l'élément reconnu par les domaines Myb in vitro a potentiellement été identifié. En effet, il a été démontré que les domaines HsMyb et CeMyb lient l'ADN simple brin in vitro. De plus, les domaines HsMyb et CeMyb ne colocalisent pas avec CENP-A lorsqu'exprimés dans les cellules HeLa, mais plutôt avec les corps nucléaires PML, des structures nucléaires composées d'ARN. Donc, en liant potentiellement les transcrits centromériques, les domaines Myb de KNL-2 pourraient spécifier l'incorporation de CENP-A nouvellement synthétisé uniquement aux régions centromériques.