935 resultados para Complete Genome Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Transcription factors (TFs) control the temporal and spatial expression of target genes by interacting with DNA in a sequence-specific manner. Recent advances in high throughput experiments that measure TF-DNA interactions in vitro and in vivo have facilitated the identification of DNA binding sites for thousands of TFs. However, it remains unclear how each individual TF achieves its specificity, especially in the case of paralogous TFs that recognize distinct target genomic sites despite sharing very similar DNA binding motifs. In my work, I used a combination of high throughput in vitro protein-DNA binding assays and machine-learning algorithms to characterize and model the binding specificity of 11 paralogous TFs from 4 distinct structural families. My work proves that even very closely related paralogous TFs, with indistinguishable DNA binding motifs, oftentimes exhibit differential binding specificity for their genomic target sites, especially for sites with moderate binding affinity. Importantly, the differences I identify in vitro and through computational modeling help explain, at least in part, the differential in vivo genomic targeting by paralogous TFs. Future work will focus on in vivo factors that might also be important for specificity differences between paralogous TFs, such as DNA methylation, interactions with protein cofactors, or the chromatin environment. In this larger context, my work emphasizes the importance of intrinsic DNA binding specificity in targeting of paralogous TFs to the genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete and faithful duplication of the genome is essential to ensure normal cell division and organismal development. Eukaryotic DNA replication is initiated at multiple sites termed origins of replication that are activated at different time through S phase. The replication timing program is regulated by the S-phase checkpoint, which signals and repairs replicative stress. Eukaryotic DNA is packaged with histones into chromatin, thus DNA-templated processes including replication are modulated by the local chromatin environment such as post-translational modifications (PTMs) of histones.

One such epigenetic mark, methylation of lysine 20 on histone H4 (H4K20), has been linked to chromatin compaction, transcription, DNA repair and DNA replication. H4K20 can be mono-, di- and tri-methylated. Monomethylation of H4K20 (H4K20me1) is mediated by the cell cycle-regulated histone methyltransferase PR-Set7 and subsequent di-/tri- methylation is catalyzed by Suv4-20. Prior studies have shown that PR-Set7 depletion in mammalian cells results in defective S phase progression and the accumulation of DNA damage, which may be partially attributed to defects in origin selection and activation. Meanwhile, overexpression of mammalian PR-Set7 recruits components of pre-Replication Complex (pre-RC) onto chromatin and licenses replication origins for re-replication. However, these studies were limited to only a handful of mammalian origins, and it remains unclear how PR-Set7 impacts the replication program on a genomic scale. Finally, the methylation substrates of PR-Set7 include both histone (H4K20) and non-histone targets, therefore it is necessary to directly test the role of H4K20 methylation in PR-Set7 regulated phenotypes.

I employed genetic, cytological, and genomic approaches to better understand the role of H4K20 methylation in regulating DNA replication and genome stability in Drosophila melanogaster cells. Depletion of Drosophila PR-Set7 by RNAi in cultured Kc167 cells led to an ATR-dependent cell cycle arrest with near 4N DNA content and the accumulation of DNA damage, indicating a defect in completing S phase. The cells were arrested at the second S phase following PR-Set7 downregulation, suggesting that it was an epigenetic effect that coupled to the dilution of histone modification over multiple cell cycles. To directly test the role of H4K20 methylation in regulating genome integrity, I collaborated with the Duronio Lab and observed spontaneous DNA damage on the imaginal wing discs of third instar mutant larvae that had an alanine substitution on H4K20 (H4K20A) thus unable to be methylated, confirming that H4K20 is a bona fide target of PR-Set7 in maintaining genome integrity.

One possible source of DNA damage due to loss of PR-Set7 is reduced origin activity. I used BrdU-seq to profile the genome-wide origin activation pattern. However, I found that deregulation of H4K20 methylation states by manipulating the H4K20 methyltransferases PR-Set7 and Suv4-20 had no impact on origin activation throughout the genome. I then mapped the genomic distribution of DNA damage upon PR-Set7 depletion. Surprisingly, ChIP-seq of the DNA damage marker γ-H2A.v located the DNA damage to late replicating euchromatic regions of the Drosophila genome, and the strength of γ-H2A.v signal was uniformly distributed and spanned the entire late replication domain, implying stochastic replication fork collapse within late replicating regions. Together these data suggest that PR-Set7-mediated monomethylation of H4K20 is critical for maintaining the genomic integrity of late replicating domains, presumably via stabilization of late replicating forks.

In addition to investigating the function of H4K20me, I also used immunofluorescence to characterize the cell cycle regulated chromatin loading of Mcm2-7 complex, the DNA helicase that licenses replication origins, using H4K20me1 level as a proxy for cell cycle stages. In parallel with chromatin spindown data by Powell et al. (Powell et al. 2015), we showed a continuous loading of Mcm2-7 during G1 and a progressive removal from chromatin through S phase.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present an extensive photometric catalog for 548 CALIFA galaxies observed as of the summer of 2015. CALIFA is currently lacking photometry matching the scale and diversity of its spectroscopy; this work is intended to meet all photometric needs for CALIFA galaxies while also identifying best photometric practices for upcoming integral field spectroscopy surveys such as SAMI and MaNGA. This catalog comprises gri surface brightness profiles derived from Sloan Digital Sky Survey (SDSS) imaging, a variety of non-parametric quantities extracted from these pro files, and parametric models fitted to the i-band pro files (1D) and original galaxy images (2D). To compliment our photometric analysis, we contrast the relative performance of our 1D and 2D modelling approaches. The ability of each measurement to characterize the global properties of galaxies is quantitatively assessed, in the context of constructing the tightest scaling relations. Where possible, we compare our photometry with existing photometrically or spectroscopically obtained measurements from the literature. Close agreement is found with Walcher et al. (2014), the current source of basic photometry and classifications of CALIFA galaxies, while comparisons with spectroscopically derived quantities reveals the effect of CALIFA's limited field of view compared to broadband imaging surveys such as the SDSS. The colour-magnitude diagram, star formation main sequence, and Tully-Fisher relation of CALIFA galaxies are studied, to give a small example of the investigations possible with this rich catalog. We conclude with a discussion of points of concern for ongoing integral field spectroscopy surveys and directions for future expansion and exploitation of this work.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PURPOSE: We analyzed patients with hairy cell leukemia (HCL) to achieve a better understanding of the differentiation stage reached by HCL cells and to define the key role of the diversification of cell surface makers, especially CD25 expression. PATIENTS AND METHODS: We analyzed 38 previously untreated patients with HCL to characterize their complete (VDJ(H)) and incomplete (DJ(H)) immunoglobulin (Ig) heavy chain (IgH) rearrangements, including somatic hypermutation pattern and gene segment use. RESULTS: A correlation between immunophenotypic profile and molecular data was seen. All 38 cases showed monoclonal amplifications: VDJ(H) in 97%, DJ(H) in 42%, and both in 39%. Segments from the D(H)3 family were used more in complete compared with incomplete rearrangements (45% vs. 12%; P <.005). Furthermore, comparison between molecular and immunophenotypic characteristics disclosed differences in the expression of CD25 antigen; CD25(-) cases, a phenotype associated with HCL variant, showed complete homology to the germline in 3 of 5 cases (60%), whereas this characteristic was never observed in CD25(+) cases (P <.005). Moreover, V(H)4-34, V(H)1-08, and J(H)3 segments appeared in 2, 1, and 2 CD25(-) cases, respectively, whereas they were absent in all CD25(+) cases. CONCLUSION: These results support that HCL is a heterogeneous entity including subgroups with different molecular characteristics, which reinforces the need for additional studies with a larger number of patients to clarify the real role of gene rearrangements in HCL.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Genome-wide association studies (GWAS) have identified several risk variants for late-onset Alzheimer's disease (LOAD)1, 2. These common variants have replicable but small effects on LOAD risk and generally do not have obvious functional effects. Low-frequency coding variants, not detected by GWAS, are predicted to include functional variants with larger effects on risk. To identify low-frequency coding variants with large effects on LOAD risk, we carried out whole-exome sequencing (WES) in 14 large LOAD families and follow-up analyses of the candidate variants in several large LOAD case–control data sets. A rare variant in PLD3 (phospholipase D3; Val232Met) segregated with disease status in two independent families and doubled risk for Alzheimer’s disease in seven independent case–control series with a total of more than 11,000 cases and controls of European descent. Gene-based burden analyses in 4,387 cases and controls of European descent and 302 African American cases and controls, with complete sequence data for PLD3, reveal that several variants in this gene increase risk for Alzheimer’s disease in both populations. PLD3 is highly expressed in brain regions that are vulnerable to Alzheimer’s disease pathology, including hippocampus and cortex, and is expressed at significantly lower levels in neurons from Alzheimer’s disease brains compared to control brains. Overexpression of PLD3 leads to a significant decrease in intracellular amyloid-β precursor protein (APP) and extracellular Aβ42 and Aβ40 (the 42- and 40-residue isoforms of the amyloid-β peptide), and knockdown of PLD3 leads to a significant increase in extracellular Aβ42 and Aβ40. Together, our genetic and functional data indicate that carriers of PLD3 coding variants have a twofold increased risk for LOAD and that PLD3 influences APP processing. This study provides an example of how densely affected families may help to identify rare variants with large effects on risk for disease or other complex traits.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pour ce projet, nous avons développé une plateforme pour l’analyse pangénomique de la méthylation de l’ADN chez le bovin qui est compatible avec des échantillons de petites tailles. Cet outil est utilisé pour étudier les caractéristiques génétiques et épigénétiques (méthylation de l’ADN) des gamètes soumis aux procédures de procréation médicalement assisitée et des embryons précoces. Dans un premier temps, une plateforme d’analyse de biopuces spécifiques pour l’étude de la méthylation de l’ADN chez l’espèce bovine a été développée. Cette plateforme a ensuite été optimisée pour produire des analyses pangénomiques de méthylation de l’ADN fiables et reproductibles à partir d’échantillons de très petites tailles telle que les embryons précoces (≥ 10 ng d’ADN a été utilisé, ce qui correspond à 10 blastocystes en expansion). En outre, cet outil a permis d’évaluer de façon simultanée la méthylation de l’ADN et le transcriptome dans le même échantillon, fournissant ainsi une image complète des profils génétiques et épigénétiques (méthylation de l’ADN). Comme preuve de concept, les profils comparatifs de méthylation de l’ADN spermatique et de blastocystes bovins ont été analysés au niveau de l’ensemble du génome. Dans un deuxième temps, grâce à cette plateforme, les profils globaux de méthylation de l’ADN de taureaux jumeaux monozygotes (MZ) ont été analysés. Malgré qu’ils sont génétiquement identiques, les taureaux jumeaux MZ ont des descendants avec des performances différentes. Par conséquent, l’hypothèse que le profil de méthylation de l’ADN spermatique de taureaux jumeaux MZ est différent a été émise. Dans notre étude, des différences significatives entre les jumeaux MZ au niveau des caractéristiques de la semence ainsi que de la méthylation de l’ADN ont été trouvées, chacune pouvant contribuer à l’obtention de performances divergentes incongrues des filles engendrées par ces jumeaux MZ. Dans la troisième partie de ce projet, la même plateforme a été utilisée pour découvrir les impacts d’une supplémentation à forte concentration en donneur de méthyle universel sur les embryons précoces bovins. La supplémentation avec de grandes quantités d’acide folique (AF) a été largement utilisée et recommandée chez les femmes enceintes pour sa capacité bien établie à prévenir les malformations du tube neural chez les enfants. Cependant, plus récemment, plusieurs études ont rapporté des effets indésirables de l’AF utilisé à des concentrations élevées, non seulement sur le développement de l’embryon, mais aussi chez les adultes. Au niveau cellulaire, l’AF entre dans le métabolisme monocarboné, la seule voie de production de S-adénosyl méthionine (SAM), un donneur universel de groupements méthyles pour une grande variété de biomolécules, y compris l’ADN. Par conséquent, pour résoudre cette controverse, une forte dose de SAM a été utilisée pour traiter des embryons produits in vitro chez le bovin. Ceci a non seulement permis d’influencer le phénotype des embryons précoces, mais aussi d’avoir un impact sur le transcriptome et le méthylome de l’ADN. En somme, le projet en cours a permis le développement d’une plateforme d’analyse de la méthylation de l’ADN à l’échelle du génome entier chez le bovin à coût raisonnable et facile à utiliser qui est compatible avec les embryons précoces. De plus, puisque c’est l’une des premières études de ce genre en biologie de la reproduction bovine, ce projet avait trois objectifs qui a donné plusieurs nouveaux résultats, incluant les profils comparatifs de méthylation de l’ADN au niveau : i) blastocystes versus spermatozoïdes ; ii) semence de taureaux jumeaux MZ et iii) embryons précoces traités à de fortes doses de SAM versus des embryons précoces non traités.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Plant reproduction depends on the concerted activation of many genes to ensure correct communication between pollen and pistil. Here, we queried the whole transcriptome of Arabidopsis (Arabidopsis thaliana) in order to identify genes with specific reproductive functions. We used the Affymetrix ATH1 whole genome array to profile wild-type unpollinated pistils and unfertilized ovules. By comparing the expression profile of pistils at 0.5, 3.5, and 8.0 h after pollination and applying a number of statistical and bioinformatics criteria, we found 1,373 genes differentially regulated during pollen-pistil interactions. Robust clustering analysis grouped these genes in 16 time-course clusters representing distinct patterns of regulation. Coregulation within each cluster suggests the presence of distinct genetic pathways, which might be under the control of specific transcriptional regulators. A total of 78% of the regulated genes were expressed initially in unpollinated pistil and/or ovules, 15% were initially detected in the pollen data sets as enriched or preferentially expressed, and 7% were induced upon pollination. Among those, we found a particular enrichment for unknown transcripts predicted to encode secreted proteins or representing signaling and cell wall-related proteins, which may function by remodeling the extracellular matrix or as extracellular signaling molecules. A strict regulatory control in various metabolic pathways suggests that fine-tuning of the biochemical and physiological cellular environment is crucial for reproductive success. Our study provides a unique and detailed temporal and spatial gene expression profile of in vivo pollen-pistil interactions, providing a framework to better understand the basis of the molecular mechanisms operating during the reproductive process in higher plants.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next-generation sequencing of complete genomes has given researchers unprecedented levels of information to study the multifaceted evolutionary changes that have shaped elite plant germplasm. In conjunction with population genetic analytical techniques and detailed online databases, we can more accurately capture the effects of domestication on entire biological pathways of agronomic importance. In this study, we explore the genetic diversity and signatures of selection in all predicted gene models of the storage starch synthesis pathway of Sorghum bicolor, utilizing a diversity panel containing lines categorized as either ‘Landraces’ or ‘Wild and Weedy’ genotypes. Amongst a total of 114 genes involved in starch synthesis, 71 had at least a single signal of purifying selection and 62 a signal of balancing selection and others a mix of both. This included key genes such as STARCH PHOSPHORYLASE 2 (SbPHO2, under balancing selection), PULLULANASE (SbPUL, under balancing selection) and ADP-glucose pyrophosphorylases (SHRUNKEN2, SbSH2 under purifying selection). Effectively, many genes within the primary starch synthesis pathway had a clear reduction in nucleotide diversity between the Landraces and wild and weedy lines indicating that the ancestral effects of domestication are still clearly identifiable. There was evidence of the positional rate variation within the well-characterized primary starch synthesis pathway of sorghum, particularly in the Landraces, whereby low evolutionary rates upstream and high rates downstream in the metabolic pathway were expected. This observation did not extend to the wild and weedy lines or the minor starch synthesis pathways.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next-generation sequencing of complete genomes has given researchers unprecedented levels of information to study the multifaceted evolutionary changes that have shaped elite plant germplasm. In conjunction with population genetic analytical techniques and detailed online databases, we can more accurately capture the effects of domestication on entire biological pathways of agronomic importance. In this study, we explore the genetic diversity and signatures of selection in all predicted gene models of the storage starch synthesis pathway of Sorghum bicolor, utilizing a diversity panel containing lines categorized as either ‘Landraces’ or ‘Wild and Weedy’ genotypes. Amongst a total of 114 genes involved in starch synthesis, 71 had at least a single signal of purifying selection and 62 a signal of balancing selection and others a mix of both. This included key genes such as STARCH PHOSPHORYLASE 2 (SbPHO2, under balancing selection), PULLULANASE (SbPUL, under balancing selection) and ADP-glucose pyrophosphorylases (SHRUNKEN2, SbSH2 under purifying selection). Effectively, many genes within the primary starch synthesis pathway had a clear reduction in nucleotide diversity between the Landraces and wild and weedy lines indicating that the ancestral effects of domestication are still clearly identifiable. There was evidence of the positional rate variation within the well-characterized primary starch synthesis pathway of sorghum, particularly in the Landraces, whereby low evolutionary rates upstream and high rates downstream in the metabolic pathway were expected. This observation did not extend to the wild and weedy lines or the minor starch synthesis pathways.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cauliflower (Brassica oleracea var. botrytis) is a vernalization-responsive crop. High ambient temperatures delay harvest time. The elucidation of the genetic regulation of floral transition is highly interesting for a precise harvest scheduling and to ensure stable market supply. This study aims at genetic dissection of temperature-dependent curd induction in cauliflower by genome-wide association studies and gene expression analysis. To assess temperature dependent curd induction, two greenhouse trials under distinct temperature regimes were conducted on a diversity panel consisting of 111 cauliflower commercial parent lines, genotyped with 14,385 SNPs. Broad phenotypic variation and high heritability (0.93) were observed for temperature-related curd induction within the cauliflower population. GWA mapping identified a total of 18 QTL localized on chromosomes O1, O2, O3, O4, O6, O8, and O9 for curding time under two distinct temperature regimes. Among those, several QTL are localized within regions of promising candidate flowering genes. Inferring population structure and genetic relatedness among the diversity set assigned three main genetic clusters. Linkage disequilibrium (LD) patterns estimated global LD extent of r(2) = 0.06 and a maximum physical distance of 400 kb for genetic linkage. Transcriptional profiling of flowering genes FLOWERING LOCUS C (BoFLC) and VERNALIZATION 2 (BoVRN2) was performed, showing increased expression levels of BoVRN2 in genotypes with faster curding. However, functional relevance of BoVRN2 and BoFLC2 could not consistently be supported, which probably suggests to act facultative and/or might evidence for BoVRN2/BoFLC-independent mechanisms in temperature regulated floral transition in cauliflower. Genetic insights in temperature-regulated curd induction can underpin genetically informed phenology models and benefit molecular breeding strategies toward the development of thermo-tolerant cultivars.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The primary goal of systems biology is to integrate complex omics data, and data obtained from traditional experimental studies in order to provide a holistic understanding of organismal function. One way of achieving this aim is to generate genome-scale metabolic models (GEMs), which contain information on all metabolites, enzyme-coding genes, and biochemical reactions in a biological system. Drosophila melanogaster GEM has not been reconstructed to date. Constraint-free genome-wide metabolic model of the fruit fly has been reconstructed in our lab, identifying gaps, where no enzyme was identified and metabolites were either only produced or consume. The main focus of the work presented in this thesis was to develop a pipeline for efficient gap filling using metabolomics approaches combined with standard reverse genetics methods, using 5-hydroxyisourate hydrolase (5-HIUH) as an example. 5-HIUH plays a role in urate degradation pathway. Inability to degrade urate can lead to inborn errors of metabolism (IEMs) in humans, including hyperuricemia. Based on sequence analysis Drosophila CG30016 gene was hypothesised to encode 5- HIUH. CG30016 knockout flies were examined to identify Malpighian tubules phenotype, and shortened lifespan might reflect kidney disorders in hyperuricemia in humans. Moreover, LC-MS analysis of mutant tubules revealed that CG30016 is involved in purine metabolism, and specifically urate degradation pathway. However, the exact role of the gene has not been identified, and the complete method for gap filling has not been developed. Nevertheless, thanks to the work presented here, we are a step closer towards the development of a gap-filling pipeline in Drosophila melanogaster GEM. Importantly, the areas that require further optimisation were identified and are the focus of future research. Moreover, LC-MS analysis confirmed that tubules rather than the whole fly were more suitable for metabolomics analysis of purine metabolism. Previously, Dow/Davies lab has generated the most complete tissue-specific transcriptomic atlas for Drosophila – FlyAtlas.org, which provides data on gene expression across multiple tissues of adult fly and larva. FlyAtlas revealed that transcripts of many genes are enriched in specific Drosophila tissues, and that it is possible to deduce the functions of individual tissues within the fly. Based on FlyAtlas data, it has become clear that the fly (like other metazoan species) must be considered as a set of tissues, each 2 with its own distinct transcriptional and functional profile. Moreover, it revealed that for about 30% of the genome, reverse genetic methods (i.e. mutation in an unknown gene followed by observation of phenotype) are only useful if specific tissues are investigated. Based on the FlyAtlas findings, we aimed to build a primary tissue-specific metabolome of the fruit fly, in order to establish whether different Drosophila tissues have different metabolomes and if they correspond to tissue-specific transcriptome of the fruit fly (FlyAtlas.org). Different fly tissues have been dissected and their metabolome elucidated using LC-MS. The results confirmed that tissue metabolomes differ significantly from each other and from the whole fly, and that some of these differences can be correlated to the tissue function. The results illustrate the need to study individual tissues as well as the whole organism. It is clear that some metabolites that play an important role in a given tissue might not be detected in the whole fly sample because their abundance is much lower in comparison to other metabolites present in all tissues, which prevent the detection of the tissue-specific compound.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The quality and the speed for genome sequencing has advanced at the same time that technology boundaries are stretched. This advancement has been divided so far in three generations. The first-generation methods enabled sequencing of clonal DNA populations. The second-generation massively increased throughput by parallelizing many reactions while the third-generation methods allow direct sequencing of single DNA molecules. The first techniques to sequence DNA were not developed until the mid-1970s, when two distinct sequencing methods were developed almost simultaneously, one by Alan Maxam and Walter Gilbert, and the other one by Frederick Sanger. The first one is a chemical method to cleave DNA at specific points and the second one uses ddNTPs, which synthesizes a copy from the DNA chain template. Nevertheless, both methods generate fragments of varying lengths that are further electrophoresed. Moreover, it is important to say that until the 1990s, the sequencing of DNA was relatively expensive and it was seen as a long process. Besides, using radiolabeled nucleotides also compounded the problem through safety concerns and prevented the automation. Some advancements within the first generation include the replacement of radioactive labels by fluorescent labeled ddNTPs and cycle sequencing with thermostable DNA polymerase, which allows automation and signal amplification, making the process cheaper, safer and faster. Another method is Pyrosequencing, which is based on the “sequencing by synthesis” principle. It differs from Sanger sequencing, in that it relies on the detection of pyrophosphate release on nucleotide incorporation. By the end of the last millennia, parallelization of this method started the Next Generation Sequencing (NGS) with 454 as the first of many methods that can process multiple samples, calling it the 2º generation sequencing. Here electrophoresis was completely eliminated. One of the methods that is sometimes used is SOLiD, based on sequencing by ligation of fluorescently dye-labeled di-base probes which competes to ligate to the sequencing primer. Specificity of the di-base probe is achieved by interrogating every 1st and 2nd base in each ligation reaction. The widely used Solexa/Illumina method uses modified dNTPs containing so called “reversible terminators” which blocks further polymerization. The terminator also contains a fluorescent label, which can be detected by a camera. Now, the previous step towards the third generation was in charge of Ion Torrent, who developed a technique that is based in a method of “sequencing-by-synthesis”. Its main feature is the detection of hydrogen ions that are released during base incorporation. Likewise, the third generation takes into account nanotechnology advancements for the processing of unique DNA molecules to a real time synthesis sequencing system like PacBio; and finally, the NANOPORE, projected since 1995, also uses Nano-sensors forming channels obtained from bacteria that conducts the sample to a sensor that allows the detection of each nucleotide residue in the DNA strand. The advancements in terms of technology that we have nowadays have been so quick, that it makes wonder: ¿How do we imagine the next generation?

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This is the author’s version of a work that was accepted for publication in AIDS Research and Human Retroviruses .

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Mycobacterium avium subsp. paratuberculosis is an important animal pathogen widely disseminated in the environment that has also been associated with Crohn's disease in humans. Three M. avium subsp. paratuberculosis genomotypes are recognized, but genomic differences have not been fully described. To further investigate these potential differences, a 60-mer oligonucleotide microarray (designated the MAPAC array), based on the combined genomes of M. avium subsp. paratuberculosis (strain K-10) and Mycobacterium avium subsp. hominissuis (strain 104), was designed and validated. By use of a test panel of defined M. avium subsp. paratuberculosis strains, the MAPAC array was able to identify a set of large sequence polymorphisms (LSPs) diagnostic for each of the three major M. avium subsp. paratuberculosis types. M. avium subsp. paratuberculosis type II strains contained a smaller genomic complement than M. avium subsp. paratuberculosis type I and M. avium subsp. paratuberculosis type III genomotypes, which included a set of genomic regions also found in M. avium subsp. hominissuis 104. Specific PCRs for genes within LSPs that differentiated M. avium subsp. paratuberculosis types were devised and shown to accurately screen a panel (n = 78) of M. avium subsp. paratuberculosis strains. Analysis of insertion/deletion region INDEL12 showed deletion events causing a reduction in the complement of mycobacterial cell entry genes in M. avium subsp. paratuberculosis type II strains and significantly altering the coding of a major immunologic protein (MPT64) associated with persistence and granuloma formation. Analysis of MAPAC data also identified signal variations in several genomic regions, termed variable genomic islands (vGIs), suggestive of transient duplication/deletion events. vGIs contained significantly low GC% and were immediately flanked by insertion sequences, integrases, or short inverted repeat sequences. Quantitative PCR demonstrated that variation in vGI signals could be associated with colony growth rate and morphology.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lactobacillus rossiae is an obligately hetero-fermentative lactic acid bacterium, which can be isolated from a broad range of environments including sourdoughs, vegetables, fermented meat and flour, as well as the gastrointestinal tract of both humans and animals. In order to unravel distinctive genomic features of this particular species and investigate the phylogenetic positioning within the genus Lactobacillus, comparative genomics and phylogenomic approaches, followed by functional analyses were performed on L. rossiae DSM 15814(T), showing how this type strain not only occupies an independent phylogenetic branch, but also possesses genomic features underscoring its biotechnological potential. This strain in fact represents one of a small number of bacteria known to encode a complete de novo biosynthetic pathway of vitamin B-12 (in addition to other B vitamins such as folate and riboflavin). In addition, it possesses the capacity to utilize an extensive set of carbon sources, a characteristic that may contribute to environmental adaptation, perhaps enabling the strain's ability to populate different niches.