931 resultados para genomic
Resumo:
Abstract Background Citrus bacterial canker is a disease that has severe economic impact on citrus industries worldwide and is caused by a few species and pathotypes of Xanthomonas. X. citri subsp. citri strain 306 (XccA306) is a type A (Asiatic) strain with a wide host range, whereas its variant X. citri subsp. citri strain Aw12879 (Xcaw12879, Wellington strain) is restricted to Mexican lime. Results To characterize the mechanism for the differences in host range of XccA and Xcaw, the genome of Xcaw12879 that was completed recently was compared with XccA306 genome. Effectors xopAF and avrGf1 are present in Xcaw12879, but were absent in XccA306. AvrGf1 was shown previously for Xcaw to cause hypersensitive response in Duncan grapefruit. Mutation analysis of xopAF indicates that the gene contributes to Xcaw growth in Mexican lime but does not contribute to the limited host range of Xcaw. RNA-Seq analysis was conducted to compare the expression profiles of Xcaw12879 and XccA306 in Nutrient Broth (NB) medium and XVM2 medium, which induces hrp gene expression. Two hundred ninety two and 281 genes showed differential expression in XVM2 compared to in NB for XccA306 and Xcaw12879, respectively. Twenty-five type 3 secretion system genes were up-regulated in XVM2 for both XccA and Xcaw. Among the 4,370 common genes of Xcaw12879 compared to XccA306, 603 genes in NB and 450 genes in XVM2 conditions were differentially regulated. Xcaw12879 showed higher protease activity than XccA306 whereas Xcaw12879 showed lower pectate lyase activity in comparison to XccA306. Conclusions Comparative genomic analysis of XccA306 and Xcaw12879 identified strain specific genes. Our study indicated that AvrGf1 contributes to the host range limitation of Xcaw12879 whereas XopAF contributes to virulence. Transcriptome analyses of XccA306 and Xcaw12879 presented insights into the expression of the two closely related strains of X. citri subsp. citri. Virulence genes including genes encoding T3SS components and effectors are induced in XVM2 medium. Numerous genes with differential expression in Xcaw12879 and XccA306 were identified. This study provided the foundation to further characterize the mechanisms for virulence and host range of pathotypes of X. citri subsp. citri.
Resumo:
Background Trypanosomatids of the genera Angomonas and Strigomonas live in a mutualistic association characterized by extensive metabolic cooperation with obligate endosymbiotic Betaproteobacteria. However, the role played by the symbiont has been more guessed by indirect means than evidenced. Symbiont-harboring trypanosomatids, in contrast to their counterparts lacking symbionts, exhibit lower nutritional requirements and are autotrophic for essential amino acids. To evidence the symbiont’s contributions to this autotrophy, entire genomes of symbionts and trypanosomatids with and without symbionts were sequenced here. Results Analyses of the essential amino acid pathways revealed that most biosynthetic routes are in the symbiont genome. By contrast, the host trypanosomatid genome contains fewer genes, about half of which originated from different bacterial groups, perhaps only one of which (ornithine cyclodeaminase, EC:4.3.1.12) derived from the symbiont. Nutritional, enzymatic, and genomic data were jointly analyzed to construct an integrated view of essential amino acid metabolism in symbiont-harboring trypanosomatids. This comprehensive analysis showed perfect concordance among all these data, and revealed that the symbiont contains genes for enzymes that complete essential biosynthetic routes for the host amino acid production, thus explaining the low requirement for these elements in symbiont-harboring trypanosomatids. Phylogenetic analyses show that the cooperation between symbionts and their hosts is complemented by multiple horizontal gene transfers, from bacterial lineages to trypanosomatids, that occurred several times in the course of their evolution. Transfers occur preferentially in parts of the pathways that are missing from other eukaryotes. Conclusion We have herein uncovered the genetic and evolutionary bases of essential amino acid biosynthesis in several trypanosomatids with and without endosymbionts, explaining and complementing decades of experimental results. We uncovered the remarkable plasticity in essential amino acid biosynthesis pathway evolution in these protozoans, demonstrating heavy influence of horizontal gene transfer events, from Bacteria to trypanosomatid nuclei, in the evolution of these pathways.
Resumo:
The tumorigenesis of pituitary adenomas is poorly understood. Mutations of the PIK3CA proto-oncogene, which encodes the p110-α catalytic subunit of PI3K, have been reported in various types of human cancers regarding the role of the gene in cell proliferation and survival through activation of the PI3K/Akt signaling pathway. Only one Chinese study described somatic mutations and amplification of the PIK3CA gene in a large series of pituitary adenomas. The aim of the present study was to determine genetic alterations of PIK3CA in a second series that consisted of 33 pituitary adenomas of different subtypes diagnosed by immunohistochemistry: 6 adrenocorticotropic hormone-secreting microadenomas, 5 growth hormone-secreting macroadenomas, 7 prolactin-secreting macroadenomas, and 15 nonfunctioning macroadenomas. Direct sequencing of exons 9 and 20 assessed by qPCR was employed to investigate the presence of mutations and genomic amplification defined as a copy number ≥4. Previously identified PIK3CA mutations (exon 20) were detected in four cases (12.1%). Interestingly, the Chinese study reported mutations only in invasive tumors, while we found a PIK3CA mutation in one noninvasive corticotroph microadenoma. PIK3CA amplification was observed in 21.2% (7/33) of the cases. This study demonstrates the presence of somatic mutations and amplifications of the PIK3CA gene in a second series of pituitary adenomas, corroborating the previously described involvement of the PI3K/Akt signaling pathway in the tumorigenic process of this gland.
Resumo:
Human endogenous retroviruses (HERVs) arise from ancient infections of the host germline cells by exogenous retroviruses, constituting 8% of the human genome. Elevated level of envelope transcripts from HERVs-W has been detected in CSF, plasma and brain tissues from patients with Multiple Sclerosis (MS), most of them from Xq22.3, 15q21.3, and 6q21 chromosomes. However, since the locus Xq22.3 (ERVWE2) lack the 5' LTR promoter and the putative protein should be truncated due to a stop codon, we investigated the ERVWE2 genomic loci from 84 individuals, including MS patients with active HERV-W expression detected in PBMC. In addition, an automated search for promoter sequences in 20 kb nearby region of ERVWE2 reference sequence was performed. Several putative binding sites for cellular cofactors and enhancers were found, suggesting that transcription may occur via alternative promoters. However, ERVWE2 DNA sequencing of MS and healthy individuals revealed that all of them harbor a stop codon at site 39, undermining the expression of a full-length protein. Finally, since plaque formation in central nervous system (CNS) of MS patients is attributed to immunological mechanisms triggered by autoimmune attack against myelin, we also investigated the level of similarity between envelope protein and myelin oligodendrocyte glycoprotein (MOG). Comparison of the MOG to the envelope identified five retroviral regions similar to the Ig-like domain of MOG. Interestingly, one of them includes T and B cell epitopes, capable to induce T effector functions and circulating Abs in rats. In sum, although no DNA substitutions that would link ERVWE2 to the MS pathogeny was found, the similarity between the envelope protein to MOG extends the idea that ERVEW2 may be involved on the immunopathogenesis of MS, maybe facilitating the MOG recognizing by the immune system. Although awaiting experimental evidences, the data presented here may expand the scope of the endogenous retroviruses involvement on MS pathogenesis
Resumo:
Some non-pathogenic trypanosomatids maintain a mutualistic relationship with a betaproteobacterium of the Alcaligenaceae family. Intensive nutritional exchanges have been reported between the two partners, indicating that these protozoa are excellent biological models to study metabolic co-evolution. We previously sequenced and herein investigate the entire genomes of five trypanosomatids which harbor a symbiotic bacterium (SHTs for Symbiont-Haboring Trypanosomatids) and the respective bacteria (TPEs for Trypanosomatid Proteobacterial Endosymbiont), as well as two trypanosomatids without symbionts (RTs for Regular Trypanosomatids), for the presence of genes of the classical pathways for vitamin biosynthesis. Our data show that genes for the biosynthetic pathways of thiamine, biotin, and nicotinic acid are absent from all trypanosomatid genomes. This is in agreement with the absolute growth requirement for these vitamins in all protozoa of the family. Also absent from the genomes of RTs are the genes for the synthesis of pantothenic acid, folic acid, riboflavin, and vitamin B6. This is also in agreement with the available data showing that RTs are auxotrophic for these essential vitamins. On the other hand, SHTs are autotrophic for such vitamins. Indeed, all the genes of the corresponding biosynthetic pathways were identified, most of them in the symbiont genomes, while a few genes, mostly of eukaryotic origin, were found in the host genomes. The only exceptions to the latter are: the gene coding for the enzyme ketopantoate reductase (EC:1.1.1.169) which is related instead to the Firmicutes bacteria; and two other genes, one involved in the salvage pathway of pantothenic acid and the other in the synthesis of ubiquinone, that are related to Gammaproteobacteria. Their presence in trypanosomatids may result from lateral gene transfer. Taken together, our results reinforce the idea that the low nutritional requirement of SHTs is associated with the presence of the symbiotic bacterium, which contains most genes for vitamin production.
Resumo:
Recombination is a significant factor driving genomic evolution, but it is not well understood in Dengue virus. We used phylogenetic methods to search for recombination in 636 Dengue virus type 3 (DENV-3) genomes and unveiled complex recombination patterns in two strains, which appear to be the outcome of recombination between genotype II and genotype I parental DENV-3 lineages. Our findings of genomic mosaic structures suggest that strand switching during RNA synthesis may be involved in the generation of genetic diversity in dengue viruses.
Viruses in the marine environment: community dynamics, phage-host interactions and genomic structure
Resumo:
[EN] There are an estimated 1030 viruses in the world oceans, the majority of which are phages (viruses that infect bacteria). Extensive research has demonstrated the significant influence of marine phages on microbial abundance, community structure, genetic exchange and global biogeochemical cycles. In this thesis, we contribute to increase the knowledge about the ecological role of viruses in marine systems, but also we aimed to provide a better understanding about the interactions between phages and their hosts and the genetic pool and biogeography of some the isolated phages genomes.
Resumo:
Nandrolone and other anabolic androgenic steroids (AAS) at elevated concentration can alter the expression and function of neurotransmitter systems and contribute to neuronal cell death. This effect can explain the behavioural changes, drug dependence and neuro degeneration observed in steroid abuser. Nandrolone treatment (10-8M–10-5M) caused a time- and concentration-dependent downregulation of mu opioid receptor (MOPr) transcripts in SH-SY5Y human neuroblastoma cells. This effect was prevented by the androgen receptor (AR) antagonist hydroxyflutamide. Receptor binding assays confirmed a decrease in MOPr of approximately 40% in nandrolonetreated cells. Treatment with actinomycin D (10-5M), a transcription inhibitor, revealed that nandrolone may regulate MOPr mRNA stability. In SH-SY5Y cells transfected with a human MOPr luciferase promoter/reporter construct, nandrolone did not alter the rate of gene transcription. These results suggest that nandrolone may regulate MOPr expression through post-transcriptional mechanisms requiring the AR. Cito-toxicity assays demonstrated a time- and concentration dependent decrease of cells viability in SH-SY5Y cells exposed to steroids (10-6M–10-4M). This toxic effects is independent of activation of AR and sigma-2 receptor. An increased of caspase-3 activity was observed in cells treated with Nandrolone 10-6M for 48h. Collectively, these data support the existence of two cellular mechanisms that might explain the neurological syndromes observed in steroids abuser.
Resumo:
This study provides a comprehensive genetic overview on the endangered Italian wolf population. In particular, it focuses on two research lines. On one hand, we focalised on melanism in wolf in order to isolate a mutation related with black coat colour in canids. With several reported black individuals (an exception at European level), the Italian wolf population constituted a challenging research field posing many unanswered questions. As found in North American wolf, we reported that melanism in the Italian population is caused by a different melanocortin pathway component, the K locus, in which a beta-defensin protein acts as an alternative ligand for the Mc1r. This research project was conducted in collaboration with Prof. Gregory Barsh, Department of Genetics and Paediatrics, Stanford University. On the other hand, we performed analysis on a high number of SNPs thanks to a customized Canine microarray useful to integrate or substitute the STR markers for genotyping individuals and detecting wolf-dog hybrids. Thanks to DNA microchip technology, we obtained an impressive amount of genetic data which provides a solid base for future functional genomic studies. This study was undertaken in collaboration with Prof. Robert K. Wayne, Department of Ecology and Evolutionary Biology, University of California, Los Angeles (UCLA).
Resumo:
Apple consumption is highly recomended for a healthy diet and is the most important fruit produced in temperate climate regions. Unfortunately, it is also one of the fruit that most ofthen provoks allergy in atopic patients and the only treatment available up to date for these apple allergic patients is the avoidance. Apple allergy is due to the presence of four major classes of allergens: Mal d 1 (PR-10/Bet v 1-like proteins), Mal d 2 (Thaumatine-like proteins), Mal d 3 (Lipid transfer protein) and Mal d 4 (profilin). In this work new advances in the characterization of apple allergen gene families have been reached using a multidisciplinary approach. First of all, a genomic approach was used for the characterization of the allergen gene families of Mal d 1 (task of Chapter 1), Mal d 2 and Mal d 4 (task of Chapter 5). In particular, in Chapter 1 the study of two large contiguos blocks of DNA sequences containing the Mal d 1 gene cluster on LG16 allowed to acquire many new findings on number and orientation of genes in the cluster, their physical distances, their regulatory sequences and the presence of other genes or pseudogenes in this genomic region. Three new members were discovered co-localizing with the other Mal d 1 genes of LG16 suggesting that the complexity of the genetic base of allergenicity will increase with new advances. Many retrotranspon elements were also retrieved in this cluster. Due to the developement of molecular markers on the two sequences, the anchoring of the physical and the genetic map of the region has been successfully achieved. Moreover, in Chapter 5 the existence of other loci for the Thaumatine-like protein family in apple (Mal d 2.03 on LG4 and Mal d 2.02 on LG17) respect the one reported up to now was demonstred for the first time. Also one new locus for profilins (Mal d 4.04) was mapped on LG2, close to the Mal d 4.02 locus, suggesting a cluster organization for this gene family, as is well reported for Mal d 1 family. Secondly, a methodological approach was used to set up an highly specific tool to discriminate and quantify the expression of each Mal d 1 allergen gene (task of Chapter 2). In aprticular, a set of 20 Mal d 1 gene specific primer pairs for the quantitative Real time PCR technique was validated and optimized. As a first application, this tool was used on leaves and fruit tissues of the cultivar Florina in order to identify the Mal d 1 allergen genes that are expressed in different tissues. The differential expression retrieved in this study revealed a tissue-specificity for some Mal d 1 genes: 10/20 Mal d 1 genes were expressed in fruits and, indeed, probably more involved in the allergic reactions; while 17/20 Mal d 1 genes were expressed in leaves challenged with the fungus Venturia inaequalis and therefore probably interesting in the study of the plant defense mechanism. In Chapter 3 the specific expression levels of the 10 Mal d 1 isoallergen genes, found to be expressed in fruits, were studied for the first time in skin and flesh of apples of different genotypes. A complex gene expression profile was obtained due to the high gene-, tissue- and genotype-variability. Despite this, Mal d 1.06A and Mal d 1.07 expression patterns resulted particularly associated with the degree of allergenicity of the different cultivars. They were not the most expressed Mal d 1 genes in apple but here it was hypotized a relevant importance in the determination of allergenicity for both qualitative and quantitative aspects of the Mal d 1 gene expression levels. In Chapter 4 a clear modulation for all the 17 PR-10 genes tested in young leaves of Florina after challenging with the fungus V. inaequalis have been reported but with a peculiar expression profile for each gene. Interestingly, all the Mal d 1 genes resulted up-regulated except Mal d 1.10 that was down-regulated after the challenging with the fungus. The differences in direction, timing and magnitude of induction seem to confirm the hypothesis of a subfunctionalization inside the gene family despite an high sequencce and structure similarity. Moreover, a modulation of PR-10 genes was showed both in compatible (Gala-V. inaequalis) and incompatible (Florina-V. inaequalis) interactions contribute to validate the hypothesis of an indirect role for at least some of these proteins in the induced defense responses. Finally, a certain modulation of PR-10 transcripts retrieved also in leaves treated with water confirm their abilty to respond also to abiotic stress. To conclude, the genomic approach used here allowed to create a comprehensive inventory of all the genes of allergen families, especially in the case of extended gene families like Mal d 1. This knowledge can be considered a basal prerequisite for many further studies. On the other hand, the specific transcriptional approach make it possible to evaluate the Mal d 1 genes behavior on different samples and conditions and therefore, to speculate on their involvement on apple allergenicity process. Considering the double nature of Mal d 1 proteins, as apple allergens and as PR-10 proteins, the gene expression analysis upon the attack of the fungus created the base for unravel the Mal d 1 biological functions. In particular, the knowledge acquired in this work about the PR-10 genes putatively more involved in the specific Malus-V. inaequalis interaction will be helpful, in the future, to drive the apple breeding for hypo-allergenicity genotype without compromise the mechanism of response of the plants to stress conditions. For the future, the survey of the differences in allergenicity among cultivars has to be be thorough including other genotypes and allergic patients in the tests. After this, the allelic diversity analysis with the high and low allergenic cultivars on all the allergen genes, in particular on the ones with transcription levels correlated to allergencity, will provide the genetic background of the low ones. This step from genes to alleles will allow the develop of molecular markers for them that might be used to effectively addressed the apple breeding for hypo-allergenicity. Another important step forward for the study of apple allergens will be the use of a specific proteomic approach since apple allergy is a multifactor-determined disease and only an interdisciplinary and integrated approach can be effective for its prevention and treatment.
Resumo:
Part I : A zinc finger gene Tzf1 was cloned in the earlier work of the lab by screening a ë-DASH2 cDNA expression library with an anti-Rat SC antibody. A ë-DASH2 genomic DNA library and cosmid lawrist 4 genomic DNA library were screened with the cDNA fragment of Tzf1 to determine the genomic organization of Tzf1. Another putative zinc finger gene Tzf2 was found about 700 bp upstream of Tzf1.RACE experiment was carried out for both genes to establish the whole length cDNA. The cDNA sequences of Tzf and Tzf2 were used to search the Flybase (Version Nov, 2000). They correspond to two genes found in the Flybase, CG4413 and CG4936. The CG4413 transcript seems to be a splicing variant of Tzf transcripts. Another two zinc finger genes Tzf3 and Tzf4 were discovered in silico. They are located 300 bp away from Tzf and Tzf2, and a non-tandem cluster was formed by the four genes. All four genes encode proteins with a very similar modular structure, since they all have five C2H2 type zinc fingers at their c-terminal ends. This is the most compact zinc finger protein gene cluster found in Drosophila melanogaster.Part II: 34,056 bp insert of the cosmid 19G11
Resumo:
The comparative genomic sequence analysis of a region in human chromosome 11p15.3 and its homologous segment in mouse chromosome 7 between ST5 and LMO1 genes has been performed. 158,201 bases were sequenced in the mouse and compared with the syntenic region in human, partially available in the public databases. The analysed region exhibits the typical eukaryotic genomic structure and compared with the close neighbouring regions, strikingly reflexes the mosaic pattern distribution of (G+C) and repeats content despites its relative short size. Within this region the novel gene STK33 was discovered (Stk33 in the mouse), that codes for a serine/threonine kinase. The finding of this gene constitutes an excellent example of the strength of the comparative sequencing approach. Poor gene-predictions in the mouse genomic sequence were corrected and improved by the comparison with the unordered data from the human genomic sequence publicly available. Phylogenetical analysis suggests that STK33 belongs to the calcium/calmodulin-dependent protein kinases group and seems to be a novelty in the chordate lineage. The gene, as a whole, seems to evolve under purifying selection whereas some regions appear to be under strong positive selection. Both human and mouse versions of serine/threonine kinase 33, consists of seventeen exons highly conserved in the coding regions, particularly in those coding for the core protein kinase domain. Also the exon/intron structure in the coding regions of the gene is conserved between human and mouse. The existence and functionality of the gene is supported by the presence of entries in the EST databases and was in vivo fully confirmed by isolating specific transcripts from human uterus total RNA and from several mouse tissues. Strong evidence for alternative splicing was found, which may result in tissue-specific starting points of transcription and in some extent, different protein N-termini. RT-PCR and hybridisation experiments suggest that STK33/Stk33 is differentially expressed in a few tissues and in relative low levels. STK33 has been shown to be reproducibly down-regulated in tumor tissues, particularly in ovarian tumors. RNA in-situ hybridisation experiments using mouse Stk33-specific probes showed expression in dividing cells from lung and germinal epithelium and possibly also in macrophages from kidney and lungs. Preliminary experimentation with antibodies designed in this work, performed in parallel to the preparation of this manuscript, seems to confirm this expression pattern. The fact that the chromosomal region 11p15 in which STK33 is located may be associated with several human diseases including tumor development, suggest further investigation is necessary to establish the role of STK33 in human health.
Resumo:
Here I will focus on three main topics that best address and include the projects I have been working in during my three year PhD period that I have spent in different research laboratories addressing both computationally and practically important problems all related to modern molecular genomics. The first topic is the use of livestock species (pigs) as a model of obesity, a complex human dysfunction. My efforts here concern the detection and annotation of Single Nucleotide Polymorphisms. I developed a pipeline for mining human and porcine sequences. Starting from a set of human genes related with obesity the platform returns a list of annotated porcine SNPs extracted from a new set of potential obesity-genes. 565 of these SNPs were analyzed on an Illumina chip to test the involvement in obesity on a population composed by more than 500 pigs. Results will be discussed. All the computational analysis and experiments were done in collaboration with the Biocomputing group and Dr.Luca Fontanesi, respectively, under the direction of prof. Rita Casadio at the Bologna University, Italy. The second topic concerns developing a methodology, based on Factor Analysis, to simultaneously mine information from different levels of biological organization. With specific test cases we develop models of the complexity of the mRNA-miRNA molecular interaction in brain tumors measured indirectly by microarray and quantitative PCR. This work was done under the supervision of Prof. Christine Nardini, at the “CAS-MPG Partner Institute for Computational Biology” of Shangai, China (co-founded by the Max Planck Society and the Chinese Academy of Sciences jointly) The third topic concerns the development of a new method to overcome the variety of PCR technologies routinely adopted to characterize unknown flanking DNA regions of a viral integration locus of the human genome after clinical gene therapy. This new method is entirely based on next generation sequencing and it reduces the time required to detect insertion sites, decreasing the complexity of the procedure. This work was done in collaboration with the group of Dr. Manfred Schmidt at the Nationales Centrum für Tumorerkrankungen (Heidelberg, Germany) supervised by Dr. Annette Deichmann and Dr. Ali Nowrouzi. Furthermore I add as an Appendix the description of a R package for gene network reconstruction that I helped to develop for scientific usage (http://www.bioconductor.org/help/bioc-views/release/bioc/html/BUS.html).
Resumo:
This 9p21 locus, encode for important proteins involved in cell cycle regulation and apoptosis containing the p16/CDKN2A (cyclin-dependent kinase inhibitor 2a) tumor suppressor gene and two other related genes, p14/ARF and p15/CDKN2B. This locus, is a major target of inactivation in the pathogenesis of a number of human tumors, both solid and haematologic, and is a frequent site of loss or deletion also in acute lymphoblastic leukemia (ALL) ranging from 18% to 45% 1. In order to explore, at high resolution, the frequency and size of alterations affecting this locus in adult BCR-ABL1-positive ALL and to investigate their prognostic value, 112 patients (101 de novo and 11 relapse cases) were analyzed by genome-wide single nucleotide polymorphisms arrays and gene candidate deep exon sequencing. Paired diagnosis-relapse samples were further available and analyzed for 19 (19%) cases. CDKN2A/ARF and CDKN2B genomic alterations were identified in 29% and 25% of newly diagnosed patients, respectively. Deletions were monoallelic in 72% of cases and in 43% the minimal overlapping region of the lost area spanned only the CDKN2A/2B gene locus. The analysis at the time of relapse showed an almost significant increase in the detection rate of CDKN2A/ARF loss (47%) compared to diagnosis (p = 0.06). Point mutations within the 9p21 locus were found at very low level with only a non-synonymous substition in the exon 2 of CDKN2A. Finally, correlation with clinical outcome showed that deletions of CDKN2A/B are significantly associated with poor outcome in terms of overall survival (p = 0.0206), disease free-survival (p = 0.0010) and cumulative incidence of relapse (p = 0.0014). The inactivation of 9p21 locus by genomic deletions is a frequent event in BCR-ABL1-positive ALL. Deletions are frequently acquired at the leukemia progression and work as a poor prognostic marker.
Resumo:
Enterobacteriaceae genomes evolve through mutations, rearrangements and horizontal gene transfer (HGT). The latter evolutionary pathway works through the acquisition DNA (GEI) modules of foreign origin that enhances fitness of the host to a given environment. The genome of E. coli IHE3034, a strain isolated from a case of neonatal meningitis, has recently been sequenced and its subsequent sequence analysis has predicted 18 possible GEIs, of which: 8 have not been previously described, 5 fully meet the pathogenic island definition and at least 10 that seem to be of prophagic origin. In order to study the GEI distribution of our reference strain, we screened for the presence 18 GEIs a panel of 132 strains, representative of E. coli diversity. Also, using an inverse nested PCR approach we identified 9 GEI that can form an extrachromosomal circular intermediate (CI) and their respective attachment sites (att). Further, we set up a qPCR approach that allowed us to determine the excision rates of 5 genomic islands in different growth conditions. Four islands, specific for strains appertaining to the sequence type complex 95 (STC95), have been deleted in order to assess their function in a Dictyostelium discoideum grazing assays. Overall, the distribution data presented here indicate that 16 IHE3034 GEIs are more associated to the STC95 strains. Also the functional and genetic characterization has uncovered that GEI 13, 17 and 19 are involved in the resistance to phagocitation by Dictyostelium d thus suggesting a possible role in the adaptation of the pathogen during certain stages of infection.