918 resultados para Genome duplication
Resumo:
Avian pathogenic Escherichia coli (APEC) infections are responsible for significant losses in the poultry industry worldwide. A zoonotic risk has been attributed to APEC strains because they present similarities to extraintestinal pathogenic E. coli (ExPEC) associated with illness in humans, mainly urinary tract infections and neonatal meningitis. Here, we present in silico analyses with pathogenic E. coli genome sequences, including recently available APEC genomes. The phylogenetic tree, based on multi-locus sequence typing (MLST) of seven housekeeping genes, revealed high diversity in the allelic composition. Nevertheless, despite this diversity, the phylogenetic tree was able to cluster the different pathotypes together. An in silico virulence gene profile was also determined for each of these strains, through the presence or absence of 83 well-known virulence genes/traits described in pathogenic E. coli strains. The MLST phylogeny and the virulence gene profiles demonstrated a certain genetic similarity between Brazilian APEC strains, APEC isolated in the United States, UPEC (uropathogenic E. coli) and diarrheagenic strains isolated from humans. This correlation corroborates and reinforces the zoonotic potential hypothesis proposed to APEC.
Resumo:
Presentation at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014
Resumo:
Dense molecular genetic maps are used for an efficient quantitative trait loci (QTL) mapping and in the marker-assisted selection programs. A dense genetic map was generated with 139 microsatellite markers using 256 F2 plants generated by the crossing of two tropical maize inbred lines (L-02-03D and L-20-01F). This map presented 1,858.61 cM in length, where 10 linkage groups were found spanned, with an average interval of 13.47 cM between adjacent markers. Seventy seven percent of the maize genetic mapping bins were covered, which means an increase of 14% coverage in relation to the previous tropical maize maps. The results provide a more detailed and informative genetic map in a tropical maize population representing the first step to make possible the studies of genetic architecture to identify and map QTL and estimate their effects on the variation of quantitative traits, thus allowing the manipulation and use in tropical maize breeding programs.
Resumo:
Several human genetic syndromes have long been recognized to be defective in DNA repair mechanisms. This was first discovered by Cleaver (1968), who showed that cells from patients with xeroderma pigmentosum (XP) were defective for the ability to remove ultraviolet (UV)-induced lesions from their genome. Since then, new discoveries have promoted DNA repair studies to one of the most exciting areas of molecular biology. The present work intends to give a brief summary of the main known human genetic diseases related to DNA repair and how they may be linked to acquired diseases such as cancer
Resumo:
The Thr(118)Met substitution in the peripheral myelin protein 22 (PMP22) gene has been detected in a number of families with demyelinating Charcot-Marie-Tooth (CMT1) neuropathy or with the hereditary neuropathy with liability to pressure palsy, but in none of them has it consistently segregated with the peripheral neuropathy. We describe here a CMT1 family (a 63-year-old man, his brother and his niece) in which two mutations on different chromosomes were found in the PMP22 gene, the 17p duplication, detected by fluorescent semiquantitative polymerase chain reaction (PCR) of microsatellite markers localized within the duplicated region on chromosome 17p11.2-p12, and the Thr(118)Met substitution, detected by direct sequencing the four coding exons of the PMP22 gene. A genotype/phenotype correlation study showed that the neuropathy segregates with the duplication and that the amino acid substitution does not seem to modify the clinical characteristics or the severity of the peripheral neuropathy. We did not find any evidence to characterize this substitution as a polymorphism in the population studied and we propose that the high frequency reported for this point mutation in the literature suggests that the Thr(118)Met substitution may be a hotspot for mutations in the PMP22 gene.
Resumo:
We report novel features of the genome sequence of Leptospira interrogans serovar Copenhageni, a highly invasive spirochete. Leptospira species colonize a significant proportion of rodent populations worldwide and produce life-threatening infections in mammals. Genomic sequence analysis reveals the presence of a competent transport system with 13 families of genes encoding for major transporters including a three-member component efflux system compatible with the long-term survival of this organism. The leptospiral genome contains a broad array of genes encoding regulatory system, signal transduction and methyl-accepting chemotaxis proteins, reflecting the organism's ability to respond to diverse environmental stimuli. The identification of a complete set of genes encoding the enzymes for the cobalamin biosynthetic pathway and the novel coding genes related to lipopolysaccharide biosynthesis should bring new light to the study of Leptospira physiology. Genes related to toxins, lipoproteins and several surface-exposed proteins may facilitate a better understanding of the Leptospira pathogenesis and may serve as potential candidates for vaccine.
Resumo:
Genomics is expanding the horizons of epidemiology, providing a new dimension for classical epidemiological studies and inspiring the development of large-scale multicenter studies with the statistical power necessary for the assessment of gene-gene and gene-environment interactions in cancer etiology and prognosis. This paper describes the methodology of the Clinical Genome of Cancer Project in São Paulo, Brazil (CGCP), which includes patients with nine types of tumors and controls. Three major epidemiological designs were used to reach specific objectives: cross-sectional studies to examine gene expression, case-control studies to evaluate etiological factors, and follow-up studies to analyze genetic profiles in prognosis. The clinical groups included patients' data in the electronic database through the Internet. Two approaches were used for data quality control: continuous data evaluation and data entry consistency. A total of 1749 cases and 1509 controls were entered into the CGCP database from the first trimester of 2002 to the end of 2004. Continuous evaluation showed that, for all tumors taken together, only 0.5% of the general form fields still included potential inconsistencies by the end of 2004. Regarding data entry consistency, the highest percentage of errors (11.8%) was observed for the follow-up form, followed by 6.7% for the clinical form, 4.0% for the general form, and only 1.1% for the pathology form. Good data quality is required for their transformation into useful information for clinical application and for preventive measures. The use of the Internet for communication among researchers and for data entry is perhaps the most innovative feature of the CGCP. The monitoring of patients' data guaranteed their quality.
Resumo:
When compared to other model organisms whose genome is sequenced, the number of mutations identified in the mouse appears extremely reduced and this situation seriously hampers our understanding of mammalian gene function(s). Another important consequence of this shortage is that a majority of human genetic diseases still await an animal model. To improve the situation, two strategies are currently used: the first makes use of embryonic stem cells, in which one can induce knockout mutations almost at will; the second consists of a genome-wide random chemical mutagenesis, followed by screening for mutant phenotypes and subsequent identification of the genetic alteration(s). Several projects are now in progress making use of one or the other of these strategies. Here, we report an original effort where we mutagenized BALB/c males, with the mutagen ethylnitrosourea. Offspring of these males were screened for dominant mutations and a three-generation breeding protocol was set to recover recessive mutations. Eleven mutations were identified (one dominant and ten recessives). Three of these mutations are new alleles (Otop1mlh, Foxn1sepe and probably rodador) at loci where mutations have already been reported, while 4 are new and original alleles (carc, eqlb, frqz, and Sacc). This result indicates that the mouse genome, as expected, is far from being saturated with mutations. More mutations would certainly be discovered using more sophisticated phenotyping protocols. Seven of the 11 new mutant alleles induced in our experiment have been localized on the genetic map as a first step towards positional cloning.
Resumo:
Data on genome damage, lipid peroxidation, and levels of glutathione peroxidase (GPX) in newborns after transplacental exposure to xenobiotics are rare and insufficient for risk assessment. The aim of the current study was to analyze, in an animal model, transplacental genotoxicity, lipid peroxidation, and detoxification disturbances caused by the following drugs commonly prescribed to pregnant women: paracetamol, fluconazole, 5-nitrofurantoin, and sodium valproate. Genome damage in dams and their newborn pups transplacentally exposed to these drugs was investigated using the in vivo micronucleus (MN) assay. The drugs were administered to dams intraperitoneally in three consecutive daily doses between days 12 and 14 of pregnancy. The results were correlated, with detoxification capacity of the newborn pups measured by the levels of GPX in blood and lipid peroxidation in liver measured by malondialdehyde (HPLC-MDA) levels. Sodium valproate and 5-nitrofurantoin significantly increased MN frequency in pregnant dams. A significant increase in the MN frequency of newborn pups was detected for all drugs tested. This paper also provides reference levels of MDA in newborn pups, according to which all drugs tested significantly lowered MDA levels of newborn pups, while blood GPX activity dropped significantly only after exposure to paracetamol. The GPX reduction reflected systemic oxidative stress, which is known to occur with paracetamol treatment. The reduction of MDA in the liver is suggested to be an unspecific metabolic reaction to the drugs that express cytotoxic, in particular hepatotoxic, effects associated with oxidative stress and lipid peroxidation.
Resumo:
DNA methylation is essential in X chromosome inactivation and genomic imprinting, maintaining repression of XIST in the active X chromosome and monoallelic repression of imprinted genes. Disruption of the DNA methyltransferase genes DNMT1 and DNMT3B in the HCT116 cell line (DKO cells) leads to global DNA hypomethylation and biallelic expression of the imprinted gene IGF2 but does not lead to reactivation of XIST expression, suggesting thatXIST repression is due to a more stable epigenetic mark than imprinting. To test this hypothesis, we induced acute hypomethylation in HCT116 cells by 5-aza-2′-deoxycytidine (5-aza-CdR) treatment (HCT116-5-aza-CdR) and compared that to DKO cells, evaluating DNA methylation by microarray and monitoring the expression of XIST and imprinted genes IGF2, H19, and PEG10. Whereas imprinted genes showed biallelic expression in HCT116-5-aza-CdR and DKO cells, the XIST locus was hypomethylated and weakly expressed only under acute hypomethylation conditions, indicating the importance ofXIST repression in the active X to cell survival. Given that DNMT3A is the only active DNMT in DKO cells, it may be responsible for ensuring the repression of XIST in those cells. Taken together, our data suggest that XIST repression is more tightly controlled than genomic imprinting and, at least in part, is due to DNMT3A.
Resumo:
Ropinirole (ROP) is a dopamine agonist that has been used as therapy for Parkinson's disease. In the present study, we aimed to detect whether gene expression was modulated by ROP in SH-SY5Y cells. SH-SY5Y cell lines were treated with 10 µM ROP for 2 h, after which total RNA was extracted for whole genome analysis. Gene expression profiling revealed that 113 genes were differentially expressed after ROP treatment compared with control cells. Further pathway analysis revealed modulation of the phosphatidylinositol 3-kinase (PI3K) signaling pathway, with prominent upregulation of PIK3C2B. Moreover, batches of regulated genes, including PIK3C2B, were found to be located on chromosome 1. These findings were validated by quantitative RT-PCR and Western blot analysis. Our study, therefore, revealed that ROP altered gene expression in SH-SY5Y cells, and future investigation of PIK3C2B and other loci on chromosome 1 may provide long-term implications for identifying novel target genes of Parkinson's disease.
Resumo:
Personalized medicine will revolutionize our capabilities to combat disease. Working toward this goal, a fundamental task is the deciphering of geneticvariants that are predictive of complex diseases. Modern studies, in the formof genome-wide association studies (GWAS) have afforded researchers with the opportunity to reveal new genotype-phenotype relationships through the extensive scanning of genetic variants. These studies typically contain over half a million genetic features for thousands of individuals. Examining this with methods other than univariate statistics is a challenging task requiring advanced algorithms that are scalable to the genome-wide level. In the future, next-generation sequencing studies (NGS) will contain an even larger number of common and rare variants. Machine learning-based feature selection algorithms have been shown to have the ability to effectively create predictive models for various genotype-phenotype relationships. This work explores the problem of selecting genetic variant subsets that are the most predictive of complex disease phenotypes through various feature selection methodologies, including filter, wrapper and embedded algorithms. The examined machine learning algorithms were demonstrated to not only be effective at predicting the disease phenotypes, but also doing so efficiently through the use of computational shortcuts. While much of the work was able to be run on high-end desktops, some work was further extended so that it could be implemented on parallel computers helping to assure that they will also scale to the NGS data sets. Further, these studies analyzed the relationships between various feature selection methods and demonstrated the need for careful testing when selecting an algorithm. It was shown that there is no universally optimal algorithm for variant selection in GWAS, but rather methodologies need to be selected based on the desired outcome, such as the number of features to be included in the prediction model. It was also demonstrated that without proper model validation, for example using nested cross-validation, the models can result in overly-optimistic prediction accuracies and decreased generalization ability. It is through the implementation and application of machine learning methods that one can extract predictive genotype–phenotype relationships and biological insights from genetic data sets.
Resumo:
Lichens are symbiotic organisms, which consist of the fungal partner and the photosynthetic partner, which can be either an alga or a cyanobacterium. In some lichen species the symbiosis is tripartite, where the relationship includes both an alga and a cyanobacterium alongside the primary symbiont, fungus. The lichen symbiosis is an evolutionarily old adaptation to life on land and many extant fungal species have evolved from lichenised ancestors. Lichens inhabit a wide range of habitats and are capable of living in harsh environments and on nutrient poor substrates, such as bare rocks, often enduring frequent cycles of drying and wetting. Most lichen species are desiccation tolerant, and they can survive long periods of dehydration, but can rapidly resume photosynthesis upon rehydration. The molecular mechanisms behind lichen desiccation tolerance are still largely uncharacterised and little information is available for any lichen species at the genomic or transcriptomic level. The emergence of the high-throughput next generation sequencing (NGS) technologies and the subsequent decrease in the cost of sequencing new genomes and transcriptomes has enabled non-model organism research on the whole genome level. In this doctoral work the transcriptome and genome of the grey reindeer lichen, Cladonia rangiferina, were sequenced, de novo assembled and characterised using NGS and traditional expressed sequence tag (EST) technologies. RNA extraction methods were optimised to improve the yield and quality of RNA extracted from lichen tissue. The effects of rehydration and desiccation on C. rangiferina gene expression on whole transcriptome level were studied and the most differentially expressed genes were identified. The secondary metabolites present in C. rangiferina decreased the quality – integrity, optical characteristics and utility for sensitive molecular biological applications – of the extracted RNA requiring an optimised RNA extraction method for isolating sufficient quantities of high-quality RNA from lichen tissue in a time- and cost-efficient manner. The de novo assembly of the transcriptome of C. rangiferina was used to produce a set of contiguous unigene sequences that were used to investigate the biological functions and pathways active in a hydrated lichen thallus. The de novo assembly of the genome yielded an assembly containing mostly genes derived from the fungal partner. The assembly was of sufficient quality, in size similar to other lichen-forming fungal genomes and included most of the core eukaryotic genes. Differences in gene expression were detected in all studied stages of desiccation and rehydration, but the largest changes occurred during the early stages of rehydration. The most differentially expressed genes did not have any annotations, making them potentially lichen-specific genes, but several genes known to participate in environmental stress tolerance in other organisms were also identified as differentially expressed.
Resumo:
Adenoviruses are non-enveloped icosahedral-shaped particles which possess a double-stranded DNA genome. Currently, nearly 100 serotypes of adenoviruses have been identified, 48 of which are of human origin. Bovine adenoviruses (BAVs), causing both mild respiratory and/or enteral diseases in cattle, have been reported in many countries all over the world. Currently, nine serotypes of SAVs have been isolated which have been placed into two subgroups based on a number of characteristics which include complement fixation tests as well as the ability to replicate in various cell lines. Bovine adenovirus type 2 (BAV2), belonging to subgroup I, is able to cause pneumonia as well as pneumonic-like symptoms in calves. In this study, the genome of BAV2 (strain No. 19) was subcloned into the plasmid vector pUC19. In total, 16 plasmids were constructed; three carry internal San fragments (spanning 3.1 to 65.2% ), and 10 carry internal Pstl fragments (spanning 4.9 to 97.4%), of the viral genome. Each of these plasmids was analyzed using twelve restriction endonucleases; BamHI, CiaI, EcoRl, HiOOlll, Kpnl, Noll, NS(N, Ps~, Pvul, Saj, Xbal, and Xhol. Terminal end fragments were also cloned and analyzed, sUbsequent to the removal of the 5' terminal protein, in the form of 2 BamHI B fragments, cloned in opposite orientations (spanning 0 to 18.1°k), and one Pstll fragment (spanning 97.4 to 1000/0). These cloned fragments, along with two other plasmids previously constructed carrying internal EcoRI fragments (spanning 20.6 to 90.5%), were then used to construct a detailed physical restriction map using the twelve restriction endonucleases, as well as to estimate the size of the genome for BAV2(32.5 Kbp). The DNA sequences of the early region 1 (E1) and hexon-associated gene (protein IX) have also been determined. The amino acid sequences of four open reading frames (ORFs) have been compared to those of the E1 proteins and protein IX from other Ads.
Resumo:
Genome sequence varies in numerous ways among individuals although the gross architecture is fixed for all humans. Retrotransposons create one of the most abundant structural variants in the human genome and are divided in many families, with certain members in some families, e.g., L1, Alu, SVA, and HERV-K, remaining active for transposition. Along with other types of genomic variants, retrotransponson-derived variants contribute to the whole spectrum of genome variants in humans. With the advancement of sequencing techniques, many human genomes are being sequenced at the individual level, fueling the comparative research on these variants among individuals. In this thesis, the evolution and functional impact of structural variations is examined primarily focusing on retrotransposons in the context of human evolution. The thesis comprises of three different studies on the topics that are presented in three data chapters. First, the recent evolution of all human specific AluYb members, representing the second most active subfamily of Alus, was tracked to identify their source/master copy using a novel approach. All human-specific AluYb elements from the reference genome were extracted, aligned with one another to construct clusters of similar copies and each cluster was analyzed to generate the evolutionary relationship between the members of the cluster. The approach resulted in identification of one major driver copy of all human specific Yb8 and the source copy of the Yb9 lineage. Three new subfamilies within the AluYb family – Yb8a1, Yb10 and Yb11 were also identified, with Yb11 being the youngest and most polymorphic. Second, an attempt to construct a relation between transposable elements (TEs) and tandem repeats (TRs) was made at a genome-wide scale for the first time. Upon sequence comparison, positional cross-checking and other relevant analyses, it was observed that over 20% of all TRs are derived from TEs. This result established the first connection between these two types of repetitive elements, and extends our appreciation for the impact of TEs on genomes. Furthermore, only 6% of these TE-derived TRs follow the already postulated initiation and expansion mechanisms, suggesting that the others are likely to follow a yet-unidentified mechanism. Third, by taking a combination of multiple computational approaches involving all types of genetic variations published so far including transposable elements, the first whole genome sequence of the most recent common ancestor of all modern human populations that diverged into different populations around 125,000-100,000 years ago was constructed. The study shows that the current reference genome sequence is 8.89 million base pairs larger than our common ancestor’s genome, contributed by a whole spectrum of genetic mechanisms. The use of this ancestral reference genome to facilitate the analysis of personal genomes was demonstrated using an example genome and more insightful recent evolutionary analyses involving the Neanderthal genome. The three data chapters presented in this thesis conclude that the tandem repeats and transposable elements are not two entirely distinctly isolated elements as over 20% TRs are actually derived from TEs. Certain subfamilies of TEs themselves are still evolving with the generation of newer subfamilies. The evolutionary analyses of all TEs along with other genomic variants helped to construct the genome sequence of the most recent common ancestor to all modern human populations which provides a better alternative to human reference genome and can be a useful resource for the study of personal genomics, population genetics, human and primate evolution.