4 resultados para Genomic selection
em DigitalCommons@The Texas Medical Center
Resumo:
Murine sarcoma viruses constitute a class of replication-defective retroviruses. Cellular transformation may be induced by these viruses in vitro; whereas, fibrosarcomas may result in animals infected with them in vivo (Tooze, 1973; Bishop, 1978). Hybridization studies suggest that murine sarcoma viruses arose by recombination between nondefective murine leukemia virus sequences and certain cellular sequences present in uninfected mouse cells (Hu et al., 1977). A specific gene product, however, has not been implicated in murine sarcoma virus transformation.^ One line of murine sarcoma virus-producing cells, Mo-MuSV-clone 124, (Ball et al., 1973), was studied biochemically because it mainly produces the sarcoma virus as a pseudotype packaged with helper murine leukemia virus proteins. The sarcoma viral RNA was translated in a sophisticated cell-free protein synthesizing system (Murphy and Arlinghaus, 1978). The translation products were analyzed by a number of techniques, including electrophoresis in denaturing gels of SDS polyacrylamide, immunoprecipitation, and peptide mapping. The major products of the total RNA purified from the virus preparation were shown to have molecular weights of about 63,000 (P63('gag)), 42,000 (P42), 40,000 (P40), 38,000 (P38), and 23,000 (P23). The size class of mRNA coding for each of the cell-free products was estimated using a poly(A) selection technique and sucrose gradient fractionation. These analyses were used to localize the coding information related to each of the in vitro synthesized cell-free products within the sarcoma virus genome.^ The major findings of these studies were: (1) the 5' half of the sarcoma viral RNA codes for the 63,000 dalton polypeptide and 42,000 - 38,000 dalton polypeptides derived from the "gag" gene; and (2) the 3' half of the sarcoma viral RNA codes for a 38,000 dalton polypeptide and possibly derived from the cellular acquired sequences. ^
Resumo:
There are many diseases associated with the expansion of DNA repeats in humans. Myotonic dystrophy type 2 is one of such diseases, characterized by expansions of a (CCTG)•(CAGG) repeat tract in intron 1 of zinc finger protein 9 (ZNF9) in chromosome 3q21.3. The DM2 repeat tract contains a flanking region 5' to the tract that consists of a polymorphic repetitive sequence (TG)14-25(TCTG)4-11(CCTG) n. The (CCTG)•(CAGG) repeat is typically 11-26 repeats in persons without the disease, but can expand up to 11,000 repeats in affected individuals, which is the largest expansion seen in DNA repeat diseases to date. This DNA tract remains one of the least characterized disease-associated DNA repeats, and mechanisms causing the repeat expansion in humans have yet to be elucidated. Alternative, non B-DNA structures formed by the expanded repeats are typical in DNA repeat expansion diseases. These sequences may promote instability of the repeat tracts. I determined that slipped strand structure formation occurs for (CCTG)•(CAGG) repeats at a length of 42 or more. In addition, Z-DNA structure forms in the flanking human sequence adjacent to the (CCTG)•(CAGG) repeat tract. I have also performed genetic assays in E. coli cells and results indicate that the (CCTG)•(CAGG) repeats are more similar to the highly unstable (CTG)•(CAG) repeat tracts seen in Huntington's disease and myotonic dystrophy type 1, than to those of the more stable (ATTCT)•(AGAAT) repeat tracts of spinocerebellar ataxia type 10. This instability, however, is RecA-independent in the (CCTG)•(CAGG) and (ATTCT)•(AGAAT) repeats, whereas the instability is RecA-dependent in the (CTG)•(CAG) repeats. Structural studies of the (CCTG)•(CAGG) repeat tract and the flanking sequence, as well as genetic selection assays may reveal the mechanisms responsible for the repeat instability in E. coli, and this may lead to a better understanding of the mechanisms contributing to the human disease state. ^
Resumo:
The basis for the recent transition of Enterococcus faecium from a primarily commensal organism to one of the leading causes of hospital-acquired infections in the United States is not yet understood. To address this, the first part of my project assessed isolates from early outbreaks in the USA and South America using sequence analysis, colony hybridizations, and minimal inhibitory concentrations (MICs) which showed clinical isolates possess virulence and antibiotic resistance determinants that are less abundant or lacking in community isolates. I also revealed that the level of ampicillin resistance increased over time in clinical strains. By sequencing the pbp5 gene, I demonstrated an ~5% difference in the pbp5 gene between strains with MICs <4ug/ml and those with MICs >4µg/ml, but no specific sequence changes correlated with increases in MICs within the latter group. A 3-10% nucleotide difference was also seen in three other genes analyzed, which suggested the existence of two distinct subpopulations of E. faecium. This led to the second part of my project analyzing concatenated core gene sequences, SNPs, the 16S rRNA, and phylogenetics of 21 E. faecium genomes confirming two distinct clades; a community-associated (CA) clade and hospital-associated (HA) clade. Molecular clock calculations indicate that these two clades likely diverged ~ 300,000 to > 1 million years ago, long before the modern antibiotic era. Genomic analysis also showed that, in addition to core genomic differences, HA E. faecium harbor specific accessory genetic elements that may confer selection advantages over CA E. faecium. The third part of my project discovered 6 E. faecium genes with the newly identified “WxL” domain. My analyses, using RT-PCR, western blots, patient sera, whole-cell ELISA, and immunogold electron microscopy, indicated that E. faecium WxL genes exist in operons, encode bacterial cell surface localized proteins, that WxL proteins are antigenic in humans, and are more exposed on the surface of clinical isolates versus community isolates (even though they are ubiquitous in both clades). ELISAs and BIAcore analyses also showed that proteins encoded by these operons bind several different host extracellular matrix proteins, as well as to each other, suggesting a novel cell-surface complex. In summary, my studies provide new insights into the evolution of E. faecium by showing that there are two distantly related clades; one being more successful in the hospital setting. My studies also identified operons encoding WxL proteins whose characteristics could also contribute to colonization and virulence within this species.
Resumo:
The genomic era brought by recent advances in the next-generation sequencing technology makes the genome-wide scans of natural selection a reality. Currently, almost all the statistical tests and analytical methods for identifying genes under selection was performed on the individual gene basis. Although these methods have the power of identifying gene subject to strong selection, they have limited power in discovering genes targeted by moderate or weak selection forces, which are crucial for understanding the molecular mechanisms of complex phenotypes and diseases. Recent availability and rapid completeness of many gene network and protein-protein interaction databases accompanying the genomic era open the avenues of exploring the possibility of enhancing the power of discovering genes under natural selection. The aim of the thesis is to explore and develop normal mixture model based methods for leveraging gene network information to enhance the power of natural selection target gene discovery. The results show that the developed statistical method, which combines the posterior log odds of the standard normal mixture model and the Guilt-By-Association score of the gene network in a naïve Bayes framework, has the power to discover moderate/weak selection gene which bridges the genes under strong selection and it helps our understanding the biology under complex diseases and related natural selection phenotypes.^