16 resultados para The cancer genome atlas

em Brock University, Canada


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequence repeats are an important phenomenon in the human genome, playing important roles in genomic alteration often with phenotypic consequences. The two major types of repeat elements in the human genome are tandem repeats (TRs) including microsatellites, minisatellites, and satellites and transposable elements (TEs). So far, very little has been known about the relationship between these two types of repeats. In this study, we identified TRs that are derived from TEs either based on sequence similarity or overlapping genomic positions. We then analyzed the distribution of these TRs among TE families/subfamilies. Our study shows that at least 7,276 TRs or 23% of all minisatellites/satellites is derived from TEs, contributing ∼0.32% of the human genome. TRs seem to be generated more likely from younger/more active TEs, and once initiated they are expanded with time via local duplication of the repeat units. The currently postulated mechanisms for origin of TRs can explain only 6% of all TE-derived TRs, indicating the presence of one or more yet to be identified mechanisms for the initiation of such repeats. Our result suggests that TEs are contributing to genome expansion and alteration not only by transposition but also by generating tandem repeats.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Lung cancer is a major chronic disease responsible for the highest mortality rate, among other types of cancer, and represents 29% of all deaths in Canada. The clinical diagnosis of lung carcinoma still requires a standard diagnostic approach, as there are no symptoms in its early stage. Therefore, it is usually diagnosed at a later stage, when the survival rate is low. With the recent advancement in molecular biology and biotechnology, a molecular biomarker approach for the diagnosis of early lung cancer seems to be a potential option. In this study, we aimed to investigate and standardize a promising Lung ,Cancer Biomarker by studying the aberrant methylation of two tumour suppressor genes, namely RASSFIA and RAR-B, and the miRNA profiling of four . commonly deregulated miRNA (miR-199a-3p, miR-182, miR-lOO and miR-221). Four lung cancer cell lines were used (two SCLC and two NSCLC), with comparisons being made with normal lung cell lines. Our results, we found that none of these genes were methylated. We then evaluated TP53, and found the promoter of this gene to be methylated in the cancer cell lines, as compared to the normal cell lines, indicating gene inactivation. We carried out miRNA profiling of the cancer cell lines and reported that 80 miRNAs are deregulated in lung cancer cell lines as compared to the normal cell lines. Our study was the first of its kind to indicate that hsa-mir-4301, hsa-mir-4707-5p and hsa-mir-4497 (newly discovered miRNAs) are deregulated in lung cancer cell lines. We also investigated miR-199a-3p, mir-lOO and miR-182, and found that miR-199a -3p and mir-l00 were down-regulated in cancer lines, whereas miR-182 was up-regulated in the cancer cell lines. In the final part of the study we observed that mir-221 could be a putative biomarker to distinguish between the two types of lung cancer because it was down-regulated in SCLC, and up-regulated in the NSCLC cell lines. In conclusion, we found four miRNA molecular biomarkers that possibly could be used in the early diagnosis of the lung cancer. More studies are still required with larger numbers of samples to effectively establish these as molecular biomarkers for the diagnosis of lung cancer

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genome sequence varies in numerous ways among individuals although the gross architecture is fixed for all humans. Retrotransposons create one of the most abundant structural variants in the human genome and are divided in many families, with certain members in some families, e.g., L1, Alu, SVA, and HERV-K, remaining active for transposition. Along with other types of genomic variants, retrotransponson-derived variants contribute to the whole spectrum of genome variants in humans. With the advancement of sequencing techniques, many human genomes are being sequenced at the individual level, fueling the comparative research on these variants among individuals. In this thesis, the evolution and functional impact of structural variations is examined primarily focusing on retrotransposons in the context of human evolution. The thesis comprises of three different studies on the topics that are presented in three data chapters. First, the recent evolution of all human specific AluYb members, representing the second most active subfamily of Alus, was tracked to identify their source/master copy using a novel approach. All human-specific AluYb elements from the reference genome were extracted, aligned with one another to construct clusters of similar copies and each cluster was analyzed to generate the evolutionary relationship between the members of the cluster. The approach resulted in identification of one major driver copy of all human specific Yb8 and the source copy of the Yb9 lineage. Three new subfamilies within the AluYb family – Yb8a1, Yb10 and Yb11 were also identified, with Yb11 being the youngest and most polymorphic. Second, an attempt to construct a relation between transposable elements (TEs) and tandem repeats (TRs) was made at a genome-wide scale for the first time. Upon sequence comparison, positional cross-checking and other relevant analyses, it was observed that over 20% of all TRs are derived from TEs. This result established the first connection between these two types of repetitive elements, and extends our appreciation for the impact of TEs on genomes. Furthermore, only 6% of these TE-derived TRs follow the already postulated initiation and expansion mechanisms, suggesting that the others are likely to follow a yet-unidentified mechanism. Third, by taking a combination of multiple computational approaches involving all types of genetic variations published so far including transposable elements, the first whole genome sequence of the most recent common ancestor of all modern human populations that diverged into different populations around 125,000-100,000 years ago was constructed. The study shows that the current reference genome sequence is 8.89 million base pairs larger than our common ancestor’s genome, contributed by a whole spectrum of genetic mechanisms. The use of this ancestral reference genome to facilitate the analysis of personal genomes was demonstrated using an example genome and more insightful recent evolutionary analyses involving the Neanderthal genome. The three data chapters presented in this thesis conclude that the tandem repeats and transposable elements are not two entirely distinctly isolated elements as over 20% TRs are actually derived from TEs. Certain subfamilies of TEs themselves are still evolving with the generation of newer subfamilies. The evolutionary analyses of all TEs along with other genomic variants helped to construct the genome sequence of the most recent common ancestor to all modern human populations which provides a better alternative to human reference genome and can be a useful resource for the study of personal genomics, population genetics, human and primate evolution.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Receipt for the New Topographical Atlas of the Dominion, paid for by S.D. Woodruff, Apr. 2, 1875.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The relative ease to concentrate and purify adenoviruses, their well characterized mid-sized genome, and the ability to delete non-essential regions from their genome to accommodate foreign gene, made adenoviruses a suitable candidate for the construction of vectors. The use of adenoviral vectors in gene therapy, vaccination, and as a general vector system for expressing foreign genes have been documented for some time. In this study, the objective was to rescue a BAV3 E1 or E3 recombinant vector carrying the kanamycin resistant gene, a dominant selectable marker with useful applications in studying vectored gene expression in mammalian cells. To accomplish the objective of this study, more information about BAV3 DNA sequences was required in order to make the manipulation of the virus genome accessible. Therefore, sequencing of the BAV3 genome from 1 1 .7% to 30.8% was carried out. Analysis of the determined sequences revealed the primary structure of important viral gene products coded by E2 including BAV3 DNA pol and precursor to terminal protein. Comparative analysis of these proteins with their counterparts from human and non human adenoviruses revealed important insights as to the evolutionary lineage of BAV3. In order to insert the kanamycin resistance gene in either E1 or E3, it was necessary to delete BAV3 sequences to accommodate the foreign gene so as not to exceed the limit of the packaging capacity of the virus. To construct a recombinant BAV3 in which a foreign gene was inserted in the deleted E1 region, an E1 shuttle vector was constructed. This involved the deletion from the viral sequences a region between 1.3% to 9% and inserting the kanamycin resistance gene to replace the deletion. The E1 shuttle vector contained the left (0%- 53.9%) segment of the genome and was expected to generate BAV3 recombinants that can be grown and propagated in cells that can complement the missing E1 functions. To construct a similar shuttle vector for E3 deletion, DNA sequences extending from 78.9% to 82.5% (1281 bp) were deleted from within the E3 region that had been cloned into a plasmid vector. The deleted region corresponds to those that have been shown to be non-essential for viral replication in cell culture. The resulting plasmid was used to construct another recombinant plasmid with BAV3 DNA sequences extending from 37.1% to 100% and with a deletion of E3 sequences that were replaced by kanamycin resistance gene. This shuttle plasmid was used in cotransfections with digested viral DNA in an attempt to rescue a recombinant BAV3 carrying the kanamycin resistance gene to replace the deleted E3. In spite of repeated attempts of transfection, El or E3 recombinant BAV3 were not isolated. It seems that other approaches should be applied to make a final conclusion on BAV3 infectivity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the developing mouse embryo, the diploid trophectoderm is known to undergo a diploid to giant cell transformation. These cells arise by a process of endoreduplication, characterized by replication of the entire genome without subsequent mitosis or cell division, leading to polyploidy and the formation of giant nuclei. Studies of 13.5 day rat trophoblast derived from the parietal yolk sac have indicated a relatively low rate of DNA polymerase a activity, the noinnal eukaryotic replicase, in comparison to that of DNA polymerase g. These results have suggested that endoreduplication in trophoblast giant cells may not employ the normal replicase enzyme, DNA polymerase a. In order to determine whether a 'switch' from DNA polymerase to DNA polymerase is a necessary concomitant of the diploid to giant cell transformation, two distinct populations of trophoblast giant cells, the primary giant cell derived from the mural trophectoderm and the secondary giant cell derived from the polar trophoectoderm were used. These two populations of trophoblast giant cells can be obtained from the tissue outgrowths of 3.5da blastocysts and the extraembryonic ectoderm (EX) and ectoplacental cone (EPC) of 7.5 day embryos respectively. Tissue outgrowths were treated with aphidicolin, a specific reversible inhibitor of eukaryotic DNA polymerase a, on various days after explantation. The effect of aphidicolin treatment was assessed both qualitatively, using autoradiography and quantitatively by scintillation counting and Feulgen staining. 3 DNA synthesis was measured in control and treated cultures after a Hthymidine pulse. Scintillation counts of the embryo proper revealed that DNA synthesis was consistently inhibited by greater than 907. in the presence of aphidicolin. Inhibition of DNA synthesis in the EX and EPC varied between 81-957. and 82-987. respectively, indicating that most DNA synthesis was mediated by DNA polymerase a, but that a small but significant amount of residual synthesis was indicated. A qualitative approach was then applied to determine whether the apparent residual DNA synthesis was restricted to a subpopulation of giant cells or whether all giant cells displayed a low level of DNA synthesis. Autoradiographs of the ICM of blastocysts and the embryo proper of 7.5da embryos, which acted as diploid control population, was completely inhibited regardless of duration in explant culture. In contrast, primary trophoblast giant cells derived from blastocysts and secondary giant cells derived from the EX and EPC were observed to possess some heavily labelled cells after aphidicolin treatment. These results suggest that although DNA polymerase a is the primary replicating enzyme responsible for endoreduplication in mouse trophoblast giant cells, some nonactivity is also observed. A DNA polymerase assay employing tissue lysates of outgrown 7.5da embryo, EX and EPC tissues was used to attempt to confirm the presence of higher nonactivity in tissues possessing trophoblast giant cells. Employing a series of inhibitors of DNA polymerases, it would appear that DNA polymerase a is the major polymerase active in all tissues of the 7.5da mouse embryo. The nature of the putative residual DNA synthetic activity could not be unequivically determined in this study. Therefore, these results suggest that both primary and secondary trophoblast giant cells possess and use DNA polymerase a in endoreduplicative DNA synthesis. It would appear that the high levels of DNA polymerase g activity reported in trophoblast tissue derived from the 13.5 da rat yolk sac was not a general feature of all endoreduplication.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The regenerating urodele limb is a useful model system in which to study, in vivo, the controls of cell proliferation and differentiation. Techniques are available which enable one to experimentally manipulate mitogenic influences upon the blastema, as well the morphogenesis of the regenerating 11mb. Although classical regeneration studies have generated a wealth of knowledge concerning tissue interactions, little 1s known about the process at the level of gene expression. The aim of this project was to clone potentially developmentally regulated genes from a newt genomic library for use in future studies of gene expression during limb regeneration. We decided to clone the cytoskeletal actin gene for the following reasons: 1. its expression reflects the proliferative and differentiatlve states of cells in other systems 2. the high copy number of cytoplasmic actin pseudogenes in other vertebrates and the high degree of evolutionary sequence conservation among actin genes increased the chance of cloning one of the newt cytoplasmic actin genes. 3. Preliminary experiments indicated that a newt actin could probably be identified using an available chick ~-actln gene for a molecular probe. Two independent recombinant phage clones, containing actin homologous inserts, were isolated from a newt genomic library by hybridization with the chick actin probe. Restriction mapping identified actin homologous sequences within the newt DNA inserts which were subcloned into the plasmid pTZ19R. The recombinant plasmids were transformed into the Escherichia coli strain, DHsa. Detailed restriction maps were produced of the 5.7Kb and 3.1Kb newt DNA inserts in the plasmids, designated pTNAl and pTNA2. The short «1.3 Kb) length of the actin homologous sequence in pTNA2 indicated that it was possibly a reverse transcript pseudogene. Problems associated with molecular cloning of DNA sequences from N. viridescens are discussed with respect to the large genome size and abundant highly repetitive DNA sequences.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study examined the factors affecting treatment decision making for young women with early stage breast cancer. Thirty women, aged 35 to 52 years, were presented information about two equally effective chemotherapy treatments following surgery for breast cancer using an educational instrument called a "decision board." Although equally effective, the treatments differ with regards to side effects and treatment schedule. The purpose of this research was to investigate what factors affect the decision-making process. Following administration of the decision board, women were given a take-home version to review and asked to return one to two weeks later with a decision, at which time they completed a questionnaire. theoretical framework for this study was constructed from the literature on self-directed learning and critical thinking. The Overall, the factors rated most important to the treatment decision were related to quality of life, side effects, and length of treatment. Five factors were found to be rated significantly different by the women who chose one treatment versus the other in terms of importance to their decision. These were side effects in general, vomiting, hair loss, family role, and the number of trips to the cancer centre required for treatment.Implications and recommendations for patient education, research, and practice evolved from the findings of this study.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The neuropeptide Th1RFamide with the sequence Phe-Met-Arg-Phe-amide was originally isolated in the clam Macrocallista nimbosa (price and Greenberg, 1977). Since its discovery, a large family ofFl\1RFamide-related peptides termed FaRPs have been found to be present in all major animal phyla with functions ranging from modulation of neuronal activity to alteration of muscular contractions. However, little is known about the genetics encoding these peptides, especially in invertebrates. As FaRP-encoding genes have yet to be investigated in the invertebrate Malacostracean subphylum, the isolation and characterization ofFaRP-encoding DNA and mRNA was pursued in this project. The immediate aims of this thesis were: (1) to amplify mRNA sequences of Procambarus clarkii using a degenerate oligonucleotide primer deduced from the common amino acid sequence ofisolated Procambarus FaRPS, (2) to determine if these amplification products encode FaRP gene sequences, and (3) to create a selective cDNA library of sequences recognized by the degenerate oligonucleotide primer. The polymerase chain reaction - rapid amplification of cDNA ends (PCR-RACE) is a procedure in which a single gene-specific primer is used in conjunction with a generalized 3' or 5' primer to amplify copies ofthe region between a single point in the transcript and the 3' or 5' end of cDNA of interest (Frohman et aI., 1988). PCRRACE reactions were optimized with respect to primers used, buffer composition, cycle number, nature ofgenetic substrate to be amplified, annealing, extension and denaturation temperatures and times, and use of reamplification procedures. Amplification products were cloned into plasmid vectors and recombinant products were isolated, as were the recombinant plaques formed in the selective cDNA library. Labeled amplification products were hybridized to recombinant bacteriophage to determine ligated amplification product presence. When sequenced, the five isolated PCR-RACE amplification products were determined not to possess FaRP-encoding sequences. The 200bp, 450bp, and 1500bp sequences showed homology to the Caenorhabditis elegans cosmid K09A11, which encodes for cytochrome P450; transfer-RNA; transposase; and tRNA-Tyr, while the 500bp and 750bp sequences showed homology with the complete genome of the Vaccinia virus. Under the employed amplification conditions the degenerate oligonucleotide primer was observed to bind to and to amplify sequences with either 9 or 10bp of 17bp identity. The selective cDNA library was obselVed to be of extremely low titre. When library titre was increased, white. plaques were isolated. Amplification analysis of eight isolated Agt11 sequences from these plaques indicated an absence of an insertion sequence. The degenerate 17 base oligonucleotide primer synthesized from the common amino acid sequence ofisolated Procambarus FaRPs was thus determined to be non-specific in its binding under the conditions required for its use, and to be insufficient for the isolation and identification ofFaRP-encoding sequences. A more specific primer oflonger sequence, lower degeneracy, and higher melting temperature (TJ is recommended for further investigation into the FaRP-encoding genes of Procambarlls clarkii.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Adenoviruses are non-enveloped icosahedral-shaped particles which possess a double-stranded DNA genome. Currently, nearly 100 serotypes of adenoviruses have been identified, 48 of which are of human origin. Bovine adenoviruses (BAVs), causing both mild respiratory and/or enteral diseases in cattle, have been reported in many countries all over the world. Currently, nine serotypes of SAVs have been isolated which have been placed into two subgroups based on a number of characteristics which include complement fixation tests as well as the ability to replicate in various cell lines. Bovine adenovirus type 2 (BAV2), belonging to subgroup I, is able to cause pneumonia as well as pneumonic-like symptoms in calves. In this study, the genome of BAV2 (strain No. 19) was subcloned into the plasmid vector pUC19. In total, 16 plasmids were constructed; three carry internal San fragments (spanning 3.1 to 65.2% ), and 10 carry internal Pstl fragments (spanning 4.9 to 97.4%), of the viral genome. Each of these plasmids was analyzed using twelve restriction endonucleases; BamHI, CiaI, EcoRl, HiOOlll, Kpnl, Noll, NS(N, Ps~, Pvul, Saj, Xbal, and Xhol. Terminal end fragments were also cloned and analyzed, sUbsequent to the removal of the 5' terminal protein, in the form of 2 BamHI B fragments, cloned in opposite orientations (spanning 0 to 18.1°k), and one Pstll fragment (spanning 97.4 to 1000/0). These cloned fragments, along with two other plasmids previously constructed carrying internal EcoRI fragments (spanning 20.6 to 90.5%), were then used to construct a detailed physical restriction map using the twelve restriction endonucleases, as well as to estimate the size of the genome for BAV2(32.5 Kbp). The DNA sequences of the early region 1 (E1) and hexon-associated gene (protein IX) have also been determined. The amino acid sequences of four open reading frames (ORFs) have been compared to those of the E1 proteins and protein IX from other Ads.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The cloned dihydrofolate reductase gene of Saccharomyces cerevisiae (DFR 1) is expressed in Escherichia coli. Bacterial strain JF1754 transformed with plasmids containing DFR 1 is at least 5X more resistant to inhibition by the folate antagonist trimethoprim. Expression of yeast DFR 1 in E. coli suggests it is likely that the gene lacks intervening sequences. The 1.8 kbp DNA fragment encoding yeast dhfr activity probably has its own promotor, as the gene is expressed in both orientations in E. coli. Expression of the yeast dhfr gene cloned into M13 viral vectors allowed positive selection of DFR 1 - M13 bacterial transfectants in medium supplemented with trimethoprim. A series of nested deletions generated by nuclease Bal 31 digestion and by restriction endonuclease cleavage of plasmids containing DFR 1 physically mapped the gene to a 930 bp region between the Pst 1 and Sal 1 cut sites. This is consistent with the 21,000 molecular weight attributed to yeast dhfr in previous reports. From preliminary DNA sequence analysis of the dhfr DNA fragment the 3' terminus of DFR 1 was assigned to a position 27 nucleotides from the Eco Rl cut site on the Bam Hi - Eco Rl DNA segment. Several putative yeast transcription termination consensus sequences were identified 3' to the opal stop codon. DFR 1 is expressed in yeast and it confers resistance to the antifolate methotrexate when the gene is present in 2 - 10 copies per cell. Plasmid-dependent resistance to methotrexate is also observed in a rad 6 background although the effect is somewhat less than that conferred to wild-type or rad 18 cells. Integration of DFR 1 into the yeast genome showed an intermediate sensitivity to folate antagonists. This may suggest a gene dosage effect. No change in petite induction in these yeast strains was observed in transformed cells containing yeast dhfr plasmids. The sensitivity of rad 6 , rad 18 and wild-type cell populations to trimethoprim were unaffected by the presence of DFR 1 in transformants. Moreover, trimethoprim did not induce petites in any strain tested, which normally results if dhfr is inhibited by other antifolates such as methotrexate. This may suggest that the dhfr enzyme is not the only possible target of trimethoprim in yeast. rad 6 mutants showed a very low level of spontaneous petite formation. Methotrexate failed to induce respiratory deficient mutants in this strain which suggested that rad 6 might be an obligate grande. However, ethidium bromide induced petites to a level approximately 50% of that exhibited by wild-type and rad 18 strains.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recombinant Adenoviruses (Ads) have been shown to have potential applications in three areas: gene therapy, high level protein expression and recombinant vaccines.' At least three different locations within the Ad genome can be deleted and subsequently used for the insertion of foreign sequences. These include the Early 3 (E3), Early 1 (E1) and Early 4 (E4) regions. Viral vectors of this type have been well studied in Human Ads 2 and 5, however one has not yet been constructed for Bovine Adenovirus Type 2 (BAV2). The E3 region is located between 76.6 and 86 m.u. on the r-strand and is transcribed in a rightward direction. The gene products of the Early 3 region (E3) have been shown to be non-essential for viral replication, in vitro, but are required for host immunosurveillance. This study represents the cloning and reconstitution of a BAV2 E3 deletion mutant. A deletion of 1800bp was made within the E3 region of BAV2 and the thymidine kinase gene was subsequently inserted in the deleted area . . The plasmid pdlE3-4tk1 (23.4Kbp) was constructed and used to to facilitate homologous recombination with the wild type BAV2 to produce a mutant. Southern Blotting and Hybridization results suggest the presence of a BAV2 E3 deletion mutant with thymidine kinase sequences present. The E4 region of Human Adenovirus types 2 and 5 is located at the extreme right end of the genome (91.3 map units - 99.1 map units) and is transcribed in a leftward direction giving rise to a complicated set of differentially spliced mRNAs. Essentially there are 7 open reading frames (ORFs) encoding for at least 7 polypeptides. The gene products encoded by the E4 region have been shown to be essential for the expression of late viral genes, host cell shutoff and normal viral growth. We have cloned and sequenced the right end segment between 90.5 map units and 100 map units of the BAV2 genome. The results show several open reading frames which encode polypeptides exhibiting homology to three polypeptides encoded by the E4 region of human adenovirus type 2. These include the 14kDa protein encoded by ORF1, the 34kDa protein encoded by ORF6 and the 13kDa protein encoded by ORF3. The nucleotide sequence, restriction enzyme map, and ORF map of the E4 region could be very useful in future molecular manipulation of this region and could possibly explain the slow growth rate of BAV2 in MDBK cells.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The ease of production and manipulation has made plasmid DNA a prime target for its use in gene transfer technologies such as gene therapy and DNA vaccines. The major drawback of plasmid however is its stability within mammalian cells. Plasmid DNA is usually lost by cellular mechanisms or as a result of mitosis by simple dilution. This study set out to search for mammalian genomic DNA sequences that would enhance the stability of plasmid DNA in mammalian cells.Creating a plasmid based genomic DNA library, we were able to screen the human genome by transfecting the library into Human Embryonic Kidney (HEK 293) Cells. Cells that contained plasmid DNA were selected, using G418 for 14 days. The resulting population was then screened for the presence of biologically active plasmid DNA using the process of transformation as a detector.A commercially available plasmid DNA isolation kit was modified to extract plasmid DNA from mammalian cells. The standardized protocol had a detection limit of -0.6 plasmids per cell in one million cells. This allowed for the detection of 45 plasmids that were maintained for 32 days in the HEK 293 cells. Sequencing of selected inserts revealed a significantly higher thymine content in comparison to the human genome. Sequences with high A/T content have been associated with Scaffold/Matrix Attachment Region (S/MAR) sequences in mammalian cells. Therefore, association with the nuclear matrix might be required for the stability of plasmids in mammalian cells.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Retrotransposons, which used to be considered as “junk DNA”, have begun to reveal their immense value to genome evolution and human biology due to recent studies. They consist of at least ~45% of the human genome and are more or less the same in other mammalian genomes. Retrotransposon elements (REs) are known to affect the human genome through many different mechanisms, such as generating insertion mutations, genomic instability, and alteration in gene expression. Previous studies have suggested several RE subfamilies, such as Alu, L1, SVA and LTR, are currently active in the human genome, and they are an important source of genetic diversity between human and other primates, as well as among humans. Although several groups had used Retrotransposon Insertion Polymorphisms (RIPs) as markers in studying primate evolutionary history, no study specifically focused on identifying Human-Specific Retrotransposon Element (HS-RE) and their roles in human genome evolution. In this study, by computationally comparing the human genome to 4 primate genomes, we identified a total of 18,860 HS-REs, among which are 11,664 Alus, 4,887 L1s, 1,526 SVAs and 783 LTRs (222 full length entries), representing the largest and most comprehensive list of HS-REs generated to date. Together, these HS-REs contributed a total of 14.2Mb sequence increase from the inserted REs and Target Site Duplications (TSDs), 71.6Kb increase from transductions, and 268.2 Kb sequence deletion of from insertion-mediated deletion, leading to a net increase of ~14 Mb sequences to the human genome. Furthermore, we observed for the first time that Y chromosome might be a hot target for new retrotransposon insertions in general and particularly for LTRs. The data also allowed for the first time the survey of frequency of TE insertions inside other TEs in comparison with TE insertion into none-TE regions. In summary, our data suggest that retrotransposon elements have played a significant role in the evolution of Homo sapiens.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

DNA assembly is among the most fundamental and difficult problems in bioinformatics. Near optimal assembly solutions are available for bacterial and small genomes, however assembling large and complex genomes especially the human genome using Next-Generation-Sequencing (NGS) technologies is shown to be very difficult because of the highly repetitive and complex nature of the human genome, short read lengths, uneven data coverage and tools that are not specifically built for human genomes. Moreover, many algorithms are not even scalable to human genome datasets containing hundreds of millions of short reads. The DNA assembly problem is usually divided into several subproblems including DNA data error detection and correction, contig creation, scaffolding and contigs orientation; each can be seen as a distinct research area. This thesis specifically focuses on creating contigs from the short reads and combining them with outputs from other tools in order to obtain better results. Three different assemblers including SOAPdenovo [Li09], Velvet [ZB08] and Meraculous [CHS+11] are selected for comparative purposes in this thesis. Obtained results show that this thesis’ work produces comparable results to other assemblers and combining our contigs to outputs from other tools, produces the best results outperforming all other investigated assemblers.