955 resultados para genetic sequencing
Resumo:
The sequencing analysis of the mitochondrial DNA control region (mtCR DNA) was performed to assess the genetic divergence and population structure of the Chinese sucker Myxocyprinus asiaticus (Cypriniformes Catostomidae) using four sample lots from natural populations of the Yangtze River. The mtCR DNA sequences of approximately 920 base pairs were obtained. A total of 223 nucleotide positions were polymorphic, and these defined 39 haplotypes. Of the 39 haplotypes, 37 (90%) were not shared, and among the populations as a whole there was little sharing of haplotypes. The average haplotype diversity (0.958) and the average nucleotide diversity (0.052) indicated a higher level of genetic diversity of Chinese sucker through the river. Analysis of molecular variation (AMOVA) of data revealed significant partitioning of variance (P<0.001) among populations (60.29%), and within populations (39.71%). The topology according to the neighbor joining and maximum parsimony methods showed mosaic composition of the 39 haplotypes, suggesting that the populations wore not completely divergent. The pairwise F statistic values, however, indicated that the population structuring existed to some extent among the geographic populations. There was a positive relationship between the aquatic distance and the genetic distance (Fst) among the populations (P<0.05). Based on our data, it is suggested that genetic drift, gene flow, and stochastic events are the possible factors influencing the population structure and genetic variation.
Resumo:
Although common carp is the major fish species in Asian and European aquaculture and many domestic varieties have occurred, there is a controversy about the origination of European domestic common carp. Some scientists affirmed that the ancestor of European domestic common carp was Danube River wild common carp, but others considered it might be Asian common carp. For elucidating origination of European domestic common carp, we chose two representative European domestic common carp strains (German mirror carp and Russian scattered scaled mirror carp) and one wild common carp strain of Cyprinus carpio carpio subspecies (Volga River wild common carp) and two Asian common carp strains, the Yangtze River wild common carp (Cyprinus carpio haematopterus) and traditionally domestic Xingguo red common carp, as experimental materials. ND5-ND6 and D-loop segments of mitochondrial DNA were amplified by polymerase chain reaction and analyzed through restriction fragment length polymorphism (RFLP) and sequencing respectively. The results revealed that HaeIII and DdeI digestion patterns of ND5-ND6 segment and sequences of control region were different between European subspecies C. carpio carpio and Asian subspecies C. carpio haematopterus. Phylogenetic analysis showed that German mirror carp and Russian scattered scaled mirror carp belonged to two subspecies, C. carpio carpio and C. carpio haematopterus, respectively. Therefore, there were different ancestors for domestic carp in Europe: German mirror carp was domesticated from European subspecies C. carpio carpio and Russian scattered scaled mirror carp originated from Asian subspecies C. carpio haematopterus.
Resumo:
This article documents the addition of 512 microsatellite marker loci and nine pairs of Single Nucleotide Polymorphism (SNP) sequencing primers to the Molecular Ecology Resources Database. Loci were developed for the following species: Alcippe morrisonia morrisonia, Bashania fangiana, Bashania fargesii, Chaetodon vagabundus, Colletes floralis, Coluber constrictor flaviventris, Coptotermes gestroi, Crotophaga major, Cyprinella lutrensis, Danaus plexippus, Fagus grandifolia, Falco tinnunculus, Fletcherimyia fletcheri, Hydrilla verticillata, Laterallus jamaicensis coturniculus, Leavenworthia alabamica, Marmosops incanus, Miichthys miiuy, Nasua nasua, Noturus exilis, Odontesthes bonariensis, Quadrula fragosa, Pinctada maxima, Pseudaletia separata, Pseudoperonospora cubensis, Podocarpus elatus, Portunus trituberculatus, Rhagoletis cerasi, Rhinella schneideri, Sarracenia alata, Skeletonema marinoi, Sminthurus viridis, Syngnathus abaster, Uroteuthis (Photololigo) chinensis, Verticillium dahliae, Wasmannia auropunctata, and Zygochlamys patagonica. These loci were cross-tested on the following species: Chaetodon baronessa, Falco columbarius, Falco eleonorae, Falco naumanni, Falco peregrinus, Falco subbuteo, Didelphis aurita, Gracilinanus microtarsus, Marmosops paulensis, Monodelphis Americana, Odontesthes hatcheri, Podocarpus grayi, Podocarpus lawrencei, Podocarpus smithii, Portunus pelagicus, Syngnathus acus, Syngnathus typhle,Uroteuthis (Photololigo) edulis, Uroteuthis (Photololigo) duvauceli and Verticillium albo-atrum. This article also documents the addition of nine sequencing primer pairs and sixteen allele specific primers or probes for Oncorhynchus mykiss and Oncorhynchus tshawytscha; these primers and assays were cross-tested in both species.
Resumo:
Genetic differentiation of the shrimp Penaeus chinensis in the Yellow Sea and Bohai Sea was investigated using the mitochondrial control region (CR). RFLP of a partial CR segment (613 bp) shows that 106 out of 122 (86.9%) individuals from six sampling localities along the coast of northern China and the west coast of the Korean Peninsula share the same haplotype, and the haplotype frequencies among localities are not significantly different. The findings are further confirmed by sequencing the complete CR. Divergence of the complete CR (992 bp) is less than 1.6% in 14 individuals from the six localities. F-statistics based on RFLP data and the TCS network of sequencing data suggest little genetic differentiation of P. chinensis in the Yellow Sea and Bohai Sea. Mismatch analysis suggests a rapid expansion of P. chinensis population to the Yellow Sea and the Bohai Sea, which probably occurred with the rapid rise in sea level after the last glacial maximum. Despite the lack of genetic heterogeneity, we propose that P. chinensis populations in this region should be treated as separate management units, as fishery management programs have to be applied on a local basis by different governments.
Resumo:
Colorectal cancer is the most common cause of death due to malignancy in nonsmokers in the western world. In 1995 there were 1,757 cases of colon cancer in Ireland. Most colon cancer is sporadic, however ten percent of cases occur where there is a previous family history of the disease. In an attempt to understand the tumorigenic pathway in Irish colon cancer patients, a number of genes associated with colorectal cancer development were analysed in Irish sporadic and HNPCC colon cancer patients. The hereditary forms of colon cancer include Familial adenomatous polyposis coli (FAP) and Hereditary Non-Polyposis Colon Cancer (HNPCC). Genetic analysis of the gene responsible for FAP, (the APC gene) has been previously performed on Irish families, however the genetic analysis of HNPCC families is limited. In an attempt to determine the mutation spectrum in Irish HNPCC pedigrees, the hMSH2 and hMLHl mismatch repair genes were screened in 18 Irish HNPCC families. Using SSCP analysis followed by DNA sequencing, five mutations were identified, four novel and a previously reported mutation. In families where a mutation was detected, younger asyptomatic members were screened for the presence of the predisposing mutation (where possible). Detection of mutations is particularly important for the identification of at risk individuals as the early diagnosis of cancer can vastly improve the prognosis. The sensitive and efficient detection of multiple different mutations and polymorphisms in DNA is of prime importance for genetic diagnosis and the identification of disease genes. A novel mutation detection technique has recently been developed in our laboratory. In order to assess the efficacy and application of the methodology in the analysis of cancer associated genes, a protocol for the analysis of the K-ras gene was developed and optimised. Matched normal and tumour DNA from twenty sporadic colon cancer patients was analysed for K-ras mutations using the Glycosylase Mediated Polymorphism Detection technique. Five mutations of the K-ras gene were detected using this technology. Sequencing analysis verified the presence of the mutations and SSCP analysis of the same samples did not identify any additional mutations. The GMPD technology proved to be highly sensitive, accurate and efficient in the identification of K-ras gene mutations. In order to investigate the role of the replication error phenomenon in Irish colon cancer, 3 polyA tract repeat loci were analysed. The repeat loci included a 10 bp intragenic repeat of the TGF-β-RII gene. TGF-β-RII is involved in the TGF-β epithelial cell growth pathway and mutation of the gene is thought to play a role in cell proliferation and tumorigenesis. Due to the presence of a repeat sequence within the gene, TGFB-RII defects are associated with tumours that display the replication error phenomenon. Analysis of the TGF-β-RII 10 bp repeat failed to identify mutations in any colon cancer patients. Analysis of the Bat26 and Bat 40 polyA repeat sequences in the sporadic and HNPCC families revealed that instability is associated with HNPCC tumours harbouring mismatch repair defects and with 20 % of sporadic colon cancer tumours. No correlation between K-ras gene mutations and the RER+ phenotype was detected in sporadic colon cancer tumours.
Resumo:
Bacteriophages, viruses infecting bacteria, are uniformly present in any location where there are high numbers of bacteria, both in the external environment and the human body. Knowledge of their diversity is limited by the difficulty to culture the host species and by the lack of the universal marker gene present in all viruses. Metagenomics is a powerful tool that can be used to analyse viral communities in their natural environments. The aim of this study was to investigate diverse populations of uncultured viruses from clinical (a sputum of patient with cystic fibrosis, CF) and environmental samples (a sludge from a dairy food wastewater treatment plant) containing rich bacterial populations using genetic and metagenomic analyses. Metagenomic sequencing of viruses obtained from these samples revealed that the majority of the metagenomic reads (97-99%) were novel when compared to the NCBI protein database using BLAST. A large proportion of assembled contigs were assignable as novel phages or uncharacterised prophages, the next largest assignable group being single-stranded eukaryotic virus genomes. Sputum from a cystic fibrosis patient contained DNA typical of phages of bacteria that are traditionally involved in CF lung infections and other bacteria that are part of the normal oral flora. The only eukaryotic virus detected in the CF sputum was Torque Teno virus (TTV). A substantial number of assigned sequences from dairy wastewater could be affiliated with phages of bacteria that are typically found in the soil and aquatic environments, including wastewater. Eukaryotic viral sequences were dominated by plant pathogens from the Geminiviridae and Nanoviridae families, and animal pathogens from the Circoviridae family. Antibiotic resistance genes were detected in both metagenomes suggesting phages could be a source for transmissible antimicrobial resistance. Overall, diversity of viruses in the CF sputum was low, with 89 distinct viral genotypes predicted, and higher (409 genotypes) in the wastewater. Function-based screening of a metagenomic library constructed from DNA extracted from dairy food wastewater viruses revealed candidate promoter sequences that have ability to drive expression of GFP in a promoter-trap vector in Escherichia coli. The majority of the cloned DNA sequences selected by the assay were related to ssDNA circular eukaryotic viruses and phages which formed a minority of the metagenome assembly, and many lacked any significant homology to known database sequences. Natural diversity of bacteriophages in wastewater samples was also examined by PCR amplification of the major capsid protein sequences, conserved within T4-type bacteriophages from Myoviridae family. Phylogenetic analysis of capsid sequences revealed that dairy wastewater contained mainly diverse and uncharacterized phages, while some showed a high level of similarity with phages from geographically distant environments.
Resumo:
Phages belonging to the 936 group represent one of the most prevalent and frequently isolated phages in dairy fermentation processes using Lactococcus lactis as the primary starter culture. In recent years extensive research has been carried out to characterise this phage group at a genomic level in an effort to understand how the 936 group phages dominate this particular niche and cause regular problems during large scale milk fermentations. This thesis describes a large scale screening of industrial whey samples, leading to the isolation of forty three genetically different lactococcal phages. Using multiplex PCR, all phages were identified as members of the 936 group. The complete genome of thirty eight of these phages was determined using next generation sequencing technologies which identified several regions of divergence. These included the structural region surrounding the major tail protein, the replication region as well as the genes involved in phage DNA packing. For a number of phages the latter genomic region was found to harbour genes encoding putative orphan methyltransferases. Using small molecule real time (SMRT) sequencing and heterologous gene expression, the target motifs for several of these MTases were determined and subsequently shown to actively protect phage DNA from restriction endonuclease activity. Comparative analysis of the thirty eight phages with fifty two previously sequenced members of this group showed that the core genome consists of 28 genes, while the non-core genome was found to fluctuate irrespective of geographical location or time of isolation. This study highlights the continued need to perform large scale characterisation of the bacteriophage populations infecting industrial fermentation facilities in effort to further our understanding dairy phages and ways to control their proliferation.
Resumo:
BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. RESULTS: We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. CONCLUSION: The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible.
Resumo:
BACKGROUND: There is considerable interest in the development of methods to efficiently identify all coding variants present in large sample sets of humans. There are three approaches possible: whole-genome sequencing, whole-exome sequencing using exon capture methods, and RNA-Seq. While whole-genome sequencing is the most complete, it remains sufficiently expensive that cost effective alternatives are important. RESULTS: Here we provide a systematic exploration of how well RNA-Seq can identify human coding variants by comparing variants identified through high coverage whole-genome sequencing to those identified by high coverage RNA-Seq in the same individual. This comparison allowed us to directly evaluate the sensitivity and specificity of RNA-Seq in identifying coding variants, and to evaluate how key parameters such as the degree of coverage and the expression levels of genes interact to influence performance. We find that although only 40% of exonic variants identified by whole genome sequencing were captured using RNA-Seq; this number rose to 81% when concentrating on genes known to be well-expressed in the source tissue. We also find that a high false positive rate can be problematic when working with RNA-Seq data, especially at higher levels of coverage. CONCLUSIONS: We conclude that as long as a tissue relevant to the trait under study is available and suitable quality control screens are implemented, RNA-Seq is a fast and inexpensive alternative approach for finding coding variants in genes with sufficiently high expression levels.
Resumo:
We used ultra-deep sequencing to obtain tens of thousands of HIV-1 sequences from regions targeted by CD8+ T lymphocytes from longitudinal samples from three acutely infected subjects, and modeled viral evolution during the critical first weeks of infection. Previous studies suggested that a single virus established productive infection, but these conclusions were tempered because of limited sampling; now, we have greatly increased our confidence in this observation through modeling the observed earliest sample diversity based on vastly more extensive sampling. Conventional sequencing of HIV-1 from acute/early infection has shown different patterns of escape at different epitopes; we investigated the earliest escapes in exquisite detail. Over 3-6 weeks, ultradeep sequencing revealed that the virus explored an extraordinary array of potential escape routes in the process of evading the earliest CD8 T-lymphocyte responses--using 454 sequencing, we identified over 50 variant forms of each targeted epitope during early immune escape, while only 2-7 variants were detected in the same samples via conventional sequencing. In contrast to the diversity seen within epitopes, non-epitope regions, including the Envelope V3 region, which was sequenced as a control in each subject, displayed very low levels of variation. In early infection, in the regions sequenced, the consensus forms did not have a fitness advantage large enough to trigger reversion to consensus amino acids in the absence of immune pressure. In one subject, a genetic bottleneck was observed, with extensive diversity at the second time point narrowing to two dominant escape forms by the third time point, all within two months of infection. Traces of immune escape were observed in the earliest samples, suggesting that immune pressure is present and effective earlier than previously reported; quantifying the loss rate of the founder virus suggests a direct role for CD8 T-lymphocyte responses in viral containment after peak viremia. Dramatic shifts in the frequencies of epitope variants during the first weeks of infection revealed a complex interplay between viral fitness and immune escape.
Resumo:
BACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. FINDINGS: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) -- the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. CONCLUSIONS: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits.
Resumo:
Limited data are available regarding the molecular epidemiology of Mycobacterium tuberculosis (Mtb) strains circulating in Guatemala. Beijing-lineage Mtb strains have gained prevalence worldwide and are associated with increased virulence and drug resistance, but there have been only a few cases reported in Central America. Here we report the first whole genome sequencing of Central American Beijing-lineage strains of Mtb. We find that multiple Beijing-lineage strains, derived from independent founding events, are currently circulating in Guatemala, but overall still represent a relatively small proportion of disease burden. Finally, we identify a specific Beijing-lineage outbreak centered on a poor neighborhood in Guatemala City.
Resumo:
Plastid microsatellite loci developed for Cephalanthera longifolia were used to examine the level of genetic variation within and between populations of the three widespread Cephalanthera species (C. damasonium, C. longifolia and C. rubra). The most detailed sampling was in C. longifolia (42 localities from Ireland to China; 147 individuals). Eight haplotypes were detected. One was detected in the vast majority of individuals and occurred from Ireland to Iran. Three others were only found in Europe (Ireland to Italy, England to Italy and Austria to Croatia). Two were only found in the Middle East and two only in Asia. In C. damasonium, 21 individuals from 10 populations (England to Turkey) were sampled. Only one haplotype was detected. In C. rubra, 34 individuals from eight populations (England to Turkey) were sampled. Although it was not possible to amplify all loci for all samples of this species, nine haplotypes were detected. Short alleles for the trnS-trnG region found in two populations of C. rubra were characterized by sequencing and were caused by deletions of 26 and 30 base pairs. At this level of sampling, it appears that C. rubra shows the greatest genetic variability. Cephalanthera longifolia, C. rubra and C. damasonium have previously been characterized as outbreeding, outbreeding with facultative vegetative reproduction and inbreeding, respectively. Patterns of genetic variation here are discussed in the light of these reproductive system differences. The primers used in these three species of Cephalanthera were also demonstrated to amplify these loci in another five species (C. austiniae, C. calcarata, C. epipactoides, C. falcata and C. yunnanensis). Although it is sometimes treated as a synonym of C. damasonium, the single sample of C. yunnanensis from China had a markedly different haplotype from that found in C. damasonium. All three loci were successfully amplified in two achlorophyllous, myco-heterotrophic species, C. austinae and C. calcarata. © 2010 The Linnean Society of London.
Resumo:
This article documents the addition of 512 microsatellite marker loci and nine pairs of Single Nucleotide Polymorphism (SNP) sequencing primers to the Molecular Ecology Resources Database. Loci were developed for the following species: Alcippe morrisonia morrisonia, Bashania fangiana, Bashania fargesii, Chaetodon vagabundus, Colletes floralis, Coluber constrictor flaviventris, Coptotermes gestroi, Crotophaga major, Cyprinella lutrensis, Danaus plexippus, Fagus grandifolia, Falco tinnunculus, Fletcherimyia fletcheri, Hydrilla verticillata, Laterallus jamaicensis coturniculus, Leavenworthia alabamica, Marmosops incanus, Miichthys miiuy, Nasua nasua, Noturus exilis, Odontesthes bonariensis, Quadrula fragosa, Pinctada maxima, Pseudaletia separata, Pseudoperonospora cubensis, Podocarpus elatus, Portunus trituberculatus, Rhagoletis cerasi, Rhinella schneideri, Sarracenia alata, Skeletonema marinoi, Sminthurus viridis, Syngnathus abaster, Uroteuthis (Photololigo) chinensis, Verticillium dahliae, Wasmannia auropunctata, and Zygochlamys patagonica. These loci were cross-tested on the following species: Chaetodon baronessa, Falco columbarius, Falco eleonorae, Falco naumanni, Falco peregrinus, Falco subbuteo, Didelphis aurita, Gracilinanus microtarsus, Marmosops paulensis, Monodelphis Americana, Odontesthes hatcheri, Podocarpus grayi, Podocarpus lawrencei, Podocarpus smithii, Portunus pelagicus, Syngnathus acus, Syngnathus typhle,Uroteuthis (Photololigo) edulis, Uroteuthis (Photololigo) duvauceli and Verticillium albo-atrum. This article also documents the addition of nine sequencing primer pairs and sixteen allele specific primers or probes for Oncorhynchus mykiss and Oncorhynchus tshawytscha; these primers and assays were cross-tested in both species.
Resumo:
BRCA1 encodes a tumour suppressor protein that plays pivotal roles in homologous recombination (HR) DNA repair, cell-cycle checkpoints, and transcriptional regulation. BRCA1 germline mutations confer a high risk of early-onset breast and ovarian cancer. In more than 80% of cases, tumours arising in BRCA1 germline mutation carriers are oestrogen receptor (ER)-negative; however, up to 15% are ER-positive. It has been suggested that BRCA1 ER-positive breast cancers constitute sporadic cancers arising in the context of a BRCA1 germline mutation rather than being causally related to BRCA1 loss-of-function. Whole-genome massively parallel sequencing of ER-positive and ER-negative BRCA1 breast cancers, and their respective germline DNAs, was used to characterize the genetic landscape of BRCA1 cancers at base-pair resolution. Only BRCA1 germline mutations, somatic loss of the wild-type allele, and TP53 somatic mutations were recurrently found in the index cases. BRCA1 breast cancers displayed a mutational signature consistent with that caused by lack of HR DNA repair in both ER-positive and ER-negative cases. Sequencing analysis of independent cohorts of hereditary BRCA1 and sporadic non-BRCA1 breast cancers for the presence of recurrent pathogenic mutations and/or homozygous deletions found in the index cases revealed that DAPK3, TMEM135, KIAA1797, PDE4D, and GATA4 are potential additional drivers of breast cancers. This study demonstrates that BRCA1 pathogenic germline mutations coupled with somatic loss of the wild-type allele are not sufficient for hereditary breast cancers to display an ER-negative phenotype, and has led to the identification of three potential novel breast cancer genes (ie DAPK3, TMEM135, and GATA4).