35 resultados para CpGV resistance baculovirus whole genome sequencing
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
The human genome project has been recently complemented by whole-genome assessment sequence of 32 mammals and 24 nonmammalian vertebrate species suitable for comparative genomic analyses. Here we anticipate a precipitous drop in costs and increase in sequ
Resumo:
Mitochondria are essential for cellular energy production in most eukaryotic organisms. However, when glucose is abundant, yeast species that underwent whole-genome duplication (WGD) mostly conduct fermentation even under aerobic conditions, and most can
Resumo:
The Chinese pangolin (Manis pentadactyla), a representative species of the order Pholidota, has been enlisted in the mammalian whole-genome sequencing project mainly because of its phylogenetic importance. Previous studies showed that the diploid number o
Resumo:
Highly pathogenic avian influenza H5N1 virus has swept west across the globe and caused serious debates on the roles of migratory birds in virus circulation since the first large-scale outbreak in migratory birds of Lake Qinghai, 2005. In May 2006, another outbreak struck Lake Qinghai and six novel strains were isolated. To elucidate these QH06 viruses, the six isolates were subjected to whole-genome sequencing. Phylogenetic analyses show that QH06 viruses are derived from the lineages of Lake Qinghai, 2005. Five of the six novel isolates are adjacent to the strain A/Cygnus olor/Croatia/1/05, and the last one is related to the strain A/duck/Novosibirsk/ 02/05, an isolate of the flyway. Antigenic analyses suggest that QH06 and QH05 viruses are similar to each other. These findings implicate that QH06 viruses of Lake Qinghai may travel back via migratory birds, though not ruling out the possibility of local circulation of viruses of Lake Qinghai.
Sequencing, annotation and comparative analysis of nine BACs of giant panda (Ailuropoda melanoleuca)
Resumo:
A 10-fold BAC library for giant panda was constructed and nine BACs were selected to generate finish sequences. These BACs could be used as a validation resource for the de novo assembly accuracy of the whole genome shotgun sequencing reads of giant panda newly generated by the Illumina GA sequencing technology. Complete sanger sequencing, assembly, annotation and comparative analysis were carried out on the selected BACs of a joint length 878 kb. Homologue search and de novo prediction methods were used to annotate genes and repeats. Twelve protein coding genes were predicted, seven of which could be functionally annotated. The seven genes have an average gene size of about 41 kb, an average coding size of about 1.2 kb and an average exon number of 6 per gene. Besides, seven tRNA genes were found. About 27 percent of the BAC sequence is composed of repeats. A phylogenetic tree was constructed using neighbor-join algorithm across five species, including giant panda, human, dog, cat and mouse, which reconfirms dog as the most related species to giant panda. Our results provide detailed sequence and structure information for new genes and repeats of giant panda, which will be helpful for further studies on the giant panda.
Resumo:
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.
Resumo:
With complete sets of chromosome-specific painting probes derived from flow-sorted chromosomes of human and grey squirrel (Sciurus carolinensis), the whole genome homologies between human and representatives of tree squirrels (Sciurus carolinensis, Callosciurus erythraeus), flying squirrels (Petaurista albiventer) and chipmunks (Tamias sibiricus) have been defined by cross-species chromosome painting. The results show that, unlike the highly rearranged karyotypes of mouse and rat, the karyotypes of squirrels are highly conserved. Two methods have been used to reconstruct the genome phylogeny of squirrels with the laboratory rabbit (Oryctolagus cuniculus) as the out-group: ( 1) phylogenetic analysis by parsimony using chromosomal characters identified by comparative cytogenetic approaches; ( 2) mapping the genome rearrangements onto recently published sequence-based molecular trees. Our chromosome painting results, in combination with molecular data, show that flying squirrels are phylogenetically close to New World tree squirrels. Chromosome painting and G-banding comparisons place chipmunks ( Tamias sibiricus), with a derived karyotype, outside the clade comprising tree and flying squirrels. The superorder Glires (order Rodentia + order Lagomorpha) is firmly supported by two conserved syntenic associations between human chromosomes 1 and 10p homologues, and between 9 and 11 homologues.
Resumo:
Background: Giardia are a group of widespread intestinal protozoan parasites in a number of vertebrates. Much evidence from G. lamblia indicated they might be the most primitive extant eukaryotes. When and how such a group of the earliest branching unicellular eukaryotes developed the ability to successfully parasitize the latest branching higher eukaryotes (vertebrates) is an intriguing question. Gene duplication has long been thought to be the most common mechanism in the production of primary resources for the origin of evolutionary novelties. In order to parse the evolutionary trajectory of Giardia parasitic lifestyle, here we carried out a genome-wide analysis about gene duplication patterns in G. lamblia. Results: Although genomic comparison showed that in G. lamblia the contents of many fundamental biologic pathways are simplified and the whole genome is very compact, in our study 40% of its genes were identified as duplicated genes. Evolutionary distance analyses of these duplicated genes indicated two rounds of large scale duplication events had occurred in G. lamblia genome. Functional annotation of them further showed that the majority of recent duplicated genes are VSPs (Variant-specific Surface Proteins), which are essential for the successful parasitic life of Giardia in hosts. Based on evolutionary comparison with their hosts, it was found that the rapid expansion of VSPs in G. lamblia is consistent with the evolutionary radiation of placental mammals. Conclusions: Based on the genome-wide analysis of duplicated genes in G. lamblia, we found that gene duplication was essential for the origin and evolution of Giardia parasitic lifestyle. The recent expansion of VSPs uniquely occurring in G. lamblia is consistent with the increment of its hosts. Therefore we proposed a hypothesis that the increment of Giradia hosts might be the driving force for the rapid expansion of VSPs.
Resumo:
The Sox gene family is found in a broad range of animal taxa and encodes important gene regulatory proteins involved in a variety of developmental processes. We have obtained clones representing the HMG boxes of twelve Sox genes from grass carp (Ctenopharyngodon idella), one of the four major domestic carps in China. The cloned Sox genes belong to group B1, B2 and C. Our analyses show that whereas the human genome contains a single copy of Sox4, Sox11 and Sox14, each of these genes has two co-orthologs in grass carp, and the duplication of Sox4 and Sox11 occurred before the divergence of grass carp and zebrafish, which support the "fish-specific whole-genome duplication" theory. An estimation for the origin of grass carp based on the molecular clock using Sox1, Sox3 and Sox11 genes as markers indicates that grass carp (subfamily Leuciscinae) and zebrafish (subfamily Danioninae) diverged approximately 60 million years ago. The potential uses of Sox genes as markers in revealing the evolutionary history of grass carp are discussed.
Resumo:
Vibrio anguillarum is a common bacterial pathogen in fish. However, little is known about its pathogenic mechanism, in part, because the entire genome has not been completely sequenced. We constructed a fosmid library for V. anguillarum containing 960 clones with an average insert size of 37.7 kb and 8.6-fold genome coverage. We characterized the library by end-sequencing 50 randomly selected clones. This generated 93 sequences with a total length of 57 485 by covering 1.4% of the whole genome. Of these sequences, 58 (62.4%) were homologous to known genes, 30 (32.3%) were genes with hypothetical functions, and the remaining 5 (5.3%) were unknown genes. We demonstrated the utility of this library by PCR screening of 10 genes. This resulted in an average of 6.2 fosmid clones per screening. This fosmid library offers a new tool for gene screening and cloning of V. anguillarum, and for comparative genomic studies among Vibrio species.
Resumo:
Transcription factor binding sites (TFBS) play key roles in genebior 6.8 wavelet expression and regulation. They are short sequence segments with de¯nite structure and can be recognized by the corresponding transcription factors correctly. From the viewpoint of statistics, the candidates of TFBS should be quite di®erent from the segments that are randomly combined together by nucleotide. This paper proposes a combined statistical model for ¯nding over- represented short sequence segments in di®erent kinds of data set. While the over-represented short sequence segment is described by position weight matrix, the nucleotide distribution at most sites of the segment should be far from the background nucleotide distribution. The central idea of this approach is to search for such kind of signals. This algorithm is tested on 3 data sets, including binding sites data set of cyclic AMP receptor protein in E.coli, PlantProm DB which is a non-redundant collection of proximal promoter sequences from di®erent species, collection of the intergenic sequences of the whole genome of E.Coli. Even though the complexity of these three data sets is quite di®erent, the results show that this model is rather general and sensible.
Resumo:
The Indian muntjac (Muntiacus muntjak vaginalis) has a karyotype of 2n=6 in the female and 7 in the male, the karyotypic evolution of which through extensive tandem fusions and several centric fusions has been well-documented by recent molecular cytogenetic studies. In an attempt to define the fusion orientations of conserved chromosomal segments and the molecular mechanisms underlying the tandem fusions, we have constructed a highly redundant (more than six times of whole genome coverage) bacterial artificial chromosome (BAC) library of Indian muntjac. The BAC library contains 124,800 clones with no chromosome bias and has an average insert DNA size of 120 kb. A total of 223 clones have been mapped by fluorescent in situ hybridization onto the chromosomes of both Indian muntjac and Chinese muntjac and a high-resolution comparative map has been established. Our mapping results demonstrate that all tandem fusions that occurred during the evolution of Indian muntjac karyotype from the acrocentric 2n=70 hypothetical ancestral karyotype are centromere-telomere (head-tail) fusions.
Resumo:
We constructed a high redundancy bacterial artificial chromosome library of a seriously endangered Old World Monkey, the Yunnan snub-nosed monkey (Rhinopithecus bieti) from China. This library contains a total of 136 320 BAC clones. The average insert size of BAC clones was estimated to be 148 kb. The percentage of small inserts (50-100 kb) is 2.74%, and only 2.67% non-recombinant clones were observed. Assuming a similar genome size with closely related primate species, the Yunnan snub-nosed monkey BAC library has at least six times the genome coverage. By end sequencing of randomly selected BAC clones, we generated 201 sequence tags for the library. A total of 139 end-sequenced BAC clones were mapped onto the chromosomes of Yunnan snub-nosed monkey by fluorescence in-situ hybridization, demonstrating a high degree of synteny conservation between humans and Yunnan snub-nosed monkeys. Blast search against human genome showed a good correlation between the number of hit clones and the size of the chromosomes, an indication of unbiased chromosomal distribution of the BAC library. This library and the mapped BAC clones will serve as a valuable resource in comparative genomics studies and large-scale genome sequencing of nonhuman primates. The DNA sequence data reported in this paper were deposited in GenBank and assigned the accession number CG891489-CG891703.
Resumo:
The mitochondrial DNA (mtDNA) control region is believed to play an important biological role in mtDNA replication. Large deletions in this region are rarely found, but when they do occur they might be expected to interfere with the replication of the molecule, thus leading to a reduction of mtDNA copy number. During a survey for mtDNA sequence variations in 5,559 individuals from the general Chinese population and 2,538 individuals with medical disorders, we identified a 50-bp deletion (m.298_347del50) in the mtDNA control region in a member of a healthy Han Chinese family belonging to haplogroup B4c1b2, as suggested by complete mtDNA genome sequencing. This deletion removes the conserved sequence block II (CSBII; region 299-315) and the replication primer location (region 317-321). However, quantification of the mtDNA copy number in this subject showed a value within a range that was observed in 20 healthy subjects without the deletion. The deletion was detected in the hair samples of the maternal relatives of the subject and exhibited variable heteroplasmy. Our current observation, together with a recent report for a benign 154-bp deletion in the mtDNA control region, suggests that the control of mtDNA replication may be more complex than we had thought. Hum Mutat 31:538-543, 2010. (C) 2010 Wiley-Liss, Inc.
Resumo:
We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped superscaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000 - 40,000. Only 2% - 3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism ( SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family.