935 resultados para Complete Genome Sequence
Resumo:
We experimentally identified the activities of six predicted heptosyltransferases in Actinobacillus pleuropneumoniae genome serotype 5b strain L20 and serotype 3 strain JL03. The initial identification was based on a bioinformatic analysis of the amino acid similarity between these putative heptosyltrasferases with others of known function from enteric bacteria and Aeromonas. The putative functions of all the Actinobacillus pleuropneumoniae heptosyltrasferases were determined by using surrogate LPS acceptor molecules from well-defined A. hydrophyla AH-3 and A. salmonicida A450 mutants. Our results show that heptosyltransferases APL_0981 and APJL_1001 are responsible for the transfer of the terminal outer core D-glycero-D-manno-heptose (D,D-Hep) residue although they are not currently included in the CAZY glycosyltransferase 9 family. The WahF heptosyltransferase group signature sequence [S(T/S)(GA)XXH] differs from the heptosyltransferases consensus signature sequence [D(TS)(GA)XXH], because of the substitution of D(261) for S(261), being unique.
Resumo:
Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific life history.
Resumo:
Two Brazilian Potato virus Y (PVY) isolates were biologically characterized as necrotic (PVY-NBR) and common (PVY-OBR) based upon symptoms on test plants. Additional characterization was performed by sequencing a cDNA corresponding to the 3' terminal region of the viral genome. The sequence consisted of 195 nucleotides (nt) coding part of the nuclear inclusion body b (NIb) gene, 804 nt of the coat protein (CP) gene, and 328 nt (PVY-OBR) or 326 nt (PVY-NBR) of the 3'-untranslated region (UTR). Translation of the sequence resulted in one single open reading frame with part of the NIb and a CP of 267 amino acids. The two isolates shared 95.1% similarity in the CP amino acid sequence. The CP and the 3'-UTR sequence of the Brazilian isolates were compared to those of other PVY isolates previously reported and unrooted phylogenetic trees were constructed. The trees revealed a separation of two distinct clusters, one comprising most of the common strains and the other comprising the necrotic strains. PVY-OBR was clustered in the common group and PVY-NBR in the necrotic one.
Resumo:
Leafroll is an economically important disease affecting grapevines (Vitis spp.). Nine serologically distinct viruses, Grapevine leafroll-associated virus-1 through 9, are associated with this disease. The present study describes the coat protein gene sequence of four GLRaV-3 isolates occurring in the São Francisco River basin, Northeastern Brazil. The viral RNA was extracted from GLRaV-3 ELISA-positive plants and the complete coat protein gene was amplified by RT-PCR. Sequences were generated automatically and compared to the complete coat protein sequence from North American (NY1) and Chinese (Dawanhong Nº2 and SL10) GLRaV-3 isolates. The four studied isolates, named Pet-1 through 4, showed deduced amino acid identities of 98-100% (Pet-1 through 3) and 95% (Pet-4) with North American and Chinese isolates. A total of seventeen amino acid substitutions was detected among the four characterized isolates in comparison to the NY1, Dawanhong No.2 and SL10 sequences. The results indicated the existence of natural variation among GLRaV-3 isolates from grapevines, also demonstrating a lack of correlation between sequence data and geographic origin. This variability should be considered when selecting regions of the viral genome targeted for reliable and consistent virus molecular detection.
Resumo:
Reverse transcriptase (RT) sequence analysis is an important technique used to detect the presence of transposable elements in a genome. Putative RT sequences were analyzed in the genome of the pathogenic fungus C. perniciosa, the causal agent of witches' broom disease of cocoa. A 394 bp fragment was amplified from genomic DNA of different isolates of C. perniciosa belonging to C-, L-, and S-biotypes and collected from various geographical areas. The cleavage of PCR products with restriction enzymes and the sequencing of various RT fragments indicated the presence of several sequences showing transition events (G:C to A:T). Southern blot analysis revealed high copy numbers of RT signals, forming different patterns among C-, S-, and L-biotype isolates. Sequence comparisons of the predicted RT peptide indicate a close relationship with the RT protein from thegypsy family of LTR-retrotransposons. The possible role of these retrotransposons in generating genetic variability in the homothallic C. perniciosa is discussed.
Resumo:
A dichorionic twin pregnancy with complete hydatidiform mole and coexistent fetus is a rare and challenging situation, whose pathogenesis has not been yet fully understood. We present a case of a 39-year-old woman who underwent intracytoplasmic sperm injection with two embryos transfer. The 12-week gestation ultrasound examination revealed normal fetus and placenta with features of hydatidiform mole, leading to pregnancy termination. Autopsy and histological examinations diagnosed a complete mole coexisting with a normal fetus, and the genetic analysis showed a diploid fetus with biparental genome and molar tissue with paternal diploidy. This case highlighted that complete molar pregnancies may still occur even though pregnancy is achieved after intracytoplasmic sperm injection. A review of the literature was performed by collecting data from the few similar reported cases and by commenting on the pathogenesis of this rare condition.
Resumo:
Avian pathogenic Escherichia coli (APEC) infections are responsible for significant losses in the poultry industry worldwide. A zoonotic risk has been attributed to APEC strains because they present similarities to extraintestinal pathogenic E. coli (ExPEC) associated with illness in humans, mainly urinary tract infections and neonatal meningitis. Here, we present in silico analyses with pathogenic E. coli genome sequences, including recently available APEC genomes. The phylogenetic tree, based on multi-locus sequence typing (MLST) of seven housekeeping genes, revealed high diversity in the allelic composition. Nevertheless, despite this diversity, the phylogenetic tree was able to cluster the different pathotypes together. An in silico virulence gene profile was also determined for each of these strains, through the presence or absence of 83 well-known virulence genes/traits described in pathogenic E. coli strains. The MLST phylogeny and the virulence gene profiles demonstrated a certain genetic similarity between Brazilian APEC strains, APEC isolated in the United States, UPEC (uropathogenic E. coli) and diarrheagenic strains isolated from humans. This correlation corroborates and reinforces the zoonotic potential hypothesis proposed to APEC.
Resumo:
Measles virus is a highly contagious agent which causes a major health problem in developing countries. The viral genomic RNA is single-stranded, nonsegmented and of negative polarity. Many live attenuated vaccines for measles virus have been developed using either the prototype Edmonston strain or other locally isolated measles strains. Despite the diverse geographic origins of the vaccine viruses and the different attenuation methods used, there was remarkable sequence similarity of H, F and N genes among all vaccine strains. CAM-70 is a Japanese measles attenuated vaccine strain widely used in Brazilian children and produced by Bio-Manguinhos since 1982. Previous studies have characterized this vaccine biologically and genomically. Nevertheless, only the F, H and N genes have been sequenced. In the present study we have sequenced the remaining P, M and L genes (approximately 1.6, 1.4 and 6.5 kb, respectively) to complete the genomic characterization of CAM-70 and to assess the extent of genetic relationship between CAM-70 and other current vaccines. These genes were amplified using long-range or standard RT-PCR techniques, and the cDNA was cloned and automatically sequenced using the dideoxy chain-termination method. The sequence analysis comparing previously sequenced genotype A strains with the CAM-70 Bio-Manguinhos strain showed a low divergence among them. However, the CAM-70 strains (CAM-70 Bio-Manguinhos and a recently sequenced CAM-70 submaster seed strain) were assigned to a specific group by phylogenetic analysis using the neighbor-joining method. Information about our product at the genomic level is important for monitoring vaccination campaigns and for future studies of measles virus attenuation.
Resumo:
Calves born persistently infected with non-cytopathic bovine viral diarrhea virus (ncpBVDV) frequently develop a fatal gastroenteric illness called mucosal disease. Both the original virus (ncpBVDV) and an antigenically identical but cytopathic virus (cpBVDV) can be isolated from animals affected by mucosal disease. Cytopathic BVDVs originate from their ncp counterparts by diverse genetic mechanisms, all leading to the expression of the non-structural polypeptide NS3 as a discrete protein. In contrast, ncpBVDVs express only the large precursor polypeptide, NS2-3, which contains the NS3 sequence within its carboxy-terminal half. We report here the investigation of the mechanism leading to NS3 expression in 41 cpBVDV isolates. An RT-PCR strategy was employed to detect RNA insertions within the NS2-3 gene and/or duplication of the NS3 gene, two common mechanisms of NS3 expression. RT-PCR amplification revealed insertions in the NS2-3 gene of three cp isolates, with the inserts being similar in size to that present in the cpBVDV NADL strain. Sequencing of one such insert revealed a 296-nucleotide sequence with a central core of 270 nucleotides coding for an amino acid sequence highly homologous (98%) to the NADL insert, a sequence corresponding to part of the cellular J-Domain gene. One cpBVDV isolate contained a duplication of the NS3 gene downstream from the original locus. In contrast, no detectable NS2-3 insertions or NS3 gene duplications were observed in the genome of 37 cp isolates. These results demonstrate that processing of NS2-3 without bulk mRNA insertions or NS3 gene duplications seems to be a frequent mechanism leading to NS3 expression and BVDV cytopathology.
Resumo:
The present study examined the distribution of hepatitis C virus (HCV) genotypes and subtypes in a hemodialysis population in Goiás State, Central Brazil, and evaluated the efficiency of two genotyping methods: line probe assay (LiPA) based on the 5' noncoding region and nucleotide sequencing of the nonstructural 5B (NS5B) region of the genome. A total of 1095 sera were tested for HCV RNA by RT-nested PCR of the 5' noncoding region. The LiPA assay was able to genotype all 131 HCV RNA-positive samples. Genotypes 1 (92.4%) and 3 (7.6%) were found. Subtype 1a (65.7%) was the most prevalent, followed by subtypes 1b (26.7%) and 3a (7.6%). Direct nucleotide sequencing of 340 bp from the NS5B region was performed in 106 samples. The phylogenetic tree showed that 98 sequences (92.4%) were classified as genotype 1, subtypes 1a (72.6%) and 1b (19.8%), and 8 sequences (7.6%) as subtype 3a. The two genotyping methods gave concordant results within HCV genotypes and subtypes in 100 and 96.2% of cases, respectively. Only four samples presented discrepant results, with LiPA not distinguishing subtypes 1a and 1b. Therefore, HCV genotype 1 (subtype 1a) is predominant in hemodialysis patients in Central Brazil. By using sequence analysis of the NS5B region as a reference standard method for HCV genotyping, we found that LiPA was efficient at the genotype level, although some discrepant results were observed at the subtype level (sensitivity of 96.1% for subtype 1a and 95.2% for subtype 1b). Thus, analysis of the NS5B region permitted better discrimination between HCV subtypes, as required in epidemiological investigations.
Resumo:
Lichens are symbiotic organisms, which consist of the fungal partner and the photosynthetic partner, which can be either an alga or a cyanobacterium. In some lichen species the symbiosis is tripartite, where the relationship includes both an alga and a cyanobacterium alongside the primary symbiont, fungus. The lichen symbiosis is an evolutionarily old adaptation to life on land and many extant fungal species have evolved from lichenised ancestors. Lichens inhabit a wide range of habitats and are capable of living in harsh environments and on nutrient poor substrates, such as bare rocks, often enduring frequent cycles of drying and wetting. Most lichen species are desiccation tolerant, and they can survive long periods of dehydration, but can rapidly resume photosynthesis upon rehydration. The molecular mechanisms behind lichen desiccation tolerance are still largely uncharacterised and little information is available for any lichen species at the genomic or transcriptomic level. The emergence of the high-throughput next generation sequencing (NGS) technologies and the subsequent decrease in the cost of sequencing new genomes and transcriptomes has enabled non-model organism research on the whole genome level. In this doctoral work the transcriptome and genome of the grey reindeer lichen, Cladonia rangiferina, were sequenced, de novo assembled and characterised using NGS and traditional expressed sequence tag (EST) technologies. RNA extraction methods were optimised to improve the yield and quality of RNA extracted from lichen tissue. The effects of rehydration and desiccation on C. rangiferina gene expression on whole transcriptome level were studied and the most differentially expressed genes were identified. The secondary metabolites present in C. rangiferina decreased the quality – integrity, optical characteristics and utility for sensitive molecular biological applications – of the extracted RNA requiring an optimised RNA extraction method for isolating sufficient quantities of high-quality RNA from lichen tissue in a time- and cost-efficient manner. The de novo assembly of the transcriptome of C. rangiferina was used to produce a set of contiguous unigene sequences that were used to investigate the biological functions and pathways active in a hydrated lichen thallus. The de novo assembly of the genome yielded an assembly containing mostly genes derived from the fungal partner. The assembly was of sufficient quality, in size similar to other lichen-forming fungal genomes and included most of the core eukaryotic genes. Differences in gene expression were detected in all studied stages of desiccation and rehydration, but the largest changes occurred during the early stages of rehydration. The most differentially expressed genes did not have any annotations, making them potentially lichen-specific genes, but several genes known to participate in environmental stress tolerance in other organisms were also identified as differentially expressed.
Resumo:
The construction of adenovirus vectors for cloning and foreign gene expression requires packaging cell lines that can complement missing viral functions caused by sequence deletions and/or replacement with foreign DNA sequences. In this study, packaging cell lines were designed to provide in trans the missing bovine adenovirus functions, so that recombinant viruses could be generated. Fetal bovine kidney and lUng cells, acquired at the trimester term from a pregnant cow, were tranfected with both digested wild type BAV2 genomic DNA and pCMV-EI. The plasmid pCMV-EI was specifically constructed to express El of BAV2 under the control of the cytomegalovirus enhancer/promoter (CMV). Selection for "true" transformants by continuous passaging showed no success in isolating immortalised cells, since the cells underwent crisis resulting in complete cell death. Moreover, selection for G418 resistance, using the same cells, also did not result in the isolation of an immortalised cell line and the same culture-collapse event was observed. The lack of success in establishing an immortalised cell line from fetal tissue prompted us to transfect a pre-established cell line. We began by transfecting MDBK (Mardin-Dardy bovine kidney) cells with pCMV-El-neo, which contain the bacterial selectable marker neo gene. A series of MDBK-derived cell lines, that constitutively express bovine adenoviral (BAV) early region 1 (El), were then isolated. Cells selected for resistance to the drug G418 were isolated collectively for full characterisation to assess their suitability as packaging cell lines. Individual colonies were isolated by limiting dilution and further tested for El expression and efficiency of DNA uptake. Two cell lines, L-23 and L-24, out of 48 generated foci tested positive for £1 expression using Northern Blot analysis. DNA uptake studies, using both lipofectamine and calcium phosphate methods, were performed to compare these cells, their parental MDBK cells, 8 and the unrelated human 293 cells as a benchmark. The results revealed that the new MDBKderived clones were no more efficient than MDBK cells in the transient expression of transfected DNA and that they were inferior to 293 cells, when using lacZ as the reporter gene. In view of the inherently poor transfection efficiency of MDBK cells and their derivatives, a number of other bovine cells were investigated for their potential as packaging cells. The cell line CCL40 was chosen for its high efficiency in DNA uptake and subsequently transfected with the plasmid vector pCMV El-neo. By selection with the drug G418, two cell lines were isolated, ProCell 1 and ProCell 2. These cell lines were tested for El expression, permissivity to BAV2 and DNA uptake efficiency, revealing a DNA uptake efficiency of 37 % , comparable to that of CCL40. Attempts to rescue BAV2 mutants carrying the lacZ gene in place of £1 or £3 were carried out by co-transfecting wild type viral DNA with either the plasmid pdlElE-Z (which contains BAV2 sequences from 0% to 40.4% with the lacZ gene in place of the £1 region from 1.1% to 8.25%) or with the plasmid pdlE3-5-Z (which contains BAV2 sequences from 64.8% to 100% with the lacZ gene in place of the E3 region from 75.8% to 81.4%). These cotransfections did not result in the generation of a viral mutant. The lack of mutant generation was thought to be caused by the relative inefficiency ofDNA uptake. Consequently, cosBAV2, a cosmid vector carrying the BAV2 genome, was modified to carry the neo reporter gene in place of the £3 region from 75.8% to 81.4%. The use of a single cosmid vector earring the whole genome would eliminate the need for homologous recombination in order to generate a viral vector. Unfortunately, the transfection of cosBAV2- neo also did not result in the generation of a viral mutant. This may have been caused by the size of the £3 deletion, where excess sequences that are essential to the virus' survival might have been deleted. As an extension to this study, the spontaneous E3 deletion, accidently discovered in our viral stock, could be used as site of foreign gene insertion.
Resumo:
Adenoviruses are non-enveloped icosahedral-shaped particles which possess a double-stranded DNA genome. Currently, nearly 100 serotypes of adenoviruses have been identified, 48 of which are of human origin. Bovine adenoviruses (BAVs), causing both mild respiratory and/or enteral diseases in cattle, have been reported in many countries all over the world. Currently, nine serotypes of SAVs have been isolated which have been placed into two subgroups based on a number of characteristics which include complement fixation tests as well as the ability to replicate in various cell lines. Bovine adenovirus type 2 (BAV2), belonging to subgroup I, is able to cause pneumonia as well as pneumonic-like symptoms in calves. In this study, the genome of BAV2 (strain No. 19) was subcloned into the plasmid vector pUC19. In total, 16 plasmids were constructed; three carry internal San fragments (spanning 3.1 to 65.2% ), and 10 carry internal Pstl fragments (spanning 4.9 to 97.4%), of the viral genome. Each of these plasmids was analyzed using twelve restriction endonucleases; BamHI, CiaI, EcoRl, HiOOlll, Kpnl, Noll, NS(N, Ps~, Pvul, Saj, Xbal, and Xhol. Terminal end fragments were also cloned and analyzed, sUbsequent to the removal of the 5' terminal protein, in the form of 2 BamHI B fragments, cloned in opposite orientations (spanning 0 to 18.1°k), and one Pstll fragment (spanning 97.4 to 1000/0). These cloned fragments, along with two other plasmids previously constructed carrying internal EcoRI fragments (spanning 20.6 to 90.5%), were then used to construct a detailed physical restriction map using the twelve restriction endonucleases, as well as to estimate the size of the genome for BAV2(32.5 Kbp). The DNA sequences of the early region 1 (E1) and hexon-associated gene (protein IX) have also been determined. The amino acid sequences of four open reading frames (ORFs) have been compared to those of the E1 proteins and protein IX from other Ads.
Resumo:
Variations in different types of genomes have been found to be responsible for a large degree of physical diversity such as appearance and susceptibility to disease. Identification of genomic variations is difficult and can be facilitated through computational analysis of DNA sequences. Newly available technologies are able to sequence billions of DNA base pairs relatively quickly. These sequences can be used to identify variations within their specific genome but must be mapped to a reference sequence first. In order to align these sequences to a reference sequence, we require mapping algorithms that make use of approximate string matching and string indexing methods. To date, few mapping algorithms have been tailored to handle the massive amounts of output generated by newly available sequencing technologies. In otrder to handle this large amount of data, we modified the popular mapping software BWA to run in parallel using OpenMPI. Parallel BWA matches the efficiency of multithreaded BWA functions while providing efficient parallelism for BWA functions that do not currently support multithreading. Parallel BWA shows significant wall time speedup in comparison to multithreaded BWA on high-performance computing clusters, and will thus facilitate the analysis of genome sequencing data.
Resumo:
Sequence repeats are an important phenomenon in the human genome, playing important roles in genomic alteration often with phenotypic consequences. The two major types of repeat elements in the human genome are tandem repeats (TRs) including microsatellites, minisatellites, and satellites and transposable elements (TEs). So far, very little has been known about the relationship between these two types of repeats. In this study, we identified TRs that are derived from TEs either based on sequence similarity or overlapping genomic positions. We then analyzed the distribution of these TRs among TE families/subfamilies. Our study shows that at least 7,276 TRs or 23% of all minisatellites/satellites is derived from TEs, contributing ∼0.32% of the human genome. TRs seem to be generated more likely from younger/more active TEs, and once initiated they are expanded with time via local duplication of the repeat units. The currently postulated mechanisms for origin of TRs can explain only 6% of all TE-derived TRs, indicating the presence of one or more yet to be identified mechanisms for the initiation of such repeats. Our result suggests that TEs are contributing to genome expansion and alteration not only by transposition but also by generating tandem repeats.