897 resultados para patent sequence datasets
Resumo:
Enterococcus hirae ATCC 9790 is a Gram-positive lactic acid bacterium that has been used in basic research for over 4 decades. Here we report the sequence and annotation of the 2.8-Mb genome of E. hirae and its endemic 29-kb plasmid pTG9790.
Resumo:
With a virus such as Human Immunodeficiency Virus (HIV) that has infected millions of people worldwide, and with many unaware that they are infected, it becomes vital to understand how the virus works and how it functions at the molecular level. Because there currently is no vaccine and no way to eradicate the virus from an infected person, any information about how the virus interacts with its host greatly increases the chances of understanding how HIV works and brings scientists one step closer to being able to combat such a destructive virus. Thousands of HIV viruses have been sequenced and are available in many online databases for public use. Attributes that are linked to each sequence include the viral load within the host and how sick the patient is currently. Being able to predict the stage of infection for someone is a valuable resource, as it could potentially aid in treatment options and proper medication use. Our approach of analyzing region-specific amino acid composition for select genes has been able to predict patient disease state up to an accuracy of 85.4%. Moreover, we output a set of classification rules based on the sequence that may prove useful for diagnosing the expected clinical outcome of the infected patient.
Resumo:
Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
Polymerization of Styrene and Cyclization to Macrocyclic Polystyrene in a One-Pot, Two-Step Sequence
Resumo:
Dibrominated polystyrene (BrPStBr) was produced by atom transfer radical polymerization (ATRP) at 80 degrees C, using the bifunctional initiator benzal bromide to afford the telechelic precursor. The ATRP reaction was stopped around 40% monomer conversion and directly converted into an radical trap-assisted atom transfer radical coupling (RTA-ATRC) reaction by lowering the temperature to 50 degrees C, and adding the radical trap 2-methyl-2-nitrosopropane (MNP) along with additional catalyst, reducing agent, and ligand to match ATRC-type reaction conditions. In an attempt to induce intramolecular coupling, rather than solely intermolecular coupling and elongation, the total reaction volume was increased by the addition of varying amounts of THF. Cyclization, along with intermolecular coupling and elongation, occurred in all cases, with the extent of ring closure a function of the total reaction volume. The cyclic portion of the coupled product was found to have a (G) value around 0.8 by GPC analysis, consistent with the reduction in hydrodynamic volume of a cyclic polymer compared to its linear analog. Analysis of the sequence by H-1 NMR confirmed that propagation was suppressed nearly completely during the RTA-ATRC phase, with percent monomer conversion remaining constant after the ATRP phase. (C) 2013 Elsevier Ltd. All rights reserved.
Resumo:
With the advent of high through-put sequencing (HTS), the emerging science of metagenomics is transforming our understanding of the relationships of microbial communities with their environments. While metagenomics aims to catalogue the genes present in a sample through assessing which genes are actively expressed, metatranscriptomics can provide a mechanistic understanding of community inter-relationships. To achieve these goals, several challenges need to be addressed from sample preparation to sequence processing, statistical analysis and functional annotation. Here we use an inbred non-obese diabetic (NOD) mouse model in which germ-free animals were colonized with a defined mixture of eight commensal bacteria, to explore methods of RNA extraction and to develop a pipeline for the generation and analysis of metatranscriptomic data. Applying the Illumina HTS platform, we sequenced 12 NOD cecal samples prepared using multiple RNA-extraction protocols. The absence of a complete set of reference genomes necessitated a peptide-based search strategy. Up to 16% of sequence reads could be matched to a known bacterial gene. Phylogenetic analysis of the mapped ORFs revealed a distribution consistent with ribosomal RNA, the majority from Bacteroides or Clostridium species. To place these HTS data within a systems context, we mapped the relative abundance of corresponding Escherichia coli homologs onto metabolic and protein-protein interaction networks. These maps identified bacterial processes with components that were well-represented in the datasets. In summary this study highlights the potential of exploiting the economy of HTS platforms for metatranscriptomics.
Resumo:
The rupture of intracranial aneurysms leads to subarachnoid hemorrhage, which is often associated with poor outcome. Preventive treatment of unruptured intracranial aneurysms is possible and recommended. However, the lack of candidate genes precludes identifying patients at risk by genetic analyses. We observed intracranial aneurysms in 2 patients with von Hippel-Lindau (VHL) disease and the known disease-causing mutation c.292T > C (p.Tyr98His) in the VHL tumor suppressor gene. This study investigates whether the VHL gene is a possible candidate gene for aneurysm formation.