Biblioteca Digital

970 resultados para DNA sequence analysis

Identification of the dominant translation start site in the attB1 sequence of the pET-DEST42 Gateway vector

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Gateway technology is a powerful system for converting a single entry vector into a wide variety of expression vectors. We expressed recombinant influenza matrix protein M1 (FMP), a potent antigen for cytotoxic T cells, using the Gateway vector pET-DEST42 containing the FMP cDNA, and purified the expressed FMP as a single 32 kDa recombinant protein. N-terminal and internal protein sequencing, however, showed that the recombinant FMP contained an extra 10 amino acids fused to the N-terminal of native FMP. Further investigation of the DNA sequence adjacent to the 5'-FMP cDNA indicated that the TTG in the attB1 site (30bp upstream of the ATG in the 5'-FMP cDNA) behaved as a dominant translation start site, resulting in a 10 amino acid extension of the recombinant FMP. Thus, it is possible that recombinant proteins produced by this Gateway vector contain unexpected vector-derived peptides, which may affect experimental outcomes. (c) 2006 Elsevier Inc. All rights reserved.

Predicting the solvent accessibility of transmembrane residues from protein sequence

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this study, we propose a novel method to predict the solvent accessible surface areas of transmembrane residues. For both transmembrane alpha-helix and beta-barrel residues, the correlation coefficients between the predicted and observed accessible surface areas are around 0.65. On the basis of predicted accessible surface areas, residues exposed to the lipid environment or buried inside a protein can be identified by using certain cutoff thresholds. We have extensively examined our approach based on different definitions of accessible surface areas and a variety of sets of control parameters. Given that experimentally determining the structures of membrane proteins is very difficult and membrane proteins are actually abundant in nature, our approach is useful for theoretically modeling membrane protein tertiary structures, particularly for modeling the assembly of transmembrane domains. This approach can be used to annotate the membrane proteins in proteomes to provide extra structural and functional information.

Sequence variation in the Newcastle disease virus genome

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Full-length genome sequences of five virulent and five avirulent strains of Newcastle disease virus isolated between 1998 and 2002 in Victoria and New South Wales, Australia were determined. Comparisons between these strains revealed that coding sequence variability in the haemagglutinin-neuraminidase (HN), matrix (M) and phosphoprotein (P) gene sequences appeared to be more variable than in the fusion (F), nucleocapsid (N) and RNA dependent-RNA replicase (L) genes. Sequence analysis of a number of other isolates made during the recent virulent NDV outbreaks, also identified the presence of a number of variants with altered F gene cleavage sites, which resulted in altered biological properties of those viruses. Quasispecies analysis of a number of field isolates indicated the presence of virulent virus in one particular isolate. Gene sequence analysis of the progenitor virus isolated in 1998 showed very little sequence variation when compared to that of a progenitor-like virus isolated in 2001 demonstrating that in the field. viral genome sequence variation appears to be biologically restricted to that of a consensus sequence. (c) 2005 Elsevier B.V. All rights reserved.

Transposon-free regions in mammalian genomes

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Despite the presence of over 3 million transposons separated on average by similar to 500 bp, the human and mouse genomes each contain almost 1000 transposon-free regions (TFRs) over 10 kb in length. The majority of human TFRs correlate with orthologous TFRs in the mouse, despite the fact that most transposons are lineage specific. Many human TFRs also overlap with orthologous TFRs in the marsupial opossum, indicating that these regions have remained refractory to transposon insertion for long evolutionary periods. Over 90% of the bases covered by TFRs are noncoding, much of which is not highly conserved. Most TFRs are not associated with unusual nucleotide composition, but are significantly associated with genes encoding developmental regulators, suggesting that they represent extended regions of regulatory information that are largely unable to tolerate insertions, a conclusion difficult to reconcile with current conceptions of gene regulation.

Origin of the West Nile virus responsible for an outbreak of encephalitis in the northeastern United States

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In Late summer 1999, an outbreak of human encephalitis occurred in the northeastern United States that was concurrent with extensive mortality in crows (Corvus species) as well as the deaths of several exotic birds at a zoological park in the same area. Complete genome sequencing of a flavivirus isolated from the brain of a dead Chilean flamingo (Phoenicopterus chilensis), together with partial sequence analysis of envelope glycoprotein (E-glycoprotein) genes amplified from several other species including mosquitoes and two fatal human cases, revealed that West Nile (WN) virus circulated in natural transmission cycles and was responsible for the human disease. Antigenic mapping with E-glycoprotein-specific monoclonal antibodies and E-glycoprotein phylogenetic analysis confirmed these viruses as WN. This North American WN virus was most closely related to a WN virus isolated from a dead goose in Israel in 1998.

Identification, mutagenesis, and transcriptional analysis of the methanesulfonate transport operon of methylosulfonomonas methylovora

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Recently identified genes located downstream (3') of the msmEF (transport encoding) gene cluster, msmGH, and located 5' of the structural genes for methanesulfonate monooxygenase (MSAMO) are described from Methylosulfonomonas methylovora. Sequence analysis of the derived polypeptide sequences encoded by these genes revealed a high degree of identity to ABC-type transporters. MsmE showed similarity to a putative periplasmic substrate binding protein, MsmF resembled an integral membraneassociated protein, and MsmG was a putative ATP-binding enzyme. MsmH was thought to be the cognate permease component of the sulfonate transport system. The close association of these putative transport genes to the MSAMO structural genes msmABCD suggested a role for these genes in transport of methanesulfonic acid (MSA) into M. methylovora. msmEFGH and msmABCD constituted two operons for the coordinated expression of MSAMO and the MSA transporter systems. Reverse-transcription-PCR analysis of msmABCD and msmEFGH revealed differential expression of these genes during growth on MSA and methanol. The msmEFGH operon was constitutively expressed, whereas MSA induced expression of msmABCD. A mutant defective in msmE had considerably slower growth rates than the wild type, thus supporting the proposed role of MsmE in the transport of MSA into M. methylovora.

Real-time detection of DNA interactions with long-period fiber-grating-based biosensor

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Using an optical biosensor based on a dual-peak long-period fiber grating, we have demonstrated the detection of interactions between biomolecules in real time. Silanization of the grating surface was successfully realized for the covalent immobilization of probe DNA, which was subsequently hybridized with the complementary target DNA sequence. It is interesting to note that the DNA biosensor was reusable after being stripped off the hybridized target DNA from the grating surface, demonstrating a function of multiple usability.

The development of a model system and high throughput assay for the study of interactions between zinc finger proteins and DNA

Relevância:

90.00% 90.00%

Publicador:

Resumo:

It has been recognised for some time that a full code of amino acid-based recognition of DNA sequences would be useful. Several approaches, which utilise small DNA binding motifs called zinc fingers, are presently employed. None of the current approaches successfully combine a combinatorial approach to the elucidation of a code with a single stage high throughput screening assay. The work outlined here describes the development of a model system for the study of DNA protein interactions and the development of a high throughput assay for detection of such interactions. A zinc finger protein was designed which will bind with high affinity and specificity to a known DNA sequence. For future work it is possible to mutate the region of the zinc finger responsible for the specificity of binding, in order to observe the effect on the DNA / protein interactions. The zinc finger protein was initially synthesised as a His tagged product. It was not possible however to develop a high throughput assay using the His tagged zinc finger protein. The gene encoding the zinc finger protein was altered and the protein synthesised as a Glutathione S-Transferase (GST) fusion product. A successful assay was developed using the GST protein and Scintillation Proximity Assay technology (Amersham Pharmacia Biotech). The scintillation proximity assay is a dynamic assay that allows the DNA protein interactions to be studied in "real time". This assay not only provides a high throughput method of screening zinc finger proteins for potential ligands but also allows the effect of addition of reagents or competitor ligands to be monitored.

Discovering properties of new DNA-binding activity of proteins

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Protein-DNA interactions are an essential feature in the genetic activities of life, and the ability to predict and manipulate such interactions has applications in a wide range of fields. This Thesis presents the methods of modelling the properties of protein-DNA interactions. In particular, it investigates the methods of visualising and predicting the specificity of DNA-binding Cys2His2 zinc finger interaction. The Cys2His2 zinc finger proteins interact via their individual fingers to base pair subsites on the target DNA. Four key residue positions on the a- helix of the zinc fingers make non-covalent interactions with the DNA with sequence specificity. Mutating these key residues generates combinatorial possibilities that could potentially bind to any DNA segment of interest. Many attempts have been made to predict the binding interaction using structural and chemical information, but with only limited success. The most important contribution of the thesis is that the developed model allows for the binding properties of a given protein-DNA binding to be visualised in relation to other protein-DNA combinations without having to explicitly physically model the specific protein molecule and specific DNA sequence. To prove this, various databases were generated, including a synthetic database which includes all possible combinations of the DNA-binding Cys2His2 zinc finger interactions. NeuroScale, a topographic visualisation technique, is exploited to represent the geometric structures of the protein-DNA interactions by measuring dissimilarity between the data points. In order to verify the effect of visualisation on understanding the binding properties of the DNA-binding Cys2His2 zinc finger interaction, various prediction models are constructed by using both the high dimensional original data and the represented data in low dimensional feature space. Finally, novel data sets are studied through the selected visualisation models based on the experimental DNA-zinc finger protein database. The result of the NeuroScale projection shows that different dissimilarity representations give distinctive structural groupings, but clustering in biologically-interesting ways. This method can be used to forecast the physiochemical properties of the novel proteins which may be beneficial for therapeutic purposes involving genome targeting in general.

Real-time detection of DNA interactions with long-period fiber-grating-based biosensor

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Using an optical biosensor based on a dual-peak long-period fiber grating, we have demonstrated the detection of interactions between biomolecules in real time. Silanization of the grating surface was successfully realized for the covalent immobilization of probe DNA, which was subsequently hybridized with the complementary target DNA sequence. It is interesting to note that the DNA biosensor was reusable after being stripped off the hybridized target DNA from the grating surface, demonstrating a function of multiple usability. © 2007 Optical Society of America.

Molecular Mechanisms Involved in Regulating the Mucoid-to-Nonmucoid Conversion of Pseudomonas aeruginosa Associated with Cystic Fibrosis Patients

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Pseudomonas aeruginosa is an opportunistic pathogen that has received attention because of its close association with cystic fibrosis (CF). Chronic pulmonary infection with the mucoid P. aeruginosa is the leading cause of mortality in CF patients. This bacterium has the ability to sense and adapt to the harsh environment in the CF lung by converting from a nonmucoid to a mucoid state. The mucoid phenotype is caused by overproduction of a polysaccharide called alginate. Alginate production is regulated by the algT/U operon containing five genes, algT/U-mucA-mucB-mucC-mucD. Alginate overproduction in CF isolates has been partially attributed to a loss-of-function mutation in mucA that results in the overexpression of algT. This mucoid phenotype is unstable, reverting to the nonmucoid form when the isolates are cultured outside of the CF lung. This study was undertaken to determine the mechanisms involved in the conversion from the mucoid to the nonmucoid form. Thirty-six spontaneous nonmucoid variants of a known mucoid isolate with a mucA mutation were analyzed. Ten of these isolates were complemented in trans by plasmids containing the algT operon and the algT gene. Chromosomal DNA was extracted and the mucA and algT genes were amplified by the polymerase chain reaction. Sequence analysis of the genes showed that these mutants retained the original mucA mutation but acquired secondary mutations in the algT gene.

Expressão heteróloga de biossurfactantes identificados em bibliotecas metagenômicas

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The microorganisms have a vast genetic diversity and they are present throughout the biosphere, however, only about 1% of the species can be cultivated by traditional cultivation techniques. Within this diversity there is a huge pool genetic and biological being explored. The metagenomics has enabled direct access to microbial genome derived from environmental samples using independent methods of cultivation. The methodology enables to obtain functional information about the proteins, as well as identify potential products with biotechnological interest and new industrially exploitable biological resources, such as new solutions to environmental impacts. Oil-contaminated areas are characterized by a large accumulation of hydrocarbons and surfactants may be used for bioremediation. Thus, the metagenomic approach was used in this study in order to select genes involved in the degradation and hydrocarbon emulsification. In a previous work, the environmental DNA (eDNA) was extracted from soil samples collected from two different areas (Caatinga and Saline River) of Rio Grande do Norte (Brazil), the metagenomic libraries were constructed and functionally analyzed. The clone able to degrade the oil was evaluated for the ability to synthesize biosurfactants. The sequence analysis revealed an ORF with 897 bp, 298 amino acids and a protein with around 34 kDa. The search for homology in GenBank revealed sequence similarity with a hypothetical protein of representatives Halobacteriaceae family, who were recently shown as strains producing biosurfactants. The presence of the inserted coding sequence and the acquired phenotype was confirmed. Primers were designed and the ORF amplified by PCR. The ORF was subcloned into pETDuet-1 expression vector for subsequent purification of the protein of interest containing a histidine tail. The tests performed to confirm the biosurfactant activity and the ability of hydrocarbon degradation showed positive results. The immunodetection test (western blot) using the monoclonal AntiHis® confirmed the presence of the environmental protein. This study was the first to report a possible protein with biosurfactant activity obtained from a metagenomic approach

New Advancements of Scalable Statistical Methods for Learning Latent Structures in Big Data

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Constant technology advances have caused data explosion in recent years. Accord- ingly modern statistical and machine learning methods must be adapted to deal with complex and heterogeneous data types. This phenomenon is particularly true for an- alyzing biological data. For example DNA sequence data can be viewed as categorical variables with each nucleotide taking four different categories. The gene expression data, depending on the quantitative technology, could be continuous numbers or counts. With the advancement of high-throughput technology, the abundance of such data becomes unprecedentedly rich. Therefore efficient statistical approaches are crucial in this big data era.

Previous statistical methods for big data often aim to find low dimensional struc- tures in the observed data. For example in a factor analysis model a latent Gaussian distributed multivariate vector is assumed. With this assumption a factor model produces a low rank estimation of the covariance of the observed variables. Another example is the latent Dirichlet allocation model for documents. The mixture pro- portions of topics, represented by a Dirichlet distributed variable, is assumed. This dissertation proposes several novel extensions to the previous statistical methods that are developed to address challenges in big data. Those novel methods are applied in multiple real world applications including construction of condition specific gene co-expression networks, estimating shared topics among newsgroups, analysis of pro- moter sequences, analysis of political-economics risk data and estimating population structure from genotype data.

The Genetics of Success: How Single-Nucleotide Polymorphisms Associated With Educational Attainment Relate to Life-Course Development.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

A previous genome-wide association study (GWAS) of more than 100,000 individuals identified molecular-genetic predictors of educational attainment. We undertook in-depth life-course investigation of the polygenic score derived from this GWAS using the four-decade Dunedin Study (N = 918). There were five main findings. First, polygenic scores predicted adult economic outcomes even after accounting for educational attainments. Second, genes and environments were correlated: Children with higher polygenic scores were born into better-off homes. Third, children's polygenic scores predicted their adult outcomes even when analyses accounted for their social-class origins; social-mobility analysis showed that children with higher polygenic scores were more upwardly mobile than children with lower scores. Fourth, polygenic scores predicted behavior across the life course, from early acquisition of speech and reading skills through geographic mobility and mate choice and on to financial planning for retirement. Fifth, polygenic-score associations were mediated by psychological characteristics, including intelligence, self-control, and interpersonal skill. Effect sizes were small. Factors connecting DNA sequence with life outcomes may provide targets for interventions to promote population-wide positive development.

Transcriptomic Analysis of the Host Response and Innate Resilience to Enterotoxigenic Escherichia coli Infection in Humans.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: Enterotoxigenic Escherichia coli (ETEC) is a globally prevalent cause of diarrhea. Though usually self-limited, it can be severe and debilitating. Little is known about the host transcriptional response to infection. We report the first gene expression analysis of the human host response to experimental challenge with ETEC. METHODS: We challenged 30 healthy adults with an unattenuated ETEC strain, and collected serial blood samples shortly after inoculation and daily for 8 days. We performed gene expression analysis on whole peripheral blood RNA samples from subjects in whom severe symptoms developed (n = 6) and a subset of those who remained asymptomatic (n = 6) despite shedding. RESULTS: Compared with baseline, symptomatic subjects demonstrated significantly different expression of 406 genes highlighting increased immune response and decreased protein synthesis. Compared with asymptomatic subjects, symptomatic subjects differentially expressed 254 genes primarily associated with immune response. This comparison also revealed 29 genes differentially expressed between groups at baseline, suggesting innate resilience to infection. Drug repositioning analysis identified several drug classes with potential utility in augmenting immune response or mitigating symptoms. CONCLUSIONS: There are statistically significant and biologically plausible differences in host gene expression induced by ETEC infection. Differential baseline expression of some genes may indicate resilience to infection.

«
1
2
...
57
58
59
60
61
62
63
64
65
»