993 resultados para sequence homology


Relevância:

60.00% 60.00%

Publicador:

Resumo:

The occurrence of DNA architectural proteins containing two functional domains derived from two different architectural proteins is an interesting emerging research theme in the field of nucleoid structure and function. Mycobacterium tuberculosis HupB, unlike Escherichia coli HU, is a two-domain protein that, in the N-terminal region, shows broad sequence homology with bacterial HU. The long C-terminal extension, on the other hand, contains seven PAKK/KAAK motifs, which are characteristic of the histone H1/H5 family of proteins. In this article, we describe several aspects of HupB function, in comparison with its truncated derivatives lacking either the C-terminus or N-terminus. We found that HupB binds a variety of DNA repair and replication intermediates with K(d) values in the nanomolar range. By contrast, the N-terminal fragment of M. tuberculosis HupB (HupB(MtbN)) showed diminished DNA-binding activity, with K(d) values in the micromolar range, and the C-terminal domain was completely devoid of DNA-binding activity. Unlike HupB(MtbN), HupB was able to constrain DNA in negative supercoils and introduce negative superhelical turns into relaxed DNA. Similarly, HupB exerted a robust inhibitory effect on DNA strand exchange promoted by cognate and noncognate RecA proteins, whereas HupB(MtbN), even at a 50-fold molar excess, had no inhibitory effect. Considered together, these results suggest that synergy between the N-terminal and C-terminal domains of HupB is essential for its DNA-binding ability, and to modulate the topological features of DNA, which has implications for processes such as DNA compaction, gene regulation, homologous recombination, and DNA repair.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The fidelity of the folding pathways being encoded in the amino acid sequence is met with challenge in instances where proteins with no sequence homology, performing different functions and no apparent evolutionary linkage, adopt a similar fold. The problem stated otherwise is that a limited fold space is available to a repertoire of diverse sequences. The key question is what factors lead to the formation of a fold from diverse sequences. Here, with the NAD(P)-binding Rossmann fold domains as a case study and using the concepts of network theory, we have unveiled the consensus structural features that drive the formation of this fold. We have proposed a graph theoretic formalism to capture the structural details in terms of the conserved atomic interactions in global milieu, and hence extract the essential topological features from diverse sequences. A unified mathematical representation of the different structures together with a judicious concoction of several network parameters enabled us to probe into the structural features driving the adoption of the NAD(P)-binding Rossmann fold. The atomic interactions at key positions seem to be better conserved in proteins, as compared to the residues participating in these interactions. We propose a ``spatial motif'' and several ``fold specific hot spots'' that form the signature structural blueprints of the NAD(P)-binding Rossmann fold domain. Excellent agreement of our data with previous experimental and theoretical studies validates the robustness and validity of the approach. Additionally, comparison of our results with statistical coupling analysis (SCA) provides further support. The methodology proposed here is general and can be applied to similar problems of interest.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The evolutionary diversity of the HSP70 gene family at the genetic level has generated complex structural variations leading to altered functional specificity and mode of regulation in different cellular compartments. By utilizing Saccharomyces cerevisiae as a model system for better understanding the global functional cooperativity between Hsp70 paralogs, we have dissected the differences in functional properties at the biochemical level between mitochondrial heat shock protein 70 (mtHsp70) Ssc1 and an uncharacterized Ssc3 paralog. Based on the evolutionary origin of Ssc3 and a high degree of sequence homology with Ssc1, it has been proposed that both have a close functional overlap in the mitochondrial matrix. Surprisingly, our results demonstrate that there is no functional cross-talk between Ssc1 and Ssc3 paralogs. The lack of in vivo functional overlap is due to altered conformation and significant lower stability associated with Ssc3. The substrate-binding domain of Ssc3 showed poor affinity toward mitochondrial client proteins and Tim44 due to the open conformation in ADP-bound state. In addition to that, the nucleotide-binding domain of Ssc3 showed an altered regulation by the Mge1 co-chaperone due to a high degree of conformational plasticity, which strongly promotes aggregation. Besides, Ssc3 possesses a dysfunctional inter-domain interface thus rendering it unable to perform functions similar to generic Hsp70s. Moreover, we have identified the critical amino acid sequence of Ssc1 and Ssc3 that can “make or break” mtHsp70 chaperone function. Together, our analysis provides the first evidence to show that the nucleotide-binding domain of mtHsp70s plays a critical role in determining the functional specificity among paralogs and orthologs across kingdoms.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Distinct structures delineating the introns of Simian Virus 40 T-antigen and Adenovirus 2 E1A genes have been discovered. The structures, which are centered around the branch points of the genes inserted in supercoiled double-stranded plasmids, are specifically targeted through photoactivated strand cleavage by the metal complex tris(4,7-diphenyl-1,10-phenanthroline)rhodium(III). The DNA sites that are recognized lack sequence homology but are similar in demarcating functionally important sites on the RNA level. The single-stranded DNA fragments corresponding to the coding strands of the genes were also found to fold into a structure apparently identical to that in the supercoiled genes based on the recognition by the metal complex. Further investigation of different single-stranded DNA fragments with other structural probes, such as another metal complex bis(1,10-phenanthroline)(phenanthrenequinone diimine)rhodium(III), AMT (4'aminomethyl-4,5',8 trimethylpsoralen), restriction enzyme Mse I, and mung bean nuclease, showed that the structures require the sequ ences at both ends of the intron plus the flanking sequences but not the middle of the intron. The two ends form independent helices which interact with each other to form the global tertiary structures. Both of the intron structures share similarities to the structure of the Holliday junction, which is also known to be specifically targeted by the former metal complex. These structures may have arisen from early RNA intron structures and may have been used to facilitate the evolution of genes through exon shuffling by acting as target sites for recombinase enzymes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

G protein-coupled receptors (GPCRs) are the largest family of proteins within the human genome. They consist of seven transmembrane (TM) helices, with a N-terminal region of varying length and structure on the extracellular side, and a C-terminus on the intracellular side. GPCRs are involved in transmitting extracellular signals to cells, and as such are crucial drug targets. Designing pharmaceuticals to target GPCRs is greatly aided by full-atom structural information of the proteins. In particular, the TM region of GPCRs is where small molecule ligands (much more bioavailable than peptide ligands) typically bind to the receptors. In recent years nearly thirty distinct GPCR TM regions have been crystallized. However, there are more than 1,000 GPCRs, leaving the vast majority of GPCRs with limited structural information. Additionally, GPCRs are known to exist in a myriad of conformational states in the body, rendering the static x-ray crystal structures an incomplete reflection of GPCR structures. In order to obtain an ensemble of GPCR structures, we have developed the GEnSeMBLE procedure to rapidly sample a large number of variations of GPCR helix rotations and tilts. The lowest energy GEnSeMBLE structures are then docked to small molecule ligands and optimized. The GPCR family consists of five subfamilies with little to no sequence homology between them: class A, B1, B2, C, and Frizzled/Taste2. Almost all of the GPCR crystal structures have been of class A GPCRs, and much is known about their conserved interactions and binding sites. In this work we particularly focus on class B1 GPCRs, and aim to understand that family’s interactions and binding sites both to small molecules and their native peptide ligands. Specifically, we predict the full atom structure and peptide binding site of the glucagon-like peptide receptor and the TM region and small molecule binding sites for eight other class B1 GPCRs: CALRL, CRFR1, GIPR, GLR, PACR, PTH1R, VIPR1, and VIPR2. Our class B1 work reveals multiple conserved interactions across the B1 subfamily as well as a consistent small molecule binding site centrally located in the TM bundle. Both the interactions and the binding sites are distinct from those seen in the more well-characterized class A GPCRs, and as such our work provides a strong starting point for drug design targeting class B1 proteins. We also predict the full structure of CXCR4 bound to a small molecule, a class A GPCR that was not closely related to any of the class A GPCRs at the time of the work.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Part I. The regions of sequence homology and non-homology between the DNA molecules of T2, T4, and T6 have been mapped by the electron microscopic heteroduplex method. The heteroduplex maps have been oriented with respect to the T4 genetic map. They show characteristic, reproducible patterns of substitution and deletion loops. All heteroduplex molecules show more than 85% homology. Some of the loop patterns in T2/T4 heteroduplexes are similar to those in T4/T6.

We find that the rII, the lysozyme and ac genes, the D region, and gene 52 are homologous in T2, T4, and T6. Genes 43 and 47 are probably homologous between T2 and T4. The region of greatest homology is that bearing the late genes. The host range region, which comprises a part of gene 37 and all of gene 38, is heterologous in T2, T4, and T6. The remainder of gene 37 is partially homologous in the T2/T4 heteroduplex (Beckendorf, Kim and Lielausis, 1972) but it is heterologous in T4/T6 and in T2/T6. Some of the tRNA genes are homologous and some are not. The internal protein genes in general seem to be non-homologous.

The molecular lengths of the T-even DNAs are the same within the limit of experimental error; their calculated molecular weights are correspondingly different due to unequal glucosylation. The size of the T2 genome is smaller than that of T4 or T6, but the terminally repetitious region in T2 is larger. There is a length distribution of the terminal repetition for any one phage DNA, indicating a variability in length of the DNA molecules packaged within the phage.

Part II. E. coli cells infected with phage strains carrying extensive deletions encompassing the gene for the phage ser-tRNA are missing the phage tRNAs normally present in wild type infected cells. By DNA-RNA hybridization we have demonstrated that the DNA complementary to the missing tRNAs is also absent in such deletion mutants. Thus the genes for these tRNAs must be clustered in the same region of the genome as the ser-tRNA gene. Physical mapping of several deletions of the ser-tRNA and lysozyme genes, by examination of heteroduplex DNA in the electron microscope, has enabled us to locate the cluster, to define its maximum size, and to order a few of the tRNA genes within it. That such deletions can be isolated indicates that the phage-specific tRNAs from this cluster are dispensable.

Part III. Genes 37 and 38 between closely related phages T2 and T4 have been compared by genetic, biochemical, and hetero-duplex studies. Homologous, partially homologous and non-homologous regions of the gene 37 have been mapped. The host range determinant which interacts with the gene 38 product is identified.

Part IV. A population of double-stranded ØX-RF DNA molecules carrying a deletion of about 9% of the wild-type DNA has been discovered in a sample cultivated under conditions where the phage lysozyme gene is nonessential. The structures of deleted monomers, dimers, and trimers have been studied by the electron microscope heteroduplex method. The dimers and trimers are shown to be head-to-tail repeats of the deleted monomers. Some interesting examples of the dynamical phenomenon of branch migration in vitro have been observed in heteroduplexes of deleted dimer and trimer strands with undeleted wild-type monomer viral strands.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Viral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow rapid phylogenetic characterization of these new viruses. Often, however, complete viral genomes are not recovered, but rather several distinct contigs derived from a single entity are, some of which have no sequence homology to any known proteins. De novo assembly of single viruses from a metagenome is challenging, not only because of the lack of a reference genome, but also because of intrapopulation variation and uneven or insufficient coverage. Here we explored different assembly algorithms, remote homology searches, genome-specific sequence motifs, k-mer frequency ranking, and coverage profile binning to detect and obtain viral target genomes from metagenomes. All methods were tested on 454-generated sequencing datasets containing three recently described RNA viruses with a relatively large genome which were divergent to previously known viruses from the viral families Rhabdoviridae and Coronaviridae. Depending on specific characteristics of the target virus and the metagenomic community, different assembly and in silico gap closure strategies were successful in obtaining near complete viral genomes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Trimeresurus stejnegeri venom, which contains TSV-PA (a specific plasminogen activator sharing 60-70% sequence homology with venom fibrinogen-clotting enzymes), also possesses fibrinogen-clotting activity in vitro. A fibrinogen-clotting enzyme (stejnobin) has been purified to homogeneity by gel filtration and ion-exchange chromatography on a Mono-Q column. It is a single-chain glycoprotein with a mol. wt of 44,000. The NH2-terminal amino acid sequence of stejnobin shows great homology with venom fibrinogen-clotting enzymes and TSV-PA. Like TSV-PA, stejnobin was able to hydrolyse several chromogenic substrates. Comparative study of substrate specificities of stejnobin and other venom proteases purified in our laboratory was carried out on five chromogenic substrates. Stejnobin clotted human fibrinogen with a specific activity of 122 NIH thrombin-equivalent units/mg protein. However, stejnobin did not act on other blood coagulation factors, such as factor X, prothrombin and plasminogen. Diisopropyl fluorophosphate and phenylmethanesulfonyl fluoride inhibited its activity, whereas ethylenediamine tetracetic acid had no effect on it, indicating that it is a serine protease. Although stejnobin showed strong immunological cross-reaction with polyclonal antibodies raised against TSV-PA, it was interesting to observe that, unlike the case of TSV-PA, these antibodies did not inhibit the amidolytic and fibrinogen-clotting activities of stejnobin. (C) 1998 Elsevier Science Ltd. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In the present study, EA-CATH1 and EA-CATH2 were identified from a constructed lung cDNA library of donkey (Equus asinus) as members of cathelicidin-derived antimicrobial peptides, using a nested PCR-based cloning strategy. Composed of 25 and 26 residues, respectively, EA-CATH1 and EA-CATH2 are smaller than most other cathelicidins and have no sequence homology to other cathelicidins identified to date. Chemically synthesized EA-CATH1 exerted potent antimicrobial activity against most of the 32 strains of bacteria and fungi tested, especially the clinically isolated drug-resistant strains, and minimal inhibitory concentration values against Gram-positive bacteria were mostly in the range of 0.3-2.4 mu g center dot mL-1. EA-CATH1 showed an extraordinary serum stability and no haemolytic activity against human erythrocytes in a dose up to 20 mu g center dot mL-1. CD spectra showed that EA-CATH1 mainly adopts an alpha-helical conformation in a 50% trifluoroethanol/water solution, but a random coil in aqueous solution. Scanning electron microscope observations of Staphylococcus aureus (ATCC2592) treated with EA-CATH1 demonstrated that EA-CATH could cause rapid disruption of the bacterial membrane, and in turn lead to cell lysis. This might explain the much faster killing kinetics of EA-CATH1 than conventional antibiotics revealed by killing kinetics data. In the presence of CaCl2, EA-CATH1 exerted haemagglutination activity, which might potentiate an inhibition against the bacterial polyprotein interaction with the host erythrocyte surface, thereby possibly restricting bacterial colonization and spread.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In order to identify genes encoding the outer membrane proteins (OMPs) of the myxobacter Flavobacterium columnare G(4), the expression library of the bacterium was screened by using rabbit antisera developed against its OMPs. Positive colonies of Escherichia coli M15 containing fragments encoding the bacterial OMPs were selected for cloning the relevant genes by genomic walking methods. Two genes encoding a membrane-associated zinc metalloprotease and prolyl oligopeptidase are reported in this paper. The membrane-associated zinc metalloprotease gene (map) is 1800 bp in length, coding for 449 amino acids (aa). Despite the presence of a conserved motif HEXXH for all metalloproteases, the special HEXXH similar to 32 aa similar to E motif of the F. columnare G(4) Map and its low level of identity with other reported zinc-containing metalloproteases may imply that the membrane-associated zinc metalloprotease of F. columnare G(4) represents a new family of zincins. The gene encoding prolyl oligopeptidase (Pop), a serine proteinase, is 2352 bp in length, coding for 649 aa. Sequence homology analysis revealed that the Pop is also novel as it has <50% identity with other reported prolyl oligopeptidase family proteins. The present study represents the first to employ anti-fish bacterial OMP sera to screen genes of membrane-associated proteases of fish pathogenic bacteria, and to provide necessary information for the examination of the role of the two genes in the infection and pathogenesis of F. columnare.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The F1F0 ATP synthase has been purified from the hyperthermophilic eubacterium Aquifex aeolicus and characterized. Its subunits have been identified by MALDI-mass spectrometry through peptide mass fingerprinting and MS/MS. It contains the canonical subunits alpha, beta, gamma, delta and epsilon of F-1 and subunits a and c of F-0. Two versions of the b subunit were found, which show a low sequence homology to each other. Most likely they form a heterodimer. An electron microscopic single particle analysis revealed clear structural details, including two stalks connecting F-1 and F-0. In several orientations the central stalk appears to be tilted and/or kinked. It is unclear whether there is a direct connection between the peripheral stalk and the 6 subunit. (c) 2006 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. RESULTS: We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. CONCLUSION: The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND: There is considerable interest in the development of methods to efficiently identify all coding variants present in large sample sets of humans. There are three approaches possible: whole-genome sequencing, whole-exome sequencing using exon capture methods, and RNA-Seq. While whole-genome sequencing is the most complete, it remains sufficiently expensive that cost effective alternatives are important. RESULTS: Here we provide a systematic exploration of how well RNA-Seq can identify human coding variants by comparing variants identified through high coverage whole-genome sequencing to those identified by high coverage RNA-Seq in the same individual. This comparison allowed us to directly evaluate the sensitivity and specificity of RNA-Seq in identifying coding variants, and to evaluate how key parameters such as the degree of coverage and the expression levels of genes interact to influence performance. We find that although only 40% of exonic variants identified by whole genome sequencing were captured using RNA-Seq; this number rose to 81% when concentrating on genes known to be well-expressed in the source tissue. We also find that a high false positive rate can be problematic when working with RNA-Seq data, especially at higher levels of coverage. CONCLUSIONS: We conclude that as long as a tissue relevant to the trait under study is available and suitable quality control screens are implemented, RNA-Seq is a fast and inexpensive alternative approach for finding coding variants in genes with sufficiently high expression levels.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

BACKGROUND: Microsporidia are obligate intracellular, eukaryotic pathogens that infect a wide range of animals from nematodes to humans, and in some cases, protists. The preponderance of evidence as to the origin of the microsporidia reveals a close relationship with the fungi, either within the kingdom or as a sister group to it. Recent phylogenetic studies and gene order analysis suggest that microsporidia share a particularly close evolutionary relationship with the zygomycetes. METHODOLOGY/PRINCIPAL FINDINGS: Here we expanded this analysis and also examined a putative sex-locus for variability between microsporidian populations. Whole genome inspection reveals a unique syntenic gene pair (RPS9-RPL21) present in the vast majority of fungi and the microsporidians but not in other eukaryotic lineages. Two other unique gene fusions (glutamyl-prolyl tRNA synthetase and ubiquitin-ribosomal subunit S30) that are present in metazoans, choanoflagellates, and filasterean opisthokonts are unfused in the fungi and microsporidians. One locus previously found to be conserved in many microsporidian genomes is similar to the sex locus of zygomycetes in gene order and architecture. Both sex-related and sex loci harbor TPT, HMG, and RNA helicase genes forming a syntenic gene cluster. We sequenced and analyzed the sex-related locus in 11 different Encephalitozoon cuniculi isolates and the sibling species E. intestinalis (3 isolates) and E. hellem (1 isolate). There was no evidence for an idiomorphic sex-related locus in this Encephalitozoon species sample. According to sequence-based phylogenetic analyses, the TPT and RNA helicase genes flanking the HMG genes are paralogous rather than orthologous between zygomycetes and microsporidians. CONCLUSION/SIGNIFICANCE: The unique genomic hallmarks between microsporidia and fungi are independent of sequence based phylogenetic comparisons and further contribute to define the borders of the fungal kingdom and support the classification of microsporidia as unusual derived fungi. And the sex/sex-related loci appear to have been subject to frequent gene conversion and translocations in microsporidia and zygomycetes.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The mammalian odorant receptor (OR) repertoire is an attractive model to study evolution, because ORs have been subjected to rapid evolution between species, presumably caused by changes of the olfactory system to adapt to the environment. However, functional assessment of ORs in related species remains largely untested. Here we investigated the functional properties of primate and rodent ORs to determine how well evolutionary distance predicts functional characteristics. Using human and mouse ORs with previously identified ligands, we cloned 18 OR orthologs from chimpanzee and rhesus macaque and 17 mouse-rat orthologous pairs that are broadly representative of the OR repertoire. We functionally characterized the in vitro responses of ORs to a wide panel of odors and found similar ligand selectivity but dramatic differences in response magnitude. 87% of human-primate orthologs and 94% of mouse-rat orthologs showed differences in receptor potency (EC50) and/or efficacy (dynamic range) to an individual ligand. Notably dN/dS ratio, an indication of selective pressure during evolution, does not predict functional similarities between orthologs. Additionally, we found that orthologs responded to a common ligand 82% of the time, while human OR paralogs of the same subfamily responded to the common ligand only 33% of the time. Our results suggest that, while OR orthologs tend to show conserved ligand selectivity, their potency and/or efficacy dynamically change during evolution, even in closely related species. These functional changes in orthologs provide a platform for examining how the evolution of ORs can meet species-specific demands.