931 resultados para Genomic sequence database
Resumo:
VP6, the intermediate capsid protein of the virion, specifies subgroup specificity of rotavirus, It is also the most conserved, both at nucleotide and amino acid levels, among group A rotaviruses and is the target of choice for rotavirus detection, In this study we report the sequence of the subgroup I (SGI)-specific VP6 from the serotype G2 strain IS2 isolated from a child suffering from acute diarrhoea in Bangalore ana its comparison with the published VP6 sequences. Interestingly, IS2 gene 6 shared highest homology with that from bovine UK strain and the protein contained substitutions by lysine at amino acid positions 97 and 134, In contrast, the amino acids Met and Glu/Asp at these respective positions are highly conserved in all the other group A rotaviruses sequenced so far, These observations have obvious implications for the evolution of serotype G2 and G2-like strains circulating in India, The SGI VP6, of a human rotavirus, possessing epitopes that are conformationally similar to those found in the native protein in the virion, was successfully expressed in E. coli and purified for the first time by single-step affinity chromatography.
Resumo:
We report cloning of the DNA encoding winged bean basic agglutinin (WBA I). Using oligonucleotide primers corresponding to N- and C-termini of the mature lectin, the complete coding sequence for WBA I could be amplified from genomic DNA. DNA sequence determination by the chain termination method revealed the absence of any intervening sequences in the gene. The DNA deduced amino acid sequence of WBA I displayed some differences with its primary structure established previously by chemical means. Comparison of the sequence of WBA I with that of other legume lectins highlighted several interesting features, including the existence of the largest specificity determining loop which might account for its oligosaccharide-binding specificity and the presence of an additional N-glycosylation site. These data also throw some light on the relationship between the primary structure of the protein and its probable mode of dimerization.
Resumo:
The time of the large sequencing projects has enabled unprecedented possibilities of investigating more complex aspects of living organisms. Among the high-throughput technologies based on the genomic sequences, the DNA microarrays are widely used for many purposes, including the measurement of the relative quantity of the messenger RNAs. However, the reliability of microarrays has been strongly doubted as robust analysis of the complex microarray output data has been developed only after the technology had already been spread in the community. An objective of this study consisted of increasing the performance of microarrays, and was measured by the successful validation of the results by independent techniques. To this end, emphasis has been given to the possibility of selecting candidate genes with remarkable biological significance within specific experimental design. Along with literature evidence, the re-annotation of the probes and model-based normalization algorithms were found to be beneficial when analyzing Affymetrix GeneChip data. Typically, the analysis of microarrays aims at selecting genes whose expression is significantly different in different conditions followed by grouping them in functional categories, enabling a biological interpretation of the results. Another approach investigates the global differences in the expression of functionally related groups of genes. Here, this technique has been effective in discovering patterns related to temporal changes during infection of human cells. Another aspect explored in this thesis is related to the possibility of combining independent gene expression data for creating a catalog of genes that are selectively expressed in healthy human tissues. Not all the genes present in human cells are active; some involved in basic activities (named housekeeping genes) are expressed ubiquitously. Other genes (named tissue-selective genes) provide more specific functions and they are expressed preferably in certain cell types or tissues. Defining the tissue-selective genes is also important as these genes can cause disease with phenotype in the tissues where they are expressed. The hypothesis that gene expression could be used as a measure of the relatedness of the tissues has been also proved. Microarray experiments provide long lists of candidate genes that are often difficult to interpret and prioritize. Extending the power of microarray results is possible by inferring the relationships of genes under certain conditions. Gene transcription is constantly regulated by the coordinated binding of proteins, named transcription factors, to specific portions of the its promoter sequence. In this study, the analysis of promoters from groups of candidate genes has been utilized for predicting gene networks and highlighting modules of transcription factors playing a central role in the regulation of their transcription. Specific modules have been found regulating the expression of genes selectively expressed in the hippocampus, an area of the brain having a central role in the Major Depression Disorder. Similarly, gene networks derived from microarray results have elucidated aspects of the development of the mesencephalon, another region of the brain involved in Parkinson Disease.
Resumo:
Gastric cancer is the fourth most common cancer and the second most common cause of cancer-related death worldwide. Due to lack of early symptoms, gastric cancer is characterized by late stage diagnosis and unsatisfactory options for curative treatment. Several genomic alterations have been identified in gastric cancer, but the major factors contributing to initiation and progression of gastric cancer remain poorly known. Gene copy number alterations play a key role in the development of gastric cancer, and a change in gene copy number is one of the fundamental mechanisms for a cancer cell to control the expression of potential oncogenes and tumor suppressor genes. This thesis aims at clarifying the complex genomic alterations of gastric cancer to identify novel molecular biomarkers for diagnostic purposes as well as for targeted treatment. To highlight genes of potential biological and clinical relevance, we carried out a systematic microarray-based survey of gene expression and copy number levels in primary gastric tumors and gastric cancer cell lines. Results were validated using immunohistochemistry, real-time qRT-PCR, and affinity capture-based transcript (TRAC) assay. Altogether 192 clinical gastric tissue samples and 7 gastric cancer cell lines were included in this study. Multiple chromosomal regions with recurrent copy number alterations were detected. The most frequent chromosomal alterations included gains at 7q, 8q, 17q, 19q, and 20q and losses at 9p, 18q, and 21q. Distinctive patterns of copy number alterations were detected for different histological subtypes (intestinal and diffuse) and for cancers located in different parts of the stomach. The impact of copy number alterations on gene expression was significant, as 6-10% of genes located in the regions of gains and losses also showed concomitant alterations in their expression. By combining the information from the DNA- and RNA-level analyses many novel gastric cancer-related genes, such as ALPK2, ENAH, HHIPL2, and OSMR, were identified. Independent genome-wide gene expression analysis of Finnish and Japanese gastric tumors revealed an additional set of genes that was differentially expressed in cancerous gastric tissues compared with normal tissue. Overexpression of one of these genes, CXCL1, was associated with an improved survival of gastric cancer. Thus, using an integrative microarray analysis, several novel genes were identified that may be critically important for gastric carcinogenesis. Further studies of these genes may lead to novel biomarkers for gastric cancer diagnosis and targeted therapy.
Resumo:
The genome sequence of Caloramator mitchellensis strain VF08, a rod-shaped, heterotrophic, strictly anaerobic bacterium iso-lated from the free-flowing waters of a Great Artesian Basin (GAB) bore well located in Mitchell, an outback Queensland town in Australia, is reported here. The analysis of the 2.42-Mb genome sequence indicates that the attributes of the genome are consistent with its physiological and phenotypic traits.
Resumo:
Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.
Resumo:
Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.
Resumo:
Microbial degradation pathways play a key role in the detoxification and the mineralization of polyaromatic hydrocarbons (PAHs), which are widespread pollutants in soil and constituents of petroleum hydrocarbons. In microbiology the aromatic degradation pathways are traditionally studied from single bacterial strains with capacity to degrade certain pollutant. In soil the degradation of aromatics is performed by a diverse community of micro-organisms. The aim of this thesis was to study biodegradation on different levels starting from a versatile aromatic degrader Sphingobium sp. HV3 and its megaplasmid, extending to revelation of diversity of key catabolic enzymes in the environment and finally studying birch rhizoremediation in PAH-polluted soil. To understand biodegradation of aromatics on bacterial species level, the aromatic degradation capacity of Sphingobium sp. HV3 and the role of the plasmid pSKY4, was studied. Toluene, m-xylene, biphenyl, fluorene, phenanthrene were detected as carbon and energy sources of the HV3 strain. Tn5 transposon mutagenesis linked the degradation capacity of toluene, m-xylene, biphenyl and naphthalene to the pSKY4 plasmid and qPCR expression analysis showed that plasmid extradiol dioxygenases genes (bphC and xylE) are inducted by phenanthrene, m-xylene and biphenyl whereas the 2,4-dichlorophenoxyacetic acid herbicide induced the chlorocatechol 1,2-dioxygenase gene (tfdC) from the ortho-pathway. A method to study upper meta-pathway extradiol dioxygenase gene diversity in soil was developed. The extradiol dioxygenases catalyse cleavage of the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage). A high diversity of extradiol dioxygenases were detected from polluted soils. The detected extradiol dioxygenases showed sequence similarity to known catabolic genes of Alpha-, Beta-, and Gammaproteobacteria. Five groups of extradiol dioxygenases contained sequences with no close homologues in the database, representing novel genes. In rhizoremediation experiment with birch (Betula pendula) treatment specific changes of extradiol dioxygenase communities were shown. PAH pollution changed the bulk soil extradiol dioxygenase community structure and birch rhizosphere contained a more diverse extradiol dioxygenase community than the bulk soil showing a rhizosphere effect. The degradation of pyrene in soil was enhanced with birch seedlings compared to soil without birch. The complete 280,923 kb nucleotide sequence of pSKY4 plasmid was determined. The open reading frames of pSKY4 were divided into putative conjugative transfer, aromatic degradation, replication/maintaining and transposition/integration function-encoding proteins. Aromatic degradation orfs shared high similarity to corresponding genes in pNL1, a plasmid from the deep subsurface strain Novosphingobium aromaticivorans F199. The plasmid backbones were considerably more divergent with lower similarity, which suggests that the aromatic pathway has functioned as a plasmid independent mobile genetic element. The functional diversity of microbial communities in soil is still largely unknown. Several novel clusters of extradiol dioxygenases representing catabolic bacteria, whose function, biodegradation pathways and phylogenetic position is not known were amplified with single primer pair from polluted soils. These extradiol dioxygenase communities were shown to change upon PAH pollution, which indicates that their hosts function in PAH biodegradation in soil. Although the degradation pathways of specific bacterial species are substantially better depicted than pathways in situ, the evolution of degradation pathways for the xenobiotic compounds is largely unknown. The pSKY4 plasmid contains aromatic degradation genes in putative mobile genetic element causing flexibility/instability to the pathway. The localisation of the aromatic biodegradation pathway in mobile genetic elements suggests that gene transfer and rearrangements are a competetive advantage for Sphingomonas bacteria in the environment.
Resumo:
Visual pigments of different animal species must have evolved at some stage to match the prevailing light environments, since all visual functions depend on their ability to absorb available photons and transduce the event into a reliable neural signal. There is a large literature on correlation between the light environment and spectral sensitivity between different fish species. However, little work has been done on evolutionary adaptation between separated populations within species. More generally, little is known about the rate of evolutionary adaptation to changing spectral environments. The objective of this thesis is to illuminate the constraints under which the evolutionary tuning of visual pigments works as evident in: scope, tempo, available molecular routes, and signal/noise trade-offs. Aquatic environments offer Nature s own laboratories for research on visual pigment properties, as naturally occurring light environments offer an enormous range of variation in both spectral composition and intensity. The present thesis focuses on the visual pigments that serve dim-light vision in two groups of model species, teleost fishes and mysid crustaceans. The geographical emphasis is in the brackish Baltic Sea area with its well-known postglacial isolation history and its aquatic fauna of both marine and fresh-water origin. The absorbance spectrum of the (single) dim-light visual pigment were recorded by microspectrophotometry (MSP) in single rods of 26 fish species and single rhabdoms of 8 opossum shrimp populations of the genus Mysis inhabiting marine, brackish or freshwater environments. Additionally, spectral sensitivity was determined from six Mysis populations by electroretinogram (ERG) recording. The rod opsin gene was sequenced in individuals of four allopatric populations of the sand goby (Pomatoschistus minutus). Rod opsins of two other goby species were investigated as outgroups for comparison. Rod absorbance spectra of the Baltic subspecies or populations of the primarily marine species herring (Clupea harengus membras), sand goby (P. minutus), and flounder (Platichthys flesus) were long-wavelength-shifted compared to their marine populations. The spectral shifts are consistent with adaptation for improved quantum catch (QC) as well as improved signal-to-noise ratio (SNR) of vision in the Baltic light environment. Since the chromophore of the pigment was pure A1 in all cases, this has apparently been achieved by evolutionary tuning of the opsin visual pigment. By contrast, no opsin-based differences were evident between lake and sea populations of species of fresh-water origin, which can tune their pigment by varying chromophore ratios. A more detailed analysis of differences in absorbance spectra and opsin sequence between and within populations was conducted using the sand goby as model species. Four allopatric populations from the Baltic Sea (B), Swedish west coast (S), English Channel (E), and Adriatic Sea (A) were examined. Rod absorbance spectra, characterized by the wavelength of maximum absorbance (λmax), differed between populations and correlated with differences in the spectral light transmission of the respective water bodies. The greatest λmax shift as well as the greatest opsin sequence difference was between the Baltic and the Adriatic populations. The significant within-population variation of the Baltic λmax values (506-511 nm) was analyzed on the level of individuals and was shown to correlate well with opsin sequence substitutions. The sequences of individuals with λmax at shorter wavelengths were identical to that of the Swedish population, whereas those with λmax at longer wavelengths additionally had substitution F261F/Y in the sixth transmembrane helix of the protein. This substitution (Y261) was also present in the Baltic common gobies and is known to redshift spectra. The tuning mechanism of the long-wavelength type Baltic sand gobies is assumed to be the co-expression of F261 and Y261 in all rods to produce ≈ 5 nm redshift. The polymorphism of the Baltic sand goby population possibly indicates ambiguous selection pressures in the Baltic Sea. The visual pigments of all lake populations of the opossum shrimp (Mysis relicta) were red-shifted by 25 nm compared with all Baltic Sea populations. This is calculated to confer a significant advantage in both QC and SNR in many humus-rich lakes with reddish water. Since only A2 chromophore was present, the differences obviously reflect evolutionary tuning of the visual protein, the opsin. The changes have occurred within the ca. 9000 years that the lakes have been isolated from the Sea after the most recent glaciation. At present, it seems that the mechanism explaining the spectral differences between lake and sea populations is not an amino acid substitution at any other conventional tuning site, but the mechanism is yet to be found.
Resumo:
Sequence specific interaction between DNA and protein molecules has been a subject of active investigation for decades now. Here, we have chosen single promoter containing bacteriophage Delta D-III T7 DNA and Escherichia coli RNA polymerase and followed their recognition at the air-water interface by using the surface plasmon resonance (SPR) technique, where the movement of one of the reacting species is restricted by way of arraying them on an immobilized support. For the Langmuir monolayer studies, we used a RNA polymerase with a histidine tag attached to one of its subunits, thus making it an xcellent substrate for Ni(II) ions, while the SPR Studies were done using biotin-labeled DNA immobilized on a streptavidin-coated chip. Detailed analysis of the thermodynamic parameters as a function of concentration and temperature revealed that the interaction of RNA polymerase with T7 DNA is largely entropy driven (83 (+/- 12) kcal mol(-1)) with a positive enthalpy of 13.6 (+/- 3.6) kcal mol(-1), The free energy of reaction determined by SPR and Langmuir-Blodgett technique was -11 (+/- 2) and -15.6 kcal mol(-1), respectively. The ability of these methods to retain the specificity of the recognition process was also established.
Resumo:
An isolated wind power generation scheme using slip ring induction machine (SRIM) is proposed. The proposed scheme maintains constant load voltage and frequency irrespective of the wind speed or load variation. The power circuit consists of two back-to-back connected inverters with a common dc link, where one inverter is directly connected to the rotor side of SRIM and the other inverter is connected to the stator side of the SRIM through LC filter. Developing a negative sequence compensation method to ensure that, even under the presence of unbalanced load, the generator experiences almost balanced three-phase current and most of the unbalanced current is directed through the stator side converter is the focus here. The SRIM controller varies the speed of the generator with variation in the wind speed to extract maximum power. The difference of the generated power and the load power is either stored in or extracted from a battery bank, which is interfaced to the common dc link through a multiphase bidirectional fly-back dc-dc converter. The SRIM control scheme, maximum power point extraction algorithm and the fly-back converter topology are incorporated from available literature. The proposed scheme is both simulated and experimentally verified.
Resumo:
CONTEXT: Polyalanine tract variations in transcription factors have been identified for a wide spectrum of developmental disorders. The thyroid transcription factor forkhead factor E1 (FOXE1) contains a polymorphic polyalanine tract with 12-22 alanines. Single-nucleotide polymorphisms (SNP) close to this locus are associated with papillary thyroid cancer (PTC), and a strong linkage disequilibrium block extends across this region. OBJECTIVE: The objective of the study was to assess whether the FOXE1 polyalanine repeat region was associated with PTC and to assess the effect of polyalanine repeat region variants on protein expression, DNA binding, and transcriptional function on FOXE1-responsive promoters. DESIGN: This was a case-control study. SETTING: The study was conducted at a tertiary referral hospital. PATIENTS AND METHODS: The FOXE1 polyalanine repeat region and tag SNP were genotyped in 70 PTC, with a replication in a further 92 PTC, and compared with genotypes in 5767 healthy controls (including 5667 samples from the Wellcome Trust Case Control Consortium). In vitro studies were performed to examine the protein expression, DNA binding, and transcriptional function for FOXE1 variants of different polyalanine tract lengths. RESULTS: All the genotyped SNP were in tight linkage disequilibrium, including the FOXE1 polyalanine repeat region. We confirmed the strong association of rs1867277 with PTC (overall P = 1 × 10(-7), odds ratio 1.84, confidence interval 1.31-2.57). rs1867277 was in tight linkage disequilibrium with the FOXE1 polyalanine repeat region (r(2) = 0.95). FOXE1(16Ala) was associated with PTC with an odds ratio of 2.23 (confidence interval 1.42-3.50; P = 0.0005). Functional studies in vitro showed that FOXE1(16Ala) was transcriptionally impaired compared with FOXE1(14Ala), which was not due to differences in protein expression or DNA binding. CONCLUSIONS: We have confirmed the previous association of FOXE1 with PTC. Our data suggest that the coding polyalanine expansion in FOXE1 may be responsible for the observed association between FOXE1 and PTC.
Pi-turns in proteins and peptides: Classification, conformation, occurrence, hydration and sequence.
Resumo:
The i + 5-->i hydrogen bonded turn conformation (pi-turn) with the fifth residue adopting alpha L conformation is frequently found at the C-terminus of helices in proteins and hence is speculated to be a "helix termination signal." An analysis of the occurrence of i + 5-->i hydrogen bonded turn conformation at any general position in proteins (not specifically at the helix C-terminus), using coordinates of 228 protein crystal structures determined by X-ray crystallography to better than 2.5 A resolution is reported in this paper. Of 486 detected pi-turn conformations, 367 have the (i + 4)th residue in alpha L conformation, generally occurring at the C-terminus of alpha-helices, consistent with previous observations. However, a significant number (111) of pi-turn conformations occur with (i + 4)th residue in alpha R conformation also, generally occurring in alpha-helices as distortions either at the terminii or at the middle, a novel finding. These two sets of pi-turn conformations are referred to by the names pi alpha L and pi alpha R-turns, respectively, depending upon whether the (i + 4)th residue adopts alpha L or alpha R conformations. Four pi-turns, named pi alpha L'-turns, were noticed to be mirror images of pi alpha L-turns, and four more pi-turns, which have the (i + 4)th residue in beta conformation and denoted as pi beta-turns, occur as a part of hairpin bend connecting twisted beta-strands. Consecutive pi-turns occur, but only with pi alpha R-turns. The preference for amino acid residues is different in pi alpha L and pi alpha R-turns. However, both show a preference for Pro after the C-termini. Hydrophilic residues are preferred at positions i + 1, i + 2, and i + 3 of pi alpha L-turns, whereas positions i and i + 5 prefer hydrophobic residues. Residue i + 4 in pi alpha L-turns is mainly Gly and less often Asn. Although pi alpha R-turns generally occur as distortions in helices, their amino acid preference is different from that of helices. Poor helix formers, such as His, Tyr, and Asn, also were found to be preferred for pi alpha R-turns, whereas good helix former Ala is not preferred. pi-Turns in peptides provide a picture of the pi-turn at atomic resolution. Only nine peptide-based pi-turns are reported so far, and all of them belong to pi alpha L-turn type with an achiral residue in position i + 4. The results are of importance for structure prediction, modeling, and de novo design of proteins.
Resumo:
The complete sequence of a P4 type VP4 gene from a G2 serotype human rotavirus, IS2, isolated in India has been determined. Although the IS2 VP4 is highly homologous to the other P4 type alleles, it contained acidic amino acid substitutions at several positions that make it acidic among the P4 type alleles that are basic. Moreover, comparative sequence analysis revealed unusual polymorphism in members of the P4 type at amino acid position 393 which is highly conserved in members of other VP4 types. To date, expression of complete VP4 inE. coli has not been achieved. In this study we present successful expression inE. coli of the complete VP4 as well as VP8* and VP5* cleavage subunits in soluble form as fusion proteins of the maltose-binding protein (MBP) and their purification by single-step affinity chromatography. The hemagglutinating activity exhibited by the recombinant protein was specifically inhibited by the antiserum raised against it. Availability of pure VP4 proteins should facilitate development of polyclonal and monoclonal antibodies (MAbs) for P serotyping of rotaviruses.