212 resultados para homologous sequence
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
The expressed sequence tags (EST) has been proved to be a useful tool for discovering and identifying functional genes, especially in some species whose genetic information is unavailable. A total of 180 ESTs have been generated from a cDNA library of gametophytic Gracilaria lemaneiformis in this study. These clones are clustered into 151 groups, among which 8 groups are highly homologous to chloroplast genes and are abundant in the library. After searching for matches in the EST database of red alga, 22 groups are found to match with the registered ESTs of Rhadophyta and 6 with Gracilaria. Searching in the protein database reveal that 73 non-redundant clones have significant similarity to some known sequences, the majority of which are involved in photosynthesis, DNA transcription or translation, and 6, 4 and 3 clones are associated with growth or development, signal transduction and stress or defense response, respectively.
Resumo:
In plants and less-advanced animal species, such as C.elegans, introduction of exogenous double-stranded RNA (dsRNA) into cells would trigger degradation of the mRNA with homologous sequence and interfere with the endogenous gene expression. It might represent an ancient anti-virus response which could prevent the mutation in the genome that was caused by virus infection or mobile DNA elements insertion. This phenomenon was named RNA interference, or RNAi. In this study, RNAi was used to investigate the function of basonuclin gene during oogenesis. Microinjection of dsRNA directed towards basonuclin into mouse germinal-vesicle-intact (GV) oocytes brought down the abundance of the cognate mRNA effectively in a time- and concentration-dependent manner. This reduction effect was sequence-specific and showed no negative effect on other non-homologous gene expression in oocytes, which indicated that dsRNA can recognize and cause the degradation of the transcriptional products of endogenous basonuclin gene in a sequence-specific manner. Immunofluorescence results showed that RNAi could reduce the concentration of basonuclin protein to some extent, but the effect was less efficient than the dsRNA targeting towards tPA and cMos which was also expressed in oocytes. This result might be due to the long half life of basonuclin protein in oocytes and the short reaction time which was posed by the limited life span of GV oocytes cultured in vitro. In summary, dsRNA could inhibit the expression of the cognate gene in oocytes at both mRNA and protein levels. The effect was similar to Knock-out technique which was based on homologous recombination. Furthermore, hairpin-style dsRNA targeting basonuclin gene could be produced by transcription from a recombinant plasmid and worked efficiently to deplete the cognate mRNA in oocytes. This finding offered a new way to study the function of basonuclin in the early stage of oogenesis by infection of primordial oocytes with the plasmid expressing hairpin-style basonuclin dsRNA.
Resumo:
The complete mitochondrial genome sequence of the Chinese hook snout carp, Opsariichthys bidens, was newly determined using the long and accurate polymerase chain reaction method. The 16,611-nucleotide mitogenome contains 13 protein-coding genes, two rRNA genes (12S, 16S) 22 tRNA genes, and a noncoding control region. We use these data and homologous sequence data from multiple other ostariophysan fishes in a phylogenetic evaluation to test hypothesis pertaining to codon usage pattern of O. bidens mitochondrial protein genes as well as to re-examine the ostariophysan phylogeny. The mitochondrial genome of O. bidens reveals an alternative pattern of vertebrate mitochondrial evolution. For the mitochondrial protein genes of O. bidens, the most frequently used codon generally ends with either A or C, with C preferred over A for most fourfold degenerate codon families; the relative synonymous codon usage of G-ending codons is greatly elevated in all categories. The codon usage pattern of O. bidens mitochondrial protein genes is remarkably different from the general pattern found previously in the relatively closely 9 related zebrafish and most other vertebrate mitochondria. Nucleotide bias at third codon positions is the main cause of codon bias in the mitochondrial protein genes of O. bidens, as it is biased particularly in favor of C over A. Bayesian analysis of 12 concatenated mitochondrial protein sequences for O. bidens and 46 other teleostean taxa supports the monophyly of Cypriniformes and Otophysi and results in a robust estimate of the otophysan phylogeny. (C) 2007 Published by Elsevier B.V.
Resumo:
The gene targeting technique is a powerful tool for analyzing functions of cloned genes and for generating transgenic animals with site-directed integration of foreign genes. In order to develop this technique in fish, positive-negative selection (PNS) and homologous recombination vectors were constructed, and their expression was examined in fish cells. A vector (pNK) for PNS consists of the neomycin resistance gene (neo) as a positive selectable marker gene and the herpes simplex virus (HSV) thymidine kinase (tk) gene as a negative selectable marker gene. Positive selection with geneticin (G418) of epithelioma papulosum of carp (EPC) cells transfected with linearized pNK vector yielded 350 colonies, while double selection of transfected EPC cells with G418 and gancyclovir (Gc) resulted in nearly complete cell death, demonstrating that the PNS procedure is effective in fish cells. Homologous recombination vectors consist of the Xiphophorus melanoma receptor kinase (X mrk(Y)) gene as homologous sequence in addition to the neo and tk genes. Conditions for homologous recombination vector transfection and drug selection were established. After verification of the feasibility of expression of homologous recombination vectors in EPC cells, the first gene targeting experiments were attempted in the Xiphophorus melanoma cell line, PSM. Positive-negative selection of the targeting vector-transfectants led to a low enrichment in this particular cell line. The reasons for the low enrichment in PSM cells were discussed. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
In protein sequence alignment, residue similarity is usually evaluated by substitution matrix, which scores all possible exchanges of one amino acid with another. Several matrices are widely used in sequence alignment, including PAM matrices derived from homologous sequence and BLOSUM matrices derived from aligned segments of BLOCKS. However, most matrices have not addressed the high-order residue-residue interactions that are vital to the bioproperties of protein.With consideration for the inherent correlation in residue triplet, we present a new scoring scheme for sequence alignment. Protein sequence is treated as overlapping and successive 3-residue segments. Two edge residues of a triplet are clustered into hydrophobic or polar categories, respectively. Protein sequence is then rewritten into triplet sequence with 2 · 20 · 2 = 80 alphabets. Using a traditional approach, we construct a new scoring scheme named TLESUMhp (TripLEt SUbstitution Matrices with hydropobic and polar information) for pairwise substitution of triplets, which characterizes the similarity of residue triplets. The applications of this matrix led to marked improvements in multiple sequence alignment and in searching structurally alike residue segments. The reason for the occurrence of the ‘‘twilight zone,’’ i.e., structure explosion of lowidentity sequences, is also discussed.
Resumo:
The molecular mechanics property is the foundation of many characters of proteins. Based on intramolecular hydrophobic force network, the representative family character underlying a protein’s mechanics property is described by a simple two-letter scheme. The tendency of a sequence to become a member of a protein family is scored according to this mathematical representation. Remote homologs of the WW-domain family could be easily designed using such a mechanistic signature of protein homology. Experimental validation showed that nearly all artificial homologs have the representative folding and bioactivity of their assigned family. Since the molecular mechanics property is the only consideration in this study, the results indicate its possible role in the generation of new members of a protein family during evolution.
Resumo:
The chemokine receptor CCR5 can serve as a coreceptor for M-tropic HIV-1 infection and both M-tropic and T-tropic SIV infection. We sequenced the entire CCR5 gene from 10 nonhuman primates: Pongo pygmaeus, Hylobates leucogenys, Trachypithecus francoisi, Trachypithecus phayrei, Pygathrix nemaeus, Rhinopithecus roxellanae, Rhinopithecus bieti, Rhinopithecus avunculus, Macaca assamensis, and Macaca arctoides. When compared with CCR5 sequences from humans and other primates, our results demonstrate that:(1) nucleotide and amino acid sequences of CCR5 among primates are highly homologous, with variations slightly concentrated on the amino and carboxyl termini; and (2) site Asp13, which is critical for CD4-independent binding of SIV gp120 to Macaca mulatta CCR5, was also present in all other nonhuman primates tested here, suggesting that those nonhuman primate CCR5s might also bind SIV gp120 without the presence of CD4. The topologies of CCR5 gene trees constructed here conflict with the putative opinion that the snub-nosed langurs compose a monophyletic group, suggesting that the CCR5 gene may not be a good genetic marker for low-level phylogenetic analysis. The evolutionary rate of CCR5 was calculated, and our results suggest a slowdown in primates after they diverged from rodents. The synonymous mutation rate of CCR5 in primates is constant, about 1.1 x 10(-9) synonymous mutations per site per year. Comparisons of K-a and K-s suggest that the CCR5 genes have undergone negative or purifying selection. K-a/K-s ratios from cercopithecines and colobines are significantly different, implying that selective pressures have played different roles in the two lineages.
Resumo:
The sequences of the ITS (internal transcribed spacer) and 5.8S rDNA of three cultivated strains of Porphyra haitanensis thalli (NB, PT and ST) were amplified, sequenced and analyzed. In addition, the phylogenic relationships of the sequences identified in this study with those of other Porphyra retrieved from GenBank were evaluated. The results are as follows: the sequences of the ITS and 5.8S rDNA were essentially identical among the three strains. The sequences of ITS l were 331 by to 334 bp, while those of the 5.8S rDNA were 158 by and the sequences of ITS2 ranged from 673 by to 681 bp. The sequences of the ITS had a high level of homology (up to 99.5%) with that of P. haitanensis (DQ662228) retrieved from GenBank, but were only approximately 50% homologous with those of other species of Porphyra. The results obtained when a phylogenetic tree was constructed coincided with the results of the homology analysis. These results suggest that the three cultivated strains of P. haitanensis evolved conservatively and that the ITS showed evolutionary consistency. However, the sequences of the ITS and 5.8S rDNA of different Porphyra species showed great variations. Therefore, the relationship of Porphyra interspecies phyletic evolution could be judged, which provides the proof for Porphyra identification study. However, proper classifications of the subspecies and the populations of Porphyra should be determined through the use of other molecular techniques to determine the genetic variability and rational phylogenetic relationships.
Resumo:
A large number of polymorphic simple sequence repeats (SSRs) or microsatellites are needed to develop a genetic map for shrimp. However, developing an SSR map is very time-consuming, expensive, and most SSRs are not specifically linked to gene loci of immediate interest. We report here on our strategy to develop polymorphic markers using expressed sequence tags (ESTs) by designing primers flanking single or multiple SSRs with three or more repeats. A subtracted cDNA library was prepared using RNA from specific pathogen-free (SPF) Litopenaeus vannamei juveniles (similar to 1 g) collected before (0) and after (48 h) inoculation with the China isolate of white spot syndrome virus (WSSV). A total of 224 clones were sequenced, 194 of which were useful for homology comparisons against annotated genes in NCBI nonredundant (nr) and protein databases, providing 179 sequences encoded by nuclear DNA, 4 mitochondrial DNA, and 11 were similar to portions of WSSV genome. The nuclear sequences clustered in 43 groups, 11 of which were homologous to various ESTs of unknown function, 4 had no homology to any sequence, and 28 showed similarities to known genes of invertebrates and vertebrates, representatives of cellular metabolic processes such as calcium ion balance, cytoskeleton mRNAs, and protein synthesis. A few sequences were homologous to immune system-related (allergens) genes and two were similar to motifs of the sex-lethal gene of Drosophila. A large number of EST sequences were similar to domains of the EF-hand superfamily (Ca2+ binding motif and FRQ protein domain of myosin light chains). Single or multiple SSRs with three or more repeats were found in approximately 61 % of the 179 nuclear sequences. Primer sets were designed from 28 sequences representing 19 known or putative genes and tested for polymorphism (EST-SSR marker) in a small test panel containing 16 individuals. Ten (53%) of the 19 putative or unknown function genes were polymorphic, 4 monomorphic, and 3 either failed to satisfactorily amplify genomic DNA or the allele amplification conditions need to be further optimized. Five polymorphic ESTs were genotyped with the entire reference mapping family, two of them (actin, accession #CX535973 and shrimp allergen arginine kinase, accession #CX535999) did not amplify with all offspring of the IRMF panel suggesting presence of null alleles, and three of them amplified in most of the IRM F offspring and were used for linkage analysis. EF-hand motif of myosin light chain (accession #CX535935) was placed in ShrimpMap's linkage group 7, whereas ribosomal protein S5 (accession #CX535957) and troponin I (accession #CX535976) remained unassigned. Results indicate that (a) a large number of ESTs isolated from this cDNA library are similar to cytoskeleton mRNAs and may reflect a normal pathway of the cellular response after im infection with WSSV, and (b) primers flanking single or multiple SSRs with three or more repeats from shrimp ESTs could be an efficient approach to develop polymorphic markers useful for linkage mapping. Work is underway to map additional SSR-containing ESTs from this and other cDNA libraries as a plausible strategy to increase marker density in ShrimpMap.
Resumo:
Amino acid substitution matrices play an essential role in protein sequence alignment, a fundamental task in bioinformatics. Most widely used matrices, such as PAM matrices derived from homologous sequences and BLOSUM matrices derived from aligned segments of PROSITE, did not integrate conformation information in their construction. There are a few structure-based matrices, which are derived from limited data of structure alignment. Using databases PDB_SELECT and DSSP, we create a database of sequence-conformation blocks which explicitly represent sequence-structure relationship. Members in a block are identical in conformation and are highly similar in sequence. From this block database, we derive a conformation-specific amino acid substitution matrix CBSM60. The matrix shows an improved performance in conformational segment search and homolog detection.
Resumo:
The theory of the loading/unloading response ratio (LURR) was applied to the Jiashi earthquake sequence which occurred at the beginning of 1997 in Xinjiang, and found that, before the earthquakes with relatively high magnitudes In the sequence, the ratio showed anomalies of high values. That is to say, the LURR theory can be applied to the short-term earthquake prediction in some cases, especially in the early period after a strong earthquake, such as the forecasts for some strong earthquakes in the Jiashi sequence.
Resumo:
Here we attempt to characterize protein evolution by residue features which dominate residue substitution in homologous proteins. Evolutionary information contained in residue substitution matrix is abstracted with the method of eigenvalue decomposition. Top eigenvectors in the eigenvalue spectrums are analyzed as function of the level of similarity, i.e. sequence identity (SI) between homologous proteins. It is found that hydrophobicity and volume are two significant residue features conserved in protein evolution. There is a transition point at SI approximate to 45%. Residue hydrophobicity is a feature governing residue substitution as SI >= 45%. Whereas below this SI level, residue volume is a dominant feature. (C) 2007 Elsevier B.V. All rights reserved.
Resumo:
Features of homologous relationship of proteins can provide us a general picture of protein universe, assist protein design and analysis, and further our comprehension of the evolution of organisms. Here we carried Out a Study of the evolution Of protein molecules by investigating homologous relationships among residue segments. The motive was to identify detailed topological features of homologous relationships for short residue segments in the whole protein universe. Based on the data of a large number of non-redundant Proteins, the universe of non-membrane polypeptide was analyzed by considering both residue mutations and structural conservation. By connecting homologous segments with edges, we obtained a homologous relationship network of the whole universe of short residue segments, which we named the graph of polypeptide relationships (GPR). Since the network is extremely complicated for topological transitions, to obtain an in-depth understanding, only subgraphs composed of vital nodes of the GPR were analyzed. Such analysis of vital subgraphs of the GPR revealed a donut-shaped fingerprint. Utilization of this topological feature revealed the switch sites (where the beginning of exposure Of previously hidden "hot spots" of fibril-forming happens, in consequence a further opportunity for protein aggregation is Provided; 188-202) of the conformational conversion of the normal alpha-helix-rich prion protein PrPC to the beta-sheet-rich PrPSc that is thought to be responsible for a group of fatal neurodegenerative diseases, transmissible spongiform encephalopathies. Efforts in analyzing other proteins related to various conformational diseases are also introduced. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
As a basic tool of modern biology, sequence alignment can provide us useful information in fold, function, and active site of protein. For many cases, the increased quality of sequence alignment means a better performance. The motivation of present work is to increase ability of the existing scoring scheme/algorithm by considering residue–residue correlations better. Based on a coarse-grained approach, the hydrophobic force between each pair of residues is written out from protein sequence. It results in the construction of an intramolecular hydrophobic force network that describes the whole residue–residue interactions of each protein molecule, and characterizes protein's biological properties in the hydrophobic aspect. A former work has suggested that such network can characterize the top weighted feature regarding hydrophobicity. Moreover, for each homologous protein of a family, the corresponding network shares some common and representative family characters that eventually govern the conservation of biological properties during protein evolution. In present work, we score such family representative characters of a protein by the deviation of its intramolecular hydrophobic force network from that of background. Such score can assist the existing scoring schemes/algorithms, and boost up the ability of multiple sequences alignment, e.g. achieving a prominent increase (50%) in searching the structurally alike residue segments at a low identity level. As the theoretical basis is different, the present scheme can assist most existing algorithms, and improve their efficiency remarkably.
Resumo:
Using an unperturbed scattering theory, the characteristics of H atom photoionization are studied respectively by a linearly- and by a circularly- polarized one-cycle laser pulse sequence. The asymmetry for photoelectrons in two directions opposite to each other is investigated. It is found that the asymmetry degree varies with the carrier-envelope (CE) phase, laser intensity, as well as the kinetic energy of photoelectrons. For the linear polarization, the maximal ionization rate varies with the CE phase, and the asymmetry degree varies with the CE phase in a sine-like pattern. For the circular polarization, the maximal ionization rate keeps constant for various CE phases, but the variation of asymmetry degree is still in a sine-like pattern.