208 resultados para Sequence Motifs
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
The genes encoding triosephosphate isomerase (TIM) in three species of Microcystis (M. aeruginosa, M. viridis and M. wesenbergii) were investigated. Reverse transcriptase-polymerase chain reaction indicated that they were transcribed in the cells. Analyses showed that their DNA and deduced amino acid sequences were highly conserved between all the three species, only a single nonsynonymous substitution was seen at position 31, from an Asp in M. aeruginosa and M. viridis to Glu in M. wesenbergii. Sequence alignment of these with 12 other known cyanobacterial TIM sequences showed that all the cyanobacterial TIMs had a very high level of amino acid identity (over 50% between each two). Comparison of the cyanobacterial TIMs with other reported TIMs (from diverse lineages of the three Domains) showed that they possessed common active-site residues and sequence motifs. All cyanobacterial TIMs have two common cysteine residues (Cys127 and Cys176), and the Cys176 is almost cyanobacteria-specific with only one exception in Streptomyces coelicolor. Both secondary structure alignment and comparative modelling of Synechocystis sp. TIM showed that Cys176 was located at the hinge region of the flexible loop-6 and might therefore be critical to the movement of TIM's loop-6, which is important to the function of the enzyme. Thus, the cyanobacterial TIM-specific Cys176 may be a potential site for the discovery of suitable drugs against cyanobacteria, and such drugs may have utility in controlling water blooms due to cyanobacteria.
Resumo:
Since the first intein (Sce VMA) was found in Saccharomydes cerevisiae ATPases gene in 1990, more and more inteins were identified. It is necessary to analyze the new inteins to understand the sequence charateristics of inteins. By searching protein and n
Resumo:
Transcription factor binding sites (TFBS) play key roles in genebior 6.8 wavelet expression and regulation. They are short sequence segments with de¯nite structure and can be recognized by the corresponding transcription factors correctly. From the viewpoint of statistics, the candidates of TFBS should be quite di®erent from the segments that are randomly combined together by nucleotide. This paper proposes a combined statistical model for ¯nding over- represented short sequence segments in di®erent kinds of data set. While the over-represented short sequence segment is described by position weight matrix, the nucleotide distribution at most sites of the segment should be far from the background nucleotide distribution. The central idea of this approach is to search for such kind of signals. This algorithm is tested on 3 data sets, including binding sites data set of cyclic AMP receptor protein in E.coli, PlantProm DB which is a non-redundant collection of proximal promoter sequences from di®erent species, collection of the intergenic sequences of the whole genome of E.Coli. Even though the complexity of these three data sets is quite di®erent, the results show that this model is rather general and sensible.
Resumo:
Hemorrhagic disease, caused by the grass carp reovirus (GCRV), is one of the major diseases of grass carp in China. Little is known about the structure and function of the gene segments of this reovirus. The S10 genome segment of GCRV was cloned and the complete nucleotide sequence is reported here. The S10 is 909 nucleotides long and contains a large open reading frame (ORF) encoding a protein of 276 amino acids with a deduced molecular weight of approximately 29.7 kDa. Comparisons of the deduced amino acid sequence of GCRV S10 with those of other reoviruses revealed no significant homologies. However, GCRV S10 shared a putative zinc-finger sequence and a similar distribution of hydrophilic motifs with the outer capsid proteins encoded by Coho salmon aquareovirus (SCSV) S10, striped bass reovirus (SBRV) S10, and mammalian reovirus (MRV) S4. It was predicted that this segment gene encodes an outer capsid protein.
Resumo:
A large number of polymorphic simple sequence repeats (SSRs) or microsatellites are needed to develop a genetic map for shrimp. However, developing an SSR map is very time-consuming, expensive, and most SSRs are not specifically linked to gene loci of immediate interest. We report here on our strategy to develop polymorphic markers using expressed sequence tags (ESTs) by designing primers flanking single or multiple SSRs with three or more repeats. A subtracted cDNA library was prepared using RNA from specific pathogen-free (SPF) Litopenaeus vannamei juveniles (similar to 1 g) collected before (0) and after (48 h) inoculation with the China isolate of white spot syndrome virus (WSSV). A total of 224 clones were sequenced, 194 of which were useful for homology comparisons against annotated genes in NCBI nonredundant (nr) and protein databases, providing 179 sequences encoded by nuclear DNA, 4 mitochondrial DNA, and 11 were similar to portions of WSSV genome. The nuclear sequences clustered in 43 groups, 11 of which were homologous to various ESTs of unknown function, 4 had no homology to any sequence, and 28 showed similarities to known genes of invertebrates and vertebrates, representatives of cellular metabolic processes such as calcium ion balance, cytoskeleton mRNAs, and protein synthesis. A few sequences were homologous to immune system-related (allergens) genes and two were similar to motifs of the sex-lethal gene of Drosophila. A large number of EST sequences were similar to domains of the EF-hand superfamily (Ca2+ binding motif and FRQ protein domain of myosin light chains). Single or multiple SSRs with three or more repeats were found in approximately 61 % of the 179 nuclear sequences. Primer sets were designed from 28 sequences representing 19 known or putative genes and tested for polymorphism (EST-SSR marker) in a small test panel containing 16 individuals. Ten (53%) of the 19 putative or unknown function genes were polymorphic, 4 monomorphic, and 3 either failed to satisfactorily amplify genomic DNA or the allele amplification conditions need to be further optimized. Five polymorphic ESTs were genotyped with the entire reference mapping family, two of them (actin, accession #CX535973 and shrimp allergen arginine kinase, accession #CX535999) did not amplify with all offspring of the IRMF panel suggesting presence of null alleles, and three of them amplified in most of the IRM F offspring and were used for linkage analysis. EF-hand motif of myosin light chain (accession #CX535935) was placed in ShrimpMap's linkage group 7, whereas ribosomal protein S5 (accession #CX535957) and troponin I (accession #CX535976) remained unassigned. Results indicate that (a) a large number of ESTs isolated from this cDNA library are similar to cytoskeleton mRNAs and may reflect a normal pathway of the cellular response after im infection with WSSV, and (b) primers flanking single or multiple SSRs with three or more repeats from shrimp ESTs could be an efficient approach to develop polymorphic markers useful for linkage mapping. Work is underway to map additional SSR-containing ESTs from this and other cDNA libraries as a plausible strategy to increase marker density in ShrimpMap.
Resumo:
The theory of the loading/unloading response ratio (LURR) was applied to the Jiashi earthquake sequence which occurred at the beginning of 1997 in Xinjiang, and found that, before the earthquakes with relatively high magnitudes In the sequence, the ratio showed anomalies of high values. That is to say, the LURR theory can be applied to the short-term earthquake prediction in some cases, especially in the early period after a strong earthquake, such as the forecasts for some strong earthquakes in the Jiashi sequence.
Resumo:
Using an unperturbed scattering theory, the characteristics of H atom photoionization are studied respectively by a linearly- and by a circularly- polarized one-cycle laser pulse sequence. The asymmetry for photoelectrons in two directions opposite to each other is investigated. It is found that the asymmetry degree varies with the carrier-envelope (CE) phase, laser intensity, as well as the kinetic energy of photoelectrons. For the linear polarization, the maximal ionization rate varies with the CE phase, and the asymmetry degree varies with the CE phase in a sine-like pattern. For the circular polarization, the maximal ionization rate keeps constant for various CE phases, but the variation of asymmetry degree is still in a sine-like pattern.
Resumo:
Our study of a novel technique for adaptive image sequence coding is reported. The number of reference frames and the intervals between them are adjusted to improve the temporal compensability of the input video. The bits are distributed more efficiently on different frame types according to temporal and spatial complexity of the image scene. Experimental results show that this dynamic group-of-picture (GOP) structure coding scheme is not only feasible but also better than the conventional fixed GOP method in terms of perceptual quality and SNR. (C) 1996 Society of Photo-Optical Instrumentation Engineers.