39 resultados para DATABASES
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
Reliable turbulent channel flow databases at several Reynolds numbers have been established by large eddy simulation (LES), with two of them validated by comparing with typical direct numerical simulation (DNS) results. Furthermore, the statistics, such as velocity profile, turbulent intensities and shear stress, were obtained as well as the temporal and spatial structure of turbulent bursts. Based on the LES databases available, the conditional sampling methods are used to detect the structures of burst events. A method to deterimine the grouping parameter from the probability distribution function (pdf) curve of the time separation between ejection events is proposed to avoid the errors in detected results. And thus, the dependence of average burst period on thresholds is considerably weakened. Meanwhile, the average burst-to-bed area ratios are detected. It is found that the Reynolds number exhibits little effect on the burst period and burst-to-bed area ratio.
Resumo:
This paper reports the availability of a database of protein structural domains (DDBASE), an alignment database of homologous proteins (HOMSTRAD) and a database of structurally aligned superfamilies (CAMPASS) on the World Wide Web (WWW). DDBASE contains information on the organization of structural domains and their boundaries; it includes only one representative domain from each of the homologous families. This database has been derived by identifying the presence of structural domains in proteins on the basis of inter-secondary structural distances using the program DIAL [Sowdhamini & Blundell (1995), Protein Sci. 4, 506-520]. The alignment of proteins in superfamilies has been performed on the basis of the structural features and relationships of individual residues using the program COMPARER [Sali & Blundell (1990), J. Mol. Biol. 212, 403-428]. The alignment databases contain information on the conserved structural features in homologous proteins and those belonging to superfamilies. Available data include the sequence alignments in structure-annotated formats and the provision for viewing superposed structures of proteins using a graphical interface. Such information, which is freely accessible on the WWW, should be of value to crystallographers in the comparison of newly determined protein structures with previously identified protein domains or existing families.
Resumo:
One of the most important kinds of queries in Spatial Network Databases (SNDB) to support location-based services (LBS) is the shortest path query. Given an object in a network, e.g. a location of a car on a road network, and a set of objects of interests, e.g. hotels,gas station, and car, the shortest path query returns the shortest path from the query object to interested objects. The studies of shortest path query have two kinds of ways, online processing and preprocessing. The studies of preprocessing suppose that the interest objects are static. This paper proposes a shortest path algorithm with a set of index structures to support the situation of moving objects. This algorithm can transform a dynamic problem to a static problem. In this paper we focus on road networks. However, our algorithms do not use any domain specific information, and therefore can be applied to any network. This algorithm’s complexity is O(klog2 i), and traditional Dijkstra’s complexity is O((i + k)2).
Resumo:
中国计算机学会
Resumo:
Amino acid substitution matrices play an essential role in protein sequence alignment, a fundamental task in bioinformatics. Most widely used matrices, such as PAM matrices derived from homologous sequences and BLOSUM matrices derived from aligned segments of PROSITE, did not integrate conformation information in their construction. There are a few structure-based matrices, which are derived from limited data of structure alignment. Using databases PDB_SELECT and DSSP, we create a database of sequence-conformation blocks which explicitly represent sequence-structure relationship. Members in a block are identical in conformation and are highly similar in sequence. From this block database, we derive a conformation-specific amino acid substitution matrix CBSM60. The matrix shows an improved performance in conformational segment search and homolog detection.
Resumo:
This paper deals with turbulence behavior inbenthalboundarylayers by means of large eddy simulation (LES). The flow is modeled by moving an infinite plate in an otherwise quiescent water with an oscillatory and a steady velocity components. The oscillatory one aims to simulate wave effect on the flow. A number of large-scale turbulence databases have been established, based on which we have obtained turbulencestatisticsof the boundarylayers, such as Reynolds stress, turbulence intensity, skewness and flatness ofturbulence, and temporal and spatial scales of turbulent bursts, etc. Particular attention is paid to the dependences of those statistics on two nondimensional parameters, namely the Reynolds number and the current-wave velocity ratio defined as the steady current velocity over the oscillatory velocity amplitude. It is found that the Reynolds stress and turbulence intensity profile differently from phase to phase, and exhibit two types of distributions in an oscillatory cycle. One is monotonic occurring during the time when current and wave-induced components are in the same direction, and the other inflectional occurring during the time when current and wave-induced components are in opposite directions. Current component makes an asymmetrical time series of Reynolds stress, as well as turbulence intensity, although the mean velocity series is symmetrical as a sine/cosine function. The skewness and flatness variations suggest that the turbulence distribution is not a normal function but approaches to a normal one with the increasing of Reynolds number and the current-wave velocity ratio as well. As for turbulent bursting, the dimensionless period and the mean area of all bursts per unit bed area tend to increase with Reynolds number and current-wave velocity ratio, rather than being constant as in steady channel flows.
Resumo:
Amphibian skin is a rich resource of bioactive peptides like proline-rich bombesin from frog Bombina maxima. A novel cDNA clone encoding a precursor protein that comprises proline-rich bombesin and a novel peptide, designated as bombestatin, was isolated from a skin cDNA library of B. maxima. The predicted primary structure of the novel peptide is WEVLLNVALIRLELLSCRSSKDQDQKESCGMHSW, in which two cysteines form a disulfide bond. A BLAST search of databases did not detect sequences with significant similarity. Bombestatin possesses dose-dependent contractile activity on rat stomach strips. The differences between cDNAs encoding PR-bombesin plus bombestatin and PR-bombesin alone are due to fragment insertions located in 3'-coding region and 3'-untranslational region, respectively. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
There is increasing evidence that many of the mitochondrial DNA (mtDNA) databases published in the fields of forensic science and molecular anthropology are flawed. An a posteriori phylogenetic analysis of the sequences could help to eliminate most of the errors and thus greatly improve data quality. However, previously published caveats and recommendations along these lines were not yet picked up by all researchers. Here we call for stringent quality control of mtDNA data by haplogroup-directed database comparisons. We take some problematic databases of East Asian mtDNAs, published in the Journal of Forensic Sciences and Forensic Science International, as examples to demonstrate the process of pinpointing obvious errors. Our results show that data sets are not only notoriously plagued by base shifts and artificial recombination but also by lab-specific phantom mutations, especially in the second hypervariable region (HVR-II). (C) 2003 Elsevier Ireland Ltd. All rights reserved.
Resumo:
Superimposed on the activation of the embryonic genome in the preimplantation mouse embryo is the formation of a transcriptionally repressive state during the two-cell stage. This repression appears mediated at the level of chromatin structure, because it is reversed by inducing histone hyperacetylation or inhibiting the second round of DNA replication. We report that of more than 200 amplicons analyzed by mRNA differential display, about 45% of them are repressed between the two-cell and four-cell stages. This repression is scored as either a decrease in amplicon expression that occurs between the two-cell and four-cell stages or on the ability of either trichostatin A tan inhibitor of histone deacetylases) or aphidicolin tan inhibitor of replicative DNA polymerases) to increase the level of amplicon expression. Results of this study also indicate that about 16% of the amplicons analyzed likely are novel genes whose sequence doesn't correspond to sequences in the current databases, whereas about 20% of the sequences expressed during this transition likely are repetitive sequences. Lastly, inducing histone hyperacetylation in the two-cell embryos inhibits cleavage to the four-cell stage. These results suggest that genome activation is global and relatively promiscuous and that a function of the transcriptionally repressive state is to dictate the appropriate profile of gene expression that is compatible with further development.
Resumo:
In recent years, there has been an increased number of sequenced RNAs leading to the development of new RNA databases. Thus, predicting RNA structure from multiple alignments is an important issue to understand its function. Since RNA secondary structures are often conserved in evolution, developing methods to identify covariate sites in an alignment can be essential for discovering structural elements. Structure Logo is a technique established on the basis of entropy and mutual information measured to analyze RNA sequences from an alignment. We proposed an efficient Structure Logo approach to analyze conservations and correlations in a set of Cardioviral RNA sequences. The entropy and mutual information content were measured to examine the conservations and correlations, respectively. The conserved secondary structure motifs were predicted on the basis of the conservation and correlation analyses. Our predictive motifs were similar to the ones observed in the viral RNA structure database, and the correlations between bases also corresponded to the secondary structure in the database.
Resumo:
Microsatellites have become the preferred molecular markers for strain selection and genetic breeding in fish. In this study a total of 105 microsatellites were isolated and identified in gibel carp (Carassius auratus gibelio) by microsatellite sequence searches in GenBank and other databases and by screening and sequencing of positive clones from the genomic library enriched for AG and GATA repeats. Moreover, nineteen microsatellites were randomly selected to design locus-specific primer pairs, and these were successfully used to identify and discriminate different cultured strains of gibel carp including strains A, D, L, and F. Three different types of microsatellite pattern were distinguished by the number and length of fragments amplified from the 19 primer pairs, and some microsatellite primer pairs were found to produce different microsatellite patterns among strains and strain-specific fragments. In addition, some duplicated alleles were also detected in two microsatellite patterns. Therefore, the current study provides direct molecular markers to discriminate among different cultured strains for selective breeding and aquaculture practice of gibel carp.
Resumo:
C1q is the first subcomponent of classical pathway in the complement system and a major link between innate and acquired immunities. The globular (gC1q) domain similar with C1q was also found in many non-complement C1q-domain-containing (C1qDC) proteins which have similar crystal structure to that of the multifunctional tumor necrosis factor (TNF) ligand family, and also have diverse functions. In this study, we identified a total of 52 independent gene sequences encoding C1q-domain-containing proteins through comprehensive searches of zebrafish genome, cDNA and EST databases. In comparison to 31 orthologous genes in human and different numbers in other species, a significant selective pressure was suggested during vertebrate evolution. Domain organization of C1q-domain-containing (C1qDC) proteins mainly includes a leading signal peptide, a collagen-like region of variable length, and a C-terminal C1q domain. There are 11 highly conserved residues within the C1q domain, among which 2 are invariant within the zebrafish gene set. A more extensive database searches also revealed homologous C1qDC proteins in other vertebrates, invertebrates and even bacterium, but no homologous sequences for encoding C1qDC proteins were found in many species that have a more recent evolutionary history with zebrafish. Therefore, further studies on C1q-domain-containing genes among different species will help us understand evolutionary mechanism of innate and acquired immunities.
Resumo:
Expressed sequence tags (ESTs) are a source for microsatellite development. In the present study, EST-derived microsatelltes (EST-SSRs) were generated and characterized in the common carp (Cyprinus carpio) by data mining from updated public EST databases and by subsequent testing for polymorphism. About 5.5% (555) of 10,088 ESTs contain repeat motifs of various types and lengths with CA being the most abundant dinucleotide one. Out of the 60 EST-SSRs for which PCR primers were designed, 25 loci showed polymorphism in a common carp population with the alleles per locus ranging from 3 to 17 (mean 7). The observed (H-O) and expected (HE) heterozygosities of these EST-SSRs were 0.13-1.00 and 0.12-0.91, respectively. Six EST-SSR loci significantly deviated from the Hardy-Weinberg equilibrium (HWE) expectation, and the remaining 19 loci were in HWE. Of the 60 primer sets, the rates of polymorphic EST-SSRs were 42% in common carp, 17% in crucian carp (Carassius auratus), and 5% in silver carp (Hypophthalmichthys molitrix), respectively. These new EST-SSR markers would provide sufficient polymorphism for population genetic studies and genome mapping of the common carp and its closely related fishes. (c) 2007 Published by Elsevier B.V.
Resumo:
Peptidoglycan recognition protein (PGRP) specifically binds to peptidoglycan and is considered to be one of the pattern recognition proteins in the innate immunity of insect and mammals. Using a database mining approach and RT-PCR, multiple peptidoglycan recognition protein (PGRP) like genes have been discovered in fish including zebrafish Danio rerio, Japanese pufferfish TakiFugu rubripes and spotted green pufferfish Tetraodon nigroviridis. They share the common features of those PGRPs in arthropod and mammals, by containing a conserved PGRP domain. Based on the predicted structures, the identified zebrafish PGRP homologs resemble short and long PGRP members in arthropod and mammals. The identified PGRP genes in T. nigroviridis and TakiFugu rubripes resemble the long PGRPs, and the short PGRP genes have not been found in T. nigroviridis and TakiFugu rubripes databases. Computer modelling of these molecules revealed the presence of three alpha-helices and five or six beta-strands in all fish PGRPs reported in the present study. The long PGRP in teleost fish have multiple alternatively spliced forms, and some of the identified spliced variants, e.g., tnPGRP-L3 and tnPGRP-L4 (in: Tetraodon nigroviridis), exhibited no characters present in the PGRP homologs domain. The coding regions of zfPGRP6 (zf: zebrafish), zfPGRP2-A, zfPGRP2-B and zfPGRP-L contain five exons and four introns; however, the other PGRP-like genes including zfPGRPSC1a, zfPGRPSC2, tnPGRP-L1-, tnPGRP-L2 and frPGRP-L (fr: Takifugu rubripes) contain four exons and three introns. In zebrafish, long and short PGRP genes identified are located in different chromosomes, and an unknown locus containing another long PGRP-like gene has also been found in zebrafish, demonstrating that multiple PGRP loci may be present in fish. In zebrafish, the constitutive expressions of zfPGRP-L, zfPGRP-6 and zfPGRP-SC during ontogeny from unfertilized eggs to larvae, in different organs of adult, and the inductive expression following stimulation by Flavobacterium columnare, were detected by real-time PCR, but the levels and patterns varied for different PGRP genes, implying that different short and long PGRPs may play different roles in innate immune response. (c) 2007 Elsevier Ltd. All rights reserved.