891 resultados para sequence database


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As end-user computing becomes more pervasive, an organization's success increasingly depends on the ability of end-users, usually in managerial positions, to extract appropriate data from both internal and external sources. Many of these data sources include or are derived from the organization's accounting information systems. Managerial end-users with different personal characteristics and approaches are likely to compose queries of differing levels of accuracy when searching the data contained within these accounting information systems. This research investigates how cognitive style elements of personality influence managerial end-user performance in database querying tasks. A laboratory experiment was conducted in which participants generated queries to retrieve information from an accounting information system to satisfy typical information requirements. The experiment investigated the influence of personality on the accuracy of queries of varying degrees of complexity. Relying on the Myers–Briggs personality instrument, results show that perceiving individuals (as opposed to judging individuals) who rely on intuition (as opposed to sensing) composed queries more accurately. As expected, query complexity and academic performance also explain the success of data extraction tasks.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OTseeker (Occupational Therapy Systematic Evaluation of Evidence) is a new resource for occupational therapists that has been designed with the principle aim of increasing access to research to support clinical decisions. It contains abstracts of systematic reviews and quality ratings of randomized controlled trials (RCTs) relevant to occupational therapy. It is available, free of charge, at www.otseeker.com. This paper describes the OTseeker database and provides an example of how it may support occupational therapy practice.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

One of the most important advantages of database systems is that the underlying mathematics is rich enough to specify very complex operations with a small number of statements in the database language. This research covers an aspect of biological informatics that is the marriage of information technology and biology, involving the study of real-world phenomena using virtual plants derived from L-systems simulation. L-systems were introduced by Aristid Lindenmayer as a mathematical model of multicellular organisms. Not much consideration has been given to the problem of persistent storage for these simulations. Current procedures for querying data generated by L-systems for scientific experiments, simulations and measurements are also inadequate. To address these problems the research in this paper presents a generic process for data-modeling tools (L-DBM) between L-systems and database systems. This paper shows how L-system productions can be generically and automatically represented in database schemas and how a database can be populated from the L-system strings. This paper further describes the idea of pre-computing recursive structures in the data into derived attributes using compiler generation. A method to allow a correspondence between biologists' terms and compiler-generated terms in a biologist computing environment is supplied. Once the L-DBM gets any specific L-systems productions and its declarations, it can generate the specific schema for both simple correspondence terminology and also complex recursive structure data attributes and relationships.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Using a subtractive hybridisation approach, we enriched for genes likely to play a role in embryonic development of the mammalian face and other structures. This was achieved by subtracting cDNA derived from adult mouse liver from that derived from 10.5 dpc mouse embryonic branchial arches 1 and 2. Random sequencing of clones from the resultant library revealed that a high percentage correspond to genes with a previously established role in embryonic development and disease, while 15% represent novel or uncharacterised genes. Whole mount in situ hybridisation analysis of novel genes revealed that approximately 50% have restricted expression during embryonic development. In addition to expression in branchial arches, these genes showed a range of expression domains commonly including neural tube and somites. Notably, all genes analysed were found to be expressed not only in the branchial arches but also in the developing limb buds, providing support for the hypothesis that development of the limbs and face is likely to involve analogous molecular processes. (C) 2003 Wiley-Liss, Inc.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A proportion of melanoma,prone individuals in both familial and non,familial contexts has been shown to carry inactivating mutations in either CDKN2A or, rarely, CDK4. CDKN2A is a complex locus that encodes two unrelated proteins from alternately spliced transcripts that are read in different frames. The alpha transcript (exons 1a, 2, and 3) produces the p16INK4A cyclin-dependent kinase inhibitor, while the beta transcript (exons 1beta and 2) is translated as p14ARF, a stabilizing factor of p53 levels through binding to MDM2. Mutations in exon 2 can impair both polypeptides and insertions and deletions in exons 1alpha, 1beta, and 2, which can theoretically generate p16INK4A,p14ARF fusion proteins. No online database currently takes into account all the consequences of these genotypes, a situation compounded by some problematic previous annotations of CDKN2A related sequences and descriptions of their mutations. As an initiative of the international Melanoma Genetics Consortium, we have therefore established a database of germline variants observed in all loci implicated in familial melanoma susceptibility. Such a comprehensive, publicly accessible database is an essential foundation for research on melanoma susceptibility and its clinical application. Our database serves two types of data as defined by HUGO. The core dataset includes the nucleotide variants on the genomic and transcript levels, amino acid variants, and citation. The ancillary dataset includes keyword description of events at the transcription and translation levels and epidemiological data. The application that handles users' queries was designed in the model,view. controller architecture and was implemented in Java. The object-relational database schema was deduced using functional dependency analysis. We hereby present our first functional prototype of eMelanoBase. The service is accessible via the URL www.wmi.usyd.e, du.au:8080/melanoma.html.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A general overview of the protein sequence set for the mouse transcriptome produced during the FANTOM2 sequencing project is presented here. We applied different algorithms to characterize protein sequences derived from a nonredundant representative protein set (RPS) and a variant protein set (VPS) of the mouse transcriptome. The functional characterization and assignment of Gene Ontology terms was done by analysis of the proteome using InterPro. The Superfamily database analyses gave a detailed structural classification according to SCOP and provide additional evidence for the functional characterization of the proteome data. The MDS database analysis revealed new domains which are not presented in existing protein domain databases. Thus the transcriptome gives us a unique source of data for the detection of new functional groups. The data obtained for the RPS and VPS sets facilitated the comparison of different patterns of protein expression. A comparison of other existing mouse and human protein sequence sets (e.g., the International Protein Index) demonstrates the common patterns in mammalian proteornes. The analysis of the membrane organization within the transcriptome of multiple eukaryotes provides valuable statistics about the distribution of secretory and transmembrane proteins

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have developed a computational strategy to identify the set of soluble proteins secreted into the extracellular environment of a cell. Within the protein sequences predominantly derived from the RIKEN representative transcript and protein set, we identified 2033 unique soluble proteins that are potentially secreted from the cell. These proteins contain a signal peptide required for entry into the secretory pathway and lack any transmembrane domains or intracellular localization signals. This class of proteins, which we have termed the mouse secretome, included >500 novel proteins and 92 proteins

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Our previous studies have shown that two distinct genotypes of Sindbis (SIN) virus occur in Australia. One of these, the Oriental/Australian type, circulates throughout most of the Australian continent, whereas the recently identified south-west (SW) genetic type appears to be restricted to a distinct geographic region located in the temperate south-west of Australia. We have now determined the complete nucleotide and translated amino acid sequences of a SW isolate of SIN virus (SW6562) and performed comparative analyses with other SIN viruses at the genomic level. The genome of SW6562 is 11,569 nucleotides in length, excluding the cap nucleotide and poly (A) tail. Overall this virus differs from the prototype SIN virus (strain AR339) by 23% in nucleotide sequence and 12.5% in amino acid sequence. Partial sequences of four regions of the genome of four SW isolates were determined and compared with the corresponding sequences from a number of SIN isolates from different regions of the World. These regions are the non-structural protein (nsP3), the E2 gene, the capsid gene, and the repeated sequence elements (RSE) of the 3'UTR. These comparisons revealed that the SW SIN viruses were more closely related to South African and European strains than to other Australian isolates of SIN virus. Thus the SW genotype of SIN virus may have been introduced into this region of Australia by viremic humans or migratory birds and subsequently evolved independently in the region. The sequence data also revealed that the SW genotype contains a unique deletion in the RSE of the 3'UTR region of the genome. Previous studies have shown that deletions in this region of the SIN genome can have significant effects on virus replication in mosquito and avian cells, which may explain the restricted distribution of this genotype of SIN virus.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The alternative sigma factor sigB gene is involved in the stress response regulation of Listeria monocytogenes, and contributes towards growth and survival in adverse conditions. This gene was examined to determine if it could be a useful indicator of lineage differentiation, similar to the established method based on ribotyping. The sigB sequence was resolved in four local L. monocytogenes strains and the phylogenetic relationship among these, and a further 21 sigB gene sequences from strains of different serotype and lineage including two Listeria innocua strains, obtained from the GenBank database were determined. The sigB nucleotide sequences of these 25 Listeria strains were then examined for single nucleotide polymorphic (SNP) sites that could differentiate between the three lineages. Based on nucleotide sequences L. monocytogenes lineage F serotype 1/2b and 4b clustered together, lineage II/serotype 1/2a and 1/2c strains clustered together, lineage III/serotypes 4a and 4c strains clustered together and L. innocua strains clustered together as an outgroup. SNPs differentiating the three lineages were identified. Individual allele-specific PCR reactions based on these polymorphisms were successful in grouping known and a further 37 local L. monocytogenes isolates into the three lineages. (C) 2003 Elsevier B.V. All fights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The pathogenesis-related (PR) protein superfamily is widely distributed in the animal, plant, and fungal kingdoms and is implicated in human brain tumor growth and plant pathogenesis. The precise biological activity of PR proteins, however, has remained elusive. Here we report the characterization, cloning and structural homology modeling of Tex31 from the venom duct of Conus textile. Tex31 was isolated to >95% purity by activity-guided fractionation using a para-nitroanilide substrate based on the putative cleavage site residues found in the propeptide precursor of conotoxin TxVIA. Tex31 requires four residues including a leucine N-terminal of the cleavage site for efficient substrate processing. The sequence of Tex31 was determined using two degenerate PCR primers designed from N-terminal and tryptic digest Edman sequences. A BLAST search revealed that Tex31 was a member of the PR protein superfamily and most closely related to the CRISP family of mammalian proteins that have a cysteine-rich C-terminal tail. A homology model constructed from two PR proteins revealed that the likely catalytic residues in Tex31 fall within a structurally conserved domain found in PR proteins. Thus, it is possible that other PR proteins may also be substrate-specific proteases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article is published online with Open Access and distributed under the terms of the Creative Commons Attribution Non-Commercial License.

Relevância:

20.00% 20.00%

Publicador: