50 resultados para patent sequence datasets

em University of Queensland eSpace - Australia


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Bellerophon is a program for detecting chimeric sequences in multiple sequence datasets by an adaption of partial treeing analysis. Bellerophon was specifically developed to detect 16S rRNA gene chimeras in PCR-clone libraries of environmental samples but can be applied to other nucleotide sequence alignments.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With rapid advances in video processing technologies and ever fast increments in network bandwidth, the popularity of video content publishing and sharing has made similarity search an indispensable operation to retrieve videos of user interests. The video similarity is usually measured by the percentage of similar frames shared by two video sequences, and each frame is typically represented as a high-dimensional feature vector. Unfortunately, high complexity of video content has posed the following major challenges for fast retrieval: (a) effective and compact video representations, (b) efficient similarity measurements, and (c) efficient indexing on the compact representations. In this paper, we propose a number of methods to achieve fast similarity search for very large video database. First, each video sequence is summarized into a small number of clusters, each of which contains similar frames and is represented by a novel compact model called Video Triplet (ViTri). ViTri models a cluster as a tightly bounded hypersphere described by its position, radius, and density. The ViTri similarity is measured by the volume of intersection between two hyperspheres multiplying the minimal density, i.e., the estimated number of similar frames shared by two clusters. The total number of similar frames is then estimated to derive the overall similarity between two video sequences. Hence the time complexity of video similarity measure can be reduced greatly. To further reduce the number of similarity computations on ViTris, we introduce a new one dimensional transformation technique which rotates and shifts the original axis system using PCA in such a way that the original inter-distance between two high-dimensional vectors can be maximally retained after mapping. An efficient B+-tree is then built on the transformed one dimensional values of ViTris' positions. Such a transformation enables B+-tree to achieve its optimal performance by quickly filtering a large portion of non-similar ViTris. Our extensive experiments on real large video datasets prove the effectiveness of our proposals that outperform existing methods significantly.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombination-detecting approaches can vary, depending on evolutionary events that take place after recombination. We recently evaluated the effects of post-recombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach in delineating breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Despite many successes of conventional DNA sequencing methods, some DNAs remain difficult or impossible to sequence. Unsequenceable regions occur in the genomes of many biologically important organisms, including the human genome. Such regions range in length from tens to millions of bases, and may contain valuable information such as the sequences of important genes. The authors have recently developed a technique that renders a wide range of problematic DNAs amenable to sequencing. The technique is known as sequence analysis via mutagenesis (SAM). This paper presents a number of algorithms for analysing and interpreting data generated by this technique.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombination-detecting approaches can vary, depending on evolutionary events that take place after recombination. We previously evaluated the effects of post-recombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach to delineate breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We sequenced cDNAs coding for chicken cellular nucleic acid binding protein (CNBP). Two slightly different variations of the open reading frame were found, each of which translates into a protein with seven zinc finger domains. The longest transcript contains an in-frame insert of 3 bp. The sequence conservation between chick CNBP cDNAs with human, rat and mouse CNBP cDNAs is extreme, especially in the coding region, where the deduced amino acid sequence identity with human, rat and mouse CNBP is 99%. CNBP-like transcripts were also found in various tissues from insect, shrimp, fish and lizard. Regions with remarkable nucleotide conservation were also found in the 3' untranslated region, indicating important functions for these regions. Quantitative reverse transcription polymerase chain reaction (RT-PCR) indicated that in the chick, CNBP is present in all tissues examined in approximately equal ratios to total RNA. RT-PCR of total RNA isolated from different phyla indicate CNBP-like proteins art widespread throughout the animal kingdom. The extraordinary level of conservation suggests an important physiological role for CNBP. (C) 1997 Elsevier Science Inc.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The nifH gene sequence of the nitrogen-fixing bacterium Acetobacter diazotrophicus was determined with the use of the polymerase chain reaction and universal degenerate oligonucleotide primers. The gene shows highest pair-wise similarity to the nifH gene of Azospirillum brasilense. The phylogenetic relationships of the nifH gene sequences were compared with those inferred from 16S rRNA gene sequences. Knowledge of the sequence of the nifH gene contributes to the growing database of nifH gene sequences, and will allow the detection of Acet. diazotrophicus from environmental samples with nifH gene-based primers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A clone encoding ovine preprogastrin was isolated from a sheep genomic library. The deduced 104 amino acid sequence of ovine preprogastrin was 92% and 68% identical to the sequences of bovine and human preprogastrin, respectively. While the similarity was greatest in the gastrin-17 sequence, an unexpected similarity was also observed in the N-terminus of mature progastrin.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Segregation of mRNAs in the cytoplasm of polar cells has been demonstrated for proteins involved in Xenopus and Drosophila oogenesis, and for some proteins in somatic cells. It is assumed that vectorial transport of the messages is generally responsible for this localization. The mRNA encoding the basic protein of central nervous system myelin is selectively transported to the distal ends of the processes of oligodendrocytes, where it is anchored to the myelin membrane and translated. This transport is dependent on a 21-nucleotide cis-acting segment of the 3'-untranslated region (RTS). Proteins that bind to this cis-acting segment have now been isolated from extracts of rat brain. A group of six 35-42-kDa proteins bind to a 35-base oligoribonucleotide incorporating the RTS, but not to several oligoribonucleotides with the same composition but randomized sequences, thus establishing specificity for the base sequence in the RTS. The most abundant of these proteins has been identified, by Edman sequencing of tryptic peptides and mass spectroscopy, as heterogeneous nuclear ribonucleoprotein (hnRNP) A2, a 36-kDa member of a family of proteins that are primarily, but not solely, intranuclear. This protein was most abundant in samples from rat brain and testis, with lower amounts in other tissues. It was separated from the other polypeptides by using reverse-phase HPLC and shown to retain preferential association with the RTS. In cultured oligodendrocytes, hnRNP A2 was demonstrated by confocal microscopy to be distributed throughout the nucleus, cell soma, and processes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Parkinson's disease (PD) is a neurodegenerative movement disorder primarily due to basal ganglia dysfunction. While much research has been conducted on Parkinsonian deficits in the traditional arena of musculoskeletal limb movement, research in other functional motor tasks is lacking. The present study examined articulation in PD with increasingly complex sequences of articulatory movement. Of interest was whether dysfunction would affect articulation in the same manner as in limb-movement impairment. In particular, since very Similar (homogeneous) articulatory sequences (the tongue twister effect) are more difficult for healthy individuals to achieve than dissimilar (heterogeneous) gestures, while the reverse may apply for skeletal movements in PD, we asked which factor would dominate when PD patients articulated various grades of artificial tongue twisters: the influence of disease or a possible difference between the two motor systems. Execution was especially impaired when articulation involved a sequence of motor program heterogeneous in terms of place of articulation. The results are suggestive of a hypokinesic tendency in complex sequential articulatory movement as in limb movement. It appears that PD patients do show abnormalities in articulatory movement which are similar to those of the musculoskeletal system. The present study suggests that an underlying disease effect modulates movement impairment across different functional motor systems. (C) 1998 Academic Press.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Phosphorylation of the tumor suppressor p53 is generally thought to modify the properties of the protein in four of its five independent domains. We used synthetic peptides to directly study the effects of phosphorylation on the non-sequence-specific DNA binding and conformation of the C-terminal, basic domain. The peptides corresponded to amino acids 361-393 and were either nonphosphorylated or phosphorylated at the protein kinase C (PKC) site, Ser378, or the casein kinase II (CKII) site, Ser392, or bis-phosphorylated on both the PKC and the CKII sites. A fluorescence polarization analysis revealed that either the recombinant p53 protein or the synthetic peptides bound to two unrelated target DNA fragments. Phosphorylation of the peptide at the PKC or the CKII sites clearly decreased DNA binding, and addition of a second phosphate group almost completely abolished binding. Circular dichroism spectroscopy showed that the peptides assumed identical unordered structures in aqueous solutions. The unmodified peptide, unlike the Ser378 phosphorylated peptide, changed conformation in the presence of DNA. The inherent ability of the peptides to form an alpha-helix could be detected when circular dichroism and nuclear magnetic resonance spectra were: taken in trifluoroethanol-water mixtures. A single or double phosphorylation destabilized the helix around the phosphorylated Ser378 residue but stabilized the helix downstream in the sequence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Monocrotaline is a pyrrolizidine alkaloid known to cause toxicity in humans and animals. Its mechanism of biological action is still unclear although DNA crosslinking has been suggested to a play a role in its activity. In this study we found that an active metabolite of monocrotaline, dehydromonocrotaline (DHM), alkylates guanines at the N7 position of DNA with a preference for 5'-GG and 5'-GA sequences; In addition, it generates piperidine- and heat-resistant multiple DNA crosslinks, as confirmed by electrophoresis and electron microscopy. On the basis of these findings, we propose that DHM undergoes rapid polymerization to a structure which is able to crosslink several fragments of DNA.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Liver samples from rabbits killed by RHDV, collected from five States in Australia in 1996 and 1997 were analysed by RT-PCR. A 398 bp fragment of the capsid protein (VP60) gene was amplified by PCR and directly sequenced. The alignment of the nucleotide and amino acid sequences and their comparison with the original strain of the virus released in Australia indicated genetic changes after two years have been small with 98.2% to 100% identity. The constructed phylogenetic tree suggests slight differences in nucleotide substitutions in various States but there is no clear evidence of clustering of sequences according to their geographic origin. In practical terms, sequencing of viral RNA provides a means of testing the efficacy of further releases and subsequent spread of the virus if such a strategy is employed as a means of enhancing RHD as a biological control of the wild rabbit in Australia.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The ornamental tobacco Nicotiana alata produces a series of proteinase inhibitors (Pls) that are derived from a 43 kDa precursor protein, NaProPl. NaProPl contains six highly homologous repeats that fold to generate six separate structural domains, each corresponding to one of the native Pls. An unusual feature of NaProPl is that the structural domains lie across adjacent repeats and that the sixth Pl domain is generated from fragments of the first and sixth repeats. Although the homology of the repeats suggests that they may have arisen from gene duplication, the observed folding does not appear to support this. This study of the solution structure of a single NaProPl repeat (aPl1) forms a basis for unravelling the mechanism by which this protein may have evolved, Results: The three-dimensional structure of aPl1 closely resembles the triple-stranded antiparallel beta sheet observed in each of the native Pls. The five-residue sequence Glu-Glu-Lys-Lys-Asn, which forms the linker between the six structural domains in NaProPl, exists as a disordered loop in aPl1. The presence of this loop in aPl1 results in a loss of the characteristically flat and disc-like topography of the native inhibitors. Conclusions: A single repeat from NaProPl is capable of folding into a compact globular domain that displays native-like Pl activity. Consequently, it is possible that a similar single-domain inhibitor represents the ancestral protein from which NaProPl evolved.