907 resultados para patent sequence
Resumo:
The analysis of sequential data is required in many diverse areas such as telecommunications, stock market analysis, and bioinformatics. A basic problem related to the analysis of sequential data is the sequence segmentation problem. A sequence segmentation is a partition of the sequence into a number of non-overlapping segments that cover all data points, such that each segment is as homogeneous as possible. This problem can be solved optimally using a standard dynamic programming algorithm. In the first part of the thesis, we present a new approximation algorithm for the sequence segmentation problem. This algorithm has smaller running time than the optimal dynamic programming algorithm, while it has bounded approximation ratio. The basic idea is to divide the input sequence into subsequences, solve the problem optimally in each subsequence, and then appropriately combine the solutions to the subproblems into one final solution. In the second part of the thesis, we study alternative segmentation models that are devised to better fit the data. More specifically, we focus on clustered segmentations and segmentations with rearrangements. While in the standard segmentation of a multidimensional sequence all dimensions share the same segment boundaries, in a clustered segmentation the multidimensional sequence is segmented in such a way that dimensions are allowed to form clusters. Each cluster of dimensions is then segmented separately. We formally define the problem of clustered segmentations and we experimentally show that segmenting sequences using this segmentation model, leads to solutions with smaller error for the same model cost. Segmentation with rearrangements is a novel variation to the segmentation problem: in addition to partitioning the sequence we also seek to apply a limited amount of reordering, so that the overall representation error is minimized. We formulate the problem of segmentation with rearrangements and we show that it is an NP-hard problem to solve or even to approximate. We devise effective algorithms for the proposed problem, combining ideas from dynamic programming and outlier detection algorithms in sequences. In the final part of the thesis, we discuss the problem of aggregating results of segmentation algorithms on the same set of data points. In this case, we are interested in producing a partitioning of the data that agrees as much as possible with the input partitions. We show that this problem can be solved optimally in polynomial time using dynamic programming. Furthermore, we show that not all data points are candidates for segment boundaries in the optimal solution.
Resumo:
The H1',H2' and H2″ regions of the 270-MHz PMR spectra of two deoxydinucleotides, d-pTpA and d-pApT, have been analyzed. The coupling constants in the sugar ring indicate that both A and T sugars have a tendency to acquire 2E conformations. There is also a marginal difference in the 2E populations of the T sugar in the two dinucleotides. The trends in the chemical shifts of base protons indicate different stacking of the bases in d-pApT and d-pTpA. The sequence effects on base stacking and pentose conformation are discussed.
Resumo:
While many placental herpesvirus genomes have been fully sequenced, the complete genome of a marsupial herpesvirus has not been described. Here we present the first genome sequence of a metatherian herpesvirus, Macropodid herpesvirus 1 (MaHV-1).
Resumo:
Based upon a stereochemical guideline, two topologically distinct types of helicalduplexes have been deduced for a polynucleotide duplex with alternating purine pyrimidine sequence (PAPP): (a) right-handed uniform (RU) helix and (b) left-handed zig-zag (LZ) helix. Both structures have trinucleoside diphosphate as the basic unit wherein the purine pyrimidine fragment has a different conformation from the pyrimidine-purine fragment. Thus, RU and LZ helices represent two different classes of sequence-dependent molecular conformations for PAPP. The conformationalf eatures of an RU helix of PAPP in B-form and three LZ-helices for B-, D- and Z-forms are discussed.
Resumo:
In an earlier communication[l] we have indicated a general graphical design procedure for a sequence of sparger reactors in which a second order liquid phase reaction proceeds in a stagewise fashion. The prediction of the reactant concentration in each stage and hence the conversion depended on a search procedure initiated along a straight line representing the mass balance equation at the given stage and drawn from the known feed stage located on the abscissa in a E-IU diagram for the given system.
Resumo:
The effect of phenobarbital on the rates of the synthesis of the protein and heme moieties of cytochrome P-450 has been studied. For this purpose, cytochrome P-450 has been partially purified as its P-420 derivative and the labeled amino acid incorporation into the protein has been studied after subjecting a partially purified preparation to sodium dodecyl sulfate gel electrophoresis. The incorporation studies into the protein species after sodium dodecyl sulfate gel electrophoresis reveal that the drug primarily accelerates the rate of apoprotein synthesis followed by an increase in the rate of heme synthesis. The messenger for apocytochrome P-450 appears to be fairly stable.
Resumo:
The sequence distribution studies on the acrylonitrile-methylmethacrylate copolymer of high methylmethacrylate (M) content (30%
Resumo:
Antibodies were raised in rabbits against the bovine serum albumin conjugate of dpApT. Analysis by double diffusion in agar gel and quantitative precipitation test showed the presence of antibodies specific to the hapten in the antisera. Quantitative data on the specificity of the antibodies were obtained by studying the inhibition of the binding of 3H-dpApT to the anti-sera by various nonradioactive mono- and oligonucleotides, using a nitrocellulose membrane binding assay. The antibodies were found to be highly specific for the dinucleotide sequence dpApT. The antibodies were able to bind to synthetic oligonucleotides containing the sequence dpApT and to denatured calf thymus DNA.
Resumo:
Reaction of the bromoketals 3, 7a-g and 11 with tri-n-butyltin chloride and sodium cyanoborohydride in the presence of a catalytic amount of AIBN furnished the ethers 5, 8a-g and 13 via a tandem sequence comprising of a radical cyclisation reaction and tri-n-butylhalostannane and sodium cyanoborohydride mediated reductive demethoxylation of the resulting cyclic ketals.
Resumo:
The 3prime terminal 1255nt sequence of Physalis mottle virus (PhMV) genomic RNA has been determined from a set of overlapping cDNA clones. The open reading frame (ORF) at the 3prime terminus corresponds to the amino acid sequence of the coat protein (CP) determined earlier except for the absence of the dipeptide, Lys-Leu, at position 110-111. In addition, the sequence upstream of the CP gene contains the message coding for 178 amino acid residues of the C-terminus of the putative replicase protein (RP). The sequence downstream of the CP gene contains an untranslated region whose terminal 80 nucleotides can be folded into a characteristic tRNA-like structure. A phylogenetic tree constructed after aligning separately the sequence of the CP, the replicase protein (RP) and the tRNA-like structure determined in this study with the corresponding sequences of other tymoviruses shows that PhMV wrongly named belladonna mottle virus [BDMV(I)] is a separate tymovirus and not another strain of BDMV(E) as originally envisaged. The phylogenetic tree in all the three cases is identical showing that any subset of genomic sequence of sufficient length can be used for establishing evolutionary relationships among tymoviruses.
Resumo:
The nucleotide sequence of a proline tRNA (anticodon UGG) from cucumber chloroplasts has been determined. The sequence is: pAAGGAUGUAGCGCAGCUUCADAGCGCAΨUUGUUUUGGNΨFACAAAAUm7GUCACGGGTΨCAAAUCCUGUCAUCCUUACCAOH. It shows 93% homology with spinach chloroplast tRNAPro (UGG) and 72% homology with bean mitochondrial tRNA Pro (UGG), the other two known plant organellar tRNAsPro.
Resumo:
The incidence of human infections by the fungal pathogen Candida species has been increasing in recent years. Enolase is an essential protein in fungal metabolism. Sequence data is available for human and a number of medically important fungal species. An understanding of the structural and functional features of fungal enolases may provide the structural basis for their use as a target for the development of new anti-fungal drugs. We have obtained the sequence of the enolase of Candida krusei (C. krusei), as it is a significant medically important fungal pathogen. We have then used multiple sequence alignments with various enolase isoforms in order to identify C. krusei specific amino acid residues. The phylogenetic tree of enolases shows that the C. krusei enolase assembles on the tree with the fungal genes. Importantly, C. krusei lacks four amino acids in the active site compared to human enolase, as revealed by multiple sequence alignments. These differences in the substrate binding site may be exploited for the design of new anti-fungal drugs to selectively block this enzyme. The lack of the important amino acids in the active site also indicates that C. krusei enolase might have evolved as a member of a mechanistically diverse enolase superfamily catalying somewhat different reactions.
Resumo:
One of the monoclonal antibodies raised against bovine beta-lactoglobulin reacted with human serum retinol binding protein. The finding that this monoclonal antibody also reacted with the serum retinol binding proteins isolated from other animals, suggested that this epitopic conformation is conserved among these proteins. Using ELISA and various synthetic peptides of defined sequence, we show in this paper that the epitope defined by this monoclonal antibody comprises of the highly conserved core sequence of DTDY present in beta-lactoglobulin and retinol binding proteins.
Resumo:
VP6, the intermediate capsid protein of the virion, specifies subgroup specificity of rotavirus, It is also the most conserved, both at nucleotide and amino acid levels, among group A rotaviruses and is the target of choice for rotavirus detection, In this study we report the sequence of the subgroup I (SGI)-specific VP6 from the serotype G2 strain IS2 isolated from a child suffering from acute diarrhoea in Bangalore ana its comparison with the published VP6 sequences. Interestingly, IS2 gene 6 shared highest homology with that from bovine UK strain and the protein contained substitutions by lysine at amino acid positions 97 and 134, In contrast, the amino acids Met and Glu/Asp at these respective positions are highly conserved in all the other group A rotaviruses sequenced so far, These observations have obvious implications for the evolution of serotype G2 and G2-like strains circulating in India, The SGI VP6, of a human rotavirus, possessing epitopes that are conformationally similar to those found in the native protein in the virion, was successfully expressed in E. coli and purified for the first time by single-step affinity chromatography.