963 resultados para Multiple sequence alignment


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Sequence analysis and optimal matching are useful heuristic tools for the descriptive analysis of heterogeneous individual pathways such as educational careers, job sequences or patterns of family formation. However, to date it remains unclear how to handle the inevitable problems caused by missing values with regard to such analysis. Multiple Imputation (MI) offers a possible solution for this problem but it has not been tested in the context of sequence analysis. Against this background, we contribute to the literature by assessing the potential of MI in the context of sequence analyses using an empirical example. Methodologically, we draw upon the work of Brendan Halpin and extend it to additional types of missing value patterns. Our empirical case is a sequence analysis of panel data with substantial attrition that examines the typical patterns and the persistence of sex segregation in school-to-work transitions in Switzerland. The preliminary results indicate that MI is a valuable methodology for handling missing values due to panel mortality in the context of sequence analysis. MI is especially useful in facilitating a sound interpretation of the resulting sequence types.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Multiple-complete-digest mapping is a DNA mapping technique based on complete-restriction-digest fingerprints of a set of clones that provides highly redundant coverage of the mapping target. The maps assembled from these fingerprints order both the clones and the restriction fragments. Maps are coordinated across three enzymes in the examples presented. Starting with yeast artificial chromosome contigs from the 7q31.3 and 7p14 regions of the human genome, we have produced cosmid-based maps spanning more than one million base pairs. Each yeast artificial chromosome is first subcloned into cosmids at a redundancy of ×15–30. Complete-digest fragments are electrophoresed on agarose gels, poststained, and imaged on a fluorescent scanner. Aberrant clones that are not representative of the underlying genome are rejected in the map construction process. Almost every restriction fragment is ordered, allowing selection of minimal tiling paths with clone-to-clone overlaps of only a few thousand base pairs. These maps demonstrate the practicality of applying the experimental and software-based steps in multiple-complete-digest mapping to a target of significant size and complexity. We present evidence that the maps are sufficiently accurate to validate both the clones selected for sequencing and the sequence assemblies obtained once these clones have been sequenced by a “shotgun” method.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally aligned with every other member in the same family (pairwise alignment) and all the members in the family are also aligned using simultaneous super­position (multiple alignment). The structural alignments are performed largely automatically, with manual interventions especially in the cases of distantly related proteins, using the program STAMP (version 4.2). Every family is also associated with two dendrograms, calculated using PHYLIP (version 3.5), one based on a structural dissimilarity metric defined for every pairwise alignment and the other based on similarity of topologically equivalent residues. These dendrograms enable easy comparison of sequence and structure-based relationships among the members in a family. Structure-based alignments with the details of structural and sequence similarities, superposed coordinate sets and dendrograms can be accessed conveniently using a web interface. The database can be queried for protein pairs with sequence or structural similarities falling within a specified range. Thus PALI forms a useful resource to help in analysing the relationship between sequence and structure variation at a given level of sequence similarity. PALI also contains over 653 ‘orphans’ (single member families). Using the web interface involving PSI_BLAST and PHYLIP it is possible to associate the sequence of a new protein with one of the families in PALI and generate a phylogenetic tree combining the query sequence and proteins of known 3-D structure. The database with the web interfaced search and dendrogram generation tools can be accessed at http://pa uling.mbu.iisc.ernet.in/~pali.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

STACK is a tool for detection and visualisation of expressed transcript variation in the context of developmental and pathological states. The datasystem organises and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index and is accessible via WWW at http://www.sanbi.ac.za/Dbases.html. STACK represents a broadly applicable resource, as it is the only reconstructed transcript database for which the tools for its generation are also broadly available (http://www.sanbi.ac.za/CODES).

Relevância:

40.00% 40.00%

Publicador:

Resumo:

By detailed NMR analysis of a human telomere repeating unit, d(CCCTAA), we have found that three distinct tetramers, each of which consists of four symmetric single-strands, slowly exchange in a slightly acidic solution. Our new finding is a novel i-motif topology (T-form) where T4 is intercalated between C1 and C2 of the other duplex. The other two tetramers have a topology where C1 is intercalated between C2 and C3 of the other parallel duplex, resulting in the non-stacking T4 residues (R-form), and a topology where C1 is stacked between C3 and T4 of the other duplex (S-form). From the NMR denaturation profile, the R-form is the most stable of the three structures in the temperature range of 15–50°C, the S-form the second and the T-form the least stable. The thermodynamic parameters indicate that the T-form is the most enthalpically driven and entropically opposed, and its population is increased with decreasing temperature. The T-form structure determined by restrained molecular dynamics calculation suggests that inter-strand van der Waals contacts in the narrow grooves should contribute to the enthalpic stabilization of the T-form.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this study, we propose a novel method to predict the solvent accessible surface areas of transmembrane residues. For both transmembrane alpha-helix and beta-barrel residues, the correlation coefficients between the predicted and observed accessible surface areas are around 0.65. On the basis of predicted accessible surface areas, residues exposed to the lipid environment or buried inside a protein can be identified by using certain cutoff thresholds. We have extensively examined our approach based on different definitions of accessible surface areas and a variety of sets of control parameters. Given that experimentally determining the structures of membrane proteins is very difficult and membrane proteins are actually abundant in nature, our approach is useful for theoretically modeling membrane protein tertiary structures, particularly for modeling the assembly of transmembrane domains. This approach can be used to annotate the membrane proteins in proteomes to provide extra structural and functional information.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The rapidly increasing demand for cellular telephony is placing greater demand on the limited bandwidth resources available. This research is concerned with techniques which enhance the capacity of a Direct-Sequence Code-Division-Multiple-Access (DS-CDMA) mobile telephone network. The capacity of both Private Mobile Radio (PMR) and cellular networks are derived and the many techniques which are currently available are reviewed. Areas which may be further investigated are identified. One technique which is developed is the sectorisation of a cell into toroidal rings. This is shown to provide an increased system capacity when the cell is split into these concentric rings and this is compared with cell clustering and other sectorisation schemes. Another technique for increasing the capacity is achieved by adding to the amount of inherent randomness within the transmitted signal so that the system is better able to extract the wanted signal. A system model has been produced for a cellular DS-CDMA network and the results are presented for two possible strategies. One of these strategies is the variation of the chip duration over a signal bit period. Several different variation functions are tried and a sinusoidal function is shown to provide the greatest increase in the maximum number of system users for any given signal-to-noise ratio. The other strategy considered is the use of additive amplitude modulation together with data/chip phase-shift-keying. The amplitude variations are determined by a sparse code so that the average system power is held near its nominal level. This strategy is shown to provide no further capacity since the system is sensitive to amplitude variations. When both strategies are employed, however, the sensitivity to amplitude variations is shown to reduce, thus indicating that the first strategy both increases the capacity and the ability to handle fluctuations in the received signal power.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availaibilty of chloroplast genome (cpDNA) sequences of Atropa belladonna, Nicotiana sylvestris, N tabacum, N tomentosiformis, Solanum bulbocastanum, S lycopersicum and S tuberosum, which are Solanaceae species, allowed us to analyze the organization of cpSSRs in their genic and intergenic regions In general, the number of cpSSRs in cpDNA ranged from 161 in S tuberosum to 226 in N tabacum, and the number of intergenic cpSSRs was higher than genic cpSSRs The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, penta- and hexanucleotide repeats Multiple alignments of all cpSSRs sequence from Solanaceae species made the identification of nucleotide variability possible and the phylogeny was estimated by maximum parsimony Our study showed that the plastome database can be exploited for phylogenetic analyses and biotechnological approaches

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study outlines the quantification of low levels of Alicyclobacillus acidoterrestris in pure cultures, since this bacterium is not inactivated by pasteurization and may remain in industrialized foods and beverages. Electroconductive polymer-modified fluorine tin oxide (FTO) electrodes and multiple nanoparticle labels were used for biosensing. The detection of A. acidoterrestris in pure cultures was performed by reverse transcription polymerase chain reaction (RT-PCR) and the sensitivity was further increased by asymmetric nested RT-PCR using electrochemical detection for quantification of the amplicon. The quantification of nested RT-PCR products by Ag/Au-based electrochemical detection was able to detect 2 colony forming units per mL (CFU mL(-1)) of spores in pure culture and low detection and quantification limits (7.07 and 23.6 nM, respectively) were obtained for the target A. acidoterrestris on the electrochemical detection bioassay.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A gap has been identified in the literature on the diagnosis and monitoring of the degree of strategic alignment. The main objective of this article is to diagnose and analyze the strategic alignment profile using the alignment diagnostic profile (ADP) tool, which enables organizations to show visually their degree of strategic alignment. The methodological approach adopted is multiple-case studies, which were conducted at five organizations in the medical diagnostics sector. The results indicate that the ADP enables organizations to understand the steps required to improve their level of alignment and to identify and locate gaps and conflicts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper the continuous Verhulst dynamic model is used to synthesize a new distributed power control algorithm (DPCA) for use in direct sequence code division multiple access (DS-CDMA) systems. The Verhulst model was initially designed to describe the population growth of biological species under food and physical space restrictions. The discretization of the corresponding differential equation is accomplished via the Euler numeric integration (ENI) method. Analytical convergence conditions for the proposed DPCA are also established. Several properties of the proposed recursive algorithm, such as Euclidean distance from optimum vector after convergence, convergence speed, normalized mean squared error (NSE), average power consumption per user, performance under dynamics channels, and implementation complexity aspects, are analyzed through simulations. The simulation results are compared with two other DPCAs: the classic algorithm derived by Foschini and Miljanic and the sigmoidal of Uykan and Koivo. Under estimated errors conditions, the proposed DPCA exhibits smaller discrepancy from the optimum power vector solution and better convergence (under fixed and adaptive convergence factor) than the classic and sigmoidal DPCAs. (C) 2010 Elsevier GmbH. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A proteinase, named BmooMP alpha-I, from the venom of Bothrops moojeni, was purified by DEAE-Sephacel, Sephadex G-75 and heparin-agarose column chromatography. The enzyme was purified to homogeneity as judged by its migration profile in SDS-PAGE stained with coomassie blue, and showed a molecular mass of about 24.5 kDa. Its complete cDNA was obtained by RT-PCR and the 615 bp codified for a mature protein of 205 amino acid residues. The multiple alignment of its deduced amino acid sequence and those of other snake venom metalloproteinases showed a high structural similarly, mainly among class P-IB proteases. The enzyme cleaves the A alpha-chain of fibrinogen first, followed by the B beta-chain, and shows no effects on the gamma-chain. On fibrin, the enzyme hydrolyzed only the beta-chain, leaving the gamma-dimer apparently untouched. It was devoid of phospholipase A(2), hemorrhagic and thrombin-like activities. Like many venom enzymes, it is stable at pH values between 4 and 10 and stable at 70 degrees C for 15 min. The inhibitory effects of EDTA on the fibrinogenolytic activity suggest that BmooMP alpha-I is a metalloproteinase and inhibition by beta-mercaptoethanol revealed the important role of the disulfide bonds in the stabilization of the native structure. Aprotinin and benzamidine, specific serine proteinase inhibitors, had no effect on BmooMP alpha-I activity. Since the BmooMP alpha-I enzyme was found to cause defibrinogenation when administered i.p. on mice, it is expected that it may be of medical interest as a therapeutic agent in the treatment and prevention of arterial thrombosis. (C) 2007 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Analysis of the structure of the urochordate Herdmania curvata ribosomal DNA intergenic spacer (IGS) and its role in transcription initiation and termination suggests that rRNA gene regulation in this chordate differs from that in vertebrates. A cloned H, curvata IGS is 1881 bp and composed predominantly of two classes of similar repeat sequences that largely alternate in a tandem array. Southern blot hybridization demonstrates that the IGS length variation within an individual and population is largely the result of changes in internal repeat number. Nuclease S1 mapping and primer extension analyses suggest that there are two transcription initiation sites at the 3' end of the most 3' repetitive element; these sites are 6 nucleotides apart. Unlike mouse, Xenopus, and Drosophila, there is no evidence of transcription starting elsewhere in the IGS. Most sequence differences between the promoter repeat and the other internal repeats are in the vicinity of the putative initiation sites. As in Drosophila, nuclease S1 mapping of transcription termination sites suggest that there is not a definitive stop site and a majority of the pre-rRNAs read through a substantial portion of the IGS. Some transcription appears to proceed completely through the promoter repeat into the adjacent rDNA unit. Analysis of oocyte RNA by reverse transcription-polymerase chain reaction (RT-PCR) confirms that readthrough transcription into the adjacent rDNA unit is occurring in some small IGS length variants; there is no evidence of complete readthrough of IGSs larger than 1.0 kb.