951 resultados para sequence based alignments


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Motivation: While processing of MHC class II antigens for presentation to helper T-cells is essential for normal immune response, it is also implicated in the pathogenesis of autoimmune disorders and hypersensitivity reactions. Sequence-based computational techniques for predicting HLA-DQ binding peptides have encountered limited success, with few prediction techniques developed using three-dimensional models. Methods: We describe a structure-based prediction model for modeling peptide-DQ3.2 beta complexes. We have developed a rapid and accurate protocol for docking candidate peptides into the DQ3.2 beta receptor and a scoring function to discriminate binders from the background. The scoring function was rigorously trained, tested and validated using experimentally verified DQ3.2 beta binding and non-binding peptides obtained from biochemical and functional studies. Results: Our model predicts DQ3.2 beta binding peptides with high accuracy [area under the receiver operating characteristic (ROC) curve A(ROC) > 0.90], compared with experimental data. We investigated the binding patterns of DQ3.2 beta peptides and illustrate that several registers exist within a candidate binding peptide. Further analysis reveals that peptides with multiple registers occur predominantly for high-affinity binders.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Pattern discovery in a long temporal event sequence is of great importance in many application domains. Most of the previous work focuses on identifying positive associations among time stamped event types. In this paper, we introduce the problem of defining and discovering negative associations that, as positive rules, may also serve as a source of knowledge discovery. In general, an event-oriented pattern is a pattern that associates with a selected type of event, called a target event. As a counter-part of previous research, we identify patterns that have a negative relationship with the target events. A set of criteria is defined to evaluate the interestingness of patterns associated with such negative relationships. In the process of counting the frequency of a pattern, we propose a new approach, called unique minimal occurrence, which guarantees that the Apriori property holds for all patterns in a long sequence. Based on the interestingness measures, algorithms are proposed to discover potentially interesting patterns for this negative rule problem. Finally, the experiment is made for a real application.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Pattern discovery in temporal event sequences is of great importance in many application domains, such as telecommunication network fault analysis. In reality, not every type of event has an accurate timestamp. Some of them, defined as inaccurate events may only have an interval as possible time of occurrence. The existence of inaccurate events may cause uncertainty in event ordering. The traditional support model cannot deal with this uncertainty, which would cause some interesting patterns to be missing. A new concept, precise support, is introduced to evaluate the probability of a pattern contained in a sequence. Based on this new metric, we define the uncertainty model and present an algorithm to discover interesting patterns in the sequence database that has one type of inaccurate event. In our model, the number of types of inaccurate events can be extended to k readily, however, at a cost of increasing computational complexity.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Based on Bayesian Networks, methods were created that address protein sequence-based bacterial subcellular location prediction. Distinct predictive algorithms for the eight bacterial subcellular locations were created. Several variant methods were explored. These variations included differences in the number of residues considered within the query sequence - which ranged from the N-terminal 10 residues to the whole sequence - and residue representation - which took the form of amino acid composition, percentage amino acid composition, or normalised amino acid composition. The accuracies of the best performing networks were then compared to PSORTB. All individual location methods outperform PSORTB except for the Gram+ cytoplasmic protein predictor, for which accuracies were essentially equal, and for outer membrane protein prediction, where PSORTB outperforms the binary predictor. The method described here is an important new approach to method development for subcellular location prediction. It is also a new, potentially valuable tool for candidate subunit vaccine selection.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

As torrents of new data now emerge from microbial genomics, bioinformatic prediction of immunogenic epitopes remains challenging but vital. In silico methods often produce paradoxically inconsistent results: good prediction rates on certain test sets but not others. The inherent complexity of immune presentation and recognition processes complicates epitope prediction. Two encouraging developments – data driven artificial intelligence sequence-based methods for epitope prediction and molecular modeling methods based on three-dimensional protein structures – offer hope for the future.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Forty-four species of Colletotrichum are confirmed as present in Australia based on DNA sequencing analyses. Many of these species were identified directly as a result of two workshops organised by the Subcommittee on Plant Health Diagnostics in Australia in 2015 that covered morphological and molecular approaches to identification of Colletotrichum. There are several other species of Colletotrichum reported from Australia that remain to be substantiated by DNA sequence-based methods. This body of work aims to provide a basis from which to critically examine a number of isolates of Colletotrichum deposited in Australian culture collections.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND Adenoviruses are common pathogens in vertebrates, including humans. In marine mammals, adenovirus has been associated with fatal hepatitis in sea lions. However, only in rare cases have adenoviruses been detected in cetaceans, where no clear correlation was found between presence of the virus and disease status. CASE PRESENTATION A novel adenovirus was identified in four captive bottlenose dolphins with self-limiting gastroenteritis. Viral detection and identification were achieved by: PCR-amplification from fecal samples; sequencing of partial adenovirus polymerase (pol) and hexon genes; producing the virus in HeLa cells, with PCR and immunofluorescence detection, and with sequencing of the amplified pol and hexon gene fragments. A causative role of this adenovirus for gastroenteritis was suggested by: 1) we failed to identify other potential etiological agents; 2) the exclusive detection of this novel adenovirus and of seropositivity for canine adenoviruses 1 and 2 in the four sick dolphins, but not in 10 healthy individuals of the same captive population; and 3) the virus disappeared from feces after clinical signs receded. The partial sequences of the amplified fragments of the pol and hexon genes were closest to those of adenoviruses identified in sea lions with fatal adenoviral hepatitis, and to a Genbank-deposited sequence obtained from a harbour porpoise. CONCLUSION These data suggest that adenovirus can cause self-limiting gastroenteritis in dolphins. This adenoviral infection can be detected by serology and by PCR detection in fecal material. Lack of signs of hepatitis in sick dolphins may reflect restricted tissue tropism or virulence of this adenovirus compared to those of the adenovirus identified in sea lions. Gene sequence-based phylogenetic analysis supports a common origin of adenoviruses that affect sea mammals. Our findings suggest the need for vigilance against adenoviruses in captive and wild dolphin populations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Microbes associated with marine sponges play significant roles in host physiology. Remarkable levels of microbial diversity have been observed in sponges worldwide through both culture-dependent and culture-independent studies. Most studies have focused on the structure of the bacterial communities in sponges and have involved sponges sampled from shallow waters. Here, we used pyrosequencing of 16S rRNA genes to compare the bacterial and archaeal communities associated with two individuals of the marine sponge Inflatella pellicula from the deep-sea, sampled from a depth of 2,900 m, a depth which far exceeds any previous sequence-based report of sponge-associated microbial communities. Sponge-microbial communities were also compared to the microbial community in the surrounding seawater. Sponge-associated microbial communities were dominated by archaeal sequencing reads with a single archaeal OTU, comprising similar to ∼60% and similar to ∼72% of sequences, being observed from Inflatella pellicula. Archaeal sequencing reads were less abundant in seawater (similar to ∼11% of sequences). Sponge-associated microbial communities were less diverse and less even than any other sponge-microbial community investigated to date with just 210 and 273 OTUs (97% sequence identity) identified in sponges, with 4 and 6 dominant OTUs comprising similar to ∼88% and similar to ∼89% of sequences, respectively. Members of the candidate phyla, SAR406, NC10 and ZB3 are reported here from sponges for the first time, increasing the number of bacterial phyla or candidate divisions associated with sponges to 43. A minor cohort from both sponge samples (similar to ∼0.2% and similar to ∼0.3% of sequences) were not classified to phylum level. A single OTU, common to both sponge individuals, dominates these unclassified reads and shares sequence homology with a sponge associated clone which itself has no known close relative and may represent a novel taxon.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The diagnosis of mixed genotype hepatitis C virus (HCV) infection is rare and information on incidence in the UK, where genotypes 1a and 3 are the most prevalent, is sparse. Considerable variations in the efficacies of direct-acting antivirals (DAAs) for the HCV genotypes have been documented and the ability of DAAs to treat mixed genotype HCV infections remains unclear, with the possibility that genotype switching may occur. In order to estimate the prevalence of mixed genotype 1a/3 infections in Scotland, a cohort of 512 samples was compiled and then screened using a genotype-specific nested PCR assay. Mixed genotype 1a/3 infections were found in 3.8% of samples tested, with a significantly higher prevalence rate of 6.7% (p<0.05) observed in individuals diagnosed with genotype 3 infections than genotype 1a (0.8%). An analysis of the samples using genotypic-specific qPCR assays found that in two-thirds of samples tested, the minor strain contributed <1% of the total viral load. The potential of deep sequencing methods for the diagnosis of mixed genotype infections was assessed using two pan-genotypic PCR assays compatible with the Illumina MiSeq platform that were developed targeting the E1-E2 and NS5B regions of the virus. The E1-E2 assay detected 75% of the mixed genotype infections, proving to be more sensitive than the NS5B assay which identified only 25% of the mixed infections. Studies of sequence data and linked patient records also identified significantly more neurological disorders in genotype 3 patients. Evidence of distinctive dinucleotide expression within the genotypes was also uncovered. Taken together these findings raise interesting questions about the evolutionary history of the virus and indicate that there is still more to understand about the different genotypes. In an era where clinical medicine is frequently more personalised, the development of diagnostic methods for HCV providing increased patient stratification is increasingly important. This project has shown that sequence-based genotyping methods can be highly discriminatory and informative, and their use should be encouraged in diagnostic laboratories. Mixed genotype infections were challenging to identify and current deep sequencing methods were not as sensitive or cost-effective as Sanger-based approaches in this study. More research is needed to evaluate the clinical prognosis of patients with mixed genotype infection and to develop clinical guidelines on their treatment.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Six hundred twenty-one samples from Portugal, the Cabo Verde archipelago, and Guinea-Bissau were typed for HLA-A, HLA-B, and HLADRB1usingthepolymerasechainreaction–sequence-specificoligonucleotide probe (PCR-SSOP) method and the sequence-based typing (SBT) method to characterizeandcomparediscrepanciesbetweenthetwomethods.Fifty-three alleles (4.27% of 1,242 chromosomes typed) identified by the PCR-SSOP method were not concordant with the results obtained using the SBT method. Thirty-four (2.74% of total chromosomes typed) PCR-SSOP mistyping results were discrepancies inside the same allele group and 19 others (1.53% of total chromosomes typed) were relative to nonconcordant results between different groups. PCR-SSOP allele mistyping is the result of interpretation difficulties resulting from less intense, absent, or dubious hybridization patterns. Noncommercial PCR-SSOP procedures are highly exigent on the technicians’ experience and the availability of properly calibrated high-precision equipment.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Since the turn of the century, fisheries have maintained a steady growth rate, while aquaculture has experienced a more rapid expansion. Aquaculture can offer EU consumers more diverse, healthy, and sustainable food options, some of which are more popular elsewhere. To develop the sector, the EU is investing heavily. The EU supports innovative projects that promote the sustainable development of seafood sectors and food security. Priority 3 promotes sector development through innovation dissemination. This doctoral dissertation examined innovation transfer in the Italian aquaculture sector, specifically the adoption of innovative tools, using a theoretical model to better understand the complexity of these processes. The work focused on innovation adoption, emphasising that it is the end of a well-defined process. The Awareness Knowledge Adoption Implementation Effectiveness (AKAIE) model was created to better analyse post-adoption phases and evaluate technology adoption implementation and impact. To identify AKAIE drivers and barriers, aquaculture actors were consulted. "Perceived complexity"—barriers to adoption that are strongly influenced by contextual factors—has been used to examine their perspectives (i.e. socio-economic, institutional, cultural ones). The new model will contextualise the sequence based on technologies, entrepreneur traits, corporate and institutional contexts, and complexity perception, the sequence's central node. Technology adoption can also be studied by examining complexity perceptions along the AKAIE sequence. This study proposes a new model to evaluate the diffusion of a given technology, offering the policy maker the possibility to be able to act promptly across the process. The development of responsible policies for evaluating the effectiveness of innovation is more necessary than ever, especially to orient strategies and interventions in the face of major scenarios of change.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Intergenic spacers of chloroplast DNA (cpDNA) are very useful in phylogenetic and population genetic studies of plant species, to study their potential integration in phylogenetic analysis. The non-coding trnE-trnT intergenic spacer of cpDNA was analyzed to assess the nucleotide sequence polymorphism of 16 Solanaceae species and to estimate its ability to contribute to the resolution of phylogenetic studies of this group. Multiple alignments of DNA sequences of trnE-trnT intergenic spacer made the identification of nucleotide variability in this region possible and the phylogeny was estimated by maximum parsimony and rooted with Convolvulaceae Ipomoea batalas, the most closely related family. Besides, this intergenic spacer was tested for the phylogenetic ability to differentiate taxonomic levels. For this purpose, species from four other families were analyzed and compared with Solanaceae species. Results confirmed polymorphism in the trnE-trnT region at different taxonomic levels.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

This article presents a statistical method for detecting recombination in DNA sequence alignments, which is based on combining two probabilistic graphical models: (1) a taxon graph (phylogenetic tree) representing the relationship between the taxa, and (2) a site graph (hidden Markov model) representing interactions between different sites in the DNA sequence alignments. We adopt a Bayesian approach and sample the parameters of the model from the posterior distribution with Markov chain Monte Carlo, using a Metropolis-Hastings and Gibbs-within-Gibbs scheme. The proposed method is tested on various synthetic and real-world DNA sequence alignments, and we compare its performance with the established detection methods RECPARS, PLATO, and TOPAL, as well as with two alternative parameter estimation schemes.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.