964 resultados para Sequence analysis with oligonucleotid series


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Variations in different types of genomes have been found to be responsible for a large degree of physical diversity such as appearance and susceptibility to disease. Identification of genomic variations is difficult and can be facilitated through computational analysis of DNA sequences. Newly available technologies are able to sequence billions of DNA base pairs relatively quickly. These sequences can be used to identify variations within their specific genome but must be mapped to a reference sequence first. In order to align these sequences to a reference sequence, we require mapping algorithms that make use of approximate string matching and string indexing methods. To date, few mapping algorithms have been tailored to handle the massive amounts of output generated by newly available sequencing technologies. In otrder to handle this large amount of data, we modified the popular mapping software BWA to run in parallel using OpenMPI. Parallel BWA matches the efficiency of multithreaded BWA functions while providing efficient parallelism for BWA functions that do not currently support multithreading. Parallel BWA shows significant wall time speedup in comparison to multithreaded BWA on high-performance computing clusters, and will thus facilitate the analysis of genome sequencing data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Affiliation: Département de biochimie, Faculté de médecine, Université de Montréal

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Modern computer systems are plagued with stability and security problems: applications lose data, web servers are hacked, and systems crash under heavy load. Many of these problems or anomalies arise from rare program behavior caused by attacks or errors. A substantial percentage of the web-based attacks are due to buffer overflows. Many methods have been devised to detect and prevent anomalous situations that arise from buffer overflows. The current state-of-art of anomaly detection systems is relatively primitive and mainly depend on static code checking to take care of buffer overflow attacks. For protection, Stack Guards and I-leap Guards are also used in wide varieties.This dissertation proposes an anomaly detection system, based on frequencies of system calls in the system call trace. System call traces represented as frequency sequences are profiled using sequence sets. A sequence set is identified by the starting sequence and frequencies of specific system calls. The deviations of the current input sequence from the corresponding normal profile in the frequency pattern of system calls is computed and expressed as an anomaly score. A simple Bayesian model is used for an accurate detection.Experimental results are reported which show that frequency of system calls represented using sequence sets, captures the normal behavior of programs under normal conditions of usage. This captured behavior allows the system to detect anomalies with a low rate of false positives. Data are presented which show that Bayesian Network on frequency variations responds effectively to induced buffer overflows. It can also help administrators to detect deviations in program flow introduced due to errors.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

R from http://www.r-project.org/ is ‘GNU S’ – a language and environment for statistical computing and graphics. The environment in which many classical and modern statistical techniques have been implemented, but many are supplied as packages. There are 8 standard packages and many more are available through the cran family of Internet sites http://cran.r-project.org . We started to develop a library of functions in R to support the analysis of mixtures and our goal is a MixeR package for compositional data analysis that provides support for operations on compositions: perturbation and power multiplication, subcomposition with or without residuals, centering of the data, computing Aitchison’s, Euclidean, Bhattacharyya distances, compositional Kullback-Leibler divergence etc. graphical presentation of compositions in ternary diagrams and tetrahedrons with additional features: barycenter, geometric mean of the data set, the percentiles lines, marking and coloring of subsets of the data set, theirs geometric means, notation of individual data in the set . . . dealing with zeros and missing values in compositional data sets with R procedures for simple and multiplicative replacement strategy, the time series analysis of compositional data. We’ll present the current status of MixeR development and illustrate its use on selected data sets

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this paper is to introduce a diVerent approach, called the ecological-longitudinal, to carrying out pooled analysis in time series ecological studies. Because it gives a larger number of data points and, hence, increases the statistical power of the analysis, this approach, unlike conventional ones, allows the complementation of aspects such as accommodation of random effect models, of lags, of interaction between pollutants and between pollutants and meteorological variables, that are hardly implemented in conventional approaches. Design—The approach is illustrated by providing quantitative estimates of the short-termeVects of air pollution on mortality in three Spanish cities, Barcelona,Valencia and Vigo, for the period 1992–1994. Because the dependent variable was a count, a Poisson generalised linear model was first specified. Several modelling issues are worth mentioning. Firstly, because the relations between mortality and explanatory variables were nonlinear, cubic splines were used for covariate control, leading to a generalised additive model, GAM. Secondly, the effects of the predictors on the response were allowed to occur with some lag. Thirdly, the residual autocorrelation, because of imperfect control, was controlled for by means of an autoregressive Poisson GAM. Finally, the longitudinal design demanded the consideration of the existence of individual heterogeneity, requiring the consideration of mixed models. Main results—The estimates of the relative risks obtained from the individual analyses varied across cities, particularly those associated with sulphur dioxide. The highest relative risks corresponded to black smoke in Valencia. These estimates were higher than those obtained from the ecological-longitudinal analysis. Relative risks estimated from this latter analysis were practically identical across cities, 1.00638 (95% confidence intervals 1.0002, 1.0011) for a black smoke increase of 10 μg/m3 and 1.00415 (95% CI 1.0001, 1.0007) for a increase of 10 μg/m3 of sulphur dioxide. Because the statistical power is higher than in the individual analysis more interactions were statistically significant,especially those among air pollutants and meteorological variables. Conclusions—Air pollutant levels were related to mortality in the three cities of the study, Barcelona, Valencia and Vigo. These results were consistent with similar studies in other cities, with other multicentric studies and coherent with both, previous individual, for each city, and multicentric studies for all three cities

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recently described cupin superfamily of proteins includes the germin and germinlike proteins, of which the cereal oxalate oxidase is the best characterized. This superfamily also includes seed storage proteins, in addition to several microbial enzymes and proteins with unknown function. All these proteins are characterized by the conservation of two central motifs, usually containing two or three histidine residues presumed to be involved with metal binding in the catalytic active site. The present study on the coding regions of Synechocystis PCC6803 identifies a previously unknown group of 12 related cupins, each containing the characteristic two-motif signature. This group comprises 11 single-domain proteins, ranging in length from 104 to 289 residues, and includes two phosphomannose isomerases and two epimerases involved in cell wall synthesis, a member of the pirin group of nuclear proteins, a possible transcriptional regulator, and a close relative-of a cytochrome c551 from Rhodococcus. Additionally, there is a duplicated, two-domain protein that has close similarity to an oxalate decarboxylase from the fungus Collybia velutipes and that is a putative progenitor of the storage proteins of land plants.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The nucleotide sequence of a 3 kb region immediately upstream of the sef operon operon of Salmonella enteritidis was determined. A 1230 base pair insertion sequence which shared sequence identity (> 75%) with members of the IS3 family was revealed. This element, designated IS1230, had almost identical (90% identity) terminal inverted repeats to Escherichia coli IS3 but unlike other IS3-like sequences lacked the two characteristic open reading frames which encode the putative transposase. S. enteritidis possessed only one copy of this insertion sequence although Southern hybridisation analysis of restriction digests of genomic DNA revealed another fragment located in a region different from the sef operon which hybridised weakly which suggested the presence of an IS1230 homologue. The distribution of IS1230 and IS1230-like elements was shown to be widespread amongst salmonellas and the patterns of restriction fragments which hybridised differed significantly between Salmonella serotypes and it is suggested that IS1230 has potential for development as a differential diagnostic tool.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genome of Salmonella enterica serovar Enteritidis was shown to possess three IS3-like insertion elements, designated IS1230A, B and C, and each was cloned and their respective deoxynucleotide sequences determined. Mutations in elements IS1230A and B resulted in frameshifts in the open reading frames that encoded a putative transposase to be inactive. IS1230C was truncated at nucleotide 774 relative to IS1230B and therefore did not possess the 3' terminal inverted repeat. The three IS1230 derivatives were closely related to each other based on nucleotide sequence similarity. IS1230A was located adjacent to the sef operon encoding SEF14 fimbriae located at minute 97 of the genome of S. Enteritidis. IS1230B was located adjacent to the umuDC operon at minute 42.5 on the genome, itself located near to one terminus of an 815-kb genome inversion of S. Enteritidis relative to S. Typhimurium. IS1230C was located next to attB, the bacteriophage P22 attachment site, and proB, encoding gamma-glutamyl phosphate reductase. A truncated 3' remnant of IS1230, designated IS1230T, was identified in a clinical isolate of S. Typhimurium DT193 strain 2391. This element was located next to attB adjacent to which were bacteriophage P22-like sequences. Southern hybridisation of total genomic DNA from eighteen phage types of S. Enteritidis and eighteen definitive types of S. Typhimurium showed similar, if not identical, restriction fragment profiles in the respective serovars when probed with IS1230A.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Several protease inhibitors have reached the world market in the last fifteen years, dramatically improving the quality of life and life expectancy of millions of HIV-infected patients. In spite of the tremendous research efforts in this area, resistant HIV-1 variants are constantly decreasing the ability of the drugs to efficiently inhibit the enzyme. As a consequence, inhibitors with novel frameworks are necessary to circumvent resistance to chemotherapy. In the present work, we have created 3D QSAR models for a series of 82 HIV-1 protease inhibitors employing the comparative molecular field analysis (CoMFA) method. Significant correlation coefficients were obtained (q(2) = 0.82 and r(2) = 0.97), indicating the internal consistency of the best model, which was then used to evaluate an external test set containing 17 compounds. The predicted values were in good agreement with the experimental results, showing the robustness of the model and its substantial predictive power for untested compounds. The final QSAR model and the information gathered from the CoMFA contour maps should be useful for the design of novel anti-HIV agents with improved potency.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A significant proportion of oral bacteria are unable to undergo cultivation by existing techniques. In this regard, the microbiota from root canals still requires complementary characterization. The present study aimed at the identification of bacteria by sequence analysis of 16S rDNA clone libraries from seven endodontically infected teeth. Samples were collected from the root canals, subjected to the PCR with universal 16S rDNA primers, cloned and partially sequenced. Clones were clustered into groups of closely related sequences (phylotypes) and identification to the species level was performed by comparative analysis with the GenBank, EMBL and DDBJ databases, according to a 98 % minimum identity. All samples were positive for bacteria and the number of phylotypes detected per subject varied from two to 14. The majority of taxa (65(.)2 %) belonged to the phylum Firmicutes of the Gram-positive bacteria, followed by Proteobacteria (10(.)9 %), Spirochaetes (4(.)3 %), Bacteroidetes (6(.)5 %), Actinobacteria (2(.)2 %) and Deferribacteres (2(.)2 %). A total of 46 distinct taxonomic units was identified. Four clones with low similarity to sequences previously deposited in the databases were sequenced to nearly full extent and were classified taxonomically as novel representatives of the order Clostridiales, including a putative novel species of Mogibacterium. The identification of novel phylotypes associated with endodontic infections suggests that the endodontium may still harbour a relevant proportion of uncharacterized taxa.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: the soil fungus Rhizoctonia solani anastomosis group 3 (AG-3) is an important pathogen of cultivated plants in the family Solanaceae. Isolates of R. solani AG-3 are taxonomically related based on the composition of cellular fatty acids, phylogenetic analysis of nuclear ribosomal DNA (rDNA) and beta-tubulin gene sequences, and somatic hyphal interactions. Despite the close genetic relationship among isolates of R. solani AG-3, field populations from potato and tobacco exhibit comparative differences in their disease biology, dispersal ecology, host specialization, genetic diversity and population structure. However, little information is available on how field populations of R. solani AG-3 on potato and tobacco are shaped by population genetic processes. In this study, two field populations of R. solani AG-3 from potato in North Carolina (NC) and the Northern USA; and two field populations from tobacco in NC and Southern Brazil were examined using sequence analysis of two cloned regions of nuclear DNA (pP42F and pP89).Results: Populations of R. solani AG-3 from potato were genetically diverse with a high frequency of heterozygosity, while limited or no genetic diversity was observed within the highly homozygous tobacco populations from NC and Brazil. Except for one isolate (TBR24), all NC and Brazilian isolates from tobacco shared the same alleles. No alleles were shared between potato and tobacco populations of R. solani AG-3, indicating no gene flow between them. To infer historical events that influenced current geographical patterns observed for populations of R. solani AG-3 from potato, we performed an analysis of molecular variance (AMOVA) and a nested clade analysis (NCA). Population differentiation was detected for locus pP89 (Phi(ST) = 0.257, significant at P < 0.05) but not for locus pP42F (Phi(ST) = 0.034, not significant). Results based on NCA of the pP89 locus suggest that historical restricted gene flow is a plausible explanation for the geographical association of clades. Coalescent-based simulations of genealogical relationships between populations of R. solani AG-3 from potato and tobacco were used to estimate the amount and directionality of historical migration patterns in time, and the ages of mutations of populations. Low rates of historical movement of genes were observed between the potato and tobacco populations of R. solani AG-3.Conclusion: the two sisters populations of the basidiomycete fungus R. solani AG-3 from potato and tobacco represent two genetically distinct and historically divergent lineages that have probably evolved within the range of their particular related Solanaceae hosts as sympatric species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report the cloning and characterization of a long interspersed nucleotide element (LINE) fi-om a cichlid fish, Oreochromis niloticus, and show the distribution of this element, called CiLINE2 for cichlid LINE2, in the chromosomes of this species. The identification of an open reading frame in CiLINE2 with amino acid sequence similarity to reverse transcriptases encoded by LINE-like elements in Caenorhabditis elegans, Platemys spixii, Schistosoma mansoni, Gallus gallus (CRI), Drosophila melanogaster (I factor), and Homo sapiens (LINE2), as well as the structure of the element, suggest it is a member of this family of non-long terminal repeat-containing retrotransposons. Search of a DNA sequence database identified sequences similar to CiLINE2 in four other fish species (Haplotaxodon microlepis, Oreochromis mossambicus, Pseudotropheus zebra, and Fugu rubripes). Southern blot hybridization experiments revealed the presence of sequences similar to CiLINE2 in all Tilapiini species analyzed from the genera Oreochromis, Tilapia, and Sarotherodon, and gave an estimated copy number of about 5500 for the haploid genome of O. niloticus. Fluorescent in situ hybridization showed that CiLINE2 sequences were organized in small clusters dispersed over all chromosomes of O. niloticus, with a higher concentration near chromosome ends. Furthermore the long arm of chromosome 1 was strikingly enriched with this sequence. The distribution of LINE2-related elements might underlie the difference in chromosome banding patterns observed between cold-blooded vertebrates and mammals.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVETo determine the current status of the literature regarding the clinical efficacy and complication rates of cryoablation vs radiofrequency ablation in the treatment of small renal tumours.METHODSA review of the literature was conducted. There was no language restriction. Studies were obtained from the following sources: MEDLINE, EMBASE and LILACS.Inclusion criteria were (i) case series design with more than one case reported, (ii) use of cryoablation or radiofrequency ablation, (iii) patients with renal cell carcinoma and, (iv) outcome reported as clinical efficacy.When available, we also quantified the complication rates from each included study.Proportional meta-analysis was performed on both outcomes with a random-effects model. The 95% confidential intervals were also calculated.RESULTSThirty-one case series (20 cryoablation, 11 radiofrequency ablation) met all inclusion criteria.The pooled proportion of clinical efficacy was 89% in cryoablation therapy from a total of 457 cases. There was a statistically significant heterogeneity between these studies showing the inconsistency of clinical and methodological aspects.The pooled proportion of clinical efficacy was 90% in radiofrequency ablation therapy from a total of 426 cases. There was no statistically significant heterogeneity between these studies.There was no statistically significant difference regarding complications rate between cryoablation and radiofrequency ablation.CONCLUSIONSThis review shows that both ablation therapies have similar efficacy and complication rates.There is urgency for performing clinical trials with long-term data to establish which intervention is most suitable for the treatment of small renal masses.