29 resultados para Repeated Fragments
em CentAUR: Central Archive University of Reading - UK
Resumo:
Recently, two approaches have been introduced that distribute the molecular fragment mining problem. The first approach applies a master/worker topology, the second approach, a completely distributed peer-to-peer system, solves the scalability problem due to the bottleneck at the master node. However, in many real world scenarios the participating computing nodes cannot communicate directly due to administrative policies such as security restrictions. Thus, potential computing power is not accessible to accelerate the mining run. To solve this shortcoming, this work introduces a hierarchical topology of computing resources, which distributes the management over several levels and adapts to the natural structure of those multi-domain architectures. The most important aspect is the load balancing scheme, which has been designed and optimized for the hierarchical structure. The approach allows dynamic aggregation of heterogenous computing resources and is applied to wide area network scenarios.
Resumo:
In real world applications sequential algorithms of data mining and data exploration are often unsuitable for datasets with enormous size, high-dimensionality and complex data structure. Grid computing promises unprecedented opportunities for unlimited computing and storage resources. In this context there is the necessity to develop high performance distributed data mining algorithms. However, the computational complexity of the problem and the large amount of data to be explored often make the design of large scale applications particularly challenging. In this paper we present the first distributed formulation of a frequent subgraph mining algorithm for discriminative fragments of molecular compounds. Two distributed approaches have been developed and compared on the well known National Cancer Institute’s HIV-screening dataset. We present experimental results on a small-scale computing environment.
Resumo:
Frequent pattern discovery in structured data is receiving an increasing attention in many application areas of sciences. However, the computational complexity and the large amount of data to be explored often make the sequential algorithms unsuitable. In this context high performance distributed computing becomes a very interesting and promising approach. In this paper we present a parallel formulation of the frequent subgraph mining problem to discover interesting patterns in molecular compounds. The application is characterized by a highly irregular tree-structured computation. No estimation is available for task workloads, which show a power-law distribution in a wide range. The proposed approach allows dynamic resource aggregation and provides fault and latency tolerance. These features make the distributed application suitable for multi-domain heterogeneous environments, such as computational Grids. The distributed application has been evaluated on the well known National Cancer Institute’s HIV-screening dataset.
Resumo:
The aim was to determine the fate of transgenic and endogenous plant DNA fragments in the blood, tissues, and digesta of broilers. Male broiler chicks (n = 24) were allocated at 1 day old to each of four treatment diets designated T1-T4. T1 and T2 contained the near isogenic nongenetically modified (GM) maize grain, whereas T3 and T4 contained GM maize grain [cry1a(b) gene]; T1 and T3 also contained the near isogenic non-GM soybean meal, whereas T2 and T4 contained GM soybean meal (cp4epsps gene). Four days prior to slaughter at 39-42 days old, 50% of the broilers on T2-T4 had the source(s) of GM ingredients replaced by their non-GM counterparts. Detection of specific DNA sequences in feed, tissue, and digesta samples was completed by polymerase chain reaction analysis. Seven primer pairs were used to amplify fragments (similar to 200 bp) from single copy genes (maize high mobility protein, soya lectin, and transgenes in the GM feeds) and multicopy genes (poultry mitochondrial cytochrome b, maize, and soya rubisco). There was no effect of treatment on the measured growth performance parameters. Except for a single detection of lectin (nontransgenic single copy gene; unsubstantiated) in the extracted DNA from one bursa tissue sample, there was no positive detection of any endogenous or transgenic single copy genes in either blood or tissue DNA samples. However, the multicopy rubisco gene was detected in a proportion of samples from all tissue types (23% of total across all tissues studied) and in low numbers in blood. Feed-derived DNA was found to survive complete degradation up to the large intestine. Transgenic DNA was detected in gizzard digesta but not in intestinal digesta 96 h after the last feeding of treatment diets containing a source of GM maize and/or soybean meal.
The sequential analysis of repeated binary responses: a score test for the case of three time points
Resumo:
In this paper a robust method is developed for the analysis of data consisting of repeated binary observations taken at up to three fixed time points on each subject. The primary objective is to compare outcomes at the last time point, using earlier observations to predict this for subjects with incomplete records. A score test is derived. The method is developed for application to sequential clinical trials, as at interim analyses there will be many incomplete records occurring in non-informative patterns. Motivation for the methodology comes from experience with clinical trials in stroke and head injury, and data from one such trial is used to illustrate the approach. Extensions to more than three time points and to allow for stratification are discussed. Copyright © 2005 John Wiley & Sons, Ltd.
Resumo:
The DcuS-DcuR system of Escherichia coli is a two-component sensor-regulator that controls gene expression in response to external C-4-dicarboxylates and citrate. The DcuS protein is particularly interesting since it contains two PAS domains, namely a periplasmic C-4-dicarboxylate-sensing PAS domain (PASp) and a cytosolic PAS domain (PASc) of uncertain function. For a study of the role of the PASc domain, three different fragments of DcuS were overproduced and examined: they were PASc-kinase, PASc, and kinase. The two kinase-domain-containing fragments were autophosphorylated by [gamma-P-32]ATP. The rate was not affected by fumarate or succinate, supporting the role of the PASp domain in C-4-dicarboxylate sensing. Both of the phosphorylated DcuS constructs were able to rapidly pass their phosphoryl groups to DcuR, and after phosphorylation, DcuR dephosphorylated rapidly. No prosthetic group or significant quantity of metal was found associated with either of the PASc-containing proteins. The DNA-binding specificity of DcuR was studied by use of the pure protein. It was found to be converted from a monomer to a dimer upon acetylphosphate treatment, and native polyacrylamide gel electrophoresis suggested that it can oligomerize. DcuR specifically bound to the promoters of the three known DcuSR-regulated genes (dctA, dcuB, and frdA), with apparent K(D)s of 6 to 32 muM for untreated DcuR and less than or equal to1 to 2 muM for the acetylphosphate-treated form. The binding sites were located by DNase I footprinting, allowing a putative DcuR-binding motif [tandemly repeated (T/A)(A/T)(T/C)(A/T)AA sequences] to be identified. The DcuR-binding sites of the dcuB, dctA, and frdA genes were located 27, 94, and 86 bp, respectively, upstream of the corresponding +1 sites, and a new promoter was identified for dcuB that responds to DcuR.
Resumo:
Natural exposure to prion disease is likely to occur throughout successive challenges, yet most experiments focus on single large doses of infectious material. We analyze the results from an experiment in which rodents were exposed to multiple doses of feed contaminated with the scrapie agent. We formally define hypotheses for how the doses combine in terms of statistical models. The competing hypotheses are that only the total dose of infectivity is important (cumulative model), doses act independently, or a general alternative that interaction between successive doses occurs (to raise or lower the risk of infection). We provide sample size calculations to distinguish these hypotheses. In the experiment, a fixed total dose has a significantly reduced probability of causing infection if the material is presented as multiple challenges, and as the time between challenges lengthens. Incubation periods are shorter and less variable if all material is consumed on one occasion. We show that the probability of infection is inconsistent with the hypothesis that each dose acts as a cumulative or independent challenge. The incubation periods are inconsistent with the independence hypothesis. Thus, although a trend exists for the risk of infection with prion disease to increase with repeated doses, it does so to a lesser degree than is expected if challenges combine independently or in a cumulative manner.
Resumo:
The effect of poly(ethylene glycol) PEG crystallization on P-sheet fibril formation is studied for a series of three peptide/PEG conjugates containing fragments modified from the amyloid P peptide, specifically KLVFF, FFKLVFF, and AAKLVFF. These are conjugated to PEG with M-n = 3300 g mol(-1). It is found, via small-angle X-ray scattering,X-ray diffraction, atomic force microscopy, and polarized optical microscopy, that PEG crystallinity in dried samples can disturb fibrillization, in particular cross-P amyloid structure formation, for the conjugate containing the weak fibrillizer KLVFF, whereas this is retained for the conjugates containing the stronger fibrillizers AAKLVFF and FFKLVFF. For these two samples, the alignment of peptide fibrils also drives the orientation of the attached PEG chains. Our results highlight the importance of the antagonistic effects of PEG crystallization and peptide fibril formation in PEG/peptide conjugates.
Resumo:
Ordered nanostructures are observed in the melt and solid state for a series of three peptide/PEG conjugates containing fragments of amyloid beta-peptides. These are conjugated to PEG with (M) over bar (n) = 3 300 g.mol(-1) and a melting temperature T-m = 45-50 degrees C. The morphology at room temperature is examined by AFM and POM. This shows spherulite formation for the weakly fibrillizing KLVFF-PEG sample but fibril formation for FFKLVFF-PEG. The fibrillization tendency of the latter is enhanced by multiple phenylalanine residues. Simultaneous SAXS and WAXS was used to investigate the morphology as a function of temperature. The secondary structure is probed by FTIR.
Resumo:
In our state of centralised control of the curriculum and high-stakes testing an examination subject's assessment objectives have become high profile. Some of the anomalous effects of this profile are shown in the teaching, question-setting, and marking of English literature. Glimpses of earlier times are revealed, all three secondary school key stages are considered, examination performances are discussed, and the views of beginning teachers about teaching to the test are sought.
Resumo:
The self-assembly of a hydrophobically modified fragment of the amyloid beta(A beta) peptide has been studied in methanol. The peptide FFKLVFF is based on A beta(16-20) extended at the N terminus by two phenylalanine residues. The formation of amyloid-type fibrils is confirmed by Congo Red staining, thioflavin T fluorescence and circular dichroism experiments. FTIR points to the formation of beta-sheet structures in solution and in dried films and suggests that aggregation occurs at low concentration and is not strongly affected by further increase in concentration, i.e. the peptide is a strong fibril-former in methanol. UV fluorescence experiments on unstained peptide and CD point to the importance of aromatic interactions between phenylalanine groups in driving aggregation into beta-sheets. The CD spectrum differs from that usually observed for beta-sheet assemblies formed by larger peptides or proteins and this is discussed for solutions in methanol and also trifluoroethanol. The fibril structure is imaged by transmission electron microscopy and scanning electron microscopy on dried samples and is confirmed by small-angle X-ray scattering experiments in solution.
Resumo:
We have been using Virus-Induced Gene Silencing (VIGS) to test the function of genes that are candidates for involvement in floral senescence. Although VIGS is a powerful tool for assaying the effects of gene silencing in plants, relatively few taxa have been studied using this approach, and most that have are in the Solanaceae. We typically use silencing of phytoene desaturase (PDS) in preliminary tests of the feasibility of using VIGS. Silencing this gene, whose product is involved in carotene biosynthesis, results in a characteristic photobleaching phenotype in the leaves. We have found that efficient silencing requires the use of fragments that are more than 90% homologous to the target gene. To simplify testing the effectiveness of VIGS in a range of species, we designed a set of universal primers to a region of the PDS gene that is highly conserved among species, and that therefore allows an investigator to isolate a fragment of the homologous PDS gene from the species of interest. We report the sequences of these primers and the results of VIGS experiments in horticultural species from the Asteraceae, Leguminosae, Balsaminaceae and Solanaceae.