12 resultados para Re-ranking methods
em CentAUR: Central Archive University of Reading - UK
Resumo:
Background: Selecting the highest quality 3D model of a protein structure from a number of alternatives remains an important challenge in the field of structural bioinformatics. Many Model Quality Assessment Programs (MQAPs) have been developed which adopt various strategies in order to tackle this problem, ranging from the so called "true" MQAPs capable of producing a single energy score based on a single model, to methods which rely on structural comparisons of multiple models or additional information from meta-servers. However, it is clear that no current method can separate the highest accuracy models from the lowest consistently. In this paper, a number of the top performing MQAP methods are benchmarked in the context of the potential value that they add to protein fold recognition. Two novel methods are also described: ModSSEA, which based on the alignment of predicted secondary structure elements and ModFOLD which combines several true MQAP methods using an artificial neural network. Results: The ModSSEA method is found to be an effective model quality assessment program for ranking multiple models from many servers, however further accuracy can be gained by using the consensus approach of ModFOLD. The ModFOLD method is shown to significantly outperform the true MQAPs tested and is competitive with methods which make use of clustering or additional information from multiple servers. Several of the true MQAPs are also shown to add value to most individual fold recognition servers by improving model selection, when applied as a post filter in order to re-rank models. Conclusion: MQAPs should be benchmarked appropriately for the practical context in which they are intended to be used. Clustering based methods are the top performing MQAPs where many models are available from many servers; however, they often do not add value to individual fold recognition servers when limited models are available. Conversely, the true MQAP methods tested can often be used as effective post filters for re-ranking few models from individual fold recognition servers and further improvements can be achieved using a consensus of these methods.
Resumo:
Motivation: Modelling the 3D structures of proteins can often be enhanced if more than one fold template is used during the modelling process. However, in many cases, this may also result in poorer model quality for a given target or alignment method. There is a need for modelling protocols that can both consistently and significantly improve 3D models and provide an indication of when models might not benefit from the use of multiple target-template alignments. Here, we investigate the use of both global and local model quality prediction scores produced by ModFOLDclust2, to improve the selection of target-template alignments for the construction of multiple-template models. Additionally, we evaluate clustering the resulting population of multi- and single-template models for the improvement of our IntFOLD-TS tertiary structure prediction method. Results: We find that using accurate local model quality scores to guide alignment selection is the most consistent way to significantly improve models for each of the sequence to structure alignment methods tested. In addition, using accurate global model quality for re-ranking alignments, prior to selection, further improves the majority of multi-template modelling methods tested. Furthermore, subsequent clustering of the resulting population of multiple-template models significantly improves the quality of selected models compared with the previous version of our tertiary structure prediction method, IntFOLD-TS.
Resumo:
Web service is one of the most fundamental technologies in implementing service oriented architecture (SOA) based applications. One essential challenge related to web service is to find suitable candidates with regard to web service consumer’s requests, which is normally called web service discovery. During a web service discovery protocol, it is expected that the consumer will find it hard to distinguish which ones are more suitable in the retrieval set, thereby making selection of web services a critical task. In this paper, inspired by the idea that the service composition pattern is significant hint for service selection, a personal profiling mechanism is proposed to improve ranking and recommendation performance. Since service selection is highly dependent on the composition process, personal knowledge is accumulated from previous service composition process and shared via collaborative filtering where a set of users with similar interest will be firstly identified. Afterwards a web service re-ranking mechanism is employed for personalised recommendation. Experimental studies are conduced and analysed to demonstrate the promising potential of this research.
Resumo:
BACKGROUND AND AIM: The atherogenic potential of dietary derived lipids, chylomicrons (CM) and their remnants (CMr) is now becoming more widely recognised. To investigate factors effecting levels of CM and CMr and their importance in coronary heart disease risk it is essential to use a specific method of quantification. Two studies were carried out to investigate: (i) effects of increased daily intake of long chain n-3 polyunsaturated fatty acid (LC n-3 PUFA), and (ii) effects of increasing meal monounsaturated fatty acid (MUFA) content on the postprandial response of intestinally-derived lipoproteins. The contribution of the intestinally-derived lipoproteins to total lipaemia was assessed by triacylglycerol-rich lipoprotein (TRL) apolipoprotein B-48 (apo B-48) and retinyl ester (RE) concentrations. METHODS AND RESULTS: In a randomised controlled crossover trial (placebo vs LC n-3 PUFA) a mean daily intake of 1.4 g/day of LC n-3 PUFA failed to reduce fasting and postprandial triacylglycerol (TAG) response in 9 healthy male volunteers. Although the pattern and nature of the apo B-48 response was consistent with the TAG response following the two diets, the postprandial RE response differed on the LC n-3 PUFA diet with a lower early RE response and a delayed and more marked increase in RE in the late postprandial period compared with the control diet, but the differences did not reach levels of statistical significance. In the meal study there was no effect of MUFA/SFA content on the total lipaemic response to the meals nor on the contribution of intestinally derived lipoproteins evaluated as TAG, apo B-48 and RE responses in the TRL fraction. In both studies, the RE and apo B-48 measurements provided broadly similar information with respect to lack of effects of dietary or meal fatty acid composition and the presence of single or multiple peak responses. However the apo B-48 and RE measurements differed with respect to the timing of their peak response times, with a delayed RE peak, relalive to apo B-48, of approximately 2-3 hours for the LC n-3 PUFA diet (p = 0.002) study and 1-1.5 hours for the meal MUFA/SFA study. CONCLUSIONS: It was concluded that there are limitations of using RE as a specific CM marker, apo B-48 quantitation was found to be a more appropriate method for CM and CMr quantitation. However it was still considered of value to measure RE as it provided additional information regarding the incorporation of other constituents into the CM particle.
Resumo:
The steadily accumulating literature on technical efficiency in fisheries attests to the importance of efficiency as an indicator of fleet condition and as an object of management concern. In this paper, we extend previous work by presenting a Bayesian hierarchical approach that yields both efficiency estimates and, as a byproduct of the estimation algorithm, probabilistic rankings of the relative technical efficiencies of fishing boats. The estimation algorithm is based on recent advances in Markov Chain Monte Carlo (MCMC) methods—Gibbs sampling, in particular—which have not been widely used in fisheries economics. We apply the method to a sample of 10,865 boat trips in the US Pacific hake (or whiting) fishery during 1987–2003. We uncover systematic differences between efficiency rankings based on sample mean efficiency estimates and those that exploit the full posterior distributions of boat efficiencies to estimate the probability that a given boat has the highest true mean efficiency.
Resumo:
Comparative analyses of survival senescence by using life tables have identified generalizations including the observation that mammals senesce faster than similar-sized birds. These generalizations have been challenged because of limitations of life-table approaches and the growing appreciation that senescence is more than an increasing probability of death. Without using life tables, we examine senescence rates in annual individual fitness using 20 individual-based data sets of terrestrial vertebrates with contrasting life histories and body size. We find that senescence is widespread in the wild and equally likely to occur in survival and reproduction. Additionally, mammals senesce faster than birds because they have a faster life history for a given body size. By allowing us to disentangle the effects of two major fitness components our methods allow an assessment of the robustness of the prevalent life-table approach. Focusing on one aspect of life history - survival or recruitment - can provide reliable information on overall senescence.
Resumo:
Physical, cultural and biological methods for weed control have developed largely independently and are often concerned with weed control in different systems: physical and cultural control in annual crops and biocontrol in extensive grasslands. We discuss the strengths and limitations of four physical and cultural methods for weed control: mechanical, thermal, cutting, and intercropping, and the advantages and disadvantages of combining biological control with them. These physical and cultural control methods may increase soil nitrogen levels and alter microclimate at soil level; this may be of benefit to biocontrol agents, although physical disturbance to the soil and plant damage may be detrimental. Some weeds escape control by these methods; we suggest that these weeds may be controlled by biocontrol agents. It will be easiest to combine biological control with. re and cutting in grasslands; within arable systems it would be most promising to combine biological control (especially using seed predators and foliar pathogens) with cover-cropping, and mechanical weeding combined with foliar bacterial and possibly foliar fungal pathogens. We stress the need to consider the timing of application of combined control methods in order to cause least damage to the biocontrol agent, along with maximum damage to the weed and to consider the wider implications of these different weed control methods.
Resumo:
Species-rich lowland hay meadows are of conservation importance for both plants and invertebrates; however, they have declined in area across Europe as a result of conversion to other land uses and management intensification. The re-creation of these grasslands on ex-arable land provides a valuable approach to increasing the extent and conservation value of this threatened habitat. Over a 3-year period a replicated block design was used to test whether introducing seeds promoted the re-creation of both plant and phytophagous beetle assemblages typical of a target hay meadow. Seeds were harvested from local hay meadows, and applied to experimental plots in the form of either green hay or brush harvesting seeds. Green hay spreading achieved the greatest success in re-creating plant and phytophagous beetle assemblages. While re-creation success increased over time for both taxa, for the phytophagous beetles the greatest increase in re-creation success relative to the establishment year also occurred where green hay was applied. We also considered the phytophagous beetles in terms of functional traits that describe host plant specificity, larval feeding location and dispersal. Phytophagous beetle functional trait composition was most similar to the target hay meadow assemblage where some form of seed addition was used, i.e. hay spreading or brush harvested seeds. This study identified the importance of introducing target plant species as a mechanism to promote the re-creation of phytophagous beetle communities. Seed addition methods (e.g. green hay spreading) are crucial to successful hay meadow re-creation.
Resumo:
This paper presents practical approaches to the problem of sample size re-estimation in the case of clinical trials with survival data when proportional hazards can be assumed. When data are readily available at the time of the review, on a full range of survival experiences across the recruited patients, it is shown that, as expected, performing a blinded re-estimation procedure is straightforward and can help to maintain the trial's pre-specified error rates. Two alternative methods for dealing with the situation where limited survival experiences are available at the time of the sample size review are then presented and compared. In this instance, extrapolation is required in order to undertake the sample size re-estimation. Worked examples, together with results from a simulation study are described. It is concluded that, as in the standard case, use of either extrapolation approach successfully protects the trial error rates. Copyright © 2012 John Wiley & Sons, Ltd.
Resumo:
The steadily accumulating literature on technical efficiency in fisheries attests to the importance of efficiency as an indicator of fleet condition and as an object of management concern. In this paper, we extend previous work by presenting a Bayesian hierarchical approach that yields both efficiency estimates and, as a byproduct of the estimation algorithm, probabilistic rankings of the relative technical efficiencies of fishing boats. The estimation algorithm is based on recent advances in Markov Chain Monte Carlo (MCMC) methods— Gibbs sampling, in particular—which have not been widely used in fisheries economics. We apply the method to a sample of 10,865 boat trips in the US Pacific hake (or whiting) fishery during 1987–2003. We uncover systematic differences between efficiency rankings based on sample mean efficiency estimates and those that exploit the full posterior distributions of boat efficiencies to estimate the probability that a given boat has the highest true mean efficiency.
Resumo:
The European Centre for Medium-range Weather Forecast (ECMWF) provides an aerosol re-analysis starting from year 2003 for the Monitoring Atmospheric Composition and Climate (MACC) project. The re-analysis assimilates total aerosol optical depth retrieved by the Moderate Resolution Imaging Spectroradiometer (MODIS) to correct for model departures from observed aerosols. The reanalysis therefore combines satellite retrievals with the full spatial coverage of a numerical model. Re-analysed products are used here to estimate the shortwave direct and first indirect radiative forcing of anthropogenic aerosols over the period 2003–2010, using methods previously applied to satellite retrievals of aerosols and clouds. The best estimate of globally-averaged, all-sky direct radiative forcing is −0.7±0.3Wm−2. The standard deviation is obtained by a Monte-Carlo analysis of uncertainties, which accounts for uncertainties in the aerosol anthropogenic fraction, aerosol absorption, and cloudy-sky effects. Further accounting for differences between the present-day natural and pre-industrial aerosols provides a direct radiative forcing estimate of −0.4±0.3Wm−2. The best estimate of globally-averaged, all-sky first indirect radiative forcing is −0.6±0.4Wm−2. Its standard deviation accounts for uncertainties in the aerosol anthropogenic fraction, and in cloud albedo and cloud droplet number concentration susceptibilities to aerosol changes. The distribution of first indirect radiative forcing is asymmetric and is bounded by −0.1 and −2.0Wm−2. In order to decrease uncertainty ranges, better observational constraints on aerosol absorption and sensitivity of cloud droplet number concentrations to aerosol changes are required.