36 resultados para TROPOPAUSE FOLD
Resumo:
Background: Selecting the highest quality 3D model of a protein structure from a number of alternatives remains an important challenge in the field of structural bioinformatics. Many Model Quality Assessment Programs (MQAPs) have been developed which adopt various strategies in order to tackle this problem, ranging from the so called "true" MQAPs capable of producing a single energy score based on a single model, to methods which rely on structural comparisons of multiple models or additional information from meta-servers. However, it is clear that no current method can separate the highest accuracy models from the lowest consistently. In this paper, a number of the top performing MQAP methods are benchmarked in the context of the potential value that they add to protein fold recognition. Two novel methods are also described: ModSSEA, which based on the alignment of predicted secondary structure elements and ModFOLD which combines several true MQAP methods using an artificial neural network. Results: The ModSSEA method is found to be an effective model quality assessment program for ranking multiple models from many servers, however further accuracy can be gained by using the consensus approach of ModFOLD. The ModFOLD method is shown to significantly outperform the true MQAPs tested and is competitive with methods which make use of clustering or additional information from multiple servers. Several of the true MQAPs are also shown to add value to most individual fold recognition servers by improving model selection, when applied as a post filter in order to re-rank models. Conclusion: MQAPs should be benchmarked appropriately for the practical context in which they are intended to be used. Clustering based methods are the top performing MQAPs where many models are available from many servers; however, they often do not add value to individual fold recognition servers when limited models are available. Conversely, the true MQAP methods tested can often be used as effective post filters for re-ranking few models from individual fold recognition servers and further improvements can be achieved using a consensus of these methods.
Resumo:
BACKGROUND: In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power.In this paper we describe and benchmark the JYDE (Job Yield Distribution Environment) system, which is a meta-scheduler designed to work above cluster schedulers, such as Sun Grid Engine (SGE) or Condor. We demonstrate the ability of JYDE to distribute the load of genomic-scale fold recognition across multiple independent Grid domains. We use the most recent profile-profile version of our mGenTHREADER software in order to annotate the latest version of the Human proteome against the latest sequence and structure databases in as short a time as possible. RESULTS: We show that our JYDE system is able to scale to large numbers of intensive fold recognition jobs running across several independent computer clusters. Using our JYDE system we have been able to annotate 99.9% of the protein sequences within the Human proteome in less than 24 hours, by harnessing over 500 CPUs from 3 independent Grid domains. CONCLUSION: This study clearly demonstrates the feasibility of carrying out on demand high quality structural annotations for the proteomes of major eukaryotic organisms. Specifically, we have shown that it is now possible to provide complete regular updates of profile-profile based fold recognition models for entire eukaryotic proteomes, through the use of Grid middleware such as JYDE.
Resumo:
The hydrothermal reactions of Ni(NO3)(2).6H(2)O, disodium fumarate (fum) and 1,2-bis(4-pyridyl)ethane (bpe)/1,3-bis(4-pyridyl) propane (bpp) in aqueous-methanol medium yield one 3-D and one 2-D metal-organic hybrid material, [Ni(fum)(bpe)] (1) and [Ni(fum)(bpp)(H2O)] (2), respectively. Complex 1 possesses a novel unprecedented structure, the first example of an "unusual mode" of a five-fold distorted interpenetrated network with metal-ligand linkages where the four six-membered windows in each adamantane-type cage are different. The structural characterization of complex 2 evidences a buckled sheet where nickel ions are in a distorted octahedral geometry, with two carboxylic groups, one acting as a bis-chelate, the other as a bis-monodentate ligand. The metal ion completes the coordination sphere through one water molecule and two bpp nitrogens in cis position. Variable-temperature magnetic measurements of complexes 1 and 2 reveal the existence of very weak antiferromagnetic intramolecular interactions and/or the presence of single-ion zero field splitting (D) of isolated Ni-II ions in both the compounds. Experimentally, both the J parameters are close, comparable and very small. Considering zero-field splitting of Ni-II, the calculated D values are in agreement with values reported in the literature for Ni-II ions. Complex 3, [{Co(phen)}(2)(fum)(2)] (phen=1,10-phenanthroline) is obtained by diffusing methanolic solution of 1,10-phenanthroline on an aqueous layer of disodium fumarate and Co(NO3)(2).6H(2)O. It consists of dimeric Co-II(phen) units, doubly bridged by carboxylate groups in a distorted syn-syn fashion. These fumarate anions act as bis-chelates to form corrugated sheets. The 2D layer has a (4,4) topology, with the nodes represented by the centres of the dimers. The magnetic data were fitted ignoring the very weak coupling through the fumarate pathway and using a dimer model.
Resumo:
Recent literature has described a “transition zone” between the average top of deep convection in the Tropics and the stratosphere. Here transport across this zone is investigated using an offline trajectory model. Particles were advected by the resolved winds from the European Centre for Medium-Range Weather Forecasts reanalyses. For each boreal winter clusters of particles were released in the upper troposphere over the four main regions of tropical deep convection (Indonesia, central Pacific, South America, and Africa). Most particles remain in the troposphere, descending on average for every cluster. The horizontal components of 5-day trajectories are strongly influenced by the El Niño–Southern Oscillation (ENSO), but the Lagrangian average descent does not have a clear ENSO signature. Tropopause crossing locations are first identified by recording events when trajectories from the same release regions cross the World Meteorological Organization lapse rate tropopause. Most crossing events occur 5–15 days after release, and 30-day trajectories are sufficiently long to estimate crossing number densities. In a further two experiments slight excursions across the lapse rate tropopause are differentiated from the drift deeper into the stratosphere by defining the “tropopause zone” as a layer bounded by the average potential temperature of the lapse rate tropopause and the profile temperature minimum. Transport upward across this zone is studied using forward trajectories released from the lower bound and back trajectories arriving at the upper bound. Histograms of particle potential temperature (θ) show marked differences between the transition zone, where there is a slow spread in θ values about a peak that shifts slowly upward, and the troposphere below 350 K. There forward trajectories experience slow radiative cooling interspersed with bursts of convective heating resulting in a well-mixed distribution. In contrast θ histograms for back trajectories arriving in the stratosphere have two distinct peaks just above 300 and 350 K, indicating the sharp change from rapid convective heating in the well-mixed troposphere to slow ascent in the transition zone. Although trajectories slowly cross the tropopause zone throughout the Tropics, all three experiments show that most trajectories reaching the stratosphere from the lower troposphere within 30 days do so over the west Pacific warm pool. This preferred location moves about 30°–50° farther east in an El Niño year (1982/83) and about 30° farther west in a La Niña year (1988/89). These results could have important implications for upper-troposphere–lower-stratosphere pollution and chemistry studies.
Resumo:
A new tropopause definition, based on a flow-dependent blending of the traditional thermal tropopause with one based on potential vorticity, has been developed. The benefits of such a blending algorithm are most apparent in regions with synoptic scale fluctuations between tropical and extratropical airmasses. The properties of the local airmass determine the relative contributions to the location of the blended tropopause, rather than this being determined by a specified function of latitude. Global climatologies of tropopause height, temperature, potential temperature and zonal wind, based on European Centre for Medium-Range Weather Forecasts (ECMWF) reanalysis (ERA) ERA-Interim data, are presented for the period 1989-2007. Features of the seasonal-mean tropopause are discussed on a global scale, alongside a focus on selected monthly climatologies for the two high latitude regions and the tropical belt. The height differences between climatologies based on ERA-Interim and ERA-40 data are also presented. Key spatial and temporal features seen in earlier climatologies, based mainly on the World Meteorological Organization thermal tropopause definition, are reproduced with the new definition. Tropopause temperatures are consistent with those from earlier climatologies, despite some differences in height in the extratropics.
Resumo:
A new tropopause definition involving a flow-dependent blending of the traditional thermal tropopause with one based on potential vorticity has been developed and applied to the European Centre for Medium-Range Weather Forecasts (ECMWF) reanalyses (ERA), ERA-40 and ERA-Interim. Global and regional trends in tropopause characteristics for annual and solsticial seasonal means are presented here, with emphasis on significant results for the newer ERA-Interim data for 1989-2007. The global-mean tropopause is rising at a rate of 47 m decade−1 , with pressure falling at 1.0 hPa decade−1 , and temperature falling at 0.18 K decade−1 . The Antarctic tropopause shows decreasing heights,warming,and increasing westerly winds. The Arctic tropopause also shows a warming, but with decreasing westerly winds. In the tropics the trends are small, but at the latitudes of the sub-tropical jets they are almost double the global values. It is found that these changes are mainly concentrated in the eastern hemisphere. Previous and new metrics for the rate of broadening of the tropics, based on both height and wind, give trends in the range 0.9◦ decade−1 to 2.2◦ decade−1 . For ERA-40 the global height and pressure trends for the period 1979-2001 are similar: 39 m decade−1 and -0.8 hPa decade−1. These values are smaller than those found from the thermal tropopause definition with this data set, as was used in most previous studies.
Resumo:
Motivation: The ability of a simple method (MODCHECK) to determine the sequence–structure compatibility of a set of structural models generated by fold recognition is tested in a thorough benchmark analysis. Four Model Quality Assessment Programs (MQAPs) were tested on 188 targets from the latest LiveBench-9 automated structure evaluation experiment. We systematically test and evaluate whether the MQAP methods can successfully detect native-likemodels. Results: We show that compared with the other three methods tested MODCHECK is the most reliable method for consistently performing the best top model selection and for ranking the models. In addition, we show that the choice of model similarity score used to assess a model's similarity to the experimental structure can influence the overall performance of these tools. Although these MQAP methods fail to improve the model selection performance for methods that already incorporate protein three dimension (3D) structural information, an improvement is observed for methods that are purely sequence-based, including the best profile–profile methods. This suggests that even the best sequence-based fold recognition methods can still be improved by taking into account the 3D structural information.
Resumo:
A number of new and newly improved methods for predicting protein structure developed by the Jones–University College London group were used to make predictions for the CASP6 experiment. Structures were predicted with a combination of fold recognition methods (mGenTHREADER, nFOLD, and THREADER) and a substantially enhanced version of FRAGFOLD, our fragment assembly method. Attempts at automatic domain parsing were made using DomPred and DomSSEA, which are based on a secondary structure parsing algorithm and additionally for DomPred, a simple local sequence alignment scoring function. Disorder prediction was carried out using a new SVM-based version of DISOPRED. Attempts were also made at domain docking and “microdomain” folding in order to build complete chain models for some targets.
Resumo:
Motivation: In order to enhance genome annotation, the fully automatic fold recognition method GenTHREADER has been improved and benchmarked. The previous version of GenTHREADER consisted of a simple neural network which was trained to combine sequence alignment score, length information and energy potentials derived from threading into a single score representing the relationship between two proteins, as designated by CATH. The improved version incorporates PSI-BLAST searches, which have been jumpstarted with structural alignment profiles from FSSP, and now also makes use of PSIPRED predicted secondary structure and bi-directional scoring in order to calculate the final alignment score. Pairwise potentials and solvation potentials are calculated from the given sequence alignment which are then used as inputs to a multi-layer, feed-forward neural network, along with the alignment score, alignment length and sequence length. The neural network has also been expanded to accommodate the secondary structure element alignment (SSEA) score as an extra input and it is now trained to learn the FSSP Z-score as a measurement of similarity between two proteins. Results: The improvements made to GenTHREADER increase the number of remote homologues that can be detected with a low error rate, implying higher reliability of score, whilst also increasing the quality of the models produced. We find that up to five times as many true positives can be detected with low error rate per query. Total MaxSub score is doubled at low false positive rates using the improved method.
Resumo:
If secondary structure predictions are to be incorporated into fold recognition methods, an assessment of the effect of specific types of errors in predicted secondary structures on the sensitivity of fold recognition should be carried out. Here, we present a systematic comparison of different secondary structure prediction methods by measuring frequencies of specific types of error. We carry out an evaluation of the effect of specific types of error on secondary structure element alignment (SSEA), a baseline fold recognition method. The results of this evaluation indicate that missing out whole helix or strand elements, or predicting the wrong type of element, is more detrimental than predicting the wrong lengths of elements or overpredicting helix or strand. We also suggest that SSEA scoring is an effective method for assessing accuracy of secondary structure prediction and perhaps may also provide a more appropriate assessment of the “usefulness” and quality of predicted secondary structure, if secondary structure alignments are to be used in fold recognition.
Resumo:
What constitutes a baseline level of success for protein fold recognition methods? As fold recognition benchmarks are often presented without any thought to the results that might be expected from a purely random set of predictions, an analysis of fold recognition baselines is long overdue. Given varying amounts of basic information about a protein—ranging from the length of the sequence to a knowledge of its secondary structure—to what extent can the fold be determined by intelligent guesswork? Can simple methods that make use of secondary structure information assign folds more accurately than purely random methods and could these methods be used to construct viable hierarchical classifications?
Resumo:
The global behavior of the extratropical tropopause transition layer (ExTL) is investigated using O3, H2O, and CO measurements from the Atmospheric Chemistry Experiment Fourier Transform Spectrometer (ACE-FTS) on Canada’s SCISAT-1 satellite obtained between February 2004 and May 2007. The ExTL depth is derived using H2O-O3 and CO-O3 correlations. The ExTL top derived from H2O-O3 shows an increase from roughly 1–1.5 km above the thermal tropopause in the subtropics to 3–4 km (2.5–3.5 km) in the north (south) polar region, implying somewhat weaker tropospherestratosphere- transport in the Southern Hemisphere. The ExTL bottom extends ~1 km below the thermal tropopause, indicating a persistent stratospheric influence on the troposphere at all latitudes. The ExTL top derived from the CO-O3 correlation is lower, at 2 km or ~345 K (1.5 km or ~335 K) in the Northern (Southern) Hemisphere. Its annual mean coincides with the relative temperature maximum just above the thermal tropopause. The vertical CO gradient maximizes at the thermal tropopause, indicating a local minimum in mixing within the tropopause region. The seasonal changes in and the scales of the vertical H2O gradients show a similar pattern as the static stability structure of the tropopause inversion layer (TIL), which provides observational support for the hypothesis that H2O plays a radiative role in forcing and maintaining the structure of the TIL.
Resumo:
The spatial structure and phase velocity of tropopause disturbances localized around the subpolar jet in the Southern Hemisphere are investigated using 6-hourly European Centre for Medium-Range Weather Forecasts reanalysis data covering 15 yr (1979–93). The phase velocity and phase structure of the tropopause disturbances are in good agreement with those of an edge wave vertically trapped at the tropopause. However, the vertical distribution of the ratio of potential to kinetic energy exhibits maxima above and below the tropopause and a minimum around the tropopause, in contradiction to edge wave theory for which the ratio is unity throughout the troposphere and stratosphere. This difference in vertical structure between the observed tropopause disturbances and edge wave theory is attributed to the effects of a finite-depth tropopause together with the next-order corrections in Rossby number to quasigeostrophic dynamics
Resumo:
Recent high-resolution radiosonde climatologies have revealed a tropopause inversion layer (TIL) in the extratropics: temperature strongly increases just above a sharp local cold point tropopause. Here, it is asked to what extent a TIL exists in current general circulation models (GCMs) and meteorological analyses. Only a weak hint of a TIL exists in NCEP/NCAR reanalysis data. In contrast, the Canadian Middle Atmosphere Model (CMAM), a comprehensive GCM, exhibits a TIL of realistic strength. However, in data assimilation mode CMAM exhibits a much weaker TIL, especially in the Southern Hemisphere where only coarse satellite data are available. The discrepancy between the analyses and the GCM is thus hypothesized to be mainly due to data assimilation acting to smooth the observed strong curvature in temperature around the tropopause. This is confirmed in the reanalysis where the stratification around the tropopause exhibits a strong discontinuity at the start of the satellite era.