Background: Affymetrix GeneChip arrays are widely used for transcriptomic studies in a diverse range of species. Each gene is represented on a GeneChip array by a probe- set, consisting of up to 16 probe-pairs. Signal intensities across probe- pairs within a probe-set vary in part due to different physical hybridisation characteristics of individual probes with their target labelled transcripts. We have previously developed a technique to study the transcriptomes of heterologous species based on hybridising genomic DNA (gDNA) to a GeneChip array designed for a different species, and subsequently using only those probes with good homology. Results: Here we have investigated the effects of hybridising homologous species gDNA to study the transcriptomes of species for which the arrays have been designed. Genomic DNA from Arabidopsis thaliana and rice (Oryza sativa) were hybridised to the Affymetrix Arabidopsis ATH1 and Rice Genome GeneChip arrays respectively. Probe selection based on gDNA hybridisation intensity increased the number of genes identified as significantly differentially expressed in two published studies of Arabidopsis development, and optimised the analysis of technical replicates obtained from pooled samples of RNA from rice. Conclusion: This mixed physical and bioinformatics approach can be used to optimise estimates of gene expression when using GeneChip arrays.


High-density oligonucleotide (oligo) arrays are a powerful tool for transcript profiling. Arrays based on GeneChip® technology are amongst the most widely used, although GeneChip® arrays are currently available for only a small number of plant and animal species. Thus, we have developed a method to improve the sensitivity of high-density oligonucleotide arrays when applied to heterologous species and tested the method by analysing the transcriptome of Brassica oleracea L., a species for which no GeneChip® array is available, using a GeneChip® array designed for Arabidopsis thaliana (L.) Heynh. Genomic DNA from B. oleracea was labelled and hybridised to the ATH1-121501 GeneChip® array. Arabidopsis thaliana probe-pairs that hybridised to the B. oleracea genomic DNA on the basis of the perfect-match (PM) probe signal were then selected for subsequent B. oleracea transcriptome analysis using a .cel file parser script to generate probe mask files. The transcriptional response of B. oleracea to a mineral nutrient (phosphorus; P) stress was quantified using probe mask files generated for a wide range of gDNA hybridisation intensity thresholds. An example probe mask file generated with a gDNA hybridisation intensity threshold of 400 removed > 68 % of the available PM probes from the analysis but retained >96 % of available A. thaliana probe-sets. Ninety-nine of these genes were then identified as significantly regulated under P stress in B. oleracea, including the homologues of P stress responsive genes in A. thaliana. Increasing the gDNA hybridisation intensity thresholds up to 500 for probe-selection increased the sensitivity of the GeneChip® array to detect regulation of gene expression in B. oleracea under P stress by up to 13-fold. Our open-source software to create probe mask files is freely available http://affymetrix.arabidopsis.info/xspecies/ webcite and may be used to facilitate transcriptomic analyses of a wide range of plant and animal species in the absence of custom arrays.


Background: Hexaploid wheat is one of the most important cereal crops for human nutrition. Molecular understanding of the biology of the developing grain will assist the improvement of yield and quality traits for different environments. High quality transcriptomics is a powerful method to increase this understanding. Results: The transcriptome of developing caryopses from hexaploid wheat ( Triticum aestivum, cv. Hereward) was determined using Affymetrix wheat GeneChip (R) oligonucleotide arrays which have probes for 55,052 transcripts. Of these, 14,550 showed significant differential regulation in the period between 6 and 42 days after anthesis ( daa). Large changes in transcript abundance were observed which were categorised into distinct phases of differentiation ( 6 - 10 daa), grain fill ( 12 - 21 daa) and desiccation/maturation ( 28 - 42 daa) and were associated with specific tissues and processes. A similar experiment on developing caryopses grown with dry and/or hot environmental treatments was also analysed, using the profiles established in the first experiment to show that most environmental treatment effects on transcription were due to acceleration of development, but that a few transcripts were specifically affected. Transcript abundance profiles in both experiments for nine selected known and putative wheat transcription factors were independently confirmed by real time RT-PCR. These expression profiles confirm or extend our knowledge of the roles of the known transcription factors and suggest roles for the unknown ones. Conclusion: This transcriptome data will provide a valuable resource for molecular studies on wheat grain. It has been demonstrated how it can be used to distinguish general developmental shifts from specific effects of treatments on gene expression and to diagnose the probable tissue specificity and role of transcription factors.


Supplementation of diets with plant extracts such as ginkgo biloba extract (EGb 761®) (definition see editorial) for health and prevention of degenerative diseases is popular. However, it is often difficult to analyse the biological activities of plant extracts due to their complex nature and the possible synergistic and/or antagonistic effects of their components. Genome-wide expression monitoring with high-density oligonucleotide arrays provides one way to examine the molecular targets of plant extracts and may prove a useful tool in evaluating their therapeutic claims. Here, we will briefly describe some of our work on the effect of EGb 761® on differential gene expression in relation to its potential anti-carcinogenic, photoprotective and neuromodulatory properties.


Background: There are compelling economic and environmental reasons to reduce our reliance on inorganic phosphate (Pi) fertilisers. Better management of Pi fertiliser applications is one option to improve the efficiency of Pi fertiliser use, whilst maintaining crop yields. Application rates of Pi fertilisers are traditionally determined from analyses of soil or plant tissues. Alternatively, diagnostic genes with altered expression under Pi limiting conditions that suggest a physiological requirement for Pi fertilisation, could be used to manage Pifertiliser applications, and might be more precise than indirect measurements of soil or tissue samples. Results: We grew potato (Solanum tuberosum L.) plants hydroponically, under glasshouse conditions, to control their nutrient status accurately. Samples of total leaf RNA taken periodically after Pi was removed from the nutrient solution were labelled and hybridised to potato oligonucleotide arrays. A total of 1,659 genes were significantly differentially expressed following Pi withdrawal. These included genes that encode proteins involved in lipid, protein, and carbohydrate metabolism, characteristic of Pi deficient leaves and included potential novel roles for genes encoding patatin like proteins in potatoes. The array data were analysed using a support vector machine algorithm to identify groups of genes that could predict the Pi status of the crop. These groups of diagnostic genes were tested using field grown potatoes that had either been fertilised or unfertilised. A group of 200 genes could correctly predict the Pi status of field grown potatoes. Conclusions: This paper provides a proof-of-concept demonstration for using microarrays and class prediction tools to predict the Pi status of a field grown potato crop. There is potential to develop this technology for other biotic and abiotic stresses in field grown crops. Ultimately, a better understanding of crop stresses may improve our management of the crop, improving the sustainability of agriculture.


Background: Expression microarrays are increasingly used to obtain large scale transcriptomic information on a wide range of biological samples. Nevertheless, there is still much debate on the best ways to process data, to design experiments and analyse the output. Furthermore, many of the more sophisticated mathematical approaches to data analysis in the literature remain inaccessible to much of the biological research community. In this study we examine ways of extracting and analysing a large data set obtained using the Agilent long oligonucleotide transcriptomics platform, applied to a set of human macrophage and dendritic cell samples. Results: We describe and validate a series of data extraction, transformation and normalisation steps which are implemented via a new R function. Analysis of replicate normalised reference data demonstrate that intrarray variability is small (only around 2 of the mean log signal), while interarray variability from replicate array measurements has a standard deviation (SD) of around 0.5 log(2) units (6 of mean). The common practise of working with ratios of Cy5/Cy3 signal offers little further improvement in terms of reducing error. Comparison to expression data obtained using Arabidopsis samples demonstrates that the large number of genes in each sample showing a low level of transcription reflect the real complexity of the cellular transcriptome. Multidimensional scaling is used to show that the processed data identifies an underlying structure which reflect some of the key biological variables which define the data set. This structure is robust, allowing reliable comparison of samples collected over a number of years and collected by a variety of operators. Conclusions: This study outlines a robust and easily implemented pipeline for extracting, transforming normalising and visualising transcriptomic array data from Agilent expression platform. The analysis is used to obtain quantitative estimates of the SD arising from experimental (non biological) intra- and interarray variability, and for a lower threshold for determining whether an individual gene is expressed. The study provides a reliable basis for further more extensive studies of the systems biology of eukaryotic cells.


Turbulence statistics obtained by direct numerical simulations are analysed to investigate spatial heterogeneity within regular arrays of building-like cubical obstacles. Two different array layouts are studied, staggered and square, both at a packing density of λp=0.25 . The flow statistics analysed are mean streamwise velocity ( u− ), shear stress ( u′w′−−−− ), turbulent kinetic energy (k) and dispersive stress fraction ( u˜w˜ ). The spatial flow patterns and spatial distribution of these statistics in the two arrays are found to be very different. Local regions of high spatial variability are identified. The overall spatial variances of the statistics are shown to be generally very significant in comparison with their spatial averages within the arrays. Above the arrays the spatial variances as well as dispersive stresses decay rapidly to zero. The heterogeneity is explored further by separately considering six different flow regimes identified within the arrays, described here as: channelling region, constricted region, intersection region, building wake region, canyon region and front-recirculation region. It is found that the flow in the first three regions is relatively homogeneous, but that spatial variances in the latter three regions are large, especially in the building wake and canyon regions. The implication is that, in general, the flow immediately behind (and, to a lesser extent, in front of) a building is much more heterogeneous than elsewhere, even in the relatively dense arrays considered here. Most of the dispersive stress is concentrated in these regions. Considering the experimental difficulties of obtaining enough point measurements to form a representative spatial average, the error incurred by degrading the sampling resolution is investigated. It is found that a good estimate for both area and line averages can be obtained using a relatively small number of strategically located sampling points.


The structure of turbulent flow over large roughness consisting of regular arrays of cubical obstacles is investigated numerically under constant pressure gradient conditions. Results are analysed in terms of first- and second-order statistics, by visualization of instantaneous flow fields and by conditional averaging. The accuracy of the simulations is established by detailed comparisons of first- and second-order statistics with wind-tunnel measurements. Coherent structures in the log region are investigated. Structure angles are computed from two-point correlations, and quadrant analysis is performed to determine the relative importance of Q2 and Q4 events (ejections and sweeps) as a function of height above the roughness. Flow visualization shows the existence of low-momentum regions (LMRs) as well as vortical structures throughout the log layer. Filtering techniques are used to reveal instantaneous examples of the association of the vortices with the LMRs, and linear stochastic estimation and conditional averaging are employed to deduce their statistical properties. The conditional averaging results reveal the presence of LMRs and regions of Q2 and Q4 events that appear to be associated with hairpin-like vortices, but a quantitative correspondence between the sizes of the vortices and those of the LMRs is difficult to establish; a simple estimate of the ratio of the vortex width to the LMR width gives a value that is several times larger than the corresponding ratio over smooth walls. The shape and inclination of the vortices and their spatial organization are compared to recent findings over smooth walls. Characteristic length scales are shown to scale linearly with height in the log region. Whilst there are striking qualitative similarities with smooth walls, there are also important differences in detail regarding: (i) structure angles and sizes and their dependence on distance from the rough surface; (ii) the flow structure close to the roughness; (iii) the roles of inflows into and outflows from cavities within the roughness; (iv) larger vortices on the rough wall compared to the smooth wall; (v) the effect of the different generation mechanism at the wall in setting the scales of structures.


The scattering of small amplitude water waves by a finite array of locally axisymmetric structures is considered. Regions of varying quiescent depth are included and their axisymmetric nature, together with a mild-slope approximation, permits an adaptation of well-known interaction theory which ultimately reduces the problem to a simple numerical calculation. Numerical results are given and effects due to regions of varying depth on wave loading and free-surface elevation are presented.


We advocate the use of systolic design techniques to create custom hardware for Custom Computing Machines. We have developed a hardware genetic algorithm based on systolic arrays to illustrate the feasibility of the approach. The architecture is independent of the lengths of chromosomes used and can be scaled in size to accommodate different population sizes. An FPGA prototype design can process 16 million genes per second.


Sulphate-reducing bacteria (SRB) and methanogenic archaea (MA) are important anaerobic terminal oxidisers of organic matter. However, we have little knowledge about the distribution and types of SRB and MA in the environment or the functional role they play in situ. Here we have utilised sediment slurry microcosms amended with ecologically significant substrates, including acetate and hydrogen, and specific functional inhibitors, to identify the important SRB and MA groups in two contrasting sites on a UK estuary. Substrate and inhibitor additions had significant effects on methane production and on acetate and sulphate consumption in the slurries. By using specific 16S-targeted oligonucleotide probes we were able to link specific SRB and MA groups to the use of the added substrates. Acetate consumption in the freshwater-dominated sediments was mediated by Methanosarcinales under low-sulphate conditions and Desulfobacter under the high-sulphate conditions that simulated a tidal incursion. In the marine-dominated sediments, acetate consumption was linked to Desulfobacter. Addition of trimethylamine, a non-competitive substrate for methanogenesis, led to a large increase in Methanosarcinales signal in marine slurries. Desulfobulbus was linked to non-sulphate-dependent H-2 consumption in the freshwater sediments. The addition of sulphate to freshwater sediments inhibited methane production and reduced signal from probes targeted to Methanosarcinales and Methanomicrobiales, while the addition of molybdate to marine sediments inhibited Desulfobulbus and Desulfobacterium. These data complement our understanding of the ecophysiology of the organisms detected and make a firm connection between the capabilities of species, as observed in the laboratory, to their roles in the environment. (C) 2003 Federation of European Microbiological Societies. Published by Elsevier Science B.V. All rights reserved.


The synthesis of a selection of multivalent arrays of mannose mono- and disaccharides, that are of potential use as anti-infective agents against enterobacteria infections, is described. (C) 2003 Elsevier Ltd. All rights reserved.


The synthesis of modified nucleic acids has been the subject of much study ever since the structure of DNA was elucidated by Watson and Crick at Cambridge and Wilkins and Franklin at King's College over half a century ago. This review describes recent developments in the synthesis and application of these artificial nucleic acids, predominantly the phosphoramidites which allow for easy inclusion into oligonucleotides, and is divided into three separate sections. Firstly, modi. cations to the base portion will be discussed followed secondly by modi. cations to the sugar portion. Finally, changes in the type of nucleic acid linker will be discussed in the third section. Peptide Nucleic Acids ( PNAs) are not discussed in this review as they represent a separate and large area of nucleic acid mimics.


We describe a high-level design method to synthesize multi-phase regular arrays. The method is based on deriving component designs using classical regular (or systolic) array synthesis techniques and composing these separately evolved component design into a unified global design. Similarity transformations ar e applied to component designs in the composition stage in order to align data ow between the phases of the computations. Three transformations are considered: rotation, re ection and translation. The technique is aimed at the design of hardware components for high-throughput embedded systems applications and we demonstrate this by deriving a multi-phase regular array for the 2-D DCT algorithm which is widely used in many vide ocommunications applications.