66 resultados para process data


Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Tumorigenesis is characterised by changes in transcriptional control. Extensive transcript expression data have been acquired over the last decade and used to classify prostate cancers. Prostate cancer is, however, a heterogeneous multifocal cancer and this poses challenges in identifying robust transcript biomarkers.

METHODS: In this study, we have undertaken a meta-analysis of publicly available transcriptomic data spanning datasets and technologies from the last decade and encompassing laser capture microdissected and macrodissected sample sets.

RESULTS: We identified a 33 gene signature that can discriminate between benign tissue controls and localised prostate cancers irrespective of detection platform or dissection status. These genes were significantly overexpressed in localised prostate cancer versus benign tissue in at least three datasets within the Oncomine Compendium of Expression Array Data. In addition, they were also overexpressed in a recent exon-array dataset as well a prostate cancer RNA-seq dataset generated as part of the The Cancer Genomics Atlas (TCGA) initiative. Biologically, glycosylation was the single enriched process associated with this 33 gene signature, encompassing four glycosylating enzymes. We went on to evaluate the performance of this signature against three individual markers of prostate cancer, v-ets avian erythroblastosis virus E26 oncogene homolog (ERG) expression, prostate specific antigen (PSA) expression and androgen receptor (AR) expression in an additional independent dataset. Our signature had greater discriminatory power than these markers both for localised cancer and metastatic disease relative to benign tissue, or in the case of metastasis, also localised prostate cancer.

CONCLUSION: In conclusion, robust transcript biomarkers are present within datasets assembled over many years and cohorts and our study provides both examples and a strategy for refining and comparing datasets to obtain additional markers as more data are generated.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

As an emerging hole-machining methodology, helical milling process has become increasingly popular in aeromaterials manufacturing research, especially in areas of aircraft structural parts, dies, and molds manufacturing. Helical milling process is highly demanding due to its complex tool geometry and the progressive material failure on the workpiece. This paper outlines the development of a 3D finite element model for helical milling hole of titanium alloy Ti-6Al-4V using commercial FE code ABAQUS/Explicit. The proposed model simulates the helical milling hole process by taking into account the damage initiation and evolution in the workpiece material. A contact model at the interface between end-mill bit and workpiece has been established and the process parameters specified. Furthermore, a simulation procedure is proposed to simulate different cutting processes with the same failure parameters. With this finite element model, a series of FEAs for machined titanium alloy have been carried out and results compared with laboratory experimental data. The effects of machining parameters on helical milling have been elucidated, and the capability and advantage of FE simulation on helical milling process have been well presented.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a robust Dirichlet process for estimating survival functions from samples with right-censored data. It adopts a prior near-ignorance approach to avoid almost any assumption about the distribution of the population lifetimes, as well as the need of eliciting an infinite dimensional parameter (in case of lack of prior information), as it happens with the usual Dirichlet process prior. We show how such model can be used to derive robust inferences from right-censored lifetime data. Robustness is due to the identification of the decisions that are prior-dependent, and can be interpreted as an analysis of sensitivity with respect to the hypothetical inclusion of fictitious new samples in the data. In particular, we derive a nonparametric estimator of the survival probability and a hypothesis test about the probability that the lifetime of an individual from one population is shorter than the lifetime of an individual from another. We evaluate these ideas on simulated data and on the Australian AIDS survival dataset. The methods are publicly available through an easy-to-use R package.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The popularity of tri-axial accelerometer data loggers to quantify animal activity through the analysis of signature traces is increasing. However, there is no consensus on how to process the large data sets that these devices generate when recording at the necessary high sample rates. In addition, there have been few attempts to validate accelerometer traces with specific behaviours in non-domesticated terrestrial mammals.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study was conducted to explore the effect of different autoclave heating times (30, 60 and 90 min) on fatty acids supply and molecular stability in Brassica carinata seed. Multivariate spectral analyses and correlation analyses were also carried out in our study. The results showed that autoclaving treatments significantly decreased the total fatty acids content in a linear fashion in B. carinata seed as heating time increased. Reduced concentrations were also observed in C18:3n3, C20:1, C22:1n9, monounsaturated fatty acids (MUFA), polyunsaturated fatty acids (PUFA), omega 3 (ω-3) and 9 (ω-9) fatty acids. Correspondingly, the heated seeds showed dramatic reductions in all the peak intensities within lipid-related spectral regions. Results from agglomerative hierarchical cluster analysis (AHCA) and principal component analysis (PCA) indicated that the raw oilseed had completely different structural make-up from the autoclaved seeds in both CH3 and CH2 asymmetric and symmetric stretching region (ca. 2999–2800 cm−1) and lipid ester Cdouble bond; length as m-dashO carbonyl region (ca. 1787–1706 cm−1). However, the oilseeds heated for 30, 60 and 90 min were not grouped into separate classes or ellipses in all the lipid-related regions, indicating that there still exhibited similarities in lipid biopolymer conformations among autoclaved B. carinata seeds. Moreover, strong correlations between spectral information and fatty acid compositions observed in our study could imply that lipid-related spectral parameters might have a potential to predict some fatty acids content in oilseed samples, i.e. B. carinata. However, more data from large sample size and diverse range would be necessary and helpful to draw up a final conclusion.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The application of chemometrics in food science has revolutionized the field by allowing the creation of models able to automate a broad range of applications such as food authenticity and food fraud detection. In order to create effective and general models able to address the complexity of real life problems, a vast amount of varied training samples are required. Training dataset has to cover all possible types of sample and instrument variability. However, acquiring a varied amount of samples is a time consuming and costly process, in which collecting samples representative of the real world variation is not always possible, specially in some application fields. To address this problem, a novel framework for the application of data augmentation techniques to spectroscopic data has been designed and implemented. This is a carefully designed pipeline of four complementary and independent blocks which can be finely tuned depending on the desired variance for enhancing model's robustness: a) blending spectra, b) changing baseline, c) shifting along x axis, and d) adding random noise.
This novel data augmentation solution has been tested in order to obtain highly efficient generalised classification model based on spectroscopic data. Fourier transform mid-infrared (FT-IR) spectroscopic data of eleven pure vegetable oils (106 admixtures) for the rapid identification of vegetable oil species in mixtures of oils have been used as a case study to demonstrate the influence of this pioneering approach in chemometrics, obtaining a 10% improvement in classification which is crucial in some applications of food adulteration.