954 resultados para R Environment


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Matrix-assisted laser desorption/ionization time-of flight mass spectrometry (MALDI-TOF MS) has been widely used for the identification and classification of microorganisms based on their proteomic fingerprints. However, the use of MALDI-TOF MS in plant research has been very limited. In the present study, a first protocol is proposed for metabolic fingerprinting by MALDI-TOF MS using three different MALDI matrices with subsequent multivariate data analysis by in-house algorithms implemented in the R environment for the taxonomic classification of plants from different genera, families and orders. By merging the data acquired with different matrices, different ionization modes and using careful algorithms and parameter selection, we demonstrate that a close taxonomic classification can be achieved based on plant metabolic fingerprints, with 92% similarity to the taxonomic classifications found in literature. The present work therefore highlights the great potential of applying MALDI-TOF MS for the taxonomic classification of plants and, furthermore, provides a preliminary foundation for future research.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: High-density tiling arrays and new sequencing technologies are generating rapidly increasing volumes of transcriptome and protein-DNA interaction data. Visualization and exploration of this data is critical to understanding the regulatory logic encoded in the genome by which the cell dynamically affects its physiology and interacts with its environment. Results: The Gaggle Genome Browser is a cross-platform desktop program for interactively visualizing high-throughput data in the context of the genome. Important features include dynamic panning and zooming, keyword search and open interoperability through the Gaggle framework. Users may bookmark locations on the genome with descriptive annotations and share these bookmarks with other users. The program handles large sets of user-generated data using an in-process database and leverages the facilities of SQL and the R environment for importing and manipulating data. A key aspect of the Gaggle Genome Browser is interoperability. By connecting to the Gaggle framework, the genome browser joins a suite of interconnected bioinformatics tools for analysis and visualization with connectivity to major public repositories of sequences, interactions and pathways. To this flexible environment for exploring and combining data, the Gaggle Genome Browser adds the ability to visualize diverse types of data in relation to its coordinates on the genome. Conclusions: Genomic coordinates function as a common key by which disparate biological data types can be related to one another. In the Gaggle Genome Browser, heterogeneous data are joined by their location on the genome to create information-rich visualizations yielding insight into genome organization, transcription and its regulation and, ultimately, a better understanding of the mechanisms that enable the cell to dynamically respond to its environment.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Patients presenting with active Systemic lupus erythematosus (SLE) manifestations may exhibit distinct pathogenetic features in relation to inactive SLE. Also, cDNA microarrays may potentially discriminate the gene expression profile of a disease or disease variant. Therefore, we evaluated the expression profile of 4500 genes in peripheral blood lymphocytes (PBL) of SLE patients. We studied 11 patients with SLE (seven with active SLE and four with inactive SLE) and eight healthy controls. Total RNA was isolated from PBL, reverse transcribed into cDNA, and postlabeled with Cy3 fluorochrome. These probes were then hybridized to a glass slide cDNA microarray containing 4500 human IMAGE cDNA target sequences. An equimolar amount of total RNA from human cell lines served as reference. The microarray images were quantified, normalized, and analyzed using the R environment (ANOVA, significant analysis of microarrays, and cluster-tree view algorithms). Disease activity was assessed by the SLE disease activity index. Compared to the healthy controls, 104 genes in active SLE patients (80 repressed and 24 induced) and 52 genes in nonactive SLE patients (31 induced and 21 repressed) were differentially expressed. The modulation of 12 genes, either induced or repressed, was found in both disease variants; however, each disease variant had differential expression of different genes. Taken together, these results indicate that the two lupus variants studied have common and unique differentially expressed genes. Although the biological significance of the differentially expressed genes discussed above has not been completely understood, they may serve as a platform to further explore the molecular basis of immune deregulation in SLE.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Objectives To evaluate the gene expression profile of fibroblasts from affected and non-affected skin of systemic sclerosis (SSc) patients and from controls. Materials and methods Labeled cDNA from fibroblast cultures from forearm (affected) and axillary (non-affected) skin from six diffuse SSc patients, from three normal controls, and from MOLT-4/HEp-2/normal fibroblasts (reference pool) was probed in microarrays generated with 4193 human cDNAs from the IMAGE Consortium. Microarray images were converted into numerical data and gene expression was calculated as the ratio between fibroblast cDNA (Cy5) and reference pool cDNA (Cy3) data and analyzed by R environment/Aroma, Cluster, Tree View, and SAM softwares. Differential expression was confirmed by real time PCR for a set of selected genes. Results Eighty-eight genes were up- and 241 genes down-regulated in SSc fibroblasts. Gene expression correlation was strong between affected and non-affected fibroblast samples from the same patient (r>0.8), moderate among fibroblasts from all patients (r=0.72) and among fibroblasts from all controls (r=0.70), and modest among fibroblasts from patients and controls (r=0.55). The differential expression was confirmed by real time PCR for all selected genes. Conclusions Fibroblasts from affected and non-affected skin of SSc patients shared a similar abnormal gene expression profile, suggesting that the widespread molecular disturbance in SSc fibroblasts is more sensitive than histological and clinical alterations. Novel molecular elements potentially involved in SSc pathogenesis were identified.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Notre progiciel PoweR vise à faciliter l'obtention ou la vérification des études empiriques de puissance pour les tests d'ajustement. En tant que tel, il peut être considéré comme un outil de calcul de recherche reproductible, car il devient très facile à reproduire (ou détecter les erreurs) des résultats de simulation déjà publiés dans la littérature. En utilisant notre progiciel, il devient facile de concevoir de nouvelles études de simulation. Les valeurs critiques et puissances de nombreuses statistiques de tests sous une grande variété de distributions alternatives sont obtenues très rapidement et avec précision en utilisant un C/C++ et R environnement. On peut même compter sur le progiciel snow de R pour le calcul parallèle, en utilisant un processeur multicœur. Les résultats peuvent être affichés en utilisant des tables latex ou des graphiques spécialisés, qui peuvent être incorporés directement dans vos publications. Ce document donne un aperçu des principaux objectifs et les principes de conception ainsi que les stratégies d'adaptation et d'extension.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper proposes a method for the identification of different partial discharges (PDs) sources through the analysis of a collection of PD signals acquired with a PD measurement system. This method, robust and sensitive enough to cope with noisy data and external interferences, combines the characterization of each signal from the collection, with a clustering procedure, the CLARA algorithm. Several features are proposed for the characterization of the signals, being the wavelet variances, the frequency estimated with the Prony method, and the energy, the most relevant for the performance of the clustering procedure. The result of the unsupervised classification is a set of clusters each containing those signals which are more similar to each other than to those in other clusters. The analysis of the classification results permits both the identification of different PD sources and the discrimination between original PD signals, reflections, noise and external interferences. The methods and graphical tools detailed in this paper have been coded and published as a contributed package of the R environment under a GNU/GPL license.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper proposes a method for the identification of different partial discharges (PDs) sources through the analysis of a collection of PD signals acquired with a PD measurement system. This method, robust and sensitive enough to cope with noisy data and external interferences, combines the characterization of each signal from the collection, with a clustering procedure, the CLARA algorithm. Several features are proposed for the characterization of the signals, being the wavelet variances, the frequency estimated with the Prony method, and the energy, the most relevant for the performance of the clustering procedure. The result of the unsupervised classification is a set of clusters each containing those signals which are more similar to each other than to those in other clusters. The analysis of the classification results permits both the identification of different PD sources and the discrimination between original PD signals, reflections, noise and external interferences. The methods and graphical tools detailed in this paper have been coded and published as a contributed package of the R environment under a GNU/GPL license.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Forecasting the AC power output of a PV plant accurately is important both for plant owners and electric system operators. Two main categories of PV modeling are available: the parametric and the nonparametric. In this paper, a methodology using a nonparametric PV model is proposed, using as inputs several forecasts of meteorological variables from a Numerical Weather Forecast model, and actual AC power measurements of PV plants. The methodology was built upon the R environment and uses Quantile Regression Forests as machine learning tool to forecast AC power with a confidence interval. Real data from five PV plants was used to validate the methodology, and results show that daily production is predicted with an absolute cvMBE lower than 1.3%.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

2000 Mathematics Subject Classification: Primary 62F35; Secondary 62P99

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Le tecniche di Machine Learning sono molto utili in quanto consento di massimizzare l’utilizzo delle informazioni in tempo reale. Il metodo Random Forests può essere annoverato tra le tecniche di Machine Learning più recenti e performanti. Sfruttando le caratteristiche e le potenzialità di questo metodo, la presente tesi di dottorato affronta due casi di studio differenti; grazie ai quali è stato possibile elaborare due differenti modelli previsionali. Il primo caso di studio si è incentrato sui principali fiumi della regione Emilia-Romagna, caratterizzati da tempi di risposta molto brevi. La scelta di questi fiumi non è stata casuale: negli ultimi anni, infatti, in detti bacini si sono verificati diversi eventi di piena, in gran parte di tipo “flash flood”. Il secondo caso di studio riguarda le sezioni principali del fiume Po, dove il tempo di propagazione dell’onda di piena è maggiore rispetto ai corsi d’acqua del primo caso di studio analizzato. Partendo da una grande quantità di dati, il primo passo è stato selezionare e definire i dati in ingresso in funzione degli obiettivi da raggiungere, per entrambi i casi studio. Per l’elaborazione del modello relativo ai fiumi dell’Emilia-Romagna, sono stati presi in considerazione esclusivamente i dati osservati; a differenza del bacino del fiume Po in cui ai dati osservati sono stati affiancati anche i dati di previsione provenienti dalla catena modellistica Mike11 NAM/HD. Sfruttando una delle principali caratteristiche del metodo Random Forests, è stata stimata una probabilità di accadimento: questo aspetto è fondamentale sia nella fase tecnica che in fase decisionale per qualsiasi attività di intervento di protezione civile. L'elaborazione dei dati e i dati sviluppati sono stati effettuati in ambiente R. Al termine della fase di validazione, gli incoraggianti risultati ottenuti hanno permesso di inserire il modello sviluppato nel primo caso studio all’interno dell’architettura operativa di FEWS.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Tämä tutkielma käsittelee lisäarvon syntymistä, ylläpitämistä ja hallintaa verkostoi-tuneessa tuotekehitysympäristössä. Teemahaastattelu-menetelmää käyttäen, tavoitteena on tunnistaa ja kuvata ne prosessit, käytännöt ja toimintatavat, joissa kohdeyritys on onnistunut ja joissa lisäarvoa on syntynyt. Toinen keskeinen tavoite on löytää ongelmalliset alueet lisäarvon tuottamisessa ja analysoida, miksi nämä alueet ovat ongelmallisia. Käsitteiden arvo, arvoketju ja arvoverkosto, sekä viitekirjallisuuden esimerkkien perusteella muodostetaan teoreettinen viitekehys ja kuvataan niitä hyödyllisiä toimintatapoja ja käytäntöjä, joihin panostamalla lisäarvoa syntyy. Erityisesti informaatioteknologian alalla verkostoituminen ja arvoverkosto ovat yhä merkittävämpiä tuotekehityksen toimintatapoja, mihin horisontaalisen yhteistyön kehittyminen, globalisoituminen ja informaatioteknologian nopea kehitys on johtanut. Keskeisiä tuloksia ovat tarve yhtenäisempään, prosessinomaisempaan toimintatapaan ja liiketoimintaprosessien muokkaamiseen verkostoituneen T&K ympäristön vaatimusten mukaisesti. Myös tarve paremman näkyvyyden luomiseen sekä aktiviteettien hallintaan uudentyyppisen arvoverkoston vaatimusten mukaisesti korostui tuloksissa.