929 resultados para source code analysis
Resumo:
PURPOSE: Although the central role of the immune system for tumor prognosis is generally accepted, a single robust marker is not yet available. EXPERIMENTAL DESIGN: On the basis of receiver operating characteristic analyses, robust markers were identified from a 60-gene B cell-derived metagene and analyzed in gene expression profiles of 1,810 breast cancer; 1,056 non-small cell lung carcinoma (NSCLC); 513 colorectal; and 426 ovarian cancer patients. Protein and RNA levels were examined in paraffin-embedded tissue of 330 breast cancer patients. The cell types were identified with immunohistochemical costaining and confocal fluorescence microscopy. RESULTS: We identified immunoglobulin κ C (IGKC) which as a single marker is similarly predictive and prognostic as the entire B-cell metagene. IGKC was consistently associated with metastasis-free survival across different molecular subtypes in node-negative breast cancer (n = 965) and predicted response to anthracycline-based neoadjuvant chemotherapy (n = 845; P < 0.001). In addition, IGKC gene expression was prognostic in NSCLC and colorectal cancer. No association was observed in ovarian cancer. IGKC protein expression was significantly associated with survival in paraffin-embedded tissues of 330 breast cancer patients. Tumor-infiltrating plasma cells were identified as the source of IGKC expression. CONCLUSION: Our findings provide IGKC as a novel diagnostic marker for risk stratification in human cancer and support concepts to exploit the humoral immune response for anticancer therapy. It could be validated in several independent cohorts and carried out similarly well in RNA from fresh frozen as well as from paraffin tissue and on protein level by immunostaining.
Resumo:
Advanced neuroinformatics tools are required for methods of connectome mapping, analysis, and visualization. The inherent multi-modality of connectome datasets poses new challenges for data organization, integration, and sharing. We have designed and implemented the Connectome Viewer Toolkit - a set of free and extensible open source neuroimaging tools written in Python. The key components of the toolkit are as follows: (1) The Connectome File Format is an XML-based container format to standardize multi-modal data integration and structured metadata annotation. (2) The Connectome File Format Library enables management and sharing of connectome files. (3) The Connectome Viewer is an integrated research and development environment for visualization and analysis of multi-modal connectome data. The Connectome Viewer's plugin architecture supports extensions with network analysis packages and an interactive scripting shell, to enable easy development and community contributions. Integration with tools from the scientific Python community allows the leveraging of numerous existing libraries for powerful connectome data mining, exploration, and comparison. We demonstrate the applicability of the Connectome Viewer Toolkit using Diffusion MRI datasets processed by the Connectome Mapper. The Connectome Viewer Toolkit is available from http://www.cmtk.org/
Resumo:
Introduction: The field of Connectomic research is growing rapidly, resulting from methodological advances in structural neuroimaging on many spatial scales. Especially progress in Diffusion MRI data acquisition and processing made available macroscopic structural connectivity maps in vivo through Connectome Mapping Pipelines (Hagmann et al, 2008) into so-called Connectomes (Hagmann 2005, Sporns et al, 2005). They exhibit both spatial and topological information that constrain functional imaging studies and are relevant in their interpretation. The need for a special-purpose software tool for both clinical researchers and neuroscientists to support investigations of such connectome data has grown. Methods: We developed the ConnectomeViewer, a powerful, extensible software tool for visualization and analysis in connectomic research. It uses the novel defined container-like Connectome File Format, specifying networks (GraphML), surfaces (Gifti), volumes (Nifti), track data (TrackVis) and metadata. Usage of Python as programming language allows it to by cross-platform and have access to a multitude of scientific libraries. Results: Using a flexible plugin architecture, it is possible to enhance functionality for specific purposes easily. Following features are already implemented: * Ready usage of libraries, e.g. for complex network analysis (NetworkX) and data plotting (Matplotlib). More brain connectivity measures will be implemented in a future release (Rubinov et al, 2009). * 3D View of networks with node positioning based on corresponding ROI surface patch. Other layouts possible. * Picking functionality to select nodes, select edges, get more node information (ConnectomeWiki), toggle surface representations * Interactive thresholding and modality selection of edge properties using filters * Arbitrary metadata can be stored for networks, thereby allowing e.g. group-based analysis or meta-analysis. * Python Shell for scripting. Application data is exposed and can be modified or used for further post-processing. * Visualization pipelines using filters and modules can be composed with Mayavi (Ramachandran et al, 2008). * Interface to TrackVis to visualize track data. Selected nodes are converted to ROIs for fiber filtering The Connectome Mapping Pipeline (Hagmann et al, 2008) processed 20 healthy subjects into an average Connectome dataset. The Figures show the ConnectomeViewer user interface using this dataset. Connections are shown that occur in all 20 subjects. The dataset is freely available from the homepage (connectomeviewer.org). Conclusions: The ConnectomeViewer is a cross-platform, open-source software tool that provides extensive visualization and analysis capabilities for connectomic research. It has a modular architecture, integrates relevant datatypes and is completely scriptable. Visit www.connectomics.org to get involved as user or developer.
Resumo:
Introduction: Les études GVvA (Genome-wide association ,-studies) ont identifié et confirmé plus de 20 gènes de susceptibilité au DT2 et ont contribué à mieux comprendre sa physiopathologie. L'hyperglycémie à jeun (GJ), et 2 heures après une HGPO (G2h) sont les deux mesures cliniques du diagnostic du DT2. Nous avons identifié récemment la G6P du pancréas (G6PC2) comme déterminant de la variabilité physiologique de la GJ puis Ie récepteur à la mélatonine (MTNRIB) qui de plus lie la régulation du rythme circadien au DT2. Dans ce travail nous avons étudié la génétique de la G2h à l'aide de l'approche GWA. Résultats: Nous avons réalisé une méta-analyse GWA dans le cadre de MAGIC (Meta-Analysis of Glucose and Insulin related traits Consortium) qui a inclus 9 études GWA (N=15'234). La réplication de 29 loci (N=6958-30 121, P < 10-5 ) a confirmé 5 nouveaux loci; 2 étant connus comme associés avec Ie DT2 (TCF7L2, P = 1,6 X 10-10 ) et la GJ (GCKR, p = 5,6 X 10-10 ); alors que GIPR (p= 5,2 X 10-12), VSP13C (p= 3,9 X 10-8) et ADCY5 (p = 1,11 X 10-15 ) sont inédits. GIPR code Ie récepteur au GIP (gastric inhibitory polypeptide) qui est sécrété par les ceIlules intestinales pour stimuler la sécrétion de l'insuline en réponse au glucose (l'effet incrétine). Les porteurs du variant GIPR qui augmente la G2h ont également un indice insulinogénique plus bas, (p= 1,0 X 10-17) mais ils ne présentent aucune modification de leur glycémie suite à une hyperglycémie provoquée par voie veineuse (p= 0,21). Ces résultats soutiennent un effet incrétine du locus GIPR qui expliquerait ~9,6 % de la variance total de ce trait. La biologie de ADCY5 et VPS13C et son lien avec l'homéostasie du glucose restent à élucider. GIPR n'est pas associé avec le risque de DT2 indiquant qu'il influence la variabilité physiologique de la G2h alors que le locus ADCY5 est associé avec le DT2 (OR = 1,11, P = 1,5 X 10-15). Conclusion: Notre étude démontre que l'étude de la G2h est une approche efficace d'une part pour la compréhension de la base génétique de la physiologie de ce trait clinique important et d'autre part pour identifier de nouveaux gènes de susceptibilité au DT2.
Resumo:
The general strategy to perform anti-doping analyses of urine samples starts with the screening for a wide range of compounds. This step should be fast, generic and able to detect any sample that may contain a prohibited substance while avoiding false negatives and reducing false positive results. The experiments presented in this work were based on ultra-high-pressure liquid chromatography coupled to hybrid quadrupole time-of-flight mass spectrometry. Thanks to the high sensitivity of the method, urine samples could be diluted 2-fold prior to injection. One hundred and three forbidden substances from various classes (such as stimulants, diuretics, narcotics, anti-estrogens) were analysed on a C(18) reversed-phase column in two gradients of 9min (including two 3min equilibration periods) for positive and negative electrospray ionisation and detected in the MS full scan mode. The automatic identification of analytes was based on retention time and mass accuracy, with an automated tool for peak picking. The method was validated according to the International Standard for Laboratories described in the World Anti-Doping Code and was selective enough to comply with the World Anti-Doping Agency recommendations. In addition, the matrix effect on MS response was measured on all investigated analytes spiked in urine samples. The limits of detection ranged from 1 to 500ng/mL, allowing the identification of all tested compounds in urine. When a sample was reported positive during the screening, a fast additional pre-confirmatory step was performed to reduce the number of confirmatory analyses.
Resumo:
The aim of this work is to present a new concept, called on-line desorption of dried blood spots (on-line DBS), allowing the direct analysis of a dried blood spot coupled to liquid chromatography mass spectrometry device (LC/MS). The system is based on an inox cell which can receive a blood sample (10 microL) previously spotted on a filter paper. The cell is then integrated into LC/MS system where the analytes are desorbed out of the paper towards a column switching system ensuring the purification and separation of the compounds before their detection on a single quadrupole MS coupled to atmospheric pressure chemical ionisation (APCI) source. The described procedure implies that no pretreatment is necessary in spite the analysis is based on whole blood sample. To ensure the applicability of the concept, saquinavir, imipramine, and verapamil were chosen. Despite the use of a small sampling volume and a single quadrupole detector, on-line DBS allowed the analyses of these three compounds over their therapeutic concentrations from 50 to 500 ng/mL for imipramine and verapamil and from 100 to 1000 ng/mL for saquinavir. Moreover, the method showed good repeatability with relative standard deviation (RSD) lower than 15% based on two levels of concentration (low and high). Function responses were found to be linear over the therapeutic concentration for each compound and were used to determine the concentrations of real patient samples for saquinavir. Comparison of the founded values with those of a validated method used routinely in a reference laboratory showed a good correlation between the two methods. Moreover, good selectivity was observed ensuring that no endogenous or chemical components interfered with the quantitation of the analytes. This work demonstrates the feasibility and applicability of the on-line DBS procedure for bioanalysis.
Resumo:
AbstractBACKGROUND: Scientists have been trying to understand the molecular mechanisms of diseases to design preventive and therapeutic strategies for a long time. For some diseases, it has become evident that it is not enough to obtain a catalogue of the disease-related genes but to uncover how disruptions of molecular networks in the cell give rise to disease phenotypes. Moreover, with the unprecedented wealth of information available, even obtaining such catalogue is extremely difficult.PRINCIPAL FINDINGS: We developed a comprehensive gene-disease association database by integrating associations from several sources that cover different biomedical aspects of diseases. In particular, we focus on the current knowledge of human genetic diseases including mendelian, complex and environmental diseases. To assess the concept of modularity of human diseases, we performed a systematic study of the emergent properties of human gene-disease networks by means of network topology and functional annotation analysis. The results indicate a highly shared genetic origin of human diseases and show that for most diseases, including mendelian, complex and environmental diseases, functional modules exist. Moreover, a core set of biological pathways is found to be associated with most human diseases. We obtained similar results when studying clusters of diseases, suggesting that related diseases might arise due to dysfunction of common biological processes in the cell.CONCLUSIONS: For the first time, we include mendelian, complex and environmental diseases in an integrated gene-disease association database and show that the concept of modularity applies for all of them. We furthermore provide a functional analysis of disease-related modules providing important new biological insights, which might not be discovered when considering each of the gene-disease association repositories independently. Hence, we present a suitable framework for the study of how genetic and environmental factors, such as drugs, contribute to diseases.AVAILABILITY: The gene-disease networks used in this study and part of the analysis are available at http://ibi.imim.es/DisGeNET/DisGeNETweb.html#Download
Resumo:
A raga is a collective melodic expression consisting of motifs. A raga can be identified using motifs which areunique to it. Motifs can be thought of as signature prosodic phrases. Different ragas may be composed of the same setof notes, or even phrases, but the prosody may be completely different. In this paper, an attempt is made to determinethe characteristic motifs that enable identification of a raga and distinguish between them. To determine this, motifs are first manually marked for a set of five popular raga by a professional musician. The motifs are then normalisedwith respect to the tonic. HMMs are trained for each motif using 80% of the data and about 20% are used for testing. The results do indicate that about 80% of the motifs are identified as belonging to a specific raga accurately.
Resumo:
For computational studies of makam music, it is essential to gather a list of characteristics that constitute a makam and explore corresponding quantitative features for automaticanalysis. This study is such an attempt where we address the characteristics for makams as defined in theory books and deduce a list of quantitative features. The target here is to evoke discussions on some measurable features other than providing complete analysis on thediscriminative potentials of each proposed feature which could be the subject of a few larger studies.
Resumo:
Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.
Resumo:
Methods for the extraction of features from physiological datasets are growing needs as clinical investigations of Alzheimer’s disease (AD) in large and heterogeneous population increase. General tools allowing diagnostic regardless of recording sites, such as different hospitals, are essential and if combined to inexpensive non-invasive methods could critically improve mass screening of subjects with AD. In this study, we applied three state of the art multiway array decomposition (MAD) methods to extract features from electroencephalograms (EEGs) of AD patients obtained from multiple sites. In comparison to MAD, spectral-spatial average filter (SSFs) of control and AD subjects were used as well as a common blind source separation method, algorithm for multiple unknown signal extraction (AMUSE). We trained a feed-forward multilayer perceptron (MLP) to validate and optimize AD classification from two independent databases. Using a third EEG dataset, we demonstrated that features extracted from MAD outperformed features obtained from SSFs AMUSE in terms of root mean squared error (RMSE) and reaching up to 100% of accuracy in test condition. We propose that MAD maybe a useful tool to extract features for AD diagnosis offering great generalization across multi-site databases and opening doors to the discovery of new characterization of the disease.
Resumo:
BACKGROUND: Data on the association between subclinical thyroid dysfunction and fractures conflict. PURPOSE: To assess the risk for hip and nonspine fractures associated with subclinical thyroid dysfunction among prospective cohorts. DATA SOURCES: Search of MEDLINE and EMBASE (1946 to 16 March 2014) and reference lists of retrieved articles without language restriction. STUDY SELECTION: Two physicians screened and identified prospective cohorts that measured thyroid function and followed participants to assess fracture outcomes. DATA EXTRACTION: One reviewer extracted data using a standardized protocol, and another verified data. Both reviewers independently assessed methodological quality of the studies. DATA SYNTHESIS: The 7 population-based cohorts of heterogeneous quality included 50,245 participants with 1966 hip and 3281 nonspine fractures. In random-effects models that included the 5 higher-quality studies, the pooled adjusted hazard ratios (HRs) of participants with subclinical hyperthyroidism versus euthyrodism were 1.38 (95% CI, 0.92 to 2.07) for hip fractures and 1.20 (CI, 0.83 to 1.72) for nonspine fractures without statistical heterogeneity (P = 0.82 and 0.52, respectively; I2= 0%). Pooled estimates for the 7 cohorts were 1.26 (CI, 0.96 to 1.65) for hip fractures and 1.16 (CI, 0.95 to 1.42) for nonspine fractures. When thyroxine recipients were excluded, the HRs for participants with subclinical hyperthyroidism were 2.16 (CI, 0.87 to 5.37) for hip fractures and 1.43 (CI, 0.73 to 2.78) for nonspine fractures. For participants with subclinical hypothyroidism, HRs from higher-quality studies were 1.12 (CI, 0.83 to 1.51) for hip fractures and 1.04 (CI, 0.76 to 1.42) for nonspine fractures (P for heterogeneity = 0.69 and 0.88, respectively; I2 = 0%). LIMITATIONS: Selective reporting cannot be excluded. Adjustment for potential common confounders varied and was not adequately done across all studies. CONCLUSION: Subclinical hyperthyroidism might be associated with an increased risk for hip and nonspine fractures, but additional large, high-quality studies are needed. PRIMARY FUNDING SOURCE: Swiss National Science Foundation.
Resumo:
Organic geochemical and stable isotope investigations were performed to provide an insight into the depositional environments, origin and maturity of the organic matter in Jurassic and Cretaceous formations of the External Dinarides. A correlation is made among various parameters acquired from Rock-Eval, gas chromatography-mass spectrometry data and isotope analysis of carbonates and kerogen. Three groups of samples were analysed. The first group includes source rocks derived from Lower Jurassic limestone and Upper Jurassic ``Leme'' beds, the second from Upper Cretaceous carbonates, while the third group comprises oil seeps genetically connected with Upper Cretaceous source rocks. The carbon and oxygen isotopic ratios of all the carbonates display marine isotopic composition. Rock-Eval data and maturity parameter values derived from biomarkers define the organic matter of the Upper Cretaceous carbonates as Type I-S and Type II-S kerogen at the low stage of maturity up to entering the oil-generating window. Lower and Upper Jurassic source rocks contain early mature Type III mixed with Type IV organic matter. All Jurassic and Cretaceous potential source rock extracts show similarity in triterpane and sterane distribution. The hopane and sterane distribution pattern of the studied oil seeps correspond to those from Cretaceous source rocks. The difference between Cretaceous oil seeps and potential source rock extracts was found in the intensity and distribution of n-alkanes, as well as in the abundance of asphaltenes which is connected to their biodegradation stage. In the Jurassic and Cretaceous potential source rock samples a mixture of aromatic hydrocarbons with their alkyl derivatives were indicated, whereas in the oil seep samples extracts only asphaltenes were observed.
Resumo:
US Geological Survey (USGS) based elevation data are the most commonly used data source for highway hydraulic analysis; however, due to the vertical accuracy of USGS-based elevation data, USGS data may be too “coarse” to adequately describe surface profiles of watershed areas or drainage patterns. Additionally hydraulic design requires delineation of much smaller drainage areas (watersheds) than other hydrologic applications, such as environmental, ecological, and water resource management. This research study investigated whether higher resolution LIDAR based surface models would provide better delineation of watersheds and drainage patterns as compared to surface models created from standard USGS-based elevation data. Differences in runoff values were the metric used to compare the data sets. The two data sets were compared for a pilot study area along the Iowa 1 corridor between Iowa City and Mount Vernon. Given the limited breadth of the analysis corridor, areas of particular emphasis were the location of drainage area boundaries and flow patterns parallel to and intersecting the road cross section. Traditional highway hydrology does not appear to be significantly impacted, or benefited, by the increased terrain detail that LIDAR provided for the study area. In fact, hydrologic outputs, such as streams and watersheds, may be too sensitive to the increased horizontal resolution and/or errors in the data set. However, a true comparison of LIDAR and USGS-based data sets of equal size and encompassing entire drainage areas could not be performed in this study. Differences may also result in areas with much steeper slopes or significant changes in terrain. LIDAR may provide possibly valuable detail in areas of modified terrain, such as roads. Better representations of channel and terrain detail in the vicinity of the roadway may be useful in modeling problem drainage areas and evaluating structural surety during and after significant storm events. Furthermore, LIDAR may be used to verify the intended/expected drainage patterns at newly constructed highways. LIDAR will likely provide the greatest benefit for highway projects in flood plains and areas with relatively flat terrain where slight changes in terrain may have a significant impact on drainage patterns.