187 resultados para data validation
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects.
Resumo:
The quantification of the available energy in the environment is important because it determines photosynthesis, evapotranspiration and, therefore, the final yield of crops. Instruments for measuring the energy balance are costly and indirect estimation alternatives are desirable. This study assessed the Deardorff's model performance during a cycle of a sugarcane crop in Piracicaba, State of São Paulo, Brazil, in comparison to the aerodynamic method. This mechanistic model simulates the energy fluxes (sensible, latent heat and net radiation) at three levels (atmosphere, canopy and soil) using only air temperature, relative humidity and wind speed measured at a reference level above the canopy, crop leaf area index, and some pre-calibrated parameters (canopy albedo, soil emissivity, atmospheric transmissivity and hydrological characteristics of the soil). The analysis was made for different time scales, insolation conditions and seasons (spring, summer and autumn). Analyzing all data of 15 minute intervals, the model presented good performance for net radiation simulation in different insolations and seasons. The latent heat flux in the atmosphere and the sensible heat flux in the atmosphere did not present differences in comparison to data from the aerodynamic method during the autumn. The sensible heat flux in the soil was poorly simulated by the model due to the poor performance of the soil water balance method. The Deardorff's model improved in general the flux simulations in comparison to the aerodynamic method when more insolation was available in the environment.
Resumo:
For obtaining accurate and reliable gene expression results it is essential that quantitative real-time RT-PCR (qRT-PCR) data are normalized with appropriate reference genes. The current exponential increase in postgenomic studies on the honey bee, Apis mellifera, makes the standardization of qRT-PCR results an important task for ongoing community efforts. For this aim we selected four candidate reference genes (actin, ribosomal protein 49, elongation factor 1-alpha, tbp-association factor) and used three software-based approaches (geNorm, BestKeeper and NormFinder) to evaluate the suitability of these genes as endogenous controls. Their expression was examined during honey bee development, in different tissues, and after juvenile hormone exposure. Furthermore, the importance of choosing an appropriate reference gene was investigated for two developmentally regulated target genes. The results led us to consider all four candidate genes as suitable genes for normalization in A. mellifera. However, each condition evaluated in this study revealed a specific set of genes as the most appropriated ones.
Resumo:
Background: The inherent complexity of statistical methods and clinical phenomena compel researchers with diverse domains of expertise to work in interdisciplinary teams, where none of them have a complete knowledge in their counterpart's field. As a result, knowledge exchange may often be characterized by miscommunication leading to misinterpretation, ultimately resulting in errors in research and even clinical practice. Though communication has a central role in interdisciplinary collaboration and since miscommunication can have a negative impact on research processes, to the best of our knowledge, no study has yet explored how data analysis specialists and clinical researchers communicate over time. Methods/Principal Findings: We conducted qualitative analysis of encounters between clinical researchers and data analysis specialists (epidemiologist, clinical epidemiologist, and data mining specialist). These encounters were recorded and systematically analyzed using a grounded theory methodology for extraction of emerging themes, followed by data triangulation and analysis of negative cases for validation. A policy analysis was then performed using a system dynamics methodology looking for potential interventions to improve this process. Four major emerging themes were found. Definitions using lay language were frequently employed as a way to bridge the language gap between the specialties. Thought experiments presented a series of ""what if'' situations that helped clarify how the method or information from the other field would behave, if exposed to alternative situations, ultimately aiding in explaining their main objective. Metaphors and analogies were used to translate concepts across fields, from the unfamiliar to the familiar. Prolepsis was used to anticipate study outcomes, thus helping specialists understand the current context based on an understanding of their final goal. Conclusion/Significance: The communication between clinical researchers and data analysis specialists presents multiple challenges that can lead to errors.
Resumo:
Melanoma is a highly aggressive and therapy resistant tumor for which the identification of specific markers and therapeutic targets is highly desirable. We describe here the development and use of a bioinformatic pipeline tool, made publicly available under the name of EST2TSE, for the in silico detection of candidate genes with tissue-specific expression. Using this tool we mined the human EST (Expressed Sequence Tag) database for sequences derived exclusively from melanoma. We found 29 UniGene clusters of multiple ESTs with the potential to predict novel genes with melanoma-specific expression. Using a diverse panel of human tissues and cell lines, we validated the expression of a subset of three previously uncharacterized genes (clusters Hs.295012, Hs.518391, and Hs.559350) to be highly restricted to melanoma/melanocytes and named them RMEL1, 2 and 3, respectively. Expression analysis in nevi, primary melanomas, and metastatic melanomas revealed RMEL1 as a novel melanocytic lineage-specific gene up-regulated during melanoma development. RMEL2 expression was restricted to melanoma tissues and glioblastoma. RMEL3 showed strong up-regulation in nevi and was lost in metastatic tumors. Interestingly, we found correlations of RMEL2 and RMEL3 expression with improved patient outcome, suggesting tumor and/or metastasis suppressor functions for these genes. The three genes are composed of multiple exons and map to 2q12.2, 1q25.3, and 5q11.2, respectively. They are well conserved throughout primates, but not other genomes, and were predicted as having no coding potential, although primate-conserved and human-specific short ORFs could be found. Hairpin RNA secondary structures were also predicted. Concluding, this work offers new melanoma-specific genes for future validation as prognostic markers or as targets for the development of therapeutic strategies to treat melanoma.
Resumo:
The efficacy of fluorescence spectroscopy to detect squamous cell carcinoma is evaluated in an animal model following laser excitation at 442 and 532 nm. Lesions are chemically induced with a topical DMBA application at the left lateral tongue of Golden Syrian hamsters. The animals are investigated every 2 weeks after the 4th week of induction until a total of 26 weeks. The right lateral tongue of each animal is considered as a control site (normal contralateral tissue) and the induced lesions are analyzed as a set of points covering the entire clinically detectable area. Based on fluorescence spectral differences, four indices are determined to discriminate normal and carcinoma tissues, based on intraspectral analysis. The spectral data are also analyzed using a multivariate data analysis and the results are compared with histology as the diagnostic gold standard. The best result achieved is for blue excitation using the KNN (K-nearest neighbor, a interspectral analysis) algorithm with a sensitivity of 95.7% and a specificity of 91.6%. These high indices indicate that fluorescence spectroscopy may constitute a fast noninvasive auxiliary tool for diagnostic of cancer within the oral cavity. (C) 2008 Society of Photo-Optical Instrumentation Engineers.
Resumo:
Laser induced breakdown spectrometry (LIBS) was applied for the determination of macro (P, K, Ca, Mg) and micronutrients (B, Cu, Fe, Mn and Zn) in sugar cane leaves, which is one of the most economically important crops in Brazil. Operational conditions were previously optimized by a neuro-genetic approach, by using a laser Nd:YAG at 1064 nm with 110 mJ per pulse focused on a pellet surface prepared with ground plant samples. Emission intensities were measured after 2.0 mu s delay time, with 4.5 mu s integration time gate and 25 accumulated laser pulses. Measurements of LIBS spectra were based on triplicate and each replicate consisted of an average of ten spectra collected in different sites (craters) of the pellet. Quantitative determinations were carried out by using univariate calibration and chemometric methods, such as PLSR and iPLS. The calibration models were obtained by using 26 laboratory samples and the validation was carried out by using 15 test samples. For comparative purpose, these samples were also microwave-assisted digested and further analyzed by ICP OES. In general, most results obtained by LIBS did not differ significantly from ICP OES data by applying a t-test at 95% confidence level. Both LIBS multivariate and univariate calibration methods produced similar results, except for Fe where better results were achieved by the multivariate approach. Repeatability precision varied from 0.7 to 15% and 1.3 to 20% from measurements obtained by multivariate and univariate calibration, respectively. It is demonstrated that LIBS is a powerful tool for analysis of pellets of plant materials for determination of macro and micronutrients by choosing calibration and validation samples with similar matrix composition.
Resumo:
Cooling towers are widely used in many industrial and utility plants as a cooling medium, whose thermal performance is of vital importance. Despite the wide interest in cooling tower design, rating and its importance in energy conservation, there are few investigations concerning the integrated analysis of cooling systems. This work presents an approach for the systemic performance analysis of a cooling water system. The approach combines experimental design with mathematical modeling. An experimental investigation was carried out to characterize the mass transfer in the packing of the cooling tower as a function of the liquid and gas flow rates, whose results were within the range of the measurement accuracy. Then, an integrated model was developed that relies on the mass and heat transfer of the cooling tower, as well as on the hydraulic and thermal interactions with a heat exchanger network. The integrated model for the cooling water system was simulated and the temperature results agree with the experimental data of the real operation of the pilot plant. A case study illustrates the interaction in the system and the need for a systemic analysis of cooling water system. The proposed mathematical and experimental analysis should be useful for performance analysis of real-world cooling water systems. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
A risk score model was developed based in a population of 1,224 individuals from the general population without known diabetes aging 35 years or more from an urban Brazilian population sample in order to select individuals who should be screened in subsequent testing and improve the efficacy of public health assurance. External validation was performed in a second, independent, population from a different city ascertained through a similar epidemiological protocol. The risk score was developed by multiple logistic regression and model performance and cutoff values were derived from a receiver operating characteristic curve. Model`s capacity of predicting fasting blood glucose levels was tested analyzing data from a 5-year follow-up protocol conducted in the general population. Items independently and significantly associated with diabetes were age, BMI and known hypertension. Sensitivity, specificity and proportion of further testing necessary for the best cutoff value were 75.9, 66.9 and 37.2%, respectively. External validation confirmed the model`s adequacy (AUC equal to 0.72). Finally, model score was also capable of predicting fasting blood glucose progression in non-diabetic individuals in a 5-year follow-up period. In conclusion, this simple diabetes risk score was able to identify individuals with an increased likelihood of having diabetes and it can be used to stratify subpopulations in which performing of subsequent tests is necessary and probably cost-effective.
Resumo:
To validate the Brazilian version of the Brief Pain Inventory (BPI-B) scale and to determine the optimal cutpoints for mild, moderate, and severe pain based on patients` rating of their worst pain. One hundred forty-three outpatients with cancer were recruited in Hospital das Clinicas-University of Sao Paulo, Brazil. Confirmatory factor analysis confirmed two underlying dimensions, pain severity, and pain interference, with Cronbach`s alpha of 0.91 and 0.87, respectively. Convergent validity was shown by the correlation observed between the BPI dimensions with the EORTC-QLQ-C30 pain scale and the McGill Pain Questionnaire. The BPI-B detected significant differences in the two dimensions by disease and performance status, supporting known-group validity. For the worst pain, the optimal cutpoints were 4 and 7 (1-4 = mild pain, 5-7 = moderate, and 8-10 = severe). Our data show that BPI-B is a brief, useful, and valid tool for assessing pain and its impact on patient`s life.
Resumo:
Introduction: Although obsessions and compulsions comprise the main features of obsessive-compulsive disorder (OCD), many patients report that their compulsions are preceded by a sense of ""incompleteness"" or other unpleasant feelings such as premonitory urges or a need perform action`s until feeling ""just right."" These manifestations have been characterized as Sensory Phenomena (SP). The current study presents initial psychometric data for a new scale designed to measure SP. Methods: Seventy-six adult OCD subjects were probed twice. Patients were assessed with an open clinical interview (considered as the ""gold standard"") and with the following standardized instruments: Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition Axis I Disorders, Yale-Brown Obsessive-Compulsive Scale, Dimensional Yale-Brown Obsessive-Compulsive Scale, Yale Global Tic Severity Scale, Beck Anxiety Inventory, and Beck Depression Inventory. Results: SP were present in 51 OCD patients (67.1%). Tics were present in 16 (21.1%) of the overall sample. The presence of SP was significantly higher in early-onset OCD patients. There were no significant differences in the presence of SP according to comorbidity with tics or gender. The comparison between the results from the open clinical interviews and the University of Sao Paulo Sensory Phenomena Scale (USP-SPS) showed an excellent concordance between them, with no significant differences between interviewers. The inter-rater reliability between the expert raters for the USP-SPS was high, with K=.92. The Pearson correlation coefficient between the SP severity scores given by the two raters was .89. Conclusion: Preliminary results suggest that the USP-SPS is a valid and reliable instrument for assessing the presence and severity of SP in OCD subjects. CNS Spectr. 2009;14(6):315-323
Resumo:
Background The aim of this study was to validate a biomagnetic method (alternate current biosusceptometry, ACB) for monitoring gastric wall contractions in rats. Methods In vitro data were obtained to establish the relationship between ACB and the strain-gauge (SG) signal amplitude. In vivo experiments were performed in pentobarbital-anesthetized rats with SG and magnetic markers previously implanted under the gastric serosa or after ingestion of magnetic material. Gastric motility was quantified from the tracing amplitudes and frequency profiles obtained by Fast Fourier Transform. Key Results The correlation between in vitro signal amplitudes was strong (R = 0.989). The temporal cross-correlation coefficient between the ACB and SG signal amplitude was higher (P < 0.0001) in the postprandial (88.3 +/- 9.1 V) than in the fasting state (31.0 +/- 16.9 V). Irregular signal profiles, low contraction amplitudes, and smaller signal-to-noise ratios explained the poor correlation between techniques for fasting-state recordings. When a magnetic material was ingested, there was also strong correlation in the frequency and signal amplitude and a small phase-difference between the techniques. The contraction frequencies using ACB were 0.068 +/- 0.007 Hz (postprandial) and 0.058 +/- 0.007 Hz (fasting) (P < 0.002) and those using SG were 0.066 +/- 0.006 Hz (postprandial) and 0.059 +/- 0.008 Hz (fasting) (P < 0.005). Conclusions & Inferences In summary, ACB is reliable for monitoring gastric wall contractions using both implanted and ingested magnetic materials, and may serve as an accurate and sensitive technique for gastrointestinal motility studies.
Resumo:
Objective. To validate a core set of outcome measures for the evaluation of response to treatment in patients with juvenile dermatomyositis (DM). Methods. In 2001, a preliminary consensus-derived core set for evaluating response to therapy in juvenile DM was established. In the present study, the core set was validated through an evidence-based, large-scale data collection that led to the enrollment of 294 patients from 36 countries. Consecutive patients with active disease were assessed at baseline and after 6 months. The validation procedures included assessment of feasibility, responsiveness, discriminant and construct ability, concordce in the evaluation of response to therapy between physicians and parents, redundancy, internal consistency, and ability to predict a therapeutic response. Results. The following clinical measures were found to be feasible, and to have good construct validity, discriminative ability, and internal consistency; furthermore, they were not redundant, proved responsive to clinically important changes in disease activity, and were associated strongly with treatment outcome and thus were included in the final core set: 1) physician`s global assessment of disease activity, 2) muscle strength, 3) global disease activity measure, 4) parent`s global assessment of patient`s well-being, 5) functional ability, and 6) health-related quality of life. Conclusion. The members of the Paediatric Rheumatology International Trials Organisation, with the endorsement of the American College of Rheumatology and the European Leauge Against Rheumatism, propose a core set of criteria for the evaluation of response of therapy that is scientifically and clinically relevant and statistically validated. The core set will help standardize the conduct and reporting of clinical trials and assist practitioners in deciding whether a child with juvenile DM has responded adequately to therapy.
Resumo:
Quality of life (QOL) has been extensively studied in clinical trials and in research on chronic degenerative diseases and dementia. The aim of this study was to assess the reliability and construct validity of the Brazilian version of the QOL scale in Alzheimer`s disease (AD; QOL-AD). The QOL-AD was administered to 60 patients with mild or moderate AD and to their caregivers. The construct validation was accomplished through correlations amongst total scores of patients` and caregivers` reports on patients` quality of life (PQOL and C-PQOL, respectively), and data related to cognitive impairment, depressive symptoms, functional performance, behavioral disturbances and a generic instrument of quality of life (WHOQOL-brief), as well as correlation of total score of caregivers` reports on their own quality of life (CQOL) with the measurements cited above, QOL-AD patient reports, and depressive symptoms. The reliability was high for PQOL, C-PQOL, and CQOL versions (Cronbach`s alpha = 0.80, 0.83, and 0.86, respectively). We observed significant correlations in the construct validity of all three versions regarding the variables associated with the disease and also with WHOQOL-brief. The scale took, on average, six min for each version. The results indicate reliability and construct validity of the Brazilian version of the QOL-AD in the studied sample.
Resumo:
There is a family of well-known external clustering validity indexes to measure the degree of compatibility or similarity between two hard partitions of a given data set, including partitions with different numbers of categories. A unified, fully equivalent set-theoretic formulation for an important class of such indexes was derived and extended to the fuzzy domain in a previous work by the author [Campello, R.J.G.B., 2007. A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment. Pattern Recognition Lett., 28, 833-841]. However, the proposed fuzzy set-theoretic formulation is not valid as a general approach for comparing two fuzzy partitions of data. Instead, it is an approach for comparing a fuzzy partition against a hard referential partition of the data into mutually disjoint categories. In this paper, generalized external indexes for comparing two data partitions with overlapping categories are introduced. These indexes can be used as general measures for comparing two partitions of the same data set into overlapping categories. An important issue that is seldom touched in the literature is also addressed in the paper, namely, how to compare two partitions of different subsamples of data. A number of pedagogical examples and three simulation experiments are presented and analyzed in details. A review of recent related work compiled from the literature is also provided. (c) 2010 Elsevier B.V. All rights reserved.