12 resultados para Moretti, Franco: Graphs, Maps, Trees. Abstract models for a literaty theory
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
We review recent visualization techniques aimed at supporting tasks that require the analysis of text documents, from approaches targeted at visually summarizing the relevant content of a single document to those aimed at assisting exploratory investigation of whole collections of documents.Techniques are organized considering their target input materialeither single texts or collections of textsand their focus, which may be at displaying content, emphasizing relevant relationships, highlighting the temporal evolution of a document or collection, or helping users to handle results from a query posed to a search engine.We describe the approaches adopted by distinct techniques and briefly review the strategies they employ to obtain meaningful text models, discuss how they extract the information required to produce representative visualizations, the tasks they intend to support and the interaction issues involved, and strengths and limitations. Finally, we show a summary of techniques, highlighting their goals and distinguishing characteristics. We also briefly discuss some open problems and research directions in the fields of visual text mining and text analytics.
Resumo:
Species distribution models (SDMs) can be useful for different conservation purposes. We discuss the importance of fitting spatial scale and using current records and relevant predictors aiming conservation. We choose jaguar (Panthera onca) as a target species and Brazil and Atlantic Forest biome as study areas. We tested two different extents (continent and biome) and resolutions (similar to 4 Km and similar to 1 Km) in Maxent with 186 records and 11 predictors (bioclimatic, elevation, land-use and landscape structure). All models presented satisfactory AUC values (>0.70) and low omission errors (<23%). SDMs were scale-sensitive as the use of reduced extent implied in significant gains to model performance generating more constrained and real predictive distribution maps. Continental-scale models performed poorly in predicting potential current jaguar distribution, but they reached the historic distribution. Specificity increased significantly from coarse to finer-scale models due to the reduction of overprediction. The variability of environmental space (E-space) differed for most of climatic variables between continental and biome-scale and the representation of the E-space by predictors differed significantly (t = 2.42, g.I. = 9, P < 0.05). Refining spatial scale, incorporating landscape variables and improving the quality of biological data are essential for improving model prediction for conservation purposes.
Resumo:
Abstract Background Smear negative pulmonary tuberculosis (SNPT) accounts for 30% of pulmonary tuberculosis cases reported yearly in Brazil. This study aimed to develop a prediction model for SNPT for outpatients in areas with scarce resources. Methods The study enrolled 551 patients with clinical-radiological suspicion of SNPT, in Rio de Janeiro, Brazil. The original data was divided into two equivalent samples for generation and validation of the prediction models. Symptoms, physical signs and chest X-rays were used for constructing logistic regression and classification and regression tree models. From the logistic regression, we generated a clinical and radiological prediction score. The area under the receiver operator characteristic curve, sensitivity, and specificity were used to evaluate the model's performance in both generation and validation samples. Results It was possible to generate predictive models for SNPT with sensitivity ranging from 64% to 71% and specificity ranging from 58% to 76%. Conclusion The results suggest that those models might be useful as screening tools for estimating the risk of SNPT, optimizing the utilization of more expensive tests, and avoiding costs of unnecessary anti-tuberculosis treatment. Those models might be cost-effective tools in a health care network with hierarchical distribution of scarce resources.
Resumo:
The Asteraceae, one of the largest families among angiosperms, is chemically characterised by the production of sesquiterpene lactones (SLs). A total of 1,111 SLs, which were extracted from 658 species, 161 genera, 63 subtribes and 15 tribes of Asteraceae, were represented and registered in two dimensions in the SISTEMATX, an in-house software system, and were associated with their botanical sources. The respective 11 block of descriptors: Constitutional, Functional groups, BCUT, Atom-centred, 2D autocorrelations, Topological, Geometrical, RDF, 3D-MoRSE, GETAWAY and WHIM were used as input data to separate the botanical occurrences through self-organising maps. Maps that were generated with each descriptor divided the Asteraceae tribes, with total index values between 66.7% and 83.6%. The analysis of the results shows evident similarities among the Heliantheae, Helenieae and Eupatorieae tribes as well as between the Anthemideae and Inuleae tribes. Those observations are in agreement with systematic classifications that were proposed by Bremer, which use mainly morphological and molecular data, therefore chemical markers partially corroborate with these classifications. The results demonstrate that the atom-centred and RDF descriptors can be used as a tool for taxonomic classification in low hierarchical levels, such as tribes. Descriptors obtained through fragments or by the two-dimensional representation of the SL structures were sufficient to obtain significant results, and better results were not achieved by using descriptors derived from three-dimensional representations of SLs. Such models based on physico-chemical properties can project new design SLs, similar structures from literature or even unreported structures in two-dimensional chemical space. Therefore, the generated SOMs can predict the most probable tribe where a biologically active molecule can be found according Bremer classification.
Resumo:
This paper addresses the functional reliability and the complexity of reconfigurable antennas using graph models. The correlation between complexity and reliability for any given reconfigurable antenna is defined. Two methods are proposed to reduce failures and improve the reliability of reconfigurable antennas. The failures are caused by the reconfiguration technique or by the surrounding environment. These failure reduction methods proposed are tested and examples are given which verify these methods.
Resumo:
Site-specific height-diameter models may be used to improve biomass estimates for forest inventories where only diameter at breast height (DBH) measurements are available. In this study, we fit height-diameter models for vegetation types of a tropical Atlantic forest using field measurements of height across plots along an altitudinal gradient. To fit height-diameter models, we sampled trees by DBH class and measured tree height within 13 one-hectare permanent plots established at four altitude classes. To select the best model we tested the performance of 11 height-diameter models using the Akaike Information Criterion (AIC). The Weibull and Chapman-Richards height-diameter models performed better than other models, and regional site-specific models performed better than the general model. In addition, there is a slight variation of height-diameter relationships across the altitudinal gradient and an extensive difference in the stature between the Atlantic and Amazon forests. The results showed the effect of altitude on tree height estimates and emphasize the need for altitude-specific models that produce more accurate results than a general model that encompasses all altitudes. To improve biomass estimation, the development of regional height-diameter models that estimate tree height using a subset of randomly sampled trees presents an approach to supplement surveys where only diameter has been measured.
Resumo:
We review symplectic nontwist maps that we have introduced to describe Lagrangian transport properties in magnetically confined plasmas in tokamaks. These nontwist maps are suitable to describe the formation and destruction of transport barriers in the shearless region (i.e., near the curve where the twist condition does not hold). The maps can be used to investigate two kinds of problems in plasmas with non-monotonic field profiles: the first is the chaotic magnetic field line transport in plasmas with external resonant perturbations. The second problem is the chaotic particle drift motion caused by electrostatic drift waves. The presented analytical maps, derived from plasma models with equilibrium field profiles and control parameters that are commonly measured in plasma discharges, can be used to investigate long-term transport properties. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
This study uses several measures derived from the error matrix for comparing two thematic maps generated with the same sample set. The reference map was generated with all the sample elements and the map set as the model was generated without the two points detected as influential by the analysis of local influence diagnostics. The data analyzed refer to the wheat productivity in an agricultural area of 13.55 ha considering a sampling grid of 50 x 50 m comprising 50 georeferenced sample elements. The comparison measures derived from the error matrix indicated that despite some similarity on the maps, they are different. The difference between the estimated production by the reference map and the actual production was of 350 kilograms. The same difference calculated with the mode map was of 50 kilograms, indicating that the study of influential points is of fundamental importance to obtain a more reliable estimative and use of measures obtained from the error matrix is a good option to make comparisons between thematic maps.
Resumo:
Spatial linear models have been applied in numerous fields such as agriculture, geoscience and environmental sciences, among many others. Spatial dependence structure modelling, using a geostatistical approach, is an indispensable tool to estimate the parameters that define this structure. However, this estimation may be greatly affected by the presence of atypical observations in the sampled data. The purpose of this paper is to use diagnostic techniques to assess the sensitivity of the maximum-likelihood estimators, covariance functions and linear predictor to small perturbations in the data and/or the spatial linear model assumptions. The methodology is illustrated with two real data sets. The results allowed us to conclude that the presence of atypical values in the sample data have a strong influence on thematic maps, changing the spatial dependence structure.
Resumo:
Abstract Background Signaling by the vitamin A-derived morphogen retinoic acid (RA) is required at multiple steps of cardiac development. Since conversion of retinaldehyde to RA by retinaldehyde dehydrogenase type II (ALDH1A2, a.k.a RALDH2) is critical for cardiac development, we screened patients with congenital heart disease (CHDs) for genetic variation at the ALDH1A2 locus. Methods One-hundred and thirty-three CHD patients were screened for genetic variation at the ALDH1A2 locus through bi-directional sequencing. In addition, six SNPs (rs2704188, rs1441815, rs3784259, rs1530293, rs1899430) at the same locus were studied using a TDT-based association approach in 101 CHD trios. Observed mutations were modeled through molecular mechanics (MM) simulations using the AMBER 9 package, Sander and Pmemd programs. Sequence conservation of observed mutations was evaluated through phylogenetic tree construction from ungapped alignments containing ALDH8 s, ALDH1Ls, ALDH1 s and ALDH2 s. Trees were generated by the Neighbor Joining method. Variations potentially affecting splicing mechanisms were cloned and functional assays were designed to test splicing alterations using the pSPL3 splicing assay. Results We describe in Tetralogy of Fallot (TOF) the mutations Ala151Ser and Ile157Thr that change non-polar to polar residues at exon 4. Exon 4 encodes part of the highly-conserved tetramerization domain, a structural motif required for ALDH oligomerization. Molecular mechanics simulation studies of the two mutations indicate that they hinder tetramerization. We determined that the SNP rs16939660, previously associated with spina bifida and observed in patients with TOF, does not affect splicing. Moreover, association studies performed with classical models and with the transmission disequilibrium test (TDT) design using single marker genotype, or haplotype information do not show differences between cases and controls. Conclusion In summary, our screen indicates that ALDH1A2 genetic variation is present in TOF patients, suggesting a possible causal role for this gene in rare cases of human CHD, but does not support the hypothesis that variation at the ALDH1A2 locus is a significant modifier of the risk for CHD in humans.
Resumo:
Abstract Background The Brazilian Study on the Practice of Diabetes Care main objective was to provide an epidemiological profile of individuals with type 1 and 2 diabetes mellitus (DM) in Brazil, concerning therapy and adherence to international guidelines in the medical practice. Methods This observational, cross-sectional, multicenter study collected and analyzed data from individuals with type 1 and 2 DM attending public or private clinics in Brazil. Each investigator included the first 10 patients with type 2 DM who visited his/her office, and the first 5 patients with type 1 DM. Results A total of 1,358 patients were analyzed; 375 (27.6%) had type 1 and 983 (72.4%) had type 2 DM. Most individuals were women, Caucasian, and private health care users. High prevalence rates of hypertension, dyslipidemia and central obesity were observed, particularly in type 2 DM. Only 7.3% and 5.1% of the individuals with types 1 and 2 DM, respectively, had optimal control of blood pressure, plasma glucose and lipids. The absence of hypertension and female sex were associated with better control of type 1 DM and other cardiovascular risk factors. In type 2 DM, older age was also associated with better control. Conclusions Female sex, older age, and absence of hypertension were associated with better metabolic control. An optimal control of plasma glucose and other cardiovascular risk factors are obtained only in a minority of individuals with diabetes. Local numbers, compared to those from other countries are worse.
Resumo:
Abstract Background The implication of post-transcriptional regulation by microRNAs in molecular mechanisms underlying cancer disease is well documented. However, their interference at the cellular level is not fully explored. Functional in vitro studies are fundamental for the comprehension of their role; nevertheless results are highly dependable on the adopted cellular model. Next generation small RNA transcriptomic sequencing data of a tumor cell line and keratinocytes derived from primary culture was generated in order to characterize the microRNA content of these systems, thus helping in their understanding. Both constitute cell models for functional studies of microRNAs in head and neck squamous cell carcinoma (HNSCC), a smoking-related cancer. Known microRNAs were quantified and analyzed in the context of gene regulation. New microRNAs were investigated using similarity and structural search, ab initio classification, and prediction of the location of mature microRNAs within would-be precursor sequences. Results were compared with small RNA transcriptomic sequences from HNSCC samples in order to access the applicability of these cell models for cancer phenotype comprehension and for novel molecule discovery. Results Ten miRNAs represented over 70% of the mature molecules present in each of the cell types. The most expressed molecules were miR-21, miR-24 and miR-205, Accordingly; miR-21 and miR-205 have been previously shown to play a role in epithelial cell biology. Although miR-21 has been implicated in cancer development, and evaluated as a biomarker in HNSCC progression, no significant expression differences were seen between cell types. We demonstrate that differentially expressed mature miRNAs target cell differentiation and apoptosis related biological processes, indicating that they might represent, with acceptable accuracy, the genetic context from which they derive. Most miRNAs identified in the cancer cell line and in keratinocytes were present in tumor samples and cancer-free samples, respectively, with miR-21, miR-24 and miR-205 still among the most prevalent molecules at all instances. Thirteen miRNA-like structures, containing reads identified by the deep sequencing, were predicted from putative miRNA precursor sequences. Strong evidences suggest that one of them could be a new miRNA. This molecule was mostly expressed in the tumor cell line and HNSCC samples indicating a possible biological function in cancer. Conclusions Critical biological features of cells must be fully understood before they can be chosen as models for functional studies. Expression levels of miRNAs relate to cell type and tissue context. This study provides insights on miRNA content of two cell models used for cancer research. Pathways commonly deregulated in HNSCC might be targeted by most expressed and also by differentially expressed miRNAs. Results indicate that the use of cell models for cancer research demands careful assessment of underlying molecular characteristics for proper data interpretation. Additionally, one new miRNA-like molecule with a potential role in cancer was identified in the cell lines and clinical samples.