30 resultados para Syntactic derivation
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
The realization that statistical physics methods can be applied to analyze written texts represented as complex networks has led to several developments in natural language processing, including automatic summarization and evaluation of machine translation. Most importantly, so far only a few metrics of complex networks have been used and therefore there is ample opportunity to enhance the statistics-based methods as new measures of network topology and dynamics are created. In this paper, we employ for the first time the metrics betweenness, vulnerability and diversity to analyze written texts in Brazilian Portuguese. Using strategies based on diversity metrics, a better performance in automatic summarization is achieved in comparison to previous work employing complex networks. With an optimized method the Rouge score (an automatic evaluation method used in summarization) was 0.5089, which is the best value ever achieved for an extractive summarizer with statistical methods based on complex networks for Brazilian Portuguese. Furthermore, the diversity metric can detect keywords with high precision, which is why we believe it is suitable to produce good summaries. It is also shown that incorporating linguistic knowledge through a syntactic parser does enhance the performance of the automatic summarizers, as expected, but the increase in the Rouge score is only minor. These results reinforce the suitability of complex network methods for improving automatic summarizers in particular, and treating text in general. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Fetal tissues are frequently discarded before (amniocentesis) or after birth, which both facilitates stem cell access and helps to overcome ethical concerns. In the present study, we aimed to isolate and characterize stem cells from the allantoic and amniotic fluids (ALF; AMF) of third trimester canine fetuses. This gestation age has not been previously explored for stem cells isolation. The gestational age, cell culture conditions and method of isolation used in this study allowed for the establishment and efficient expansion of ALF and AMF cells. We showed that the majority of ALF and ALF cells express the stem cell markers, such as vimentin, nestin and cytokeratin 18 (CK18). Under appropriate culture conditions AMF derived cells can undergo differentiation into osteogenic, adipogenic, chondrogenic and neuron-like lineages. ALF derived cells showed adipogenic, and chondrogenic potential. Therefore, ALF and AMF cells derived at the third gestation trimester can be qualified as progenitor stem cells, accordingly referred as (alantoic fluid progenitor/stem) ALF PS cells and (amniotic fluid progenitor/stem) AMF PS cells. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Dimensional analysis was employed to develop a predictive formula for the terminal velocity for a magnet dropped down a metallic tube. In this particular application, the technique succeeded in generating the same formula theoretically derived and that has been published by others. The analysis thus presented suggests other applications that can be developed for motivating in the use of the technique.
Resumo:
The availability and uptake of Cd by lettuce (Lactuca sativa L.) in two common tropical soils (before and after liming) were studied in order to derive human health-based risk soil concentration. Cadmium concentrations ranging from 1 to 12 mg kg(-1) were added to samples from a clayey Oxisol and a sandy-loam Ultisol under glasshouse conditions. After incubation, a soil sample was taken from each pot, the concentration of Cd in the soil was determined, lettuce was grown during 36 d, and the edible parts were harvested and analyzed for Cd. A positive linear correlation was observed between total soil Cd and the Cd concentration in lettuce. The amount of Cd absorbed by lettuce grown in the Ultisol was about twice the amount absorbed in the Oxisol. Liming increased the soil pH and slightly reduced Cd availability and uptake. CaCl2 extraction was better than DTPA to reflect differences in binding strength of Cd between limed and unlimed soils. Risk Cd concentrations in the Ultisol were lower than in the Oxisol, reflecting the greater degree of uptake from the Ultisol. The derived risk Cd values were dependent on soil type and the exposure scenario.
Resumo:
We aimed to develop site-specific sediment quality guidelines (SQGs) for two estuarine and port zones in Southeastern Brazil (Santos Estuarine System and Paranagua Estuarine System) and three in Southern Spain (Ria of Huelva, Bay of Cadiz, and Bay of Algeciras), and compare these values against national and traditionally used international benchmark values. Site-specific SQGs were derived based on sediment physical-chemical, toxicological, and benthic community data integrated through multivariate analysis. This technique allowed the identification of chemicals of concern and the establishment of effects range correlatively to individual concentrations of contaminants for each site of study. The results revealed that sediments from Santos channel, as well as inner portions of the SES, are considered highly polluted (exceeding SQGs-high) by metals, PAHs and PCBs. High pollution by PAHs and some metals was found in Sao Vicente channel. In PES, sediments from inner portions (proximities of the Ponta do Mix port`s terminal and the Port of Paranagua) are highly polluted by metals and PAHs, including one zone inside the limits of an environmental protection area. In Gulf of Cadiz, SQGs exceedences were found in Ria of Huelva (all analysed metals and PAHs), in the surroundings of the Port of CAdiz (Bay of CAdiz) (metals), and in Bay of Algeciras (Ni and PAHs). The site-specific SQGs derived in this study are more restricted than national SQGs applied in Brazil and Spain, as well as international guidelines. This finding confirms the importance of the development of site-specific SQGs to support the characterisation of sediments and dredged material. The use of the same methodology to derive SQGs in Brazilian and Spanish port zones confirmed the applicability of this technique with an international scope and provided a harmonised methodology for site-specific SQGs derivation. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Buteonine hawks represent one of the most diverse groups in the Accipitridae, with 58 species distributed in a variety of habitats on almost all continents. Variations in migratory behavior, remarkable dispersal capability, and unusual diversity in Central and South America make buteonine hawks an excellent model for studies in avian evolution. To evaluate the history of their global radiation, we used an integrative approach that coupled estimation of the phylogeny using a large sequence database (based on 6411 bp of mitochondrial markers and one nuclear intron from 54 species), divergence time estimates, and ancestral state reconstructions. Our findings suggest that Neotropical buteonines resulted from a long evolutionary process that began in the Miocene and extended to the Pleistocene. Colonization of the Nearctic, and eventually the Old World, occurred from South America, promoted by the evolution of seasonal movements and development of land bridges. Migratory behavior evolved several times and may have contributed not only to colonization of the Holarctic, but also derivation of insular species. In the Neotropics, diversification of the buteonines included four disjunction events across the Andes. Adaptation of monophyletic taxa to wet environments occurred more than once, and some relationships indicate an evolutionary connection among mangroves, coastal and varzea environments. On the other hand, groups occupying the same biome, forest, or open vegetation habitats are not monophyletic. Refuges or sea-level changes or a combination of both was responsible for recent speciation in Amazonian taxa. In view of the lack of concordance between phylogeny and classification, we propose numerous taxonomic changes. (C) 2009 Elsevier Inc. All rights reserved.
Resumo:
The north-western sector of the Gharyan volcanic field (northern Libya) consists of trachytic-phonolitic domes emplaced between similar to 41 and 38 Ma, and small-volume mafic alkaline volcanic centres (basanites, tephrites. alkali basalts. hawaiites and rare benmoreites) of Middle Miocene-Pliocene age (similar to 12-2 Ma). Two types of trachytes and phonolites have been recognized on the basis of petrography, mineralogy and geochemistry. Type-1 trachytes and phonolites display a smooth spoon-shaped REE pattern without negative Europium anomalies. Type-2 trachytes and phonolites show a remarkable Eu negative anomaly, higher concentration in HFSE (Nb-Ta-Zr-Hf), REE and Ti than Type-1 rocks. The origin of Type-1 trachytes and phonolites is compatible with removal of clinopyroxene, plagioclase, alkali feldspar, amphibole. magnetite and titanite starting from benmoreitic magmas. found in the same outcrops. Type-2 trachytes and phonolites could be the result of extensive fractional crystallization starting from mafic alkaline magma, without removal of titanite. In primitive mantle-normalized diagrams, the mafic rocks (Mg#= 62-68, Cr up to 514 ppm, Ni up to 425 ppm) show peaks at Nb and Ta and troughs at K. These characteristics, coupled with low Sr-87/Sr-86(i) (0.7033-0.7038) and positive epsilon(Nd) (from +4.2 to + 5.3) features typical of the mafic anorogenic magmas of the northern African plate and of HIMU-OIB-like magma in general. The origin of the mafic rocks is compatible from a derivation from low degree partial melting (3-9%) shallow mantle sources in the spinel/gamet facies. placed just below the rigid plate in the uppermost low-velocity zone. The origin of the igneous activity is considered linked to passive lithospheric thinning related to the development of continental rifts like those of Sicily Channel (e.g.. Pantelleria and Linosa) and Sardinia (e.g., Campidano Graben) in the Central-Western Mediterranean Sea. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Metalinguistic skill is the ability to reflect upon language as an object of thought. Amongst metalinguistic skills, two seem to be associated with reading and spelling: morphological awareness and phonological awareness. Phonological awareness is the ability of reflecting upon the phonemes that compose words, and morphological awareness is the ability of reflecting upon the morphemes that compose the words. The latter seems to be particularly important for reading comprehension and contextual reading, as beyond phonological information, syntactic and semantic information are required. This study is set to investigate - with a longitudinal design - the relation between those abilities and contextual reading measured by the Cloze test. The first part of the study explores the relationship between morphological awareness tasks and Cloze scores through simple correlations and, in the second part, the specificity of such relationship was inquired using multiple regressions. The results give some support to the hypothesis that morphological awareness offers an independent contribution regarding phonological awareness to contextual reading in Brazilian Portuguese.
Resumo:
The classification of texts has become a major endeavor with so much electronic material available, for it is an essential task in several applications, including search engines and information retrieval. There are different ways to define similarity for grouping similar texts into clusters, as the concept of similarity may depend on the purpose of the task. For instance, in topic extraction similar texts mean those within the same semantic field, whereas in author recognition stylistic features should be considered. In this study, we introduce ways to classify texts employing concepts of complex networks, which may be able to capture syntactic, semantic and even pragmatic features. The interplay between various metrics of the complex networks is analyzed with three applications, namely identification of machine translation (MT) systems, evaluation of quality of machine translated texts and authorship recognition. We shall show that topological features of the networks representing texts can enhance the ability to identify MT systems in particular cases. For evaluating the quality of MT texts, on the other hand, high correlation was obtained with methods capable of capturing the semantics. This was expected because the golden standards used are themselves based on word co-occurrence. Notwithstanding, the Katz similarity, which involves semantic and structure in the comparison of texts, achieved the highest correlation with the NIST measurement, indicating that in some cases the combination of both approaches can improve the ability to quantify quality in MT. In authorship recognition, again the topological features were relevant in some contexts, though for the books and authors analyzed good results were obtained with semantic features as well. Because hybrid approaches encompassing semantic and topological features have not been extensively used, we believe that the methodology proposed here may be useful to enhance text classification considerably, as it combines well-established strategies. (c) 2012 Elsevier B.V. All rights reserved.
Resumo:
Fundamental principles of mechanics were primarily conceived for constant mass systems. Since the pioneering works of Meshcherskii (see historical review in Mikhailov (Mech. Solids 10(5):32-40, 1975), efforts have been made in order to elaborate an adequate mathematical formalism for variable mass systems. This is a current research field in theoretical mechanics. In this paper, attention is focused on the derivation of the so-called 'generalized canonical equations of Hamilton' for a variable mass particle. The applied technique consists in the consideration of the mass variation process as a dissipative phenomenon. Kozlov's (Stek. Inst. Math 223:178-184, 1998) method, originally devoted to the derivation of the generalized canonical equations of Hamilton for dissipative systems, is accordingly extended to the scenario of variable mass systems. This is done by conveniently writing the flux of kinetic energy from or into the variable mass particle as a 'Rayleigh-like dissipation function'. Cayley (Proc. R Soc. Lond. 8:506-511, 1857) was the first scholar to propose such an analogy. A deeper discussion on this particular subject will be left for a future paper.
Resumo:
We address the spherical accretion of generic fluids onto black holes. We show that, if the black hole metric satisfies certain conditions, in the presence of a test fluid it is possible to derive a fully relativistic prescription for the black hole mass variation. Although the resulting equation may seem obvious due to a form of it appearing as a step in the derivation of the Schwarzschild metric, this geometrical argument is necessary to fix the added degree of freedom one gets for allowing the mass to vary with time. This result has applications on cosmological accretion models and provides a derivation from first principles to serve as a basis to the accretion equations already in use in the literature.
Resumo:
Background: Tuberculosis (TB) remains a public health issue worldwide. The lack of specific clinical symptoms to diagnose TB makes the correct decision to admit patients to respiratory isolation a difficult task for the clinician. Isolation of patients without the disease is common and increases health costs. Decision models for the diagnosis of TB in patients attending hospitals can increase the quality of care and decrease costs, without the risk of hospital transmission. We present a predictive model for predicting pulmonary TB in hospitalized patients in a high prevalence area in order to contribute to a more rational use of isolation rooms without increasing the risk of transmission. Methods: Cross sectional study of patients admitted to CFFH from March 2003 to December 2004. A classification and regression tree (CART) model was generated and validated. The area under the ROC curve (AUC), sensitivity, specificity, positive and negative predictive values were used to evaluate the performance of model. Validation of the model was performed with a different sample of patients admitted to the same hospital from January to December 2005. Results: We studied 290 patients admitted with clinical suspicion of TB. Diagnosis was confirmed in 26.5% of them. Pulmonary TB was present in 83.7% of the patients with TB (62.3% with positive sputum smear) and HIV/AIDS was present in 56.9% of patients. The validated CART model showed sensitivity, specificity, positive predictive value and negative predictive value of 60.00%, 76.16%, 33.33%, and 90.55%, respectively. The AUC was 79.70%. Conclusions: The CART model developed for these hospitalized patients with clinical suspicion of TB had fair to good predictive performance for pulmonary TB. The most important variable for prediction of TB diagnosis was chest radiograph results. Prospective validation is still necessary, but our model offer an alternative for decision making in whether to isolate patients with clinical suspicion of TB in tertiary health facilities in countries with limited resources.
Resumo:
We present a study of the stellar parameters and iron abundances of 18 giant stars in six open clusters. The analysis was based on high-resolution and high-S/N spectra obtained with the UVES spectrograph (VLT-UT2). The results complement our previous study where 13 clusters were already analyzed. The total sample of 18 clusters is part of a program to search for planets around giant stars. The results show that the 18 clusters cover a metallicity range between -0.23 and +0.23 dex. Together with the derivation of the stellar masses, these metallicities will allow the metallicity and mass effects to be disentangled when analyzing the frequency of planets as a function of these stellar parameters.
Resumo:
The leaf area index (LAI) is a key characteristic of forest ecosystems. Estimations of LAI from satellite images generally rely on spectral vegetation indices (SVIs) or radiative transfer model (RTM) inversions. We have developed a new and precise method suitable for practical application, consisting of building a species-specific SVI that is best-suited to both sensor and vegetation characteristics. Such an SVI requires calibration on a large number of representative vegetation conditions. We developed a two-step approach: (1) estimation of LAI on a subset of satellite data through RTM inversion; and (2) the calibration of a vegetation index on these estimated LAI. We applied this methodology to Eucalyptus plantations which have highly variable LAI in time and space. Previous results showed that an RTM inversion of Moderate Resolution Imaging Spectroradiometer (MODIS) near-infrared and red reflectance allowed good retrieval performance (R-2 = 0.80, RMSE = 0.41), but was computationally difficult. Here, the RTM results were used to calibrate a dedicated vegetation index (called "EucVI") which gave similar LAI retrieval results but in a simpler way. The R-2 of the regression between measured and EucVI-simulated LAI values on a validation dataset was 0.68, and the RMSE was 0.49. The additional use of stand age and day of year in the SVI equation slightly increased the performance of the index (R-2 = 0.77 and RMSE = 0.41). This simple index opens the way to an easily applicable retrieval of Eucalyptus LAI from MODIS data, which could be used in an operational way.
Resumo:
The use of statistical methods to analyze large databases of text has been useful in unveiling patterns of human behavior and establishing historical links between cultures and languages. In this study, we identified literary movements by treating books published from 1590 to 1922 as complex networks, whose metrics were analyzed with multivariate techniques to generate six clusters of books. The latter correspond to time periods coinciding with relevant literary movements over the last five centuries. The most important factor contributing to the distinctions between different literary styles was the average shortest path length, in particular the asymmetry of its distribution. Furthermore, over time there has emerged a trend toward larger average shortest path lengths, which is correlated with increased syntactic complexity, and a more uniform use of the words reflected in a smaller power-law coefficient for the distribution of word frequency. Changes in literary style were also found to be driven by opposition to earlier writing styles, as revealed by the analysis performed with geometrical concepts. The approaches adopted here are generic and may be extended to analyze a number of features of languages and cultures.