955 resultados para Temporal Information Extraction
Resumo:
Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.
Resumo:
In this paper we discuss the temporal aspects of indexing and classification in information systems. Basing this discussion off of the three sources of research of scheme change: of indexing: (1) analytical research on the types of scheme change and (2) empirical data on scheme change in systems and (3) evidence of cataloguer decision-making in the context of scheme change. From this general discussion we propose two constructs along which we might craft metrics to measure scheme change: collocative integrity and semantic gravity. The paper closes with a discussion of these constructs.
Resumo:
Crop monitoring and more generally land use change detection are of primary importance in order to analyze spatio-temporal dynamics and its impacts on environment. This aspect is especially true in such a region as the State of Mato Grosso (south of the Brazilian Amazon Basin) which hosts an intensive pioneer front. Deforestation in this region as often been explained by soybean expansion in the last three decades. Remote sensing techniques may now represent an efficient and objective manner to quantify how crops expansion really represents a factor of deforestation through crop mapping studies. Due to the special characteristics of the soybean productions' farms in Mato Grosso (area varying between 1000 hectares and 40000 hectares and individual fields often bigger than 100 hectares), the Moderate Resolution Imaging Spectroradiometer (MODIS) data with a near daily temporal resolution and 250 m spatial resolution can be considered as adequate resources to crop mapping. Especially, multitemporal vegetation indices (VI) studies have been currently used to realize this task [1] [2]. In this study, 16-days compositions of EVI (MODQ13 product) data are used. However, although these data are already processed, multitemporal VI profiles still remain noisy due to cloudiness (which is extremely frequent in a tropical region such as south Amazon Basin), sensor problems, errors in atmospheric corrections or BRDF effect. Thus, many works tried to develop algorithms that could smooth the multitemporal VI profiles in order to improve further classification. The goal of this study is to compare and test different smoothing algorithms in order to select the one which satisfies better to the demand which is classifying crop classes. Those classes correspond to 6 different agricultural managements observed in Mato Grosso through an intensive field work which resulted in mapping more than 1000 individual fields. The agricultural managements above mentioned are based on combination of soy, cotton, corn, millet and sorghum crops sowed in single or double crop systems. Due to the difficulty in separating certain classes because of too similar agricultural calendars, the classification will be reduced to 3 classes : Cotton (single crop), Soy and cotton (double crop), soy (single or double crop with corn, millet or sorghum). The classification will use training data obtained in the 2005-2006 harvest and then be tested on the 2006-2007 harvest. In a first step, four smoothing techniques are presented and criticized. Those techniques are Best Index Slope Extraction (BISE) [3], Mean Value Iteration (MVI) [4], Weighted Least Squares (WLS) [5] and Savitzky-Golay Filter (SG) [6] [7]. These techniques are then implemented and visually compared on a few individual pixels so that it allows doing a first selection between the five studied techniques. The WLS and SG techniques are selected according to criteria proposed by [8]. Those criteria are: ability in eliminating frequent noises, conserving the upper values of the VI profiles and keeping the temporality of the profiles. Those selected algorithms are then programmed and applied to the MODIS/TERRA EVI data (16-days composition periods). Tests of separability are realized based on the Jeffries-Matusita distance in order to see if the algorithms managed in improving the potential of differentiation between the classes. Those tests are realized on the overall profile (comprising 23 MODIS images) as well as on each MODIS sub-period of the profile [1]. This last test is a double interest process because it allows comparing the smoothing techniques and also enables to select a set of images which carries more information on the separability between the classes. Those selected dates can then be used to realize a supervised classification. Here three different classifiers are tested to evaluate if the smoothing techniques as a particular effect on the classification depending on the classifiers used. Those classifiers are Maximum Likelihood classifier, Spectral Angle Mapper (SAM) classifier and CHAID Improved Decision tree. It appears through the separability tests on the overall process that the smoothed profiles don't improve efficiently the potential of discrimination between classes when compared with the original data. However, the same tests realized on the MODIS sub-periods show better results obtained with the smoothed algorithms. The results of the classification confirm this first analyze. The Kappa coefficients are always better with the smoothing techniques and the results obtained with the WLS and SG smoothed profiles are nearly equal. However, the results are different depending on the classifier used. The impact of the smoothing algorithms is much better while using the decision tree model. Indeed, it allows a gain of 0.1 in the Kappa coefficient. While using the Maximum Likelihood end SAM models, the gain remains positive but is much lower (Kappa improved of 0.02 only). Thus, this work's aim is to prove the utility in smoothing the VI profiles in order to improve the final results. However, the choice of the smoothing algorithm has to be made considering the original data used and the classifier models used. In that case the Savitzky-Golay filter gave the better results.
Resumo:
Introduction: Brazil, is one of the main agricultural producers in the world ranking 1st in the production of sugarcane, coffee and oranges. It is also 2nd as world producer of soybeans and a leader in the harvested yields of many other crops. The annual consumption of mineral fertilizers exceeds 20 million mt, 30% of which corresponds to potash fertilizers (ANDA, 2006). From this statistic it may be supposed that fertilizer application in Brazil is rather high, compared with many other countries. However, even if it is assumed that only one fourth of this enormous 8.5 million km2 territory is used for agriculture, average levels of fertilizer application per hectare of arable land are not high enough for sustainable production. One of the major constraints is the relatively low natural fertility status of the soils which contain excessive Fe and Al oxides. Agriculture is also often practised on sandy soils so that the heavy rainfall causes large losses of nutrients through leaching. In general, nutrient removal by crops such as sugarcane and tropical fruits is much more than the average nutrient application via fertilization, especially in regions with a long history of agricultural production. In the recently developed areas, especially in the Cerrado (Brazilian savanna) where agriculture has expanded since 1980, soils are even poorer than in the "old" agricultural regions, and high costs of mineral fertilizers have become a significant input factor in determining soybean, maize and cotton planting. The consumption of mineral fertilizers throughout Brazil is very uneven. According to the 1995/96 Agricultural Census, only in eight of the total of 26 Brazilian states, were 50 per cent or more of the farms treated "systematically" with mineral fertilizers; in many states it was less than 25 per cent, and in five states even less than 12 per cent (Brazilian Institute for Geography and Statistics; Censo Agropecuario1995/96, Instituto Brazileiro de Geografia e Estadistica; IBGE, www.ibge.gov.br). The geographical application distribution pattern of mineral fertilizers may be considered as an important field of research. Understanding geographical disparities in fertilization level requires a complex approach. This includes evaluation of the availability of nutrients in the soil (and related soil properties e.g. CEC and texture), the input of nutrients with fertilizer application, and the removal of nutrients by harvested yields. When all these data are compiled, it is possible to evaluate the balance of particular nutrients for certain areas, and make conclusions as to where agricultural practices should be optimized. This kind of research is somewhat complicated, because it relies on completely different sources of data, usually from incomparable data sources, e.g. soil characteristics attributed to soil type areas, in contrast to yields by administrative regions, or farms. A priority tool in this case is the Geographical Information System (GIS), which enables attribution of data from different fields to the same territorial units, and makes possible integration of these data in an "inputoutput" model, where "input" is the natural availability of a nutrient in the soil plus fertilization, and "output" export of the same nutrient with the removed harvested yield.
Resumo:
The year 14,226 BP marks an important border in the actual radiocarbon (14C) calibration curve: the high resolution and precision characterising the first part (0 – 14,226 BP) of the curve are due to the potential represented by tree-ring datasets, which directly provide the atmospheric 14C content at the time of tree-rings formation with high resolution. They systematically decrease going back in time, where only a few floating tree-ring chronologies alternate to other low-resolution records. The lack of resolution in the dating procedure before 14,226 years BP leads to significant issues in the interpretation and untangling of tricky facts of our past, in the field of Human Evolution. Research on sub-fossil trees and the construction of new Glacial tree-ring chronologies can significantly improve the radiocarbon dating in terms of temporal resolution and precision until 55,000 years BP to clear puzzles in the Human Evolution history. In this thesis, the dendrochronological study, the radiocarbon dating and the extrapolation of environmental and climate information from sub-fossil trees found on the Portugal foreshore, remnants of a Glacial lagoonal forest, are presented. The careful sampling, the dendrochronological measurements and cross-dating, the application of the most suitable cellulose extraction protocol and the most advanced technologies of the MICADAS system at ETH-Zurich, led to the construction of a new 220-years long tree-ring site chronology and to high resolution, highly reliable and with a tight error range radiocarbon ages. At the moment, it results impossible to absolutely date this radiocarbon sequence by the comparison of Δ14C of the trees and 10 Be fluctuations from the ice-cores. For this reason, tree growth analysis, comparisons with a living pine stand and forest-fires history reconstruction have made it possible to hypothesize site and climate characteristics useful to constrain the positioning in time of the obtained radiocarbon sequence.
Resumo:
Teeth, with their high mineralisation, incremental growth, and lack of remodelling, serve as biological archives that document an individual's development. This project aims to utilise the potential of teeth in bioarchaeological studies to achieve three primary objectives: 1) to investigate the application of histological and histochemical methods in reconstructing developmental bio-chronologies and early life histories; 2) to refine the temporal precision of isotopic analysis of dentine collagen by developing a novel protocol that integrates micro-sampling techniques with high-resolution histomorphometrics; and 3) to synthesise data from enamel and dentine for a comprehensive understanding of early life development and dietary transitions. This study adopts an integrated multidisciplinary bioarchaeological approach, conducting histomorphometric analysis on enamel and dentine across deciduous and permanent dentitions. It applies high-temporal resolution trace element analysis to enamel using LA-ICPMS and δ13C and δ15N isotope analyses through sequential micro-sampling to dentine of permanent teeth. Samples were selected from diverse archaeological contexts across the Italian peninsula, covering the Upper Palaeolithic, Copper Age, and Early Medieval periods, providing insight into diachronic variations in infant development and life history. Findings highlight the efficacy of histological and histochemical techniques in accurately determining growth rates, physiological stress, dietary shifts (particularly timing of weaning), and age at death in infant remains. The consistency and comparison between enamel and dentine underscores the enhanced insight obtained from integrating information from both tissues. Importantly, the newly proposed protocol significantly improves the temporal accuracy of dentine collagen analysis, facilitating precise chronological placement of the results over broad developmental associations. This study reaffirms the significance of teeth as valuable bioarchaeological instruments. By introducing and testing multidisciplinary methods, it provides deeper insights into early life history and cultural practices across diverse chronological contexts, highlighting the importance of advanced methodologies in extracting detailed, accurate, and nuanced information from past populations.
Resumo:
Most cognitive functions require the encoding and routing of information across distributed networks of brain regions. Information propagation is typically attributed to physical connections existing between brain regions, and contributes to the formation of spatially correlated activity patterns, known as functional connectivity. While structural connectivity provides the anatomical foundation for neural interactions, the exact manner in which it shapes functional connectivity is complex and not yet fully understood. Additionally, traditional measures of directed functional connectivity only capture the overall correlation between neural activity, and provide no insight on the content of transmitted information, limiting their ability in understanding neural computations underlying the distributed processing of behaviorally-relevant variables. In this work, we first study the relationship between structural and functional connectivity in simulated recurrent spiking neural networks with spike timing dependent plasticity. We use established measures of time-lagged correlation and overall information propagation to infer the temporal evolution of synaptic weights, showing that measures of dynamic functional connectivity can be used to reliably reconstruct the evolution of structural properties of the network. Then, we extend current methods of directed causal communication between brain areas, by deriving an information-theoretic measure of Feature-specific Information Transfer (FIT) quantifying the amount, content and direction of information flow. We test FIT on simulated data, showing its key properties and advantages over traditional measures of overall propagated information. We show applications of FIT to several neural datasets obtained with different recording methods (magneto and electro-encephalography, spiking activity, local field potentials) during various cognitive functions, ranging from sensory perception to decision making and motor learning. Overall, these analyses demonstrate the ability of FIT to advance the investigation of communication between brain regions, uncovering the previously unaddressed content of directed information flow.
Resumo:
To detect the presence of male DNA in vaginal samples collected from survivors of sexual violence and stored on filter paper. A pilot study was conducted to evaluate 10 vaginal samples spotted on sterile filter paper: 6 collected at random in April 2009 and 4 in October 2010. Time between sexual assault and sample collection was 4-48hours. After drying at room temperature, the samples were placed in a sterile envelope and stored for 2-3years until processing. DNA extraction was confirmed by polymerase chain reaction for human β-globin, and the presence of prostate-specific antigen (PSA) was quantified. The presence of the Y chromosome was detected using primers for sequences in the TSPY (Y7/Y8 and DYS14) and SRY genes. β-Globin was detected in all 10 samples, while 2 samples were positive for PSA. Half of the samples amplified the Y7/Y8 and DYS14 sequences of the TSPY gene and 30% amplified the SRY gene sequence of the Y chromosome. Four male samples and 1 female sample served as controls. Filter-paper spots stored for periods of up to 3years proved adequate for preserving genetic material from vaginal samples collected following sexual violence.
Resumo:
One of the great challenges of the scientific community on theories of genetic information, genetic communication and genetic coding is to determine a mathematical structure related to DNA sequences. In this paper we propose a model of an intra-cellular transmission system of genetic information similar to a model of a power and bandwidth efficient digital communication system in order to identify a mathematical structure in DNA sequences where such sequences are biologically relevant. The model of a transmission system of genetic information is concerned with the identification, reproduction and mathematical classification of the nucleotide sequence of single stranded DNA by the genetic encoder. Hence, a genetic encoder is devised where labelings and cyclic codes are established. The establishment of the algebraic structure of the corresponding codes alphabets, mappings, labelings, primitive polynomials (p(x)) and code generator polynomials (g(x)) are quite important in characterizing error-correcting codes subclasses of G-linear codes. These latter codes are useful for the identification, reproduction and mathematical classification of DNA sequences. The characterization of this model may contribute to the development of a methodology that can be applied in mutational analysis and polymorphisms, production of new drugs and genetic improvement, among other things, resulting in the reduction of time and laboratory costs.
Resumo:
Originally from Asia, Dovyalis hebecarpa is a dark purple/red exotic berry now also produced in Brazil. However, no reports were found in the literature about phenolic extraction or characterisation of this berry. In this study we evaluate the extraction optimisation of anthocyanins and total phenolics in D. hebecarpa berries aiming at the development of a simple and mild analytical technique. Multivariate analysis was used to optimise the extraction variables (ethanol:water:acetone solvent proportions, times, and acid concentrations) at different levels. Acetone/water (20/80 v/v) gave the highest anthocyanin extraction yield, but pure water and different proportions of acetone/water or acetone/ethanol/water (with >50% of water) were also effective. Neither acid concentration nor time had a significant effect on extraction efficiency allowing to fix the recommended parameters at the lowest values tested (0.35% formic acid v/v, and 17.6 min). Under optimised conditions, extraction efficiencies were increased by 31.5% and 11% for anthocyanin and total phenolics, respectively as compared to traditional methods that use more solvent and time. Thus, the optimised methodology increased yields being less hazardous and time consuming than traditional methods. Finally, freeze-dried D. hebecarpa showed high content of target phytochemicals (319 mg/100g and 1,421 mg/100g of total anthocyanin and total phenolic content, respectively).
Resumo:
The Atlantic rainforest species Ocotea catharinensis, Ocotea odorifera, and Ocotea porosa have been extensively harvested in the past for timber and oil extraction and are currently listed as threatened due to overexploitation. To investigate the genetic diversity and population structure of these species, we developed 8 polymorphic microsatellite markers for O. odorifera from an enriched microsatellite library by using 2 dinucleotide repeats. The microsatellite markers were tested for cross-amplification in O. catharinensis and O. porosa. The average number of alleles per locus was 10.2, considering all loci over 2 populations of O. odorifera. Observed and expected heterozygosities for O. odorifera ranged from 0.39 to 0.93 and 0.41 to 0.92 across populations, respectively. Cross-amplification of all loci was successfully observed in O. catharinensis and O. porosa except 1 locus that was found to lack polymorphism in O. porosa. Combined probabilities of identity in the studied Ocotea species were very low ranging from 1.0 x 10-24 to 7.7 x 10-24. The probability of exclusion over all loci estimated for O. odorifera indicated a 99.9% chance of correctly excluding a random nonparent individual. The microsatellite markers described in this study have high information content and will be useful for further investigations on genetic diversity within these species and for subsequent conservation purposes.
Resumo:
The poison frog genus Ameerega (Dendrobatidae) currently contains 32 species. They are distributed from central Brazil into western Amazonia to the lower Andean versant. In addition, three trans-Andean species have been allocated to Ameerega (Andrade et al. 2013; Frost 2014). Ameerega berohoka (Vaz-Silva & Maciel 2011) was described based on specimens from central Brazil (type-locality: Arenópolis, GO) and it is assumed to occur in parts of western and southwestern state of Goiás (Frost 2014). More recently, Andrade et al. (2013) extended its distribution to the state of Mato Grosso. Here we re-describe the advertisement call of A. berohoka, providing additional information regarding its temporal structure and spectral traits. Our observations also consist of a new distribution record for this species to the state of Mato Grosso.
Resumo:
Extraction processes are largely used in many chemical, biotechnological and pharmaceutical industries for recovery of bioactive compounds from medicinal plants. To replace the conventional extraction techniques, new techniques as high-pressure extraction processes that use environment friendly solvents have been developed. However, these techniques, sometimes, are associated with low extraction rate. The ultrasound can be effectively used to improve the extraction rate by the increasing the mass transfer and possible rupture of cell wall due the formation of microcavities leading to higher product yields with reduced processing time and solvent consumption. This review presents a brief survey about the mechanism and aspects that affecting the ultrasound assisted extraction focusing on the use of ultrasound irradiation for high-pressure extraction processes intensification.
Resumo:
Purified genomic DNA can be difficult to obtain from some plant species because of the presence of impurities such as polysaccharides, which are often co-extracted with DNA. In this study, we developed a fast, simple, and low-cost protocol for extracting DNA from plants containing high levels of secondary metabolites. This protocol does not require the use of volatile toxic reagents such as mercaptoethanol, chloroform, or phenol and allows the extraction of high-quality DNA from wild and cultivated tropical species.
Resumo:
To assess the completeness and reliability of the Information System on Live Births (Sinasc) data. A cross-sectional analysis of the reliability and completeness of Sinasc's data was performed using a sample of Live Birth Certificate (LBC) from 2009, related to births from Campinas, Southeast Brazil. For data analysis, hospitals were grouped according to category of service (Unified National Health System, private or both), 600 LBCs were randomly selected and the data were collected in LBC-copies through mothers and newborns' hospital records and by telephone interviews. The completeness of LBCs was evaluated, calculating the percentage of blank fields, and the LBCs agreement comparing the originals with the copies was evaluated by Kappa and intraclass correlation coefficients. The percentage of completeness of LBCs ranged from 99.8%-100%. For the most items, the agreement was excellent. However, the agreement was acceptable for marital status, maternal education and newborn infants' race/color, low for prenatal visits and presence of birth defects, and very low for the number of deceased children. The results showed that the municipality Sinasc is reliable for most of the studied variables. Investments in training of the professionals are suggested in an attempt to improve system capacity to support planning and implementation of health activities for the benefit of maternal and child population.