952 resultados para Dynamic data set visualization


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Idiosyncratic markers are features of genes and genomes that are so unusual that it is unlikely that they evolved more than once in a lineage of organisms. Here we explore further the potential of idiosyncratic markers and changes to typically conserved tRNA sequences for phylogenetic inference. Hard ticks were chosen as the model group because their phylogeny has been studied extensively. Fifty-eight candidate markers from hard ticks ( family Ixodidae) and 22 markers from the subfamily Rhipicephalinae sensu lato were mapped onto phylogenies of these groups. Two of the most interesting markers, features of the secondary structure of two different tRNAs, gave strong support to the hypothesis that species of the Prostriata ( Ixodes spp.) are monophyletic. Previous analyses of genes and morphology did not strongly support this relationship, instead suggesting that the Prostriata is paraphyletic with respect to the Metastriata ( the rest of the hard ticks). Parallel or convergent evolution was not found in the arrangements of mitochondrial genes in ticks nor were there any reversals to the ancestral arthropod character state. Many of the markers identified were phylogenetically informative, whereas others should be informative with study of additional taxa. Idiosyncratic markers and changes to typically conserved nucleotides in tRNAs that are phylogenetically informative were common in this data set, and thus these types of markers might be found in other organisms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The entire internal transcribed spacer ( ITS) region, including the 5.8S subunit of the nuclear ribosomal DNA ( rDNA), was sequenced by direct double-stranded sequencing of polymerase chain reaction (PCR) amplified fragments. The study included 40 Sporobolus ( Family Poaceae, subfamily Chloridoideae) seed collections from 14 putative species ( all 11 species from the S. indicus complex and three Australian native species). These sequences, along with those from two out-group species [ Pennisetum alopecuroides ( L.) Spreng. and Heteropogon contortus ( L.) P. Beauv. ex Roemer & Schultes, Poaceae, subfamily Panicoideae], were analysed by the parsimony method (PAUP; version 4.0b4a) to infer phylogenetic relationships among these species. The length of the ITS1, 5.8S subunit and ITS2 region were 222, 164 and 218 base pairs ( bp), respectively, in all species of the S. indicus complex, except for the ITS2 region of S. diandrus P. Beauv. individuals, which was 217 bp long. Of the 624 characters included in the analysis, 245 ( 39.3%) of the 330 variable sites contained potential phylogenetic information. Differences in sequences among the members of the S. pyramidalis P. Beauv., S. natalensis (Steud.) Dur & Schinz and S. jacquemontii Kunth. collections were 0%, while differences ranged from 0 to 2% between these and other species of the complex. Similarly, differences in sequences among collections of S. laxus B. K. Simon, S. sessilis B. K. Simon, S. elongatus R. Br. and S. creber De Nardi were 0%, compared with differences of 1-2% between these four species and the rest of the complex. When comparing S. fertilis ( Steud.) Clayton and S. africanus (Poir.) Robyns & Tourney, differences between collections ranged from 0 to 1%. Parsimony analysis grouped all 11 species of the S. indicus complex together, indicating a monophyletic origin. For the entire data set, pair-wise distances among members of the S. indicus complex varied from 0.00 to 1.58%, compared with a range of 20.08-21.44% among species in the complex and the Australian native species studied. A strict consensus phylogenetic tree separated 11 species of the S. indicus complex into five major clades. The phylogeny, based on ITS sequences, was found to be congruent with an earlier study on the taxonomic relationship of the weedy Sporobolus grasses revealed from random amplified polymorphic DNA ( RAPD). However, this cladistic analysis of the complex was not in agreement with that created on past morphological analyses and therefore gives a new insight into the phylogeny of the S. indicus complex.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fertilizer recommendation to most agricultural crops is based on response curves. Such curves are constructed from field experimental data, obtained for a particular condition and may not be reliable to be applied to other regions. The aim of this study was to develop a Lime and Fertilizer Recommendation System for Coconut Crop based on the nutritional balance. The System considers the expected productivity and plant nutrient use efficiency to estimate nutrient demand, and effective rooting layer, soil nutrient availability, as well as any other nutrient input to estimate the nutrient supply. Comparing the nutrient demand with the nutrient supply the System defines the nutrient balance. If the balance for a given nutrient is negative, lime and, or, fertilization is recommended. On the other hand, if the balance is positive, no lime or fertilizer is needed. For coconut trees, the fertilization regime is divided in three stages: fertilization at the planting spot, band fertilization and fertilization at the production phase. The data set for the development of the System for coconut trees was obtained from the literature. The recommendations generated by the System were compared to those derived from recommendation tables used for coconut crop in Brazil. The main differences between the two procedures were for the P rate applied in the planting hole, which was higher in the proposed System because the tables do not pay heed to the pit volume, whereas the N and K rates were lower. The crop demand for K is very high, and the rates recommended by the System are superior to the table recommendations for the formation and initial production stage. The fertilizer recommendations by the System are higher for the phase of coconut tree growth as compared to the production phase, because greater amount of biomass is produced in the first phase.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this paper is to provide empirical evidence on the determinants of gender wage inequality in the Portuguese tourism industry. Relying on firm level wage equations and production functions, gender wage and productivity differentials are estimated and then compared in order to infer whether observed gender disparities are justifiable on the grounds that women are relatively less productive than men, or instead disparities are due to gender wage discrimination. This approach is applied to tourism industry data gathered in the matched employer-employee data set Quadros de Pessoal (Employee Records). The main findings indicate that female employees in the tourism industry in Portugal are less productive than their male colleagues and that gender differences in wages are fully explained by gender differences in productivity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tourism represents a major economic activity in Portugal, with an enormous wealth and employment growth potential. A significant proportion of jobs in the industry tourism are occupied by women, given that this industry is characterized by a relatively higher percentage of female employees. Despite the evidence of female progress with regard to their role in the Portuguese labor market, women continue to earn less than their male counterparts. This is clearly the case of the tourism industry, where statistics reveal a persistent gender wage gap. The objective of this paper is to provide empirical evidence on the determinants of gender wage inequality in the tourism industry in northern Portugal. Relying on firm-level wage equations and production functions, gender wage and productivity differentials are estimated and then compared. The comparison of these differentials allows inferring whether observed wage disparities are attributable to relatively lower female productivity, or instead disparities are due to gender wage discrimination. This approach is applied to tourism industry data gathered in the matched employer-employee data set Quadros de Pessoal (Employee Records). The main findings indicate that female employees in the tourism industry in northern Portugal are less productive than their male colleagues and that gender differences in wages are fully explained by gender differences in productivity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the last years, it has become increasingly clear that neurodegenerative diseases involve protein aggregation, a process often used as disease progression readout and to develop therapeutic strategies. This work presents an image processing tool to automatic segment, classify and quantify these aggregates and the whole 3D body of the nematode Caenorhabditis Elegans. A total of 150 data set images, containing different slices, were captured with a confocal microscope from animals of distinct genetic conditions. Because of the animals’ transparency, most of the slices pixels appeared dark, hampering their body volume direct reconstruction. Therefore, for each data set, all slices were stacked in one single 2D image in order to determine a volume approximation. The gradient of this image was input to an anisotropic diffusion algorithm that uses the Tukey’s biweight as edge-stopping function. The image histogram median of this outcome was used to dynamically determine a thresholding level, which allows the determination of a smoothed exterior contour of the worm and the medial axis of the worm body from thinning its skeleton. Based on this exterior contour diameter and the medial animal axis, random 3D points were then calculated to produce a volume mesh approximation. The protein aggregations were subsequently segmented based on an iso-value and blended with the resulting volume mesh. The results obtained were consistent with qualitative observations in literature, allowing non-biased, reliable and high throughput protein aggregates quantification. This may lead to a significant improvement on neurodegenerative diseases treatment planning and interventions prevention

Relevância:

100.00% 100.00%

Publicador:

Resumo:

One Plus Sequential Air Sampler—Partisol was placed in a small village (Foros de Arrão) in central Portugal to collect PM10 (particles with an aerodynamic diameter below 10 μm), during the winter period for 3 months (December 2009–March 2010). Particles masses were gravimetrically determined and the filters were analyzed by instrumental neutron activation analysis to assess their chemical composition. The water-soluble ion compositions of the collected particles were determined by Ion-exchange Chromatography. Principal component analysis was applied to the data set of chemical elements and soluble ions to assess the main sources of the air pollutants. The use of both analytical techniques provided information about elemental solubility, such as for potassium, which was important to differentiate sources.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A organização automática de mensagens de correio electrónico é um desafio actual na área da aprendizagem automática. O número excessivo de mensagens afecta cada vez mais utilizadores, especialmente os que usam o correio electrónico como ferramenta de comunicação e trabalho. Esta tese aborda o problema da organização automática de mensagens de correio electrónico propondo uma solução que tem como objectivo a etiquetagem automática de mensagens. A etiquetagem automática é feita com recurso às pastas de correio electrónico anteriormente criadas pelos utilizadores, tratando-as como etiquetas, e à sugestão de múltiplas etiquetas para cada mensagem (top-N). São estudadas várias técnicas de aprendizagem e os vários campos que compõe uma mensagem de correio electrónico são analisados de forma a determinar a sua adequação como elementos de classificação. O foco deste trabalho recai sobre os campos textuais (o assunto e o corpo das mensagens), estudando-se diferentes formas de representação, selecção de características e algoritmos de classificação. É ainda efectuada a avaliação dos campos de participantes através de algoritmos de classificação que os representam usando o modelo vectorial ou como um grafo. Os vários campos são combinados para classificação utilizando a técnica de combinação de classificadores Votação por Maioria. Os testes são efectuados com um subconjunto de mensagens de correio electrónico da Enron e um conjunto de dados privados disponibilizados pelo Institute for Systems and Technologies of Information, Control and Communication (INSTICC). Estes conjuntos são analisados de forma a perceber as características dos dados. A avaliação do sistema é realizada através da percentagem de acerto dos classificadores. Os resultados obtidos apresentam melhorias significativas em comparação com os trabalhos relacionados.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper seeks to study the persistence in the G7’s stock market volatility, which is carried out using the GARCH, IGARCH and FIGARCH models. The data set consists of the daily returns of the S&P/TSX 60, CAC 40, DAX 30, MIB 30, NIKKEI 225, FTSE 100 and S&P 500 indexes over the period 1999-2009. The results evidences long memory in volatility, which is more pronounced in Germany, Italy and France. On the other hand, Japan appears as the country where this phenomenon is less obvious; nevertheless, the persistence prevails but with minor intensity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The portfolio generating the iTraxx EUR index is modeled by coupled Markov chains. Each of the industries of the portfolio evolves according to its own Markov transition matrix. Using a variant of the method of moments, the model parameters are estimated from a data set of Standard and Poor's. Swap spreads are evaluated by Monte-Carlo simulations. Along with an actuarially fair spread, at least squares spread is considered.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Financial literature and financial industry use often zero coupon yield curves as input for testing hypotheses, pricing assets or managing risk. They assume this provided data as accurate. We analyse implications of the methodology and of the sample selection criteria used to estimate the zero coupon bond yield term structure on the resulting volatility of spot rates with different maturities. We obtain the volatility term structure using historical volatilities and Egarch volatilities. As input for these volatilities we consider our own spot rates estimation from GovPX bond data and three popular interest rates data sets: from the Federal Reserve Board, from the US Department of the Treasury (H15), and from Bloomberg. We find strong evidence that the resulting zero coupon bond yield volatility estimates as well as the correlation coefficients among spot and forward rates depend significantly on the data set. We observe relevant differences in economic terms when volatilities are used to price derivatives.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This study explores a large set of OC and EC measurements in PM(10) and PM(2.5) aerosol samples, undertaken with a long term constant analytical methodology, to evaluate the capability of the OC/EC minimum ratio to represent the ratio between the OC and EC aerosol components resulting from fossil fuel combustion (OC(ff)/EC(ff)). The data set covers a wide geographical area in Europe, but with a particular focus upon Portugal, Spain and the United Kingdom, and includes a great variety of sites: urban (background, kerbside and tunnel), industrial, rural and remote. The highest minimum ratios were found in samples from remote and rural sites. Urban background sites have shown spatially and temporally consistent minimum ratios, of around 1.0 for PM(10) and 0.7 for PM(2.5).The consistency of results has suggested that the method could be used as a tool to derive the ratio between OC and EC from fossil fuel combustion and consequently to differentiate OC from primary and secondary sources. To explore this capability, OC and EC measurements were performed in a busy roadway tunnel in central Lisbon. The OC/EC ratio, which reflected the composition of vehicle combustion emissions, was in the range of 03-0.4. Ratios of OC/EC in roadside increment air (roadside minus urban background) in Birmingham, UK also lie within the range 03-0.4. Additional measurements were performed under heavy traffic conditions at two double kerbside sites located in the centre of Lisbon and Madrid. The OC/EC minimum ratios observed at both sites were found to be between those of the tunnel and those of urban background air, suggesting that minimum values commonly obtained for this parameter in open urban atmospheres over-predict the direct emissions of OC(ff) from road transport. Possible reasons for this discrepancy are explored. (C) 2011 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This work describes a methodology to extract symbolic rules from trained neural networks. In our approach, patterns on the network are codified using formulas on a Lukasiewicz logic. For this we take advantage of the fact that every connective in this multi-valued logic can be evaluated by a neuron in an artificial network having, by activation function the identity truncated to zero and one. This fact simplifies symbolic rule extraction and allows the easy injection of formulas into a network architecture. We trained this type of neural network using a back-propagation algorithm based on Levenderg-Marquardt algorithm, where in each learning iteration, we restricted the knowledge dissemination in the network structure. This makes the descriptive power of produced neural networks similar to the descriptive power of Lukasiewicz logic language, minimizing the information loss on the translation between connectionist and symbolic structures. To avoid redundance on the generated network, the method simplifies them in a pruning phase, using the "Optimal Brain Surgeon" algorithm. We tested this method on the task of finding the formula used on the generation of a given truth table. For real data tests, we selected the Mushrooms data set, available on the UCI Machine Learning Repository.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mestrado em Engenharia Informática

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Audiometer systems provide enormous amounts of detailed TV watching data. Several relevant and interdependent factors may influence TV viewers' behavior. In this work we focus on the time factor and derive Temporal Patterns of TV watching, based on panel data. Clustering base attributes are originated from 1440 binary minute-related attributes, capturing the TV watching status (watch/not watch). Since there are around 2500 panel viewers a data reduction procedure is first performed. K-Means algorithm is used to obtain daily clusters of viewers. Weekly patterns are then derived which rely on daily patterns. The obtained solutions are tested for consistency and stability. Temporal TV watching patterns provide new insights concerning Portuguese TV viewers' behavior.