968 resultados para Audio Data set
Resumo:
Idiosyncratic markers are features of genes and genomes that are so unusual that it is unlikely that they evolved more than once in a lineage of organisms. Here we explore further the potential of idiosyncratic markers and changes to typically conserved tRNA sequences for phylogenetic inference. Hard ticks were chosen as the model group because their phylogeny has been studied extensively. Fifty-eight candidate markers from hard ticks ( family Ixodidae) and 22 markers from the subfamily Rhipicephalinae sensu lato were mapped onto phylogenies of these groups. Two of the most interesting markers, features of the secondary structure of two different tRNAs, gave strong support to the hypothesis that species of the Prostriata ( Ixodes spp.) are monophyletic. Previous analyses of genes and morphology did not strongly support this relationship, instead suggesting that the Prostriata is paraphyletic with respect to the Metastriata ( the rest of the hard ticks). Parallel or convergent evolution was not found in the arrangements of mitochondrial genes in ticks nor were there any reversals to the ancestral arthropod character state. Many of the markers identified were phylogenetically informative, whereas others should be informative with study of additional taxa. Idiosyncratic markers and changes to typically conserved nucleotides in tRNAs that are phylogenetically informative were common in this data set, and thus these types of markers might be found in other organisms.
Resumo:
The entire internal transcribed spacer ( ITS) region, including the 5.8S subunit of the nuclear ribosomal DNA ( rDNA), was sequenced by direct double-stranded sequencing of polymerase chain reaction (PCR) amplified fragments. The study included 40 Sporobolus ( Family Poaceae, subfamily Chloridoideae) seed collections from 14 putative species ( all 11 species from the S. indicus complex and three Australian native species). These sequences, along with those from two out-group species [ Pennisetum alopecuroides ( L.) Spreng. and Heteropogon contortus ( L.) P. Beauv. ex Roemer & Schultes, Poaceae, subfamily Panicoideae], were analysed by the parsimony method (PAUP; version 4.0b4a) to infer phylogenetic relationships among these species. The length of the ITS1, 5.8S subunit and ITS2 region were 222, 164 and 218 base pairs ( bp), respectively, in all species of the S. indicus complex, except for the ITS2 region of S. diandrus P. Beauv. individuals, which was 217 bp long. Of the 624 characters included in the analysis, 245 ( 39.3%) of the 330 variable sites contained potential phylogenetic information. Differences in sequences among the members of the S. pyramidalis P. Beauv., S. natalensis (Steud.) Dur & Schinz and S. jacquemontii Kunth. collections were 0%, while differences ranged from 0 to 2% between these and other species of the complex. Similarly, differences in sequences among collections of S. laxus B. K. Simon, S. sessilis B. K. Simon, S. elongatus R. Br. and S. creber De Nardi were 0%, compared with differences of 1-2% between these four species and the rest of the complex. When comparing S. fertilis ( Steud.) Clayton and S. africanus (Poir.) Robyns & Tourney, differences between collections ranged from 0 to 1%. Parsimony analysis grouped all 11 species of the S. indicus complex together, indicating a monophyletic origin. For the entire data set, pair-wise distances among members of the S. indicus complex varied from 0.00 to 1.58%, compared with a range of 20.08-21.44% among species in the complex and the Australian native species studied. A strict consensus phylogenetic tree separated 11 species of the S. indicus complex into five major clades. The phylogeny, based on ITS sequences, was found to be congruent with an earlier study on the taxonomic relationship of the weedy Sporobolus grasses revealed from random amplified polymorphic DNA ( RAPD). However, this cladistic analysis of the complex was not in agreement with that created on past morphological analyses and therefore gives a new insight into the phylogeny of the S. indicus complex.
Resumo:
Fertilizer recommendation to most agricultural crops is based on response curves. Such curves are constructed from field experimental data, obtained for a particular condition and may not be reliable to be applied to other regions. The aim of this study was to develop a Lime and Fertilizer Recommendation System for Coconut Crop based on the nutritional balance. The System considers the expected productivity and plant nutrient use efficiency to estimate nutrient demand, and effective rooting layer, soil nutrient availability, as well as any other nutrient input to estimate the nutrient supply. Comparing the nutrient demand with the nutrient supply the System defines the nutrient balance. If the balance for a given nutrient is negative, lime and, or, fertilization is recommended. On the other hand, if the balance is positive, no lime or fertilizer is needed. For coconut trees, the fertilization regime is divided in three stages: fertilization at the planting spot, band fertilization and fertilization at the production phase. The data set for the development of the System for coconut trees was obtained from the literature. The recommendations generated by the System were compared to those derived from recommendation tables used for coconut crop in Brazil. The main differences between the two procedures were for the P rate applied in the planting hole, which was higher in the proposed System because the tables do not pay heed to the pit volume, whereas the N and K rates were lower. The crop demand for K is very high, and the rates recommended by the System are superior to the table recommendations for the formation and initial production stage. The fertilizer recommendations by the System are higher for the phase of coconut tree growth as compared to the production phase, because greater amount of biomass is produced in the first phase.
Resumo:
The objective of this paper is to provide empirical evidence on the determinants of gender wage inequality in the Portuguese tourism industry. Relying on firm level wage equations and production functions, gender wage and productivity differentials are estimated and then compared in order to infer whether observed gender disparities are justifiable on the grounds that women are relatively less productive than men, or instead disparities are due to gender wage discrimination. This approach is applied to tourism industry data gathered in the matched employer-employee data set Quadros de Pessoal (Employee Records). The main findings indicate that female employees in the tourism industry in Portugal are less productive than their male colleagues and that gender differences in wages are fully explained by gender differences in productivity.
Resumo:
Tourism represents a major economic activity in Portugal, with an enormous wealth and employment growth potential. A significant proportion of jobs in the industry tourism are occupied by women, given that this industry is characterized by a relatively higher percentage of female employees. Despite the evidence of female progress with regard to their role in the Portuguese labor market, women continue to earn less than their male counterparts. This is clearly the case of the tourism industry, where statistics reveal a persistent gender wage gap. The objective of this paper is to provide empirical evidence on the determinants of gender wage inequality in the tourism industry in northern Portugal. Relying on firm-level wage equations and production functions, gender wage and productivity differentials are estimated and then compared. The comparison of these differentials allows inferring whether observed wage disparities are attributable to relatively lower female productivity, or instead disparities are due to gender wage discrimination. This approach is applied to tourism industry data gathered in the matched employer-employee data set Quadros de Pessoal (Employee Records). The main findings indicate that female employees in the tourism industry in northern Portugal are less productive than their male colleagues and that gender differences in wages are fully explained by gender differences in productivity.
Resumo:
In the last years, it has become increasingly clear that neurodegenerative diseases involve protein aggregation, a process often used as disease progression readout and to develop therapeutic strategies. This work presents an image processing tool to automatic segment, classify and quantify these aggregates and the whole 3D body of the nematode Caenorhabditis Elegans. A total of 150 data set images, containing different slices, were captured with a confocal microscope from animals of distinct genetic conditions. Because of the animals’ transparency, most of the slices pixels appeared dark, hampering their body volume direct reconstruction. Therefore, for each data set, all slices were stacked in one single 2D image in order to determine a volume approximation. The gradient of this image was input to an anisotropic diffusion algorithm that uses the Tukey’s biweight as edge-stopping function. The image histogram median of this outcome was used to dynamically determine a thresholding level, which allows the determination of a smoothed exterior contour of the worm and the medial axis of the worm body from thinning its skeleton. Based on this exterior contour diameter and the medial animal axis, random 3D points were then calculated to produce a volume mesh approximation. The protein aggregations were subsequently segmented based on an iso-value and blended with the resulting volume mesh. The results obtained were consistent with qualitative observations in literature, allowing non-biased, reliable and high throughput protein aggregates quantification. This may lead to a significant improvement on neurodegenerative diseases treatment planning and interventions prevention
Resumo:
One Plus Sequential Air Sampler—Partisol was placed in a small village (Foros de Arrão) in central Portugal to collect PM10 (particles with an aerodynamic diameter below 10 μm), during the winter period for 3 months (December 2009–March 2010). Particles masses were gravimetrically determined and the filters were analyzed by instrumental neutron activation analysis to assess their chemical composition. The water-soluble ion compositions of the collected particles were determined by Ion-exchange Chromatography. Principal component analysis was applied to the data set of chemical elements and soluble ions to assess the main sources of the air pollutants. The use of both analytical techniques provided information about elemental solubility, such as for potassium, which was important to differentiate sources.
Resumo:
A organização automática de mensagens de correio electrónico é um desafio actual na área da aprendizagem automática. O número excessivo de mensagens afecta cada vez mais utilizadores, especialmente os que usam o correio electrónico como ferramenta de comunicação e trabalho. Esta tese aborda o problema da organização automática de mensagens de correio electrónico propondo uma solução que tem como objectivo a etiquetagem automática de mensagens. A etiquetagem automática é feita com recurso às pastas de correio electrónico anteriormente criadas pelos utilizadores, tratando-as como etiquetas, e à sugestão de múltiplas etiquetas para cada mensagem (top-N). São estudadas várias técnicas de aprendizagem e os vários campos que compõe uma mensagem de correio electrónico são analisados de forma a determinar a sua adequação como elementos de classificação. O foco deste trabalho recai sobre os campos textuais (o assunto e o corpo das mensagens), estudando-se diferentes formas de representação, selecção de características e algoritmos de classificação. É ainda efectuada a avaliação dos campos de participantes através de algoritmos de classificação que os representam usando o modelo vectorial ou como um grafo. Os vários campos são combinados para classificação utilizando a técnica de combinação de classificadores Votação por Maioria. Os testes são efectuados com um subconjunto de mensagens de correio electrónico da Enron e um conjunto de dados privados disponibilizados pelo Institute for Systems and Technologies of Information, Control and Communication (INSTICC). Estes conjuntos são analisados de forma a perceber as características dos dados. A avaliação do sistema é realizada através da percentagem de acerto dos classificadores. Os resultados obtidos apresentam melhorias significativas em comparação com os trabalhos relacionados.
Resumo:
The crustal and lithospheric mantle structure at the south segment of the west Iberian margin was investigated along a 370 km long seismic transect. The transect goes from unthinned continental crust onshore to oceanic crust, crossing the ocean-continent transition (OCT) zone. The wide-angle data set includes recordings from 6 OBSs and 2 inland seismic stations. Kinematic and dynamic modeling provided a 2D velocity model that proved to be consistent with the modeled free-air anomaly data. The interpretation of coincident multi-channel near-vertical and wide-angle reflection data sets allowed the identification of four main crustal domains: (i) continental (east of 9.4 degrees W); (ii) continental thinning (9.4 degrees W-9.7 degrees W): (iii) transitional (9.7 degrees W-similar to 10.5 degrees W); and (iv) oceanic (west of similar to 10.5 degrees W). In the continental domain the complete crustal section of slightly thinned continental crust is present. The upper (UCC, 5.1-6.0 km/s) and the lower continental crust (LCC, 6.9-7.2 km/s) are seismically reflective and have intermediate to low P-wave velocity gradients. The middle continental crust (MCC, 6.35-6.45 km/s) is generally unreflective with low velocity gradient. The main thinning of the continental crust occurs in the thinning domain by attenuation of the UCC and the LCC. Major thinning of the MCC starts to the west of the LCC pinchout point, where it rests directly upon the mantle. In the thinning domain the Moho slope is at least 13 degrees and the continental crust thickness decreases seaward from 22 to 11 km over a similar to 35 km distance, stretched by a factor of 1.5 to 3. In the oceanic domain a two-layer high-gradient igneous crust (5.3-6.0 km/s; 6.5-7.4 km/s) was modeled. The intra-crustal interface correlates with prominent mid-basement, 10-15 km long reflections in the multi-channel seismic profile. Strong secondary reflected PmP phases require a first order discontinuity at the Moho. The sedimentary cover can be as thick as 5 km and the igneous crustal thickness varies from 4 to 11 km in the west, where the profile reaches the Madeira-Tore Rise. In the transitional domain the crust has a complex structure that varies both horizontally and vertically. Beneath the continental slope it includes exhumed continental crust (6.15-6.45 km/s). Strong diffractions were modeled to originate at the lower interface of this layer. The western segment of this transitional domain is highly reflective at all levels, probably due to dykes and sills, according to the high apparent susceptibility and density modeled at this location. Sub-Moho mantle velocity is found to be 8.0 km/s, but velocities smaller than 8.0 km/s confined to short segments are not excluded by the data. Strong P-wave wide-angle reflections are modeled to originate at depth of 20 km within the lithospheric mantle, under the eastern segment of the oceanic domain, or even deeper at the transitional domain, suggesting a layered structure for the lithospheric mantle. Both interface depths and velocities of the continental section are in good agreement to the conjugate Newfoundland margin. A similar to 40 km wide OCT having a geophysical signature distinct from the OCT to the north favors a two pulse continental breakup.
Resumo:
This paper seeks to study the persistence in the G7’s stock market volatility, which is carried out using the GARCH, IGARCH and FIGARCH models. The data set consists of the daily returns of the S&P/TSX 60, CAC 40, DAX 30, MIB 30, NIKKEI 225, FTSE 100 and S&P 500 indexes over the period 1999-2009. The results evidences long memory in volatility, which is more pronounced in Germany, Italy and France. On the other hand, Japan appears as the country where this phenomenon is less obvious; nevertheless, the persistence prevails but with minor intensity.
Resumo:
The portfolio generating the iTraxx EUR index is modeled by coupled Markov chains. Each of the industries of the portfolio evolves according to its own Markov transition matrix. Using a variant of the method of moments, the model parameters are estimated from a data set of Standard and Poor's. Swap spreads are evaluated by Monte-Carlo simulations. Along with an actuarially fair spread, at least squares spread is considered.
Resumo:
Financial literature and financial industry use often zero coupon yield curves as input for testing hypotheses, pricing assets or managing risk. They assume this provided data as accurate. We analyse implications of the methodology and of the sample selection criteria used to estimate the zero coupon bond yield term structure on the resulting volatility of spot rates with different maturities. We obtain the volatility term structure using historical volatilities and Egarch volatilities. As input for these volatilities we consider our own spot rates estimation from GovPX bond data and three popular interest rates data sets: from the Federal Reserve Board, from the US Department of the Treasury (H15), and from Bloomberg. We find strong evidence that the resulting zero coupon bond yield volatility estimates as well as the correlation coefficients among spot and forward rates depend significantly on the data set. We observe relevant differences in economic terms when volatilities are used to price derivatives.
Resumo:
This study explores a large set of OC and EC measurements in PM(10) and PM(2.5) aerosol samples, undertaken with a long term constant analytical methodology, to evaluate the capability of the OC/EC minimum ratio to represent the ratio between the OC and EC aerosol components resulting from fossil fuel combustion (OC(ff)/EC(ff)). The data set covers a wide geographical area in Europe, but with a particular focus upon Portugal, Spain and the United Kingdom, and includes a great variety of sites: urban (background, kerbside and tunnel), industrial, rural and remote. The highest minimum ratios were found in samples from remote and rural sites. Urban background sites have shown spatially and temporally consistent minimum ratios, of around 1.0 for PM(10) and 0.7 for PM(2.5).The consistency of results has suggested that the method could be used as a tool to derive the ratio between OC and EC from fossil fuel combustion and consequently to differentiate OC from primary and secondary sources. To explore this capability, OC and EC measurements were performed in a busy roadway tunnel in central Lisbon. The OC/EC ratio, which reflected the composition of vehicle combustion emissions, was in the range of 03-0.4. Ratios of OC/EC in roadside increment air (roadside minus urban background) in Birmingham, UK also lie within the range 03-0.4. Additional measurements were performed under heavy traffic conditions at two double kerbside sites located in the centre of Lisbon and Madrid. The OC/EC minimum ratios observed at both sites were found to be between those of the tunnel and those of urban background air, suggesting that minimum values commonly obtained for this parameter in open urban atmospheres over-predict the direct emissions of OC(ff) from road transport. Possible reasons for this discrepancy are explored. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
This work describes a methodology to extract symbolic rules from trained neural networks. In our approach, patterns on the network are codified using formulas on a Lukasiewicz logic. For this we take advantage of the fact that every connective in this multi-valued logic can be evaluated by a neuron in an artificial network having, by activation function the identity truncated to zero and one. This fact simplifies symbolic rule extraction and allows the easy injection of formulas into a network architecture. We trained this type of neural network using a back-propagation algorithm based on Levenderg-Marquardt algorithm, where in each learning iteration, we restricted the knowledge dissemination in the network structure. This makes the descriptive power of produced neural networks similar to the descriptive power of Lukasiewicz logic language, minimizing the information loss on the translation between connectionist and symbolic structures. To avoid redundance on the generated network, the method simplifies them in a pruning phase, using the "Optimal Brain Surgeon" algorithm. We tested this method on the task of finding the formula used on the generation of a given truth table. For real data tests, we selected the Mushrooms data set, available on the UCI Machine Learning Repository.
Resumo:
Mestrado em Engenharia Informática