909 resultados para EXPLORATORY DATA ANALYSIS


Relevância:

100.00% 100.00%

Publicador:

Resumo:

LHC experiments produce an enormous amount of data, estimated of the order of a few PetaBytes per year. Data management takes place using the Worldwide LHC Computing Grid (WLCG) grid infrastructure, both for storage and processing operations. However, in recent years, many more resources are available on High Performance Computing (HPC) farms, which generally have many computing nodes with a high number of processors. Large collaborations are working to use these resources in the most efficient way, compatibly with the constraints imposed by computing models (data distributed on the Grid, authentication, software dependencies, etc.). The aim of this thesis project is to develop a software framework that allows users to process a typical data analysis workflow of the ATLAS experiment on HPC systems. The developed analysis framework shall be deployed on the computing resources of the Open Physics Hub project and on the CINECA Marconi100 cluster, in view of the switch-on of the Leonardo supercomputer, foreseen in 2023.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Il rilevatore Probe for LUminosity MEasurement (PLUME) è un luminometro per l’esperimento LHCb al CERN. Fornirà misurazioni istantanee della luminosità per LHCb durante la Run 3 a LHC. L’obiettivo di questa tesi è di valutare, con dati simulati, le prestazioni attese di PLUME, come l’occupanza dei PMT che compongono il rivelatore, e riportare l’analisi dei primi dati ottenuti da PLUME durante uno scan di Van der Meer. In particolare, sono state ottenuti tre misure del valore della sezione d’urto, necessarie per tarare il rivelatore, ovvero σ1Da = (1.14 ± 0.11) mb, σ1Db = (1.13 ± 0.10) mb, σ2D = (1.20 ± 0.02) mb, dove i pedici 1D e 2D corrispondono a uno scan di Van der Meer unidimensionale e bidimensionale. Tutti i risultati sono in accordo tra loro.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The thesis is the result of work conducted during a period of six months at the Strategy department of Automobili Lamborghini S.p.A. in Sant'Agata Bolognese (BO) and concerns the study and analysis of Big Data relating to Lamborghini's connected cars. The Big Data is a project of Connected Car Project House, that is an inter-departmental team which works toward the definition of the Lamborghini corporate connectivity strategy and its implementation in the product portfolio. The Data of the connected cars is one of the hottest topics right now in the automotive industry; in fact, all the largest automotive companies are investi,ng a lot in this direction, in order to derive the greatest advantages both from a purely economic point of view, because from these data you can understand a lot the behaviors and habits of each driver, and from a technological point of view because it will increasingly promote the development of 5G that will be an important enabler for the future of connectivity. The main purpose of the work by Lamborghini prospective is to analyze the data of the connected cars, in particular a data-set referred to connected Huracans that had been already placed on the market, and, starting from that point, derive valuable Key Performance Indicators (KPIs) on which the company could partly base the decisions to be made in the near future. The key result that we have obtained at the end of this period was the creation of a Dashboard, in which is possible to visualize many parameters and indicators both related to driving habits and the use of the vehicle itself, which has brought great insights on the huge potential and value that is present behind the study of these data. The final Demo of the project has received great interest, not only from the whole strategy department but also from all the other business areas of Lamborghini, making mostly a great awareness that this will be the road to follow in the coming years.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

I principi Agile, pubblicati nell’omonimo Manifesto più di 20 anni fa, al giorno d’oggi sono declinati in una moltitudine di framework: Scrum, XP, Kanban, Lean, Adaptive, Crystal, etc. Nella prima parte della tesi (Capitoli 1 e 2) sono stati descritti alcuni di questi framework e si è analizzato come un approccio Agile è utilizzato nella pratica in uno specifico caso d’uso: lo sviluppo di una piattaforma software a supporto di un sistema di e-grocery da parte di un team di lab51. Si sono verificate le differenze e le similitudini rispetto alcuni metodi Agile formalizzati in letteratura spiegando le motivazioni che hanno portato a differenziarsi da questi framework illustrando i vantaggi per il team. Nella seconda parte della tesi (Capitoli 3 e 4) è stata effettuata un’analisi dei dati raccolti dal supermercato online negli ultimi anni con l’obiettivo di migliorare l’algoritmo di riordino. In particolare, per prevedere le vendite dei singoli prodotti al fine di avere degli ordini più adeguati in quantità e frequenza, sono stati studiati vari approcci: dai modelli statistici di time series forecasting, alle reti neurali, fino ad una metodologia sviluppata ad hoc.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

There are many natural events that can negatively affect the urban ecosystem, but weather-climate variations are certainly among the most significant. The history of settlements has been characterized by extreme events like earthquakes and floods, which repeat themselves at different times, causing extensive damage to the built heritage on a structural and urban scale. Changes in climate also alter various climatic subsystems, changing rainfall regimes and hydrological cycles, increasing the frequency and intensity of extreme precipitation events (heavy rainfall).  From an hydrological risk perspective, it is crucial to understand future events that could occur and their magnitude in order to design safer infrastructures. Unfortunately, it is not easy to understand future scenarios as the complexity of climate is enormous.  For this thesis, precipitation and discharge extremes were primarily used as data sources. It is important to underline that the two data sets are not separated: changes in rainfall regime, due to climate change, could significantly affect overflows into receiving water bodies. It is imperative that we understand and model climate change effects on water structures to support the development of adaptation strategies.   The main purpose of this thesis is to search for suitable water structures for a road located along the Tione River. Therefore, through the analysis of the area from a hydrological point of view, we aim to guarantee the safety of the infrastructure over time.   The observations made have the purpose to underline how models such as a stochastic one can improve the quality of an analysis for design purposes, and influence choices.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: Previous experiments showed that caffeine blocks the development of Aedes aegypti (Diptera, Culicidae) in the larval stage, consequently inhibiting the production of adults. This study aimed at obtaining data suggestive of caffeine resistance by these mosquitoes. METHODS: Experiments were carried out in successive generations to assess adult production from eggs laid in previous generation and oviposition rate in every generation using 200 and 500 µg/mL caffeine. Tap water was used as control. Experiments were conducted in the city of São José do Rio Preto, Southeastern Brazil between 2002 and 2005. Statistical tests consisted of exploratory data analysis and smoothing algorithms. RESULTS: Increasing reduction in productivity of adults occurred among generations at both caffeine concentrations but the differences were only significant at 200µg/mL caffeine. As for the oviposition rate, there was a decrease in the mean number of eggs per female over generations at both caffeine concentrations. CONCLUSIONS: There was no evidence of caffeine resistance over generations. The study results corroborate caffeine as an alternative as an important Ae. Aegypti control agent to avoid resistance.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tese de Doutoramento em Engenharia Civil.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Adding Omega fatty acids (ω) 3 to the diet of stud bucks, the quality of sperm and their resistance to cryopreservation could increase. The aim of this study is to determine the effect of supplementation with ω3 on the metabolic state, sperm quality and resistance to freezing, in bucks kept in confinement under natural photoperiod. The experiment will be conducted in the facilities of the Faculty of Agronomy and Veterinary, UNRC (National University of Río Cuarto). Ten Anglo Nubian adult bucks, trained for semen collection with artificial vagina will be used. Males will be randomly allocated into 2 groups (5 animals each): control (C) and treatment (T). During the breeding season, group C will be fed with a ration of alfalfa and ground corn, according to the requirements for each category and sex (NRC, 2007). Group T will receive the same diet with the addition of linseeds. Both will have free access to water. Every week, semen of each buck, will be collected, evaluated and frozen. Sperm quality “in vitro” after thawing will be studied with a digital image analyzer. To assess oxidative stress in fresh and cryopreserved semen, levels of thiobarbituric acid reactive substances (TBARS) and quantification of the activity of superoxide dismutase (SOD) and catalase (CAT) will be determined. To establish the metabolic state, blood samples will be collected every two weeks. The statistical analysis will include an exploratory data analysis, multivariate analysis of multiple correspondences on a completely randomized design, analysis of variance and Fisher post-test. The level of significance will be set at P <0.05 and all results will be expressed as means ± SEM.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A methodology of exploratory data analysis investigating the phenomenon of orographic precipitation enhancement is proposed. The precipitation observations obtained from three Swiss Doppler weather radars are analysed for the major precipitation event of August 2005 in the Alps. Image processing techniques are used to detect significant precipitation cells/pixels from radar images while filtering out spurious effects due to ground clutter. The contribution of topography to precipitation patterns is described by an extensive set of topographical descriptors computed from the digital elevation model at multiple spatial scales. Additionally, the motion vector field is derived from subsequent radar images and integrated into a set of topographic features to highlight the slopes exposed to main flows. Following the exploratory data analysis with a recent algorithm of spectral clustering, it is shown that orographic precipitation cells are generated under specific flow and topographic conditions. Repeatability of precipitation patterns in particular spatial locations is found to be linked to specific local terrain shapes, e.g. at the top of hills and on the upwind side of the mountains. This methodology and our empirical findings for the Alpine region provide a basis for building computational data-driven models of orographic enhancement and triggering of precipitation. Copyright (C) 2011 Royal Meteorological Society .

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Casos de fraudes têm ocorrido, frequentemente no mercado mundial. Diversos são os profissionais envolvidos nesses casos, inclusive os da contabilidade. Os escândalos contabilísticos, especialmente os mais famosos, como os incidido nas empresas Enron e Wordcom, acenderam para uma maior preocupação em relação a conduta ética dos profissionais da contabilidade. Como consequência há uma maior exigência quanto a transparência e a fidedignidade das informações prestadas por estes profissionais. Esta preocupação visa, sobretudo, manter a confiança das empresas, investidores, fornece-dores e sociedade em geral, de entre outras, na responsabilidade ética do contabilista, de-negrida pelo envolvimento nas fraudes detectadas. Desta forma, o presente estudo teve como objectivo verificar a conduta ética dos contabilistas, quando, no exercício da sua profissão, depararem com questões relacionadas a fraudes. Nesse sentido considerou-se factores que podem vir a influenciar o processo decisório ético de um indivíduo, demonstrados através do modelo de tomada de decisão, desenvolvido por Alves, quanto a motivar um indivíduo a cometer uma fraude, evidenciada através do modelo desenvolvido por Cressey. Tentando responder a questão norteadora desta pesquisa, executou-se a análise descritiva e estatística dos dados. Em relação a análise descritiva, foram elaboradas tabelas de frequência. Para a análise estatística dos dados foi utilizado o teste não paramétrico de Spearman. Os resultados demonstraram que a maioria dos contabilistas, da amostra pesquisada, reconhece a questão moral inserida nos cenários, e discordam dos actos dos agentes de cada cenário, e, ainda os classificam como graves ou muito graves. A pesquisa revelou maior aproximação desses profissionais a corrente teleológica, uma vez que a intenção de agir é mais influenciada por alguns factores como a oportunidade, a racionalização e principalmente a pressão. Alguns factores individuais apresentam influências sob o posicionamento ético dos contabilistas entrevistados nesta pesquisa. Cases of fraud have occurred, in the word market. Several are involved in these cases, including the accounting class. The accounting scandals, especially the most famous, such as focusing on companies and Enron Word Com, kindled to greater concern about the ethical conduct of professional accounting. As a result there is a greater demand on the transparency and reliability of information provide by these professionals This concern is aimed, primarily, to maintain the confidence of businesses, investor, suppliers and society, among others, the ethical responsibility of the meter, denigrated, by involvement in the fraud detected. Thus, this study aimed to verify the ethical conduct of accounts in when, in the exercise of their professional activities, is confronted with issues related to fraud. This is considered some factors that can both come to influence the ethical decision making of an individual, demonstrated by the model of decision making, developed by Alves, as a motivated individual to commit a fraudulent act, developed by Cressey. Seeking to answer question, guiding this study, performed to exploratory and confirmatory analysis of data. For exploratory data analysis were made table of frequencies. For confirmatory analysis of data, were used non parametric tests of Spearman. The results showed that the majority of accountings professionals, the sample, recognizing the moral issue included in the scenarios, disagrees the acts of agents of each scenario, and also classifies such acts as serious and very serious. However, we found that these accounting professionals tend to have a position more toward the teleological theory, since the intention to act is influenced by factors as opportunity, rationalization and particularly the pressure. Some individual factors also had influence on the ethical position of the professional interviewed is this research.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The present study deals with the analysis and mapping of Swiss franc interest rates. Interest rates depend on time and maturity, defining term structure of the interest rate curves (IRC). In the present study IRC are considered in a two-dimensional feature space - time and maturity. Exploratory data analysis includes a variety of tools widely used in econophysics and geostatistics. Geostatistical models and machine learning algorithms (multilayer perceptron and Support Vector Machines) were applied to produce interest rate maps. IR maps can be used for the visualisation and pattern perception purposes, to develop and to explore economical hypotheses, to produce dynamic asset-liability simulations and for financial risk assessments. The feasibility of an application of interest rates mapping approach for the IRC forecasting is considered as well. (C) 2008 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the context of the evidence-based practices movement, the emphasis on computing effect sizes and combining them via meta-analysis does not preclude the demonstration of functional relations. For the latter aim, we propose to augment the visual analysis to add consistency to the decisions made on the existence of a functional relation without losing sight of the need for a methodological evaluation of what stimuli and reinforcement or punishment are used to control the behavior. Four options for quantification are reviewed, illustrated, and tested with simulated data. These quantifications include comparing the projected baseline with the actual treatment measurements, on the basis of either parametric or nonparametric statistics. The simulated data used to test the quantifications include nine data patterns in terms of the presence and type of effect and comprising ABAB and multiple baseline designs. Although none of the techniques is completely flawless in terms of detecting a functional relation only when it is present but not when it is absent, an option based on projecting split-middle trend and considering data variability as in exploratory data analysis proves to be the best performer for most data patterns. We suggest that the information on whether a functional relation has been demonstrated should be included in meta-analyses. It is also possible to use as a weight the inverse of the data variability measure used in the quantification for assessing the functional relation. We offer an easy to use code for open-source software for implementing some of the quantifications.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this work was to develop a free access exploratory data analysis software application for academic use that is easy to install and can be handled without user-level programming due to extensive use of chemometrics and its association with applications that require purchased licenses or routines. The developed software, called Chemostat, employs Hierarchical Cluster Analysis (HCA), Principal Component Analysis (PCA), intervals Principal Component Analysis (iPCA), as well as correction methods, data transformation and outlier detection. The data can be imported from the clipboard, text files, ASCII or FT-IR Perkin-Elmer “.sp” files. It generates a variety of charts and tables that allow the analysis of results that can be exported in several formats. The main features of the software were tested using midinfrared and near-infrared spectra in vegetable oils and digital images obtained from different types of commercial diesel. In order to validate the software results, the same sets of data were analyzed using Matlab© and the results in both applications matched in various combinations. In addition to the desktop version, the reuse of algorithms allowed an online version to be provided that offers a unique experience on the web. Both applications are available in English.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Molecular orbital calculations were carried out on a set of 28 non-imidazole H(3) antihistamine compounds using the Hartree-Fock method in order to investigate the possible relationships between electronic structural properties and binding affinity for H3 receptors (pK(i)). It was observed that the frontier effective-for-reaction molecular orbital (FERMO) energies were better correlated with pK(i) values than highest occupied molecular orbital (HOMO) and lowest unoccupied molecular orbital (LUMO) energy values. Exploratory data analysis through hierarchical cluster (HCA) and principal component analysis (PCA) showed a separation of the compounds in two sets, one grouping the molecules with high pK(i) values, the other gathering low pK(i) value compounds. This separation was obtained with the use of the following descriptors: FERMO energies (epsilon(FERMO)), charges derived from the electrostatic potential on the nitrogen atom (N(1)), electronic density indexes for FERMO on the N(1) atom (Sigma((FERMO))c(i)(2)). and electrophilicity (omega`). These electronic descriptors were used to construct a quantitative structure-activity relationship (QSAR) model through the partial least-squares (PLS) method with three principal components. This model generated Q(2) = 0.88 and R(2) = 0.927 values obtained from a training set and external validation of 23 and 5 molecules, respectively. After the analysis of the PLS regression equation and the values for the selected electronic descriptors, it is suggested that high values of FERMO energies and of Sigma((FERMO))c(i)(2), together with low values of electrophilicity and pronounced negative charges on N(1) appear as desirable properties for the conception of new molecules which might have high binding affinity. 2010 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report the results of an exploratory data analysis of the Brazilian securities lending market. The analysis is performed over the full historical data set of each individual loan offer and loan contract negotiated between January 2007 and August 2013. We give a quantitative description of volume and loan fee trends and fee dependence on asset characteristics. We also unveil new stylized facts specific to the Brazilian market on market access asymmetries between different types of investors. The emerging picture is that the Brazilian securities lending market is a complex environment with specific frictions and strong asymmetries among players. In particular, we describe a tax arbitrage operation performed by domestic mutual funds which generates a significant distortion in the data. In one such event, we estimate additional aggregate profits of 24.25 million Reais (around 10 million Dollars).