97 resultados para Data-Intensive Science
Resumo:
In this paper, we present an algorithm for cluster analysis that integrates aspects from cluster ensemble and multi-objective clustering. The algorithm is based on a Pareto-based multi-objective genetic algorithm, with a special crossover operator, which uses clustering validation measures as objective functions. The algorithm proposed can deal with data sets presenting different types of clusters, without the need of expertise in cluster analysis. its result is a concise set of partitions representing alternative trade-offs among the objective functions. We compare the results obtained with our algorithm, in the context of gene expression data sets, to those achieved with multi-objective Clustering with automatic K-determination (MOCK). the algorithm most closely related to ours. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Most multidimensional projection techniques rely on distance (dissimilarity) information between data instances to embed high-dimensional data into a visual space. When data are endowed with Cartesian coordinates, an extra computational effort is necessary to compute the needed distances, making multidimensional projection prohibitive in applications dealing with interactivity and massive data. The novel multidimensional projection technique proposed in this work, called Part-Linear Multidimensional Projection (PLMP), has been tailored to handle multivariate data represented in Cartesian high-dimensional spaces, requiring only distance information between pairs of representative samples. This characteristic renders PLMP faster than previous methods when processing large data sets while still being competitive in terms of precision. Moreover, knowing the range of variation for data instances in the high-dimensional space, we can make PLMP a truly streaming data projection technique, a trait absent in previous methods.
Resumo:
This paper describes a novel template-based meshing approach for generating good quality quadrilateral meshes from 2D digital images. This approach builds upon an existing image-based mesh generation technique called Imeshp, which enables us to create a segmented triangle mesh from an image without the need for an image segmentation step. Our approach generates a quadrilateral mesh using an indirect scheme, which converts the segmented triangle mesh created by the initial steps of the Imesh technique into a quadrilateral one. The triangle-to-quadrilateral conversion makes use of template meshes of triangles. To ensure good element quality, the conversion step is followed by a smoothing step, which is based on a new optimization-based procedure. We show several examples of meshes generated by our approach, and present a thorough experimental evaluation of the quality of the meshes given as examples.
Resumo:
In interval-censored survival data, the event of interest is not observed exactly but is only known to occur within some time interval. Such data appear very frequently. In this paper, we are concerned only with parametric forms, and so a location-scale regression model based on the exponentiated Weibull distribution is proposed for modeling interval-censored data. We show that the proposed log-exponentiated Weibull regression model for interval-censored data represents a parametric family of models that include other regression models that are broadly used in lifetime data analysis. Assuming the use of interval-censored data, we employ a frequentist analysis, a jackknife estimator, a parametric bootstrap and a Bayesian analysis for the parameters of the proposed model. We derive the appropriate matrices for assessing local influences on the parameter estimates under different perturbation schemes and present some ways to assess global influences. Furthermore, for different parameter settings, sample sizes and censoring percentages, various simulations are performed; in addition, the empirical distribution of some modified residuals are displayed and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be straightforwardly extended to a modified deviance residual in log-exponentiated Weibull regression models for interval-censored data. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In this paper, we develop a flexible cure rate survival model by assuming the number of competing causes of the event of interest to follow the Conway-Maxwell Poisson distribution. This model includes as special cases some of the well-known cure rate models discussed in the literature. Next, we discuss the maximum likelihood estimation of the parameters of this cure rate survival model. Finally, we illustrate the usefulness of this model by applying it to a real cutaneous melanoma data. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In survival analysis applications, the failure rate function may frequently present a unimodal shape. In such case, the log-normal or log-logistic distributions are used. In this paper, we shall be concerned only with parametric forms, so a location-scale regression model based on the Burr XII distribution is proposed for modeling data with a unimodal failure rate function as an alternative to the log-logistic regression model. Assuming censored data, we consider a classic analysis, a Bayesian analysis and a jackknife estimator for the parameters of the proposed model. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and compared to the performance of the log-logistic and log-Burr XII regression models. Besides, we use sensitivity analysis to detect influential or outlying observations, and residual analysis is used to check the assumptions in the model. Finally, we analyze a real data set under log-Buff XII regression models. (C) 2008 Published by Elsevier B.V.
Resumo:
Conventional procedures employed in the modeling of viscoelastic properties of polymer rely on the determination of the polymer`s discrete relaxation spectrum from experimentally obtained data. In the past decades, several analytical regression techniques have been proposed to determine an explicit equation which describes the measured spectra. With a diverse approach, the procedure herein introduced constitutes a simulation-based computational optimization technique based on non-deterministic search method arisen from the field of evolutionary computation. Instead of comparing numerical results, this purpose of this paper is to highlight some Subtle differences between both strategies and focus on what properties of the exploited technique emerge as new possibilities for the field, In oder to illustrate this, essayed cases show how the employed technique can outperform conventional approaches in terms of fitting quality. Moreover, in some instances, it produces equivalent results With much fewer fitting parameters, which is convenient for computational simulation applications. I-lie problem formulation and the rationale of the highlighted method are herein discussed and constitute the main intended contribution. (C) 2009 Wiley Periodicals, Inc. J Appl Polym Sci 113: 122-135, 2009
Resumo:
The TCABR data analysis and acquisition system has been upgraded to support a joint research programme using remote participation technologies. The architecture of the new system uses Java language as programming environment. Since application parameters and hardware in a joint experiment are complex with a large variability of components, requirements and specification solutions need to be flexible and modular, independent from operating system and computer architecture. To describe and organize the information on all the components and the connections among them, systems are developed using the extensible Markup Language (XML) technology. The communication between clients and servers uses remote procedure call (RPC) based on the XML (RPC-XML technology). The integration among Java language, XML and RPC-XML technologies allows to develop easily a standard data and communication access layer between users and laboratories using common software libraries and Web application. The libraries allow data retrieval using the same methods for all user laboratories in the joint collaboration, and the Web application allows a simple graphical user interface (GUI) access. The TCABR tokamak team in collaboration with the IPFN (Instituto de Plasmas e Fusao Nuclear, Instituto Superior Tecnico, Universidade Tecnica de Lisboa) is implementing this remote participation technologies. The first version was tested at the Joint Experiment on TCABR (TCABRJE), a Host Laboratory Experiment, organized in cooperation with the IAEA (International Atomic Energy Agency) in the framework of the IAEA Coordinated Research Project (CRP) on ""Joint Research Using Small Tokamaks"". (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
This work describes two similar methods for calculating gamma transition intensities from multidetector coincidence measurements. In the first one, applicable to experiments where the angular correlation function is explicitly fitted, the normalization parameter from this fit is used to determine the gamma transition intensities. In the second, that can be used both in angular correlation or DCO measurements, the spectra obtained for all the detector pairs are summed up, in order to get the best detection statistics possible, and the analysis of the resulting bidimensional spectrum is used to calculate the transition intensities; in this method, the summation of data corresponding to different angles minimizes the influence of the angular correlation coefficient. Both methods are then tested in the calculation of intensities for well-known transitions from a (152)Eu standard source, as well as in the calculation of intensities obtained in beta-decay experiments with (193)Os and (155)Sm sources, yielding excellent results in all these cases. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In this paper a new parametric method to deal with discrepant experimental results is developed. The method is based on the fit of a probability density function to the data. This paper also compares the characteristics of different methods used to deduce recommended values and uncertainties from a discrepant set of experimental data. The methods are applied to the (137)Cs and (90)Sr published half-lives and special emphasis is given to the deduced confidence intervals. The obtained results are analyzed considering two fundamental properties expected from an experimental result: the probability content of confidence intervals and the statistical consistency between different recommended values. The recommended values and uncertainties for the (137)Cs and (90)Sr half-lives are 10,984 (24) days and 10,523 (70) days, respectively. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Specific choices about how to represent complex networks can have a substantial impact on the execution time required for the respective construction and analysis of those structures. In this work we report a comparison of the effects of representing complex networks statically by adjacency matrices or dynamically by adjacency lists. Three theoretical models of complex networks are considered: two types of Erdos-Renyi as well as the Barabasi-Albert model. We investigated the effect of the different representations with respect to the construction and measurement of several topological properties (i.e. degree, clustering coefficient, shortest path length, and betweenness centrality). We found that different forms of representation generally have a substantial effect on the execution time, with the sparse representation frequently resulting in remarkably superior performance. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
A radiometric zircon age of 285.4 +/- 8.6 Ma (IDTIMS U-Pb) is reported from a tonstein layer interbedded with coal seams in the Faxinal coalfield, Rio Grande do Sul, Brazil. Calibration of palynostratigraphic data with the absolute age shows that the coal depositional interval in the southern Parana Basin is constrained to the Sakmarian. Consequently, the basal Gondwana sequence in the southern part of the basin should lie at the Carboniferous-Permian boundary, not within the Sakmarian as previously considered. The new results are significant for correlations between the Parana Basin and the Argentinian Paganzo Basin (302 +/- 6 Ma and 288 +/- 7 Ma) and with the Karoo Basin, specifically with the top of the Dwyka Tillite (302 +/- 3 Ma and 299.2 +/- 3.2 Ma) and the lowermost Ecca Group (288 +/- 3 Ma and 289.6 +/- 3.8 Ma). The evidence signifies widespread latest Carboniferous volcanic activity in western Gondwana. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
The Ibituruna quartz-syenite was emplaced as a sill in the Ribeira-Aracuai Neoproterozoic belt (Southeastern Brazil) during the last stages of the Gondwana supercontinent amalgamation. We have measured the Anisotropy of Magnetic Susceptibility (AMS) in samples from the Ibituruna sill to unravel its magnetic fabric that is regarded as a proxy for its magmatic fabric. A large magnetic anisotropy, dominantly due to magnetite, and a consistent magnetic fabric have been determined over the entire Ibituruna massif. The magmatic foliation and lineation are strikingly parallel to the solid-state mylonitic foliation and lineation measured in the country-rock. Altogether, these observations suggest that the Ibituruna sill was emplaced during the high temperature (similar to 750 degrees C) regional deformation and was deformed before full solidification coherently with its country-rock. Unexpectedly, geochronological data suggest a rather different conclusion. LA-ICP-MS and SHRIMP ages of zircons from the Ibituruna quartz-syenite are in the range 530-535 Ma and LA-ICP-MS ages of zircons and monazites from synkinematic leucocratic veins in the country-rocks suggest a crystallization at similar to 570-580 Ma, i.e., an HT deformation >35My older than the emplacement of the Ibituruna quartz-syenite. Conclusions from the structural and the geochronological studies are therefore conflicting. A possible explanation arises from (40)Ar-(39)Ar thermochronology. We have dated amphiboles from the quartz-syenite, and amphiboles and biotites from the country-rock. Together with the ages of monazites and zircons in the country-rock, (40)Ar-(39)Ar mineral ages suggest a very low cooling rate: <3 degrees C/My between 570 and similar to 500 Ma and similar to 5 degrees C/My between 500 and 460 Ma. Assuming a protracted regional deformation consistent over tens of My, under such stable thermal conditions the fabric and microstructure of deformed rocks may remain almost unchanged even if they underwent and recorded strain pulses separated by long periods of time. This may be a characteristic of slow cooling ""hot orogens"" that rocks deformed at significantly different periods during the orogeny, but under roughly unchanged temperature conditions, may display almost indiscernible microstructure and fabric. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
We report 6 K-Ar ages and paleomagnetic data from 28 sites collected in Jurassic, Lower Cretaceous and Paleocene rocks of the Santa Marta massif, to test previous hypothesis of rotations and translations of this massif, whose rock assemblage differs from other basement-cored ranges adjacent to the Guyana margin. Three magnetic components were identified in this study. A first component has a direction parallel to the present magnetic field and was uncovered in all units (D 352, I = 25.6, k = 57.35, a95 = 5.3, N = 12). A second component was isolated in Cretaceous limestone and Jurassic volcaniclastic rocks (D = 8.8, I = 8.3, k = 24.71, a95 = 13.7, N = 6), and it was interpreted as of Early Cretaceous age. In Jurassic sites with this component, Early Cretaceous K-Ar ages obtained from this and previous studies are interpreted as reset ages. The third component was uncovered in eight sites of Jurassic volcaniclastic rocks, and its direction indicates negative shallow to moderate inclinations and northeastward declinations. K-Ar ages in these sites are of Early (196.5 +/- 4.9 Ma) to early Late Jurassic age (156.6 +/- 8.9 Ma). Due to local structural complexity and too few Cretaceous outcrops to perform a reliable unconformity test, we only used two sites with (1) K-Ar ages, (2) less structural complexity, and (3) reliable structural data for Jurassic and Cretaceous rocks. The mean direction of the Jurassic component is (D = 20.4, I = -18.2, k = 46.9, a95 = 5.1, n = 18 specimens from two sites). These paleomagnetic data support previous models of northward along-margin translations of Grenvillian-cored massifs. Additionally, clockwise vertical-axis rotation of this massif, with respect to the stable craton, is also documented; the sense of rotation is similar to that proposed for the Perija Range and other ranges of the southern Caribbean margin. More data is needed to confirm the magnitudes of rotations and translations. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
Lycopodiopsis derbyi Renault was analyzed on the basis of compressed silicified stems from four Guadalupian outcrops of the Parana Basin (Corumbatai Formation) in the State of Sao Paulo, Southern Brazil. Dichotomous stems have been recorded, and three different branch regions related to apoxogenesis are described. The most proximal region has larger, clearly rhomboidal leaf cushions, with protruding upper edges; the intermediate transitional region also has rhombic leaf cushions, but they are smaller and less elongated than the lower in the same axis; finally, the most distal region reveals only incipient cushions, with inconspicuous infrafoliar bladders; interspersed microphylls were still attached. A well preserved branch representative of this most distal region was sectioned; it has a siphonostelic cylinder similar to that previously described for L derbyi. The cortex, however, shows new traits, such as a short portion of elongated cells between the periderm and the external cortex (or leaf cushion tissue). The stems were apparently silicified prior to their final burial but were probably not transported for long distances. Their final burial may have taken place during storm events, which were common during the deposition of the Corumbatai Formation. These stems are commonly deformed due to compression, mainly because the internal cortical portions rapidly decayed prior to silicification due to their thin-walled tissue, and are therefore not preserved. The common alkalinity of a shallow marine environment such as that in which the Corumbatai Formation was deposited, should mobilize the silica and favors petrifaction. Based on the new data, an emended diagnosis is proposed and a modification of the identification key published by Thomas and Meyen in 1984 for Upper Paleozoic Lycopsida is suggested. (C) 2009 Elsevier B.V. All rights reserved.