3 resultados para Large Data
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Astronomy has evolved almost exclusively by the use of spectroscopic and imaging techniques, operated separately. With the development of modern technologies, it is possible to obtain data cubes in which one combines both techniques simultaneously, producing images with spectral resolution. To extract information from them can be quite complex, and hence the development of new methods of data analysis is desirable. We present a method of analysis of data cube (data from single field observations, containing two spatial and one spectral dimension) that uses Principal Component Analysis (PCA) to express the data in the form of reduced dimensionality, facilitating efficient information extraction from very large data sets. PCA transforms the system of correlated coordinates into a system of uncorrelated coordinates ordered by principal components of decreasing variance. The new coordinates are referred to as eigenvectors, and the projections of the data on to these coordinates produce images we will call tomograms. The association of the tomograms (images) to eigenvectors (spectra) is important for the interpretation of both. The eigenvectors are mutually orthogonal, and this information is fundamental for their handling and interpretation. When the data cube shows objects that present uncorrelated physical phenomena, the eigenvector`s orthogonality may be instrumental in separating and identifying them. By handling eigenvectors and tomograms, one can enhance features, extract noise, compress data, extract spectra, etc. We applied the method, for illustration purpose only, to the central region of the low ionization nuclear emission region (LINER) galaxy NGC 4736, and demonstrate that it has a type 1 active nucleus, not known before. Furthermore, we show that it is displaced from the centre of its stellar bulge.
Resumo:
Most multidimensional projection techniques rely on distance (dissimilarity) information between data instances to embed high-dimensional data into a visual space. When data are endowed with Cartesian coordinates, an extra computational effort is necessary to compute the needed distances, making multidimensional projection prohibitive in applications dealing with interactivity and massive data. The novel multidimensional projection technique proposed in this work, called Part-Linear Multidimensional Projection (PLMP), has been tailored to handle multivariate data represented in Cartesian high-dimensional spaces, requiring only distance information between pairs of representative samples. This characteristic renders PLMP faster than previous methods when processing large data sets while still being competitive in terms of precision. Moreover, knowing the range of variation for data instances in the high-dimensional space, we can make PLMP a truly streaming data projection technique, a trait absent in previous methods.
Resumo:
A new method to measure the epicycle frequency kappa in the Galactic disc is presented. We make use of the large data base on open clusters completed by our group to derive the observed velocity vector (amplitude and direction) of the clusters in the Galactic plane. In the epicycle approximation, this velocity is equal to the circular velocity given by the rotation curve, plus a residual or perturbation velocity, of which the direction rotates as a function of time with the frequency kappa. Due to the non-random direction of the perturbation velocity at the birth time of the clusters, a plot of the present-day direction angle of this velocity as a function of the age of the clusters reveals systematic trends from which the epicycle frequency can be obtained. Our analysis considers that the Galactic potential is mainly axis-symmetric, or in other words, that the effect of the spiral arms on the Galactic orbits is small; in this sense, our results do not depend on any specific model of the spiral structure. The values of kappa that we obtain provide constraints on the rotation velocity of the in particular, V(0) is found to be 230 +/- 15 km s(-1) even if the scale (R(0) = 7.5 kpc) of the Galaxy is adopted. The measured kappa at the solar radius is 43 +/- 5 km s(-1) kpc(-1). The distribution of initial velocities of open clusters is discussed.