150 resultados para Maximum Degree Proximity algorithm (MAX-DPA)
Resumo:
We study the star/galaxy classification efficiency of 13 different decision tree algorithms applied to photometric objects in the Sloan Digital Sky Survey Data Release Seven (SDSS-DR7). Each algorithm is defined by a set of parameters which, when varied, produce different final classification trees. We extensively explore the parameter space of each algorithm, using the set of 884,126 SDSS objects with spectroscopic data as the training set. The efficiency of star-galaxy separation is measured using the completeness function. We find that the Functional Tree algorithm (FT) yields the best results as measured by the mean completeness in two magnitude intervals: 14 <= r <= 21 (85.2%) and r >= 19 (82.1%). We compare the performance of the tree generated with the optimal FT configuration to the classifications provided by the SDSS parametric classifier, 2DPHOT, and Ball et al. We find that our FT classifier is comparable to or better in completeness over the full magnitude range 15 <= r <= 21, with much lower contamination than all but the Ball et al. classifier. At the faintest magnitudes (r > 19), our classifier is the only one that maintains high completeness (> 80%) while simultaneously achieving low contamination (similar to 2.5%). We also examine the SDSS parametric classifier (psfMag - modelMag) to see if the dividing line between stars and galaxies can be adjusted to improve the classifier. We find that currently stars in close pairs are often misclassified as galaxies, and suggest a new cut to improve the classifier. Finally, we apply our FT classifier to separate stars from galaxies in the full set of 69,545,326 SDSS photometric objects in the magnitude range 14 <= r <= 21.
Resumo:
Background: The malaria parasite Plasmodium falciparum exhibits abundant genetic diversity, and this diversity is key to its success as a pathogen. Previous efforts to study genetic diversity in P. falciparum have begun to elucidate the demographic history of the species, as well as patterns of population structure and patterns of linkage disequilibrium within its genome. Such studies will be greatly enhanced by new genomic tools and recent large-scale efforts to map genomic variation. To that end, we have developed a high throughput single nucleotide polymorphism (SNP) genotyping platform for P. falciparum. Results: Using an Affymetrix 3,000 SNP assay array, we found roughly half the assays (1,638) yielded high quality, 100% accurate genotyping calls for both major and minor SNP alleles. Genotype data from 76 global isolates confirm significant genetic differentiation among continental populations and varying levels of SNP diversity and linkage disequilibrium according to geographic location and local epidemiological factors. We further discovered that nonsynonymous and silent (synonymous or noncoding) SNPs differ with respect to within-population diversity, interpopulation differentiation, and the degree to which allele frequencies are correlated between populations. Conclusions: The distinct population profile of nonsynonymous variants indicates that natural selection has a significant influence on genomic diversity in P. falciparum, and that many of these changes may reflect functional variants deserving of follow-up study. Our analysis demonstrates the potential for new high-throughput genotyping technologies to enhance studies of population structure, natural selection, and ultimately enable genome-wide association studies in P. falciparum to find genes underlying key phenotypic traits.
Resumo:
In this paper, we initially present an algorithm for automatic composition of melodies using chaotic dynamical systems. Afterward, we characterize chaotic music in a comprehensive way as comprising three perspectives: musical discrimination, dynamical influence on musical features, and musical perception. With respect to the first perspective, the coherence between generated chaotic melodies (continuous as well as discrete chaotic melodies) and a set of classical reference melodies is characterized by statistical descriptors and melodic measures. The significant differences among the three types of melodies are determined by discriminant analysis. Regarding the second perspective, the influence of dynamical features of chaotic attractors, e.g., Lyapunov exponent, Hurst coefficient, and correlation dimension, on melodic features is determined by canonical correlation analysis. The last perspective is related to perception of originality, complexity, and degree of melodiousness (Euler's gradus suavitatis) of chaotic and classical melodies by nonparametric statistical tests. (c) 2010 American Institute of Physics. [doi: 10.1063/1.3487516]
Resumo:
A great part of the interest in complex networks has been motivated by the presence of structured, frequently nonuniform, connectivity. Because diverse connectivity patterns tend to result in distinct network dynamics, and also because they provide the means to identify and classify several types of complex network, it becomes important to obtain meaningful measurements of the local network topology. In addition to traditional features such as the node degree, clustering coefficient, and shortest path, motifs have been introduced in the literature in order to provide complementary descriptions of the network connectivity. The current work proposes a different type of motif, namely, chains of nodes, that is, sequences of connected nodes with degree 2. These chains have been subdivided into cords, tails, rings, and handles, depending on the type of their extremities (e.g., open or connected). A theoretical analysis of the density of such motifs in random and scale-free networks is described, and an algorithm for identifying these motifs in general networks is presented. The potential of considering chains for network characterization has been illustrated with respect to five categories of real-world networks including 16 cases. Several interesting findings were obtained, including the fact that several chains were observed in real-world networks, especially the world wide web, books, and the power grid. The possibility of chains resulting from incompletely sampled networks is also investigated.
Resumo:
We describe an estimation technique for biomass burning emissions in South America based on a combination of remote-sensing fire products and field observations, the Brazilian Biomass Burning Emission Model (3BEM). For each fire pixel detected by remote sensing, the mass of the emitted tracer is calculated based on field observations of fire properties related to the type of vegetation burning. The burnt area is estimated from the instantaneous fire size retrieved by remote sensing, when available, or from statistical properties of the burn scars. The sources are then spatially and temporally distributed and assimilated daily by the Coupled Aerosol and Tracer Transport model to the Brazilian developments on the Regional Atmospheric Modeling System (CATT-BRAMS) in order to perform the prognosis of related tracer concentrations. Three other biomass burning inventories, including GFEDv2 and EDGAR, are simultaneously used to compare the emission strength in terms of the resultant tracer distribution. We also assess the effect of using the daily time resolution of fire emissions by including runs with monthly-averaged emissions. We evaluate the performance of the model using the different emission estimation techniques by comparing the model results with direct measurements of carbon monoxide both near-surface and airborne, as well as remote sensing derived products. The model results obtained using the 3BEM methodology of estimation introduced in this paper show relatively good agreement with the direct measurements and MOPITT data product, suggesting the reliability of the model at local to regional scales.
Resumo:
As a contribution to the Large-Scale Biosphere-Atmosphere Experiment in Amazonia - Cooperative LBA Airborne Regional Experiment (LBA-CLAIRE-2001) field campaign in the heart of the Amazon Basin, we analyzed the temporal and spatial dynamics of the urban plume of Manaus City during the wet-to-dry season transition period in July 2001. During the flights, we performed vertical stacks of crosswind transects in the urban outflow downwind of Manaus City, measuring a comprehensive set of trace constituents including O(3), NO, NO(2), CO, VOC, CO(2), and H(2)O. Aerosol loads were characterized by concentrations of total aerosol number (CN) and cloud condensation nuclei (CCN), and by light scattering properties. Measurements over pristine rainforest areas during the campaign showed low levels of pollution from biomass burning or industrial emissions, representative of wet season background conditions. The urban plume of Manaus City was found to be joined by plumes from power plants south of the city, all showing evidence of very strong photochemical ozone formation. One episode is discussed in detail, where a threefold increase in ozone mixing ratios within the atmospheric boundary layer occurred within a 100 km travel distance downwind of Manaus. Observation-based estimates of the ozone production rates in the plume reached 15 ppb h(-1). Within the plume core, aerosol concentrations were strongly enhanced, with Delta CN/Delta CO ratios about one order of magnitude higher than observed in Amazon biomass burning plumes. Delta CN/Delta CO ratios tended to decrease with increasing transport time, indicative of a significant reduction in particle number by coagulation, and without substantial new particle nucleation occurring within the time/space observed. While in the background atmosphere a large fraction of the total particle number served as CCN (about 60-80% at 0.6% supersaturation), the CCN/CN ratios within the plume indicated that only a small fraction (16 +/- 12 %) of the plume particles were CCN. The fresh plume aerosols showed relatively weak light scattering efficiency. The CO-normalized CCN concentrations and light scattering coefficients increased with plume age in most cases, suggesting particle growth by condensation of soluble organic or inorganic species. We used a Single Column Chemistry and Transport Model (SCM) to infer the urban pollution emission fluxes of Manaus City, implying observed mixing ratios of CO, NO(x) and VOC. The model can reproduce the temporal/spatial distribution of ozone enhancements in the Manaus plume, both with and without accounting for the distinct (high NO(x)) contribution by the power plants; this way examining the sensitivity of ozone production to changes in the emission rates of NO(x). The VOC reactivity in the Manaus region was dominated by a high burden of biogenic isoprene from the background rainforest atmosphere, and therefore NO(x) control is assumed to be the most effective ozone abatement strategy. Both observations and models show that the agglomeration of NO(x) emission sources, like power plants, in a well-arranged area can decrease the ozone production efficiency in the near field of the urban populated cores. But on the other hand remote areas downwind of the city then bear the brunt, being exposed to increased ozone production and N-deposition. The simulated maximum stomatal ozone uptake fluxes were 4 nmol m(-2) s(-1) close to Manaus, and decreased only to about 2 nmol m(-2) s(-1) within a travel distance >1500 km downwind from Manaus, clearly exceeding the critical threshold level for broadleaf trees. Likewise, the simulated N deposition close to Manaus was similar to 70 kg N ha(-1) a(-1) decreasing only to about 30 kg N ha(-1) a(-1) after three days of simulation.
Resumo:
Aerosol samples were collected at a pasture site in the Amazon Basin as part of the project LBA-SMOCC-2002 (Large-Scale Biosphere-Atmosphere Experiment in Amazonia - Smoke Aerosols, Clouds, Rainfall and Climate: Aerosols from Biomass Burning Perturb Global and Regional Climate). Sampling was conducted during the late dry season, when the aerosol composition was dominated by biomass burning emissions, especially in the submicron fraction. A 13-stage Dekati low-pressure impactor (DLPI) was used to collect particles with nominal aerodynamic diameters (D(p)) ranging from 0.03 to 0.10 mu m. Gravimetric analyses of the DLPI substrates and filters were performed to obtain aerosol mass concentrations. The concentrations of total, apparent elemental, and organic carbon (TC, EC(a), and OC) were determined using thermal and thermal-optical analysis (TOA) methods. A light transmission method (LTM) was used to determine the concentration of equivalent black carbon (BC(e)) or the absorbing fraction at 880 nm for the size-resolved samples. During the dry period, due to the pervasive presence of fires in the region upwind of the sampling site, concentrations of fine aerosols (D(p) < 2.5 mu m: average 59.8 mu g m(-3)) were higher than coarse aerosols (D(p) > 2.5 mu m: 4.1 mu g m(-3)). Carbonaceous matter, estimated as the sum of the particulate organic matter (i.e., OC x 1.8) plus BC(e), comprised more than 90% to the total aerosol mass. Concentrations of EC(a) (estimated by thermal analysis with a correction for charring) and BC(e) (estimated by LTM) averaged 5.2 +/- 1.3 and 3.1 +/- 0.8 mu g m(-3), respectively. The determination of EC was improved by extracting water-soluble organic material from the samples, which reduced the average light absorption Angstrom exponent of particles in the size range of 0.1 to 1.0 mu m from >2.0 to approximately 1.2. The size-resolved BC(e) measured by the LTM showed a clear maximum between 0.4 and 0.6 mu m in diameter. The concentrations of OC and BC(e) varied diurnally during the dry period, and this variation is related to diurnal changes in boundary layer thickness and in fire frequency.
Resumo:
We construct an invisible quantum barrier which represents the phenomenon of quantum reflection using available data on atom-wall and Bose-Einstein-condensate-wall reflection. We use the Abel equation to invert the data. The resulting invisible quantum barrier is double valued in both axes. We study this invisible barrier in the case of atom and Bose-Einstein condensate (BEC) reflection from a solid silicon surface. A time-dependent, one-spatial-dimension Gross-Pitaevskii equation is solved for the BEC case. We found that the BEC behaves very similarly to the single atom except for size effects, which manifest themselves in a maximum in the reflectivity at small distances from the wall. The effect of the atom-atom interaction on the BEC reflection and correspondingly on the invisible barrier is found to be appreciable at low velocities and comparable to the finite-size effect. The trapping of an ultracold atom or BEC between two walls is discussed.
Resumo:
We investigate a conjecture on the cover times of planar graphs by means of large Monte Carlo simulations. The conjecture states that the cover time tau (G(N)) of a planar graph G(N) of N vertices and maximal degree d is lower bounded by tau (G(N)) >= C(d)N(lnN)(2) with C(d) = (d/4 pi) tan(pi/d), with equality holding for some geometries. We tested this conjecture on the regular honeycomb (d = 3), regular square (d = 4), regular elongated triangular (d = 5), and regular triangular (d = 6) lattices, as well as on the nonregular Union Jack lattice (d(min) = 4, d(max) = 8). Indeed, the Monte Carlo data suggest that the rigorous lower bound may hold as an equality for most of these lattices, with an interesting issue in the case of the Union Jack lattice. The data for the honeycomb lattice, however, violate the bound with the conjectured constant. The empirical probability distribution function of the cover time for the square lattice is also briefly presented, since very little is known about cover time probability distribution functions in general.
Resumo:
The photoluminescence (PL) technique as a function of temperature and excitation intensity was used to study the optical properties of multiquantum wells (MQWs) of GaAs/Al(x)Ga(1-x)As grown by molecular beam epitaxy on GaAs substrates oriented in the [100], [311]A, and [311]B directions. The asymmetry presented by the PL spectra of the MQWs with an apparent exponential tail in the lower-energy side and the unusual behavior of the PL peak energy versus temperature (blueshift) at low temperatures are explained by the exciton localization in the confinement potential fluctuations of the heterostructures. The PL peak energy dependence with temperature was fitted by the expression proposed by Passler [Phys. Status Solidi B 200, 155 (1997)] by subtracting the term sigma(2)(E)/k(B)T, which considers the presence of potential fluctuations. It can be verified from the PL line shape, the full width at half maximum of PL spectra, the sigma(E) values obtained from the adjustment of experimental points, and the blueshift maximum values that the samples grown in the [311]A/B directions have higher potential fluctuation amplitude than the sample grown in the [100] direction. This indicates a higher degree of the superficial corrugations for the MQWs grown in the [311] direction. (C) 2008 American Institute of Physics.
Resumo:
A density-functional formalism for superconductivity and magnetism is presented. The resulting relations unify previously derived Kohn-Sham equations for superconductors and for noncollinear magnetism. The formalism, which discriminates Cooper-pair singlets from triplets, is applied to two quantum liquids coupled by tunneling through a barrier. An exact expression is derived, relating the eigenstates and eigenvalues of the Kohn-Sham equations, unperturbed by tunneling, on one side of the barrier to the proximity-induced ordering potential on the other.
Resumo:
A planar k-restricted structure is a simple graph whose blocks are planar and each has at most k vertices. Planar k-restricted structures are used by approximation algorithms for Maximum Weight Planar Subgraph, which motivates this work. The planar k-restricted ratio is the infimum, over simple planar graphs H, of the ratio of the number of edges in a maximum k-restricted structure subgraph of H to the number edges of H. We prove that, as k tends to infinity, the planar k-restricted ratio tends to 1/2. The same result holds for the weighted version. Our results are based on analyzing the analogous ratios for outerplanar and weighted outerplanar graphs. Here both ratios tend to 1 as k goes to infinity, and we provide good estimates of the rates of convergence, showing that they differ in the weighted from the unweighted case.
Resumo:
Context tree models have been introduced by Rissanen in [25] as a parsimonious generalization of Markov models. Since then, they have been widely used in applied probability and statistics. The present paper investigates non-asymptotic properties of two popular procedures of context tree estimation: Rissanen's algorithm Context and penalized maximum likelihood. First showing how they are related, we prove finite horizon bounds for the probability of over- and under-estimation. Concerning overestimation, no boundedness or loss-of-memory conditions are required: the proof relies on new deviation inequalities for empirical probabilities of independent interest. The under-estimation properties rely on classical hypotheses for processes of infinite memory. These results improve on and generalize the bounds obtained in Duarte et al. (2006) [12], Galves et al. (2008) [18], Galves and Leonardi (2008) [17], Leonardi (2010) [22], refining asymptotic results of Buhlmann and Wyner (1999) [4] and Csiszar and Talata (2006) [9]. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.
Resumo:
Changes in the oxygen isotopic composition of the planktonic foraminifer Globigerinoides ruber and in the foraminifera faunal composition in a core retrieved from the southeastern Brazilian continental margin were used to infer past changes in the hydrological balance and monsoon precipitation in the western South Atlantic since the Last Glacial Maximum (LGM). The results suggest a first-order orbital (precessional) control on the South American Monsoon precipitation. This agrees with previous studies based on continental proxies except for LGM estimates provided by pollen records. The causes for this disagreement are discussed.