905 resultados para Classification Methods
Resumo:
This paper presents a new statistical algorithm to estimate rainfall over the Amazon Basin region using the Tropical Rainfall Measuring Mission (TRMM) Microwave Imager (TMI). The algorithm relies on empirical relationships derived for different raining-type systems between coincident measurements of surface rainfall rate and 85-GHz polarization-corrected brightness temperature as observed by the precipitation radar (PR) and TMI on board the TRMM satellite. The scheme includes rain/no-rain area delineation (screening) and system-type classification routines for rain retrieval. The algorithm is validated against independent measurements of the TRMM-PR and S-band dual-polarization Doppler radar (S-Pol) surface rainfall data for two different periods. Moreover, the performance of this rainfall estimation technique is evaluated against well-known methods, namely, the TRMM-2A12 [ the Goddard profiling algorithm (GPROF)], the Goddard scattering algorithm (GSCAT), and the National Environmental Satellite, Data, and Information Service (NESDIS) algorithms. The proposed algorithm shows a normalized bias of approximately 23% for both PR and S-Pol ground truth datasets and a mean error of 0.244 mm h(-1) ( PR) and -0.157 mm h(-1)(S-Pol). For rain volume estimates using PR as reference, a correlation coefficient of 0.939 and a normalized bias of 0.039 were found. With respect to rainfall distributions and rain area comparisons, the results showed that the formulation proposed is efficient and compatible with the physics and dynamics of the observed systems over the area of interest. The performance of the other algorithms showed that GSCAT presented low normalized bias for rain areas and rain volume [0.346 ( PR) and 0.361 (S-Pol)], and GPROF showed rainfall distribution similar to that of the PR and S-Pol but with a bimodal distribution. Last, the five algorithms were evaluated during the TRMM-Large-Scale Biosphere-Atmosphere Experiment in Amazonia (LBA) 1999 field campaign to verify the precipitation characteristics observed during the easterly and westerly Amazon wind flow regimes. The proposed algorithm presented a cumulative rainfall distribution similar to the observations during the easterly regime, but it underestimated for the westerly period for rainfall rates above 5 mm h(-1). NESDIS(1) overestimated for both wind regimes but presented the best westerly representation. NESDIS(2), GSCAT, and GPROF underestimated in both regimes, but GPROF was closer to the observations during the easterly flow.
Resumo:
Context. It is debated whether the Milky Way bulge has characteristics more similar to those of a classical bulge than those of a pseudobulge. Detailed abundance studies of bulge stars are important when investigating the origin, history, and classification of the bulge. These studies provide constraints on the star-formation history, initial mass function, and differences between stellar populations. Not many similar studies have been completed because of the large distance and high variable visual extinction along the line-of-sight towards the bulge. Therefore, near-IR investigations can provide superior results. Aims. To investigate the origin of the bulge and study its chemical abundances determined from near-IR spectra for bulge giants that have already been investigated with optical spectra. The optical spectra also provide the stellar parameters that are very important to the present study. In particular, the important CNO elements are determined more accurately in the near-IR. Oxygen and other alpha elements are important for investigating the star-formation history. The C and N abundances are important for determining the evolutionary stage of the giants and the origin of C in the bulge. Methods. High-resolution, near-infrared spectra in the H band were recorded using the CRIRES spectrometer mounted on the Very Large Telescope. The CNO abundances are determined from the numerous molecular lines in the wavelength range observed. Abundances of the alpha elements Si, S, and Ti are also determined from the near-IR spectra. Results. The abundance ratios [O/Fe], [Si/Fe], and [S/Fe] are enhanced to metallicities of at least [Fe/H] = -0.3, after which they decline. This suggests that the Milky Way bulge experienced a rapid and early burst of star formation similar to that of a classical bulge. However, a similarity between the bulge trend and the trend of the local thick disk seems to be present. This similarity suggests that the bulge could have had a pseudobulge origin. The C and N abundances suggest that our giants are first-ascent red-giants or clump stars, and that the measured oxygen abundances are those with which the stars were born. Our [C/Fe] trend does not show any increase with [Fe/H], which is expected if W-R stars contributed substantially to the C abundances. No ""cosmic scatter"" can be traced around our observed abundance trends: the measured scatter is expected, given the observational uncertainties.
Resumo:
Aims. To detect line effects using spectropolarimetry in order to find evidence of rotating disks and their respective symmetry axes in T Tauri stars. Methods. We used the IAGPOL imaging polarimeter along with the Eucalyptus-IFU to obtain spectropolarimetric measurements of the T Tauri stars RY Tau (two epochs) and PX Vul (one epoch). Evidence of line effects showing a loop in the Q-U diagram favors a compact rather than an extended source for the line photons in a rotating disk. In addition, the polarization position angle (PA) obtained using the line effect can constrain the symmetry axis of the disk. Results. RY Tau shows a variable H alpha double peak in 2004-2005 data. A polarization line effect is evident in the Q-U diagram for both epochs confirming a clockwise rotating disk. A single loop is evident in 2004 changing to a linear excursion plus a loop in 2005. Interestingly, the intrinsic PA calculated using the line effect is consistent between our two epochs (similar to 167 degrees). An alternative intrinsic PA computed from the interstellar polarization-corrected continuum and averaged between 2001-2005 yielded a PA similar to 137 degrees. This last value is closer to perpendicular to the observed disk direction (similar to 25 degrees), as expected from single scattering in an optically thin disk. For PX Vul, we detected spectral variability in H alpha along with non-variable continuum polarization when compared with previous data. The Q-U diagram shows a well-defined loop in H alpha associated with a counter-clockwise rotating disk. The symmetry axis inferred from the line effect has a PA similar to 91 degrees (with an ambiguity of 90 degrees). Our results confirm previous evidence that the emission line in T Tauri stars has its origin in a compact source scattered off a rotating accretion disk.
Resumo:
We study the star/galaxy classification efficiency of 13 different decision tree algorithms applied to photometric objects in the Sloan Digital Sky Survey Data Release Seven (SDSS-DR7). Each algorithm is defined by a set of parameters which, when varied, produce different final classification trees. We extensively explore the parameter space of each algorithm, using the set of 884,126 SDSS objects with spectroscopic data as the training set. The efficiency of star-galaxy separation is measured using the completeness function. We find that the Functional Tree algorithm (FT) yields the best results as measured by the mean completeness in two magnitude intervals: 14 <= r <= 21 (85.2%) and r >= 19 (82.1%). We compare the performance of the tree generated with the optimal FT configuration to the classifications provided by the SDSS parametric classifier, 2DPHOT, and Ball et al. We find that our FT classifier is comparable to or better in completeness over the full magnitude range 15 <= r <= 21, with much lower contamination than all but the Ball et al. classifier. At the faintest magnitudes (r > 19), our classifier is the only one that maintains high completeness (> 80%) while simultaneously achieving low contamination (similar to 2.5%). We also examine the SDSS parametric classifier (psfMag - modelMag) to see if the dividing line between stars and galaxies can be adjusted to improve the classifier. We find that currently stars in close pairs are often misclassified as galaxies, and suggest a new cut to improve the classifier. Finally, we apply our FT classifier to separate stars from galaxies in the full set of 69,545,326 SDSS photometric objects in the magnitude range 14 <= r <= 21.
Resumo:
Aerosol samples were collected at a pasture site in the Amazon Basin as part of the project LBA-SMOCC-2002 (Large-Scale Biosphere-Atmosphere Experiment in Amazonia - Smoke Aerosols, Clouds, Rainfall and Climate: Aerosols from Biomass Burning Perturb Global and Regional Climate). Sampling was conducted during the late dry season, when the aerosol composition was dominated by biomass burning emissions, especially in the submicron fraction. A 13-stage Dekati low-pressure impactor (DLPI) was used to collect particles with nominal aerodynamic diameters (D(p)) ranging from 0.03 to 0.10 mu m. Gravimetric analyses of the DLPI substrates and filters were performed to obtain aerosol mass concentrations. The concentrations of total, apparent elemental, and organic carbon (TC, EC(a), and OC) were determined using thermal and thermal-optical analysis (TOA) methods. A light transmission method (LTM) was used to determine the concentration of equivalent black carbon (BC(e)) or the absorbing fraction at 880 nm for the size-resolved samples. During the dry period, due to the pervasive presence of fires in the region upwind of the sampling site, concentrations of fine aerosols (D(p) < 2.5 mu m: average 59.8 mu g m(-3)) were higher than coarse aerosols (D(p) > 2.5 mu m: 4.1 mu g m(-3)). Carbonaceous matter, estimated as the sum of the particulate organic matter (i.e., OC x 1.8) plus BC(e), comprised more than 90% to the total aerosol mass. Concentrations of EC(a) (estimated by thermal analysis with a correction for charring) and BC(e) (estimated by LTM) averaged 5.2 +/- 1.3 and 3.1 +/- 0.8 mu g m(-3), respectively. The determination of EC was improved by extracting water-soluble organic material from the samples, which reduced the average light absorption Angstrom exponent of particles in the size range of 0.1 to 1.0 mu m from >2.0 to approximately 1.2. The size-resolved BC(e) measured by the LTM showed a clear maximum between 0.4 and 0.6 mu m in diameter. The concentrations of OC and BC(e) varied diurnally during the dry period, and this variation is related to diurnal changes in boundary layer thickness and in fire frequency.
Resumo:
Multispectral widefield optical imaging has the potential to improve early detection of oral cancer. The appropriate selection of illumination and collection conditions is required to maximize diagnostic ability. The goals of this study were to (i) evaluate image contrast between oral cancer/precancer and non-neoplastic mucosa for a variety of imaging modalities and illumination/collection conditions, and (ii) use classification algorithms to evaluate and compare the diagnostic utility of these modalities to discriminate cancers and precancers from normal tissue. Narrowband reflectance, autofluorescence, and polarized reflectance images were obtained from 61 patients and 11 normal volunteers. Image contrast was compared to identify modalities and conditions yielding greatest contrast. Image features were extracted and used to train and evaluate classification algorithms to discriminate tissue as non-neoplastic, dysplastic, or cancer; results were compared to histologic diagnosis. Autofluorescence imaging at 405-nm excitation provided the greatest image contrast, and the ratio of red-to-green fluorescence intensity computed from these images provided the best classification of dysplasia/cancer versus non-neoplastic tissue. A sensitivity of 100% and a specificity of 85% were achieved in the validation set. Multispectral widefield images can accurately distinguish neoplastic and non-neoplastic tissue; however, the ability to separate precancerous lesions from cancers with this technique was limited. (C) 2010 Society of Photo-Optical Instrumentation Engineers. [DOI: 10.1117/1.3516593]
Resumo:
Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.
Resumo:
Background -: Beta-2 adrenergic receptor gene polymorphisms Gln27Glu, Arg16Gly and Thr164Ile were suggested to have an effect in heart failure. We evaluated these polymorphisms relative to clinical characteristics and prognosis of alarge cohort of patients with heart failure of different etiologies. Methods -: We studied 501 patients with heart failure of different etiologies. Mean age was 58 years (standard deviation 14.4 years), 298 (60%) were men. Polymorphisms were identified by polymerase chain reaction-restriction fragment length polymorphism. Results -: During the mean follow-up of 12.6 months (standard deviation 10.3 months), 188 (38%) patients died. Distribution of genotypes of polymorphism Arg16Gly was different relative to body mass index (chi(2) = 9.797; p = 0.04). Overall the probability of survival was not significantly predicted by genotypes of Gln27Glu, Arg16Gly, or Thr164Ile. Allele and haplotype analysis also did not disclose any significant difference regarding mortality. Exploratory analysis through classification trees pointed towards a potential association between the Gln27Glu polymorphism and mortality in older individuals. Conclusion -: In this study sample, we were not able to demonstrate an overall influence of polymorphisms Gln27Glu and Arg16Gly of beta-2 receptor gene on prognosis. Nevertheless, Gln27Glu polymorphism may have a potential predictive value in older individuals.
Resumo:
The problem of semialgebraic Lipschitz classification of quasihomogeneous polynomials on a Holder triangle is studied. For this problem, the ""moduli"" are described completely in certain combinatorial terms.
Resumo:
We study induced modules of nonzero central charge with arbitrary multiplicities over affine Lie algebras. For a given pseudo parabolic subalgebra P of an affine Lie algebra G, our main result establishes the equivalence between a certain category of P-induced G-modules and the category of weight P-modules with injective action of the central element of G. In particular, the induction functor preserves irreducible modules. If P is a parabolic subalgebra with a finite-dimensional Levi factor then it defines a unique pseudo parabolic subalgebra P(ps), P subset of P(ps). The structure of P-induced modules in this case is fully determined by the structure of P(ps)-induced modules. These results generalize similar reductions in particular cases previously considered by V. Futorny, S. Konig, V. Mazorchuk [Forum Math. 13 (2001), 641-661], B. Cox [Pacific J. Math. 165 (1994), 269-294] and I. Dimitrov, V. Futorny, I. Penkov [Comm. Math. Phys. 250 (2004), 47-63].
Resumo:
Background: Mutations in TP53 are common events during carcinogenesis. In addition to gene mutations, several reports have focused on TP53 polymorphisms as risk factors for malignant disease. Many studies have highlighted that the status of the TP53 codon 72 polymorphism could influence cancer susceptibility. However, the results have been inconsistent and various methodological features can contribute to departures from Hardy-Weinberg equilibrium, a condition that may influence the disease risk estimates. The most widely accepted method of detecting genotyping error is to confirm genotypes by sequencing and/or via a separate method. Results: We developed two new genotyping methods for TP53 codon 72 polymorphism detection: Denaturing High Performance Liquid Chromatography (DHPLC) and Dot Blot hybridization. These methods were compared with Restriction Fragment Length Polymorphism (RFLP) using two different restriction enzymes. We observed high agreement among all methodologies assayed. Dot-blot hybridization and DHPLC results were more highly concordant with each other than when either of these methods was compared with RFLP. Conclusions: Although variations may occur, our results indicate that DHPLC and Dot Blot hybridization can be used as reliable screening methods for TP53 codon 72 polymorphism detection, especially in molecular epidemiologic studies, where high throughput methodologies are required.
Resumo:
In the present research, we studied wines from three different south Brazilian winemaking regions with the purpose of differentiating them by geographical origin of the grapes. Brazil`s wide territory and climate diversity allow grape cultivation and winemaking in many regions of different and unique characteristics. The wine grape cultivation for winemaking concentrates in the South Region, mainly in the Serra GaA(0)cha, the mountain area of the state of Rio Grande do Sul, which is responsible for 90% of the domestic wine production. However, in recent years, two new production regions have developed: the Campanha, the plains to the south and the Serra do Sudeste, the hills to the southeast of the state. Analysis of isotopic ratios of (18)O/(16)O of wine water, (13)C/(12)C of ethanol, and of minerals were used to characterize wines from different regions. The isotope analysis of delta(18)O of wine water and minerals Mg and Rb were the most efficient to differentiate the regions. By using isotope and mineral analysis, and discrimination analysis, it was possible to classify the wines from south Brazil.
Resumo:
It has been demonstrated that laser induced breakdown spectrometry (LIBS) can be used as an alternative method for the determination of macro (P, K. Ca, Mg) and micronutrients (B, Fe, Cu, Mn, Zn) in pellets of plant materials. However, information is required regarding the sample preparation for plant analysis by LIBS. In this work, methods involving cryogenic grinding and planetary ball milling were evaluated for leaves comminution before pellets preparation. The particle sizes were associated to chemical sample properties such as fiber and cellulose contents, as well as to pellets porosity and density. The pellets were ablated at 30 different sites by applying 25 laser pulses per site (Nd:YAG@1064 nm, 5 ns, 10 Hz, 25J cm(-2)). The plasma emission collected by lenses was directed through an optical fiber towards a high resolution echelle spectrometer equipped with an ICCD. Delay time and integration time gate were fixed at 2.0 and 4.5 mu s, respectively. Experiments carried out with pellets of sugarcane, orange tree and soy leaves showed a significant effect of the plant species for choosing the most appropriate grinding conditions. By using ball milling with agate materials, 20 min grinding for orange tree and soy, and 60 min for sugarcane leaves led to particle size distributions generally lower than 75 mu m. Cryogenic grinding yielded similar particle size distributions after 10 min for orange tree, 20 min for soy and 30 min for sugarcane leaves. There was up to 50% emission signal enhancement on LIBS measurements for most elements by improving particle size distribution and consequently the pellet porosity. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Quality control of toys for avoiding children exposure to potentially toxic elements is of utmost relevance and it is a common requirement in national and/or international norms for health and safety reasons. Laser-induced breakdown spectroscopy (LIBS) was recently evaluated at authors` laboratory for direct analysis of plastic toys and one of the main difficulties for the determination of Cd. Cr and Pb was the variety of mixtures and types of polymers. As most norms rely on migration (lixiviation) protocols, chemometric classification models from LIBS spectra were tested for sampling toys that present potential risk of Cd, Cr and Pb contamination. The classification models were generated from the emission spectra of 51 polymeric toys and by using Partial Least Squares - Discriminant Analysis (PLS-DA), Soft Independent Modeling of Class Analogy (SIMCA) and K-Nearest Neighbor (KNN). The classification models and validations were carried out with 40 and 11 test samples, respectively. Best results were obtained when KNN was used, with corrected predictions varying from 95% for Cd to 100% for Cr and Pb. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Today several different unsupervised classification algorithms are commonly used to cluster similar patterns in a data set based only on its statistical properties. Specially in image data applications, self-organizing methods for unsupervised classification have been successfully applied for clustering pixels or group of pixels in order to perform segmentation tasks. The first important contribution of this paper refers to the development of a self-organizing method for data classification, named Enhanced Independent Component Analysis Mixture Model (EICAMM), which was built by proposing some modifications in the Independent Component Analysis Mixture Model (ICAMM). Such improvements were proposed by considering some of the model limitations as well as by analyzing how it should be improved in order to become more efficient. Moreover, a pre-processing methodology was also proposed, which is based on combining the Sparse Code Shrinkage (SCS) for image denoising and the Sobel edge detector. In the experiments of this work, the EICAMM and other self-organizing models were applied for segmenting images in their original and pre-processed versions. A comparative analysis showed satisfactory and competitive image segmentation results obtained by the proposals presented herein. (C) 2008 Published by Elsevier B.V.