966 resultados para Hierarchical Spatial Classification
Resumo:
Genetic models of sex and caste determination in eusocial stingless bees suggest specific patterns of male, worker and gyne cell distribution in the brood comb. Conflict between queen and laying workers over male parentage and center-periphery gradients of conditions, such as food and temperature, could also contribute to non-random spatial configuration. We converted the positions of the hexagonal cells in a brood comb to Cartesian coordinates, labeled by sex or caste of the individuals inside. To detect and locate clustered patterns, the mapped brood combs were evaluated by indexes of dispersion (MMC, mean distance of cells of a given category from their centroid) and eccentricity (DMB, distance between this centroid and the overall brood comb centroid) that we developed. After randomizing the labels and recalculating the indexes, we calculated probabilities that the original values had been generated by chance. We created sets of binary brood combs in which males were aggregated, regularly or randomly distributed among females. These stylized maps were used to describe the power of MMC and DMB, and they were applied to evaluate the male distribution in the sampled Nannotrigona testaceicornis brood combs. MMC was very sensitive to slight deviations from a perfectly rounded clump; DMB detected any asymmetry in the location of these compact to fuzzy clusters. Six of the 82 brood combs of N. testaceicornis that we analyzed had more than nine males, distributed according to variations in spatial patterns, as indicated by the two indexes.
Resumo:
Positional information in developing embryos is specified by spatial gradients of transcriptional regulators. One of the classic systems for studying this is the activation of the hunchback (hb) gene in early fruit fly (Drosophila) segmentation by the maternally-derived gradient of the Bicoid (Bcd) protein. Gene regulation is subject to intrinsic noise which can produce variable expression. This variability must be constrained in the highly reproducible and coordinated events of development. We identify means by which noise is controlled during gene expression by characterizing the dependence of hb mRNA and protein output noise on hb promoter structure and transcriptional dynamics. We use a stochastic model of the hb promoter in which the number and strength of Bcd and Hb (self-regulatory) binding sites can be varied. Model parameters are fit to data from WT embryos, the self-regulation mutant hb(14F), and lacZ reporter constructs using different portions of the hb promoter. We have corroborated model noise predictions experimentally. The results indicate that WT (self-regulatory) Hb output noise is predominantly dependent on the transcription and translation dynamics of its own expression, rather than on Bcd fluctuations. The constructs and mutant, which lack self-regulation, indicate that the multiple Bcd binding sites in the hb promoter (and their strengths) also play a role in buffering noise. The model is robust to the variation in Bcd binding site number across a number of fly species. This study identifies particular ways in which promoter structure and regulatory dynamics reduce hb output noise. Insofar as many of these are common features of genes (e. g. multiple regulatory sites, cooperativity, self-feedback), the current results contribute to the general understanding of the reproducibility and determinacy of spatial patterning in early development.
Resumo:
Online music databases have increased significantly as a consequence of the rapid growth of the Internet and digital audio, requiring the development of faster and more efficient tools for music content analysis. Musical genres are widely used to organize music collections. In this paper, the problem of automatic single and multi-label music genre classification is addressed by exploring rhythm-based features obtained from a respective complex network representation. A Markov model is built in order to analyse the temporal sequence of rhythmic notation events. Feature analysis is performed by using two multi-variate statistical approaches: principal components analysis (unsupervised) and linear discriminant analysis (supervised). Similarly, two classifiers are applied in order to identify the category of rhythms: parametric Bayesian classifier under the Gaussian hypothesis (supervised) and agglomerative hierarchical clustering (unsupervised). Qualitative results obtained by using the kappa coefficient and the obtained clusters corroborated the effectiveness of the proposed method.
Resumo:
A combined analytical and numerical study is performed of the mapping between strongly interacting fermions and weakly interacting spins, in the framework of the Hubbard, t-J, and Heisenberg models. While for spatially homogeneous models in the thermodynamic limit the mapping is thoroughly understood, we here focus on aspects that become relevant in spatially inhomogeneous situations, such as the effect of boundaries, impurities, superlattices, and interfaces. We consider parameter regimes that are relevant for traditional applications of these models, such as electrons in cuprates and manganites, and for more recent applications to atoms in optical lattices. The rate of the mapping as a function of the interaction strength is determined from the Bethe-Ansatz for infinite systems and from numerical diagonalization for finite systems. We show analytically that if translational symmetry is broken through the presence of impurities, the mapping persists and is, in a certain sense, as local as possible, provided the spin-spin interaction between two sites of the Heisenberg model is calculated from the harmonic mean of the onsite Coulomb interaction on adjacent sites of the Hubbard model. Numerical calculations corroborate these findings also in interfaces and superlattices, where analytical calculations are more complicated.
Resumo:
Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.
Resumo:
The problem of semialgebraic Lipschitz classification of quasihomogeneous polynomials on a Holder triangle is studied. For this problem, the ""moduli"" are described completely in certain combinatorial terms.
Resumo:
Epilithic biofilm on rocky shores is regulated by physico-chemical and biological factors and is important as a source of food for benthic organisms. The influences of environmental and grazing pressure on spatial variability of biomass of biofilm were evaluated on shores on the north coast of Sao Paulo State (SE Brazil). A general trend of greater abundance of microalgae was observed lower on the shore, but neither of the environmental factors evaluated (wave exposure and shore level) showed consistent effects, and differences were found among specific shores or times (September 2007 and March 2008). The abundance of slow-moving grazers (limpets and littorinids) showed a negative correlation with chlorophyll a concentration on shores. However, experimental exclusion of these grazers failed to show consistent results at small spatial scales. Observations of divergent abundances of the isopod Ligia exotica and biomass of biofilm on isolated boulders on shores led to a short exclusion experiment, where the grazing pressure by L. exotica significantly decreased microalgal biomass. The result suggests that grazing activities of this fast-moving consumer probably mask the influence of slow-moving grazers at small spatial scales, while both have an additive effect at larger scales that masks environmental influences. This is the first evaluation of the impact of the fast-moving herbivore L. exotica on microalgal biomass on rocky shores and opens an interesting discussion about the role of these organisms in subtropical coastal environments.
Resumo:
Nitrogen variations at different spatial scales and integrated across functional groups were addressed for lowland tropical forests in the Brazilian Amazon as follows: (1) how does N availability vary across the region over different spatial scales (regional x landscape scale); ( 2) how are these variations in N availability integrated across plant functional groups ( legume 9 non-legume trees). Leaf N, P, and Ca concentrations as well the leaf N isotope ratios (delta(15)N) from a large set of legume and non-legume tree species were measured. Legumes had higher foliar N/Ca ratios than non-legumes, consistent with the high energetic costs in plant growth associated with higher foliar P/Ca ratios found in legumes than in non-legumes. At the regional level, foliar delta(15)N decreased with increasing rainfall. At the landscape level, N availability was higher in the forests on clayey soils on the plateau than in forests on sandier soils. The isotope as well as the non-isotope data relationships here documented, explain to a large extent the variation in delta(15)N signatures across gradients of rainfall and soil. Although at the regional level, the precipitation regime is a major determinant of differences in N availability, at the landscape level, under the same precipitation regime, soil type seems to be a major factor influencing the availability of N in the Brazilian Amazon forest.
Resumo:
Psecas chapoda, a neotropical jumping spider strictly associated with the terrestrial bromeliad Bromelia balansae in cerrados and semi-deciduous forests in South America, effectively contributes to plant nutrition and growth. In this study, our goal was to investigate if spider density caused spatial variations in the strength of this spider-plant mutualism. We found a positive significant relationship between spider density and delta N-15 values for bromeliad leaves in different forest fragments. Open grassland Bromeliads were associated with spiders and had higher delta N-15 values compared to forest bromeliads. Although forest bromeliads had no association with spiders their total N concentrations were higher. These results suggest that bromeliad nutrition is likely more litter-based in forests and more spider-based in open grasslands. This study is one of the few to show nutrient provisioning and conditionality in a spider-plant system. (c) 2008 Elsevier Masson SAS. All rights reserved.
Resumo:
Quality control of toys for avoiding children exposure to potentially toxic elements is of utmost relevance and it is a common requirement in national and/or international norms for health and safety reasons. Laser-induced breakdown spectroscopy (LIBS) was recently evaluated at authors` laboratory for direct analysis of plastic toys and one of the main difficulties for the determination of Cd. Cr and Pb was the variety of mixtures and types of polymers. As most norms rely on migration (lixiviation) protocols, chemometric classification models from LIBS spectra were tested for sampling toys that present potential risk of Cd, Cr and Pb contamination. The classification models were generated from the emission spectra of 51 polymeric toys and by using Partial Least Squares - Discriminant Analysis (PLS-DA), Soft Independent Modeling of Class Analogy (SIMCA) and K-Nearest Neighbor (KNN). The classification models and validations were carried out with 40 and 11 test samples, respectively. Best results were obtained when KNN was used, with corrected predictions varying from 95% for Cd to 100% for Cr and Pb. (C) 2011 Elsevier B.V. All rights reserved.
Microsatellite Polymorphisms in Cassava Landraces from the Cerrado Biome, Mato Grosso do Sul, Brazil
Resumo:
Using nine microsatellite loci, we investigated genetic structure and diversity in 83 Brazilian cassava accessions, including several landraces, in the Cerrado biome in Mato Grosso do Sul, Brazil. All nine loci were polymorphic, averaging 6.00 alleles per locus. Treating each of seven municipalities as a cassava group or population, they averaged 3.5 alleles per locus, with 97% polymorphic loci, high values for observed heterozygosity (0.32) and gene diversity (0.56). Total genetic variability was high (0.668), and most of this genetic variability was concentrated within municipalities (0.577). Cluster and structure analyses divided accessions into two major clusters or populations (K = 2). Also, a significant genetic versus geographic correlation was found (r = 0.4567; P < 0.0260). Migratory routes in the Cerrado are considered main contributors to the region`s high cassava diversity and spatial genetic structure, amplifying interactions between traditional farmers and the evolutionary dynamics of this crop.
Resumo:
Objective: We carry out a systematic assessment on a suite of kernel-based learning machines while coping with the task of epilepsy diagnosis through automatic electroencephalogram (EEG) signal classification. Methods and materials: The kernel machines investigated include the standard support vector machine (SVM), the least squares SVM, the Lagrangian SVM, the smooth SVM, the proximal SVM, and the relevance vector machine. An extensive series of experiments was conducted on publicly available data, whose clinical EEG recordings were obtained from five normal subjects and five epileptic patients. The performance levels delivered by the different kernel machines are contrasted in terms of the criteria of predictive accuracy, sensitivity to the kernel function/parameter value, and sensitivity to the type of features extracted from the signal. For this purpose, 26 values for the kernel parameter (radius) of two well-known kernel functions (namely. Gaussian and exponential radial basis functions) were considered as well as 21 types of features extracted from the EEG signal, including statistical values derived from the discrete wavelet transform, Lyapunov exponents, and combinations thereof. Results: We first quantitatively assess the impact of the choice of the wavelet basis on the quality of the features extracted. Four wavelet basis functions were considered in this study. Then, we provide the average accuracy (i.e., cross-validation error) values delivered by 252 kernel machine configurations; in particular, 40%/35% of the best-calibrated models of the standard and least squares SVMs reached 100% accuracy rate for the two kernel functions considered. Moreover, we show the sensitivity profiles exhibited by a large sample of the configurations whereby one can visually inspect their levels of sensitiveness to the type of feature and to the kernel function/parameter value. Conclusions: Overall, the results evidence that all kernel machines are competitive in terms of accuracy, with the standard and least squares SVMs prevailing more consistently. Moreover, the choice of the kernel function and parameter value as well as the choice of the feature extractor are critical decisions to be taken, albeit the choice of the wavelet family seems not to be so relevant. Also, the statistical values calculated over the Lyapunov exponents were good sources of signal representation, but not as informative as their wavelet counterparts. Finally, a typical sensitivity profile has emerged among all types of machines, involving some regions of stability separated by zones of sharp variation, with some kernel parameter values clearly associated with better accuracy rates (zones of optimality). (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Traditionally, chronotype classification is based on the Morningness-Eveningness Questionnaire (MEQ). It is implicit in the classification that intermediate individuals get intermediate scores to most of the MEQ questions. However, a small group of individuals has a different pattern of answers. In some questions, they answer as ""morning-types"" and in some others they answer as ""evening-types,"" resulting in an intermediate total score. ""Evening-type"" and ""Morning-type"" answers were set as A(1) and A(4), respectively. Intermediate answers were set as A(2) and A(3). The following algorithm was applied: Bimodality Index = (Sigma A(1) x Sigma A(4))(2) - (Sigma A(2) x Sigma A(3))(2). Neither-types that had positive bimodality scores were classified as bimodal. If our hypothesis is validated by objective data, an update of chronotype classification will be required. (Author correspondence: brunojm@ymail.com)
Resumo:
Oropharyngeal dysphagia is characterized by any alteration in swallowing dynamics which may lead to malnutrition and aspiration pneumonia. Early diagnosis is crucial for the prognosis of patients with dysphagia, and the best method for swallowing dynamics assessment is swallowing videofluoroscopy, an exam performed with X-rays. Because it exposes patients to radiation, videofluoroscopy should not be performed frequently nor should it be prolonged. This study presents a non-invasive method for the pre-diagnosis of dysphagia based on the analysis of the swallowing acoustics, where the discrete wavelet transform plays an important role to increase sensitivity and specificity in the identification of dysphagic patients. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
The properties of recycled aggregate produced from mixed (masonry and concrete) construction and demolition (C&D) waste are highly variable, and this restricts the use of such aggregate in structural concrete production. The development of classification techniques capable of reducing this variability is instrumental for quality control purposes and the production of high quality C&D aggregate. This paper investigates how the classification of C&D mixed coarse aggregate according to porosity influences the mechanical performance of concrete. Concretes using a variety of C&D aggregate porosity classes and different water/cement ratios were produced and the mechanical properties measured. For concretes produced with constant volume fractions of water, cement, natural sand and coarse aggregate from recycled mixed C&D waste, the compressive strength and Young modulus are direct exponential functions of the aggregate porosity. Sink and float technique is a simple laboratory density separation tool that facilitates the separation of cement particles with lower porosity, a difficult task when done only by visual sorting. For this experiment, separation using a 2.2 kg/dmA(3) suspension produced recycled aggregate (porosity less than 17%) which yielded good performance in concrete production. Industrial gravity separators may lead to the production of high quality recycled aggregate from mixed C&D waste for structural concrete applications.