926 resultados para Multidimensional scaling
Resumo:
Ocean acidification may stimulate primary production through increased availability of inorganic carbon in the photic zone, which may in turn change the biogenic flux of dissolved organic carbon (DOC) and the growth potential of heterotrophic bacteria. To investigate the effects of ocean acidification on marine bacterial assemblages, a two-by-three factorial mescosom experiment was conducted using surface sea water from the East Greenland Current in Fram Strait. Pyrosequencing of the V1-V2 region of bacterial 16S ribosomal RNA genes was used to investigate differences in the endpoint (Day 9) composition of bacterial assemblages in mineral nutrient-replete mesocosms amended with glucose (0 µm, 5.3 µm and 15.9 µm) under ambient (250 µatm) or acidified (400 µatm) partial pressures of CO2 (pCO2). All mesocosms showed low richness and diversity by Chao1 estimator and Shannon index, respectively, with general dominance by Gammaproteobacteria and Flavobacteria. Nonmetric multidimensional scaling analysis and two-way analysis of variance of the Jaccard dissimilarity matrix (97% similarity cut-off) demonstrated that the significant community shift between 0 µm and 15.9 µm glucose addition at 250 µatm pCO2 was eliminated at 400 µatm pCO2. These results suggest that the response potential of marine bacteria to DOC input may be altered under acidified conditions.
Resumo:
DNA extraction was carried out as described on the MICROBIS project pages (http://icomm.mbl.edu/microbis ) using a commercially available extraction kit. We amplified the hypervariable regions V4-V6 of archaeal and bacterial 16S rRNA genes using PCR and several sets of forward and reverse primers (http://vamps.mbl.edu/resources/primers.php). Massively parallel tag sequencing of the PCR products was carried out on a 454 Life Sciences GS FLX sequencer at Marine Biological Laboratory, Woods Hole, MA, following the same experimental conditions for all samples. Sequence reads were submitted to a rigorous quality control procedure based on mothur v30 (doi:10.1128/AEM.01541-09) including denoising of the flow grams using an algorithm based on PyroNoise (doi:10.1038/nmeth.1361), removal of PCR errors and a chimera check using uchime (doi:10.1093/bioinformatics/btr381). The reads were taxonomically assigned according to the SILVA taxonomy (SSURef v119, 07-2014; doi:10.1093/nar/gks1219) implemented in mothur and clustered at 98% ribosomal RNA gene V4-V6 sequence identity. V4-V6 amplicon sequence abundance tables were standardized to account for unequal sampling effort using 1000 (Archaea) and 2300 (Bacteria) randomly chosen sequences without replacement using mothur and then used to calculate inverse Simpson diversity indices and Chao1 richness (doi:10.2307/4615964). Bray-Curtis dissimilarities (doi:10.2307/1942268) between all samples were calculated and used for 2-dimensional non metric multidimensional scaling (NMDS) ordinations with 20 random starts (doi:10.1007/BF02289694). Stress values below 0.2 indicated that the multidimensional dataset was well represented by the 2D ordination. NMDS ordinations were compared and tested using Procrustes correlation analysis (doi:10.1007/BF02291478). All analyses were carried out with the R statistical environment and the packages vegan (available at: http://cran.r-project.org/package=vegan), labdsv (available at: http://cran.r-project.org/package=labdsv), as well as with custom R scripts. Operational taxonomic units at 98% sequence identity (OTU0.03) that occurred only once in the whole dataset were termed absolute single sequence OTUs (SSOabs; doi:10.1038/ismej.2011.132). OTU0.03 sequences that occurred only once in at least one sample, but may occur more often in other samples were termed relative single sequence OTUs (SSOrel). SSOrel are particularly interesting for community ecology, since they comprise rare organisms that might become abundant when conditions change.16S rRNA amplicons and metagenomic reads have been stored in the sequence read archive under SRA project accession number SRP042162.
Resumo:
The genetic history of a group of populations is usually analyzed by reconstructing a tree of their origins. Reliability of the reconstruction depends on the validity of the hypothesis that genetic differentiation of the populations is mostly due to population fissions followed by independent evolution. If necessary, adjustment for major population admixtures can be made. Dating the fissions requires comparisons with paleoanthropological and paleontological dates, which are few and uncertain. A method of absolute genetic dating recently introduced uses mutation rates as molecular clocks; it was applied to human evolution using microsatellites, which have a sufficiently high mutation rate. Results are comparable with those of other methods and agree with a recent expansion of modern humans from Africa. An alternative method of analysis, useful when there is adequate geographic coverage of regions, is the geographic study of frequencies of alleles or haplotypes. As in the case of trees, it is necessary to summarize data from many loci for conclusions to be acceptable. Results must be independent from the loci used. Multivariate analyses like principal components or multidimensional scaling reveal a number of hidden patterns and evaluate their relative importance. Most patterns found in the analysis of human living populations are likely to be consequences of demographic expansions, determined by technological developments affecting food availability, transportation, or military power. During such expansions, both genes and languages are spread to potentially vast areas. In principle, this tends to create a correlation between the respective evolutionary trees. The correlation is usually positive and often remarkably high. It can be decreased or hidden by phenomena of language replacement and also of gene replacement, usually partial, due to gene flow.
Resumo:
Efficient and reliable classification of visual stimuli requires that their representations reside a low-dimensional and, therefore, computationally manageable feature space. We investigated the ability of the human visual system to derive such representations from the sensory input-a highly nontrivial task, given the million or so dimensions of the visual signal at its entry point to the cortex. In a series of experiments, subjects were presented with sets of parametrically defined shapes; the points in the common high-dimensional parameter space corresponding to the individual shapes formed regular planar (two-dimensional) patterns such as a triangle, a square, etc. We then used multidimensional scaling to arrange the shapes in planar configurations, dictated by their experimentally determined perceived similarities. The resulting configurations closely resembled the original arrangements of the stimuli in the parameter space. This achievement of the human visual system was replicated by a computational model derived from a theory of object representation in the brain, according to which similarities between objects, and not the geometry of each object, need to be faithfully represented.
Resumo:
To examine population affinities in light of the ‘dual structure model’, frequencies of 21 nonmetric cranial traits were analyzed in 17 prehistoric to recent samples from Japan and five from continental northeast Asia. Eight bivariate plots, each representing a different bone or region of the skull, as well as cluster analysis of 21-trait mean measures of divergence using multidimensional scaling and additive tree techniques, revealed good discrimination between the Jomon-Ainu indigenous lineage and that of the immigrants who arrived from continental Asia after 300 BC. In Hokkaido, in agreement with historical records, Ainu villages of Hidaka province were least, and those close to the Japan Sea coast were most, hybridized with Wajin. In the central islands, clines were identified among Wajin skeletal samples whereby those from Kyushu most resembled continental northeast Asians, while those from the northernmost prefectures of Tohoku apparently retained the strongest indigenous heritage. In the more southerly prefectures of Tohoku, stronger traces of Jomon ancestry prevailed in the cohort born during the latest Edo period than in the one born after 1870. Thus, it seems that increased inter-regional mobility and gene flow following the Meiji Restoration initiated the most recent episode in the long process of demic diffusion that has helped to shape craniofacial change in Japan.
Resumo:
We have used microarray gene expression pro. ling and machine learning to predict the presence of BRAF mutations in a panel of 61 melanoma cell lines. The BRAF gene was found to be mutated in 42 samples (69%) and intragenic mutations of the NRAS gene were detected in seven samples (11%). No cell line carried mutations of both genes. Using support vector machines, we have built a classifier that differentiates between melanoma cell lines based on BRAF mutation status. As few as 83 genes are able to discriminate between BRAF mutant and BRAF wild-type samples with clear separation observed using hierarchical clustering. Multidimensional scaling was used to visualize the relationship between a BRAF mutation signature and that of a generalized mitogen-activated protein kinase ( MAPK) activation ( either BRAF or NRAS mutation) in the context of the discriminating gene list. We observed that samples carrying NRAS mutations lie somewhere between those with or without BRAF mutations. These observations suggest that there are gene-specific mutation signals in addition to a common MAPK activation that result from the pleiotropic effects of either BRAF or NRAS on other signaling pathways, leading to measurably different transcriptional changes.
Resumo:
Genetic diversity and population structure were investigated across the core range of Tasmanian devils (Sarcophilus laniarius; Dasyuridae), a wide-ranging marsupial carnivore restricted to the island of Tasmania. Heterozygosity (0.386-0.467) and allelic diversity (2.7-3.3) were low in all subpopulations and allelic size ranges were small and almost continuous, consistent with a founder effect. Island effects and repeated periods of low population density may also have contributed to the low variation. Within continuous habitat, gene flow appears extensive up to 50 km (high assignment rates to source or close neighbour populations; nonsignificant values of pairwise F-ST), in agreement with movement data. At larger scales (150-250 km), gene flow is reduced (significant pairwise F-ST) but there is no evidence for isolation by distance. The most substantial genetic structuring was observed for comparisons spanning unsuitable habitat, implying limited dispersal of devils between the well-connected, eastern populations and a smaller northwestern population. The genetic distinctiveness of the northwestern population was reflected in all analyses: unique alleles; multivariate analyses of gene frequency (multidimensional scaling, minimum spanning tree, nearest neighbour); high self-assignment (95%); two distinct populations for Tasmania were detected in isolation by distance and in Bayesian model-based clustering analyses. Marsupial carnivores appear to have stronger population subdivisions than their placental counterparts.
Resumo:
This study presents results from an experimental 10-day research charter that was designed to quantify the effects of (a) a turtle excluder device (TED), (b) a radial escape section bycatch reduction device (BRD) and (c) both devices together, on bycatch and prawn catch rates in the Queensland shallow water eastern king prawn (Penaeus plebejus) trawl fishery. The bycatch was comprised of 250 taxa, mainly gurnards, whiting, lizard fish, flathead, dragonets, portunid crabs, turretfish and flounders. The observed mean catch rates of bycatch and marketable eastern king prawns from the standard trawl net (i.e., net with no TED or BRD) used during the charter were 11.06 kg/hectare (ha(-1)) (S.E. 0.90) swept by the trawl gear and 0.94 kg ha(-1), respectively. For the range of depths sampled (20.1-90.7 m), bycatch rates declined significantly at a rate of 0.14 kg ha-1 for every 1 m increase in depth, while prawn catch rates were unaffected. When both the TED and radial escape section BRD were used together, the bycatch rate declined by 24% compared to a standard net, but at a 20% reduction in marketable prawn catch rate. The largest reductions were achieved for stout whiting Sillago robusta (57% reduction) and yellowtail scad Trachurus novaezelandiae (32% reduction). Multidimensional scaling and analysis of similarities revealed that bycatch assemblages differed significantly between depths and latitude, but not between the different combinations of bycatch reduction devices. Despite the lowered prawn catch rates, the reduced bycatch rates are promising, particularly for S. robusta, which is targeted in another fishery. Prawn trawl operators are not permitted to retain S. robusta and the devices examined herein offer the potential to significantly reduce the incidental fishing mortality that this species experiences. (c) 2006 Elsevier B.V. All rights reserved.
Resumo:
This thesis is a study of the generation of topographic mappings - dimension reducing transformations of data that preserve some element of geometric structure - with feed-forward neural networks. As an alternative to established methods, a transformational variant of Sammon's method is proposed, where the projection is effected by a radial basis function neural network. This approach is related to the statistical field of multidimensional scaling, and from that the concept of a 'subjective metric' is defined, which permits the exploitation of additional prior knowledge concerning the data in the mapping process. This then enables the generation of more appropriate feature spaces for the purposes of enhanced visualisation or subsequent classification. A comparison with established methods for feature extraction is given for data taken from the 1992 Research Assessment Exercise for higher educational institutions in the United Kingdom. This is a difficult high-dimensional dataset, and illustrates well the benefit of the new topographic technique. A generalisation of the proposed model is considered for implementation of the classical multidimensional scaling (¸mds}) routine. This is related to Oja's principal subspace neural network, whose learning rule is shown to descend the error surface of the proposed ¸mds model. Some of the technical issues concerning the design and training of topographic neural networks are investigated. It is shown that neural network models can be less sensitive to entrapment in the sub-optimal global minima that badly affect the standard Sammon algorithm, and tend to exhibit good generalisation as a result of implicit weight decay in the training process. It is further argued that for ideal structure retention, the network transformation should be perfectly smooth for all inter-data directions in input space. Finally, there is a critique of optimisation techniques for topographic mappings, and a new training algorithm is proposed. A convergence proof is given, and the method is shown to produce lower-error mappings more rapidly than previous algorithms.
Resumo:
This thesis seeks to describe the development of an inexpensive and efficient clustering technique for multivariate data analysis. The technique starts from a multivariate data matrix and ends with graphical representation of the data and pattern recognition discriminant function. The technique also results in distances frequency distribution that might be useful in detecting clustering in the data or for the estimation of parameters useful in the discrimination between the different populations in the data. The technique can also be used in feature selection. The technique is essentially for the discovery of data structure by revealing the component parts of the data. lhe thesis offers three distinct contributions for cluster analysis and pattern recognition techniques. The first contribution is the introduction of transformation function in the technique of nonlinear mapping. The second contribution is the us~ of distances frequency distribution instead of distances time-sequence in nonlinear mapping, The third contribution is the formulation of a new generalised and normalised error function together with its optimal step size formula for gradient method minimisation. The thesis consists of five chapters. The first chapter is the introduction. The second chapter describes multidimensional scaling as an origin of nonlinear mapping technique. The third chapter describes the first developing step in the technique of nonlinear mapping that is the introduction of "transformation function". The fourth chapter describes the second developing step of the nonlinear mapping technique. This is the use of distances frequency distribution instead of distances time-sequence. The chapter also includes the new generalised and normalised error function formulation. Finally, the fifth chapter, the conclusion, evaluates all developments and proposes a new program. for cluster analysis and pattern recognition by integrating all the new features.
Resumo:
This thesis presents an investigation of the structure of people's occupational perceptions. The questionnaires used In this study collected both descriptive information about people's perceptions of occupations and also pair comparison similarities data. The data were collected both in the United States of America and England from samples of subjects who differed in terms of age and sex. This provided, therefore, both cross-cultural and developmental dimensions to the study. A cognitive orientation to the study of vocational behaviour is developed and multidimensional scaling procedures are used to analyze the data. A prime concern of the thesis is to examine the appropriateness of this approach and these techniques to this subject area. The results of this study show that a considerable range of individuaI differences exist in occupational perceptions.0lder subjects have a more complex structure to their perceptions and showed greater consensus as to how they perceived occupations to relate to each other. Younger subjects exhibited a greater range of individual differences in occupational perceptions but had, on average, a simpler subjective occupational structure. The multidimensional scaling procedures used in this study were able to reveal how occupational perceptions were structured, to relate these occupational perceptions to occupational preferences and other evaluative data, and to show that the groupings and structure of occupational perceptions ore similar to the dimensions used in occupational classification schemes. ImpIications of these resultts to vocationaI guidance theory and practice are discussed. The resuIts reported here strongly support both the use of the cognitive approach adopted here and demonstrate the potential of multidimensional scaling techniques for further:research in the field of vocational psychology.
Resumo:
Excessive consumption of dietary fat is acknowledged to be a widespread problem linked to a range of medical conditions. Despite this, little is known about the specific sensory appeal held by fats and no previous published research exists concerning human perception of non-textural taste qualities in fats. This research aimed to address whether a taste component can be found in sensory perception of pure fats. It also examined whether individual differences existed in human taste responses to fat, using both aggregated data analysis methods and multidimensional scaling. Results indicated that individuals were able to detect both the primary taste qualities of sweet, salty, sour and bitter in pure processed oils and reliably ascribe their own individually-generated taste labels, suggested that a taste component may be present in human responses to fat. Individual variation appeared to exist, both in the perception of given taste qualities and in perceived intensity and preferences. A number of factors were examined in relation to such individual differences in taste perception, including age, gender, genetic sensitivity to 6-n-propylthiouracil, body mass, dietary preferences and intake, dieting behaviours and restraint. Results revealed that, to varying extents, gender, age, sensitivity to 6-n-propylthiouracil, dietary preferences, habitual dietary intake and restraint all appeared to be related to individual variation in taste responses to fat. However, in general, these differences appeared to exist in the form of differing preferences and levels of intensity with which taste qualities detected in fat were perceived, as opposed to the perception of specific taste qualities being associated with given traits or states. Equally, each of these factors appeared to exert only a limited influence upon variation in sensory responses and thus the potential for using taste responses to fats as a marker for issues such as over-consumption, obesity or eating disorder is at present limited.
Resumo:
A recent novel approach to the visualisation and analysis of datasets, and one which is particularly applicable to those of a high dimension, is discussed in the context of real applications. A feed-forward neural network is utilised to effect a topographic, structure-preserving, dimension-reducing transformation of the data, with an additional facility to incorporate different degrees of associated subjective information. The properties of this transformation are illustrated on synthetic and real datasets, including the 1992 UK Research Assessment Exercise for funding in higher education. The method is compared and contrasted to established techniques for feature extraction, and related to topographic mappings, the Sammon projection and the statistical field of multidimensional scaling.
Resumo:
In this paper, we investigate the use of manifold learning techniques to enhance the separation properties of standard graph kernels. The idea stems from the observation that when we perform multidimensional scaling on the distance matrices extracted from the kernels, the resulting data tends to be clustered along a curve that wraps around the embedding space, a behavior that suggests that long range distances are not estimated accurately, resulting in an increased curvature of the embedding space. Hence, we propose to use a number of manifold learning techniques to compute a low-dimensional embedding of the graphs in an attempt to unfold the embedding manifold, and increase the class separation. We perform an extensive experimental evaluation on a number of standard graph datasets using the shortest-path (Borgwardt and Kriegel, 2005), graphlet (Shervashidze et al., 2009), random walk (Kashima et al., 2003) and Weisfeiler-Lehman (Shervashidze et al., 2011) kernels. We observe the most significant improvement in the case of the graphlet kernel, which fits with the observation that neglecting the locational information of the substructures leads to a stronger curvature of the embedding manifold. On the other hand, the Weisfeiler-Lehman kernel partially mitigates the locality problem by using the node labels information, and thus does not clearly benefit from the manifold learning. Interestingly, our experiments also show that the unfolding of the space seems to reduce the performance gap between the examined kernels.
Resumo:
The quantum Jensen-Shannon divergence kernel [1] was recently introduced in the context of unattributed graphs where it was shown to outperform several commonly used alternatives. In this paper, we study the separability properties of this kernel and we propose a way to compute a low-dimensional kernel embedding where the separation of the different classes is enhanced. The idea stems from the observation that the multidimensional scaling embeddings on this kernel show a strong horseshoe shape distribution, a pattern which is known to arise when long range distances are not estimated accurately. Here we propose to use Isomap to embed the graphs using only local distance information onto a new vectorial space with a higher class separability. The experimental evaluation shows the effectiveness of the proposed approach. © 2013 Springer-Verlag.