783 resultados para grid, clustering, statistical, clustering


Relevância:

30.00% 30.00%

Publicador:

Resumo:

A probabilistic method is proposed to evaluate voltage quality of grid-connected photovoltaic (PV) power systems. The random behavior of solar irradiation is described in statistical terms and the resulting voltage fluctuation probability distribution is then derived. Reactive power capabilities of the PV generators are then analyzed and their operation under constant power factor mode is examined. By utilizing the reactive power capability of the PV-generators to the full, it is shown that network voltage quality can be greatly enhanced.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Given the drawbacks for using geo-political areas in mapping outcomes unrelated to geo-politics, a compromise is to aggregate and analyse data at the grid level. This has the advantage of allowing spatial smoothing and modelling at a biologically or physically relevant scale. This article addresses two consequent issues: the choice of the spatial smoothness prior and the scale of the grid. Firstly, we describe several spatial smoothness priors applicable for grid data and discuss the contexts in which these priors can be employed based on different aims. Two such aims are considered, i.e., to identify regions with clustering and to model spatial dependence in the data. Secondly, the choice of the grid size is shown to depend largely on the spatial patterns. We present a guide on the selection of spatial scales and smoothness priors for various point patterns based on the two aims for spatial smoothing.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Provenance studies of iron-age pottery specimens originating from the Mngeni river area in South Africa was carried out by applying XRF spectrometry. A total of sixteen major and trace elements were analysed in a batch of 107 potsherds, excavated from four different archaeological sites in the aforementioned area. A multivariate statistical programme Correspondence Analysis was used in this study to obtain the relevant clustering patterns according to the similarity of the elemental distributions. Differences and similarities in the clusters obtained for the majors and trace elements are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PURPOSE The purpose of this study was to demonstrate the potential of near infrared (NIR) spectroscopy for characterizing the health and degenerative state of articular cartilage based on the components of the Mankin score. METHODS Three models of osteoarthritic degeneration induced in laboratory rats by anterior cruciate ligament (ACL) transection, meniscectomy (MSX), and intra-articular injection of monoiodoacetate (1 mg) (MIA) were used in this study. Degeneration was induced in the right knee joint; each model group consisted of 12 rats (N = 36). After 8 weeks, the animals were euthanized and knee joints were collected. A custom-made diffuse reflectance NIR probe of 5-mm diameter was placed on the tibial and femoral surfaces, and spectral data were acquired from each specimen in the wave number range of 4,000 to 12,500 cm(-1). After spectral data acquisition, the specimens were fixed and safranin O staining (SOS) was performed to assess disease severity based on the Mankin scoring system. Using multivariate statistical analysis, with spectral preprocessing and wavelength selection technique, the spectral data were then correlated to the structural integrity (SI), cellularity (CEL), and matrix staining (SOS) components of the Mankin score for all the samples tested. RESULTS ACL models showed mild cartilage degeneration, MSX models had moderate degeneration, and MIA models showed severe cartilage degenerative changes both morphologically and histologically. Our results reveal significant linear correlations between the NIR absorption spectra and SI (R(2) = 94.78%), CEL (R(2) = 88.03%), and SOS (R(2) = 96.39%) parameters of all samples in the models. In addition, clustering of the samples according to their level of degeneration, with respect to the Mankin components, was also observed. CONCLUSIONS NIR spectroscopic probing of articular cartilage can potentially provide critical information about the health of articular cartilage matrix in early and advanced stages of osteoarthritis (OA). CLINICAL RELEVANCE This rapid nondestructive method can facilitate clinical appraisal of articular cartilage integrity during arthroscopic surgery.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Description of a patient's injuries is recorded in narrative text form by hospital emergency departments. For statistical reporting, this text data needs to be mapped to pre-defined codes. Existing research in this field uses the Naïve Bayes probabilistic method to build classifiers for mapping. In this paper, we focus on providing guidance on the selection of a classification method. We build a number of classifiers belonging to different classification families such as decision tree, probabilistic, neural networks, and instance-based, ensemble-based and kernel-based linear classifiers. An extensive pre-processing is carried out to ensure the quality of data and, in hence, the quality classification outcome. The records with a null entry in injury description are removed. The misspelling correction process is carried out by finding and replacing the misspelt word with a soundlike word. Meaningful phrases have been identified and kept, instead of removing the part of phrase as a stop word. The abbreviations appearing in many forms of entry are manually identified and only one form of abbreviations is used. Clustering is utilised to discriminate between non-frequent and frequent terms. This process reduced the number of text features dramatically from about 28,000 to 5000. The medical narrative text injury dataset, under consideration, is composed of many short documents. The data can be characterized as high-dimensional and sparse, i.e., few features are irrelevant but features are correlated with one another. Therefore, Matrix factorization techniques such as Singular Value Decomposition (SVD) and Non Negative Matrix Factorization (NNMF) have been used to map the processed feature space to a lower-dimensional feature space. Classifiers with these reduced feature space have been built. In experiments, a set of tests are conducted to reflect which classification method is best for the medical text classification. The Non Negative Matrix Factorization with Support Vector Machine method can achieve 93% precision which is higher than all the tested traditional classifiers. We also found that TF/IDF weighting which works well for long text classification is inferior to binary weighting in short document classification. Another finding is that the Top-n terms should be removed in consultation with medical experts, as it affects the classification performance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We propose the use of optical flow information as a method for detecting and describing changes in the environment, from the perspective of a mobile camera. We analyze the characteristics of the optical flow signal and demonstrate how robust flow vectors can be generated and used for the detection of depth discontinuities and appearance changes at key locations. To successfully achieve this task, a full discussion on camera positioning, distortion compensation, noise filtering, and parameter estimation is presented. We then extract statistical attributes from the flow signal to describe the location of the scene changes. We also employ clustering and dominant shape of vectors to increase the descriptiveness. Once a database of nodes (where a node is a detected scene change) and their corresponding flow features is created, matching can be performed whenever nodes are encountered, such that topological localization can be achieved. We retrieve the most likely node according to the Mahalanobis and Chi-square distances between the current frame and the database. The results illustrate the applicability of the technique for detecting and describing scene changes in diverse lighting conditions, considering indoor and outdoor environments and different robot platforms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This project aimed to identify novel genetic risk variants associated with migraine in the Norfolk Island population. Statistical analysis and bioinformatics approaches such as polygenic modeling and gene clustering methods were carried out to explore genotypic and expression data from high-throughput techniques. This project had a particular focus on hormonal genes and other genetic variants and identified a modest effect size on the migraine phenotype.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The article describes a generalized estimating equations approach that was used to investigate the impact of technology on vessel performance in a trawl fishery during 1988-96, while accounting for spatial and temporal correlations in the catch-effort data. Robust estimation of parameters in the presence of several levels of clustering depended more on the choice of cluster definition than on the choice of correlation structure within the cluster. Models with smaller cluster sizes produced stable results, while models with larger cluster sizes, that may have had complex within-cluster correlation structures and that had within-cluster covariates, produced estimates sensitive to the correlation structure. The preferred model arising from this dataset assumed that catches from a vessel were correlated in the same years and the same areas, but independent in different years and areas. The model that assumed catches from a vessel were correlated in all years and areas, equivalent to a random effects term for vessel, produced spurious results. This was an unexpected finding that highlighted the need to adopt a systematic strategy for modelling. The article proposes a modelling strategy of selecting the best cluster definition first, and the working correlation structure (within clusters) second. The article discusses the selection and interpretation of the model in the light of background knowledge of the data and utility of the model, and the potential for this modelling approach to apply in similar statistical situations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A pressed-plate Fe electrode for alkalines storage batteries, designed using a statistical method (fractional factorial technique), is described. Parameters such as the configuration of the base grid, electrode compaction temperature and pressure, binder composition, mixing time, etc. have been optimised using this method. The optimised electrodes have a capacity of 300 plus /minus 5 mA h/g of active material (mixture of Fe and magnetite) at 7 h rate to a cut-off voltage of 8.86V vs. Hg/HgO, OH exp 17 ref.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Large fruited spotted gum eucalypt Corymbia henryi occurs sympatrically with small fruited spotted gum Corymbia citriodora subspecies variegata over a large portion of its range on the east coast of Australia. The two taxa are interfertile, have overlapping flowering times and share a common set of insect and vertebrate pollinators. Previous genetic analysis of both taxa from two geographically remote sites suggested that the two were morphotypes rather than genetically distinct species. In this study we further explore this hypothesis of genic species by expanding sampling broadly through their sympatric locations and examine local-scale spatial genetic structure in stands that differ in species and age composition. Delineation of populations at five microsatellite loci, using an individual-based approach and Bayesian modelling, as well as clustering of individuals based on allele frequencies showed the two species to be molecularly homogeneous. Genetic structure aligned largely with geographic areas of origin, and followed an isolation-by-distance model, where proximal populations were generally less differentiated than more distant ones. At the stand level, spotted gums also generally showed little structure consistent with the high levels of gene flow inferred across the species range. Disturbances in the uniformity of structuring were detected, however, and attributed to localised events giving rise to even aged stands, probably due to regeneration from a few individuals following fire.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis studies human gene expression space using high throughput gene expression data from DNA microarrays. In molecular biology, high throughput techniques allow numerical measurements of expression of tens of thousands of genes simultaneously. In a single study, this data is traditionally obtained from a limited number of sample types with a small number of replicates. For organism-wide analysis, this data has been largely unavailable and the global structure of human transcriptome has remained unknown. This thesis introduces a human transcriptome map of different biological entities and analysis of its general structure. The map is constructed from gene expression data from the two largest public microarray data repositories, GEO and ArrayExpress. The creation of this map contributed to the development of ArrayExpress by identifying and retrofitting the previously unusable and missing data and by improving the access to its data. It also contributed to creation of several new tools for microarray data manipulation and establishment of data exchange between GEO and ArrayExpress. The data integration for the global map required creation of a new large ontology of human cell types, disease states, organism parts and cell lines. The ontology was used in a new text mining and decision tree based method for automatic conversion of human readable free text microarray data annotations into categorised format. The data comparability and minimisation of the systematic measurement errors that are characteristic to each lab- oratory in this large cross-laboratories integrated dataset, was ensured by computation of a range of microarray data quality metrics and exclusion of incomparable data. The structure of a global map of human gene expression was then explored by principal component analysis and hierarchical clustering using heuristics and help from another purpose built sample ontology. A preface and motivation to the construction and analysis of a global map of human gene expression is given by analysis of two microarray datasets of human malignant melanoma. The analysis of these sets incorporate indirect comparison of statistical methods for finding differentially expressed genes and point to the need to study gene expression on a global level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Conformational studies have been carried out on hydrogenbonded all-trans cyclic pentapeptide backbone. Application of a combination of grid search and energy minimization on this system has resulted in obtaining 23 minimum energy conformations, which are characterized by unique patterns of hydrogen bonding comprising of β- and γ-turns. A study of the minimum energy conformationsvis-a-vis non-planar deviation of the peptide units reveals that non-planarity is an inherent feature in many cases. A study on conformational clustering of minimum energy conformations shows that the minimum energy conformations fall into 6 distinct conformational families. Preliminary comparison with available X-ray structures of cyclic pentapeptide indicates that only some of the minimum energy conformations have formed crystal structures. The set of minimum energy conformations worked out in the present study can form a consolidated database of prototypes for hydrogen bonded backbone and be useful for modelling cyclic pentapeptides both synthetic and bioactive in nature.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The main purpose of the Master Thesis was to find out what kind of attitudes the pupils in the 9th grade of Finnish comprehensive school have towards music as a school subject and compare it to the attitudes of the principals at a school level. The theoretical context of the research is based on the former studies of the significance of music education in the comprehensive school, the connection between learning and attitudes and the motivational factors towards the study motivation of music. In addition to this, I have analysed the role of the evaluation and the assessment from the point of view of developing the educational system and what is the role of management and leadership in relation to the pupils` behaviour and attitudes. The data of the research is the Finnish National Board of Education`s collected data of the assessment of the learning outcomes of arts education and it is nationally representative (N=5056 I phase and n=1570 II phase), both the Finnish-language and the Swedish-language pupil data. I have especially concentrated on the items of measuring the attitudes, the certain background variables and the questionnaire of the principals. The numerical data was analyzed using the multivariate statistical methods. The results of the research prove that in general the pupils and the principals think that music is quite significant as a school subject. The girls valued music on average more than the boys when comparing all the dimensions. The differences were systematic but the effect sizes were under 10 %. There were not statistically significant differences between the Finnish-language and the Swedish-language pupils. Comparing the grades of music in the 7th grade, the differences were growing linearly and the effect size was 15.7 %. There was a positive statistically significant correlation between the Significance of music and music as a hobby (Active interest in music, Informal interest in music, Taking part of music activities in the school) during free time. The strongest correlation were with the Active interest in music variable (r= 0.53, p= .000). Also the principals thought that music is important as a school subject considering the development of the pupil and the function of the school. The answers of the pupils were not clustering at a school level and there were no strong correlations between the attitudes of the pupils and the principals. A statistically nearly significant and a slight correlation (r= 0.21, p= .011) was found between the principals valuing the Significance of the music for school function and the pupils valuing the Benefits and hobbyism. The role of a well-motivated and active music teacher can be important from this point of view. The most important conclusion of the research was that the significance of music is a very personal individual level phenomenon. The results highlight also that in the pupils` opinion the most important thing about music lessons is to musical activity and learning as an experience.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The primary objective of the paper is to make use of statistical digital human model to better understand the nature of reach probability of points in the taskspace. The concept of task-dependent boundary manikin is introduced to geometrically characterize the extreme individuals in the given population who would accomplish the task. For a given point of interest and task, the map of the acceptable variation in anthropometric parameters is superimposed with the distribution of the same parameters in the given population to identify the extreme individuals. To illustrate the concept, the task space mapping is done for the reach probability of human arms. Unlike the boundary manikins, who are completely defined by the population, the dimensions of these manikins will vary with task, say, a point to be reached, as in the present case. Hence they are referred to here as the task-dependent boundary manikins. Simulations with these manikins would help designers to visualize how differently the extreme individuals would perform the task. Reach probability at the points in a 3D grid in the operational space is computed; for objects overlaid in this grid, approximate probabilities are derived from the grid for rendering them with colors indicating the reach probability. The method may also help in providing a rational basis for selection of personnel for a given task.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Parallel sub-word recognition (PSWR) is a new model that has been proposed for language identification (LID) which does not need elaborate phonetic labeling of the speech data in a foreign language. The new approach performs a front-end tokenization in terms of sub-word units which are designed by automatic segmentation, segment clustering and segment HMM modeling. We develop PSWR based LID in a framework similar to the parallel phone recognition (PPR) approach in the literature. This includes a front-end tokenizer and a back-end language model, for each language to be identified. Considering various combinations of the statistical evaluation scores, it is found that PSWR can perform as well as PPR, even with broad acoustic sub-word tokenization, thus making it an efficient alternative to the PPR system.