36 resultados para CLUSTER ANALYSIS
Resumo:
The article examines the structure of the collaboration networks of research groups where Slovenian and Spanish PhD students are pursuing their doctorate. The units of analysis are student-supervisor dyads. We use duocentred networks, a novel network structure appropriate for networks which are centred around a dyad. A cluster analysis reveals three typical clusters of research groups. Those which are large and belong to several institutions are labelled under a bridging social capital label. Those which are small, centred in a single institution but have high cohesion are labelled as bonding social capital. Those which are small and with low cohesion are called weak social capital groups. Academic performance of both PhD students and supervisors are highest in bridging groups and lowest in weak groups. Other variables are also found to differ according to the type of research group. At the end, some recommendations regarding academic and research policy are drawn
Resumo:
This article aims to identify the key groups of regions with respectto farms oriented to fruit and citrus production.Twenty variables of fruit and citrus oriented farms corresponding toforty-one regions of the European Union were analyzed. Seven groupsemerged from cluster analysis. Only two of them showed good perspectives. Regions in the South of the Community need an important modernisation and restructuring process, which entails serious social consequences.
Resumo:
In applied regional analysis, statistical information is usually published at different territorial levels with the aim providing inforamtion of interest for different potential users. When using this information, there are two different choices: first, to use normative regions ( towns, provinces, etc.) or, second, to design analytical regions directly related with the analysed phenomena. In this paper, privincial time series of unemployment rates in Spain are used in order to compare the results obtained by applying yoy analytical regionalisation models ( a two stages procedure based on cluster analysis and a procedure based on mathematical programming) with the normative regions available at two different scales: NUTS II and NUTS I. The results have shown that more homogeneous regions were designed when applying both analytical regionalisation tools. Two other obtained interesting results are related with the fact that analytical regions were also more estable along time and with the effects of scales in the regionalisation process
Resumo:
In applied regional analysis, statistical information is usually published at different territorial levels with the aim providing inforamtion of interest for different potential users. When using this information, there are two different choices: first, to use normative regions ( towns, provinces, etc.) or, second, to design analytical regions directly related with the analysed phenomena. In this paper, privincial time series of unemployment rates in Spain are used in order to compare the results obtained by applying yoy analytical regionalisation models ( a two stages procedure based on cluster analysis and a procedure based on mathematical programming) with the normative regions available at two different scales: NUTS II and NUTS I. The results have shown that more homogeneous regions were designed when applying both analytical regionalisation tools. Two other obtained interesting results are related with the fact that analytical regions were also more estable along time and with the effects of scales in the regionalisation process
Resumo:
This paper aims at reconsidering some analytical measures to best encapsulate the interlanguage, in writing, of young beginner learners of English as a foreign language in the light of previous and work-in-progress research conducted within the BAF project, and in particular, whether clause and sentence length should be best viewed as a fluency or syntactic complexity measusre or as part of a different construct. In the light of a factor analysis (Navés, forthcoming) and multivariate and correlation studies (Navés et al. 2003, Navés, 2006, Torres et al. 2006) it becomes clear that the relationship between different analytical measures is also dependent on learner¿s cognitive maturity (age) and proficiency (amount of instruction). Finally, clause and sentence length should not be viewed as either a fluency or sytactic complexity measure but as part of a different construct. It is concluded that further research using regression analysis and cluster analysis is neeed in order to identify and validate the constructs of the writing components and their measurements.
Resumo:
Abstract. Drought leads to a loss of longitudinal and lateral hydrologic connectivity, which causes direct or indirect changes in stream ecosystem properties. Changes in macrohabitat availability from a rifflepool sequence to isolated pools are among the most conspicuous consequences of connectivity loss. Macroinvertebrate assemblages were compared among 3 distinct stream macrohabitats (riffles [R], pools connected to riffles [Pc], disconnected pools [Pd]) of 19 Mediterranean-climate sites in northern California to examine the influence of loss of habitat resulting from drought disturbance. At the time of sampling, 10 sites were perennial and included R and Pc macrohabitats, whereas 9 sites were intermittent and included only Pd macrohabitats. Taxa richness was more variable in Pd, and taxa richness was significantly lower in Pd than in Pc but not R. These results suggested a decline in richness between Pc and Pd that might be associated with loss of connectivity. Lower Ephemeroptera, Plecoptera, and Trichoptera (EPT) richness relative to Odonata, Coleoptera, and Heteroptera (OCH) richness was observed for Pd than R and Pc macrohabitats. Family composition was more similar between R and Pc than between R or Pc and Pd macrohabitats. This similarity may be associated with greater connectivity between R and Pc macrohabitats. Correspondence analysis indicated that macroinvertebrate composition changed along a gradient from R to Pc and Pd that was related to a perennialintermittent gradient across sites. High variability among macroinvertebrate assemblages in Pd could have been related to variability in the duration of intermittency. In cluster analysis, macroinvertebrate assemblages were grouped by macrohabitat first and then by site, suggesting that the macrohabitat filter had a greater influence on macroinvertebrate assemblages than did local site characteristics. Few taxa were found exclusively in Pc, and this macrohabitat shared numerous taxa with R and Pd, indicating that Pc may act as a bridge between R and Pd during drought. Drought is regarded as a ramp disturbance, but our results suggest that the response of macroinvertebrate assemblages to the loss of hydrological connectivity among macrohabitats is gradual, at least in Mediterranean-climate streams where drying is gradual. However, the changes may be more dramatic in arid and semiarid streams or in Mediterranean-climate streams if drying is rapid.
Resumo:
This paper aims at reconsidering some analytical measures to best encapsulate the interlanguage, in writing, of young beginner learners of English as a foreign language in the light of previous and work-in-progress research conducted within the BAF project, and in particular, whether clause and sentence length should be best viewed as a fluency or syntactic complexity measusre or as part of a different construct. In the light of a factor analysis (Navés, forthcoming) and multivariate and correlation studies (Navés et al. 2003, Navés, 2006, Torres et al. 2006) it becomes clear that the relationship between different analytical measures is also dependent on learner¿s cognitive maturity (age) and proficiency (amount of instruction). Finally, clause and sentence length should not be viewed as either a fluency or sytactic complexity measure but as part of a different construct. It is concluded that further research using regression analysis and cluster analysis is neeed in order to identify and validate the constructs of the writing components and their measurements.
Resumo:
The aim of this research was to investigate the effects of high pressure processing (HPP) on consumer acceptance for chilled ready meals manufactured using a low-value beef cut. Three hundred consumers evaluated chilled ready meals subjected to 4 pressure treatments and a non-treated control monadically on a 9-point scale for liking for beef tenderness and juiciness, overall flavour, overall liking, and purchase intent. Data were also collected on consumers' food consumption patterns, their attitudes towards food by means of the reduced food-related lifestyle (FRL) instrument, and socio-demographics. The results indicated that a pressure treatment of 200 MPa was acceptable to most consumers. K-means cluster analysis identified 4 consumer groups with similar preferences, and the optimal pressure treatments acceptable to specific consumer groups were identified for those firms that would wish to target attitudinally differentiated consumer segments
Resumo:
Background: Differences in the distribution of genotypes between individuals of the same ethnicity are an important confounder factor commonly undervalued in typical association studies conducted in radiogenomics. Objective: To evaluate the genotypic distribution of SNPs in a wide set of Spanish prostate cancer patients for determine the homogeneity of the population and to disclose potential bias. Design, Setting, and Participants: A total of 601 prostate cancer patients from Andalusia, Basque Country, Canary and Catalonia were genotyped for 10 SNPs located in 6 different genes associated to DNA repair: XRCC1 (rs25487, rs25489, rs1799782), ERCC2 (rs13181), ERCC1 (rs11615), LIG4 (rs1805388, rs1805386), ATM (rs17503908, rs1800057) and P53 (rs1042522). The SNP genotyping was made in a Biotrove OpenArrayH NT Cycler. Outcome Measurements and Statistical Analysis: Comparisons of genotypic and allelic frequencies among populations, as well as haplotype analyses were determined using the web-based environment SNPator. Principal component analysis was made using the SnpMatrix and XSnpMatrix classes and methods implemented as an R package. Non-supervised hierarchical cluster of SNP was made using MultiExperiment Viewer. Results and Limitations: We observed that genotype distribution of 4 out 10 SNPs was statistically different among the studied populations, showing the greatest differences between Andalusia and Catalonia. These observations were confirmed in cluster analysis, principal component analysis and in the differential distribution of haplotypes among the populations. Because tumor characteristics have not been taken into account, it is possible that some polymorphisms may influence tumor characteristics in the same way that it may pose a risk factor for other disease characteristics. Conclusion: Differences in distribution of genotypes within different populations of the same ethnicity could be an important confounding factor responsible for the lack of validation of SNPs associated with radiation-induced toxicity, especially when extensive meta-analysis with subjects from different countries are carried out.
Resumo:
Background Chronic obstructive pulmonary disease (COPD) is increasingly considered a heterogeneous condition. It was hypothesised that COPD, as currently defined, includes different clinically relevant subtypes. Methods To identify and validate COPD subtypes, 342 subjects hospitalised for the first time because of a COPD exacerbation were recruited. Three months after discharge, when clinically stable, symptoms and quality of life, lung function, exercise capacity, nutritional status, biomarkers of systemic and bronchial inflammation, sputum microbiology, CT of the thorax and echocardiography were assessed. COPD groups were identified by partitioning cluster analysis and validated prospectively against cause-specific hospitalisations and all-cause mortality during a 4 year follow-up. Results Three COPD groups were identified: group 1 (n ¼ 126, 67 years) was characterised by severe airflow limitation (postbronchodilator forced expiratory volume in 1 s (FEV 1 ) 38% predicted) and worse performance in most of the respiratory domains of the disease; group 2 (n ¼ 125, 69 years) showed milder airflow limitation (FEV 1 63% predicted); and group 3 (n ¼ 91, 67 years) combined a similarly milder airflow limitation (FEV 1 58% predicted) with a high proportion of obesity, cardiovascular disorders, iabetes and systemic inflammation. During follow-up, group 1 had more frequent hospitalisations due to COPD (HR 3.28, p < 0.001) and higher all-cause mortality (HR 2.36, p ¼ 0.018) than the other two groups, whereas group 3 had more admissions due to cardiovascular disease (HR 2.87, p ¼ 0.014). Conclusions In patients with COPD recruited at their first hospitalisation, three different COPD subtypes were identified and prospectively validated:"severe respiratory COPD","moderate respiratory COPD", and"systemic COPD'
Resumo:
This paper explores the distinctive characteristics of mobile telephone use among the elderly population using the most recent European country-level data on individual use of mobile telephony and advanced mobile services, Eurostat 2008. Through a cluster analysis of mobile phone use data across 30 countries, it is possible to confirm that mobile telephone occupies a peripheral position for the elderly in Europe.
Resumo:
Para determinar los factores de explotación relacionados con la reactivación ovárica postparto en vacas nodrizas se realizó un análisis global de una serie de indicadores productivos y la duración del anestro postparto (APP) de 549 vacas explotadas en condiciones extensivas. Debido a la naturaleza multifactorial del proceso en estudio se eligió la metodología estadística multivariante (Análisis Factorial de Correspondencias Múltiples y Análisis Cluster). La duración del APP estuvo asociada a cuatro factores que explicaron el 59% de la heterogeneidad inicial de la muestra y que se definieron como: «Alimentación preparto» (19% de la inercia), «Alimentación postparto-Edad» (16.4%), «Manejo del ternero» (13%) y «Dificultad al parto» (10.5%). Estos factores se introdujeron en un Análisis Cluster que identificó cinco grupos de vacas con características productivas y reproductivas diferentes, y que denominamos como: «Primíparas», «Acceso restringido», «Acceso Libre-Parda de Montaña», «Parto de otoño» y «Parto de primavera». La raza no estuvo relacionada con la duración del APP, aunque el análisis Cluster asoció los largos APP inducidos por la crianza libre con la raza Parda de Montaña. En la raza Parda de Montaña, la duración del APP fue mayor en primavera que en otoño debido a diferencias nutricionales más que a un efecto estacional en sí. El parto de otoño se adaptó mejor a las condiciones de montaña seca.
Resumo:
Zonal management in vineyards requires the prior delineation of stable yield zones within the parcel. Among the different methodologies used for zone delineation, cluster analysis of yield data from several years is one of the possibilities cited in scientific literature. However, there exist reasonable doubts concerning the cluster algorithm to be used and the number of zones that have to be delineated within a field. In this paper two different cluster algorithms have been compared (k-means and fuzzy c-means) using the grape yield data corresponding to three successive years (2002, 2003 and 2004), for a ‘Pinot Noir’ vineyard parcel. Final choice of the most recommendable algorithm has been linked to obtaining a stable pattern of spatial yield distribution and to allowing for the delineation of compact and average sized areas. The general recommendation is to use reclassified maps of two clusters or yield classes (low yield zone and high yield zone) and, consequently, the site-specific vineyard management should be based on the prior delineation of just two different zones or sub-parcels. The two tested algorithms are good options for this purpose. However, the fuzzy c-means algorithm allows for a better zoning of the parcel, forming more compact areas and with more equilibrated zonal differences over time.
Resumo:
Background: Differences in the distribution of genotypes between individuals of the same ethnicity are an important confounder factor commonly undervalued in typical association studies conducted in radiogenomics. Objective: To evaluate the genotypic distribution of SNPs in a wide set of Spanish prostate cancer patients for determine the homogeneity of the population and to disclose potential bias. Design, Setting, and Participants: A total of 601 prostate cancer patients from Andalusia, Basque Country, Canary and Catalonia were genotyped for 10 SNPs located in 6 different genes associated to DNA repair: XRCC1 (rs25487, rs25489, rs1799782), ERCC2 (rs13181), ERCC1 (rs11615), LIG4 (rs1805388, rs1805386), ATM (rs17503908, rs1800057) and P53 (rs1042522). The SNP genotyping was made in a Biotrove OpenArrayH NT Cycler. Outcome Measurements and Statistical Analysis: Comparisons of genotypic and allelic frequencies among populations, as well as haplotype analyses were determined using the web-based environment SNPator. Principal component analysis was made using the SnpMatrix and XSnpMatrix classes and methods implemented as an R package. Non-supervised hierarchical cluster of SNP was made using MultiExperiment Viewer. Results and Limitations: We observed that genotype distribution of 4 out 10 SNPs was statistically different among the studied populations, showing the greatest differences between Andalusia and Catalonia. These observations were confirmed in cluster analysis, principal component analysis and in the differential distribution of haplotypes among the populations. Because tumor characteristics have not been taken into account, it is possible that some polymorphisms may influence tumor characteristics in the same way that it may pose a risk factor for other disease characteristics. Conclusion: Differences in distribution of genotypes within different populations of the same ethnicity could be an important confounding factor responsible for the lack of validation of SNPs associated with radiation-induced toxicity, especially when extensive meta-analysis with subjects from different countries are carried out.
Resumo:
Background Chronic obstructive pulmonary disease (COPD) is increasingly considered a heterogeneous condition. It was hypothesised that COPD, as currently defined, includes different clinically relevant subtypes. Methods To identify and validate COPD subtypes, 342 subjects hospitalised for the first time because of a COPD exacerbation were recruited. Three months after discharge, when clinically stable, symptoms and quality of life, lung function, exercise capacity, nutritional status, biomarkers of systemic and bronchial inflammation, sputum microbiology, CT of the thorax and echocardiography were assessed. COPD groups were identified by partitioning cluster analysis and validated prospectively against cause-specific hospitalisations and all-cause mortality during a 4 year follow-up. Results Three COPD groups were identified: group 1 (n ¼ 126, 67 years) was characterised by severe airflow limitation (postbronchodilator forced expiratory volume in 1 s (FEV 1 ) 38% predicted) and worse performance in most of the respiratory domains of the disease; group 2 (n ¼ 125, 69 years) showed milder airflow limitation (FEV 1 63% predicted); and group 3 (n ¼ 91, 67 years) combined a similarly milder airflow limitation (FEV 1 58% predicted) with a high proportion of obesity, cardiovascular disorders, iabetes and systemic inflammation. During follow-up, group 1 had more frequent hospitalisations due to COPD (HR 3.28, p < 0.001) and higher all-cause mortality (HR 2.36, p ¼ 0.018) than the other two groups, whereas group 3 had more admissions due to cardiovascular disease (HR 2.87, p ¼ 0.014). Conclusions In patients with COPD recruited at their first hospitalisation, three different COPD subtypes were identified and prospectively validated:"severe respiratory COPD","moderate respiratory COPD", and"systemic COPD'