971 resultados para Genetic clustering analysis
Resumo:
In this paper, a continuation of a variable radius niche technique called Dynamic Niche Clustering developed by (Gan & Warwick, 1999) is presented. The technique employs a separate dynamic population of overlapping niches that coexists alongside the normal population. An empirical analysis of the updated methodology on a large group of standard optimisation test-bed functions is also given. The technique is shown to perform almost as well as standard fitness sharing with regards to stability and the accuracy of peak identification, but it outperforms standard fitness sharing with regards to time complexity. It is also shown that the technique is capable of forming niches of varying size depending on the characteristics of the underlying peak that the niche is populating.
Resumo:
In this study, differences at the genetic level of 37 Salmonella Enteritidis strains from five phage types (PTs) were compared using comparative genomic hybridization (CGH) to assess differences between PTs. There were approximately 400 genes that differentiated prevalent (4, 6, 8 and 13a) and sporadic (11) PTs, of which 35 were unique to prevalent PTs, including six plasmid-borne genes, pefA, B, C, D, srgC and rck, and four chromosomal genes encoding putative amino acid transporters. Phenotype array studies also demonstrated that strains from prevalent PTs were less susceptible to urea stress and utilized L-histidine, L-glutamine, L-proline, L-aspartic acid, gly-asn and gly-gln more efficiently than PT11 strains. Complementation of a PT11 strain with the transporter genes from PT4 resulted in a significant increase in utilization of the amino acids and reduced susceptibility to urea stress. In epithelial cell association assays, PT11 strains were less invasive than other prevalent PTs. Most strains from prevalent PTs were better biofilm formers at 37 degrees C than at 28 degrees C, whilst the converse was true for PT11 strains. Collectively, the results indicate that genetic and corresponding phenotypic differences exist between strains of the prevalent PTs 4, 6, 8 and 13a and non-prevalent PT11 strains that are likely to provide a selective advantage for strains from the former PTs and could help them to enter the food chain and cause salmonellosis.
Resumo:
Potassium (K) fertilizers are used in intensive and extensive agricultural systems to maximize production. However, there are both financial and environmental costs to K-fertilization. It is therefore important to optimize the efficiency with which K-fertilizers are used. Cultivating crops that acquire and/or utilize K more effectively can reduce the use of K-fertilizers. The aim of the present study was to determine the genetic factors affecting K utilization efficiency (KUtE), defined as the reciprocal of shoot K concentration (1/K(shoot)), and K acquisition efficiency (KUpE), defined as shoot K content, in Brassica oleracea. Genetic variation in K(shoot) was estimated using a structured diversity foundation set (DFS) of 376 accessions and in 74 commercial genotypes grown in glasshouse and field experiments that included phosphorus (P) supply as a treatment factor. Chromosomal quantitative trait loci (QTL) associated with K(shoot) and KUpE were identified using a genetic mapping population grown in the glasshouse and field. Putative QTL were tested using recurrent backcross substitution lines in the glasshouse. More than two-fold variation in K(shoot) was observed among DFS accessions grown in the glasshouse, a significant proportion of which could be attributed to genetic factors. Several QTL associated with K(shoot) were identified, which, despite a significant correlation in K(shoot) among genotypes grown in the glasshouse and field, differed between these two environments. A QTL associated with K(shoot) in glasshouse-grown plants (chromosome C7 at 62 center dot 2 cM) was confirmed using substitution lines. This QTL corresponds to a segment of arabidopsis chromosome 4 containing genes encoding the K(+) transporters AtKUP9, AtAKT2, AtKAT2 and AtTPK3. There is sufficient genetic variation in B. oleracea to breed for both KUtE and KUpE. However, as QTL associated with these traits differ between glasshouse and field environments, marker-assisted breeding programmes must consider carefully the conditions under which the crop will be grown.
Resumo:
The genetics of the stipule spot pigmentation (SSP) in faba bean (Vicia faba L.) was studied using four inbred lines, of which Disco/2 was zero-tannin (zt2) with colourless stipule spots, ILB938/2 was normal-tannin (ZT2) with colourless stipule spots, and both Aurora/2 and Mélodie/2 were ZT2 with coloured stipule spots. Crosses Mélodie/2 × ILB 938/2, Mélodie/2 × Disco/2, ILB 938/2 × Aurora/2 and ILB 938/2 × Disco/2 (A, B, C and D, respectively) were prepared, along with reciprocals and backcrosses, and advanced through single-seed descent. All F1 hybrid plants had pigmented stipule spots, and in the F2 generation, the segregation ratio fit 3 coloured:1 colourless in crosses A, B and C and 9:7 in cross D. In the F3 generation, the ratio fit 5:3 in crosses A and C and 25:39 in cross D, and in the F4 generation, 9:7 in cross A. SSP was linked to the zero-tannin characteristics (white flower) only in cross B. The results show that coloured stipule spot is dominant to colourless and that colouration is determined by two unlinked complementary recessive genes. We propose the symbols ssp2 for the gene associated with zt2 in Disco/2 and ssp1 for the gene not associated with tannin content in ILB938/2. The novel ssp1 locus was mapped at F5 in cross ‘A’ using Medicago truncatula-derived single-nucleotide polymorphism and was on chromosome 1 of faba bean, in a well-conserved region of M. truncatula chromosome 5 containing some candidate Myb and basic helix–loop–helix transcription factor genes.
Resumo:
Background: The validity of ensemble averaging on event-related potential (ERP) data has been questioned, due to its assumption that the ERP is identical across trials. Thus, there is a need for preliminary testing for cluster structure in the data. New method: We propose a complete pipeline for the cluster analysis of ERP data. To increase the signalto-noise (SNR) ratio of the raw single-trials, we used a denoising method based on Empirical Mode Decomposition (EMD). Next, we used a bootstrap-based method to determine the number of clusters, through a measure called the Stability Index (SI). We then used a clustering algorithm based on a Genetic Algorithm (GA)to define initial cluster centroids for subsequent k-means clustering. Finally, we visualised the clustering results through a scheme based on Principal Component Analysis (PCA). Results: After validating the pipeline on simulated data, we tested it on data from two experiments – a P300 speller paradigm on a single subject and a language processing study on 25 subjects. Results revealed evidence for the existence of 6 clusters in one experimental condition from the language processing study. Further, a two-way chi-square test revealed an influence of subject on cluster membership.
Resumo:
Clustering methods are increasingly being applied to residential smart meter data, providing a number of important opportunities for distribution network operators (DNOs) to manage and plan the low voltage networks. Clustering has a number of potential advantages for DNOs including, identifying suitable candidates for demand response and improving energy profile modelling. However, due to the high stochasticity and irregularity of household level demand, detailed analytics are required to define appropriate attributes to cluster. In this paper we present in-depth analysis of customer smart meter data to better understand peak demand and major sources of variability in their behaviour. We find four key time periods in which the data should be analysed and use this to form relevant attributes for our clustering. We present a finite mixture model based clustering where we discover 10 distinct behaviour groups describing customers based on their demand and their variability. Finally, using an existing bootstrapping technique we show that the clustering is reliable. To the authors knowledge this is the first time in the power systems literature that the sample robustness of the clustering has been tested.
Resumo:
Background Autism spectrum conditions (ASC) are a group of neurodevelopmental conditions characterized by difficulties in social interaction and communication alongside repetitive and stereotyped behaviours. ASC are heritable, and common genetic variants contribute substantial phenotypic variability. More than 600 genes have been implicated in ASC to date. However, a comprehensive investigation of candidate gene association studies in ASC is lacking. Methods In this study, we systematically reviewed the literature for association studies for 552 genes associated with ASC. We identified 58 common genetic variants in 27 genes that have been investigated in three or more independent cohorts and conducted a meta-analysis for 55 of these variants. We investigated publication bias and sensitivity and performed stratified analyses for a subset of these variants. Results We identified 15 variants nominally significant for the mean effect size, 8 of which had P values below a threshold of significance of 0.01. Of these 15 variants, 11 were re-investigated for effect sizes and significance in the larger Psychiatric Genomics Consortium dataset, and none of them were significant. Effect direction for 8 of the 11 variants were concordant between both the datasets, although the correlation between the effect sizes from the two datasets was poor and non-significant. Conclusions This is the first study to comprehensively examine common variants in candidate genes for ASC through meta-analysis. While for majority of the variants, the total sample size was above 500 cases and 500 controls, the total sample size was not large enough to accurately identify common variants that contribute to the aetiology of ASC.
Resumo:
In this study, we describe the first survey in Thailand of Trypanosoma theileri, a widespread and prevalent parasite of cattle that is transmitted by tabanid flies. Investigation of 210 bovine blood samples of Thai cattle from six farms by hematocrit centrifuge technique (HCT) revealed 14 samples with trypanosomes morphologically compatible to T. theileri. Additional animals were positive for T. theileri by PCR based on the Cathepsin L-like sequence (TthCATL-PCR) despite negative by HCT, indicating cryptic infections. Results revealed a prevalence of 26 +/- 15% (95% CI) of T. theileri infection. Additionally, 12 samples positive for T. theileri were detected in cattle from other 11 farms. From a total of 30 blood samples positive by HCT and/or PCR from 17 farms, seven were characterized to evaluate the genetic polymorphism of T. theileri through sequence analysis of PCR-amplified CATL DNA sequences. All CATL sequences of T. theileri from Thai cattle clustered with sequences of the previously described phylogenetic lineages TthI and TthII, supporting only two major lineages of T. theileri in cattle around the world. However, 11 of the 29 CATL sequences analyzed showed to be different, disclosing an unexpectedly large polymorphic genetic repertoire, with multiple genotypes of T. theileri not previously described in other countries circulating in Thai cattle. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Plasmodium falciparum, the causative agent of human malaria, invades host erythrocytes using several proteins on the surface of the invasive merozoite, which have been proposed as potential vaccine candidates. Members of the multi-gene PfRh family are surface antigens that have been shown to play a central role in directing merozoites to alternative erythrocyte receptors for invasion. Recently, we identified a large structural polymorphism, a 0.58 Kb deletion, in the C-terminal region of the PfRh2b gene, present at a high frequency in parasite populations from Senegal. We hypothesize that this region is a target of humoral immunity. Here, by analyzing 371 P. falciparum isolates we show that this major allele is present at varying frequencies in different populations within Senegal, Africa, and throughout the world. For allelic dimorphisms in the asexual stage antigens, Msp-2 and EBA-175, we find minimal geographic differentiation among parasite populations from Senegal and other African localities, suggesting extensive gene flow among these populations and/or immune-mediated frequency-dependent balancing selection. In contrast, we observe a higher level of inter-population divergence (as measured by F(st)) for the PfRh2b deletion, similar to that observed for SNPs from the sexual stage Pfs45/48 loci, which is postulated to be under directional selection. We confirm that the region containing the PfRh2b polymorphism is a target of humoral immune responses by demonstrating antibody reactivity of endemic sera. Our analysis of inter-population divergence suggests that in contrast to the large allelic dimorphisms in EBA-175 and Msp-2, the presence or absence of the large PfRh2b deletion may not elicit frequency-dependent immune selection, but may be under positive immune selection, having important implications for the development of these proteins as vaccine candidates. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Genetic diversity and population structure of Plasmodium viva-V parasites call predict the origin and Spread of novel Variants Within a population enabling Population specific malaria control measures. We analyzed the genetic diversity and population Structure of 425 P. vivax isolates from Sri Lanka, Myanmar, and Ethiopia using 12 trinucleotide and tetranucleotide microsatellite markers. All three parasite populations were highly polymorphic with 3-44 alleles per locus. Approximately 65% were multiple-clone infections. Mean genetic diversity (H(E)) was 0.7517 in Ethiopia, 0.8450 in Myanmar, and 0.8610 in Sri Lanka. Significant linkage disequilibrium Was maintained. Population structure showed two clusters (Asian and African) according to geography and ancestry Strong clustering of outbreak isolates from Sri Lanka and Ethiopia was observed. Predictive power of ancestry using two-thirds of the isolates as a model identified 78.2% of isolates accurately as being African or Asian. Microsatellite analysis is a useful tool for mapping short-term outbreaks of malaria and for predicting ancestry.
Resumo:
Clustering is a difficult task: there is no single cluster definition and the data can have more than one underlying structure. Pareto-based multi-objective genetic algorithms (e.g., MOCK Multi-Objective Clustering with automatic K-determination and MOCLE-Multi-Objective Clustering Ensemble) were proposed to tackle these problems. However, the output of such algorithms can often contains a high number of partitions, becoming difficult for an expert to manually analyze all of them. In order to deal with this problem, we present two selection strategies, which are based on the corrected Rand, to choose a subset of solutions. To test them, they are applied to the set of solutions produced by MOCK and MOCLE in the context of several datasets. The study was also extended to select a reduced set of partitions from the initial population of MOCLE. These analysis show that both versions of selection strategy proposed are very effective. They can significantly reduce the number of solutions and, at the same time, keep the quality and the diversity of the partitions in the original set of solutions. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
J.A. Ferreira Neto, E.C. Santos Junior, U. Fra Paleo, D. Miranda Barros, and M.C.O. Moreira. 2011. Optimal subdivision of land in agrarian reform projects: an analysis using genetic algorithms. Cien. Inv. Agr. 38(2): 169-178. The objective of this manuscript is to develop a new procedure to achieve optimal land subdivision using genetic algorithms (GA). The genetic algorithm was tested in the rural settlement of Veredas, located in Minas Gerais, Brazil. This implementation was based on the land aptitude and its productivity index. The sequence of tests in the study was carried out in two areas with eight different agricultural aptitude classes, including one area of 391.88 ha subdivided into 12 lots and another of 404.1763 ha subdivided into 14 lots. The effectiveness of the method was measured using the shunting line standard value of a parceled area lot`s productivity index. To evaluate each parameter, a sequence of 15 calculations was performed to record the best individual fitness average (MMI) found for each parameter variation. The best parameter combination found in testing and used to generate the new parceling with the GA was the following: 320 as the generation number, a population of 40 individuals, 0.8 mutation tax, and a 0.3 renewal tax. The solution generated rather homogeneous lots in terms of productive capacity.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)