997 resultados para complete linkage clustering


Relevância:

50.00% 50.00%

Publicador:

Resumo:

Background: The development of sugarcane as a sustainable crop has unlimited applications. The crop is one of the most economically viable for renewable energy production, and CO2 balance. Linkage maps are valuable tools for understanding genetic and genomic organization, particularly in sugarcane due to its complex polyploid genome of multispecific origins. The overall objective of our study was to construct a novel sugarcane linkage map, compiling AFLP and EST-SSR markers, and to generate data on the distribution of markers anchored to sequences of scIvana_1, a complete sugarcane transposable element, and member of the Copia superfamily. Results: The mapping population parents ('IAC66-6' and 'TUC71-7') contributed equally to polymorphisms, independent of marker type, and generated markers that were distributed into nearly the same number of co-segregation groups (or CGs). Bi-parentally inherited alleles provided the integration of 19 CGs. The marker number per CG ranged from two to 39. The total map length was 4,843.19 cM, with a marker density of 8.87 cM. Markers were assembled into 92 CGs that ranged in length from 1.14 to 404.72 cM, with an estimated average length of 52.64 cM. The greatest distance between two adjacent markers was 48.25 cM. The scIvana_1-based markers (56) were positioned on 21 CGs, but were not regularly distributed. Interestingly, the distance between adjacent scIvana_1-based markers was less than 5 cM, and was observed on five CGs, suggesting a clustered organization. Conclusions: Results indicated the use of a NBS-profiling technique was efficient to develop retrotransposon-based markers in sugarcane. The simultaneous maximum-likelihood estimates of linkage and linkage phase based strategies confirmed the suitability of its approach to estimate linkage, and construct the linkage map. Interestingly, using our genetic data it was possible to calculate the number of retrotransposonscIvana_1 (similar to 60) copies in the sugarcane genome, confirming previously reported molecular results. In addition, this research possibly will have indirect implications in crop economics e. g., productivity enhancement via QTL studies, as the mapping population parents differ in response to an important fungal disease.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Migraine is a painful disorder for which the etiology remains obscure. Diagnosis is largely based on International Headache Society criteria. However, no feature occurs in all patients who meet these criteria, and no single symptom is required for diagnosis. Consequently, this definition may not accurately reflect the phenotypic heterogeneity or genetic basis of the disorder. Such phenotypic uncertainty is typical for complex genetic disorders and has encouraged interest in multivariate statistical methods for classifying disease phenotypes. We applied three popular statistical phenotyping methods—latent class analysis, grade of membership and grade of membership “fuzzy” clustering (Fanny)—to migraine symptom data, and compared heritability and genome-wide linkage results obtained using each approach. Our results demonstrate that different methodologies produce different clustering structures and non-negligible differences in subsequent analyses. We therefore urge caution in the use of any single approach and suggest that multiple phenotyping methods be used.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We present some additions to a fuzzy variable radius niche technique called Dynamic Niche Clustering (DNC) (Gan and Warwick, 1999; 2000; 2001) that enable the identification and creation of niches of arbitrary shape through a mechanism called Niche Linkage. We show that by using this mechanism it is possible to attain better feature extraction from the underlying population.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE(S): An individual's risk of developing cardiovascular disease (CVD) is influenced by genetic factors. This study focussed on mapping genetic loci for CVD-risk traits in a unique population isolate derived from Norfolk Island. METHODS: This investigation focussed on 377 individuals descended from the population founders. Principal component analysis was used to extract orthogonal components from 11 cardiovascular risk traits. Multipoint variance component methods were used to assess genome-wide linkage using SOLAR to the derived factors. A total of 285 of the 377 related individuals were informative for linkage analysis. RESULTS: A total of 4 principal components accounting for 83% of the total variance were derived. Principal component 1 was loaded with body size indicators; principal component 2 with body size, cholesterol and triglyceride levels; principal component 3 with the blood pressures; and principal component 4 with LDL-cholesterol and total cholesterol levels. Suggestive evidence of linkage for principal component 2 (h(2) = 0.35) was observed on chromosome 5q35 (LOD = 1.85; p = 0.0008). While peak regions on chromosome 10p11.2 (LOD = 1.27; p = 0.005) and 12q13 (LOD = 1.63; p = 0.003) were observed to segregate with principal components 1 (h(2) = 0.33) and 4 (h(2) = 0.42), respectively. CONCLUSION(S): This study investigated a number of CVD risk traits in a unique isolated population. Findings support the clustering of CVD risk traits and provide interesting evidence of a region on chromosome 5q35 segregating with weight, waist circumference, HDL-c and total triglyceride levels.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Norfolk Island is a human genetic isolate, possessing unique population characteristics that could be utilized for complex disease gene localization. Our intention was to evaluate the extent and strength of linkage disequilibrium (LD) in the Norfolk isolate by investigating markers within Xq13.3 and the NOS2A gene encoding the inducible nitric oxide synthase. A total of six microsatellite markers spanning approximately 11 Mb were assessed on chromosome Xq13.3 in a group of 56 men from Norfolk Island. Additionally, three single nucleotide polymorphisms (SNPs) localizing to the NOS2A gene were analyzed in a subset of the complex Norfolk pedigree. With the exception of two of the marker pairs, one of which is the most distantly spaced marker, all the Xq13.3 marker pairs were found to be in significant LD indicating that LD extends up to 9.5-11.5 Mb in the Norfolk Island population. Also, all SNPs studied showed significant LD in both Norfolk Islanders and Australian Caucasians, with two of the marker pairs in complete LD in the Norfolk population only. The Norfolk Island study population possesses a unique set of characteristics including founder effect, geographical isolation, exhaustive genealogical information and phenotypic data of use to cardiovascular disease risk traits. With LD extending up to 9.5-11 Mb, the Norfolk isolate should be a powerful resource for the localization of complex disease genes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Essential hypertension is a highly hereditable disorder in which genetic influences predominate over environmental factors. The molecular genetic profiles which predispose to essential hypertension are not known. In rats with genetic hypertension, there is some recent evidence pointing to linkage of renin gene alleles with blood pressure. The genes for renin and antithrombin III belong to a conserved synteny group which, in humans, spans the q21.3-32.3 region of chromosome I and, in rats, is linkage group X on chromosome 13. The present study examined the association of particular human renin gene (REN) and antithrombin III gene (AT3) polymorphisms with essential hypertension by comparing the frequency of specific alleles for each of these genes in 50 hypertensive offspring of hypertensive parents and 91 normotensive offspring of normotensive parents. In addition, linkage relationships were examined in hypertensive pedigrees with multiple affected individuals. Alleles of a REN HindIII restriction fragment length polymorphism (RFLP) were detected using a genomic clone, λHR5, to probe Southern blots of HindIII-cut leucocyte DNA, and those for an AT3 Pstl RFLP were detected by phATIII 113 complementary DNA probe. The frequencies of each REN allele in the hypertensive group were 0.76 and 0.24 compared with 0.74 and 0.26 in the normotensive group. For AT3, hypertensive allele frequencies were 0.49 and 0.51 compared with normotensive values of 0.54 and 0.46. These differences were not significant by χ2 analysis (P > 0.2). Linkage analysis of a family (data from 16 family members, 10 of whom were hypertensive), informative for both markers, without an age-of-onset correction, and assuming dominant inheritance of hypertension, complete penetrance and a disease frequency of 20%, did not indicate linkage of REN with hypertension, but gave a positive, although not significant, logarithm of the odds for linkage score of 0.784 at a recombination fraction of 0 for AT3 linkage to hypertension. In conclusion, the present study could find no evidence for an association of a REN HindIII RFLP with essential hypertension or for a linkage of the locus defined by this RFLP in a family segregating for hypertension. In the case of an AT3 Pstl RFLP, although association analysis was negative, linkage analysis suggested possible involvement (odds of 6:1 in favour) of a gene located near the 1q23 locus with hypertension in one informative family.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Rubus yellow net virus (RYNV) was cloned and sequenced from a red raspberry (Rubus idaeus L.) plant exhibiting symptoms of mosaic and mottling in the leaves. Its genomic sequence indicates that it is a distinct member of the genus Badnavirus, with 7932. bp and seven ORFs, the first three corresponding in size and location to the ORFs found in the type member Commelina yellow mottle virus. Bioinformatic analysis of the genomic sequence detected several features including nucleic acid binding motifs, multiple zinc finger-like sequences and domains associated with cellular signaling. Subsequent sequencing of the small RNAs (sRNAs) from RYNV-infected R. idaeus leaf tissue was used to determine any RYNV sequences targeted by RNA silencing and identified abundant virus-derived small RNAs (vsRNAs). The majority of the vsRNAs were 22-nt in length. We observed a highly uneven genome-wide distribution of vsRNAs with strong clustering to small defined regions distributed over both strands of the RYNV genome. Together, our data show that sequences of the aphid-transmitted pararetrovirus RYNV are targeted in red raspberry by the interfering RNA pathway, a predominant antiviral defense mechanism in plants. © 2013.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High-Order Co-Clustering (HOCC) methods have attracted high attention in recent years because of their ability to cluster multiple types of objects simultaneously using all available information. During the clustering process, HOCC methods exploit object co-occurrence information, i.e., inter-type relationships amongst different types of objects as well as object affinity information, i.e., intra-type relationships amongst the same types of objects. However, it is difficult to learn accurate intra-type relationships in the presence of noise and outliers. Existing HOCC methods consider the p nearest neighbours based on Euclidean distance for the intra-type relationships, which leads to incomplete and inaccurate intra-type relationships. In this paper, we propose a novel HOCC method that incorporates multiple subspace learning with a heterogeneous manifold ensemble to learn complete and accurate intra-type relationships. Multiple subspace learning reconstructs the similarity between any pair of objects that belong to the same subspace. The heterogeneous manifold ensemble is created based on two-types of intra-type relationships learnt using p-nearest-neighbour graph and multiple subspaces learning. Moreover, in order to make sure the robustness of clustering process, we introduce a sparse error matrix into matrix decomposition and develop a novel iterative algorithm. Empirical experiments show that the proposed method achieves improved results over the state-of-art HOCC methods for FScore and NMI.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Relative geometric arrangements of the sample points, with reference to the structure of the imbedding space, produce clusters. Hence, if each sample point is imagined to acquire a volume of a small M-cube (called pattern-cell), depending on the ranges of its (M) features and number (N) of samples; then overlapping pattern-cells would indicate naturally closer sample-points. A chain or blob of such overlapping cells would mean a cluster and separate clusters would not share a common pattern-cell between them. The conditions and an analytic method to find such an overlap are developed. A simple, intuitive, nonparametric clustering procedure, based on such overlapping pattern-cells is presented. It may be classified as an agglomerative, hierarchical, linkage-type clustering procedure. The algorithm is fast, requires low storage and can identify irregular clusters. Two extensions of the algorithm, to separate overlapping clusters and to estimate the nature of pattern distributions in the sample space, are also indicated.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Clustering techniques which can handle incomplete data have become increasingly important due to varied applications in marketing research, medical diagnosis and survey data analysis. Existing techniques cope up with missing values either by using data modification/imputation or by partial distance computation, often unreliable depending on the number of features available. In this paper, we propose a novel approach for clustering data with missing values, which performs the task by Symmetric Non-Negative Matrix Factorization (SNMF) of a complete pair-wise similarity matrix, computed from the given incomplete data. To accomplish this, we define a novel similarity measure based on Average Overlap similarity metric which can effectively handle missing values without modification of data. Further, the similarity measure is more reliable than partial distances and inherently possesses the properties required to perform SNMF. The experimental evaluation on real world datasets demonstrates that the proposed approach is efficient, scalable and shows significantly better performance compared to the existing techniques.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

As an endangered animal group, musk deer (genus Moschus) are not only a great concern of wildlife conservation, but also of special interest to evolutionary studies due to long-standing arguments on the taxonomic and phylogenetic associations in this group. Using museum samples, we sequenced complete mitochondrial cytochrome b genes (1140 bp) of all suggested species of musk deer in order to reconstruct their phylogenetic history through molecular information. Our results showed that the cytochrome b gene tree is rather robust and concurred for all the algorithms employed (parsimony, maximum likelihood, and distance methods). Further, the relative rate test indicated a constant sequence substitution rate among all the species, permitting the dating of divergence events by molecular clock. According to the molecular topology, M. moschiferus branched off the earliest from a common ancestor of musk deer (about 700,000 years ago); then followed the bifurcation forming the M. berezouskii lineage and the lineage clustering M. fuscus, M. chrysogaster, and M. leucogaster (around 370,000 years before present), interestingly the most recent speciation event in musk deer happened rather recently (140,000 years ago), which might have resulted from the diversified habitats and geographic barriers in southwest China caused by gigantic movements of the Qinghai-Tibetan Plateau in history. Combining the data of current distributions, fossil records, and molecular data of this study, we suggest that the historical dispersion of musk deer might be from north to south in China. Additionally, in our further analyses involving other pecora species, musk deer was strongly supported as a monophyletic group and a valid family in Artiodactyla, closely related to Cervidae. (C) 1999 Academic Press.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Complete achromatopsia is a rare autosomal recessive disease associated with CNGA3, CNGB3, GNAT2 and PDE6C mutations. This retinal disorder is characterized by complete loss of color discrimination due to the absence or alteration of the cones function. The purpose of the present study was the clinical and the genetic characterization of achromatopsia in a large consanguineous Tunisian family. Ophthalmic evaluation included a full clinical examination, color vision testing and electroretinography. Linkage analysis using microsatellite markers flanking CNGA3, CNGB3, GNAT2 and PDE6C genes was performed. Mutations were screened by direct sequencing. A total of 12 individuals were diagnosed with congenital complete achromatopsia. They are members of six nuclear consanguineous families belonging to the same large consanguineous family. Linkage analysis revealed linkage to GNAT2. Mutational screening of GNAT2 revealed three intronic variations c.119-69G>C, c.161+66A>T and c.875-31G>C that co-segregated with a novel mutation p.R313X. An identical GNAT2 haplotype segregating with this mutation was identified, indicating a founder mutation. All patients were homozygous for the p.R313X mutation. This is the first report of the clinical and genetic investigation of complete achromatopsia in North Africa and the largest family with recessive achromatopsia involving GNAT2; thus, providing a unique opportunity for genotype-phenotype correlation for this extremely rare condition.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

El trastorno de hiperactividad y déficit de atención (THDA), es definido clínicamente como una alteración en el comportamiento, caracterizada por inatención, hiperactividad e impulsividad. Estos aspectos son clasificados en tres subtipos, que son: Inatento, hiperactivo impulsivo y mixto. Clínicamente se describe un espectro amplio que incluye desordenes académicos, trastornos de aprendizaje, déficit cognitivo, trastornos de conducta, personalidad antisocial, pobres relaciones interpersonales y aumento de la ansiedad, que pueden continuar hasta la adultez. A nivel global se ha estimado una prevalencia entre el 1% y el 22%, con amplias variaciones, dadas por la edad, procedencia y características sociales. En Colombia, se han realizado estudios en Bogotá y Antioquia, que han permitido establecer una prevalencia del 5% y 15%, respectivamente. La causa específica no ha sido totalmente esclarecida, sin embargo se ha calculado una heredabilidad cercana al 80% en algunas poblaciones, demostrando el papel fundamental de la genética en la etiología de la enfermedad. Los factores genéticos involucrados se relacionan con cambios neuroquímicos de los sistemas dopaminérgicos, serotoninérgicos y noradrenérgicos, particularmente en los sistemas frontales subcorticales, corteza cerebral prefrontal, en las regiones ventral, medial, dorsolateral y la porción anterior del cíngulo. Basados en los datos de estudios previos que sugieren una herencia poligénica multifactorial, se han realizado esfuerzos continuos en la búsqueda de genes candidatos, a través de diferentes estrategias. Particularmente los receptores Alfa 2 adrenérgicos, se encuentran en la corteza cerebral, cumpliendo funciones de asociación, memoria y es el sitio de acción de fármacos utilizados comúnmente en el tratamiento de este trastorno, siendo esta la principal evidencia de la asociación de este receptor con el desarrollo del THDA. Hasta la fecha se han descrito más de 80 polimorfismos en el gen (ADRA2A), algunos de los cuales se han asociado con la entidad. Sin embargo, los resultados son controversiales y varían según la metodología diagnóstica empleada y la población estudiada, antecedentes y comorbilidades. Este trabajo pretende establecer si las variaciones en la secuencia codificante del gen ADRA2A, podrían relacionarse con el fenotipo del Trastorno de Hiperactividad y el Déficit de Atención.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this work a new method for clustering and building a topographic representation of a bacteria taxonomy is presented. The method is based on the analysis of stable parts of the genome, the so-called “housekeeping genes”. The proposed method generates topographic maps of the bacteria taxonomy, where relations among different type strains can be visually inspected and verified. Two well known DNA alignement algorithms are applied to the genomic sequences. Topographic maps are optimized to represent the similarity among the sequences according to their evolutionary distances. The experimental analysis is carried out on 147 type strains of the Gammaprotebacteria class by means of the 16S rRNA housekeeping gene. Complete sequences of the gene have been retrieved from the NCBI public database. In the experimental tests the maps show clusters of homologous type strains and present some singular cases potentially due to incorrect classification or erroneous annotations in the database.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Diploid Fragaria provide a potential model for genomic studies in the Rosaceae. To develop a genetic linkage map of diploid Fragaria, we scored 78 markers (68 microsatellites, one sequence-characterised amplified region, six gene-specific markers and three morphological traits) in an interspecific F2 population of 94 plants generated from a cross of F.vesca f. semperflorens × F. nubicola. Co-segregation analysis arranged 76 markers into seven discrete linkage groups covering 448 cM, with linkage group sizes ranging from 100.3 cM to 22.9 cM. Marker coverage was generally good; however some clustering of markers was observed on six of the seven linkage groups. Segregation distortion was observed at a high proportion of loci (54%), which could reflect the interspecific nature of the progeny and, in some cases, the self-incompatibility of F. nubicola. Such distortion may also account for some of the marker clustering observed in the map. One of the morphological markers, pale-green leaf (pg) has not previously been mapped in Fragaria and was located to the mid-point of linkage group VI. The transferable nature of the markers used in this study means that the map will be ideal for use as a framework for additional marker incorporation aimed at enhancing and resolving map coverage of the diploid Fragaria genome. The map also provides a sound basis for linkage map transfer to the cultivated octoploid strawberry.