765 resultados para Grouping, clustering, campi, associazione
Resumo:
BACKGROUND The insertion element IS630 found in Aeromonas salmonicida belongs to the IS630-Tc1-mariner superfamily of transposons. It is present in multiple copies and represents approximately half of the IS present in the genome of A. salmonicida subsp. salmonicida A449. RESULTS By using High Copy Number IS630 Restriction Fragment Length Polymorphism (HCN-IS630-RFLP), strains of various subspecies of Aeromonas salmonicida showed conserved or clustering patterns, thus allowing their differentiation from each other. Fingerprints of A. salmonicida subsp. salmonicida showed the highest homogeneity while 'atypical' A. salmonicida strains were more heterogeneous. IS630 typing also differentiated A. salmonicida from other Aeromonas species. The copy number of IS630 in Aeromonas salmonicida ranges from 8 to 35 and is much lower in other Aeromonas species. CONCLUSIONS HCN-IS630-RFLP is a powerful tool for subtyping of A. salmonicida. The high stability of IS630 insertions in A. salmonicida subsp. salmonicida indicates that it might have played a role in pathoadaptation of A. salmonicida which has reached an optimal configuration in the highly virulent and specific fish pathogen A. salmonicida subsp. salmonicida.
Resumo:
We consider the problem of fitting a union of subspaces to a collection of data points drawn from one or more subspaces and corrupted by noise and/or gross errors. We pose this problem as a non-convex optimization problem, where the goal is to decompose the corrupted data matrix as the sum of a clean and self-expressive dictionary plus a matrix of noise and/or gross errors. By self-expressive we mean a dictionary whose atoms can be expressed as linear combinations of themselves with low-rank coefficients. In the case of noisy data, our key contribution is to show that this non-convex matrix decomposition problem can be solved in closed form from the SVD of the noisy data matrix. The solution involves a novel polynomial thresholding operator on the singular values of the data matrix, which requires minimal shrinkage. For one subspace, a particular case of our framework leads to classical PCA, which requires no shrinkage. For multiple subspaces, the low-rank coefficients obtained by our framework can be used to construct a data affinity matrix from which the clustering of the data according to the subspaces can be obtained by spectral clustering. In the case of data corrupted by gross errors, we solve the problem using an alternating minimization approach, which combines our polynomial thresholding operator with the more traditional shrinkage-thresholding operator. Experiments on motion segmentation and face clustering show that our framework performs on par with state-of-the-art techniques at a reduced computational cost.
Resumo:
An integrated approach for multi-spectral segmentation of MR images is presented. This method is based on the fuzzy c-means (FCM) and includes bias field correction and contextual constraints over spatial intensity distribution and accounts for the non-spherical cluster's shape in the feature space. The bias field is modeled as a linear combination of smooth polynomial basis functions for fast computation in the clustering iterations. Regularization terms for the neighborhood continuity of intensity are added into the FCM cost functions. To reduce the computational complexity, the contextual regularizations are separated from the clustering iterations. Since the feature space is not isotropic, distance measure adopted in Gustafson-Kessel (G-K) algorithm is used instead of the Euclidean distance, to account for the non-spherical shape of the clusters in the feature space. These algorithms are quantitatively evaluated on MR brain images using the similarity measures.
Resumo:
In numerous intervention studies and education field trials, random assignment to treatment occurs in clusters rather than at the level of observation. This departure of random assignment of units may be due to logistics, political feasibility, or ecological validity. Data within the same cluster or grouping are often correlated. Application of traditional regression techniques, which assume independence between observations, to clustered data produce consistent parameter estimates. However such estimators are often inefficient as compared to methods which incorporate the clustered nature of the data into the estimation procedure (Neuhaus 1993).1 Multilevel models, also known as random effects or random components models, can be used to account for the clustering of data by estimating higher level, or group, as well as lower level, or individual variation. Designing a study, in which the unit of observation is nested within higher level groupings, requires the determination of sample sizes at each level. This study investigates the design and analysis of various sampling strategies for a 3-level repeated measures design on the parameter estimates when the outcome variable of interest follows a Poisson distribution. ^ Results study suggest that second order PQL estimation produces the least biased estimates in the 3-level multilevel Poisson model followed by first order PQL and then second and first order MQL. The MQL estimates of both fixed and random parameters are generally satisfactory when the level 2 and level 3 variation is less than 0.10. However, as the higher level error variance increases, the MQL estimates become increasingly biased. If convergence of the estimation algorithm is not obtained by PQL procedure and higher level error variance is large, the estimates may be significantly biased. In this case bias correction techniques such as bootstrapping should be considered as an alternative procedure. For larger sample sizes, those structures with 20 or more units sampled at levels with normally distributed random errors produced more stable estimates with less sampling variance than structures with an increased number of level 1 units. For small sample sizes, sampling fewer units at the level with Poisson variation produces less sampling variation, however this criterion is no longer important when sample sizes are large. ^ 1Neuhaus J (1993). “Estimation efficiency and Tests of Covariate Effects with Clustered Binary Data”. Biometrics , 49, 989–996^
Resumo:
SUMMARY There is interest in the potential of companion animal surveillance to provide data to improve pet health and to provide early warning of environmental hazards to people. We implemented a companion animal surveillance system in Calgary, Alberta and the surrounding communities. Informatics technologies automatically extracted electronic medical records from participating veterinary practices and identified cases of enteric syndrome in the warehoused records. The data were analysed using time-series analyses and a retrospective space-time permutation scan statistic. We identified a seasonal pattern of reports of occurrences of enteric syndromes in companion animals and four statistically significant clusters of enteric syndrome cases. The cases within each cluster were examined and information about the animals involved (species, age, sex), their vaccination history, possible exposure or risk behaviour history, information about disease severity, and the aetiological diagnosis was collected. We then assessed whether the cases within the cluster were unusual and if they represented an animal or public health threat. There was often insufficient information recorded in the medical record to characterize the clusters by aetiology or exposures. Space-time analysis of companion animal enteric syndrome cases found evidence of clustering. Collection of more epidemiologically relevant data would enhance the utility of practice-based companion animal surveillance.
Resumo:
Polydnaviruses (genera Ichnovirus and Bracovirus) have a segmented genome of circular double-stranded DNA molecules, replicate in the ovary of parasitic wasps and are essential for successful parasitism of the host. Here we show the first detailed analysis of various segments of a bracovirus, the Chelonus inanitus virus (CiV). Four segments were sequenced and two of them, CiV12 and CiV14, were found to be closely related while CiV14.5 and CiV16.8 were unrelated. CiV12, CiV14.5 and CiV16.8 are unique while CiV14 occurs also nested in another larger segment. All four segments are predicted to contain genes and predictions could be substantiated in most cases. Comparison with databases revealed no significant similarities at either the nucleotide or amino acid level. Inverted repeats with identities between 77% and 92% and lengths between 26 bp and 100 bp were found on all segments outside of predicted genes. Hybridization experiments indicate that CiV12 and CiV14 are both flanked by other virus segments, suggesting that proviral CiV segments are clustered in the genome of the wasp. The integration/excision site of CiV14 was analysed and compared to that of CiV12. On both termini of proviral CiV12 and CiV14 as well as in the excised circular molecule and the rejoined DNA a very similar repeat of 14 bp was found. A model to illustrate where the terminal repeats might recombine to yield the circular molecule is presented. Excision of CiV12 and CiV14 is restricted to the female and sets in at a very specific time-point in pupal-adult development.
Resumo:
The aetiology of childhood cancers remains largely unknown. It has been hypothesized that infections may be involved and that mini-epidemics thereof could result in space-time clustering of incident cases. Most previous studies support spatio-temporal clustering for leukaemia, while results for other diagnostic groups remain mixed. Few studies have corrected for uneven regional population shifts which can lead to spurious detection of clustering. We examined whether there is space-time clustering of childhood cancers in Switzerland identifying cases diagnosed at age <16 years between 1985 and 2010 from the Swiss Childhood Cancer Registry. Knox tests were performed on geocoded residence at birth and diagnosis separately for leukaemia, acute lymphoid leukaemia (ALL), lymphomas, tumours of the central nervous system, neuroblastomas and soft tissue sarcomas. We used Baker's Max statistic to correct for multiple testing and randomly sampled time-, sex- and age-matched controls from the resident population to correct for uneven regional population shifts. We observed space-time clustering of childhood leukaemia at birth (Baker's Max p = 0.045) but not at diagnosis (p = 0.98). Clustering was strongest for a spatial lag of <1 km and a temporal lag of <2 years (Observed/expected close pairs: 124/98; p Knox test = 0.003). A similar clustering pattern was observed for ALL though overall evidence was weaker (Baker's Max p = 0.13). Little evidence of clustering was found for other diagnostic groups (p > 0.2). Our study suggests that childhood leukaemia tends to cluster in space-time due to an etiologic factor present in early life.
Resumo:
Lipid rafts are small laterally mobile cell membrane structures that are highly enriched in lymphocyte signaling molecules. Lipid rafts can form from the assembly of specialized lipids and proteins through hydrophobic associations from saturated acyl chains. GM1 gangliosides are a common lipid raft component and have been shown to be essential in many T cell functions. Current lipid raft theory hypothesizes that certain aspects of T cell signaling can be initiated from the coalescence of these signaling-enriched lipid rafts to sites of receptor engagement. We have described how the specific aggregation of GM1 lipid rafts can cause a reorganization of cell surface molecular associations which include dynamic associations of β1 integrins with GM1 lipid rafts. These associations had pronounced effects on T cell adhesive and migratory states. We show that GM1 lipid raft aggregation can dramatically inhibit T cell migration and chemotaxis on the extracellular matrix constituent fibronectin. This inhibition of migration function was shown to be dependent on the src kinase Lck and PKC-regulated F-actin polymerization to extending pseudopods. Furthermore, GM1 lipid raft clustering could activate T cell adhesion-strengthening mechanisms. These include an increase in cellular rigidity, the creation of polymerized cortical F-actin structures, the induction of high affinity integrin states, an increase in surface area and symmetry of the contact plane, and resistance to shear flow detachment while adherent to fibronectin. This indicates that GM1 lipid raft aggregation defines a novel stimulus to regulate lymphocyte motility and cellular adhesion which could have important implications in T cell homing mechanisms. ^
Resumo:
The small leucine-rich repeat proteoglycans (or SLRPs) are a group of extracellular proteins (ECM) that belong to the leucine-rich repeat (LRR) superfamily of proteins. The LRR is a protein folding motif composed of 20–30 amino acids with leucines in conserved positions. LRR-containing proteins are present in a broad spectrum of organisms and possess diverse cellular functions and localization. In mammals, the SLRPs are abundant in connective tissues, such as bones, cartilage, tendons, skin, and blood vessels. We have discovered a new member of the class I small leucine rich repeat proteoglycan (SLRP) family which is distinct from the other class I SLRPs since it possesses a unique stretch of aspartate residues at its N-terminus. For this reason, we called the molecule asporin. The deduced amino acid sequence is about 50% identical (and 70% similar) to decorin and biglycan. However, asporin does not contain a serine/glycine dipeptide sequence required for the assembly of O-linked glycosaminoglycans and is probably not a proteoglycan. The tissue expression of asporin partially overlaps with the expression of decorin and biglycan. During mouse embryonic development, asporin mRNA expression was detected primarily in the skeleton and other specialized connective tissues; very little asporin message was detected in the major parenchymal organs. The mouse asporin gene structure is similar to that of biglycan and decorin with 8 exons. The asporin gene is localized to human chromosome 9q22-9g21.3 where asporin is part of a SLRP gene cluster that includes ECM2, osteoadherin, and osteoglycin. This gene cluster of four LRR-encoding genes is embedded in a 238 kilobase intron of another novel gene named Tes9orf that is expressed primarily in the testes of the adult mouse. The SLRP genes are not present in Drosophila or C. elegans , but reside in three separate gene clusters in the puffer fish, mice and humans. Targeted disruption of individual mouse SLRP genes display minor connective tissue defects such as skin fragility, tendon laxity, minor growth plate defects, and mild osteoporosis. However, double and triple knockouts of SLRP genes exacerbate these phenotypes. Both the double epiphycan/biglycan and the triple PRELP/fibromodulin/biglycan knockout mice exhibit premature osteoarthritis. ^
Resumo:
We have performed quantitative X-ray diffraction (qXRD) analysis of 157 grab or core-top samples from the western Nordic Seas between (WNS) ~57°-75°N and 5° to 45° W. The RockJock Vs6 analysis includes non-clay (20) and clay (10) mineral species in the <2 mm size fraction that sum to 100 weight %. The data matrix was reduced to 9 and 6 variables respectively by excluding minerals with low weight% and by grouping into larger groups, such as the alkali and plagioclase feldspars. Because of its potential dual origins calcite was placed outside of the sum. We initially hypothesized that a combination of regional bedrock outcrops and transport associated with drift-ice, meltwater plumes, and bottom currents would result in 6 clusters defined by "similar" mineral compositions. The hypothesis was tested by use of a fuzzy k-mean clustering algorithm and key minerals were identified by step-wise Discriminant Function Analysis. Key minerals in defining the clusters include quartz, pyroxene, muscovite, and amphibole. With 5 clusters, 87.5% of the observations are correctly classified. The geographic distributions of the five k-mean clusters compares reasonably well with the original hypothesis. The close spatial relationship between bedrock geology and discrete cluster membership stresses the importance of this variable at both the WNS-scale and at a more local scale in NE Greenland.