948 resultados para Clustering Analysis
Resumo:
RESUMO - Enquadramento/Objectivos: As doenças oncológicas constituem a segunda causa de morte em Portugal, e têm um profundo impacto psicossocial, não só pela sua elevada incidência e mortalidade mas também pelos enormes custos envolvidos na sua prevenção, tratamento e reabilitação. De acordo com estudos anteriores, existem disparidades geográficas na incidência da doença oncológica. É por isso indispensável caracterizar e analisar as diferentes distribuições espaciais no tempo e no espaço, para controlar a doença e promover a saúde, contribuindo ao mesmo tempo para uma melhor compreensão da etiologia da doença. Este projecto compreende 3 objectivos principais que são: a caracterização de distribuição espacio-temporal do cancro do pulmão e do cancro do estômago, separadamente e em conjunto, na região sul de Portugal Continental (abrangida pelo ROR-Sul) no espaço temporal de 2000 a 2008, procurando identificar potenciais áreas de risco no desenvolvimento destes tumores. Metodologia: Numa primeira fase realizou-se um estudo descritivo das taxas de incidência dos tumores aqui retratados por idades, por sexo, por ano e por distritos. Posteriormente com o objectivo de identificar a presença de áreas de elevada incidência, procedeu-se à análise de clustering espacio-temporal das taxas de incidência ao nível dos concelhos na região do estudo, em 2000-2008. Resultados: Os resultados da análise descritiva revelaram que ambos os tumores são mais incidentes nos homens do que nas mulheres e que estes são igualmente mais incidentes em pessoas com mais de 75 anos. A análise de clustering espacio temporal permitiu verificar a existência um padrão geográfico heterogéneo da incidência de ambos os tumores, da qual resultaram 3 clusters para o cancro do estômago e 2 clusters para o cancro do pulmão (p <0,001). Os clusters do estômago pertencem maioritariamente à região do Alentejo e os clusters do cancro do pulmão à região da grande Lisboa. Conclusões: Os resultados da análise de clustering demonstraram um padrão heterogéneo da distribuição da incidência dos dois cancros na região e período temporal do estudo. As zonas identificadas de elevado risco são diferentes para ambos o tumores. A região que apresenta maior risco para o desenvolvimento do cancro do estômago é o Alentejo e do pulmão é o distrito de Lisboa.
Resumo:
Propolis is a chemically complex biomass produced by honeybees (Apis mellifera) from plant resins added of salivary enzymes, beeswax, and pollen. The biological activities described for propolis were also identified for donor plants resin, but a big challenge for the standardization of the chemical composition and biological effects of propolis remains on a better understanding of the influence of seasonality on the chemical constituents of that raw material. Since propolis quality depends, among other variables, on the local flora which is strongly influenced by (a)biotic factors over the seasons, to unravel the harvest season effect on the propolis chemical profile is an issue of recognized importance. For that, fast, cheap, and robust analytical techniques seem to be the best choice for large scale quality control processes in the most demanding markets, e.g., human health applications. For that, UV-Visible (UV-Vis) scanning spectrophotometry of hydroalcoholic extracts (HE) of seventy-three propolis samples, collected over the seasons in 2014 (summer, spring, autumn, and winter) and 2015 (summer and autumn) in Southern Brazil was adopted. Further machine learning and chemometrics techniques were applied to the UV-Vis dataset aiming to gain insights as to the seasonality effect on the claimed chemical heterogeneity of propolis samples determined by changes in the flora of the geographic region under study. Descriptive and classification models were built following a chemometric approach, i.e. principal component analysis (PCA) and hierarchical clustering analysis (HCA) supported by scripts written in the R language. The UV-Vis profiles associated with chemometric analysis allowed identifying a typical pattern in propolis samples collected in the summer. Importantly, the discrimination based on PCA could be improved by using the dataset of the fingerprint region of phenolic compounds ( = 280-400m), suggesting that besides the biological activities of those secondary metabolites, they also play a relevant role for the discrimination and classification of that complex matrix through bioinformatics tools. Finally, a series of machine learning approaches, e.g., partial least square-discriminant analysis (PLS-DA), k-Nearest Neighbors (kNN), and Decision Trees showed to be complementary to PCA and HCA, allowing to obtain relevant information as to the sample discrimination.
Resumo:
The relationships between environmental factors and temporal and spatial variations of benthic communities of three rocky shores of the state of Espírito Santo, Southeast Brazil, were studied. Sampling was conducted every three months, from August 2006 to May 2007, using intersection points. Chthamalus bisinuatus (Pilsbry, 1916) (Crustacea) and Brachidontes spp. (Mollusca) were the most abundant taxa, occupying the upper level of the intertidal zone of the rocky shore. The species richness was higher at the lower levels. The invasive species Isognomon bicolor (C. B. Adams, 1845) (Mollusca) occurred at low densities in the studied areas. The clustering analysis dendrogram indicated a separation of communities based on exposed and sheltered areas. According to the variance analyses, the communities were significantly different among the studied areas and seasons. The extent of wave exposure and shore slope influenced the species variability. The Setibão site showed the highest diversity and richness, most likely due to greater wave exposure. The communities showed greater variation in the lower levels where environmental conditions were less severe, relative to the other levels.
Resumo:
The great expansion in the number of genome sequencing projects has revealed the importance of computational methods to speed up the characterization of unknown genes. These studies have been improved by the use of three dimensional information from the predicted proteins generated by molecular modeling techniques. In this work, we disclose the structure-function relationship of a gene product from Leishmania amazonensis by applying molecular modeling and bioinformatics techniques. The analyzed sequence encodes a 159 aminoacids polypeptide (estimated 18 kDa) and was denoted LaPABP for its high homology with poly-A binding proteins from trypanosomatids. The domain structure, clustering analysis and a three dimensional model of LaPABP, basically obtained by homology modeling on the structure of the human poly-A binding protein, are described. Based on the analysis of the electrostatic potential mapped on the model's surface and conservation of intramolecular contacts responsible for folding stabilization we hypothesize that this protein may have less avidity to RNA than it's L. major counterpart but still account for a significant functional activity in the parasite. The model obtained will help in the design of mutagenesis experiments aimed to elucidate the mechanism of gene expression in trypanosomatids and serve as a starting point for its exploration as a potential source of targets for a rational chemotherapy.
Resumo:
Immobile location-allocation (LA) problems is a type of LA problem that consists in determining the service each facility should offer in order to optimize some criterion (like the global demand), given the positions of the facilities and the customers. Due to the complexity of the problem, i.e. it is a combinatorial problem (where is the number of possible services and the number of facilities) with a non-convex search space with several sub-optimums, traditional methods cannot be applied directly to optimize this problem. Thus we proposed the use of clustering analysis to convert the initial problem into several smaller sub-problems. By this way, we presented and analyzed the suitability of some clustering methods to partition the commented LA problem. Then we explored the use of some metaheuristic techniques such as genetic algorithms, simulated annealing or cuckoo search in order to solve the sub-problems after the clustering analysis
Resumo:
BACKGROUND & AIMS: Regulation of gene expression in the follicle-associated epithelium (FAE) over Peyer's patches is largely unknown. CCL20, a chemokine that recruits immature dendritic cells, is one of the few FAE-specific markers described so far. Lymphotoxin beta (LTalpha1beta2) expressed on the membrane of immune cells triggers CCL20 expression in enterocytes. In this study, we measured expression profiles of LTalpha1beta2-treated intestinal epithelial cells and selected CCL20 -coregulated genes to identify new FAE markers. METHODS: Genomic profiles of T84 and Caco-2 cell lines treated with either LTalpha1beta2, flagellin, or tumor necrosis factor alpha were measured using the Affymetrix GeneChip U133A. Clustering analysis was used to select CCL20 -coregulated genes, and laser dissection microscopy and real-time polymerase chain reaction on human biopsy specimens was used to assess the expression of the selected markers. RESULTS: Applying a 2-way analysis of variance, we identified regulated genes upon the different treatments. A subset of genes involved in inflammation and related to the nuclear factor kappaB pathway was coregulated with CCL20 . Among these genes, the antiapoptotic factor TNFAIP3 was highly expressed in the FAE. CCL23 , which was not coregulated in vitro with CCL20 , was also specifically expressed in the FAE. CONCLUSIONS: We have identified 2 novel human FAE specifically expressed genes. Most of the CCL20 -coregulated genes did not show FAE-specific expression, suggesting that other signaling pathways are critical to modulate FAE-specific gene expression.
Resumo:
The Culex pipiens complex includes two widespread mosquito vector species, Cx. pipiens and Cx. quinquefasciatus. The distribution of these species varies in latitude, with the former being present in temperate regions and the latter in tropical and subtropical regions. However, their distribution range overlaps in certain areas and interspecific hybridization has been documented. Genetic introgression between these species may have epidemiological repercussions for West Nile virus (WNV) transmission. Bayesian clustering analysis based on multilocus genotypes of 12 microsatellites was used to determine levels of hybridization between these two species in Macaronesian islands, the only contact zone described in West Africa. The distribution of the two species reflects both the islands’ biogeography and historical aspects of human colonization. Madeira Island displayed a homogenous population of Cx. pipiens, whereas Cape Verde showed a more intriguing scenario with extensive hybridization. In the islands of Brava and Santiago, only Cx. quinquefasciatus was found, while in Fogo and Maio high hybrid rates (~40%) between the two species were detected. Within the admixed populations, second-generation hybrids (~50%) were identified suggesting a lack of isolation mechanisms. The observed levels of hybridization may locally potentiate the transmission to humans of zoonotic arboviruses such as WNV.
Resumo:
This work proposes an original contribution to the understanding of shermen spatial behavior, based on the behavioral ecology and movement ecology paradigms. Through the analysis of Vessel Monitoring System (VMS) data, we characterized the spatial behavior of Peruvian anchovy shermen at di erent scales: (1) the behavioral modes within shing trips (i.e., searching, shing and cruising); (2) the behavioral patterns among shing trips; (3) the behavioral patterns by shing season conditioned by ecosystem scenarios; and (4) the computation of maps of anchovy presence proxy from the spatial patterns of behavioral mode positions. At the rst scale considered, we compared several Markovian (hidden Markov and semi-Markov models) and discriminative models (random forests, support vector machines and arti cial neural networks) for inferring the behavioral modes associated with VMS tracks. The models were trained under a supervised setting and validated using tracks for which behavioral modes were known (from on-board observers records). Hidden semi-Markov models performed better, and were retained for inferring the behavioral modes on the entire VMS dataset. At the second scale considered, each shing trip was characterized by several features, including the time spent within each behavioral mode. Using a clustering analysis, shing trip patterns were classi ed into groups associated to management zones, eet segments and skippers' personalities. At the third scale considered, we analyzed how ecological conditions shaped shermen behavior. By means of co-inertia analyses, we found signi cant associations between shermen, anchovy and environmental spatial dynamics, and shermen behavioral responses were characterized according to contrasted environmental scenarios. At the fourth scale considered, we investigated whether the spatial behavior of shermen re ected to some extent the spatial distribution of anchovy. Finally, this work provides a wider view of shermen behavior: shermen are not only economic agents, but they are also foragers, constrained by ecosystem variability. To conclude, we discuss how these ndings may be of importance for sheries management, collective behavior analyses and end-to-end models.
Resumo:
Abstract The giant hogweed (Heracleum mantegazzianum) has successfully invaded 19 European countries as well as parts of North America. It has become a problematic species due to its ability to displace native flora and to cause public health hazards. Applying population genetics to species invasion can help reconstruct invasion history and may promote more efficient management practice. We thus analysed levels of genetic variation and population genetic structure of H. mantegazzianum in an invaded area of the western Swiss Alps as well as in its native range (the Caucasus), using eight nuclear microsatellite loci together with plastid DNA markers and sequences. On both nuclear and plastid genomes, native populations exhibited significantly higher levels of genetic diversity compared to invasive populations, confirming an important founder event during the invasion process. Invasive populations were also significantly more differentiated than native populations. Bayesian clustering analysis identified five clusters in the native range that corresponded to geographically and ecologically separated groups. In the invaded range, 10 clusters occurred. Unlike native populations, invasive clusters were characterized by a mosaic pattern in the landscape, possibly caused by anthropogenic dispersal of the species via roads and direct collection for ornamental purposes. Lastly, our analyses revealed four main divergent groups in the western Swiss Alps, likely as a consequence of multiple independent establishments of H. mantegazzianum.
Resumo:
Microarray gene expression profiles of fresh clinical samples of chronic myeloid leukaemia in chronic phase, acute promyelocytic leukaemia and acute monocytic leukaemia were compared with profiles from cell lines representing the corresponding types of leukaemia (K562, NB4, HL60). In a hierarchical clustering analysis, all clinical samples clustered separately from the cell lines, regardless of leukaemic subtype. Gene ontology analysis showed that cell lines chiefly overexpressed genes related to macromolecular metabolism, whereas in clinical samples genes related to the immune response were abundantly expressed. These findings must be taken into consideration when conclusions from cell line-based studies are extrapolated to patients.
Resumo:
We present in this paper the results of the application of several visual methods on a group of locations, dated between VI and I centuries BC, of the ager Tarraconensis (Tarragona, Spain) a Hinterland of the roman colony of Tarraco. The difficulty in interpreting the diverse results in a combined way has been resolved by means of the use of statistical methods, such as Principal Components Analysis (PCA) and K-means clustering analysis. These methods have allowed us to carry out site classifications in function of the landscape's visual structure that contains them and of the visual relationships that could be given among them.
Resumo:
We have compared the phylogenetic diversity of methicillin-resistant Staphylococcus aureus (MRSA) strains from Switzerland and their phylogenetic relationships with European epidemic clones, using multiprimer random amplification polymorphic DNA (RAPD). Strains included 24 European epidemic clones (59 strains), 66 sporadic strains isolated in Switzerland in 1996-1997, and 15 reference strains of five other Staphylococcus species. Similarity and clustering analysis with the Jaccard's coefficient showed that the maximum genetic distance between MRSA strains was 0.43, whereas the minimum genetic distance between the six Staphylococcus species was 0.97, indicating that the method permits phylogenetic hierarchization. The 24 MRSA clones reported to be epidemic in European countries during the 1990s were distributed into seven different genetic clusters with a maximum distance of 0.29 among them. This clustering pattern was confirmed by the analysis of a subset of MRSA strains by multilocus enzyme electrophoresis at 12 loci. Most of the sporadic Swiss strains were distributed into these seven different genetic clusters, together with the epidemic MRSA clones. This suggests that there is no phylogenetic cluster specific to epidemic clones of MRSA.
Resumo:
A new issue, once again a bouquet of attractive papers. First of all the paper by Droit-Dupré et al. (10.1007/s00428-015-1724-9). The group studied colonic adenocarcinomas, not otherwise specified, by immunohistochemistry for the expression of markers of intestinal epithelial cell differentiation. Hierarchical clustering analysis identified a major cluster of two thirds of the case series, expressing cytokeratin 20, CDX2 and MUC2 and invariably mismatch repair competent, which they called crypt-like. In stage III colon cancer, the crypt-like cluster had a better prognosis. The paper is a relatively simple example of what is happening in cancer classification beyond morphology: multiparameter differentiation and (epi)genomic markers defining new subtypes of cancer with potential clinical significance in clinical decision making.
Resumo:
OBJECTIVE: In contrast to conventional (CONV) neuromuscular electrical stimulation (NMES), the use of "wide-pulse, high-frequencies" (WPHF) can generate higher forces than expected by the direct activation of motor axons alone. We aimed at investigating the occurrence, magnitude, variability and underlying neuromuscular mechanisms of these "Extra Forces" (EF). METHODS: Electrically-evoked isometric plantar flexion force was recorded in 42 healthy subjects. Additionally, twitch potentiation, H-reflex and M-wave responses were assessed in 13 participants. CONV (25Hz, 0.05ms) and WPHF (100Hz, 1ms) NMES consisted of five stimulation trains (20s on-90s off). RESULTS: K-means clustering analysis disclosed a responder rate of almost 60%. Within this group of responders, force significantly increased from 4% to 16% of the maximal voluntary contraction force and H-reflexes were depressed after WPHF NMES. In contrast, non-responders showed neither EF nor H-reflex depression. Twitch potentiation and resting EMG data were similar between groups. Interestingly, a large inter- and intrasubject variability of EF was observed. CONCLUSION: The responder percentage was overestimated in previous studies. SIGNIFICANCE: This study proposes a novel methodological framework for unraveling the neurophysiological mechanisms involved in EF and provides further evidence for a central contribution to EF in responders.
Resumo:
Conventional (CONV) neuromuscular electrical stimulation (NMES) (i.e., short pulse duration, low frequencies) induces a higher energetic response as compared to voluntary contractions (VOL). In contrast, wide-pulse, high-frequency (WPHF) NMES might elicit-at least in some subjects (i.e., responders)-a different motor unit recruitment compared to CONV that resembles the physiological muscle activation pattern of VOL. We therefore hypothesized that for these responder subjects, the metabolic demand of WPHF would be lower than CONV and comparable to VOL. 18 healthy subjects performed isometric plantar flexions at 10% of their maximal voluntary contraction force for CONV (25 Hz, 0.05 ms), WPHF (100 Hz, 1 ms) and VOL protocols. For each protocol, force time integral (FTI) was quantified and subjects were classified as responders and non-responders to WPHF based on k-means clustering analysis. Furthermore, a fatigue index based on FTI loss at the end of each protocol compared with the beginning of the protocol was calculated. Phosphocreatine depletion (ΔPCr) was assessed using 31P magnetic resonance spectroscopy. Responders developed four times higher FTI's during WPHF (99 ± 37 ×103 N.s) than non-responders (26 ± 12 ×103 N.s). For both responders and non-responders, CONV was metabolically more demanding than VOL when ΔPCr was expressed relative to the FTI. Only for the responder group, the ∆PCr/FTI ratio of WPHF (0.74 ± 0.19 M/N.s) was significantly lower compared to CONV (1.48 ± 0.46 M/N.s) but similar to VOL (0.65 ± 0.21 M/N.s). Moreover, the fatigue index was not different between WPHF (-16%) and CONV (-25%) for the responders. WPHF could therefore be considered as the less demanding NMES modality-at least in this subgroup of subjects-by possibly exhibiting a muscle activation pattern similar to VOL contractions.