973 resultados para Molecular classification
Resumo:
Calls from 14 species of bat were classified to genus and species using discriminant function analysis (DFA), support vector machines (SVM) and ensembles of neural networks (ENN). Both SVMs and ENNs outperformed DFA for every species while ENNs (mean identification rate – 97%) consistently outperformed SVMs (mean identification rate – 87%). Correct classification rates produced by the ENNs varied from 91% to 100%; calls from six species were correctly identified with 100% accuracy. Calls from the five species of Myotis, a genus whose species are considered difficult to distinguish acoustically, had correct identification rates that varied from 91 – 100%. Five parameters were most important for classifying calls correctly while seven others contributed little to classification performance.
Resumo:
A novel combined near- and mid-infrared (NIR and MIR) spectroscopic method has been researched and developed for the analysis of complex substances such as the Traditional Chinese Medicine (TCM), Illicium verum Hook. F. (IVHF), and its noxious adulterant, Iuicium lanceolatum A.C. Smith (ILACS). Three types of spectral matrix were submitted for classification with the use of the linear discriminant analysis (LDA) method. The data were pretreated with either the successive projections algorithm (SPA) or the discrete wavelet transform (DWT) method. The SPA method performed somewhat better, principally because it required less spectral features for its pretreatment model. Thus, NIR or MIR matrix as well as the combined NIR/MIR one, were pretreated by the SPA method, and then analysed by LDA. This approach enabled the prediction and classification of the IVHF, ILACS and mixed samples. The MIR spectral data produced somewhat better classification rates than the NIR data. However, the best results were obtained from the combined NIR/MIR data matrix with 95–100% correct classifications for calibration, validation and prediction. Principal component analysis (PCA) of the three types of spectral data supported the results obtained with the LDA classification method.
Resumo:
The Indo-West Pacific (IWP), from South Africa in the western Indian Ocean to the western Pacific Ocean, contains some of the most biologically diverse marine habitats on earth, including the greatest biodiversity of chondrichthyan fishes. The region encompasses various densities of human habitation leading to contrasts in the levels of exploitation experienced by chondrichthyans, which are targeted for local consumption and export. The demersal chondrichthyan, the zebra shark, Stegostoma fasciatum, is endemic to the IWP and has two current regional International Union for the Conservation of Nature (IUCN) Red List classifications that reflect differing levels of exploitation: ‘Least Concern’ and ‘Vulnerable’. In this study, we employed mitochondrial ND4 sequence data and 13 microsatellite loci to investigate the population genetic structure of 180 zebra sharks from 13 locations throughout the IWP to test the concordance of IUCN zones with demographic units that have conservation value. Mitochondrial and microsatellite data sets from samples collected throughout northern Australia and Southeast Asia concord with the regional IUCN classifications. However, we found evidence of genetic subdivision within these regions, including subdivision between locations connected by habitat suitable for migration. Furthermore, parametric FST analyses and Bayesian clustering analyses indicated that the primary genetic break within the IWP is not represented by the IUCN classifications but rather is congruent with the Indonesian throughflow current. Our findings indicate that recruitment to areas of high exploitation from nearby healthy populations in zebra sharks is likely to be minimal, and that severe localized depletions are predicted to occur in zebra shark populations throughout the IWP region.
Resumo:
Microarrays have a wide range of applications in the biomedical field. From the beginning, arrays have mostly been utilized in cancer research, including classification of tumors into different subgroups and identification of clinical associations. In the microarray format, a collection of small features, such as different oligonucleotides, is attached to a solid support. The advantage of microarray technology is the ability to simultaneously measure changes in the levels of multiple biomolecules. Because many diseases, including cancer, are complex, involving an interplay between various genes and environmental factors, the detection of only a single marker molecule is usually insufficient for determining disease status. Thus, a technique that simultaneously collects information on multiple molecules allows better insights into a complex disease. Since microarrays can be custom-manufactured or obtained from a number of commercial providers, understanding data quality and comparability between different platforms is important to enable the use of the technology to areas beyond basic research. When standardized, integrated array data could ultimately help to offer a complete profile of the disease, illuminating mechanisms and genes behind disorders as well as facilitating disease diagnostics. In the first part of this work, we aimed to elucidate the comparability of gene expression measurements from different oligonucleotide and cDNA microarray platforms. We compared three different gene expression microarrays; one was a commercial oligonucleotide microarray and the others commercial and custom-made cDNA microarrays. The filtered gene expression data from the commercial platforms correlated better across experiments (r=0.78-0.86) than the expression data between the custom-made and either of the two commercial platforms (r=0.62-0.76). Although the results from different platforms correlated reasonably well, combining and comparing the measurements were not straightforward. The clone errors on the custom-made array and annotation and technical differences between the platforms introduced variability in the data. In conclusion, the different gene expression microarray platforms provided results sufficiently concordant for the research setting, but the variability represents a challenge for developing diagnostic applications for the microarrays. In the second part of the work, we performed an integrated high-resolution microarray analysis of gene copy number and expression in 38 laryngeal and oral tongue squamous cell carcinoma cell lines and primary tumors. Our aim was to pinpoint genes for which expression was impacted by changes in copy number. The data revealed that especially amplifications had a clear impact on gene expression. Across the genome, 14-32% of genes in the highly amplified regions (copy number ratio >2.5) had associated overexpression. The impact of decreased copy number on gene underexpression was less clear. Using statistical analysis across the samples, we systematically identified hundreds of genes for which an increased copy number was associated with increased expression. For example, our data implied that FADD and PPFIA1 were frequently overexpressed at the 11q13 amplicon in HNSCC. The 11q13 amplicon, including known oncogenes such as CCND1 and CTTN, is well-characterized in different type of cancers, but the roles of FADD and PPFIA1 remain obscure. Taken together, the integrated microarray analysis revealed a number of known as well as novel target genes in altered regions in HNSCC. The identified genes provide a basis for functional validation and may eventually lead to the identification of novel candidates for targeted therapy in HNSCC.
Resumo:
Background: The Ewing sarcoma family of tumors (ESFT) are rare but highly malignant neoplasms that occur mainly in bone or but also in soft tissue. ESFT affects patients typically in their second decade of life, whereby children and adolescents bear the heaviest incidence burden. Despite recent advances in the clinical management of ESFT patients, their prognosis and survival are still disappointingly poor, especially in cases with metastasis. No targeted therapy for ESFT patients is currently available. Moreover, based merely on current clinical and biological characteristics, accurate classification of ESFT patients often fails at the time of diagnosis. Therefore, there is a constant need for novel molecular biomarkers to be applied in tandem with conventional parameters to further intensify ESFT risk-stratification and treatment selection, and ultimately to develop novel targeted therapies. In this context, a greater understanding of the genetics and immune characteristics of ESFT is needed. Aims: This study sought to open novel insights into gene copy number changes and gene expression in ESFT and, further, to enlighten the role of inflammation in ESFT. For this purpose, microarrays were used to provide gene-level information on a genomewide scale. In addition, this study focused on screening of 9p21.3 deletion sizes and frequencies in ESFT and, in another pediatric cancer, acute lymphocytic leukemia (ALL), in order to define more exact criteria for highrisk patient selection and to provide data for developing a more reliable diagnostic method to detect CDKN2A deletions. Results: In study I, 20 novel ESFT-associated suppressor genes and oncogenes were pinpointed using combined array CGH and expression analysis. In addition, interesting chromosomal rearrangements were identified: (1) Duplication of derivative chromosome der(22)(11;22) was detected in three ESFT patients. This duplication included the EWSR1-FLI1 fusion gene leading to increase in its copy number; (2) Cryptic amplifications on chromosomes 20 and 22 were detected, suggesting a novel translocation between chromosomes 20 and 22, which most probably produces a fusion between EWSR1 and NFATC2. In study II, bioinformatic analysis of ESFT expression profiles showed that inflammatory gene activation is detectable in ESFT patient samples and that the activation is characterized by macrophage gene expression. Most interestingly, ESFT patient samples were shown to express certain inflammatory genes that were prognostically significant. High local expression of C5 and JAK1 at the tumor site was shown to associate with favorable clinical outcome, whereas high local expression of IL8 was shown to be detrimental. Studies III and IV showed that the smallest overlapping region of deletion in 9p21.3 includes CDKN2A in all cases and that the length of this region is 12.2 kb in both Ewing sarcoma and ALL. Furthermore, our results showed that the most widely used commercial CDKN2A FISH probe creates false negative results in the narrowest microdeletion cases (<190 kb). Therefore, more accurate methods should be developed for the detection of deletions in the CDKN2A locus. Conclusions: This study provides novel insights into the genetic changes involved in the biology of ESFT, in the interaction between ESFT cells and immune system, and in the inactivation of CDKN2A. Novel ESFT biomarker genes identified in this study serve as a useful resource for future studies and in developing novel therapeutic strategies to improve the survival of patients with ESFT.
Resumo:
Many of the genes predisposing to highly penetrant colorectal cancer (CRC) syndromes, including hereditary non-polyposis colorectal cancer (MLH1, MSH2, MSH6, PMS2), familial adenomatous polyposis (APC), Peutz-Jeghers syndrome (LKB1), juvenile polyposis (SMAD4, BMPR1A), MYH-associated polyposis (MYH), and Cowden syndrome (PTEN) have already been discovered. Identification of these genes has allowed a more precise classification of the hereditary CRC syndromes and provided a means for predictive genetic testing and surveillance. Some of the genes are also involved in sporadic cancer forms, and therefore the investigation of the rare CRC syndromes has been a breakthrough for general cancer research. Despite the accumulating knowledge on hereditary cancer syndromes, a significant number of familial CRCs remain molecularly unexplained after genetic testing, reflecting the possibility of other predisposing genes or existence of novel syndromes. Moreover, genetic variants conferring low-penetrance risk are still largely unknown. In this study, we examined the role of some new high- and low-penetrance alleles on CRC predisposition. We identified disease causing MYH mutations in a subset (9%) of patients with APC and AXIN2 mutation negative adenomatous polyposis. Due to differences in the pattern of inheritance and clinical manifestation, screening for mutations in MYH is beneficial in view of genetic counselling and surveillance. A novel functionally deficient MYH founder mutation A459D was identified in the Finnish population, and this finding had immediate clinical implications for genetic counselling of at risk families. Many patients with hamartomatous polyposis remain without molecular diagnosis due to atypical phenotypes. We therefore sought to classify 49 patients with unexplained hamartomatous or hyperplastic/mixed polyposis by extensive molecular analyses of PTEN, LKB1, BMPR1A, SMAD4, ENG, BRAF, MYH, and BHD along with revision of polyp histology. Mutations were identified in 11/49 (22%) of the patients. In 6 cases the molecular diagnosis was re-classified guiding surveillance and decisions for prophylactic surgery. Re-evaluation of polyp histology with subsequent more accurate selection of candidate gene analyses is beneficial and can be recommended for patients with unexplained polyposis. Furthermore, germline mutations in ENG underlying juvenile polyposis were described for the first time, characterizing a possible novel genetically defined form of hereditary CRC. Association analyses on two putative low-penetrance alleles, NOD2 3020insC and MDM2 SNP309 were performed in a population-based series of 1042 Finnish CRC patients and in cancer-free controls. In contrast to previous results, NOD2 3020insC did not associate with CRC or age at disease onset in the Finnish population. These data suggest that NOD2 3020insC alone might not be sufficient for CRC predisposition. MDM2 SNP309 was as common in the CRC cohort as in the healthy controls. Interesting trends, however, were observed, which after correction for multiple testing did not reach statistical significance. SNP309 was more common in female CRC patients and a trend towards an earlier age at disease onset was observed in women with SNP309. Subsequent studies have supported this observation and SNP309 could affect gender- or hormone-related tumorigenesis. Finally, a large-scale unbiased effort was designed to characterize the complete mutatome of CRC with microsatellite instability (MSI). Using an approach combining expression microarray and genome database searches, we were able to identify putative MSI target genes. Further characterization of one of the genes suggested that it might play a role also in microsatellite stable CRC and Peutz-Jeghers syndrome pathogenesis.
Resumo:
Laboratory confirmation methods are important in bovine cysticerosis diagnosis as other pathologies can result in morphologically similar lesions resulting in false identifications. We developed a probe-based real-time PCR assay to identify Taenia saginata in suspect cysts encountered at meat inspection and compared its use with the traditional method of identification, histology, as well as a published nested PCR. The assay simultaneously detects T. saginata DNA and a bovine internal control using the cytochrome c oxidase subunit 1 gene of each species and shows specificity against parasites causing lesions morphologically similar to those of T. saginata. The assay was sufficiently sensitive to detect 1 fg (Ct 35.09 +/- 0.95) of target DNA using serially-diluted plasmid DNA in reactions spiked with bovine DNA as well as in all viable and caseated positive control cysts. A loss in PCR sensitivity was observed with increasing cyst degeneration as seen in other molecular methods. In comparison to histology, the assay offered greater sensitivity and accuracy with 10/19 (53%) T. saginata positives detected by real-time PCR and none by histology. When the results were compared with the reference PCR, the assay was less sensitive but offered advantages of faster turnaround times and reduced contamination risk. Estimates of the assay's repeatability and reproducibility showed the assay is highly reliable with reliability coefficients greater than 0.94. Crown Copyright (C) 2013 Published by Elsevier B.V. All rights reserved.
Resumo:
Protein Kinase-Like Non-kinases (PKLNKs), which are closely related to protein kinases, lack the crucial catalytic aspartate in the catalytic loop, and hence cannot function as protein kinase, have been analysed. Using various sensitive sequence analysis methods, we have recognized 82 PKLNKs from four higher eukaryotic organisms, namely, Homo sapiens, Mus musculus, Rattus norvegicus, and Drosophila melanogaster. On the basis of their domain combination and function, PKLNKs have been classified mainly into four categories: (1) Ligand binding PKLNKs, (2) PKLNKs with extracellular protein-protein interaction domain, (3) PKLNKs involved in dimerization, and (4) PKLNKs with cytoplasmic protein-protein interaction module. While members of the first two classes of PKLNKs have transmembrane domain tethered to the PKLNK domain, members of the other two classes of PKLNKs are cytoplasmic in nature. The current classification scheme hopes to provide a convenient framework to classify the PKLNKs from other eukaryotes which would be helpful in deciphering their roles in cellular processes.
Resumo:
The first part of this work investigates the molecular epidemiology of a human enterovirus (HEV), echovirus 30 (E-30). This project is part of a series of studies performed in our research team analyzing the molecular epidemiology of HEV-B viruses. A total of 129 virus strains had been isolated in different parts of Europe. The sequence analysis was performed in three different genomic regions: 420 nucleotides (nt) in the VP4/VP2 capsid protein coding region, the entire VP1 capsid protein coding gene of 876 nt, and 150 nt in the VP1/2A junction region. The analysis revealed a succession of dominant sublineages within a major genotype. The temporally earlier genotypes had been replaced by a genetically homogenous lineage that has been circulating in Europe since the late 1970s. The same genotype was found by other research groups in North America and Australia. Globally, other cocirculating genetic lineages also exist. The prevalence of a dominant genotype makes E-30 different from other previously studied HEVs, such as polioviruses and coxsackieviruses B4 and B5, for which several coexisting genetic lineages have been reported. The second part of this work deals with molecular epidemiology of human rhinoviruses (HRVs). A total of 61 field isolates were studied in the 420-nt stretch in the capsid coding region of VP4/VP2. The isolates were collected from children under two years of age in Tampere, Finland. Sequences from the clinical isolates clustered in the two previously known phylogenetic clades. Seasonal clustering was found. Also, several distinct serotype-like clusters were found to co-circulate during the same epidemic season. Reappearance of a cluster after disappearing for a season was observed. The molecular epidemiology of the analyzed strains turned out to be complex, and we decided to continue our studies of HRV. Only five previously published complete genome sequences of HRV prototype strains were available for analysis. Therefore, all designated HRV prototype strains (n=102) were sequenced in the VP4/VP2 region, and the possibility of genetic typing of HRV was evaluated. Seventy-six of the 102 prototype strains clustered in HRV genetic group A (HRV-A) and 25 in group B (HRV-B). Serotype 87 clustered separately from other HRVs with HEV species D. The field strains of HRV represented as many as 19 different genotypes, as judged with an approximate demarcation of a 20% nt difference in the VP4/VP2 region. The interserotypic differences of HRV were generally similar to those reported between different HEV serotypes (i.e. about 20%), but smaller differences, less than 10%, were also observed. Because some HRV serotypes are genetically so closely related, we suggest that the genetic typing be performed using the criterion "the closest prototype strain". This study is the first systematic genetic characterization of all known HRV prototype strains, providing a further taxonomic proposal for classification of HRV. We proposed to divide the genus Human rhinoviruses into HRV-A and HRV-B. The final part of the work comprises a phylogenetic analysis of a subset (48) of HRV prototype strains and field isolates (12) in the nonstructural part of the genome coding for the RNA-dependent RNA polymerase (3D). The proposed division of the HRV strains in the species HRV-A and HRV-B was also supported by 3D region. HRV-B clustered closer to HEV species B, C, and also to polioviruses than to HRV-A. Intraspecies variation within both HRV-A and HRV-B was greater in the 3D coding region than in the VP4/VP2 coding region, in contrast to HEV. Moreover, the diversity of HRV in 3D exceeded that of HEV. One group of HRV-A, designated HRV-A', formed a separate cluster outside other HRV-A in the 3D region. It formed a cluster also in the capsid region, but located within HRV-A. This may reflect a different evolutionary history of distinct genomic regions among HRV-A. Furthermore, the tree topology within HRV-A in the 3D region differed from that in the VP4/VP2, suggesting possible recombination events in the evolution of the strains. No conflicting phylogenies were observed in any of the 12 field isolates. Possible recombination was further studied using the Similarity and Bootscanning analyses of the complete genome sequences of HRV available in public databases. Evidence for recombination among HRV-A was found, as HRV2 and HRV39 showed higher similarity in the nonstructural part of the genome. Whether HRV2 and HRV39 strains - and perhaps also some other HRV-A strains not yet completely sequenced - are recombinants remains to be determined.
Resumo:
Mutation and recombination are the fundamental processes leading to genetic variation in natural populations. This variation forms the raw material for evolution through natural selection and drift. Therefore, studying mutation rates may reveal information about evolutionary histories as well as phylogenetic interrelationships of organisms. In this thesis two molecular tools, DNA barcoding and the molecular clock were examined. In the first part, the efficiency of mutations to delineate closely related species was tested and the implications for conservation practices were assessed. The second part investigated the proposition that a constant mutation rate exists within invertebrates, in form of a metabolic-rate dependent molecular clock, which can be applied to accurately date speciation events. DNA barcoding aspires to be an efficient technique to not only distinguish between species but also reveal population-level variation solely relying on mutations found on a short stretch of a single gene. In this thesis barcoding was applied to discriminate between Hylochares populations from Russian Karelia and new Hylochares findings from the greater Helsinki region in Finland. Although barcoding failed to delineate the two reproductively isolated groups, their distinct morphological features and differing life-history traits led to their classification as two closely related, although separate species. The lack of genetic differentiation appears to be due to a recent divergence event not yet reflected in the beetles molecular make-up. Thus, the Russian Hylochares was described as a new species. The Finnish species, previously considered as locally extinct, was recognized as endangered. Even if, due to their identical genetic make-up, the populations had been regarded as conspecific, conservation strategies based on prior knowledge from Russia would not have guaranteed the survival of the Finnish beetle. Therefore, new conservation actions based on detailed studies of the biology and life-history of the Finnish Hylochares were conducted to protect this endemic rarity in Finland. The idea behind the strict molecular clock is that mutation rates are constant over evolutionary time and may thus be used to infer species divergence dates. However, one of the most recent theories argues that a strict clock does not tick per unit of time but that it has a constant substitution rate per unit of mass-specific metabolic energy. Therefore, according to this hypothesis, molecular clocks have to be recalibrated taking body size and temperature into account. This thesis tested the temperature effect on mutation rates in equally sized invertebrates. For the first dataset (family Eucnemidae, Coleoptera) the phylogenetic interrelationships and evolutionary history of the genus Arrhipis had to be inferred before the influence of temperature on substitution rates could be studied. Further, a second, larger invertebrate dataset (family Syrphidae, Diptera) was employed. Several methodological approaches, a number of genes and multiple molecular clock models revealed that there was no consistent relationship between temperature and mutation rate for the taxa under study. Thus, the body size effect, observed in vertebrates but controversial for invertebrates, rather than temperature may be the underlying driving force behind the metabolic-rate dependent molecular clock. Therefore, the metabolic-rate dependent molecular clock does not hold for the here studied invertebrate groups. This thesis emphasizes that molecular techniques relying on mutation rates have to be applied with caution. Whereas they may work satisfactorily under certain conditions for specific taxa, they may fail for others. The molecular clock as well as DNA barcoding should incorporate all the information and data available to obtain comprehensive estimations of the existing biodiversity and its evolutionary history.
Resumo:
Diffuse large B-cell lymphoma (DLBCL) is the most common of the non-Hodgkin lymphomas. As DLBCL is characterized by heterogeneous clinical and biological features, its prognosis varies. To date, the International Prognostic Index has been the strongest predictor of outcome for DLBCL patients. However, no biological characters of the disease are taken into account. Gene expression profiling studies have identified two major cell-of-origin phenotypes in DLBCL with different prognoses, the favourable germinal centre B-cell-like (GCB) and the unfavourable activated B-cell-like (ABC) phenotypes. However, results of the prognostic impact of the immunohistochemically defined GCB and non-GCB distinction are controversial. Furthermore, since the addition of the CD20 antibody rituximab to chemotherapy has been established as the standard treatment of DLBCL, all molecular markers need to be evaluated in the post-rituximab era. In this study, we aimed to evaluate the predictive value of immunohistochemically defined cell-of-origin classification in DLBCL patients. The GCB and non-GCB phenotypes were defined according to the Hans algorithm (CD10, BCL6 and MUM1/IRF4) among 90 immunochemotherapy- and 104 chemotherapy-treated DLBCL patients. In the chemotherapy group, we observed a significant difference in survival between GCB and non-GCB patients, with a good and a poor prognosis, respectively. However, in the rituximab group, no prognostic value of the GCB phenotype was observed. Likewise, among 29 high-risk de novo DLBCL patients receiving high-dose chemotherapy and autologous stem cell transplantation, the survival of non-GCB patients was improved, but no difference in outcome was seen between GCB and non-GCB subgroups. Since the results suggested that the Hans algorithm was not applicable in immunochemotherapy-treated DLBCL patients, we aimed to further focus on algorithms based on ABC markers. We examined the modified activated B-cell-like algorithm based (MUM1/IRF4 and FOXP1), as well as a previously reported Muris algorithm (BCL2, CD10 and MUM1/IRF4) among 88 DLBCL patients uniformly treated with immunochemotherapy. Both algorithms distinguished the unfavourable ABC-like subgroup with a significantly inferior failure-free survival relative to the GCB-like DLBCL patients. Similarly, the results of the individual predictive molecular markers transcription factor FOXP1 and anti-apoptotic protein BCL2 have been inconsistent and should be assessed in immunochemotherapy-treated DLBCL patients. The markers were evaluated in a cohort of 117 patients treated with rituximab and chemotherapy. FOXP1 expression could not distinguish between patients, with favourable and those with poor outcomes. In contrast, BCL2-negative DLBCL patients had significantly superior survival relative to BCL2-positive patients. Our results indicate that the immunohistochemically defined cell-of-origin classification in DLBCL has a prognostic impact in the immunochemotherapy era, when the identifying algorithms are based on ABC-associated markers. We also propose that BCL2 negativity is predictive of a favourable outcome. Further investigational efforts are, however, warranted to identify the molecular features of DLBCL that could enable individualized cancer therapy in routine patient care.
Resumo:
The prevalence of obesity is increasing at an alarming rate in all age groups worldwide. Obesity is a serious health problem due to increased risk of morbidity and mortality. Although environmental factors play a major role in the development of obesity, the identification of rare monogenic defects in human genes have confirmed that obesity has a strong genetic component. Mutations have been identified in genes encoding proteins of the leptin-melanocortin signaling system, which has an important role in the regulation of appetite and energy balance. The present study aimed at identifying mutations and genetic variations in the melanocortin receptors 2-5 and other genes active on the same signaling pathway accounting for severe early-onset obesity in children and morbid obesity in adults. The main achievement of this thesis was the identification of melanocortin-4 receptor (MC4R) mutations in Finnish patients. Six pathogenic MC4R mutations (308delT, P299H, two S127L and two -439delGC mutations) were identified, corresponding to a prevalence of 3% in severe early-onset obesity. No obesity causing MC4R mutations were found among patients with adult-onset morbid obesity. The MC4R 308delT deletion is predicted to result in a grossly truncated nonfunctional receptor of only 107 amino acids. The C-terminal residues, which are important in MC4R cell surface targeting, are totally absent from the mutant 308delT receptor. In vitro functional studies supported a pathogenic role for the S127L mutation since agonist induced signaling of the receptor was impaired. Cell membrane localization of the S127L receptor did not differ from that of the wild-type receptor, confirming that impaired function of the S127L receptor was due to reduced signaling properties. The P299H mutation leads to intracellular retention of the receptor. The -439delGC deletion is situated at a potential nescient helix-loop-helix 2 (NHLH2) -binding site in the MC4R promoter. It was demonstrated that the transcription factor NHLH2 binds to the consensus sequence at the -439delGC site in vitro, possibly resulting in altered promoter activity. Several genetic variants were identified in the melanocortin-3 receptor (MC3R) and pro-opiomelanocortin (POMC) genes. These polymorphisms do not explain morbid obesity, but the results indicate that some of these genetic variations may be modifying factors in obesity, resulting in subtle changes in obesity-related traits. A risk haplotype for obesity was identified in the ectonucleotide pyrophosphatase phosphodiesterase 1 (ENPP1) gene through a candidate gene single nucleotide polymorphism (SNP) genotyping approach. An ENPP1 haplotype, composed of SNPs rs1800949 and rs943003, was shown to be significantly associated with morbid obesity in adults. Accordingly, the MC3R, POMC and ENPP1 genes represent examples of susceptibility genes in which genetic variants predispose to obesity. In conclusion, pathogenic mutations in the MC4R gene were shown to account for 3% of cases with severe early-onset obesity in Finland. This is in line with results from other populations demonstrating that mutations in the MC4R gene underlie 1-6% of morbid obesity worldwide. MC4R deficiency thus represents the most common monogenic defect causing human obesity reported so far. The severity of the MC4-receptor defect appears to be associated with time of onset and the degree of obesity. Classification of MC4R mutations may provide a useful tool when predicting the outcome of the disease. In addition, several other genetic variants conferring susceptibility to obesity were detected in the MC3R, MC4R, POMC and ENPP1 genes.
Molecular phylogeny and biogeography of langurs and leaf monkeys of South Asia (Primates: Colobinae)
Resumo:
The two recently proposed taxonomies of the langurs and leaf monkeys (Subfamily Colobinae) provide different implications to our understanding of the evolution of Nilgiri and purple-faced langurs. Groves (2001) [Groves, C.P., 2001. Primate Taxonomy. Smithsonian Institute Press, Washington], placed Nilgiri and purple-faced langurs in the genus Trachypithecus, thereby suggesting disjunct distribution of the genus Trachypithecus. [Brandon-Jones, D., Eudey, A.A., Geissmann, T., Groves, C.P., Melnick, D.J., Morales, J.C., Shekelle, M., Stewart, C.-B., 2003. Asian primate classification. Int. J. Primatol. 25, 97–162] placed these langurs in the genus Semnopithecus, which suggests convergence of morphological characters in Nilgiri and purple-faced langurs with Trachypithecus. To test these scenarios, we sequenced and analyzed the mitochondrial cytochrome b gene and two nuclear DNA-encoded genes, lysozyme and protamine P1, from a variety of colobine species. All three markers support the clustering of Nilgiri and purple-faced langurs with Hanuman langur (Semnopithecus), while leaf monkeys of Southeast Asian (Trachypithecus) form a distinct clade. The phylogenetic position of capped and golden leaf monkeys is still unresolved. It is likely that this species group might have evolved due to past hybridization between Semnopithecus and Trachypithecus clades.
Resumo:
Background:Overwhelming majority of the Serine/Threonine protein kinases identified by gleaning archaeal and eubacterial genomes could not be classified into any of the well known Hanks and Hunter subfamilies of protein kinases. This is owing to the development of Hanks and Hunter classification scheme based on eukaryotic protein kinases which are highly divergent from their prokaryotic homologues. A large dataset of prokaryotic Serine/Threonine protein kinases recognized from genomes of prokaryotes have been used to develop a classification framework for prokaryotic Ser/Thr protein kinases. Methodology/Principal Findings: We have used traditional sequence alignment and phylogenetic approaches and clustered the prokaryotic kinases which represent 72 subfamilies with at least 4 members in each. Such a clustering enables classification of prokaryotic Ser/Thr kinases and it can be used as a framework to classify newly identified prokaryotic Ser/Thr kinases. After series of searches in a comprehensive sequence database we recognized that 38 subfamilies of prokaryotic protein kinases are associated to a specific taxonomic level. For example 4, 6 and 3 subfamilies have been identified that are currently specific to phylum proteobacteria, cyanobacteria and actinobacteria respectively. Similarly subfamilies which are specific to an order, sub-order, class, family and genus have also been identified. In addition to these, we also identify organism-diverse subfamilies. Members of these clusters are from organisms of different taxonomic levels, such as archaea, bacteria, eukaryotes and viruses.Conclusion/Significance: Interestingly, occurrence of several taxonomic level specific subfamilies of prokaryotic kinases contrasts with classification of eukaryotic protein kinases in which most of the popular subfamilies of eukaryotic protein kinases occur diversely in several eukaryotes. Many prokaryotic Ser/Thr kinases exhibit a wide variety of modular organization which indicates a degree of complexity and protein-protein interactions in the signaling pathways in these microbes.
Resumo:
Background: Protein phosphorylation is a generic way to regulate signal transduction pathways in all kingdoms of life. In many organisms, it is achieved by the large family of Ser/Thr/Tyr protein kinases which are traditionally classified into groups and subfamilies on the basis of the amino acid sequence of their catalytic domains. Many protein kinases are multidomain in nature but the diversity of the accessory domains and their organization are usually not taken into account while classifying kinases into groups or subfamilies. Methodology: Here, we present an approach which considers amino acid sequences of complete gene products, in order to suggest refinements in sets of pre-classified sequences. The strategy is based on alignment-free similarity scores and iterative Area Under the Curve (AUC) computation. Similarity scores are computed by detecting common patterns between two sequences and scoring them using a substitution matrix, with a consistent normalization scheme. This allows us to handle full-length sequences, and implicitly takes into account domain diversity and domain shuffling. We quantitatively validate our approach on a subset of 212 human protein kinases. We then employ it on the complete repertoire of human protein kinases and suggest few qualitative refinements in the subfamily assignment stored in the KinG database, which is based on catalytic domains only. Based on our new measure, we delineate 37 cases of potential hybrid kinases: sequences for which classical classification based entirely on catalytic domains is inconsistent with the full-length similarity scores computed here, which implicitly consider multi-domain nature and regions outside the catalytic kinase domain. We also provide some examples of hybrid kinases of the protozoan parasite Entamoeba histolytica. Conclusions: The implicit consideration of multi-domain architectures is a valuable inclusion to complement other classification schemes. The proposed algorithm may also be employed to classify other families of enzymes with multidomain architecture.