93 resultados para Large-scale Analysis

em Université de Lausanne, Switzerland


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Despite the high prevalence of colon cancer in the world and the great interest in targeted anti-cancer therapy, only few tumor-specific gene products have been identified that could serve as targets for the immunological treatment of colorectal cancers. The aim of our study was therefore to identify frequently expressed colon cancer-specific antigens. We performed a large-scale analysis of genes expressed in normal colon and colon cancer tissues isolated from colorectal cancer patients using massively parallel signal sequencing (MPSS). Candidates were additionally subjected to experimental evaluation by semi-quantitative RT-PCR on a cohort of colorectal cancer patients. From a pool of more than 6000 genes identified unambiguously in the analysis, we found 2124 genes that were selectively expressed in colon cancer tissue and 147 genes that were differentially expressed to a significant degree between normal and cancer cells. Differential expression of many genes was confirmed by RT-PCR on a cohort of patients. Despite the fact that deregulated genes were involved in many different cellular pathways, we found that genes expressed in the extracellular space were significantly over-represented in colorectal cancer. Strikingly, we identified a transcript from a chromosome X-linked member of the human endogenous retrovirus (HERV) H family that was frequently and selectively expressed in colon cancer but not in normal tissues. Our data suggest that this sequence should be considered as a target of immunological interventions against colorectal cancer.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Functional divergence between homologous proteins is expected to affect amino acid sequences in two main ways, which can be considered as proxies of biochemical divergence: a "covarion-like" pattern of correlated changes in evolutionary rates, and switches in conserved residues ("conserved but different"). Although these patterns have been used in case studies, a large-scale analysis is needed to estimate their frequency and distribution. We use a phylogenomic framework of animal genes to answer three questions: 1) What is the prevalence of such patterns? 2) Can we link such patterns at the amino acid level with selection inferred at the codon level? 3) Are patterns different between paralogs and orthologs? We find that covarion-like patterns are more frequently detected than "constant but different," but that only the latter are correlated with signal for positive selection. Finally, there is no obvious difference in patterns between orthologs and paralogs.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High-throughput technologies are now used to generate more than one type of data from the same biological samples. To properly integrate such data, we propose using co-modules, which describe coherent patterns across paired data sets, and conceive several modular methods for their identification. We first test these methods using in silico data, demonstrating that the integrative scheme of our Ping-Pong Algorithm uncovers drug-gene associations more accurately when considering noisy or complex data. Second, we provide an extensive comparative study using the gene-expression and drug-response data from the NCI-60 cell lines. Using information from the DrugBank and the Connectivity Map databases we show that the Ping-Pong Algorithm predicts drug-gene associations significantly better than other methods. Co-modules provide insights into possible mechanisms of action for a wide range of drugs and suggest new targets for therapy

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10(-33); LPA:p<10(-19); 1p13.3:p<10(-17)) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10(-7)). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06-1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and clarified the literature with regard to many previously suggested genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Initial topography and inherited structural discontinuities are known to play a dominant role in rock slope stability. Previous 2-D physical modeling results demonstrated that even if few preexisting fractures are activated/propagated during gravitational failure all of those heterogeneities had a great influence on mobilized volume and its kinematics. The question we address in the present study is to determine if such a result is also observed in 3-D. As in 2-D previous models we examine geologically stable model configuration, based upon the well documented landslide at Randa, Switzerland. The 3-D models consisted of a homogeneous material in which several fracture zones were introduced in order to study simplified but realistic configurations of discontinuities (e.g. based on natural example rather than a parametric study). Results showed that the type of gravitational failure (deep-seated landslide or sequential failure) and resulting slope morphology evolution are the result of the interplay of initial topography and inherited preexisting fractures (orientation and density). The three main results are i) the initial topography exerts a strong control on gravitational slope failure. Indeed in each tested configuration (even in the isotropic one without fractures) the model is affected by a rock slide, ii) the number of simulated fracture sets greatly influences the volume mobilized and its kinematics, and iii) the failure zone involved in the 1991 event is smaller than the results produced by the analog modeling. This failure may indicate that the zone mobilized in 1991 is potentially only a part of a larger deep-seated landslide and/or wider deep seated gravitational slope deformation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the vast majority of bottom-up proteomics studies, protein digestion is performed using only mammalian trypsin. Although it is clearly the best enzyme available, the sole use of trypsin rarely leads to complete sequence coverage, even for abundant proteins. It is commonly assumed that this is because many tryptic peptides are either too short or too long to be identified by RPLC-MS/MS. We show through in silico analysis that 20-30% of the total sequence of three proteomes (Schizosaccharomyces pombe, Saccharomyces cerevisiae, and Homo sapiens) is expected to be covered by Large post-Trypsin Peptides (LpTPs) with M(r) above 3000 Da. We then established size exclusion chromatography to fractionate complex yeast tryptic digests into pools of peptides based on size. We found that secondary digestion of LpTPs followed by LC-MS/MS analysis leads to a significant increase in identified proteins and a 32-50% relative increase in average sequence coverage compared to trypsin digestion alone. Application of the developed strategy to analyze the phosphoproteomes of S. pombe and of a human cell line identified a significant fraction of novel phosphosites. Overall our data indicate that specific targeting of LpTPs can complement standard bottom-up workflows to reveal a largely neglected portion of the proteome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: The recent large randomized controlled trial of glutamine and antioxidant supplementation suggested that high-dose glutamine is associated with increased mortality in critically ill patients with multiorgan failure. The objectives of the present analyses were to reevaluate the effect of supplementation after controlling for baseline covariates and to identify potentially important subgroup effects. MATERIALS AND METHODS: This study was a post hoc analysis of a prospective factorial 2 × 2 randomized trial conducted in 40 intensive care units in North America and Europe. In total, 1223 mechanically ventilated adult patients with multiorgan failure were randomized to receive glutamine, antioxidants, both glutamine and antioxidants, or placebo administered separate from artificial nutrition. We compared each of the 3 active treatment arms (glutamine alone, antioxidants alone, and glutamine + antioxidants) with placebo on 28-day mortality. Post hoc, treatment effects were examined within subgroups defined by baseline patient characteristics. Logistic regression was used to estimate treatment effects within subgroups after adjustment for baseline covariates and to identify treatment-by-subgroup interactions (effect modification). RESULTS: The 28-day mortality rates in the placebo, glutamine, antioxidant, and combination arms were 25%, 32%, 29%, and 33%, respectively. After adjusting for prespecified baseline covariates, the adjusted odds ratio of 28-day mortality vs placebo was 1.5 (95% confidence interval, 1.0-2.1, P = .05), 1.2 (0.8-1.8, P = .40), and 1.4 (0.9-2.0, P = .09) for glutamine, antioxidant, and glutamine plus antioxidant arms, respectively. In the post hoc subgroup analysis, both glutamine and antioxidants appeared most harmful in patients with baseline renal dysfunction. No subgroups suggested reduced mortality with supplements. CONCLUSIONS: After adjustment for baseline covariates, early provision of high-dose glutamine administered separately from artificial nutrition was not beneficial and may be associated with increased mortality in critically ill patients with multiorgan failure. For both glutamine and antioxidants, the greatest potential for harm was observed in patients with multiorgan failure that included renal dysfunction upon study enrollment.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

PURPOSE: The Cancer Vaccine Consortium of the Cancer Research Institute (CVC-CRI) conducted a multicenter HLA-peptide multimer proficiency panel (MPP) with a group of 27 laboratories to assess the performance of the assay. EXPERIMENTAL DESIGN: Participants used commercially available HLA-peptide multimers and a well characterized common source of peripheral blood mononuclear cells (PBMC). The frequency of CD8+ T cells specific for two HLA-A2-restricted model antigens was measured by flow cytometry. The panel design allowed for participants to use their preferred staining reagents and locally established protocols for both cell labeling, data acquisition and analysis. RESULTS: We observed significant differences in both the performance characteristics of the assay and the reported frequencies of specific T cells across laboratories. These results emphasize the need to identify the critical variables important for the observed variability to allow for harmonization of the technique across institutions. CONCLUSIONS: Three key recommendations emerged that would likely reduce assay variability and thus move toward harmonizing of this assay. (1) Use of more than two colors for the staining (2) collect at least 100,000 CD8 T cells, and (3) use of a background control sample to appropriately set the analytical gates. We also provide more insight into the limitations of the assay and identified additional protocol steps that potentially impact the quality of data generated and therefore should serve as primary targets for systematic analysis in future panels. Finally, we propose initial guidelines for harmonizing assay performance which include the introduction of standard operating protocols to allow for adequate training of technical staff and auditing of test analysis procedures.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Although age-dependent effects on blood pressure (BP) have been reported, they have not been systematically investigated in large-scale genome-wide association studies (GWASs). We leveraged the infrastructure of three well-established consortia (CHARGE, GBPgen, and ICBP) and a nonstandard approach (age stratification and metaregression) to conduct a genome-wide search of common variants with age-dependent effects on systolic (SBP), diastolic (DBP), mean arterial (MAP), and pulse (PP) pressure. In a two-staged design using 99,241 individuals of European ancestry, we identified 20 genome-wide significant (p ≤ 5 × 10(-8)) loci by using joint tests of the SNP main effect and SNP-age interaction. Nine of the significant loci demonstrated nominal evidence of age-dependent effects on BP by tests of the interactions alone. Index SNPs in the EHBP1L1 (DBP and MAP), CASZ1 (SBP and MAP), and GOSR2 (PP) loci exhibited the largest age interactions, with opposite directions of effect in the young versus the old. The changes in the genetic effects over time were small but nonnegligible (up to 1.58 mm Hg over 60 years). The EHBP1L1 locus was discovered through gene-age interactions only in whites but had DBP main effects replicated (p = 8.3 × 10(-4)) in 8,682 Asians from Singapore, indicating potential interethnic heterogeneity. A secondary analysis revealed 22 loci with evidence of age-specific effects (e.g., only in 20 to 29-year-olds). Age can be used to select samples with larger genetic effect sizes and more homogenous phenotypes, which may increase statistical power. Age-dependent effects identified through novel statistical approaches can provide insight into the biology and temporal regulation underlying BP associations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Functionally relevant large scale brain dynamics operates within the framework imposed by anatomical connectivity and time delays due to finite transmission speeds. To gain insight on the reliability and comparability of large scale brain network simulations, we investigate the effects of variations in the anatomical connectivity. Two different sets of detailed global connectivity structures are explored, the first extracted from the CoCoMac database and rescaled to the spatial extent of the human brain, the second derived from white-matter tractography applied to diffusion spectrum imaging (DSI) for a human subject. We use the combination of graph theoretical measures of the connection matrices and numerical simulations to explicate the importance of both connectivity strength and delays in shaping dynamic behaviour. Our results demonstrate that the brain dynamics derived from the CoCoMac database are more complex and biologically more realistic than the one based on the DSI database. We propose that the reason for this difference is the absence of directed weights in the DSI connectivity matrix.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aim. To predict the fate of alpine interactions involving specialized species, using a monophagous beetle and its host-plant as a case study. Location. The Alps. Methods. We investigated genetic structuring of the herbivorous beetle Oreina gloriosa and its specific host-plant Peucedanum ostruthium. We used genome fingerprinting (in the insect and the plant) and sequence data (in the insect) to compare the distribution of the main gene pools in the two associated species and to estimate divergence time in the insect, a proxy for the temporal origin of the interaction. We quantified the similarity in spatial genetic structures by performing a Procrustes analysis, a tool from the shape theory. Finally, we simulated recolonization of an empty space analogous to the deglaciated Alps just after ice retreat by two lineages from two species showing unbalanced dependence, to examine how timing of the recolonization process, as well as dispersal capacities of associated species, could explain the observed pattern. Results. Contrasting with expectations based on their asymmetrical dependence, patterns in the beetle and plant were congruent at a large scale. Exceptions occurred at a regional scale in areas of admixture, matching known suture zones in Alpine plants. Simulations using a lattice-based model suggested these empirical patterns arose during or soon after recolonization, long after the estimated origin of the interaction c. 0.5 million years ago. Main conclusions. Species-specific interactions are scarce in alpine habitats because glacial cycles have limited opportunities for coevolution. Their fate, however, remains uncertain under climate change. Here we show that whereas most dispersal routes are paralleled at large scale, regional incongruence implies that the destinies of the species might differ under changing climate. This may be a consequence of the host-dependence of the beetle that locally limits the establishment of dispersing insects.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

AbstractIn addition to genetic changes affecting the function of gene products, changes in gene expression have been suggested to underlie many or even most of the phenotypic differences among mammals. However, detailed gene expression comparisons were, until recently, restricted to closely related species, owing to technological limitations. Thus, we took advantage of the latest technologies (RNA-Seq) to generate extensive qualitative and quantitative transcriptome data for a unique collection of somatic and germline tissues from representatives of all major mammalian lineages (placental mammals, marsupials and monotremes) and birds, the evolutionary outgroup.In the first major project of my thesis, we performed global comparative analyses of gene expression levels based on these data. Our analyses provided fundamental insights into the dynamics of transcriptome change during mammalian evolution (e.g., the rate of expression change across species, tissues and chromosomes) and allowed the exploration of the functional relevance and phenotypic implications of transcription changes at a genome-wide scale (e.g., we identified numerous potentially selectively driven expression switches).In a second project of my thesis, which was also based on the unique transcriptome data generated in the context of the first project we focused on the evolution of alternative splicing in mammals. Alternative splicing contributes to transcriptome complexity by generating several transcript isoforms from a single gene, which can, thus, perform various functions. To complete the global comparative analysis of gene expression changes, we explored patterns of alternative splicing evolution. This work uncovered several general and unexpected patterns of alternative splicing evolution (e.g., we found that alternative splicing evolves extremely rapidly) as well as a large number of conserved alternative isoforms that may be crucial for the functioning of mammalian organs.Finally, the third and final project of my PhD consisted in analyzing in detail the unique functional and evolutionary properties of the testis by exploring the extent of its transcriptome complexity. This organ was previously shown to evolve rapidly both at the phenotypic and molecular level, apparently because of the specific pressures that act on this organ and are associated with its reproductive function. Moreover, my analyses of the amniote tissue transcriptome data described above, revealed strikingly widespread transcriptional activity of both functional and nonfunctional genomic elements in the testis compared to the other organs. To elucidate the cellular source and mechanisms underlying this promiscuous transcription in the testis, we generated deep coverage RNA-Seq data for all major testis cell types as well as epigenetic data (DNA and histone methylation) using the mouse as model system. The integration of these complete dataset revealed that meiotic and especially post-meiotic germ cells are the major contributors to the widespread functional and nonfunctional transcriptome complexity of the testis, and that this "promiscuous" spermatogenic transcription is resulting, at least partially, from an overall transcriptionally permissive chromatin state. We hypothesize that this particular open state of the chromatin results from the extensive chromatin remodeling that occurs during spermatogenesis which ultimately leads to the replacement of histones by protamines in the mature spermatozoa. Our results have important functional and evolutionary implications (e.g., regarding new gene birth and testicular gene expression evolution).Generally, these three large-scale projects of my thesis provide complete and massive datasets that constitute valuables resources for further functional and evolutionary analyses of mammalian genomes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

MOTIVATION: Analysis of millions of pyro-sequences is currently playing a crucial role in the advance of environmental microbiology. Taxonomy-independent, i.e. unsupervised, clustering of these sequences is essential for the definition of Operational Taxonomic Units. For this application, reproducibility and robustness should be the most sought after qualities, but have thus far largely been overlooked. RESULTS: More than 1 million hyper-variable internal transcribed spacer 1 (ITS1) sequences of fungal origin have been analyzed. The ITS1 sequences were first properly extracted from 454 reads using generalized profiles. Then, otupipe, cd-hit-454, ESPRIT-Tree and DBC454, a new algorithm presented here, were used to analyze the sequences. A numerical assay was developed to measure the reproducibility and robustness of these algorithms. DBC454 was the most robust, closely followed by ESPRIT-Tree. DBC454 features density-based hierarchical clustering, which complements the other methods by providing insights into the structure of the data. AVAILABILITY: An executable is freely available for non-commercial users at ftp://ftp.vital-it.ch/tools/dbc454. It is designed to run under MPI on a cluster of 64-bit Linux machines running Red Hat 4.x, or on a multi-core OSX system. CONTACT: dbc454@vital-it.ch or nicolas.guex@isb-sib.ch.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Through genome-wide association meta-analyses of up to 133,010 individuals of European ancestry without diabetes, including individuals newly genotyped using the Metabochip, we have increased the number of confirmed loci influencing glycemic traits to 53, of which 33 also increase type 2 diabetes risk (q < 0.05). Loci influencing fasting insulin concentration showed association with lipid levels and fat distribution, suggesting impact on insulin resistance. Gene-based analyses identified further biologically plausible loci, suggesting that additional loci beyond those reaching genome-wide significance are likely to represent real associations. This conclusion is supported by an excess of directionally consistent and nominally significant signals between discovery and follow-up studies. Functional analysis of these newly discovered loci will further improve our understanding of glycemic control.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Les larves aquatiques d'éphémères (Ephemeroptera) colonisent toutes les eaux douces du monde et sont couramment utilisées comme bio-indicateurs de la qualité de l'eau. Le genre Rhithrogena (Heptageniidae) est le deuxième plus diversifié chez les éphémères, et plusieurs espèces européennes ont une distribution restreinte dans des environnements alpins sensibles. Les espèces de Rhithrogena ont été classées en "groupes d'espèces" faciles à identifier. Cependant, malgré leur importance écologique et en terme de conservation, beaucoup d'espèces présentent des différences morphologiques ambiguës, suggérant que lataxonomie actuelle ne refléterait pas correctement leur diversité évolutive. De plus, aucune information sur leurs relations, leur origine, le taux de spéciation ou les mécanismes ayant provoqué leur remarquable diversification dans les Alpes n'est disponible. Nous avons d'abord examiné le statut spécifique d'environ 50% des espèces européennes de Rhithrogena en utilisant un large échantillonnage de populations alpines incluant 22 localités typiques, ainsi qu'une analyse basée sur le modèle général mixte de Yule et de coalescence (GMYC) appliqué à un gène mitochondrial standard (coxl) et à un gène nucléaire développé spécifiquement pour cette étude. Nous avons observé un regroupement significatif des séquences coxl en 31 espèces potentielles, et nos résultats ont fortement suggéré la présence d'espèces cryptiques et de fractionnements taxonomiques excessifs chez les Rhithrogena. Nos analyses phylogénétiques ont démontré la monophylie de quatre des six groupes d'espèces reconnus présents dans notre échantillonnage. La taxonomie ADN développée dans cette étude pose les bases d'une future révision de ce genre important mais cryptique en Europe. Puis nous avons mené une étude phylogénétique multi-gènes entre les espèces européennes de Rhithrogena. Les données provenant de trois gènes nucléaires et de deux gènes mitochondriaux ont été largement concordantes, et les relations entre les espèces bien résolues au sein de la plupart des groupes d'espèces dans une analyse combinant tous les gènes. En l'absence de points de calibration extérieurs tels que des fossiles, nous avons appliqué à nos données mitochondriales une horloge moléculaire standard pour les insectes, suggérant une origine des Rhithrogena alpins à la limite Oligocène / Miocène. Nos résultats ont montré le rôle prépondérant qu'ont joué les glaciations du quaternaire dans leur diversification, favorisant la spéciation d'au moins la moitié des espèces actuelle dans les Alpes. La biodiversité et le taux d'endémisme à Madagascar, notamment au niveau de la faune des eaux douces, sont parmi les plus extraordinaires et les plus menacés au monde. On pense que beaucoup d'espèces d'éphémères sont restreintes à un seul bassin versant (microendémisme) dans les zones forestières, ce qui les rendrait particulièrement sensibles à la réduction et à la dégradation de leur habitat. Mis à part deux espèces décrites, Afronurus matitensis et Compsoneuria josettae, les Heptageniidae sont pratiquement inconnus à Madagascar. Les deux genres ont une distribution discontinue en Afrique, à Madagascar et en Asie du Sud-Est, et leur taxonomie complexe est régulièrement révisée. L'approche standard pour comprendre leur diversité, leur endémisme et leur origine requerrait un échantillonnage étendu sur plusieurs continents et des années de travaux taxonomiques. Pour accélérer le processus, nous avons utilisé des collections de musées ainsi que des individus fraîchement collectés, et appliqué une approche combinant taxonomie ADN et phylogénie. L'analyses GMYC du gène coxl a délimité 14 espèces potentielles à Madagascar, dont 70% vraisemblablement microendémiques. Une analyse phylogénique incluant des espèces africaines et asiatiques portant sur deux gènes mitochondriaux et quatre gènes nucléaires a montré que les Heptageniidae malgaches sont monophylétiques et groupe frère des Compsoneuria africains. L'existence de cette lignée unique, ainsi qu'un taux élevé de microendémisme, mettent en évidence leur importance en terme de conservation. Nos résultats soulignent également le rôle important que peuvent jouer les collections de musées dans les études moléculaires et en conservation. - Aquatic nymphs of mayflies (Ephemeroptera) colonize all types of freshwaters throughout the world and are extensively used as bio-indicators of water quality. Rhithrogena (Heptageniidae) is the second most species-rich genus of mayflies, and several European species have restricted distributions in sensitive Alpine environments and therefore are of conservation interest. The European Rhithrogena species are arranged into "species groups" that are easily identifiable. However, despite their ecological and conservation importance, ambiguous morphological differences among many species suggest that the current taxonomy may not accurately reflect their evolutionary diversity. Moreover, no information about their relationships, origin, timing of speciation and mechanisms promoting their successful diversification in the Alps is available. We first examined the species status of ca. 50% of European Rhithrogena diversity using a widespread sampling scheme of Alpine species that included 22 type localities, general mixed Yule- coalescent (GMYC) model analysis of one standard mitochondrial (coxl) and one newly developed nuclear marker. We observed significant clustering of coxl into 31 GMYC species, and our results strongly suggest the presence of both cryptic diversity and taxonomic oversplitting in Rhithrogena. Phylogenetic analyses recovered four of the six recognized species groups in our samples as monophyletic. The DNA taxonomy developed here lays the groundwork for a future revision of this important but cryptic genus in Europe. Then we conducted a species-level, multiple-gene phylogenetic study of European Rhithrogena. Data from three nuclear and two mitochondrial loci were broadly congruent, and species-level relationships were well resolved within most species groups in a combined analysis. In the absence of external calibration points like fossils, we applied a standard insect molecular clock hypothesis to our mitochondrial data, suggesting an origin of Alpine Rhithrogena in the Oligocene / Miocene boundary. Our results highlighted the preponderant role that quaternary glaciations played in their diversification, promoting speciation of at least half of the current diversity in the Alps. Madagascar's biodiversity and endemism are among the most extraordinary and endangered in the world. This includes the island's freshwater biodiversity, although detailed knowledge of the diversity, endemism, and biogeographic origin of freshwater invertebrates is lacking. Many mayfly species are thought to be restricted to single river basins (microendemic species) in forested areas, making them particularly sensitive to habitat reduction and degradation. The Heptageniidae are practically unknown in Madagascar except for two described species, Afronurus matitensis and Compsoneuria josettae. Both genera have a disjunct distribution in Africa, Madagascar and Southeast Asia, and a complex taxonomic status still in flux. The standard approach to understanding their diversity, endemism, and origin would require extensive field sampling on several continents and years of taxonomic work. Here we circumvent this using museum collections and freshly collected individuals in a combined approach of DNA taxonomy and phylogeny. The cox/-based GMYC analysis revealed 14 putative species on Madagascar, 70% of which potentially microendemics. A phylogenetic analysis that included African and Asian species and data from two mitochondrial and four nuclear loci indicated the Malagasy Heptageniidae are monophyletic and sister to African Compsoneuria. The observed monophyly and high microendemism highlight their conservation importance. Our results also underline the important role that museum collections can play in molecular studies, especially in critically endangered biodiversity hotspots like Madagascar.