928 resultados para genomics


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: Enterococcus faecalis has emerged as a major hospital pathogen. To explore its diversity, we sequenced E. faecalis strain OG1RF, which is commonly used for molecular manipulation and virulence studies. RESULTS: The 2,739,625 base pair chromosome of OG1RF was found to contain approximately 232 kilobases unique to this strain compared to V583, the only publicly available sequenced strain. Almost no mobile genetic elements were found in OG1RF. The 64 areas of divergence were classified into three categories. First, OG1RF carries 39 unique regions, including 2 CRISPR loci and a new WxL locus. Second, we found nine replacements where a sequence specific to V583 was substituted by a sequence specific to OG1RF. For example, the iol operon of OG1RF replaces a possible prophage and the vanB transposon in V583. Finally, we found 16 regions that were present in V583 but missing from OG1RF, including the proposed pathogenicity island, several probable prophages, and the cpsCDEFGHIJK capsular polysaccharide operon. OG1RF was more rapidly but less frequently lethal than V583 in the mouse peritonitis model and considerably outcompeted V583 in a murine model of urinary tract infections. CONCLUSION: E. faecalis OG1RF carries a number of unique loci compared to V583, but the almost complete lack of mobile genetic elements demonstrates that this is not a defining feature of the species. Additionally, OG1RF's effects in experimental models suggest that mediators of virulence may be diverse between different E. faecalis strains and that virulence is not dependent on the presence of mobile genetic elements.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In chronic lymphocytic leukemia (CLL), one of the best predictors of outcome is the somatic mutation status of the immunoglobulin heavy chain variable region (IGHV) genes. Patients whose CLL cells have unmutated IGHV genes have a median survival of 8 years; those with mutated IGHV genes have a median survival of 25 years. To identify new prognostic biomarkers and molecular targets for therapy in untreated CLL patients, we reanalyzed the raw data from four published gene expression profiling microarray studies. Of 88 candidate biomarkers associated with IGHV somatic mutation status, we identified LDOC1 (Leucine Zipper, Down-regulated in Cancer 1), as one of the most significantly differentially expressed genes that distinguished mutated from unmutated CLL cases. LDOC1 is a putative transcription factor of unknown function in B-cell development and CLL pathophysiology. Using a highly sensitive quantitative RT-PCR (QRT-PCR) assay, we confirmed that LDOC1 mRNA was dramatically down-regulated in mutated compared to unmutated CLL cases. Expression of LDOC1 mRNA was also vii strongly associated with other markers of poor prognosis, including ZAP70 protein and cytogenetic abnormalities of poor prognosis (deletions of chromosomes 6q21, 11q23, and 17p13.1, and trisomy 12). CLL cases positive for LDOC1 mRNA had significantly shorter overall survival than negative cases. Moreover, in a multivariate model, LDOC1 mRNA expression predicted overall survival better than IGHV mutation status or ZAP70 protein, among the best markers of prognosis in CLL. We also discovered LDOC1S, a new LDOC1 splice variant. Using isoform-specific QRT-PCR assays that we developed, we found that both isoforms were expressed in normal B cells (naïve > memory), unmutated CLL cells, and in B-cell non-Hodgkin lymphomas with unmutated IGHV genes. To investigate pathways in which LDOC1 is involved, we knocked down LDOC1 in HeLa cells and performed global gene expression profiling. GFI1 (Growth Factor-Independent 1) emerged as a significantly up-regulated gene in both HeLa cells and CLL cells that expressed high levels of LDOC1. GFI1 oncoprotein is implicated in hematopoietic stem cell maintenance, lymphocyte development, and lymphomagenesis. Our findings indicate that LDOC1 mRNA is an excellent biomarker of overall survival in CLL, and may contribute to B-cell differentiation and malignant transformation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The past decade has seen the rise of high resolution datasets. One of the main surprises of analysing such data has been the discovery of a large genetic, phenotypic and behavioural variation and heterogeneous metabolic rates among individuals within natural populations. A parallel discovery from theory and experiments has shown a strong temporal convergence between evolutionary and ecological dynamics, but a general framework to analyse from individual-level processes the convergence between ecological and evolutionary dynamics and its implications for patterns of biodiversity in food webs has been particularly lacking. Here, as a first approximation to take into account intraspecific variability and the convergence between the ecological and evolutionary dynamics in large food webs, we develop a model from population genomics and microevolutionary processes that uses sexual reproduction, genetic-distance-based speciation and trophic interactions. We confront the model with the prey consumption per individual predator, species-level connectance and prey–predator diversity in several environmental situations using a large food web with approximately 25,000 sampled prey and predator individuals. We show higher than expected diversity of abundant species in heterogeneous environmental conditions and strong deviations from the observed distribution of individual prey consumption (i.e. individual connectivity per predator) in all the environmental conditions. The observed large variance in individual prey consumption regardless of the environmental variability collapsed species-level connectance after small increases in sampling effort. These results suggest (1) intraspecific variance in prey–predator interactions has a strong effect on the macroscopic properties of food webs and (2) intraspecific variance is a potential driver regulating the speed of the convergence between ecological and evolutionary dynamics in species-rich food webs. These results also suggest that genetic–ecological drift driven by sexual reproduction, equal feeding rate among predator individuals, mutations and genetic-distance-based speciation can be used as a neutral food web dynamics test to detect the ecological and microevolutionary processes underlying the observed patterns of individual and species-based food webs at local and macroecological scales.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

CONTRIBUTION OF ECTODOMAIN MUTATIONS IN EPIDERMAL GROWTH FACTOR RECEPTOR TO SIGNALING IN GLIOBLASTOMA MULTIFORME Publication No._________ Marta Rojas, M.S. Supervisory Professor: Oliver Bögler, Ph.D. The Cancer Genome Atlas (TCGA) has conducted a comprehensive analysis of a large tumor cohort and has cataloged genetic alterations involving primary sequence variations and copy number aberrations of genes involved in key signaling pathways in glioblastoma (GBM). This dataset revealed missense ectodomain point mutations in epidermal growth factor receptor (EGFR), but the biological and clinical significance of these mutations is not well defined in the context of gliomas. In our study, we focused on understanding and defining the molecular mechanisms underlying the functions of EGFR ectodomain mutants. Using proteomic approaches to broadly analyze cell signaling, including antibody array and mass spectrometry-based methods, we found a differential spectrum of tyrosine phosphorylation across the EGFR ectodomain mutations that enabled us to stratify them into three main groups that correlate with either wild type EGFR (EGFR) or the long-studied mutant, EGFRvIII. Interestingly, one mutant shared characteristics of both groups suggesting a continuum of behaviors along which different mutants fall. Surprisingly, no substantial differences were seen in activation of classical downstream signaling pathways such as Akt and S6 pathways between these classes of mutants. Importantly, we demonstrated that ectodomain mutations lead to differential tumor growth capabilities in both in vitro (anchorage independent colony formation) and in vivo conditions (xenografts). Our data from the biological characterization allowed us to categorize the mutants into three main groups: the first group typified by EGFRvIII are mutations with a more aggressive phenotype including R108K and A289T; a second group characterized by a less aggressive phenotype exemplified by EGFR and the T263P mutation; and a third group which shared characteristics from both groups and is exemplified by the mutation A289D. In addition, we treated cells overexpressing the mutants with various agents employed in the clinic including temozolomide, cisplatin and tarceva. We found that cells overexpressing the mutants in general displayed resistance to the treatments. Our findings yield insights that help with the molecular characterization of these mutants. In addition, our results from the drug studies might be valuable in explaining differential responses to specific treatments in GBM patients.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Clinical peptidomics and metabolomics are two emerging "-omics" technologies with the potential not only to detect disease-specific markers, but also to give insight into the disease dependency of degradation processes and metabolic pathway alterations. However, despite their rapid evolution and major investments, a clinical breakthrough, such as the approval of a major cancer biomarker, is still out of sight. What are the reasons for this failure? In this review we focus on three important factors: sensitivity, specificity and the avoidance of bias. The way to clinical implementation of peptidomics and metabolomics is still hampered by many of the problems that had to be solved for genomics and proteomics in the past, as well as new ones that require the creation of new analytic, computational and interpretative techniques. The greatest challenge, however, will be the integration of information from different "-omics" subdisciplines into straightforward answers to clinical questions, for example, in the form of new, superior "meta-markers".

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Vector control is the mainstay of malaria control programmes. Successful vector control profoundly relies on accurate information on the target mosquito populations in order to choose the most appropriate intervention for a given mosquito species and to monitor its impact. An impediment to identify mosquito species is the existence of morphologically identical sibling species that play different roles in the transmission of pathogens and parasites. Currently PCR diagnostics are used to distinguish between sibling species. PCR based methods are, however, expensive, time-consuming and their development requires a priori DNA sequence information. Here, we evaluated an inexpensive molecular proteomics approach for Anopheles species: matrix assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). MALDI-TOF MS is a well developed protein profiling tool for the identification of microorganisms but so far has received little attention as a diagnostic tool in entomology. We measured MS spectra from specimens of 32 laboratory colonies and 2 field populations representing 12 Anopheles species including the A. gambiae species complex. An important step in the study was the advancement and implementation of a bioinformatics approach improving the resolution over previously applied cluster analysis. Borrowing tools for linear discriminant analysis from genomics, MALDI-TOF MS accurately identified taxonomically closely related mosquito species, including the separation between the M and S molecular forms of A. gambiae sensu stricto. The approach also classifies specimens from different laboratory colonies; hence proving also very promising for its use in colony authentication as part of quality assurance in laboratory studies. While being exceptionally accurate and robust, MALDI-TOF MS has several advantages over other typing methods, including simple sample preparation and short processing time. As the method does not require DNA sequence information, data can also be reviewed at any later stage for diagnostic or functional patterns without the need for re-designing and re-processing biological material.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Metabolomics is the global and unbiased survey of the complement of small molecules (say, <1 kDa) in a biofluid, tissue, organ or organism and measures the end-products of the cellular metabolism of both endogenous and exogenous substrates. Many drug candidates fail during Phase II and III clinical trials at an enormous cost to the pharmaceutical industry in terms of both time lost and of financial resources. The constantly evolving model of drug development now dictates that biomarkers should be employed in preclinical development for the early detection of likely-to-fail candidates. Biomarkers may also be useful in the preselection of patients and through the subclassification of diseases in clinical drug development. Here we show with examples how metabolomics can assist in the preclinical development phases of discovery, pharmacology, toxicology, and ADME. Although not yet established as a clinical trial patient prescreening procedure, metabolomics shows considerable promise in this regard. We can be certain that metabolomics will join genomics and transcriptomics in lubricating the wheels of clinical drug development in the near future.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

DNA-based parentage determination accelerates genetic improvement in sheep by increasing pedigree accuracy. Single nucleotide polymorphism (SNP) markers can be used for determining parentage and to provide unique molecular identifiers for tracing sheep products to their source. However, the utility of a particular "parentage SNP" varies by breed depending on its minor allele frequency (MAF) and its sequence context. Our aims were to identify parentage SNPs with exceptional qualities for use in globally diverse breeds and to develop a subset for use in North American sheep. Starting with genotypes from 2,915 sheep and 74 breed groups provided by the International Sheep Genomics Consortium (ISGC), we analyzed 47,693 autosomal SNPs by multiple criteria and selected 163 with desirable properties for parentage testing. On average, each of the 163 SNPs was highly informative (MAF≥0.3) in 48±5 breed groups. Nearby polymorphisms that could otherwise confound genetic testing were identified by whole genome and Sanger sequencing of 166 sheep from 54 breed groups. A genetic test with 109 of the 163 parentage SNPs was developed for matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry. The scoring rates and accuracies for these 109 SNPs were greater than 99% in a panel of North American sheep. In a blinded set of 96 families (sire, dam, and non-identical twin lambs), each parent of every lamb was identified without using the other parent's genotype. In 74 ISGC breed groups, the median estimates for probability of a coincidental match between two animals (PI), and the fraction of potential adults excluded from parentage (PE) were 1.1×10(-39) and 0.999987, respectively, for the 109 SNPs combined. The availability of a well-characterized set of 163 parentage SNPs facilitates the development of high-throughput genetic technologies for implementing accurate and economical parentage testing and traceability in many of the world's sheep breeds.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The diversity of populations in domestic species offers great opportunities to study genome response to selection. The recently published Sheep HapMap dataset is a great example of characterization of the world wide genetic diversity in sheep. In this study, we re-analyzed the Sheep HapMap dataset to identify selection signatures in worldwide sheep populations. Compared to previous analyses, we made use of statistical methods that (i) take account of the hierarchical structure of sheep populations, (ii) make use of linkage disequilibrium information and (iii) focus specifically on either recent or older selection signatures. We show that this allows pinpointing several new selection signatures in the sheep genome and distinguishing those related to modern breeding objectives and to earlier post-domestication constraints. The newly identified regions, together with the ones previously identified, reveal the extensive genome response to selection on morphology, color and adaptation to new environments.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND A novel Gram-negative, non-haemolytic, non-motile, rod-shaped bacterium was discovered in the lungs of a dead parakeet (Melopsittacus undulatus) that was kept in captivity in a petshop in Basel, Switzerland. The organism is described with a chemotaxonomic profile and the nearly complete genome sequence obtained through the assembly of short sequence reads. RESULTS Genome sequence analysis and characterization of respiratory quinones, fatty acids, polar lipids, and biochemical phenotype is presented here. Comparison of gene sequences revealed that the most similar species is Pelistega europaea, with BLAST identities of only 93% to the 16S rDNA gene, 76% identity to the rpoB gene, and a similar GC content (~43%) as the organism isolated from the parakeet, DSM 24701 (40%). The closest full genome sequences are those of Bordetella spp. and Taylorella spp. High-throughput sequencing reads from the Illumina-Solexa platform were assembled with the Edena de novo assembler to form 195 contigs comprising the ~2 Mb genome. Genome annotation with RAST, construction of phylogenetic trees with the 16S rDNA (rrs) gene sequence and the rpoB gene, and phylogenetic placement using other highly conserved marker genes with ML Tree all suggest that the bacterial species belongs to the Alcaligenaceae family. Analysis of samples from cages with healthy parakeets suggested that the newly discovered bacterial species is not widespread in parakeet living quarters. CONCLUSIONS Classification of this organism in the current taxonomy system requires the formation of a new genus and species. We designate the new genus Basilea and the new species psittacipulmonis. The type strain of Basilea psittacipulmonis is DSM 24701 (= CIP 110308 T, 16S rDNA gene sequence Genbank accession number JX412111 and GI 406042063).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND The free-living amoeba Naegleria fowleri is the causative agent of the rapidly progressing and typically fatal primary amoebic meningoencephalitis (PAM) in humans. Despite the devastating nature of this disease, which results in > 97% mortality, knowledge of the pathogenic mechanisms of the amoeba is incomplete. This work presents a comparative proteomic approach based on an experimental model in which the pathogenic potential of N. fowleri trophozoites is influenced by the compositions of different media. RESULTS As a scaffold for proteomic analysis, we sequenced the genome and transcriptome of N. fowleri. Since the sequence similarity of the recently published genome of Naegleria gruberi was far lower than the close taxonomic relationship of these species would suggest, a de novo sequencing approach was chosen. After excluding cell regulatory mechanisms originating from different media compositions, we identified 22 proteins with a potential role in the pathogenesis of PAM. Functional annotation of these proteins revealed, that the membrane is the major location where the amoeba exerts its pathogenic potential, possibly involving actin-dependent processes such as intracellular trafficking via vesicles. CONCLUSION This study describes for the first time the 30 Mb-genome and the transcriptome sequence of N. fowleri and provides the basis for the further definition of effective intervention strategies against the rare but highly fatal form of amoebic meningoencephalitis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Tef Eragrostis tef (Zucc.) Trotter is a cereal crop resilient to adverse climatic and soil conditions, and possessing desirable storage properties. Although tef provides high quality food and grows under marginal conditions unsuitable for other cereals, it is considered to be an orphan crop because it has benefited little from genetic improvement. Hence, unlike other cereals such as maize and wheat, the productivity of tef is extremely low. In spite of the low productivity, tef is widely cultivated by over six million small-scale farmers in Ethiopia where it is annually grown on more than three million hectares of land, accounting for over 30% of the total cereal acreage. Tef, a tetraploid with 40 chromosomes (2n=4x=40), belongs to the Family Poaceae and, together with finger millet (Eleusine coracana Gaertn), to the Subfamily Chloridoideae. It was believed to have originated in Ethiopia. There are about 350 Eragrostis species of which E. tef is the only species cultivated for human consumption. At the present time, the gene bank in Ethiopia holds over five thousand tef accessions collected from geographical regions diverse in terms of climate and elevation. These germplasm accessions appear to have huge variability with regard to key agronomic and nutritional traits. In order to properly utilize the variability in developing new tef cultivars, various techniques have been implemented to catalog the extent and unravel the patterns of genetic diversity. In this review, we show some recent initiatives investigating the diversity of tef using genomics, transcriptomics and proteomics and discuss the prospect of these efforts in providing molecular resources that can aid modern tef breeding.