146 resultados para COPY NUMBER
Resumo:
Abstract : Gene duplication is an essential source of material for the origin of genetic novelties. The reverse transcription of source gene mRNA followed by the genomic insertion of the resulting cDNA - retroposition - has provided the human genome with at least ~3600 detectable retrocopies. We find that ~30% of these retrocopies are transcribed, generally in testes. Their transcription often relies on preexisting regulatory elements (or open chromatin) close to their insertion site, which is illustrated by mRNA molecules containing retrocopies fused to their neighboring genes. Retrocopies appear to have been profoundly shaped by selection. Consistently, human retrocopies with an intact open reading (ORF) are more often transcribed than retropseudogenes, which leads to a minimal estimate of 120 functional retrogenes present in our genome. We also performed an analysis of Ka/Ks for human retrocopies. This analysis demonstrates that several intact retrocopies evolved under purifying selection and yields an estimated formation rate of ~1 retrogene per million year in the primate lineage. Using DNA sequencing and evolutionary simulations, we have identified 7 such primate-specific retrogenes that emerged on the lineage leading to humans In therian genomes, we found an excess of retrogenes with X-linked parents. Expression analyses support the idea that this "out of X" movement was driven by natural selection to produce autosomal functional counterparts for X-linked genes, which are silenced during male meiosis. Phylogenetic dating of this "out of X" movement suggests that our sex chromosomes arose about 180 MYA ago and are thus much younger than previously thought. Finally, we have also analyzed young gene duplications (and deletions) that arose by non allelic-homologous recombination and are not fixed in species. Using wild-caught and laboratory animals, we detected thousands of DNA segments that are polymorphic in copy number in mice. These copy number variants were found to profoundly alter the transcriptome of several mouse tissues. Strikingly, their influence on gene expression is not limited to the gene they contain but seems to extend to genes located up to 1.5 million bases away.
Resumo:
The complete mitochondrial DNA (mtDNA) control region was amplified and directly sequenced in two species of shrew, Crocidura russula and Sorex araneus (Insectivora, Mammalia). The general organization is similar to that found in other mammals: a central conserved region surrounded by two more variable domains. However, we have found in shrews the simultaneous presence of arrays of tandem repeats in potential locations where repeats tend to occur separately in other mammalian species. These locations correspond to regions which are associated with a possible interruption of the replication processes, either at the end of the three-stranded D-loop structure or toward the end of the heavy-strand replication. In the left domain the repeated sequences (R1 repeats) are 78 bp long, whereas in the right domain the repeats are 12 bp long in C. russula and 14 bp long in S. araneus (R2 repeats). Variation in the copy number of these repeated sequences results in mtDNA control region length differences. Southern blot analysis indicates that level of heteroplasmy (more than one mtDNA form within an individual) differs between species. A comparative study of the R2 repeats in 12 additional species representing three shrew subfamilies provides useful indications for the understanding of the origin and the evolution of these homologous tandemly repeated sequences. An asymmetry in the distribution of variants within the arrays, as well as the constant occurrence of shorter repeated sequences flanking only one side of the R2 arrays, could be related to asymmetry in the replication of each strand of the mtDNA molecule. The pattern of sequence and length variation within and between species, together with the capability of the arrays to form stable secondary structures, suggests that the dominant mechanism involved in the evolution of these arrays in unidirectional replication slippage.
Resumo:
Transfection with polyethylenimine (PEI) was evaluated as a method for the generation of recombinant Chinese hamster ovary (CHO DG44) cell lines by direct comparison with calcium phosphate-DNA coprecipitation (CaPO4) using both green fluorescent protein (GFP) and a monoclonal antibody as reporter proteins. Following transfection with a GFP expression vector, the proportion of GFP-positive cells as determined by flow cytometry was fourfold higher for the PEI transfection as compared to the CaPO4 transfection. However, the mean level of transient GFP expression for the cells with the highest level of fluorescence was twofold greater for the CaPO4 transfection. Fluorescence in situ hybridization on metaphase chromosomes from pools of cells grown under selective pressure demonstrated that plasmid integration always occurred at a single site regardless of the transfection method. Importantly, the copy number of integrated plasmids was measurably higher in cells transfected with CaPO4. The efficiency of recombinant cell line recovery under selective pressure was fivefold higher following PEI transfection, but the average specific productivity of a recombinant antibody was about twofold higher for the CaPO4-derived cell lines. Nevertheless, no difference between the two transfection methods was observed in terms of the stability of protein production. These results demonstrated the feasibility of generating recombinant CHO-derived cell lines by PEI transfection. However, this method appeared inferior to CaPO4 transfection with regard to the specific productivity of the recovered cell lines.
Resumo:
The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.
Resumo:
Epigenetic silencing of the DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) by promoter methylation predicts successful alkylating agent therapy, such as with temozolomide, in glioblastoma patients. Stratified therapy assignment of patients in prospective clinical trials according to tumor MGMT status requires a standardized diagnostic test, suitable for high-throughput analysis of small amounts of formalin-fixed, paraffin-embedded tumor tissue. A direct, real-time methylation-specific PCR (MSP) assay was developed to determine methylation status of the MGMT gene promoter. Assay specificity was obtained by selective amplification of methylated DNA sequences of sodium bisulfite-modified DNA. The copy number of the methylated MGMT promoter, normalized to the beta-actin gene, provides a quantitative test result. We analyzed 134 clinical glioma samples, comparing the new test with the previously validated nested gel-based MSP assay, which yields a binary readout. A cut-off value for the MGMT methylation status was suggested by fitting a bimodal normal mixture model to the real-time results, supporting the hypothesis that there are two distinct populations within the test samples. Comparison of the tests showed high concordance of the results (82/91 [90%]; Cohen's kappa = 0.80; 95% confidence interval, 0.82-0.95). The direct, real-time MSP assay was highly reproducible (Pearson correlation 0.996) and showed valid test results for 93% (125/134) of samples compared with 75% (94/125) for the nested, gel-based MSP assay. This high-throughput test provides an important pharmacogenomic tool for individualized management of alkylating agent chemotherapy.
Resumo:
Growing evidence suggests that a novel member of the Chlamydiales order, Waddlia chondrophila, is a potential agent of miscarriage in humans and abortion in ruminants. Due to the lack of genetic tools to manipulate chlamydia, genomic analysis is proving to be the most incisive tool in stimulating investigations into the biology of these obligate intracellular bacteria. 454/Roche and Solexa/Illumina technologies were thus used to sequence and assemble de novo the full genome of the first representative of the Waddliaceae family, W. chondrophila. The bacteria possesses a 2'116'312 bp chromosome and a 15'593 bp low-copy number plasmid that might integrate into the bacterial chromosome. The Waddlia genome displays numerous repeated sequences indicating different genome dynamics from classical chlamydia which almost completely lack repetitive elements. Moreover, W. chondrophila exhibits many virulence factors also present in classical chlamydia, including a functional type III secretion system, but also a large complement of specific factors for resistance to host or environmental stresses. Large families of outer membrane proteins were identified indicating that these highly immunogenic proteins are not Chlamydiaceae specific and might have been present in their last common ancestor. Enhanced metabolic capability for the synthesis of nucleotides, amino acids, lipids and other co-factors suggests that the common ancestor of the modern Chlamydiales may have been less dependent on their eukaryotic host. The fine-detailed analysis of biosynthetic pathways brings us closer to possibly developing a synthetic medium to grow W. chondrophila, a critical step in the development of genetic tools. As a whole, the availability of the W. chondrophila genome opens new possibilities in Chlamydiales research, providing new insights into the evolution of members of the order Chlamydiales and the biology of the Waddliaceae.
Resumo:
Structural variation, whether it is caused by copy number variants or present in a balanced form, such as reciprocal translocations and inversions, can have a profound and dramatic effect on the expression of genes mapping within and close to the rearrangement, as well as affecting others genome wide. These effects can be caused by altering the copy number of one or more genes or regulatory elements (dosage effect) or from physical disruption of links between regulatory elements and their associated gene or genes, resulting in perturbation of expression. Similarly, large-scale structural variants can result in genome-wide expression changes by altering the positions that chromosomes occupy within the nucleus, potentially disrupting not only local cis interactions, but also trans interactions that occur throughout the genome. Structural variation is, therefore, a significant factor in the study of gene expression and is discussed here in more detail.
Resumo:
OBJECTIVE: To establish the genetic basis of Landau-Kleffner syndrome (LKS) in a cohort of two discordant monozygotic (MZ) twin pairs and 11 isolated cases. METHODS: We used a multifaceted approach to identify genetic risk factors for LKS. Array comparative genomic hybridization (CGH) was performed using the Agilent 180K array. Whole genome methylation profiling was undertaken in the two discordant twin pairs, three isolated LKS cases, and 12 control samples using the Illumina 27K array. Exome sequencing was undertaken in 13 patients with LKS including two sets of discordant MZ twins. Data were analyzed with respect to novel and rare variants, overlapping genes, variants in reported epilepsy genes, and pathway enrichment. RESULTS: A variant (cG1553A) was found in a single patient in the GRIN2A gene, causing an arginine to histidine change at site 518, a predicted glutamate binding site. Following copy number variation (CNV), methylation, and exome sequencing analysis, no single candidate gene was identified to cause LKS in the remaining cohort. However, a number of interesting additional candidate variants were identified including variants in RELN, BSN, EPHB2, and NID2. SIGNIFICANCE: A single mutation was identified in the GRIN2A gene. This study has identified a number of additional candidate genes including RELN, BSN, EPHB2, and NID2. A PowerPoint slide summarizing this article is available for download in the Supporting Information section here.
Resumo:
The recognition that colorectal cancer (CRC) is a heterogeneous disease in terms of clinical behaviour and response to therapy translates into an urgent need for robust molecular disease subclassifiers that can explain this heterogeneity beyond current parameters (MSI, KRAS, BRAF). Attempts to fill this gap are emerging. The Cancer Genome Atlas (TGCA) reported two main CRC groups, based on the incidence and spectrum of mutated genes, and another paper reported an EMT expression signature defined subgroup. We performed a prior free analysis of CRC heterogeneity on 1113 CRC gene expression profiles and confronted our findings to established molecular determinants and clinical, histopathological and survival data. Unsupervised clustering based on gene modules allowed us to distinguish at least five different gene expression CRC subtypes, which we call surface crypt-like, lower crypt-like, CIMP-H-like, mesenchymal and mixed. A gene set enrichment analysis combined with literature search of gene module members identified distinct biological motifs in different subtypes. The subtypes, which were not derived based on outcome, nonetheless showed differences in prognosis. Known gene copy number variations and mutations in key cancer-associated genes differed between subtypes, but the subtypes provided molecular information beyond that contained in these variables. Morphological features significantly differed between subtypes. The objective existence of the subtypes and their clinical and molecular characteristics were validated in an independent set of 720 CRC expression profiles. Our subtypes provide a novel perspective on the heterogeneity of CRC. The proposed subtypes should be further explored retrospectively on existing clinical trial datasets and, when sufficiently robust, be prospectively assessed for clinical relevance in terms of prognosis and treatment response predictive capacity. Original microarray data were uploaded to the ArrayExpress database (http://www.ebi.ac.uk/arrayexpress/) under Accession Nos E-MTAB-990 and E-MTAB-1026. © 2013 Swiss Institute of Bioinformatics. Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Resumo:
Many types of tumors exhibit characteristic chromosomal losses or gains, as well as local amplifications and deletions. Within any given tumor type, sample specific amplifications and deletions are also observed. Typically, a region that is aberrant in more tumors, or whose copy number change is stronger, would be considered as a more promising candidate to be biologically relevant to cancer. We sought for an intuitive method to define such aberrations and prioritize them. We define V, the "volume" associated with an aberration, as the product of three factors: (a) fraction of patients with the aberration, (b) the aberration's length and (c) its amplitude. Our algorithm compares the values of V derived from the real data to a null distribution obtained by permutations, and yields the statistical significance (p-value) of the measured value of V. We detected genetic locations that were significantly aberrant, and combine them with chromosomal arm status (gain/loss) to create a succinct fingerprint of the tumor genome. This genomic fingerprint is used to visualize the tumors, highlighting events that are co-occurring or mutually exclusive. We apply the method on three different public array CGH datasets of Medulloblastoma and Neuroblastoma, and demonstrate its ability to detect chromosomal regions that were known to be altered in the tested cancer types, as well as to suggest new genomic locations to be tested. We identified a potential new subtype of Medulloblastoma, which is analogous to Neuroblastoma type 1.
Resumo:
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between body mass index and ∼ 2.8 million SNPs in up to 123,865 individuals with targeted follow up of 42 SNPs in up to 125,931 additional individuals. We confirmed 14 known obesity susceptibility loci and identified 18 new loci associated with body mass index (P < 5 × 10⁻⁸), one of which includes a copy number variant near GPRC5B. Some loci (at MC4R, POMC, SH2B1 and BDNF) map near key hypothalamic regulators of energy balance, and one of these loci is near GIPR, an incretin receptor. Furthermore, genes in other newly associated loci may provide new insights into human body weight regulation.
Resumo:
We present the application of a real-time quantitative PCR assay, previously developed to measure relative telomere length in humans and mice, to two bird species, the zebra finch Taeniopygia guttata and the Alpine swift Apus melba. This technique is based on the PCR amplification of telomeric (TTAGGG)(n) sequences using specific oligonucleotide primers. Relative telomere length is expressed as the ratio (T/S) of telomere repeat copy number (T) to control single gene copy number (S). This method is particularly useful for comparisons of individuals within species, or where the same individuals are followed longitudinally. We used glyceraldehyde-3-phosphate dehydrogenase (GAPDH) as a single control gene. In both species, we validated our PCR measurements of relative telomere length against absolute measurements of telomere length determined by the conventional method of quantifying telomere terminal restriction fragment (TRF) lengths using both the traditional Southern blot analysis (Alpine swifts) and in gel hybridization (zebra finches). As found in humans and mice, telomere lengths in the same sample measured by TRF and PCR were well correlated in both the Alpine swift and the zebra finch.. Hence, this PCR assay for measurement of bird telomeres, which is fast and requires only small amounts of genomic DNA, should open new avenues in the study of environmental factors influencing variation in telomere length, and how this variation translates into variation in cellular and whole organism senescence.
Resumo:
Pigs are very often colonized by Staphylococcus aureus and transmission of such pig-associated S. aureus to humans can cause serious medical, hygiene, and economic problems. The transmission route of zoonotic pathogens colonizing farm animals to humans is not well established and bioaerosols could play an important role. The aim of this study was to assess the potential occupational risk of working with S. aureus-colonized pigs in Switzerland. We estimated the airborne contamination by S. aureus in 37 pig farms (20 nursery and 17 fattening units; 25 in summer, 12 in winter). Quantification of total airborne bacterial DNA, airborne Staphylococcus sp. DNA, fungi, and airborne endotoxins was also performed. In this experiment, the presence of cultivable airborne methicillin-resistant S. aureus (MRSA) CC398 in a pig farm in Switzerland was reported for the first time. Airborne methicillin-sensitive S. aureus (MSSA) was found in ~30% of farms. The average airborne concentration of DNA copy number of total bacteria and Staphylococcus sp. measured by quantitative polymerase chain reaction was very high, respectively reaching values of 75 (± 28) × 10(7) and 35 (± 9.8) × 10(5) copy numbers m(-3) in summer and 96 (± 19) × 10(8) and 40 (± 12) × 10(6) copy numbers m(-3) in winter. Total mean airborne concentrations of endotoxins (1298 units of endotoxin m(-3)) and fungi (5707 colony-forming units m(-3)) exceeded the Swiss recommended values and were higher in winter than in summer. In conclusion, Swiss pig farmers will have to tackle a new emerging occupational risk, which could also have a strong impact on public health. The need to inform pig farmers about biological occupational risks is therefore crucial.
Resumo:
Molecular monitoring of BCR/ABL transcripts by real time quantitative reverse transcription PCR (qRT-PCR) is an essential technique for clinical management of patients with BCR/ABL-positive CML and ALL. Though quantitative BCR/ABL assays are performed in hundreds of laboratories worldwide, results among these laboratories cannot be reliably compared due to heterogeneity in test methods, data analysis, reporting, and lack of quantitative standards. Recent efforts towards standardization have been limited in scope. Aliquots of RNA were sent to clinical test centers worldwide in order to evaluate methods and reporting for e1a2, b2a2, and b3a2 transcript levels using their own qRT-PCR assays. Total RNA was isolated from tissue culture cells that expressed each of the different BCR/ABL transcripts. Serial log dilutions were prepared, ranging from 100 to 10-5, in RNA isolated from HL60 cells. Laboratories performed 5 independent qRT-PCR reactions for each sample type at each dilution. In addition, 15 qRT-PCR reactions of the 10-3 b3a2 RNA dilution were run to assess reproducibility within and between laboratories. Participants were asked to run the samples following their standard protocols and to report cycle threshold (Ct), quantitative values for BCR/ABL and housekeeping genes, and ratios of BCR/ABL to housekeeping genes for each sample RNA. Thirty-seven (n=37) participants have submitted qRT-PCR results for analysis (36, 37, and 34 labs generated data for b2a2, b3a2, and e1a2, respectively). The limit of detection for this study was defined as the lowest dilution that a Ct value could be detected for all 5 replicates. For b2a2, 15, 16, 4, and 1 lab(s) showed a limit of detection at the 10-5, 10-4, 10-3, and 10-2 dilutions, respectively. For b3a2, 20, 13, and 4 labs showed a limit of detection at the 10-5, 10-4, and 10-3 dilutions, respectively. For e1a2, 10, 21, 2, and 1 lab(s) showed a limit of detection at the 10-5, 10-4, 10-3, and 10-2 dilutions, respectively. Log %BCR/ABL ratio values provided a method for comparing results between the different laboratories for each BCR/ABL dilution series. Linear regression analysis revealed concordance among the majority of participant data over the 10-1 to 10-4 dilutions. The overall slope values showed comparable results among the majority of b2a2 (mean=0.939; median=0.9627; range (0.399 - 1.1872)), b3a2 (mean=0.925; median=0.922; range (0.625 - 1.140)), and e1a2 (mean=0.897; median=0.909; range (0.5174 - 1.138)) laboratory results (Fig. 1-3)). Thirty-four (n=34) out of the 37 laboratories reported Ct values for all 15 replicates and only those with a complete data set were included in the inter-lab calculations. Eleven laboratories either did not report their copy number data or used other reporting units such as nanograms or cell numbers; therefore, only 26 laboratories were included in the overall analysis of copy numbers. The median copy number was 348.4, with a range from 15.6 to 547,000 copies (approximately a 4.5 log difference); the median intra-lab %CV was 19.2% with a range from 4.2% to 82.6%. While our international performance evaluation using serially diluted RNA samples has reinforced the fact that heterogeneity exists among clinical laboratories, it has also demonstrated that performance within a laboratory is overall very consistent. Accordingly, the availability of defined BCR/ABL RNAs may facilitate the validation of all phases of quantitative BCR/ABL analysis and may be extremely useful as a tool for monitoring assay performance. Ongoing analyses of these materials, along with the development of additional control materials, may solidify consensus around their application in routine laboratory testing and possible integration in worldwide efforts to standardize quantitative BCR/ABL testing.
Resumo:
Macrophage migration inhibitory factor (MIF) is an abundantly expressed proinflammatory cytokine playing a critical role in innate immunity and sepsis and other inflammatory diseases. We examined whether functional MIF gene polymorphisms (-794 CATT(5-8) microsatellite and -173 G/C SNP) were associated with the occurrence and outcome of meningococcal disease in children. The CATT(5) allele was associated with the probability of death predicted by the Pediatric Index of Mortality 2 (P=0.001), which increased in correlation with the CATT(5) copy number (P=0.04). The CATT(5) allele, but not the -173 G/C alleles, was also associated with the actual mortality from meningoccal sepsis [OR 2.72 (1.2-6.4), P=0.02]. A family-based association test (i.e., transmission disequilibrium test) performed in 240 trios with 1 afflicted offspring indicated that CATT(5) was a protective allele (P=0.02) for the occurrence of meningococcal disease. At baseline and after stimulation with Neisseria meningitidis in THP-1 monocytic cells or in a whole-blood assay, CATT(5) was found to be a low-expression MIF allele (P=0.005 and P=0.04 for transcriptional activity; P=0.09 and P=0.09 for MIF production). Taken together, these data suggest that polymorphisms of the MIF gene affecting MIF expression are associated with the occurrence, severity, and outcome of meningococcal disease in children.