29 resultados para Repeat
Resumo:
Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.
Resumo:
Background: Amino acid tandem repeats are found in nearly one-fifth of human proteins. Abnormal expansion of these regions is associated with several human disorders. To gain further insight into the mutational mechanisms that operate in this type of sequence, we have analyzed a large number of mutation variants derived from human expressed sequence tags (ESTs).Results: We identified 137 polymorphic variants in 115 different amino acid tandem repeats. Of these, 77 contained amino acid substitutions and 60 contained gaps (expansions or contractions of the repeat unit). The analysis showed that at least about 21% of the repeats might be polymorphic in humans. We compared the mutations found in different types of amino acid repeats and in adjacent regions. Overall, repeats showed a five-fold increase in the number of gap mutations compared to adjacent regions, reflecting the action of slippage within the repetitive structures. Gap and substitution mutations were very differently distributed between different amino acid repeat types. Among repeats containing gap variants we identified several disease and candidate disease genes.Conclusion: This is the first report at a genome-wide scale of the types of mutations occurring in the amino acid repeat component of the human proteome. We show that the mutational dynamics of different amino acid repeat types are very diverse. We provide a list of loci with highly variable repeat structures, some of which may be potentially involved in disease.
Resumo:
Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.
Resumo:
Low-complexity regions (LCRs) in proteins are tracts that are highly enriched in one or a few aminoacids. Given their high abundance, and their capacity to expand in relatively short periods of time through replication slippage, they can greatly contribute to increase protein sequence space and generate novel protein functions. However, little is known about the global impact of LCRs on protein evolution. We have traced back the evolutionary history of 2,802 LCRs from a large set of homologous protein families from H.sapiens, M.musculus, G.gallus, D.rerio and C.intestinalis. Transcriptional factors and other regulatory functions are overrepresented in proteins containing LCRs. We have found that the gain of novel LCRs is frequently associated with repeat expansion whereas the loss of LCRs is more often due to accumulation of amino acid substitutions as opposed to deletions. This dichotomy results in net protein sequence gain over time. We have detected a significant increase in the rate of accumulation of novel LCRs in the ancestral Amniota and mammalian branches, and a reduction in the chicken branch. Alanine and/or glycine-rich LCRs are overrepresented in recently emerged LCR sets from all branches, suggesting that their expansion is better tolerated than for other LCR types. LCRs enriched in positively charged amino acids show the contrary pattern, indicating an important effect of purifying selection in their maintenance. We have performed the first large-scale study on the evolutionary dynamics of LCRs in protein families. The study has shown that the composition of an LCR is an important determinant of its evolutionary pattern.
Resumo:
Huntington's disease (HD) is an autosomal dominantly inherited disorder caused by the expansion of CAG repeats in the Huntingtin (HTT) gene. The abnormally extended polyglutamine in the HTT protein encoded by the CAG repeats has toxic effects. Here, we provide evidence to support that the mutant HTT CAG repeats interfere with cell viability at the RNA level. In human neuronal cells, expanded HTT exon-1 mRNA with CAG repeat lengths above the threshold for complete penetrance (40 or greater) induced cell death and increased levels of small CAG-repeated RNAs (sCAGs), of ≈21 nucleotides in a Dicer-dependent manner. The severity of the toxic effect of HTT mRNA and sCAG generation correlated with CAG expansion length. Small RNAs obtained from cells expressing mutant HTT and from HD human brains significantly decreased neuronal viability, in an Ago2-dependent mechanism. In both cases, the use of anti-miRs specific for sCAGs efficiently blocked the toxic effect, supporting a key role of sCAGs in HTT-mediated toxicity. Luciferase-reporter assays showed that expanded HTT silences the expression of CTG-containing genes that are down-regulated in HD. These results suggest a possible link between HD and sCAG expression with an aberrant activation of the siRNA/miRNA gene silencing machinery, which may trigger a detrimental response. The identification of the specific cellular processes affected by sCAGs may provide insights into the pathogenic mechanisms underlying HD, offering opportunities to develop new therapeutic approaches
Resumo:
Fungi are a large group of eukaryotes found in nearly all ecosystems. More than 250 fungal genomes have already been sequenced, greatly improving our understanding of fungal evolution, physiology, and development. However, for the Pezizomycetes, an early-diverging lineage of filamentous ascomycetes, there is so far only one genome available, namely that of the black truffle, Tuber melanosporum, a mycorrhizal species with unusual subterranean fruiting bodies. To help close the sequence gap among basal filamentous ascomycetes, and to allow conclusions about the evolution of fungal development, we sequenced the genome and assayed transcriptomes during development of Pyronema confluens, a saprobic Pezizomycete with a typical apothecium as fruiting body. With a size of 50 Mb and ~13,400 protein-coding genes, the genome is more characteristic of higher filamentous ascomycetes than the large, repeat-rich truffle genome; however, some typical features are different in the P. confluens lineage, e.g. the genomic environment of the mating type genes that is conserved in higher filamentous ascomycetes, but only partly conserved in P. confluens. On the other hand, P. confluens has a full complement of fungal photoreceptors, and expression studies indicate that light perception might be similar to distantly related ascomycetes and, thus, represent a basic feature of filamentous ascomycetes. Analysis of spliced RNA-seq sequence reads allowed the detection of natural antisense transcripts for 281 genes. The P. confluens genome contains an unusually high number of predicted orphan genes, many of which are upregulated during sexual development, consistent with the idea of rapid evolution of sex-associated genes. Comparative transcriptomics identified the transcription factor gene pro44 that is upregulated during development in P. confluens and the Sordariomycete Sordaria macrospora. The P. confluens pro44 gene (PCON_06721) was used to complement the S. macrospora pro44 deletion mutant, showing functional conservation of this developmental regulator.
Resumo:
Cooperative transmission can be seen as a "virtual" MIMO system, where themultiple transmit antennas are in fact implemented distributed by the antennas both at the source and the relay terminal. Depending on the system design, diversity/multiplexing gainsare achievable. This design involves the definition of the type of retransmission (incrementalredundancy, repetition coding), the design of the distributed space-time codes, the errorcorrecting scheme, the operation of the relay (decode&forward or amplify&forward) and thenumber of antennas at each terminal. Proposed schemes are evaluated in different conditionsin combination with forward error correcting codes (FEC), both for linear and near-optimum(sphere decoder) receivers, for its possible implementation in downlink high speed packetservices of cellular networks. Results show the benefits of coded cooperation over directtransmission in terms of increased throughput. It is shown that multiplexing gains areobserved even if the mobile station features a single antenna, provided that cell wide reuse of the relay radio resource is possible.
Resumo:
Background: In Catalonia (Spain) breast cancer mortality has declined since the beginning of the 1990s. The dissemination of early detection by mammography and the introduction of adjuvant treatments are among the possible causes of this decrease, and both were almost coincident in time. Thus, understanding how these procedures were incorporated into use in the general population and in women diagnosed with breast cancer is very important for assessing their contribution to the reduction in breast cancer mortality. In this work we have modeled the dissemination of periodic mammography and described repeat mammography behavior in Catalonia from 1975 to 2006. Methods: Cross-sectional data from three Catalan Health Surveys for the calendar years 1994, 2002 and 2006 was used. The dissemination of mammography by birth cohort was modeled using a mixed effects model and repeat mammography behavior was described by age and survey year. Results: For women born from 1938 to 1952, mammography clearly had a period effect, meaning that they started to have periodic mammograms at the same calendar years but at different ages. The age at which approximately 50% of the women were receiving periodic mammograms went from 57.8 years of age for women born in 1938–1942 to 37.3 years of age for women born in 1963–1967. Women in all age groups experienced an increase in periodic mammography use over time, although women in the 50–69 age group have experienced the highest increase. Currently, the target population of the Catalan Breast Cancer Screening Program, 50–69 years of age, is the group that self-reports the highest utilization of periodic mammograms, followed by the 40–49 age group. A higher proportion of women of all age groups have annual mammograms rather than biennial or irregular ones. Conclusion: Mammography in Catalonia became more widely implemented during the 1990s. We estimated when cohorts initiated periodic mammograms and how frequently women are receiving them. These two pieces of information will be entered into a cost-effectiveness model of early detection in Catalonia.
Resumo:
Background: The 22q11.2 deletion syndrome is the most frequent genomic disorder with an estimated frequency of 1/4000 live births. The majority of patients (90%) have the same deletion of 3 Mb (Typically Deleted Region, TDR) that results from aberrant recombination at meiosis between region specific low-copy repeats (LCRs). Methods: As a first step towards the characterization of recombination rates and breakpoints within the 22q11.2 region we have constructed a high resolution recombination breakpoint map based on pedigree analysis and a population-based historical recombination map based on LD analysis. Results: Our pedigree map allows the location of recombination breakpoints with a high resolution (potential recombination hotspots), and this approach has led to the identification of 5 breakpoint segments of 50 kb or less (8.6 kb the smallest), that coincide with historical hotspots. It has been suggested that aberrant recombination leading to deletion (and duplication) is caused by low rates of Allelic Homologous Recombination (AHR) within the affected region. However, recombination rate estimates for 22q11.2 region show that neither average recombination rates in the 22q11.2 region or within LCR22-2 (the LCR implicated in most deletions and duplications), are significantly below chromosome 22 averages. Furthermore, LCR22-2, the repeat most frequently implicated in rearrangements, is also the LCR22 with the highest levels of AHR. In addition, we find recombination events in the 22q11.2 region to cluster within families. Within this context, the same chromosome recombines twice in one family; first by AHR and in the next generation by NAHR resulting in an individual affected with the del22q11.2 syndrome. Conclusion: We show in the context of a first high resolution pedigree map of the 22q11.2 region that NAHR within LCR22 leading to duplications and deletions cannot be explained exclusively under a hypothesis of low AHR rates. In addition, we find that AHR recombination events cluster within families. If normal and aberrant recombination are mechanistically related, the fact that LCR22s undergo frequent AHR and that we find familial differences in recombination rates within the 22q11.2 region would have obvious health-related implications.
Resumo:
The recently discovered apolipoprotein AV (apoAV) gene has been reported to be a key player in modulating plasma triglyceride levels. Here we identify the hepatocyte nuclear factor-4 (HNF-4 ) as a novel regulator of human apoAV gene. Inhibition of HNF-4 expression by small interfering RNA resulted in down-regulation of apoAV. Deletion, mutagenesis, and binding assays revealed that HNF-4 directly regulates human apoAV promoter through DR1 [a direct repeat separated by one nucleotide (nt)], and via a novel element for HNF-4 consisting of an inverted repeat separated by 8 nt (IR8). In addition, we show that the coactivator peroxisome proliferator-activated receptor- coactivator-1 was capable of stimulating the HNF-4 -dependent transactivation of apoAV promoter. Furthermore, analyses in human hepatic cells demonstrated that AMP-activated protein kinase (AMPK) and the MAPK signaling pathway regulate human apoAV expression and suggested that this regulation may be mediated, at least in part, by changes in HNF-4 . Intriguingly, EMSAs and mice with a liver-specific disruption of the HNF-4 gene revealed a species-distinct regulation of apoAV by HNF-4 , which resembles that of a subset of HNF-4 target genes. Taken together, our data provide new insights into the binding properties and the modulation of HNF-4 and underscore the role of HNF-4 in regulating triglyceride metabolism.
Resumo:
Background Dopamine is believed to be a key neurotransmitter in the development of attention-deficit/ hyperactivity disorder (ADHD). Several recent studies point to an association of the dopamine D4 receptor (DRD4) gene and this condition. More specifically, the 7 repeat variant of a variable number of tandem repeats (VNTR) polymorphism in exon III of this gene is suggested to bear a higher risk for ADHD. In the present study, we investigated the role of this polymorphism in the modulation of neurophysiological correlates of response inhibition (Go/Nogo task) in a healthy, high-functioning sample. Results Homozygous 7 repeat carriers showed a tendency for more accurate behavior in the Go/Nogo task compared to homozygous 4 repeat carriers. Moreover, 7 repeat carriers presented an increased nogo-related theta band response together with a reduced go-related beta decrease. Conclusions These data point to improved cognitive functions and prefrontal control in the 7 repeat carriers, probably due to the D4 receptor's modulatory role in prefrontal areas. The results are discussed with respect to previous behavioral data on this polymorphism and animal studies on the impact of the D4 receptor on cognitive functions.
Resumo:
Myotonic dystrophy 1 (DM1) is caused by a CTG expansion in the 3′-unstranslated region of the DMPK gene, which encodes a serine/threonine protein kinase. One of the common clinical features of DM1 patients is insulin resistance, which has been associated with a pathogenic effect of the repeat expansions. Here we show that DMPK itself is a positive modulator of insulin action. DMPK-deficient (dmpk−/−) mice exhibit impaired insulin signaling in muscle tissues but not in adipocytes and liver, tissues in which DMPK is not expressed. Dmpk−/− mice display metabolic derangements such as abnormal glucose tolerance, reduced glucose uptake and impaired insulin-dependent GLUT4 trafficking in muscle. Using DMPK mutants, we show that DMPK is required for a correct intracellular trafficking of insulin and IGF-1 receptors, providing a mechanism to explain the molecular and metabolic phenotype of dmpk−/− mice. Taken together, these findings indicate that reduced DMPK expression may directly influence the onset of insulin-resistance in DM1 patients and point to dmpk as a new candidate gene for susceptibility to type 2-diabetes.
Resumo:
Myotonic dystrophy 1 (DM1) is caused by a CTG expansion in the 3′-unstranslated region of the DMPK gene, which encodes a serine/threonine protein kinase. One of the common clinical features of DM1 patients is insulin resistance, which has been associated with a pathogenic effect of the repeat expansions. Here we show that DMPK itself is a positive modulator of insulin action. DMPK-deficient (dmpk−/−) mice exhibit impaired insulin signaling in muscle tissues but not in adipocytes and liver, tissues in which DMPK is not expressed. Dmpk−/− mice display metabolic derangements such as abnormal glucose tolerance, reduced glucose uptake and impaired insulin-dependent GLUT4 trafficking in muscle. Using DMPK mutants, we show that DMPK is required for a correct intracellular trafficking of insulin and IGF-1 receptors, providing a mechanism to explain the molecular and metabolic phenotype of dmpk−/− mice. Taken together, these findings indicate that reduced DMPK expression may directly influence the onset of insulin-resistance in DM1 patients and point to dmpk as a new candidate gene for susceptibility to type 2-diabetes.
Resumo:
Mechanisms underlying speciation in plants include detrimental (incompatible) genetic interactions between parental alleles that incur a fitness cost in hybrids. We reported on recessive hybrid incompatibility between an Arabidopsis thaliana strain from Poland, Landsberg erecta (Ler), and many Central Asian A. thaliana strains. The incompatible interaction is determined by a polymorphic cluster of Toll/interleukin-1 receptor-nucleotide binding-leucine rich repeat (TNL) RPP1 (Recognition of Peronospora parasitica1)-like genes in Ler and alleles of the receptor-like kinase Strubbelig Receptor Family 3 (SRF3) in Central Asian strains Kas-2 or Kond, causing temperature-dependent autoimmunity and loss of growth and reproductive fitness. Here, we genetically dissected the RPP1-like Ler locus to determine contributions of individual RPP1-like Ler (R1R8) genes to the incompatibility. In a neutral background, expression of most RPP1-like Ler genes, except R3, has no effect on growth or pathogen resistance. Incompatibility involves increased R3 expression and engineered R3 overexpression in a neutral background induces dwarfism and sterility. However, no individual RPP1-like Ler gene is sufficient for incompatibility between Ler and Kas-2 or Kond, suggesting that co-action of at least two RPP1-like members underlies this epistatic interaction. We find that the RPP1-like Ler haplotype is frequent and occurs with other Ler RPP1-like alleles in a local population in Gorzów Wielkopolski (Poland). Only Gorzów individuals carrying the RPP1-like Ler haplotype are incompatible with Kas-2 and Kond, whereas other RPP1-like alleles in the population are compatible. Therefore, the RPP1-like Ler haplotype has been maintained in genetically different individuals at a single site, allowing exploration of forces shaping the evolution of RPP1-like genes at local and regional population scales.