9 resultados para REPEATS
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
Background: Amino acid tandem repeats are found in nearly one-fifth of human proteins. Abnormal expansion of these regions is associated with several human disorders. To gain further insight into the mutational mechanisms that operate in this type of sequence, we have analyzed a large number of mutation variants derived from human expressed sequence tags (ESTs).Results: We identified 137 polymorphic variants in 115 different amino acid tandem repeats. Of these, 77 contained amino acid substitutions and 60 contained gaps (expansions or contractions of the repeat unit). The analysis showed that at least about 21% of the repeats might be polymorphic in humans. We compared the mutations found in different types of amino acid repeats and in adjacent regions. Overall, repeats showed a five-fold increase in the number of gap mutations compared to adjacent regions, reflecting the action of slippage within the repetitive structures. Gap and substitution mutations were very differently distributed between different amino acid repeat types. Among repeats containing gap variants we identified several disease and candidate disease genes.Conclusion: This is the first report at a genome-wide scale of the types of mutations occurring in the amino acid repeat component of the human proteome. We show that the mutational dynamics of different amino acid repeat types are very diverse. We provide a list of loci with highly variable repeat structures, some of which may be potentially involved in disease.
Resumo:
Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.
Resumo:
The main information sources to study a particular piece of music are symbolic scores and audio recordings. These are complementary representations of the piece and it isvery useful to have a proper linking between the two of the musically meaningful events. For the case of makam music of Turkey, linking the available scores with the correspondingaudio recordings requires taking the specificities of this music into account, such as the particular tunings, the extensive usage of non-notated expressive elements, and the way in which the performer repeats fragmentsof the score. Moreover, for most of the pieces of the classical repertoire, there is no score written by the original composer. In this paper, we propose a methodology to pair sections of a score to the corresponding fragments of audio recording performances. The pitch information obtained from both sources is used as the common representationto be paired. From an audio recording, fundamental frequency estimation and tuning analysis is done to compute a pitch contour. From the corresponding score, symbolic note names and durations are converted to a syntheticpitch contour. Then, a linking operation is performed between these pitch contours in order to find the best correspondences.The method is tested on a dataset of 11 compositions spanning 44 audio recordings, which are mostly monophonic. An F3-score of 82% and 89% are obtained with automatic and semi-automatic karar detection respectively,showing that the methodology may give us a needed tool for further computational tasks such as form analysis, audio-score alignment and makam recognition.
Resumo:
The gibbon genome exhibits extensive karyotypic diversity with an increased rate of chromosomal rearrangements during evolution. In an effort to understand the mechanistic origin and implications of these rearrangement events, we sequenced 24 synteny breakpoint regions in the white-cheeked gibbon (Nomascus leucogenys, NLE) in the form of high-quality BAC insert sequences (4.2 Mbp). While there is a significant deficit of breakpoints in genes, we identified seven human gene structures involved in signaling pathways (DEPDC4, GNG10), phospholipid metabolism (ENPP5, PLSCR2), beta-oxidation (ECH1), cellular structure and transport (HEATR4), and transcription (ZNF461), that have been disrupted in the NLE gibbon lineage. Notably, only three of these genes show the expected evolutionary signatures of pseudogenization. Sequence analysis of the breakpoints suggested both nonclassical nonhomologous end-joining (NHEJ) and replication-based mechanisms of rearrangement. A substantial number (11/24) of human-NLE gibbon breakpoints showed new insertions of gibbon-specific repeats and mosaic structures formed from disparate sequences including segmental duplications, LINE, SINE, and LTR elements. Analysis of these sites provides a model for a replication-dependent repair mechanism for double-strand breaks (DSBs) at rearrangement sites and insights into the structure and formation of primate segmental duplications at sites of genomic rearrangements during evolution.
Resumo:
Huntington's disease (HD) is an autosomal dominantly inherited disorder caused by the expansion of CAG repeats in the Huntingtin (HTT) gene. The abnormally extended polyglutamine in the HTT protein encoded by the CAG repeats has toxic effects. Here, we provide evidence to support that the mutant HTT CAG repeats interfere with cell viability at the RNA level. In human neuronal cells, expanded HTT exon-1 mRNA with CAG repeat lengths above the threshold for complete penetrance (40 or greater) induced cell death and increased levels of small CAG-repeated RNAs (sCAGs), of ≈21 nucleotides in a Dicer-dependent manner. The severity of the toxic effect of HTT mRNA and sCAG generation correlated with CAG expansion length. Small RNAs obtained from cells expressing mutant HTT and from HD human brains significantly decreased neuronal viability, in an Ago2-dependent mechanism. In both cases, the use of anti-miRs specific for sCAGs efficiently blocked the toxic effect, supporting a key role of sCAGs in HTT-mediated toxicity. Luciferase-reporter assays showed that expanded HTT silences the expression of CTG-containing genes that are down-regulated in HD. These results suggest a possible link between HD and sCAG expression with an aberrant activation of the siRNA/miRNA gene silencing machinery, which may trigger a detrimental response. The identification of the specific cellular processes affected by sCAGs may provide insights into the pathogenic mechanisms underlying HD, offering opportunities to develop new therapeutic approaches
Resumo:
Thirty two microsatellites were optimized from 454 pyrosequencing libraries for three Atlanto-Mediterranean echinoderms: Coscinasterias tenuispina, Echinaster sepositus and Arbacia lixula. We observed different frequency of microsatellite types (di-, tri-, tetra- and pentanucleotide) throughout the genome of the species, but no significant differences were observed in allele richness among different microsatellite repeats. No loci showed linkage disequilibrium. Heterozygosity deficit and departure from Hardy Weinberg equilibrium were observed for some loci, in two species, probably due to high levels of inbreeding. Heterozygosity excess observed in C. tenuispina could be explained by selection against homozygotes and/or outcrossing.
Resumo:
Background: The 22q11.2 deletion syndrome is the most frequent genomic disorder with an estimated frequency of 1/4000 live births. The majority of patients (90%) have the same deletion of 3 Mb (Typically Deleted Region, TDR) that results from aberrant recombination at meiosis between region specific low-copy repeats (LCRs). Methods: As a first step towards the characterization of recombination rates and breakpoints within the 22q11.2 region we have constructed a high resolution recombination breakpoint map based on pedigree analysis and a population-based historical recombination map based on LD analysis. Results: Our pedigree map allows the location of recombination breakpoints with a high resolution (potential recombination hotspots), and this approach has led to the identification of 5 breakpoint segments of 50 kb or less (8.6 kb the smallest), that coincide with historical hotspots. It has been suggested that aberrant recombination leading to deletion (and duplication) is caused by low rates of Allelic Homologous Recombination (AHR) within the affected region. However, recombination rate estimates for 22q11.2 region show that neither average recombination rates in the 22q11.2 region or within LCR22-2 (the LCR implicated in most deletions and duplications), are significantly below chromosome 22 averages. Furthermore, LCR22-2, the repeat most frequently implicated in rearrangements, is also the LCR22 with the highest levels of AHR. In addition, we find recombination events in the 22q11.2 region to cluster within families. Within this context, the same chromosome recombines twice in one family; first by AHR and in the next generation by NAHR resulting in an individual affected with the del22q11.2 syndrome. Conclusion: We show in the context of a first high resolution pedigree map of the 22q11.2 region that NAHR within LCR22 leading to duplications and deletions cannot be explained exclusively under a hypothesis of low AHR rates. In addition, we find that AHR recombination events cluster within families. If normal and aberrant recombination are mechanistically related, the fact that LCR22s undergo frequent AHR and that we find familial differences in recombination rates within the 22q11.2 region would have obvious health-related implications.
Resumo:
Background Dopamine is believed to be a key neurotransmitter in the development of attention-deficit/ hyperactivity disorder (ADHD). Several recent studies point to an association of the dopamine D4 receptor (DRD4) gene and this condition. More specifically, the 7 repeat variant of a variable number of tandem repeats (VNTR) polymorphism in exon III of this gene is suggested to bear a higher risk for ADHD. In the present study, we investigated the role of this polymorphism in the modulation of neurophysiological correlates of response inhibition (Go/Nogo task) in a healthy, high-functioning sample. Results Homozygous 7 repeat carriers showed a tendency for more accurate behavior in the Go/Nogo task compared to homozygous 4 repeat carriers. Moreover, 7 repeat carriers presented an increased nogo-related theta band response together with a reduced go-related beta decrease. Conclusions These data point to improved cognitive functions and prefrontal control in the 7 repeat carriers, probably due to the D4 receptor's modulatory role in prefrontal areas. The results are discussed with respect to previous behavioral data on this polymorphism and animal studies on the impact of the D4 receptor on cognitive functions.
Resumo:
Thirty two microsatellites were optimized from 454 pyrosequencing libraries for three Atlanto-Mediterranean echinoderms: Coscinasterias tenuispina, Echinaster sepositus and Arbacia lixula. We observed different frequency of microsatellite types (di-, tri-, tetra- and pentanucleotide) throughout the genome of the species, but no significant differences were observed in allele richness among different microsatellite repeats. No loci showed linkage disequilibrium. Heterozygosity deficit and departure from Hardy Weinberg equilibrium were observed for some loci, in two species, probably due to high levels of inbreeding. Heterozygosity excess observed in C. tenuispina could be explained by selection against homozygotes and/or outcrossing.