51 resultados para Repetitive DNA sequences


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Species identification based on short sequences of DNA markers, that is, DNA barcoding, has emerged as an integral part of modern taxonomy. However, software for the analysis of large and multilocus barcoding data sets is scarce. The Basic Local Alignment Search Tool (BLAST) is currently the fastest tool capable of handling large databases (e.g. >5000 sequences), but its accuracy is a concern and has been criticized for its local optimization. However, current more accurate software requires sequence alignment or complex calculations, which are time-consuming when dealing with large data sets during data preprocessing or during the search stage. Therefore, it is imperative to develop a practical program for both accurate and scalable species identification for DNA barcoding. In this context, we present VIP Barcoding: a user-friendly software in graphical user interface for rapid DNA barcoding. It adopts a hybrid, two-stage algorithm. First, an alignment-free composition vector (CV) method is utilized to reduce searching space by screening a reference database. The alignment-based K2P distance nearest-neighbour method is then employed to analyse the smaller data set generated in the first stage. In comparison with other software, we demonstrate that VIP Barcoding has (i) higher accuracy than Blastn and several alignment-free methods and (ii) higher scalability than alignment-based distance methods and character-based methods. These results suggest that this platform is able to deal with both large-scale and multilocus barcoding data with accuracy and can contribute to DNA barcoding for modern taxonomy. VIP Barcoding is free and available at http://msl.sls.cuhk.edu.hk/vipbarcoding/.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Copy number variations (CNVs) as described in the healthy population are purported to contribute significantly to genetic heterogeneity. Recent studies have described CNVs using lymphoblastoid cell lines or by application of specifically developed algorithms to interrogate previously described data. However, the full extent of CNVs remains unclear. Using high-density SNP array, we have undertaken a comprehensive investigation of chromosome 18 for CNV discovery and characterisation of distribution and association with chromosome architecture. We identified 399 CNVs, of which loss represents 98%, 58% are less than 2.5 kb in size and 71% are intergenic. Intronic deletions account for the majority of copy number changes with gene involvement. Furthermore, one-third of CNVs do not have putative breakpoints within repetitive sequences. We conclude that replicative processes, mediated either by repetitive elements or microhomology, account for the majority of CNVs in the healthy population. Genomic instability involving the formation of a non-B structure is demonstrated in one region.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

MicroRNAs (miRNAs) are small non-coding RNAs of 20 nt in length that are capable of modulating gene expression post-transcriptionally. Although miRNAs have been implicated in cancer, including breast cancer, the regulation of miRNA transcription and the role of defects in this process in cancer is not well understood. In this study we have mapped the promoters of 93 breast cancer-associated miRNAs, and then looked for associations between DNA methylation of 15 of these promoters and miRNA expression in breast cancer cells. The miRNA promoters with clearest association between DNA methylation and expression included a previously described and a novel promoter of the Hsa-mir-200b cluster. The novel promoter of the Hsa-mir-200b cluster, denoted P2, is located 2 kb upstream of the 5′ stemloop and maps within a CpG island. P2 has comparable promoter activity to the previously reported promoter (P1), and is able to drive the expression of miR-200b in its endogenous genomic context. DNA methylation of both P1 and P2 was inversely associated with miR-200b expression in eight out of nine breast cancer cell lines, and in vitro methylation of both promoters repressed their activity in reporter assays. In clinical samples, P1 and P2 were differentially methylated with methylation inversely associated with miR-200b expression. P1 was hypermethylated in metastatic lymph nodes compared with matched primary breast tumours whereas P2 hypermethylation was associated with loss of either oestrogen receptor or progesterone receptor. Hypomethylation of P2 was associated with gain of HER2 and androgen receptor expression. These data suggest an association between miR-200b regulation and breast cancer subtype and a potential use of DNA methylation of miRNA promoters as a component of a suite of breast cancer biomarkers.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

1. The low density lipoprotein receptor is an important regulator of serum cholesterol which may have implications for the development of both hypertension and obesity. In this study, genotypes for a low density lipoprotein receptor gene (LDLR) dinucleotide polymorphism were determined in both lean and obese normotensive populations. 2. In previous cross-sectional association studies an ApaLI and a HincII polymorphism for LDLR were shown to be associated with obesity in essential hypertensives. However, these polymorphisms did not show an association with obesity in normotensives. 3. In contrast, this study reports that preliminary results for an LDLR microsatellite marker, located more towards the 3' end of the gene, show a significant association with obesity in the normotensive population studied. These results indicate that LDLR could play an important role in the development of obesity, which might be independent of hypertension.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

CONTEXT: Polyalanine tract variations in transcription factors have been identified for a wide spectrum of developmental disorders. The thyroid transcription factor forkhead factor E1 (FOXE1) contains a polymorphic polyalanine tract with 12-22 alanines. Single-nucleotide polymorphisms (SNP) close to this locus are associated with papillary thyroid cancer (PTC), and a strong linkage disequilibrium block extends across this region. OBJECTIVE: The objective of the study was to assess whether the FOXE1 polyalanine repeat region was associated with PTC and to assess the effect of polyalanine repeat region variants on protein expression, DNA binding, and transcriptional function on FOXE1-responsive promoters. DESIGN: This was a case-control study. SETTING: The study was conducted at a tertiary referral hospital. PATIENTS AND METHODS: The FOXE1 polyalanine repeat region and tag SNP were genotyped in 70 PTC, with a replication in a further 92 PTC, and compared with genotypes in 5767 healthy controls (including 5667 samples from the Wellcome Trust Case Control Consortium). In vitro studies were performed to examine the protein expression, DNA binding, and transcriptional function for FOXE1 variants of different polyalanine tract lengths. RESULTS: All the genotyped SNP were in tight linkage disequilibrium, including the FOXE1 polyalanine repeat region. We confirmed the strong association of rs1867277 with PTC (overall P = 1 × 10(-7), odds ratio 1.84, confidence interval 1.31-2.57). rs1867277 was in tight linkage disequilibrium with the FOXE1 polyalanine repeat region (r(2) = 0.95). FOXE1(16Ala) was associated with PTC with an odds ratio of 2.23 (confidence interval 1.42-3.50; P = 0.0005). Functional studies in vitro showed that FOXE1(16Ala) was transcriptionally impaired compared with FOXE1(14Ala), which was not due to differences in protein expression or DNA binding. CONCLUSIONS: We have confirmed the previous association of FOXE1 with PTC. Our data suggest that the coding polyalanine expansion in FOXE1 may be responsible for the observed association between FOXE1 and PTC.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present the complete mitochondrial genome (accession number: LK995454) of an iconic Australian species, the eastern grey kangaroo (Macropus giganteus). The mitogenomic organization is consistent with other marsupials, encoding 13 protein-coding genes, 22 tRNA genes, 2 ribosomal RNA genes, an origin of light strand replication and a control region or Dloop. No repetitive sequences were detected in the control region. The M. giganteus mitogenome exemplifies a combination of tRNA gene order and structural peculiarities that appear to be unique to marsupials. We present a maximum likelihood phylogeny based on complete mitochondrial protein and RNA coding sequences that confirms the phylogenetic position of the grey kangaroo among macropodids.