Biblioteca Digital

Alignment-free methods, in which shared properties of sub-sequences (e.g. identity or match length) are extracted and used to compute a distance matrix, have recently been explored for phylogenetic inference. However, the scalability and robustness of these methods to key evolutionary processes remain to be investigated. Here, using simulated sequence sets of various sizes in both nucleotides and amino acids, we systematically assess the accuracy of phylogenetic inference using an alignment-free approach, based on D2 statistics, under different evolutionary scenarios. We find that compared to a multiple sequence alignment approach, D2 methods are more robust against among-site rate heterogeneity, compositional biases, genetic rearrangements and insertions/deletions, but are more sensitive to recent sequence divergence and sequence truncation. Across diverse empirical datasets, the alignment-free methods perform well for sequences sharing low divergence, at greater computation speed. Our findings provide strong evidence for the scalability and the potential use of alignment-free methods in large-scale phylogenomics.

Veja mais

Computational analysis of epigenetic information in human DNA sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Over the last few years, investigations of human epigenetic profiles have identified key elements of change to be Histone Modifications, stable and heritable DNA methylation and Chromatin remodeling. These factors determine gene expression levels and characterise conditions leading to disease. In order to extract information embedded in long DNA sequences, data mining and pattern recognition tools are widely used, but efforts have been limited to date with respect to analyzing epigenetic changes, and their role as catalysts in disease onset. Useful insight, however, can be gained by investigation of associated dinucleotide distributions. The focus of this paper is to explore specific dinucleotides frequencies across defined regions within the human genome, and to identify new patterns between epigenetic mechanisms and DNA content. Signal processing methods, including Fourier and Wavelet Transformations, are employed and principal results are reported.

Veja mais

Complete genome sequences of Helicoverpa armigera Single Nucleopolyhedrovirus Strains AC53 and H25EA1 from Australia

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report here the genome sequences of two alphabaculoviruses of Helicoverpa spp. from Australia: AC53, used in the biopesticides ViVUS and ViVUS Max, and H25EA1, used in in vitro production studies.

Veja mais

Automated categorisation of patent claims that reference human genome sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Debates on gene patents have necessitated the analysis of patents that disclose and reference human sequences. In this study, we built an automated classifier that assigns sequences to one of nine predefined categories according to their functional roles in patent claims by applying natural language processing and supervised learning techniques. To improve its correctness, we experimented with various feature mappings, resulting in the maximal accuracy of 79%.

Veja mais

Mapping the regulatory sequences controlling 93 breast cancer-associated miRNA genes leads to the identification of two functional promoters of the Hsa-mir-200b cluster, methylation of which is associated with metastasis or hormone receptor status in advanced breast cancer

Relevância:

20.00% 20.00%

Publicador:

Resumo:

MicroRNAs (miRNAs) are small non-coding RNAs of 20 nt in length that are capable of modulating gene expression post-transcriptionally. Although miRNAs have been implicated in cancer, including breast cancer, the regulation of miRNA transcription and the role of defects in this process in cancer is not well understood. In this study we have mapped the promoters of 93 breast cancer-associated miRNAs, and then looked for associations between DNA methylation of 15 of these promoters and miRNA expression in breast cancer cells. The miRNA promoters with clearest association between DNA methylation and expression included a previously described and a novel promoter of the Hsa-mir-200b cluster. The novel promoter of the Hsa-mir-200b cluster, denoted P2, is located 2 kb upstream of the 5′ stemloop and maps within a CpG island. P2 has comparable promoter activity to the previously reported promoter (P1), and is able to drive the expression of miR-200b in its endogenous genomic context. DNA methylation of both P1 and P2 was inversely associated with miR-200b expression in eight out of nine breast cancer cell lines, and in vitro methylation of both promoters repressed their activity in reporter assays. In clinical samples, P1 and P2 were differentially methylated with methylation inversely associated with miR-200b expression. P1 was hypermethylated in metastatic lymph nodes compared with matched primary breast tumours whereas P2 hypermethylation was associated with loss of either oestrogen receptor or progesterone receptor. Hypomethylation of P2 was associated with gain of HER2 and androgen receptor expression. These data suggest an association between miR-200b regulation and breast cancer subtype and a potential use of DNA methylation of miRNA promoters as a component of a suite of breast cancer biomarkers.

Veja mais

Draft genome and plasmid sequences of Chlamydia pneumoniae strain B21 from an Australian endangered marsupial, the western barred bandicoot

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Chlamydia pneumoniae is a ubiquitous intracellular pathogen, first associated with human respiratory disease and subsequently detected in a range of mammals, amphibians, and reptiles. Here we report the draft genome sequence for strain B21 of C. pneumoniae, isolated from the endangered Australian marsupial the western barred bandicoot.

Veja mais

Public disclosure of biological sequences in global patent practice

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Biological sequences are an important part of global patenting, with unique challenges for their effective and equitable use in practice and in policy. Because their function can only be determined with computer-aided technology, the form in which sequences are disclosed matters greatly. Similarly, the scope of patent rights sought and granted requires computer readable data and tools for comparison. Critically, the primary data provided to the national patent offices and thence to the public, must be comprehensive, standardized, timely and meaningful. It is not yet. The proposed global Patent Sequence (PatSeq) Data platform can enable national and regional jurisdictions meet the desired standards.

Veja mais

x-ray studies on crystalline complexes involving amino-acids and peptides .10. head-to-tail sequences in the crystal-structure of l-lysine acetate

Relevância:

20.00% 20.00%

Publicador:

Resumo:

l-Lysine acetate crystallises in the monoclinic space group P21 with a = 5.411 (1), b = 7.562(1), c= l2.635(2) Å and β = 91.7(1). The crystal structure was solved by direct methods and refined to an R value of 0.049 using the full matrix least squares method. The conformation and the aggregation of lysine molecules in the structure are similar to those found in the crystal structure of l-lysine l-aspartate. A conspicuous similarity between the crystal structures of l-arginine acetate and l-lysine acetate is that in both cases the strongly basic side chain, although having the largest pK value, interacts with the weakly acidic acetate group leaving the α-amino and the α-carboxylate groups to take part in head-to-tail sequences. These structures thus indicate that electrostatic effects are strongly modulated by other factors so as to give rise to head-to-tail sequences which have earlier been shown to be an almost universal feature of amino acid aggregation in the solid state.

Veja mais

Solution conformations of penta and heptapeptides containing repetitive α-aminoisobutyryl-Image -alanyl and α -aminoisobutyryl-Image -valyl sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The presence of folded solution conformations in the peptides Boc-Ala-(Aib-Ala)2-OMe, Boc-Val-(Aib-Val) 2-OMe, Boc-Ala-(Aib-Ala)3-OMe and Boc-Val-(Aib-Val)3-OMe has been established by 270MHz 1H NMR. Intramolecularly H-bonded NH groups have been identified using temperature and solvent dependence of NH chemical shifts and paramagnetic radical induced broadening of NH resonances. Both pentapeptides adopt 310 helical conformations possessing 3 intramolecular H-bonds in CDCl3 and (CD3)2SO. The heptapeptides favour helical structures with 5 H-bonds in CDCl3. In (CD3)2SO only 4 H-bonds are readily detected.

Veja mais

Peanut stripe potyvirus resistance in peanut (Arachis hypogaea L.) plants carrying viral coat protein gene sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Peanut (Arachis hypogaea L.) lines exhibiting high levels of resistance to peanut stripe virus (PStV) were obtained following microprojectile bombardment of embryogenic callus derived from mature seeds. Fertile plants of the commercial cultivars Gajah and NC7 were regenerated following co-bombardmentwith the hygromycin resistance gene and one of two forms of the PStV coat protein (CP) gene, an untranslatable, full length sequence (CP2) or a translatable gene encoding a CP with an N-terminal truncation (CP4). High level resistance to PStV was observed for both transgenes when plants were challenged with the homologous virus isolate. The mechanism of resistance appears to be RNA-mediated, since plants carrying either the untranslatable CP2 or CP4 had no detectable protein expression, but were resistant or immune (no virus replication). Furthermore, highly resistant, but not susceptible CP2 T0 plants contained transgene-specific small RNAs. These plants now provide important germplasm for peanut breeding, particularly in countries where PStV is endemic and poses a major constraint to peanut production.

Veja mais

Identification of small juvenile scombrids from northwest tropical Australia using mitochondrial DNA cytochrome b sequences

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Small juveniles of the nine species of scombrids in Australian waters are morphologically similar to one another and, consequently, difficult to identify to species level. We show that the sequence of the mitochondrial DNA cytochrome b gene region is a powerful tool for identification of these young fish. Using this method, we identified 50 juvenile scombrids collected from Exmouth Bay, Western Australia. Six species of scombrids were apparent in this sample of fish: narrow-barred Spanish mackerel (Scomberomorus commerson), Indian mackerel (Rastrelliger kanagurta), frigate tuna (Auxis thazard), bullet tuna (Auxis rochei), leaping bonito (Cybiosarda elegans), and kawakawa (Euthynnus affinis). The presence of Indian mackerel, frigate tuna, leaping bonito, and kawakawa is the first indication that coastal waters may be an important spawning habitat for these species, although offshore spawning may also occur. The occurrence of small juvenile S. commerson was predicted from the known spawning patterns of that species, but other mackerel species (Scomberomorus munroi, Scomberomorus queenslandicus, Scomberomorus semifasiciatus) likely to be spawning during the sampling period were not detected among the 50 small juveniles analyzed here.

Veja mais

Effects Of 5' Flanking Sequences And Changes In The 5' Internal Control Region On The Transcription Of Rice Transfer-Rna Gcc Cly Gene

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A stretch of 71 nucleotides in a 1.2 kilobase pair Pst I fragment of rice DNA was identified as tRNA~ gene by hybridization and nucleotide sequence analyses. The hybridization of genomic DNA with the tRNA gene showed that there are about 10 glycine tRNA genes per diploid rice genome. The 3' and 5' internal control regions, where RNA polymerase III and transcription factors bind, were found to be present in the coding sequence. The gene was transcribed into a 4S product in an yeast cell-free extract. The substitution of 5' internal control region with analogous sequences from either M13mpl9 or M13mpl8 DNA did not affect the transcription of the gene in vitro. The changes in three highly conserved nucleotides in the consensus 5' internal control region (RGYNNARYGG; R = purine, Y = pyrimidine, N = any nucleotide) did not affect transcription showing that these nucleotides are not essential for promotion of transcription. There were two 16 base pair repeats, 'TGTTTGTTTCAGCTTA' at - 130 and - 375 positions upstream from the start of the gene. Deletion of 5' flanking sequences including the 16 base pair repeat at - 375 showed increased transcription indicating that these sequences negatively modulate the expression of the gene.

Veja mais

999 resultados para CHECKING SEQUENCES

Filtro por publicador