966 resultados para de novo genome assembly


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Siamese mud carp (Henichorynchus siamensis) is a freshwater teleost of high economic importance in the Mekong River Basin. However, genetic data relevant for delineating wild stocks for management purposes currently are limited for this species. Here, we used 454 pyrosequencing to generate a partial genome survey sequence (GSS) dataset to develop simple sequence repeat (SSR) markers from H. siamensis genomic DNA. Data generated included a total of 65,954 sequence reads with average length of 264 nucleotides, of which 2.79% contain SSR motifs. Based on GSS-BLASTx results, 10.5% of contigs and 8.1% singletons possessed significant similarity (E value < 10–5) with the majority matching well to reported fish sequences. KEGG analysis identified several metabolic pathways that provide insights into specific potential roles and functions of sequences involved in molecular processes in H. siamensis. Top protein domains detected included reverse transcriptase and the top putative functional transcript identified was an ORF2-encoded protein. One thousand eight hundred and thirty seven sequences containing SSR motifs were identified, of which 422 qualified for primer design and eight polymorphic loci have been tested with average observed and expected heterozygosity estimated at 0.75 and 0.83, respectively. Regardless of their relative levels of polymorphism and heterozygosity, microsatellite loci developed here are suitable for further population genetic studies in H. siamensis and may also be applicable to other related taxa.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetic recombination is a fundamental evolutionary mechanism promoting biological adaptation. Using engineered recombinants of the small single-stranded DNA plant virus, Maize streak virus (MSV), we experimentally demonstrate that fragments of genetic material only function optimally if they reside within genomes similar to those in which they evolved. The degree of similarity necessary for optimal functionality is correlated with the complexity of intragenomic interaction networks within which genome fragments must function. There is a striking correlation between our experimental results and the types of MSV recombinants that are detectable in nature, indicating that obligatory maintenance of intragenome interaction networks strongly constrains the evolutionary value of recombination for this virus and probably for genomes in general.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Circoviruses lack an autonomous DNA polymerase and are dependent on the replication machinery of the host cell for de novo DNA synthesis. Accordingly, the viral DNA needs to cross both the plasma membrane and the nuclear envelope before replication can occur. Here we report on the subcellular distribution of the beak and feather disease virus (BFDV) capsid protein (CP) and replication-associated protein (Rep) expressed via recombinant baculoviruses in an insect cell system and test the hypothesis that the CP is responsible for transporting the viral genome, as well as Rep, across the nuclear envelope. The intracellular localization of the BFDV CP was found to be directed by three partially overlapping bipartite nuclear localization signals (NLSs) situated between residues 16 and 56 at the N terminus of the protein. Moreover, a DNA binding region was also mapped to the N terminus of the protein and falls within the region containing the three putative NLSs. The ability of CP to bind DNA, coupled with the karyophilic nature of this protein, strongly suggests that it may be responsible for nuclear targeting of the viral genome. Interestingly, whereas Rep expressed on its own in insect cells is restricted to the cytoplasm, coexpression with CP alters the subcellular localization of Rep to the nucleus, strongly suggesting that an interaction with CP facilitates movement of Rep into the nucleus. Copyright © 2006, American Society for Microbiology. All Rights Reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Exponential growth of genomic data in the last two decades has made manual analyses impractical for all but trial studies. As genomic analyses have become more sophisticated, and move toward comparisons across large datasets, computational approaches have become essential. One of the most important biological questions is to understand the mechanisms underlying gene regulation. Genetic regulation is commonly investigated and modelled through the use of transcriptional regulatory network (TRN) structures. These model the regulatory interactions between two key components: transcription factors (TFs) and the target genes (TGs) they regulate. Transcriptional regulatory networks have proven to be invaluable scientific tools in Bioinformatics. When used in conjunction with comparative genomics, they have provided substantial insights into the evolution of regulatory interactions. Current approaches to regulatory network inference, however, omit two additional key entities: promoters and transcription factor binding sites (TFBSs). In this study, we attempted to explore the relationships among these regulatory components in bacteria. Our primary goal was to identify relationships that can assist in reducing the high false positive rates associated with transcription factor binding site predictions and thereupon enhance the reliability of the inferred transcription regulatory networks. In our preliminary exploration of relationships between the key regulatory components in Escherichia coli transcription, we discovered a number of potentially useful features. The combination of location score and sequence dissimilarity scores increased de novo binding site prediction accuracy by 13.6%. Another important observation made was with regards to the relationship between transcription factors grouped by their regulatory role and corresponding promoter strength. Our study of E.coli ��70 promoters, found support at the 0.1 significance level for our hypothesis | that weak promoters are preferentially associated with activator binding sites to enhance gene expression, whilst strong promoters have more repressor binding sites to repress or inhibit gene transcription. Although the observations were specific to �70, they nevertheless strongly encourage additional investigations when more experimentally confirmed data are available. In our preliminary exploration of relationships between the key regulatory components in E.coli transcription, we discovered a number of potentially useful features { some of which proved successful in reducing the number of false positives when applied to re-evaluate binding site predictions. Of chief interest was the relationship observed between promoter strength and TFs with respect to their regulatory role. Based on the common assumption, where promoter homology positively correlates with transcription rate, we hypothesised that weak promoters would have more transcription factors that enhance gene expression, whilst strong promoters would have more repressor binding sites. The t-tests assessed for E.coli �70 promoters returned a p-value of 0.072, which at 0.1 significance level suggested support for our (alternative) hypothesis; albeit this trend may only be present for promoters where corresponding TFBSs are either all repressors or all activators. Nevertheless, such suggestive results strongly encourage additional investigations when more experimentally confirmed data will become available. Much of the remainder of the thesis concerns a machine learning study of binding site prediction, using the SVM and kernel methods, principally the spectrum kernel. Spectrum kernels have been successfully applied in previous studies of protein classification [91, 92], as well as the related problem of promoter predictions [59], and we have here successfully applied the technique to refining TFBS predictions. The advantages provided by the SVM classifier were best seen in `moderately'-conserved transcription factor binding sites as represented by our E.coli CRP case study. Inclusion of additional position feature attributes further increased accuracy by 9.1% but more notable was the considerable decrease in false positive rate from 0.8 to 0.5 while retaining 0.9 sensitivity. Improved prediction of transcription factor binding sites is in turn extremely valuable in improving inference of regulatory relationships, a problem notoriously prone to false positive predictions. Here, the number of false regulatory interactions inferred using the conventional two-component model was substantially reduced when we integrated de novo transcription factor binding site predictions as an additional criterion for acceptance in a case study of inference in the Fur regulon. This initial work was extended to a comparative study of the iron regulatory system across 20 Yersinia strains. This work revealed interesting, strain-specific difierences, especially between pathogenic and non-pathogenic strains. Such difierences were made clear through interactive visualisations using the TRNDifi software developed as part of this work, and would have remained undetected using conventional methods. This approach led to the nomination of the Yfe iron-uptake system as a candidate for further wet-lab experimentation due to its potential active functionality in non-pathogens and its known participation in full virulence of the bubonic plague strain. Building on this work, we introduced novel structures we have labelled as `regulatory trees', inspired by the phylogenetic tree concept. Instead of using gene or protein sequence similarity, the regulatory trees were constructed based on the number of similar regulatory interactions. While the common phylogentic trees convey information regarding changes in gene repertoire, which we might regard being analogous to `hardware', the regulatory tree informs us of the changes in regulatory circuitry, in some respects analogous to `software'. In this context, we explored the `pan-regulatory network' for the Fur system, the entire set of regulatory interactions found for the Fur transcription factor across a group of genomes. In the pan-regulatory network, emphasis is placed on how the regulatory network for each target genome is inferred from multiple sources instead of a single source, as is the common approach. The benefit of using multiple reference networks, is a more comprehensive survey of the relationships, and increased confidence in the regulatory interactions predicted. In the present study, we distinguish between relationships found across the full set of genomes as the `core-regulatory-set', and interactions found only in a subset of genomes explored as the `sub-regulatory-set'. We found nine Fur target gene clusters present across the four genomes studied, this core set potentially identifying basic regulatory processes essential for survival. Species level difierences are seen at the sub-regulatory-set level; for example the known virulence factors, YbtA and PchR were found in Y.pestis and P.aerguinosa respectively, but were not present in both E.coli and B.subtilis. Such factors and the iron-uptake systems they regulate, are ideal candidates for wet-lab investigation to determine whether or not they are pathogenic specific. In this study, we employed a broad range of approaches to address our goals and assessed these methods using the Fur regulon as our initial case study. We identified a set of promising feature attributes; demonstrated their success in increasing transcription factor binding site prediction specificity while retaining sensitivity, and showed the importance of binding site predictions in enhancing the reliability of regulatory interaction inferences. Most importantly, these outcomes led to the introduction of a range of visualisations and techniques, which are applicable across the entire bacterial spectrum and can be utilised in studies beyond the understanding of transcriptional regulatory networks.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the past few years, remarkable progress has been made in unveiling novel and unique optical properties of strongly coupled plasmonic nanostructures. However, application of such plasmonic nanostructures in biomedicine remains challenging due to the lack of facile and robust assembly methods for producing stable nanostructures. Previous attempts to achieve plasmonic nano-assemblies using molecular ligands were limited due to the lack of flexibility that could be exercised in forming them. Here, we report the utilization of tailor-made hyperbranched polymers (HBP) as linkers to assemble gold nanoparticles (NPs) into nano-assemblies. The ease and flexibility in tuning the particle size and number of branch ends of a HBP makes it an ideal candidate as a linker, as opposed to DNA, small organic molecules and linear or dendrimeric polymers. We report a strong correlation of polymer (HBP) concentration with the size of the hybrid nano-assemblies and “hot-spot” density. We have shown that such solutions of stable HBP-gold nano-assemblies can be barcoded with various Raman tags to provide improved surface-enhanced Raman scattering (SERS) compared with non-aggregated NP systems. These Raman barcoded hybrid nano-assemblies, with further optimization of NP shape, size and “hot-spot” density, may find application as diagnostic tools in nanomedicine.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Androgen-dependent pathways regulate maintenance and growth of normal and malignant prostate tissues. Androgen deprivation therapy (ADT) exploits this dependence and is used to treat metastatic prostate cancer; however, regression initially seen with ADT gives way to development of incurable castration-resistant prostate cancer (CRPC). Although ADT generates a therapeutic response, it is also associated with a pattern of metabolic alterations consistent with metabolic syndrome including elevated circulating insulin. Because CRPC cells are capable of synthesizing androgens de novo, we hypothesized that insulin may also influence steroidogenesis in CRPC. In this study, we examined this hypothesis by evaluating the effect of insulin on steroid synthesis in prostate cancer cell lines. Treatment with 10 nmol/L insulin increased mRNA and protein expression of steroidogenesis enzymes and upregulated the insulin receptor substrate insulin receptor substrate 2 (IRS-2). Similarly, insulin treatment upregulated intracellular testosterone levels and secreted androgens, with the concentrations of steroids observed similar to the levels reported in prostate cancer patients. With similar potency to dihydrotestosterone, insulin treatment resulted in increased mRNA expression of prostate-specific antigen. CRPC progression also correlated with increased expression of IRS-2 and insulin receptor in vivo. Taken together, our findings support the hypothesis that the elevated insulin levels associated with therapeutic castration may exacerbate progression of prostate cancer to incurable CRPC in part by enhancing steroidogenesis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genome-wide association studies (GWAS) have identified multiple common genetic variants associated with an increased risk of prostate cancer (PrCa), but these explain less than one-third of the heritability. To identify further susceptibility alleles, we conducted a meta-analysis of four GWAS including 5953 cases of aggressive PrCa and 11 463 controls (men without PrCa). We computed association tests for approximately 2.6 million SNPs and followed up the most significant SNPs by genotyping 49 121 samples in 29 studies through the international PRACTICAL and BPC3 consortia. We not only confirmed the association of a PrCa susceptibility locus, rs11672691 on chromosome 19, but also showed an association with aggressive PrCa [odds ratio = 1.12 (95% confidence interval 1.03-1.21), P = 1.4 × 10(-8)]. This report describes a genetic variant which is associated with aggressive PrCa, which is a type of PrCa associated with a poorer prognosis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Twenty first century learners operate in organic, immersive environments. A pedagogy of student-centred learning is not a recipe for rooms. A contemporary learning environment is like a landscape that grows, morphs, and responds to the pressures of the context and micro-culture. There is no single adaptable solution, nor a suite of off-the-shelf answers; propositions must be customisable and infinitely variable. They must be indeterminate and changeable; based on the creation of learning places, not restrictive or constraining spaces. A sustainable solution will be un-fixed, responsive to the life cycle of the components and materials, able to be manipulated by the users; it will create and construct its own history. Learning occurs as formal education with situational knowledge structures, but also as informal learning, active learning, blended learning social learning, incidental learning, and unintended learning. These are not spatial concepts but socio-cultural patterns of discovery. Individual learning requirements must run free and need to be accommodated as the learner sees fit. The spatial solution must accommodate and enable a full array of learning situations. It is a system not an object. Three major components: 1. The determinate landscape: in-situ concrete 'plate' that is permanent. It predates the other components of the system and remains as a remnant/imprint/fossil after the other components of the system have been relocated. It is a functional learning landscape in its own right; enabling a variety of experiences and activities. 2. The indeterminate landscape: a kit of pre-fabricated 2-D panels assembled in a unique manner at each site to suit the client and context. Manufactured to the principles of design-for-disassembly. A symbiotic barnacle like system that attaches itself to the existing infrastructure through the determinate landscape which acts as a fast growth rhizome. A carapace of protective panels, infinitely variable to create enclosed, semi-enclosed, and open learning places. 3. The stations: pre-fabricated packages of highly-serviced space connected through the determinate landscape. Four main types of stations; wet-room learning centres, dry-room learning centres, ablutions, and low-impact building services. Entirely customised at the factory and delivered to site. The stations can be retro-fitted to suit a new context during relocation. Principles of design for disassembly: material principles • use recycled and recyclable materials • minimise the number of types of materials • no toxic materials • use lightweight materials • avoid secondary finishes • provide identification of material types component principles • minimise/standardise the number of types of components • use mechanical not chemical connections • design for use of common tools and equipment • provide easy access to all components • make component size to suite means of handling • provide built in means of handling • design to realistic tolerances • use a minimum number of connectors and a minimum number of types system principles • design for durability and repeated use • use prefabrication and mass production • provide spare components on site • sustain all assembly and material information

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Approximately 2500 fly species comprise the Sarcophagidae family worldwide. The complete mitochondrial genome of the carrion-breeding, forensically important Sarcophaga impatiens Walker (Diptera: Sarcophagidae) from Australia was sequenced. The 15,169 bp circular genome contains the 37 genes found in a typical Metazoan genome: 13 protein-coding genes, 2 ribosomal RNA genes and 22 transfer RNA genes. It also contains one non-coding A+T-rich region. The arrangement of the genes was the same as that found in the ancestral insect. All the protein initiation codons are ATN, except for cox1 that begins with TCG (encoding S). The 22 tRNA anticodons of S. impatiens are consistent with those observed in Drosophila yakuba, and all form the typical cloverleaf structure, except for tRNA-Ser(AGN) that lacks the DHU arm. The mitochondrial genome of Sarcophaga presented will be valuable for resolving phylogenetic relationships within the family Sarcophagidae and the order Diptera, and could be used to identify favourable genetic markers for species identifications for forensic purposes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A genome-wide search for markers associated with BSE incidence was performed by using Transmission-Disequilibrium Tests (TDTs). Significant segregation distortion, i.e., unequal transmission probabilities of alleles within a locus, was found for three marker loci on Chromosomes (Chrs) 5, 10, and 20. Although TDTs are robust to false associations owing to hidden population substructures, it cannot distinguish segregation distortion caused by a true association between a marker and bovine spongiform encephalopathy (BSE) from a population-wide distortion. An interaction test and a segregation distortion analysis in half-sib controls were used to disentangle these two alternative hypotheses. None of the markers showed any significant interaction between allele transmission rates and disease status, and only the marker on Chr 10 showed a significant segregation distortion in control individuals. Nevertheless, the control group may have been a mixture of resistant and susceptible but unchallenged individuals. When new genotypes were generated in the vicinity of these three markers, evidence for an association with BSE was confirmed for the locus on Chr 5.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Chromium oxyhydroxide nanomaterials with narrow size-distribution were synthesised through a simple hydrothermal method. Experimental conditions, such as reaction duration and pH values of the precipitation process and hydrothermal treatment played important roles in determining the nature of the final product chromium oxyhydroxide nanomaterials. The effect of these synthesis parameters were studied with the assistance of X-ray diffraction, scanning electron microscopy, X-ray photoelectron spectroscopy and thermogravimetric analyses. This research has developed a controllable synthesis of Chromium oxyhydroxide nanomaterials from Chromium oxide colloids.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Phylogenetic inference from sequences can be misled by both sampling (stochastic) error and systematic error (nonhistorical signals where reality differs from our simplified models). A recent study of eight yeast species using 106 concatenated genes from complete genomes showed that even small internal edges of a tree received 100% bootstrap support. This effective negation of stochastic error from large data sets is important, but longer sequences exacerbate the potential for biases (systematic error) to be positively misleading. Indeed, when we analyzed the same data set using minimum evolution optimality criteria, an alternative tree received 100% bootstrap support. We identified a compositional bias as responsible for this inconsistency and showed that it is reduced effectively by coding the nucleotides as purines and pyrimidines (RY-coding), reinforcing the original tree. Thus, a comprehensive exploration of potential systematic biases is still required, even though genome-scale data sets greatly reduce sampling error.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Complementary sequences at the 5′ and 3′ ends of the dengue virus RNA genome are essential for viral replication, and are believed to cyclise the genome through long-range base pairing in cis. Although consistent with evidence in the literature, this view neglects possible biologically active multimeric forms that are equally consistent with the data. Here, we propose alternative multimeric structures, and suggest that multigenome noncovalent concatemers are more likely to exist under cellular conditions than single cyclised monomers. Concatemers provide a plausible mechanism for the dengue virus to overcome the single-stranded (+)-sense RNA virus dilemma, and can potentially assist genome transport from the virus-induced vesicles into the cytosol.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Migraine is a common neurological disease with a complex genetic aetiology. The disease affects ~12% of the Caucasian population and females are three times more likely than males to be diagnosed. In an effort to identify loci involved in migraine susceptibility, we performed a pedigree-based genome-wide association study of the isolated population of Norfolk Island, which has a high prevalence of migraine. This unique population originates from a small number of British and Polynesian founders who are descendents of the Bounty mutiny and forms a very large multigenerational pedigree (Bellis et al.; Human Genetics, 124(5):543-5542, 2008). These population genetic features may facilitate disease gene mapping strategies (Peltonen et al.; Nat Rev Genet, 1(3):182-90, 2000. In this study, we identified a high heritability of migraine in the Norfolk Island population (h (2) = 0.53, P = 0.016). We performed a pedigree-based GWAS and utilised a statistical and pathological prioritisation approach to implicate a number of variants in migraine. An SNP located in the zinc finger protein 555 (ZNF555) gene (rs4807347) showed evidence of statistical association in our Norfolk Island pedigree (P = 9.6 × 10(-6)) as well as replication in a large independent and unrelated cohort with >500 migraineurs. In addition, we utilised a biological prioritisation to implicate four SNPs, in within the ADARB2 gene, two SNPs within the GRM7 gene and a single SNP in close proximity to a HTR7 gene. Association of SNPs within these neurotransmitter-related genes suggests a disrupted serotoninergic system that is perhaps specific to the Norfolk Island pedigree, but that might provide clues to understanding migraine more generally.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The high risk of metabolic disease traits in Polynesians may be partly explained by elevated prevalence of genetic variants involved in energy metabolism. The genetics of Polynesian populations has been shaped by island hoping migration events which have possibly favoured thrifty genes. The aim of this study was to sequence the mitochondrial genome in a group of Maoris in an effort to characterise genome variation in this Polynesian population for use in future disease association studies. We sequenced the complete mitochondrial genomes of 20 non-admixed Maori subjects using Affymetrix technology. DNA diversity analyses showed the Maori group exhibited reduced mitochondrial genome diversity compared to other worldwide populations, which is consistent with historical bottleneck and founder effects. Global phylogenetic analysis positioned these Maori subjects specifically within mitochondrial haplogroup - B4a1a1. Interestingly, we identified several novel variants that collectively form new and unique Maori motifs – B4a1a1c, B4a1a1a3 and B4a1a1a5. Compared to ancestral populations we observed an increased frequency of non-synonymous coding variants of several mitochondrial genes in the Maori group, which may be a result of positive selection and/or genetic drift effects. In conclusion, this study reports the first complete mitochondrial genome sequence data for a Maori population. Overall, these new data reveal novel mitochondrial genome signatures in this Polynesian population and enhance the phylogenetic picture of maternal ancestry in Oceania. The increased frequency of several mitochondrial coding variants makes them good candidates for future studies aimed at assessment of metabolic disease risk in Polynesian populations.