183 resultados para genomic sequence


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Homologous recombination (HR) repairs chromosome damage and is indispensable for tumor suppression in humans. RAD51 mediates the DNA strand-pairing step in HR. RAD51 associated protein 1 (RAD51AP1) is a RAD51-interacting protein whose function has remained elusive. Knockdown of RAD51AP1 in human cells by RNA interference engenders sensitivity to different types of genotoxic stress, and RAD51AP1 is epistatic to the HR protein XRCC3. Moreover, RAD51AP1-depleted cells are impaired for the recombinational repair of a DNA double-strand break and exhibit chromatid breaks both spontaneously and upon DNA-damaging treatment. Purified RAD51AP1 binds both dsDNA and a D loop structure and, only when able to interact with RAD51, greatly stimulates the RAD51-mediated D loop reaction. Biochemical and cytological results show that RAD51AP1 functions at a step subsequent to the assembly of the RAD51-ssDNA nucleoprotein filament. Our findings provide evidence that RAD51AP1 helps maintain genomic integrity via RAD51 recombinase enhancement.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over long evolutionary time scales, others may have only ephemeral functional significance as regulatory profiles respond to changing selective pressures. Here we propose a sensitive segmentation methodology for investigating patterns of composition and conservation in 3′ UTRs based on comparison of closely related species. We describe encodings of pairwise and three-way alignments integrating information about conservation, GC content and transition/transversion ratios and apply the method to three closely related Drosophila species: D. melanogaster, D. simulans and D. yakuba. Incorporating multiple data types greatly increased the number of segment classes identified compared to similar methods based on conservation or GC content alone. We propose that the number of segments and number of types of segment identified by the method can be used as proxies for functional complexity. Our main finding is that the number of segments and segment classes identified in 3′ UTRs is greater than in the same length of protein-coding sequence, suggesting greater functional complexity in 3′ UTRs. There is thus a need for sustained and extensive efforts by bioinformaticians to delineate functional elements in this important genomic fraction. C code, data and results are available upon request.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, the complete mitochondrial genome of Acraea issoria (Lepidoptera: Nymphalidae: Heliconiinae: Acraeini) is reported; a circular molecule of 15,245 bp in size. For A. issoria, genes are arranged in the same order and orientation as the complete sequenced mitochondrial genomes of the other lepidopteran species, except for the presence of an extra copy of tRNAIle(AUR)b in the control region. All protein-coding genes of A. issoria mitogenome start with a typical ATN codon and terminate in the common stop codon TAA, except that COI gene uses TTG as its initial codon and terminates in a single T residue. All tRNA genes possess the typical clover leaf secondary structure except for tRNASer(AGN), which has a simple loop with the absence of the DHU stem. The sequence, organization and other features including nucleotide composition and codon usage of this mitochondrial genome were also reported and compared with those of other sequenced lepidopterans mitochondrial genomes. There are some short microsatellite-like repeat regions (e.g., (TA)9, polyA and polyT) scattered in the control region, however, the conspicuous macro-repeats units commonly found in other insect species are absent.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The generation of a correlation matrix for set of genomic sequences is a common requirement in many bioinformatics problems such as phylogenetic analysis. Each sequence may be millions of bases long and there may be thousands of such sequences which we wish to compare, so not all sequences may fit into main memory at the same time. Each sequence needs to be compared with every other sequence, so we will generally need to page some sequences in and out more than once. In order to minimize execution time we need to minimize this I/O. This paper develops an approach for faster and scalable computing of large-size correlation matrices through the maximal exploitation of available memory and reducing the number of I/O operations. The approach is scalable in the sense that the same algorithms can be executed on different computing platforms with different amounts of memory and can be applied to different bioinformatics problems with different correlation matrix sizes. The significant performance improvement of the approach over previous work is demonstrated through benchmark examples.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background Accurate diagnosis is essential for prompt and appropriate treatment of malaria. While rapid diagnostic tests (RDTs) offer great potential to improve malaria diagnosis, the sensitivity of RDTs has been reported to be highly variable. One possible factor contributing to variable test performance is the diversity of parasite antigens. This is of particular concern for Plasmodium falciparum histidine-rich protein 2 (PfHRP2)-detecting RDTs since PfHRP2 has been reported to be highly variable in isolates of the Asia-Pacific region. Methods The pfhrp2 exon 2 fragment from 458 isolates of P. falciparum collected from 38 countries was amplified and sequenced. For a subset of 80 isolates, the exon 2 fragment of histidine-rich protein 3 (pfhrp3) was also amplified and sequenced. DNA sequence and statistical analysis of the variation observed in these genes was conducted. The potential impact of the pfhrp2 variation on RDT detection rates was examined by analysing the relationship between sequence characteristics of this gene and the results of the WHO product testing of malaria RDTs: Round 1 (2008), for 34 PfHRP2-detecting RDTs. Results Sequence analysis revealed extensive variations in the number and arrangement of various repeats encoded by the genes in parasite populations world-wide. However, no statistically robust correlation between gene structure and RDT detection rate for P. falciparum parasites at 200 parasites per microlitre was identified. Conclusions The results suggest that despite extreme sequence variation, diversity of PfHRP2 does not appear to be a major cause of RDT sensitivity variation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Epigenetic silencing mediated by CpG methylation is a common feature of many cancers. Characterizing aberrant DNA methylation changes associated with tumor progression may identify potential prognostic markers for prostate cancer (PCa). We treated two PCa cell lines, 22Rv1 and DU-145 with the demethylating agent 5-Aza 2’–deoxycitidine (DAC) and global methylation status was analyzed by performing methylation-sensitive restriction enzyme based differential methylation hybridization strategy followed by genome-wide CpG methylation array profiling. In addition, we examined gene expression changes using a custom microarray. Gene Set Enrichment Analysis (GSEA) identified the most significantly dysregulated pathways. In addition, we assessed methylation status of candidate genes that showed reduced CpG methylation and increased gene expression after DAC treatment, in Gleason score (GS) 8 vs. GS6 patients using three independent cohorts of patients; the publically available The Cancer Genome Atlas (TCGA) dataset, and two separate patient cohorts. Our analysis, by integrating methylation and gene expression in PCa cell lines, combined with patient tumor data, identified novel potential biomarkers for PCa patients. These markers may help elucidate the pathogenesis of PCa and represent potential prognostic markers for PCa patients.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Thraustochytrids have become of considerable industrial and scientific interest in the past decade due to their health benefits. They have been proven to be the principle source in marine and estuarine fish diets with high percentage of long chain (LC) or polyunsaturated fatty acids (PUFA). Therefore, the oil extracted from fish for human document.forms[0].elements[13].select();consumption is rich in PUFA with high omega-3 fatty acid content. Docosahexaenoic acid (DHA) and eicosapentaenoic acid (EPA) of all of the omega-3 fatty acids, are considered beneficial essential oils for humans with a wide range of health benefits. These include brain and neural development in infants, general wellbeing of adults and drug delivery through precursor molecules. They have become one of the most extensively studied organisms for industrial oil preparations as PUFA extraction from fish becomes less profitable. Many forms of these Thraustochytrid oils are being trialled for human consumption all over the world. In Australia, there has been little research performed on these organisms in the past ten years. A few Australian studies have been conducted in the form of comparative studies related to PUFA production within the related genera, but not focussed on their identification or cellular and genomic characterisation. Therefore, the main aim of this study was to investigate the morphological and genetic characteristics of Australian Thraustochytrids in order to aid in their identification and characterisation, as well as to better understand the effect of environmental conditions in the regulation of PUFA production. It was also noted that there was a knowledge gap in the preservation and total genomic DNA extraction of these organisms for the purposes of scientific research. The cryopreservation of these organisms for studies around the world follows existing generic methods. However, it is well understood that many of these generic methods attract not only high costs for chemicals, but also uses considerable storage space and other resources, all of which can be improved with new or modified approaches. In this context, a simple and inexpensive bead preservation method is described, without compromising the storage shelf life. We also describe, for the first time, the effects of culture age on the successful cryopreservation of Thraustochytrids. It was evident in the literature that DNA and RNA extractions for molecular and genetic studies of Thraustochytrids follow the classical phenol-chloroform extraction methods. It was also observed that modern protocols failed to avoid the use of phenol-chloroform rather than improving preparation and cell disruption. In order to provide a high quantity and quality DNA extraction, a modified protocol has been introduced that employs the use of modern commercial extraction kits and standard laboratory equipment. Thraustochytrids have been shown to be highly conserved in their 18S rDNA gene sequences, which is used as the current standard for identification. It was demonstrated that the 18S rDNA gene sequence limits the recognition of closely related genera or within the genera from each member. Therefore, it was proposed that another profile, such as a randomly amplified polymorphic DNA (RAPD) based profiling system, be tested for use in the characterisation of Thraustochytrids. The RAPD profiles were shown to provide a unique DNA fingerprint for each isolate and small variations in their genome were able to be detected. This method involved the use of a minimum number of standard arbitrary primers and with an increase in the number of different primers used, a very high discrimination between organisms could be achieved. However, the method was not suitable for taxonomic purposes because the results did not correlate with other taxonomic features such as morphology. Another knowledge gap was found with respect to Australian Thraustochytrid growth characteristics, in that these had not been recorded and published. In order to rectify this, a record of colony and microscopic features of 12 selected isolates was performed. The results of preliminary studies indicated that further microbiological and biochemical studies are needed for full characterisation of these organisms. This information is of great importance to bio-prospecting of new Thraustochytrids from Australian ecosystems and would allow for their accurate identification, and so permit the prediction of their PUFA capability by comparison with related genera/species. It was well recognized that environmental stress plays a role in the PUFA production and is mainly due to the reactive oxygen species as abiotic stress (Chiou et al., 2001; Okuyama et al., 2008; Shabala et al., 2009; Shabala et al., 2001). In this aspect, this study makes the first attempt towards better understanding of this phenomenon by way of the use of real-time PCR for the detection of environmental effects on the regulation of PUFA production. Three main environmental conditions including temperature, pH and oxygen availability were monitored as stress inducers. In summary, this study provides novel approaches for the preservation and handling of Thraustochytrids, their molecular biological features, taxonomy, characterisation and responses to environmental factors with respect to their oil production enzymes. The information produced from this study will prove to be vital for both industrial and scientific investigations in the future.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Autotransporter (AT) proteins are found in all Escherichia coli pathotypes and are often associated with virulence. In this study we took advantage of the large number of available E. coli genome sequences to perform an in-depth bioinformatic analysis of AT-encoding genes. Twenty-eight E. coli genome sequences were probed using an iterative approach, which revealed a total of 215 AT-encoding sequences that represented three major groups of distinct domain architecture: (i) serine protease AT proteins, (ii) trimeric AT adhesins and (iii) AIDA-I-type AT proteins. A number of subgroups were identified within each broad category, and most subgroups contained at least one characterized AT protein; however, seven subgroups contained no previously described proteins. The AIDA-I-type AT proteins represented the largest and most diverse group, with up to 16 subgroups identified from sequence-based comparisons. Nine of the AIDA-I-type AT protein subgroups contained at least one protein that possessed functional properties associated with aggregation and/or biofilm formation, suggesting a high degree of redundancy for this phenotype. The Ag43, YfaL/EhaC, EhaB/UpaC and UpaG subgroups were found in nearly all E. coli strains. Among the remaining subgroups, there was a tendency for AT proteins to be associated with individual E. coli pathotypes, suggesting that they contribute to tissue tropism or symptoms specific to different disease outcomes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background Designing novel proteins with site-directed recombination has enormous prospects. By locating effective recombination sites for swapping sequence parts, the probability that hybrid sequences have the desired properties is increased dramatically. The prohibitive requirements for applying current tools led us to investigate machine learning to assist in finding useful recombination sites from amino acid sequence alone. Results We present STAR, Site Targeted Amino acid Recombination predictor, which produces a score indicating the structural disruption caused by recombination, for each position in an amino acid sequence. Example predictions contrasted with those of alternative tools, illustrate STAR'S utility to assist in determining useful recombination sites. Overall, the correlation coefficient between the output of the experimentally validated protein design algorithm SCHEMA and the prediction of STAR is very high (0.89). Conclusion STAR allows the user to explore useful recombination sites in amino acid sequences with unknown structure and unknown evolutionary origin. The predictor service is available from http://pprowler.itee.uq.edu.au/star.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Determination of sequence similarity is a central issue in computational biology, a problem addressed primarily through BLAST, an alignment based heuristic which has underpinned much of the analysis and annotation of the genomic era. Despite their success, alignment-based approaches scale poorly with increasing data set size, and are not robust under structural sequence rearrangements. Successive waves of innovation in sequencing technologies – so-called Next Generation Sequencing (NGS) approaches – have led to an explosion in data availability, challenging existing methods and motivating novel approaches to sequence representation and similarity scoring, including adaptation of existing methods from other domains such as information retrieval. In this work, we investigate locality-sensitive hashing of sequences through binary document signatures, applying the method to a bacterial protein classification task. Here, the goal is to predict the gene family to which a given query protein belongs. Experiments carried out on a pair of small but biologically realistic datasets (the full protein repertoires of families of Chlamydia and Staphylococcus aureus genomes respectively) show that a measure of similarity obtained by locality sensitive hashing gives highly accurate results while offering a number of avenues which will lead to substantial performance improvements over BLAST..

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Deoxyribonucleic acid (DNA) extraction has considerably evolved since it was initially performed back in 1869. It is the first step required for many of the available downstream applications used in the field of molecular biology. Whole blood samples are one of the main sources used to obtain DNA, and there are many different protocols available to perform nucleic acid extraction on such samples. These methods vary from very basic manual protocols to more sophisticated methods included in automated DNA extraction protocols. Based on the wide range of available options, it would be ideal to determine the ones that perform best in terms of cost-effectiveness and time efficiency. We have reviewed DNA extraction history and the most commonly used methods for DNA extraction from whole blood samples, highlighting their individual advantages and disadvantages. We also searched current scientific literature to find studies comparing different nucleic acid extraction methods, to determine the best available choice. Based on our research, we have determined that there is not enough scientific evidence to support one particular DNA extraction method from whole blood samples. Choosing a suitable method is still a process that requires consideration of many different factors, and more research is needed to validate choices made at facilities around the world.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background The koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus. Results RNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene. Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population. Conclusions This transcriptomic dataset is a useful resource for molecular genetic studies of the koala, for evolutionary genetic studies of marsupials, for validation and annotation of the koala genome sequence, and for investigation of koala retrovirus. Annotated transcripts can be browsed and queried at http://koalagenome.org

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We undertook analyses of mitochondrial DNA gene sequences and echolocation calls to resolve phylogenetic relationships among the related bat taxa Rhinolophus pusillus (sampled across China), R. monoceros (Taiwan), R. cornutus (main islands of Japan), and R. c. pumilus (Okinawa, Japan), Phylogenetic trees and genetic divergence analyses were constructed by combining new complete mitochondrial cytochrome-b gene sequences and partial mitochondrial control region sequences with published sequences. Our work showed that these 4 taxa formed monophyletic groups in the phylogenetic tree. However, low levels of sequence divergence among the taxa, together with similarities in body size and overlapping echolocation call frequencies, point to a lack of taxonomic distinctiveness. We therefore suggest that these taxa are better considered as geographical subspecies rather than distinct species, although this should not diminish the conservation importance of these island populations, which are important evolutionarily significant units. Based on our findings, we suggest that the similarities in body size and echolocation call frequency in these rhinolophids result from their recent common ancestry, whereas similarities in body size and call frequency with R. hipposideros of Europe are the result of convergent evolution.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background Chlamydia pecorum is an important pathogen of domesticated livestock including sheep, cattle and pigs. This pathogen is also a key factor in the decline of the koala in Australia. We sequenced the genomes of three koala C. pecorum strains, isolated from the urogenital tracts and conjunctiva of diseased koalas. The genome of the C. pecorum VR629 (IPA) strain, isolated from a sheep with polyarthritis, was also sequenced. Results Comparisons of the draft C. pecorum genomes against the complete genomes of livestock C. pecorum isolates revealed that these strains have a conserved gene content and order, sharing a nucleotide sequence similarity > 98%. Single nucleotide polymorphisms (SNPs) appear to be key factors in understanding the adaptive process. Two regions of the chromosome were found to be accumulating a large number of SNPs within the koala strains. These regions include the Chlamydia plasticity zone, which contains two cytotoxin genes (toxA and toxB), and a 77 kbp region that codes for putative type III effector proteins. In one koala strain (MC/MarsBar), the toxB gene was truncated by a premature stop codon but is full-length in IPTaLE and DBDeUG. Another five pseudogenes were also identified, two unique to the urogenital strains C. pecorum MC/MarsBar and C. pecorum DBDeUG, respectively, while three were unique to the koala C. pecorum conjunctival isolate IPTaLE. An examination of the distribution of these pseudogenes in C. pecorum strains from a variety of koala populations, alongside a number of sheep and cattle C. pecorum positive samples from Australian livestock, confirmed the presence of four predicted pseudogenes in koala C. pecorum clinical samples. Consistent with our genomics analyses, none of these pseudogenes were observed in the livestock C. pecorum samples examined. Interestingly, three SNPs resulting in pseudogenes identified in the IPTaLE isolate were not found in any other C. pecorum strain analysed, raising questions over the origin of these point mutations. Conclusions The genomic data revealed that variation between C. pecorum strains were mainly due to the accumulation of SNPs, some of which cause gene inactivation. The identification of these genetic differences will provide the basis for further studies to understand the biology and evolution of this important animal pathogen. Keywords: Chlamydia pecorum; Single nucleotide polymorphism; Pseudogene; Cytotoxin