984 resultados para Data Coding.
Resumo:
We present and validate BlastR, a method for efficiently and accurately searching non-coding RNAs. Our approach relies on the comparison of di-nucleotides using BlosumR, a new log-odd substitution matrix. In order to use BlosumR for comparison, we recoded RNA sequences into protein-like sequences. We then showed that BlosumR can be used along with the BlastP algorithm in order to search non-coding RNA sequences. Using Rfam as a gold standard, we benchmarked this approach and show BlastR to be more sensitive than BlastN. We also show that BlastR is both faster and more sensitive than BlastP used with a single nucleotide log-odd substitution matrix. BlastR, when used in combination with WU-BlastP, is about 5% more accurate than WU-BlastN and about 50 times slower. The approach shown here is equally effective when combined with the NCBI-Blast package. The software is an open source freeware available from www.tcoffee.org/blastr.html.
Resumo:
Assessing the impact of cultural change on parasitism has been a central goal in archaeoparasitology. The influence of civilization and the development of empires on parasitism has not been evaluated. Presented here is a preliminary analysis of the change in human parasitism associated with the Inca conquest of the Lluta Valley in Northern Chile. Changes in parasite prevalence are described. It can be seen that the change in life imposed on the inhabitants of the Lluta Valley by the Incas caused an increase in parasitism.
Biased gene conversion and GC-content evolution in the coding sequences of reptiles and vertebrates.
Resumo:
Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins.
Resumo:
SUMMARY: Large sets of data, such as expression profiles from many samples, require analytic tools to reduce their complexity. The Iterative Signature Algorithm (ISA) is a biclustering algorithm. It was designed to decompose a large set of data into so-called 'modules'. In the context of gene expression data, these modules consist of subsets of genes that exhibit a coherent expression profile only over a subset of microarray experiments. Genes and arrays may be attributed to multiple modules and the level of required coherence can be varied resulting in different 'resolutions' of the modular mapping. In this short note, we introduce two BioConductor software packages written in GNU R: The isa2 package includes an optimized implementation of the ISA and the eisa package provides a convenient interface to run the ISA, visualize its output and put the biclusters into biological context. Potential users of these packages are all R and BioConductor users dealing with tabular (e.g. gene expression) data. AVAILABILITY: http://www.unil.ch/cbg/ISA CONTACT: sven.bergmann@unil.ch
Resumo:
Physiological parameters of laboratory animals used for biomedical research is crucial for following several experimental procedures. With the intent to establish baseline biologic parameters for non-human primates held in closed colonies, hematological and morphometric data of captive monkeys were determined. Data of clinically healthy rhesus macaques (Macaca mulatta), cynomolgus monkeys (Macaca fascicularis), and squirrel monkeys (Saimiri sciureus) were collected over a period of five years. Animals were separated according to sex and divided into five age groups. Hematological data were compared with those in the literature by Student's t test. Discrepancies with significance levels of 0.1, 1 or 5% were found in the hematological studies. Growth curves showed that the sexual dimorphism of rhesus monkeys appeared at an age of four years. In earlier ages, the differences between sexes could not be distinguished (p < 0.05). Sexual dimorphism in both squirrel monkeys and cynomolgus monkeys occurred at an age of about 32 months. Data presented in this paper could be useful for comparative studies using primates under similar conditions.