953 resultados para Molecular sequence data


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Abstract Background The CACTA (also called En/Spm) superfamily of DNA-only transposons contain the core sequence CACTA in their Terminal Inverted Repeats (TIRs) and so far have only been described in plants. Large transcriptome and genome sequence data have recently become publicly available for Schistosoma mansoni, a digenetic blood fluke that is a major causative agent of schistosomiasis in humans, and have provided a comprehensive repository for the discovery of novel genes and repetitive elements. Despite the extensive description of retroelements in S. mansoni, just a single DNA-only transposon belonging to the Merlin family has so far been reported in this organism. Results We describe a novel S. mansoni transposon named SmTRC1, for S. mansoni Transposon Related to CACTA 1, an element that shares several characteristics with plant CACTA transposons. Southern blotting indicates approximately 30–300 copies of SmTRC1 in the S. mansoni genome. Using genomic PCR followed by cloning and sequencing, we amplified and characterized a full-length and a truncated copy of this element. RT-PCR using S. mansoni mRNA followed by cloning and sequencing revealed several alternatively spliced transcripts of this transposon, resulting in distinct ORFs coding for different proteins. Interestingly, a survey of complete genomes from animals and fungi revealed several other novel TRC elements, indicating new families of DNA transposons belonging to the CACTA superfamily that have not previously been reported in these kingdoms. The first three bases in the S. mansoni TIR are CCC and they are identical to those in the TIRs of the insects Aedes aegypti and Tribolium castaneum, suggesting that animal TRCs may display a CCC core sequence. Conclusion The DNA-only transposable element SmTRC1 from S. mansoni exhibits various characteristics, such as generation of multiple alternatively-spliced transcripts, the presence of terminal inverted repeats at the extremities of the elements flanked by direct repeats and the presence of a Transposase_21 domain, that suggest a distant relationship to CACTA transposons from Magnoliophyta. Several sequences from other Metazoa and Fungi code for proteins similar to those encoded by SmTRC1, suggesting that such elements have a common ancestry, and indicating inheritance through vertical transmission before separation of the Eumetazoa, Fungi and Plants.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Poxviruses are a family of double stranded DNA (dsDNA) viruses that cause disease in many species, both vertebrate and invertebrate. Their genomes range in size from 135 to 365 kbp and show conservation in both organization and content. In particular, the central genomic regions of the chordopoxvirus subfamily (those capable of infecting vertebrates) contain 88 genes which are present in all the virus species characterised to date and which mostly occur in the same order and orientation. In contrast, however, the terminal regions of the genomes frequently contain genes that are species or genera-specific and that are not essential for the growth of the virus in vitro but instead often encode factors with important roles in vivo including modulation of the host immune response to infection and determination of the host range of the virus. The Parapoxviruses (PPV), of which Orf virus is the prototypic species, represent a genus within the chordopoxvirus subfamily of Poxviridae and are characterised by their ability to infect ruminants and humans. The genus currently contains four recognised species of virus, bovine papular stomatitis virus (BPSV) and pseudocowpox virus (PCPV) both of which infect cattle, orf virus (OV) that infects sheep and goats, and parapoxvirus of red deer in New Zealand (PVNZ). The ORFV genome has been fully sequenced, as has that of BPSV, and is ~138 kb in length encoding ~132 genes. The vast majority of these genes allow the virus to replicate in the cytoplasm of the infected host cell and therefore encode proteins involved in replication, transcription and metabolism of nucleic acids. These genes are well conserved between all known genera of poxviruses. There is however another class of genes, located at either end of the linear dsDNA genome, that encode proteins which are non-essential for replication and generally dictate host range and virulence of the virus. The non-essential genes are often the most variable within and between species of virus and therefore are potentially useful for diagnostic purposes. Given their role in subverting the host-immune response to infection they are also targets for novel therapeutics. The function of only a relatively small number of these proteins has been elucidated and there are several genes whose function still remains obscure principally because there is little similarity between them and proteins of known function in current sequence databases. It is thought that by selectively removing some of the virulence genes, or at least neutralising the proteins in some way, current vaccines could be improved. The evolution of poxviruses has been proposed to be an adaptive process involving frequent events of gene gain and loss, such that the virus co-evolves with its specific host. Gene capture or horizontal gene transfer from the host to the virus is considered an important source of new viral genes including those likely to be involved in host range and those enabling the virus to interfere with the host immune response to infection. Given the low rate of nucleotide substitution, recombination can be seen as an essential evolutionary driving force although it is likely underestimated. Recombination in poxviruses is intimately linked to DNA replication with both viral and cellular proteins participate in this recombination-dependent replication. It has been shown, in other poxvirus genera, that recombination between isolates and perhaps even between species does occur, thereby providing another mechanism for the acquisition of new genes and for the rapid evolution of viruses. Such events may result in viruses that have a selective advantage over others, for example in re-infections (a characteristic of the PPV), or in viruses that are able to jump the species barrier and infect new hosts. Sequence data related to viral strains isolated from goats suggest that possible recombination events may have occurred between OV and PCPV (Ueda et al. 2003). The recombination events are frequent during poxvirus replication and comparative genomic analysis of several poxvirus species has revealed that recombinations occur frequently on the right terminal region. Intraspecific recombination can occur between strains of the same PPV species, but also interspecific recombination can happen depending on enough sequence similarity to enable recombination between distinct PPV species. The most important pre-requisite for a successful recombination is the coinfection of the individual host by different virus strains or species. Consequently, the following factors affecting the distribution of different viruses to shared target cells need to be considered: dose of inoculated virus, time interval between inoculation of the first and the second virus, distance between the marker mutations, genetic homology. At present there are no available data on the replication dynamics of PPV in permissive and non permissive hosts and reguarding co-infetions there are no information on the interference mechanisms occurring during the simultaneous replication of viruses of different species. This work has been carried out to set up permissive substrates allowing the replication of different PPV species, in particular keratinocytes monolayers and organotypic skin cultures. Furthermore a method to isolate and expand ovine skin stem cells was has been set up to indeep further aspects of viral cellular tropism during natural infection. The study produced important data to elucidate the replication dynamics of OV and PCPV virus in vitro as well as the mechanisms of interference that can arise during co-infection with different viral species. Moreover, the analysis carried on the genomic right terminal region of PCPV 1303/05 contributed to a better knowledge of the viral genes involved in host interaction and pathogenesis as well as to locate recombination breakpoints and genetic homologies between PPV species. Taken together these data filled several crucial gaps for the study of interspecific recombinations of PPVs which are thought to be important for a better understanding of the viral evolution and to improve the biosafety of antiviral therapy and PPV-based vectors.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Hämocyanine sind große, multimere Sauerstofftransport- proteine, die frei gelöst in der Hämolymphe von Arthropoden und Mollusken vorkommen.Zur Charakterisierung verschiedener Arthropoden-hämocyanine wurden deren molare Massen bestimmt. Die mit einer Vielwinkel-Laser-Lichtstreuapparatur ermittelten Molekulargewichte zeigten eine grosse Schwankungsbreite. Dies konnte auf Ungenauigkeiten der zur Berechnung der Molekulargewichte verwendeten spezifischen Extinktions- koeffizienten und Brechungsindex-Inkremente zurückgeführt werden.Mit der Methode der Massenspektrometrie (MALDI-TOF) bestimmte Molekulargewichte einzelner Untereinheiten des Hämocyanins der Vogelspinne Eurypelma californicum zeigten eine sehr gute Übereinstimmung mit aus der Sequenz errechneten Werten.Für das 24-mere Spinnenhämocyanin von Eurypelma californicum wurde die Stabilität gegenüber GdnHCl und der Temperatur auf den verschiedenen strukturellen Ebenen des Proteins untersucht.Viele Stabilitätsuntersuchungen werden an kleinen Proteinen durchgeführt, deren Entfaltung kooperativerfolgt. Bei größeren Proteinen mit unterschiedlichen strukturellen Bereichen (Domänen) ist der Entfaltungs-prozess weitaus komplexer. Ziel war es, durch die Denaturierung des Spinnen-Hämocyanins Erkenntnisse über die Stabilität und Entfaltung der verschiedenen strukturellen Ebenen eines so großen Proteinkomplexes zu gewinnen.Ein wichtiges Charakteristikum für die Interpretation der Entfaltungsexperimente ist die starke Löschung der Tryptophanfluoreszenz im oxygenierten Spinnen-Hämocyanin. Die Löschung kann vollständig durch Förster-Transfer erklärt werden kann. Sie bleibt auf die einzelnen Untereinheiten beschränkt und stellt somit ein reines O2-Beladungssignal dar.Unter Einwirkung von GdnHCl dissoziiert das native, 24-mere Spinnen-Hämocyanin ohne die Entstehung langlebiger Inter- mediate. Die Untereinheiten werden durch das Oligomer stabilisiert. Die Entfaltung eines Monomers, der Unter- einheit e, folgt einer Hierarchie der verschiedenen strukturellen Ebenen des Moleküls. Die Entfaltung beginnt zunächst von außen mit der Auflockerung der Tertiärstruktur. Der Kern von Domäne II mit dem aktiven Zentrum weist hingegen eine besondere Stabilität auf.Die ausgeprägte Hitzestabilität des Eurypelma-Hämocyanins hängt vom Oligomerisierungsgrad, dem verwendeten Puffer und dessen Ausgangs-pH-Wert ab und spiegelt offensichtlich die extremen Lebensbedingungen im Habitat wider.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

ABSTRACTDie vorliegende Arbeit befasste sich mit der Reinigung,heterologen Expression, Charakterisierung, molekularenAnalyse, Mutation und Kristallisation des EnzymsVinorin-Synthase. Das Enzym spielt eine wichtige Rolle inder Ajmalin-Biosynthese, da es in einerAcetyl-CoA-abhängigen Reaktion die Umwandlung desSarpagan-Alkaloids 16-epi-Vellosimin zu Vinorin unterBildung des Ajmalan-Grundgerüstes katalysiert. Nach der Reinigung der Vinorin-Synthase ausHybrid-Zellkulturen von Rauvolfia serpentina/Rhazya strictamit den fünf chromatographischen TrennmethodenAnionenaustauschchromatographie an SOURCE 30Q, HydrophobeInteraktionen Chromatographie an SOURCE 15PHE,Chromatographie an MacroPrep Ceramic Hydroxyapatit,Anionenaustauschchromatographie an Mono Q undGrößenausschlußchromatographie an Superdex 75 konnte dieVinorin-Synthase aus 2 kg Zellkulturgewebe 991fachangereichert werden.Das nach der Reinigung angefertigte SDS-Gel ermöglichte eineklare Zuordnung der Protein-Bande als Vinorin-Synthase.Der Verdau der Enzymbande mit der Endoproteinase LysC unddie darauffolgende Sequenzierung der Spaltpeptide führte zuvier Peptidsequenzen. Der Datenbankvergleich (SwissProt)zeigte keinerlei Homologien zu Sequenzen bekannterPflanzenenzyme. Mit degenerierten Primern, abgeleitet voneinem der erhaltenen Peptidfragmente und einer konserviertenRegion bekannter Acetyltransferasen gelang es, ein erstescDNA-Fragment der Vinorin-Synthase zu amplifizieren. Mit derMethode der RACE-PCR wurde die Nukleoidsequenzvervollständigt, was zu einem cDNA-Vollängenklon mit einerGröße von 1263 bp führte, der für ein Protein mit 421Aminosäuren (46 kDa) codiert.Das Vinorin-Synthase-Gen wurde in den pQE2-Expressionsvektorligiert, der für einen N-terminalen 6-fachen His-tagcodiert. Anschließend wurde sie erstmals erfolgreich in E.coli im mg-Maßstab exprimiert und bis zur Homogenitätgereinigt. Durch die erfolgreiche Überexpression konnte dieVinorin-Synthase eingehend charakterisiert werden. DerKM-Wert für das Substrat Gardneral wurde mit 20 µM, bzw.41.2 µM bestimmt und Vmax betrug 1 pkat, bzw. 1.71 pkat.Nach erfolgreicher Abspaltung des His-tags wurden diekinetischen Parameter erneut bestimmt (KM- Wert 7.5 µM, bzw.27.52 µM, Vmax 0.7 pkat, bzw. 1.21 pkat). Das Co-Substratzeigt einen KM- Wert von 60.5 µM (Vmax 0.6 pkat). DieVinorin-Synthase besitzt ein Temperatur-Optimum von 35 °Cund ein pH-Optimum bei 7.8.Homologievergleiche mit anderen Enzymen zeigten, dass dieVinorin-Synthase zu einer noch kleinen Familie von bisher 10Acetyltransferasen gehört. Alle Enzyme der Familie haben einHxxxD und ein DFGWG-Motiv zu 100 % konserviert. Basierendauf diesen Homologievergleichen und Inhibitorstudien wurden11 in dieser Proteinfamilie konservierte Aminosäuren gegenAlanin ausgetauscht, um so die Aminosäuren einer in derLiteratur postulierten katalytischen Triade(Ser/Cys-His-Asp) zu identifizieren.Die Mutation aller vorhandenen konservierten Serine undCysteine resultierte in keiner Mutante, die zumvollständigen Aktivitätsverlust des Enzyms führte. Nur dieMutationen H160A und D164A resultierten in einemvollständigen Aktivitätsverlust des Enzyms. Dieses Ergebniswiderlegt die Theorie einer katalytischen Triade und zeigte,dass die Aminosäuren H160A und D164A exklusiv an derkatalytischen Reaktion beteiligt sind.Zur Überprüfung dieser Ergebnisse und zur vollständigenAufklärung des Reaktionsmechanismus wurde dieVinorin-Synthase kristallisiert. Die bis jetzt erhaltenenKristalle (Kristallgröße in µm x: 150, y: 200, z: 200)gehören der Raumgruppe P212121 (orthorhombisch primitiv) anund beugen bis 3.3 Å. Da es bis jetzt keine Kristallstruktureines zur Vinorin-Synthase homologen Proteins gibt, konntedie Struktur noch nicht vollständig aufgeklärt werden. ZurLösung des Phasenproblems wird mit der Methode der multiplenanomalen Dispersion (MAD) jetzt versucht, die ersteKristallstruktur in dieser Enzymfamilie aufzuklären.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Different types of proteins exist with diverse functions that are essential for living organisms. An important class of proteins is represented by transmembrane proteins which are specifically designed to be inserted into biological membranes and devised to perform very important functions in the cell such as cell communication and active transport across the membrane. Transmembrane β-barrels (TMBBs) are a sub-class of membrane proteins largely under-represented in structure databases because of the extreme difficulty in experimental structure determination. For this reason, computational tools that are able to predict the structure of TMBBs are needed. In this thesis, two computational problems related to TMBBs were addressed: the detection of TMBBs in large datasets of proteins and the prediction of the topology of TMBB proteins. Firstly, a method for TMBB detection was presented based on a novel neural network framework for variable-length sequence classification. The proposed approach was validated on a non-redundant dataset of proteins. Furthermore, we carried-out genome-wide detection using the entire Escherichia coli proteome. In both experiments, the method significantly outperformed other existing state-of-the-art approaches, reaching very high PPV (92%) and MCC (0.82). Secondly, a method was also introduced for TMBB topology prediction. The proposed approach is based on grammatical modelling and probabilistic discriminative models for sequence data labeling. The method was evaluated using a newly generated dataset of 38 TMBB proteins obtained from high-resolution data in the PDB. Results have shown that the model is able to correctly predict topologies of 25 out of 38 protein chains in the dataset. When tested on previously released datasets, the performances of the proposed approach were measured as comparable or superior to the current state-of-the-art of TMBB topology prediction.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Erkrankungen des Skelettapparats wie beispielsweise die Osteoporose oder Arthrose gehören neben den Herz-Kreislauferkrankungen und Tumoren zu den Häufigsten Erkrankungen des Menschen. Ein besseres Verständnis der Bildung und des Erhalts von Knochen- oder Knorpelgewebe ist deshalb von besonderer Bedeutung. Viele bisherige Ansätze zur Identifizierung hierfür relevanter Gene, deren Produkte und Interaktionen beruhen auf der Untersuchung pathologischer Situationen. Daher ist die Funktion vieler Gene nur im Zusammenhang mit Krankheiten beschrieben. Untersuchungen, die die Genaktivität bei der Normalentwicklung von knochen- und knorpelbildenden Geweben zum Ziel haben, sind dagegen weit weniger oft durchgeführt worden. rnEines der entwicklungsphysiologisch interessantesten Gewebe ist die Epiphysenfuge der Röhrenknochen. In dieser sogenannten Wachstumsfuge ist insbesondere beim fötalen Gewebe eine sehr hohe Aktivität derjenigen Gene zu erwarten, die an der Knochen- und Knorpelbildung beteiligt sind. In der vorliegenden Arbeit wurde daher aus der Epiphysenfuge von Kälberknochen RNA isoliert und eine cDNA-Bibliothek konstruiert. Von dieser wurden ca. 4000 Klone im Rahmen eines klassischen EST-Projekts sequenziert. Durch die Analyse konnte ein ungefähr 900 Gene umfassendes Expressionsprofil erstellt werden und viele Transkripte für Komponenten der regulatorischen und strukturbildenden Bestandteile der Knochen- und Knorpelentwicklung identifiziert werden. Neben den typischen Genen für Komponenten der Knochenentwicklung sind auch deutlich Bestandteile für embryonale Entwicklungsprozesse vertreten. Zu ersten gehören in erster Linie die Kollagene, allen voran Kollagen II alpha 1, das mit Abstand höchst exprimierte Gen in der fötalen Wachstumsfuge. Nach den ribosomalen Proteinen stellen die Kollagene mit ca. 10 % aller auswertbaren Sequenzen die zweitgrößte Gengruppe im erstellten Expressionsprofil dar. Proteoglykane und andere niedrig exprimierte regulatorische Elemente, wie Transkriptionsfaktoren, konnten im EST-Projekt aufgrund der geringen Abdeckung nur in sehr geringer Kopienzahl gefunden werden. Allerdings förderte die EST-Analyse mehrere interessante, bisher nicht bekannte Transkripte zutage, die detaillierter untersucht wurden. Dazu gehören Transkripte die, die dem LOC618319 zugeordnet werden konnten. Neben den bisher beschriebenen drei Exonbereichen konnte ein weiteres Exon im 3‘-UTR identifiziert werden. Im abgeleiteten Protein, das mindestens 121 AS lang ist, wurden ein Signalpeptid und eine Transmembrandomäne nachgewiesen. In Verbindung mit einer möglichen Glykosylierung ist das Genprodukt in die Gruppe der Proteoglykane einzuordnen. Leicht abweichend von den typischen Strukturen knochen- und knorpelspezifischer Proteoglykane ist eine mögliche Funktion dieses Genprodukts bei der Interaktion mit Integrinen und der Zell-Zellinteraktion, aber auch bei der Signaltransduktion denkbar. rnDie EST-Sequenzierungen von ca. 4000 cDNA-Klonen können aber in der Regel nur einen Bruchteil der möglichen Transkripte des untersuchten Gewebes abdecken. Mit den neuen Sequenziertechnologien des „Next Generation Sequencing“ bestehen völlig neue Möglichkeiten, komplette Transkriptome mit sehr hoher Abdeckung zu sequenzieren und zu analysieren. Zur Unterstützung der EST-Daten und zur deutlichen Verbreiterung der Datenbasis wurde das Transkriptom der bovinen fötalen Wachstumsfuge sowohl mit Hilfe der Roche-454/FLX- als auch der Illumina-Solexa-Technologie sequenziert. Bei der Auswertung der ca. 40000 454- und 75 Millionen Illumina-Sequenzen wurden Verfahren zur allgemeinen Handhabung, der Qualitätskontrolle, dem „Clustern“, der Annotation und quantitativen Auswertung von großen Mengen an Sequenzdaten etabliert. Beim Vergleich der Hochdurchsatz Blast-Analysen im klassischen „Read-Count“-Ansatz mit dem erstellten EST-Expressionsprofil konnten gute Überstimmungen gezeigt werden. Abweichungen zwischen den einzelnen Methoden konnten nicht in allen Fällen methodisch erklärt werden. In einigen Fällen sind Korrelationen zwischen Transkriptlänge und „Read“-Verteilung zu erkennen. Obwohl schon simple Methoden wie die Normierung auf RPKM („reads per kilo base transkript per million mappable reads“) eine Verbesserung der Interpretation ermöglichen, konnten messtechnisch durch die Art der Sequenzierung bedingte systematische Fehler nicht immer ausgeräumt werden. Besonders wichtig ist daher die geeignete Normalisierung der Daten beim Vergleich verschieden generierter Datensätze. rnDie hier diskutierten Ergebnisse aus den verschiedenen Analysen zeigen die neuen Sequenziertechnologien als gute Ergänzung und potentiellen Ersatz für etablierte Methoden zur Genexpressionsanalyse.rn

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The intestinal ecosystem is formed by a complex, yet highly characteristic microbial community. The parameters defining whether this community permits invasion of a new bacterial species are unclear. In particular, inhibition of enteropathogen infection by the gut microbiota ( = colonization resistance) is poorly understood. To analyze the mechanisms of microbiota-mediated protection from Salmonella enterica induced enterocolitis, we used a mouse infection model and large scale high-throughput pyrosequencing. In contrast to conventional mice (CON), mice with a gut microbiota of low complexity (LCM) were highly susceptible to S. enterica induced colonization and enterocolitis. Colonization resistance was partially restored in LCM-animals by co-housing with conventional mice for 21 days (LCM(con21)). 16S rRNA sequence analysis comparing LCM, LCM(con21) and CON gut microbiota revealed that gut microbiota complexity increased upon conventionalization and correlated with increased resistance to S. enterica infection. Comparative microbiota analysis of mice with varying degrees of colonization resistance allowed us to identify intestinal ecosystem characteristics associated with susceptibility to S. enterica infection. Moreover, this system enabled us to gain further insights into the general principles of gut ecosystem invasion by non-pathogenic, commensal bacteria. Mice harboring high commensal E. coli densities were more susceptible to S. enterica induced gut inflammation. Similarly, mice with high titers of Lactobacilli were more efficiently colonized by a commensal Lactobacillus reuteri(RR) strain after oral inoculation. Upon examination of 16S rRNA sequence data from 9 CON mice we found that closely related phylotypes generally display significantly correlated abundances (co-occurrence), more so than distantly related phylotypes. Thus, in essence, the presence of closely related species can increase the chance of invasion of newly incoming species into the gut ecosystem. We provide evidence that this principle might be of general validity for invasion of bacteria in preformed gut ecosystems. This might be of relevance for human enteropathogen infections as well as therapeutic use of probiotic commensal bacteria.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Alpine lake whitefish (Coregonus lavaretus) species complex is a classic example of a recent radiation, associated with colonization of the Alpine lakes following the glacial retreat (less than 15 kyr BP). They have formed a unique array of endemic lake flocks, each with one to six described sympatric species differing in morphology, diet and reproductive ecology. Here, we present a genomic investigation of the relationships between and within the lake flocks. Comparing the signal between over 1000 AFLP loci and mitochondrial control region sequence data, we use phylogenetic tree-based and population genetic methods to reconstruct the phylogenetic history of the group and to delineate the principal centres of genetic diversity within the radiation. We find significant cytonuclear discordance showing that the genomically monophyletic Alpine whitefish clade arose from a hybrid swarm of at least two glacial refugial lineages. Within this radiation, we find seven extant genetic clusters centred on seven lake systems. Most interestingly, we find evidence of sympatric speciation within and parallel evolution of equivalent phenotypes among these lake systems. However, we also find the genetic signature of human-mediated gene flow and diversity loss within many lakes, highlighting the fragility of recent radiations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background The estimation of demographic parameters from genetic data often requires the computation of likelihoods. However, the likelihood function is computationally intractable for many realistic evolutionary models, and the use of Bayesian inference has therefore been limited to very simple models. The situation changed recently with the advent of Approximate Bayesian Computation (ABC) algorithms allowing one to obtain parameter posterior distributions based on simulations not requiring likelihood computations. Results Here we present ABCtoolbox, a series of open source programs to perform Approximate Bayesian Computations (ABC). It implements various ABC algorithms including rejection sampling, MCMC without likelihood, a Particle-based sampler and ABC-GLM. ABCtoolbox is bundled with, but not limited to, a program that allows parameter inference in a population genetics context and the simultaneous use of different types of markers with different ploidy levels. In addition, ABCtoolbox can also interact with most simulation and summary statistics computation programs. The usability of the ABCtoolbox is demonstrated by inferring the evolutionary history of two evolutionary lineages of Microtus arvalis. Using nuclear microsatellites and mitochondrial sequence data in the same estimation procedure enabled us to infer sex-specific population sizes and migration rates and to find that males show smaller population sizes but much higher levels of migration than females. Conclusion ABCtoolbox allows a user to perform all the necessary steps of a full ABC analysis, from parameter sampling from prior distributions, data simulations, computation of summary statistics, estimation of posterior distributions, model choice, validation of the estimation procedure, and visualization of the results.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Different codons encoding the same amino acid are not used equally in protein-coding sequences. In bacteria, there is a bias towards codons with high translation rates. This bias is most pronounced in highly expressed proteins, but a recent study of synthetic GFP-coding sequences did not find a correlation between codon usage and GFP expression, suggesting that such correlation in natural sequences is not a simple property of translational mechanisms. Here, we investigate the effect of evolutionary forces on codon usage. The relation between codon bias and protein abundance is quantitatively analyzed based on the hypothesis that codon bias evolved to ensure the efficient usage of ribosomes, a precious commodity for fast growing cells. An explicit fitness landscape is formulated based on bacterial growth laws to relate protein abundance and ribosomal load. The model leads to a quantitative relation between codon bias and protein abundance, which accounts for a substantial part of the observed bias for E. coli. Moreover, by providing an evolutionary link, the ribosome load model resolves the apparent conflict between the observed relation of protein abundance and codon bias in natural sequences and the lack of such dependence in a synthetic gfp library. Finally, we show that the relation between codon usage and protein abundance can be used to predict protein abundance from genomic sequence data alone without adjustable parameters.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Different codons encoding the same amino acid are not used equally in protein-coding sequences. In bacteria, there is a bias towards codons with high translation rates. This bias is most pronounced in highly expressed proteins, but a recent study of synthetic GFP-coding sequences did not find a correlation between codon usage and GFP expression, suggesting that such correlation in natural sequences is not a simple property of translational mechanisms. Here, we investigate the effect of evolutionary forces on codon usage. The relation between codon bias and protein abundance is quantitatively analyzed based on the hypothesis that codon bias evolved to ensure the efficient usage of ribosomes, a precious commodity for fast growing cells. An explicit fitness landscape is formulated based on bacterial growth laws to relate protein abundance and ribosomal load. The model leads to a quantitative relation between codon bias and protein abundance, which accounts for a substantial part of the observed bias for E. coli. Moreover, by providing an evolutionary link, the ribosome load model resolves the apparent conflict between the observed relation of protein abundance and codon bias in natural sequences and the lack of such dependence in a synthetic gfp library. Finally, we show that the relation between codon usage and protein abundance can be used to predict protein abundance from genomic sequence data alone without adjustable parameters.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: Production of native antigens for serodiagnosis of helminthic infections is laborious and hampered by batch-to-batch variation. For serodiagnosis of echinococcosis, especially cystic disease, most screening tests rely on crude or purified Echinococcus granulosus hydatid cyst fluid. To resolve limitations associated with native antigens in serological tests, the use of standardized and highly pure antigens produced by chemical synthesis offers considerable advantages, provided appropriate diagnostic sensitivity and specificity is achieved. METHODOLOGY/PRINCIPAL FINDINGS: Making use of the growing collection of genomic and proteomic data, we applied a set of bioinformatic selection criteria to a collection of protein sequences including conceptually translated nucleotide sequence data of two related tapeworms, Echinococcus multilocularis and Echinococcus granulosus. Our approach targeted alpha-helical coiled-coils and intrinsically unstructured regions of parasite proteins potentially exposed to the host immune system. From 6 proteins of E. multilocularis and 5 proteins of E. granulosus, 45 peptides between 24 and 30 amino acids in length were designed. These peptides were chemically synthesized, spotted on microarrays and screened for reactivity with sera from infected humans. Peptides reacting above the cut-off were validated in enzyme-linked immunosorbent assays (ELISA). Peptides identified failed to differentiate between E. multilocularis and E. granulosus infection. The peptide performing best reached 57% sensitivity and 94% specificity. This candidate derived from Echinococcus multilocularis antigen B8/1 and showed strong reactivity to sera from patients infected either with E. multilocularis or E. granulosus. CONCLUSIONS/SIGNIFICANCE: This study provides proof of principle for the discovery of diagnostically relevant peptides by bioinformatic selection complemented with screening on a high-throughput microarray platform. Our data showed that a single peptide cannot provide sufficient diagnostic sensitivity whereas pooling several peptide antigens improved sensitivity; thus combinations of several peptides may lead the way to new diagnostic tests that replace, or at least complement conventional immunodiagnosis of echinococcosis. Our strategy could prove useful for diagnostic developments in other pathogens.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Fungi are important members of soil microbial communities with a crucial role in biogeochemical processes. Although soil fungi are known to be highly diverse, little is known about factors influencing variations in their diversity and community structure among forests dominated by the same tree species but spread over different regions and under different managements. We analyzed the soil fungal diversity and community composition of managed and unmanaged European beech dominated forests located in three German regions, the Schwäbische Alb in Southwestern, the Hainich-Dün in Central and the Schorfheide Chorin in the Northeastern Germany, using internal transcribed spacer (ITS) rDNA pyrotag sequencing. Multiple sequence quality filtering followed by sequence data normalization revealed 1655 fungal operational taxonomic units. Further analysis based on 722 abundant fungal OTUs revealed the phylum Basidiomycota to be dominant (54%) and its community to comprise 71.4% of ectomycorrhizal taxa. Fungal community structure differed significantly (p≤0.001) among the three regions and was characterized by non-random fungal OTUs co-occurrence. Soil parameters, herbaceous understory vegetation, and litter cover affected fungal community structure. However, within each study region we found no difference in fungal community structure between management types. Our results also showed region specific significant correlation patterns between the dominant ectomycorrhizal fungal genera. This suggests that soil fungal communities are region-specific but nevertheless composed of functionally diverse and complementary taxa.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Alboluxin, a potent platelet activator, was purified from Trimeresurus albolabris venom with a mass of 120 kDa non-reduced and, after reduction, subunits of 17 and 24 kDa. Alboluxin induced a tyrosine phosphorylation profile in platelets that resembles those produced by collagen and convulxin, involving the time dependent tyrosine phosphorylation of Fc receptor gamma chain (Fc gamma), phospholipase Cgamma2 (PLCgamma2), LAT and p72SYK. Antibodies against both GPIb and GPVI inhibited platelet aggregation induced by alboluxin, whereas antibodies against alpha2beta1 had no effect. Inhibition of alphaIIb beta3 reduced the aggregation response to alboluxin, as well as tyrosine phosphorylation of platelet proteins, showing that activation of alphaIIb beta3 and binding of fibrinogen are involved in alboluxin-induced platelet aggregation and it is not simply agglutination. N-terminal sequence data from the beta-subunit of alboluxin indicates that it belongs to the snake C-type lectin family. The C-type lectin subunits are larger than usual possibly due to post-translational modifications such as glycosylation. Alboluxin is a hexameric (alphabeta)3 snake C-type lectin which activates platelets via both GPIb and GPVI.