939 resultados para Molecular Sequence Annotation


Relevância:

50.00% 50.00%

Publicador:

Resumo:

Thèse numérisée par la Direction des bibliothèques de l'Université de Montréal.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Chlamydia pecorum is a significant pathogen of domestic livestock and wildlife. We have developed a C. pecorum-specific multilocus sequence analysis (MLSA) scheme to examine the genetic diversity of and relationships between Australian sheep, cattle, and koala isolates. An MLSA of seven concatenated housekeeping gene fragments was performed using 35 isolates, including 18 livestock isolates (11 Australian sheep, one Australian cow, and six U.S. livestock isolates) and 17 Australian koala isolates. Phylogenetic analyses showed that the koala isolates formed a distinct clade, with limited clustering with C. pecorum isolates from Australian sheep. We identified 11 MLSA sequence types (STs) among Australian C. pecorum isolates, 10 of them novel, with koala and sheep sharing at least one identical ST (designated ST2013Aa). ST23, previously identified in global C. pecorum livestock isolates, was observed here in a subset of Australian bovine and sheep isolates. Most notably, ST23 was found in association with multiple disease states and hosts, providing insights into the transmission of this pathogen between livestock hosts. The complexity of the epidemiology of this disease was further highlighted by the observation that at least two examples of sheep were infected with different C. pecorum STs in the eyes and gastrointestinal tract. We have demonstrated the feasibility of our MLSA scheme for understanding the host relationship that exists between Australian C. pecorum strains and provide the first molecular epidemiological data on infections in Australian livestock hosts.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background The koala, Phascolarctos cinereus, is a biologically unique and evolutionarily distinct Australian arboreal marsupial. The goal of this study was to sequence the transcriptome from several tissues of two geographically separate koalas, and to create the first comprehensive catalog of annotated transcripts for this species, enabling detailed analysis of the unique attributes of this threatened native marsupial, including infection by the koala retrovirus. Results RNA-Seq data was generated from a range of tissues from one male and one female koala and assembled de novo into transcripts using Velvet-Oases. Transcript abundance in each tissue was estimated. Transcripts were searched for likely protein-coding regions and a non-redundant set of 117,563 putative protein sequences was produced. In similarity searches there were 84,907 (72%) sequences that aligned to at least one sequence in the NCBI nr protein database. The best alignments were to sequences from other marsupials. After applying a reciprocal best hit requirement of koala sequences to those from tammar wallaby, Tasmanian devil and the gray short-tailed opossum, we estimate that our transcriptome dataset represents approximately 15,000 koala genes. The marsupial alignment information was used to look for potential gene duplications and we report evidence for copy number expansion of the alpha amylase gene, and of an aldehyde reductase gene. Koala retrovirus (KoRV) transcripts were detected in the transcriptomes. These were analysed in detail and the structure of the spliced envelope gene transcript was determined. There was appreciable sequence diversity within KoRV, with 233 sites in the KoRV genome showing small insertions/deletions or single nucleotide polymorphisms. Both koalas had sequences from the KoRV-A subtype, but the male koala transcriptome has, in addition, sequences more closely related to the KoRV-B subtype. This is the first report of a KoRV-B-like sequence in a wild population. Conclusions This transcriptomic dataset is a useful resource for molecular genetic studies of the koala, for evolutionary genetic studies of marsupials, for validation and annotation of the koala genome sequence, and for investigation of koala retrovirus. Annotated transcripts can be browsed and queried at http://koalagenome.org

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Using DNA markers in plant breeding with marker-assisted selection (MAS) could greatly improve the precision and efficiency of selection, leading to the accelerated development of new crop varieties. The numerous examples of MAS in rice have prompted many breeding institutes to establish molecular breeding labs. The last decade has produced an enormous amount of genomics research in rice, including the identification of thousands of QTLs for agronomically important traits, the generation of large amounts of gene expression data, and cloning and characterization of new genes, including the detection of single nucleotide polymorphisms. The pinnacle of genomics research has been the completion and annotation of genome sequences for indica and japonica rice. This information-coupled with the development of new genotyping methodologies and platforms, and the development of bioinformatics databases and software tools-provides even more exciting opportunities for rice molecular breeding in the 21st century. However, the great challenge for molecular breeders is to apply genomics data in actual breeding programs. Here, we review the current status of MAS in rice, current genomics projects and promising new genotyping methodologies, and evaluate the probable impact of genomics research. We also identify critical research areas to "bridge the application gap" between QTL identification and applied breeding that need to be addressed to realize the full potential of MAS, and propose ideas and guidelines for establishing rice molecular breeding labs in the postgenome sequence era to integrate molecular breeding within the context of overall rice breeding and research programs.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Fusarium oxysporum f. sp. cubense (Foc), causal agent of fusarium wilt of banana, is among the most destructive pathogens of banana and plantain. The development of a molecular diagnostic capable of reliably distinguishing between the various races of the pathogen is of key importance to disease management. However, attempts to distinguish isolates using the standard molecular loci typically used for fungal phylogenetics have been complicated by a poor correlation between phylogeny and pathogenicity. Among the available alternative loci are several putative effector genes, known as SIX genes, which have been successfully used to differentiate the three races of F. oxysporum f. sp. lycopersici. In this study, an international collection of Foc isolates was screened for the presence of the putative effector SIX8. Using a PCR and sequencing approach, variation in Foc-SIX8 was identified which allowed race 4 to be differentiated from race 1 and 2 isolates, and tropical and subtropical race 4 isolates to be distinguished from one another.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Based upon a stereochemical guideline, two topologically distinct types of helicalduplexes have been deduced for a polynucleotide duplex with alternating purine pyrimidine sequence (PAPP): (a) right-handed uniform (RU) helix and (b) left-handed zig-zag (LZ) helix. Both structures have trinucleoside diphosphate as the basic unit wherein the purine pyrimidine fragment has a different conformation from the pyrimidine-purine fragment. Thus, RU and LZ helices represent two different classes of sequence-dependent molecular conformations for PAPP. The conformationalf eatures of an RU helix of PAPP in B-form and three LZ-helices for B-, D- and Z-forms are discussed.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Visual pigments of different animal species must have evolved at some stage to match the prevailing light environments, since all visual functions depend on their ability to absorb available photons and transduce the event into a reliable neural signal. There is a large literature on correlation between the light environment and spectral sensitivity between different fish species. However, little work has been done on evolutionary adaptation between separated populations within species. More generally, little is known about the rate of evolutionary adaptation to changing spectral environments. The objective of this thesis is to illuminate the constraints under which the evolutionary tuning of visual pigments works as evident in: scope, tempo, available molecular routes, and signal/noise trade-offs. Aquatic environments offer Nature s own laboratories for research on visual pigment properties, as naturally occurring light environments offer an enormous range of variation in both spectral composition and intensity. The present thesis focuses on the visual pigments that serve dim-light vision in two groups of model species, teleost fishes and mysid crustaceans. The geographical emphasis is in the brackish Baltic Sea area with its well-known postglacial isolation history and its aquatic fauna of both marine and fresh-water origin. The absorbance spectrum of the (single) dim-light visual pigment were recorded by microspectrophotometry (MSP) in single rods of 26 fish species and single rhabdoms of 8 opossum shrimp populations of the genus Mysis inhabiting marine, brackish or freshwater environments. Additionally, spectral sensitivity was determined from six Mysis populations by electroretinogram (ERG) recording. The rod opsin gene was sequenced in individuals of four allopatric populations of the sand goby (Pomatoschistus minutus). Rod opsins of two other goby species were investigated as outgroups for comparison. Rod absorbance spectra of the Baltic subspecies or populations of the primarily marine species herring (Clupea harengus membras), sand goby (P. minutus), and flounder (Platichthys flesus) were long-wavelength-shifted compared to their marine populations. The spectral shifts are consistent with adaptation for improved quantum catch (QC) as well as improved signal-to-noise ratio (SNR) of vision in the Baltic light environment. Since the chromophore of the pigment was pure A1 in all cases, this has apparently been achieved by evolutionary tuning of the opsin visual pigment. By contrast, no opsin-based differences were evident between lake and sea populations of species of fresh-water origin, which can tune their pigment by varying chromophore ratios. A more detailed analysis of differences in absorbance spectra and opsin sequence between and within populations was conducted using the sand goby as model species. Four allopatric populations from the Baltic Sea (B), Swedish west coast (S), English Channel (E), and Adriatic Sea (A) were examined. Rod absorbance spectra, characterized by the wavelength of maximum absorbance (λmax), differed between populations and correlated with differences in the spectral light transmission of the respective water bodies. The greatest λmax shift as well as the greatest opsin sequence difference was between the Baltic and the Adriatic populations. The significant within-population variation of the Baltic λmax values (506-511 nm) was analyzed on the level of individuals and was shown to correlate well with opsin sequence substitutions. The sequences of individuals with λmax at shorter wavelengths were identical to that of the Swedish population, whereas those with λmax at longer wavelengths additionally had substitution F261F/Y in the sixth transmembrane helix of the protein. This substitution (Y261) was also present in the Baltic common gobies and is known to redshift spectra. The tuning mechanism of the long-wavelength type Baltic sand gobies is assumed to be the co-expression of F261 and Y261 in all rods to produce ≈ 5 nm redshift. The polymorphism of the Baltic sand goby population possibly indicates ambiguous selection pressures in the Baltic Sea. The visual pigments of all lake populations of the opossum shrimp (Mysis relicta) were red-shifted by 25 nm compared with all Baltic Sea populations. This is calculated to confer a significant advantage in both QC and SNR in many humus-rich lakes with reddish water. Since only A2 chromophore was present, the differences obviously reflect evolutionary tuning of the visual protein, the opsin. The changes have occurred within the ca. 9000 years that the lakes have been isolated from the Sea after the most recent glaciation. At present, it seems that the mechanism explaining the spectral differences between lake and sea populations is not an amino acid substitution at any other conventional tuning site, but the mechanism is yet to be found.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Recent experimental studies have shown that the Rec-A mediated homologous recombination reaction involves a triple helical intermediate, in which the third strand base forms hydrogen bonds with both the bases in the major groove of the Watson-Crick duplex. Such 'mixed' hydrogen bonds allow formation of sequence independent triplexes. DNA triple helices involving 'mixed' hydrogen bonds have been studied, using model building, molecular mechanics (MM) and molecular dynamics (MD). Models were built for a tripler comprising all four possible triplets viz., G.C*C, C.G*G, A.T*T and T.A*A. To check the stability of all the 'mixed' hydrogen bonds in such triplexes and the conformational preferences of such tripler structures, MD studies were carried out starting from two structures with 30 degrees and 36 degrees twist between the basepairs. It was observed that though the two triplexes converged towards a similar structure, the various hydrogen bonds between the WC duplex and the third strand showed differential stabilities. An MD simulation with restrained hydrogen bonds showed that the resulting structure was stable and remained close to the starting structure. These studies help us in defining stable hydrogen bond geometries involving the third strand and the WC duplex. It was observed that in the C.G*G triplets the N7 atom of the second strand is always involved in hydrogen bonding. In the G.C*C triplets, either N3 or O2 in the third strand cytosine can interchangeably act as a hydrogen bond acceptor.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

DNA triple helices containing two purine strands and one pyrimidine strand (C.G*G and T.A*A) have been studied, using model building followed by energy minimisation, for different orientations of the third strand resulting from variation in the hydrogen bonding between the Watson-Crick duplex and the third strand and the glycosidic torsion angle in the third strand. Our results show that in the C.G*G case the structure with a parallel orientation of the third strand, resulting from Hoogsteen hydrogen bonds between the third strand and the Watson-Crick duplex, is energetically the most favourable while in the T.A*A case the antiparallel orientation of the third strand, resulting from reverse Hoogsteen hydrogen bonds, is energetically the most favourable. These studies when extended to the mixed sequence triplexes, in which the second strand is a mixture of G and A, correspondingly the third strand is a mixture of G and APT, show that though the parallel orientation is still energetically more favourable, the antiparallel orientation becomes energetically comparable with an increasing number of thymines in the third strand. Structurally, for the mixed triplexes containing G and T in the third strand, it is seen that the basepair non-isomorphism between the C.G*G and the T.A*T triplets can be overcome with some changes in the base pair parameters without much distortion of either the backbone or the hydrogen bonds.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Protein functional annotation relies on the identification of accurate relationships, sequence divergence being a key factor. This is especially evident when distant protein relationships are demonstrated only with three-dimensional structures. To address this challenge, we describe a computational approach to purposefully bridge gaps between related protein families through directed design of protein-like ``linker'' sequences. For this, we represented SCOP domain families, integrated with sequence homologues, as multiple profiles and performed HMM-HMM alignments between related domain families. Where convincing alignments were achieved, we applied a roulette wheel-based method to design 3,611,010 protein-like sequences corresponding to 374 SCOP folds. To analyze their ability to link proteins in homology searches, we used 3024 queries to search two databases, one containing only natural sequences and another one additionally containing designed sequences. Our results showed that augmented database searches showed up to 30% improvement in fold coverage for over 74% of the folds, with 52 folds achieving all theoretically possible connections. Although sequences could not be designed between some families, the availability of designed sequences between other families within the fold established the sequence continuum to demonstrate 373 difficult relationships. Ultimately, as a practical and realistic extension, we demonstrate that such protein-like sequences can be ``plugged-into'' routine and generic sequence database searches to empower not only remote homology detection but also fold recognition. Our richly statistically supported findings show that complementary searches in both databases will increase the effectiveness of sequence-based searches in recognizing all homologues sharing a common fold. (C) 2013 Elsevier Ltd. All rights reserved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Albacore and Atlantic Bluefin tuna are two pelagic fish. Atlantic Bluefin tuna is included in the IUCN red list of threatened species and albacore is considered to be near threatened, so conservation plans are needed. However, no genomic resources are available for any of them. In this study, to better understand their transcriptome we functionally annotated orthologous genes. In all, 159 SNPs distributed in 120 contigs of the muscle transcriptome were analyzed. Genes were predicted for 98 contigs (81.2%) using the bioinformatics tool BLAST. In addition, another bioinformatics tool, BLAST2GO was used in order to achieve GO terms for the genes, in which 41 sequences were given a biological process, and 39 sequences were given a molecular process. The most repeated biological process was metabolism and it is important that no cellular process was given in any of the sequences. The most abundant molecular process was binding and very few catalytic activity processes were given. From the initial 159 SNPs, 40 were aligned with a sequence in the database after BLAST2GO was run, and were polymorphic in Atlantic Bluefin tuna and monomorphic in albacore. From these 40 SNPs, 24 were located in an open reading frame of which four were non-synonymous and 20 were synonymous and 16 were not located in a known open reading frame,. This study provides information for better understanding the ecology and evolution of these species and this is important in order to establish a proper conservation plan and an appropriate management.