956 resultados para Genomic sequence database
Resumo:
Staphylococcus aureus is one of the most important infectious mastitis causative agents in small ruminants. In order to know the distribution of Staph. aureus strains associated with infectious mastitis in flocks of sheep in the northeast of Brazil and establish whether these clones are related to the strains distributed internationally, this study analysed the genetic diversity of Staph. aureus isolates from cases of clinical and subclinical mastitis in ewes by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST). In this research, 135 ewes with mastitis from 31 sheep flocks distributed in 15 districts were examined. Staph. aureus was isolated from sheep milk in 9 (29%) out of 31 herds located in 47% of the districts surveyed. MLST analysis allowed the identification of four STs (ST750, ST1728, ST1729 and ST1730). The last three with their respective novel alleles (g/p-220; pta-182 and yqil-180) were recently reported in the Staph. aureus MLST database (http://www.mlst.net). Each novel allele showed only a nucleotide different from those already described. The occurrence of CC133 (ST750 and ST1729) in this study is in agreement with other reports that only a few clones of Staph. aureus seem to be responsible for most cases of mastitis in dairy farms and that some of these clones may have broad geographic distribution. However, the prevalence of CC5 (ST1728 and ST1730)-an important group related to cases of colonization or infection in humans-differs from previous studies by its widespread occurrence and may suggest human contamination followed by selective pressures of the allelic diversifications presented for these STs.
Resumo:
Enteropathogenic Escherichia coli (EPEC) infections are a leading cause of infantile diarrhea in developing nations. Multilocus sequence typing (MLST) characterizes bacterial strains based on the sequences of internal fragments in housekeeping genes. Little is known about strains of EPEC analyzed by MLST from Brazil. In this study, a diverse collection of 29 EPEC strains isolated from patients with diarrhea, admitted to the University Hospital of Ribeirao Preto, was characterized by MLST. Strain analysis demonstrated 22 different sequence types (STs), of which almost half (48%) were new, indicating a high genotype diversity. The 22 STs were divided by eBURST into 12 clonal complexes. It was not possible to correlate typical and atypical EPEC with other strains in the MLST database. This is the first study that analyzed EPEC strains from South America that are included in the E. coli MLST database. Nine (31%) out of 29 strains are part of the CC10 clonal complex, the major clonal complex in the database, which comprises 174 strains and 86 different STs, suggesting that these strains might be the most important intestinal pathogenic E. coli worldwide. Genetic relationships between typical and atypical EPEC, enterohemorrhagic E. coli, and enteroaggregative E. coli strains were not established by MLST.
Resumo:
MHCPEP (http://wehih.wehi.edu.au/mhcpep/) is a curated database comprising over 13 000 peptide sequences known to bind MHC molecules, Entries are compiled from published reports as well as from direct submissions of experimental data, Each entry contains the peptide sequence, its MHC specificity and where available, experimental method, observed activity, binding affinity, source protein and anchor positions, as well as publication references, The present format of the database allows text string matching searches but can easily be converted for use in conjunction with sequence analysis packages. The database can be accessed via Internet using WWW or FTP.
Resumo:
The nifH gene sequence of the nitrogen-fixing bacterium Acetobacter diazotrophicus was determined with the use of the polymerase chain reaction and universal degenerate oligonucleotide primers. The gene shows highest pair-wise similarity to the nifH gene of Azospirillum brasilense. The phylogenetic relationships of the nifH gene sequences were compared with those inferred from 16S rRNA gene sequences. Knowledge of the sequence of the nifH gene contributes to the growing database of nifH gene sequences, and will allow the detection of Acet. diazotrophicus from environmental samples with nifH gene-based primers.
Resumo:
A clone encoding ovine preprogastrin was isolated from a sheep genomic library. The deduced 104 amino acid sequence of ovine preprogastrin was 92% and 68% identical to the sequences of bovine and human preprogastrin, respectively. While the similarity was greatest in the gastrin-17 sequence, an unexpected similarity was also observed in the N-terminus of mature progastrin.
Resumo:
Conventionally, protein structure prediction via threading relies on some nonoptimal method to align a protein sequence to each member of a library of known structures. We show how a score function (force field) can be modified so as to allow the direct application of a dynamic programming algorithm to the problem. This involves an approximation whose damage can be minimized by an optimization process during score function parameter determination. The method is compared to sequence to structure alignments using a more conventional pair-wise score function and the frozen approximation. The new method produces results comparable to the frozen approximation, but is faster and has fewer adjustable parameters. It is also free of memory of the template's original amino acid sequence, and does not suffer from a problem of nonconvergence, which can be shown to occur with the frozen approximation. Alignments generated by the simplified score function can then be ranked using a second score function with the approximations removed. (C) 1999 John Wiley & Sons, Inc.
Resumo:
Allergies are a major cause of chronic ill health in industrialised countries with the incidence of reported cases steadily increasing. This Research Focus details how bioinformatics is transforming the field of allergy through providing databases for management of allergen data, algorithms for characterisation of allergic crossreactivity, structural motifs and B- and T-cell epitopes, tools for prediction of allergenicity and techniques for genomic and proteomic analysis of allergens.
Resumo:
Background: A major goal in the post-genomic era is to identify and characterise disease susceptibility genes and to apply this knowledge to disease prevention and treatment. Rodents and humans have remarkably similar genomes and share closely related biochemical, physiological and pathological pathways. In this work we utilised the latest information on the mouse transcriptome as revealed by the RIKEN FANTOM2 project to identify novel human disease-related candidate genes. We define a new term patholog to mean a homolog of a human disease-related gene encoding a product ( transcript, anti-sense or protein) potentially relevant to disease. Rather than just focus on Mendelian inheritance, we applied the analysis to all potential pathologs regardless of their inheritance pattern. Results: Bioinformatic analysis and human curation of 60,770 RIKEN full-length mouse cDNA clones produced 2,578 sequences that showed similarity ( 70 - 85% identity) to known human-disease genes. Using a newly developed biological information extraction and annotation tool ( FACTS) in parallel with human expert analysis of 17,051 MEDLINE scientific abstracts we identified 182 novel potential pathologs. Of these, 36 were identified by computational tools only, 49 by human expert analysis only and 97 by both methods. These pathologs were related to neoplastic ( 53%), hereditary ( 24%), immunological ( 5%), cardio-vascular (4%), or other (14%), disorders. Conclusions: Large scale genome projects continue to produce a vast amount of data with potential application to the study of human disease. For this potential to be realised we need intelligent strategies for data categorisation and the ability to link sequence data with relevant literature. This paper demonstrates the power of combining human expert annotation with FACTS, a newly developed bioinformatics tool, to identify novel pathologs from within large-scale mouse transcript datasets.
Resumo:
MHCPEP is a curated database comprising over 9000 peptide sequences known to bind MHC molecules. Entries are compiled from published reports as well as from direct submissions of experimental data. Each entry contains the peptide sequence, its MHC specificity and, when available, experimental method, observed activity, binding affinity, source protein, anchor positions and publication references. The present format of the database allows text string matching searches but can easily be converted for use in conjunction with sequence analysis packages. The database can be accessed via Internet using WWW, FTP or Gopher.
Resumo:
Atypical enteropathogenic Escherichia coli (aEPEC) has been associated with infantile diarrhea in many countries. The clonal structure of aEPEC is the object of active investigation but few works have dealt with its genetic relationship with other diarrheagenic E. coli (DEC). This study aimed to evaluate the genetic relationship of aEPEC with other DEC pathotypes. The phylogenetic relationships of DEC strains were evaluated by multilocus sequence typing. Genetic diversity was assessed by pulsed-field gel electrophoresis (PFGE). The phylogram showed that aEPEC strains were distributed in four major phylogenetic groups (A, B1, B2 and D). Cluster I ( group B1) contains the majority of the strains and other pathotypes [enteroaggregative, enterotoxigenic and enterohemorrhagic E. coli ( EHEC)]; cluster II ( group A) also contains enteroaggregative and diffusely adherent E. coli; cluster III ( group B2) has atypical and typical EPEC possessing H6 or H34 antigen; and cluster IV ( group D) contains aEPEC O55:H7 strains and EHEC O157:H7 strains. PFGE analysis confirmed that these strains encompass a great genetic diversity. These results indicate that aEPEC clonal groups have a particular genomic background - especially the strains of phylogenetic group B1 that probably made possible the acquisition and expression of virulence factors derived from non-EPEC pathotypes.
Resumo:
P>We have developed a two-step PCR assay that amplifies a region of the ceja-1 sequence that is specific for virulent strains of Paracoccidioides brasiliensis. An internal region of the ceja-1 sequence was chosen for designing primers that were utilised in a single tube heminested PCR protocol to amplify DNA from six virulent strains. PCR specificity was determined by the absence of amplified products with genomic DNA from four non-virulent strains of P. brasiliensis and from eight fungal pathogens, one bacterium, two protozoa, one worm and mouse and human genomic DNA (leucocytes). The fact that the PCR product was only obtained with the genetic material from virulent isolates of P. brasiliensis suggested that this partial amplified sequence might be a marker of virulence for this fungus. The diagnostic potential of this PCR was confirmed by the successful amplification of this fragment with genomic DNA obtained in lymph node aspirate from a patient with paracoccidioidomycosis.
Resumo:
We describe the genomic organization of a recently identified CC chemokine, MIP3 alpha /CCL20 (HGMW-approved symbol SCYA20). The MIP-3 alpha /CCL20 gene was cloned and sequenced, revealing a four exon, three intron structure, and was localized by FISK analysis to 2q35-q36. Two distinct cDNAs were identified, encoding two forms of MIP-3 alpha /CCL20, Ala MLP-3 alpha /CCL20 and Ser MIP-3 alpha /CCL20, that differ by one amino acid at the predicted signal peptide cleavage site. Examination of the sequence around the boundary of intron 1 and exon 2 showed that use of alternative splice acceptor sites could give rise to Ata MIP-3 alpha /CCL20 or Ser MIP-3 alpha /CCL20. Both forms of MIP-3cr/CCL20 were chemically synthesized and tested for biological activity. Both flu antigen plus IL-a-activated CD4(+) and CD8(+) T lymphoblasts and cord blood-derived dendritic cells responded to Ser and Ala MIP-3 alpha /CCL20. T lymphocytes exposed only to IL-2 responded inconsistently, while no response was detected in naive T lymphocytes, monocytes, or neutrophils. The biological activity of Ser MIP-3 alpha /CCL20 and Ala MIP-3 alpha /CCL20 and the tissue-specific preference of different splice acceptor sites are not yet known. (C) 2001 Academic Press.
Resumo:
We report a further characterization of the genomic region containing the soybean supernodulation gene NTS-1. We performed a search for new markers linked to NTS-1 by combining DNA amplification fingerprinting (DAF) and bulked segregant analysis (BSA). The search resulted in one cloned polymorphism (B44-456) linked in trans, 8.5cM from the locus. Southern hybridization showed duplication of the B44-456 sequence in the soybean genome. Additionally, a DNA database search revealed one Arabidopsis thaliana genomic clone from chromosome I possessing 62% homology to the B44-456 marker. A relatively low number of polymorphisms were identified by several PCR marker technologies for this soybean genomic region, providing an additional support for its highly conserved and/or duplicated organization.
Resumo:
The complete nucleotide sequence of the mitochondrial (mt) DNA molecule of the liverfluke, Fasciola hepatica (phylum Platyhelminthes, class Trematoda, family Fasciolidae), was determined, It comprises 14462 bp, contains 12 protein-encoding, 2 ribosomal and 22 transfer RNA genes, and is the second complete flatworm (and the first trematode) mitochondrial sequence to be described in detail. All of the genes are transcribed from the same strand. Of the genes typically found in mitochondrial genomes of eumetazoans, only atp8 is absent. The nad4L and nad4 genes overlap by 40 nt. Most intergenic sequences are very short. Two larger non-coding regions are present. The longer one (817 nt) is located between trnG and cox3 and consists of 8 identical tandem repeats of 85 nt, rich in G and C, followed by 1 imperfect repeat. The shorter non-coding region (187 nt) exhibits no special features and is separated from the longer region by trnG. The gene arrangement resembles that of some other trematodes including the eastern Asian Schistosoma species (and cyclophyllidean cestode species) but it is strikingly different from that of the African schistosomes, represented by Schistosoma mansoni. The genetic code is as inferred previously for flatworms. Transfer RNA genes range in length from 58 to 70 nt, their products producing characteristic 'clover leaf' structures, except for tRNA(S-VON) and tRNA(S-AGN) lacking the DHU arm.
Resumo:
A proportion of melanoma,prone individuals in both familial and non,familial contexts has been shown to carry inactivating mutations in either CDKN2A or, rarely, CDK4. CDKN2A is a complex locus that encodes two unrelated proteins from alternately spliced transcripts that are read in different frames. The alpha transcript (exons 1a, 2, and 3) produces the p16INK4A cyclin-dependent kinase inhibitor, while the beta transcript (exons 1beta and 2) is translated as p14ARF, a stabilizing factor of p53 levels through binding to MDM2. Mutations in exon 2 can impair both polypeptides and insertions and deletions in exons 1alpha, 1beta, and 2, which can theoretically generate p16INK4A,p14ARF fusion proteins. No online database currently takes into account all the consequences of these genotypes, a situation compounded by some problematic previous annotations of CDKN2A related sequences and descriptions of their mutations. As an initiative of the international Melanoma Genetics Consortium, we have therefore established a database of germline variants observed in all loci implicated in familial melanoma susceptibility. Such a comprehensive, publicly accessible database is an essential foundation for research on melanoma susceptibility and its clinical application. Our database serves two types of data as defined by HUGO. The core dataset includes the nucleotide variants on the genomic and transcript levels, amino acid variants, and citation. The ancillary dataset includes keyword description of events at the transcription and translation levels and epidemiological data. The application that handles users' queries was designed in the model,view. controller architecture and was implemented in Java. The object-relational database schema was deduced using functional dependency analysis. We hereby present our first functional prototype of eMelanoBase. The service is accessible via the URL www.wmi.usyd.e, du.au:8080/melanoma.html.