397 resultados para Gene annotation
Resumo:
The striped catfish (Pangasianodon hypophthalmus) culture industry in the Mekong Delta in Vietnam has developed rapidly over the past decade. The culture industry now however, faces some significant challenges, especially related to climate change impacts notably from predicted extensive saltwater intrusion into many low topographical coastal provinces across the Mekong Delta. This problem highlights a need for development of culture stocks that can tolerate more saline culture environments as a response to expansion of saline water-intruded land. While a traditional artificial selection program can potentially address this need, understanding the genomic basis of salinity tolerance can assist development of more productive culture lines. The current study applied a transcriptomic approach using Ion PGM technology to generate expressed sequence tag (EST) resources from the intestine and swim bladder from striped catfish reared at a salinity level of 9 ppt which showed best growth performance. Total sequence data generated was 467.8 Mbp, consisting of 4,116,424 reads with an average length of 112 bp. De novo assembly was employed that generated 51,188 contigs, and allowed identification of 16,116 putative genes based on the GenBank non-redundant database. GO annotation, KEGG pathway mapping, and functional annotation of the EST sequences recovered with a wide diversity of biological functions and processes. In addition, more than 11,600 simple sequence repeats were also detected. This is the first comprehensive analysis of a striped catfish transcriptome, and provides a valuable genomic resource for future selective breeding programs and functional or evolutionary studies of genes that influence salinity tolerance in this important culture species.
Resumo:
In the Yersinia pseudotuberculosis serotyping scheme, 21 serotypes are present originating from about 30 different O-factors distributed within the species. With regard to the chemical structures of lipopolysaccharides (LPSs) and the genetic basis of their biosynthesis, a number, but not all, of Y. pseudotuberculosis strains representing different serotypes have been investigated. In order to present an overall picture of the relationship between genetics and structures, we have been working on the genetics and structures of various Y. pseudotuberculosis O-specific polysaccharides (OPSs). Here, we present a structural and genetic analysis of the Y. pseudotuberculosis serotype O:11 OPS. Our results showed that this OPS structure has the same backbone as that of Y. pseudotuberculosis O:1b, but with a 6d-l-Altf side-branch instead of Parf. The 3′ end of the gene cluster is the same as that for O:1b and has the genes for synthesis of the backbone and for processing the completed repeat unit. The 5′ end has genes for synthesis of 6d-l-Altf and its transfer to the repeating unit backbone. The pathway for the synthesis of the 6d-l-Altf appears to be different from that for 6d-l-Altp in Y. enterocolitica O:3. The chemical structure of the O:11 repeating unit is [Figure]
Resumo:
A major virulence factor for Yersinia pseudotuberculosis is lipopolysaccharide, including O-polysaccharide (OPS). Currently, the OPS based serotyping scheme for Y. pseudotuberculosis includes 21 known O-serotypes, with genetic and structural data available for 17 of them. The completion of the OPS structures and genetics of this species will enable the visualization of relationships between O-serotypes and allow for analysis of the evolutionary processes within the species that give rise to new serotypes. Here we present the OPS structure and gene cluster of serotype O:12, thus adding one more to the set of completed serotypes, and show that this serotype is present in both Y. pseudotuberculosis and the newly identified Y. similis species. The O:12 structure is shown to include two rare sugars: 4-C[(R)-1-hydroxyethyl]-3,6-dideoxy-d-xylo-hexose (d-yersiniose) and 6-deoxy-l-glucopyranose (l-quinovose). We have identified a novel putative guanine diphosphate (GDP)-l-fucose 4-epimerase gene and propose a pathway for the synthesis of GDP-l-quinovose, which extends the known GDP-l-fucose pathway.
Resumo:
Extracellular polysaccharides are major immunogenic components of the bacterial cell envelope. However, little is known about their biosynthesis in the genus Acinetobacter, which includes A. baumannii, an important nosocomial pathogen. Whether Acinetobacter sp. produce a capsule or a lipopolysaccharide carrying an O antigen or both is not resolved. To explore these issues, genes involved in the synthesis of complex polysaccharides were located in 10 complete A. baumannii genome sequences, and the function of each of their products was predicted via comparison to enzymes with a known function. The absence of a gene encoding a WaaL ligase, required to link the carbohydrate polymer to the lipid A-core oligosaccharide (lipooligosaccharide) forming lipopolysaccharide, suggests that only a capsule is produced. Nine distinct arrangements of a large capsule biosynthesis locus, designated KL1 to KL9, were found in the genomes. Three forms of a second, smaller variable locus, likely to be required for synthesis of the outer core of the lipid A-core moiety, were designated OCL1 to OCL3 and also annotated. Each K locus includes genes for capsule export as well as genes for synthesis of activated sugar precursors, and for glycosyltransfer, glycan modification and oligosaccharide repeat-unit processing. The K loci all include the export genes at one end and genes for synthesis of common sugar precursors at the other, with a highly variable region that includes the remaining genes in between. Five different capsule loci, KL2, KL6, KL7, KL8 and KL9 were detected in multiply antibiotic resistant isolates belonging to global clone 2, and two other loci, KL1 and KL4, in global clone 1. This indicates that this region is being substituted repeatedly in multiply antibiotic resistant isolates from these clones.
Resumo:
The O-specific polysaccharide (OPS) is a variable constituent of the lipopolysaccharide of Gram-negative bacteria. The polymorphic nature of OPSs within a species is usually first defined serologically, and the current serotyping scheme for Yersinia pseudotuberculosis consists of 21 O serotypes of which 15 have been characterized genetically and structurally. Here, we present the structure and DNA sequence of Y. pseudotuberculosis O:10 OPS. The O unit consists of one residue each of d-galactopyranose, N-acetyl-d-galactosamine (2-amino-2-deoxy-d-galactopyranose) and d-glucopyranose in the backbone, with two colitose (3,6-dideoxy-l-xylo-hexopyranose) side-branch residues. This structure is very similar to that shared by Escherichia coli O111 and Salmonella enterica O35. The gene cluster sequences of these serotypes, however, have only low levels of similarity to that of Y. pseudotuberculosis O:10, although there is significant conservation of gene order. Within Y. pseudotuberculosis, the O10 structure is most closely related to the O:6 and O:7 structures.
Resumo:
Many, but not all, of the current 21 serotypes of Yersinia pseudotuberculosis have been investigated with regard to the chemical structures of their O-specific polysaccharide (OPS) and the genetic basis of their biosynthesis. Completion of the genetics and structures of the remaining serotypes will enhance our understanding of the emerging relationship between genetics and structures within this species. Here, we present a structural and genetic analysis of the Y. pseudotuberculosis serotype O:1c OPS. Our results showed that this OPS has the same backbone as Y. pseudotuberculosis O:2b, but with a 3,6-dideoxy-D-ribo-hexofuranose (paratofuranose, Parf) side-branch instead of a 3,6-dideoxy-D-xylo-hexopyranose (abequopyranose, Abep). The 3'-end of the gene cluster is the same as for O:2b and has the genes for synthesis of the backbone and for processing the completed repeat unit. The 5'-end of the cluster consists of the same genes as O:1b for synthesis of Parf and a related gene for its transfer to the repeating unit backbone.
Resumo:
Lipooligosaccharide (LOS) is a complex surface structure that is linked to many pathogenic properties of Acinetobacter baumannii. In A. baumannii, the genes responsible for the synthesis of the outer core (OC) component of the LOS are located between ilvE and aspS. The content of the OC locus is usually variable within a species, and examination of 6 complete and 227 draft A. baumannii genome sequences available in GenBank non-redundant and Whole Genome Shotgun databases revealed nine distinct new types, OCL4-OCL12, in addition to the three known ones. The twelve gene clusters fell into two distinct groups, designated Group A and Group B, based on similarities in the genes present. OCL6 (Group B) was unique in that it included genes for the synthesis of L-Rhamnosep. Genetic exchange of the different configurations between strains has occurred as some OC forms were found in several different sequence types (STs). OCL1 (Group A) was the most widely distributed being present in 18 STs, and OCL6 was found in 16 STs. Variation within clones was also observed, with more than one OC locus type found in the two globally disseminated clones, GC1 and GC2, that include the majority of multiply antibiotic resistant isolates. OCL1 was the most abundant gene cluster in both GC1 and GC2 genomes but GC1 isolates also carried OCL2, OCL3 or OCL5, and OCL3 was also present in GC2. As replacement of the OC locus in the major global clones indicates the presence of sub-lineages, a PCR typing scheme was developed to rapidly distinguish Group A and Group B types, and to distinguish the specific forms found in GC1 and GC2 isolates.
Resumo:
Common diseases such as endometriosis (ED), Alzheimer's disease (AD) and multiple sclerosis (MS) account for a significant proportion of the health care burden in many countries. Genome-wide association studies (GWASs) for these diseases have identified a number of individual genetic variants contributing to the risk of those diseases. However, the effect size for most variants is small and collectively the known variants explain only a small proportion of the estimated heritability. We used a linear mixed model to fit all single nucleotide polymorphisms (SNPs) simultaneously, and estimated genetic variances on the liability scale using SNPs from GWASs in unrelated individuals for these three diseases. For each of the three diseases, case and control samples were not all genotyped in the same laboratory. We demonstrate that a careful analysis can obtain robust estimates, but also that insufficient quality control (QC) of SNPs can lead to spurious results and that too stringent QC is likely to remove real genetic signals. Our estimates show that common SNPs on commercially available genotyping chips capture significant variation contributing to liability for all three diseases. The estimated proportion of total variation tagged by all SNPs was 0.26 (SE 0.04) for ED, 0.24 (SE 0.03) for AD and 0.30 (SE 0.03) for MS. Further, we partitioned the genetic variance explained into five categories by a minor allele frequency (MAF), by chromosomes and gene annotation. We provide strong evidence that a substantial proportion of variation in liability is explained by common SNPs, and thereby give insights into the genetic architecture of the diseases.
Resumo:
Staphylococcus aureus (S. aureus) is a prominent human and livestock pathogen investigated widely using omic technologies. Critically, due to availability, low visibility or scattered resources, robust network and statistical contextualisation of the resulting data is generally under-represented. Here, we present novel meta-analyses of freely-accessible molecular network and gene ontology annotation information resources for S. aureus omics data interpretation. Furthermore, through the application of the gene ontology annotation resources we demonstrate their value and ability (or lack-there-of) to summarise and statistically interpret the emergent properties of gene expression and protein abundance changes using publically available data. This analysis provides simple metrics for network selection and demonstrates the availability and impact that gene ontology annotation selection can have on the contextualisation of bacterial omics data.
Resumo:
Background There is evidence that certain mutations in the double-strand break repair pathway ataxia-telangiectasia mutated gene act in a dominant-negative manner to increase the risk of breast cancer. There are also some reports to suggest that the amino acid substitution variants T2119C Ser707Pro and C3161G Pro1054Arg may be associated with breast cancer risk. We investigate the breast cancer risk associated with these two nonconservative amino acid substitution variants using a large Australian population-based case–control study. Methods The polymorphisms were genotyped in more than 1300 cases and 600 controls using 5' exonuclease assays. Case–control analyses and genotype distributions were compared by logistic regression. Results The 2119C variant was rare, occurring at frequencies of 1.4 and 1.3% in cases and controls, respectively (P = 0.8). There was no difference in genotype distribution between cases and controls (P = 0.8), and the TC genotype was not associated with increased risk of breast cancer (adjusted odds ratio = 1.08, 95% confidence interval = 0.59–1.97, P = 0.8). Similarly, the 3161G variant was no more common in cases than in controls (2.9% versus 2.2%, P = 0.2), there was no difference in genotype distribution between cases and controls (P = 0.1), and the CG genotype was not associated with an increased risk of breast cancer (adjusted odds ratio = 1.30, 95% confidence interval = 0.85–1.98, P = 0.2). This lack of evidence for an association persisted within groups defined by the family history of breast cancer or by age. Conclusion The 2119C and 3161G amino acid substitution variants are not associated with moderate or high risks of breast cancer in Australian women.
Resumo:
The tissue kallikreins are serine proteases encoded by highly conserved multigene families. The rodent kallikrein (KLK) families are particularly large, consisting of 13 26 genes clustered in one chromosomal locus. It has been recently recognised that the human KLK gene family is of a similar size (15 genes) with the identification of another 12 related genes (KLK4-KLK15) within and adjacent to the original human KLK locus (KLK1-3) on chromosome 19q13.4. The structural organisation and size of these new genes is similar to that of other KLK genes except for additional exons encoding 5 or 3 untranslated regions. Moreover, many of these genes have multiple mRNA transcripts, a trait not observed with rodent genes. Unlike all other kallikreins, the KLK4-KLK15 encoded proteases are less related (25–44%) and do not contain a conventional kallikrein loop. Clusters of genes exhibit high prostatic (KLK2-4, KLK15) or pancreatic (KLK6-13) expression, suggesting evolutionary conservation of elements conferring tissue specificity. These genes are also expressed, to varying degrees, in a wider range of tissues suggesting a functional involvement of these newer human kallikrein proteases in a diverse range of physiological processes.