956 resultados para Genomic sequence database
Resumo:
The use of molecular data to reconstruct the history of divergence and gene flow between populations of closely related taxa represents a challenging problem. It has been proposed that the long-standing debate about the geography of speciation can be resolved by comparing the likelihoods of a model of isolation with migration and a model of secondary contact. However, data are commonly only fit to a model of isolation with migration and rarely tested against the secondary contact alternative. Furthermore, most demographic inference methods have neglected variation in introgression rates and assume that the gene flow parameter (Nm) is similar among loci. Here, we show that neglecting this source of variation can give misleading results. We analysed DNA sequences sampled from populations of the marine mussels, Mytilus edulis and M. galloprovincialis, across a well-studied mosaic hybrid zone in Europe and evaluated various scenarios of speciation, with or without variation in introgression rates, using an Approximate Bayesian Computation (ABC) approach. Models with heterogeneous gene flow across loci always outperformed models assuming equal migration rates irrespective of the history of gene flow being considered. By incorporating this heterogeneity, the best-supported scenario was a long period of allopatric isolation during the first three-quarters of the time since divergence followed by secondary contact and introgression during the last quarter. By contrast, constraining migration to be homogeneous failed to discriminate among any of the different models of gene flow tested. Our simulations thus provide statistical support for the secondary contact scenario in the European Mytilus hybrid zone that the standard coalescent approach failed to confirm. Our results demonstrate that genomic variation in introgression rates can have profound impacts on the biological conclusions drawn from inference methods and needs to be incorporated in future studies.
Resumo:
Three-dimensional sequence stratigraphy is a potent exploration and development tool for the discovery of subtle stratigraphic traps. Reservoir morphology, heterogeneity and subtle stratigraphic trapping mechanisms can be better understood through systematic horizontal identification of sedimentary facies of systems tracts provided by three-dimensional attribute maps used as an important complement to the sequential analysis on the two-dimensional seismic lines and the well log data. On new prospects as well as on already-producing fields, the additional input of sequential analysis on three-dimensional data enables the identification, location and precise delimitation of new potentially productive zones. The first part of this paper presents four typical horizontal seismic facies assigned to the successive systems tracts of a third- or fourth-order sequence deposited in inner to outer neritic conditions on a elastic shelf. The construction of this synthetic representative sequence is based on the observed reproducibility of the horizontal seismic facies response to cyclic eustatic events on more than 35 sequences registered in the Gulf coast Plio-Pleistocene and Late Miocene, offshore Louisiana in the West Cameron region of the Gulf of Mexico. The second part shows how three-dimensional sequence stratigraphy can contribute in localizing and understanding sedimentary facies associated with productive zones. A case study in the early Middle Miocene Cibicides opima sands shows multiple stacked gas accumulations in the top slope fan, prograding wedge and basal transgressive systems tract of the third-order sequence between SB15.5 and SB 13.8 Ma.
Resumo:
Purpose: Previously we reported on a premature termination mutation in SLC16A12 that leads to dominant juvenile cataract and renal glucosuria. To assess the mutation rate and genotype-phenotype correlations of SLC16A12 in juvenile or age-related forms of cataract, we performed a mutation screen in cataract patients. Methods: Clinical data of approximately 660 patients were collected, genomic DNA was isolated and analyzed. Exons 3 to 8 including flanking intron sequences of SLC16A12 were PCR amplified and DNA sequence was determined. Selected mutations were tested by cell culture assays, in silico analysis and RT-PCR. Results: We found sequence alterations at a rate of approximately 1/75 patients. None of them was found in 360 control alleles. Alterations affect splice site and regulatory region but most mutations caused an amino acid substitution. The majority of the coding region mutations maps to trans-membrane domains. One mutation located to the 5'UTR. It affects translational efficiency of SLC16A12. In addition, we identified a cataract-predisposing SNP in the non-coding region that causes allele-specific splicing of the 5'UTR region. Conclusions: Altered translational efficiency of the solute carrier SLC16A12 and its allele-specific splicing strongly support a model of challenged homeostasis to cause various forms of cataract. In addition, the pathogenic property of the here reported sequence alterations is supported by the lack of known sequence variations within the coding region of SLC16A12. Due to the relatively high mutation rate, we suggest to include SLC16A12 in diagnostic cataract screening. Generally, our data recommend the assessment of regulatory sequences for diagnostic purposes.
Resumo:
BACKGROUND & AIMS: Regulation of gene expression in the follicle-associated epithelium (FAE) over Peyer's patches is largely unknown. CCL20, a chemokine that recruits immature dendritic cells, is one of the few FAE-specific markers described so far. Lymphotoxin beta (LTalpha1beta2) expressed on the membrane of immune cells triggers CCL20 expression in enterocytes. In this study, we measured expression profiles of LTalpha1beta2-treated intestinal epithelial cells and selected CCL20 -coregulated genes to identify new FAE markers. METHODS: Genomic profiles of T84 and Caco-2 cell lines treated with either LTalpha1beta2, flagellin, or tumor necrosis factor alpha were measured using the Affymetrix GeneChip U133A. Clustering analysis was used to select CCL20 -coregulated genes, and laser dissection microscopy and real-time polymerase chain reaction on human biopsy specimens was used to assess the expression of the selected markers. RESULTS: Applying a 2-way analysis of variance, we identified regulated genes upon the different treatments. A subset of genes involved in inflammation and related to the nuclear factor kappaB pathway was coregulated with CCL20 . Among these genes, the antiapoptotic factor TNFAIP3 was highly expressed in the FAE. CCL23 , which was not coregulated in vitro with CCL20 , was also specifically expressed in the FAE. CONCLUSIONS: We have identified 2 novel human FAE specifically expressed genes. Most of the CCL20 -coregulated genes did not show FAE-specific expression, suggesting that other signaling pathways are critical to modulate FAE-specific gene expression.
Resumo:
BACKGROUND: Membrane-bound organelles are a defining feature of eukaryotic cells, and play a central role in most of their fundamental processes. The Rab G proteins are the single largest family of proteins that participate in the traffic between organelles, with 66 Rabs encoded in the human genome. Rabs direct the organelle-specific recruitment of vesicle tethering factors, motor proteins, and regulators of membrane traffic. Each organelle or vesicle class is typically associated with one or more Rab, with the Rabs present in a particular cell reflecting that cell's complement of organelles and trafficking routes. RESULTS: Through iterative use of hidden Markov models and tree building, we classified Rabs across the eukaryotic kingdom to provide the most comprehensive view of Rab evolution obtained to date. A strikingly large repertoire of at least 20 Rabs appears to have been present in the last eukaryotic common ancestor (LECA), consistent with the 'complexity early' view of eukaryotic evolution. We were able to place these Rabs into six supergroups, giving a deep view into eukaryotic prehistory. CONCLUSIONS: Tracing the fate of the LECA Rabs revealed extensive losses with many extant eukaryotes having fewer Rabs, and none having the full complement. We found that other Rabs have expanded and diversified, including a large expansion at the dawn of metazoans, which could be followed to provide an account of the evolutionary history of all human Rabs. Some Rab changes could be correlated with differences in cellular organization, and the relative lack of variation in other families of membrane-traffic proteins suggests that it is the changes in Rabs that primarily underlies the variation in organelles between species and cell types.
Resumo:
Background: The variety of DNA microarray formats and datasets presently available offers an unprecedented opportunity to perform insightful comparisons of heterogeneous data. Cross-species studies, in particular, have the power of identifying conserved, functionally important molecular processes. Validation of discoveries can now often be performed in readily available public data which frequently requires cross-platform studies.Cross-platform and cross-species analyses require matching probes on different microarray formats. This can be achieved using the information in microarray annotations and additional molecular biology databases, such as orthology databases. Although annotations and other biological information are stored using modern database models ( e. g. relational), they are very often distributed and shared as tables in text files, i.e. flat file databases. This common flat database format thus provides a simple and robust solution to flexibly integrate various sources of information and a basis for the combined analysis of heterogeneous gene expression profiles.Results: We provide annotationTools, a Bioconductor-compliant R package to annotate microarray experiments and integrate heterogeneous gene expression profiles using annotation and other molecular biology information available as flat file databases. First, annotationTools contains a specialized set of functions for mining this widely used database format in a systematic manner. It thus offers a straightforward solution for annotating microarray experiments. Second, building on these basic functions and relying on the combination of information from several databases, it provides tools to easily perform cross-species analyses of gene expression data.Here, we present two example applications of annotationTools that are of direct relevance for the analysis of heterogeneous gene expression profiles, namely a cross-platform mapping of probes and a cross-species mapping of orthologous probes using different orthology databases. We also show how to perform an explorative comparison of disease-related transcriptional changes in human patients and in a genetic mouse model.Conclusion: The R package annotationTools provides a simple solution to handle microarray annotation and orthology tables, as well as other flat molecular biology databases. Thereby, it allows easy integration and analysis of heterogeneous microarray experiments across different technological platforms or species.
Resumo:
Gene duplication and neofunctionalization are known to be important processes in the evolution of phenotypic complexity. They account for important evolutionary novelties that confer ecological adaptation, such as the major histocompatibility complex (MHC), a multigene family crucial to the vertebrate immune system. In birds, two MHC class II β (MHCIIβ) exon 3 lineages have been recently characterized, and two hypotheses for the evolutionary history of MHCIIβ lineages were proposed. These lineages could have arisen either by 1) an ancient duplication and subsequent divergence of one paralog or by 2) recent parallel duplications followed by functional convergence. Here, we compiled a data set consisting of 63 MHCIIβ exon 3 sequences from six avian orders to distinguish between these hypotheses and to understand the role of selection in the divergent evolution of the two avian MHCIIβ lineages. Based on phylogenetic reconstructions and simulations, we show that a unique duplication event preceding the major avian radiations gave rise to two ancestral MHCIIβ lineages that were each likely lost once later during avian evolution. Maximum likelihood estimation shows that following the ancestral duplication, positive selection drove a radical shift from basic to acidic amino acid composition of a protein domain facing the α-chain in the MHCII α β-heterodimer. Structural analyses of the MHCII α β-heterodimer highlight that three of these residues are potentially involved in direct interactions with the α-chain, suggesting that the shift following duplication may have been accompanied by coevolution of the interacting α- and β-chains. These results provide new insights into the long-term evolutionary relationships among avian MHC genes and open interesting perspectives for comparative and population genomic studies of avian MHC evolution.
Resumo:
Aims: The adaptive immune response against hepatitis C virus (HCV) is significantly shaped by the host's composition of HLA alleles. Thus, the HLA phenotype is a critical determinant of viral evolution during adaptive immune pressure. Potential associations of HLA class I alleles with polymorphisms of HCV immune escape variants are largely unknown. Methods: Direct sequence analysis of the genes encoding the HCV proteins E2, NS3 and NS5B in a cohort of 159 patients with chronic HCV genotype 1 infection who were treated with pegylated interferon-alfa 2b and ribavirin in a prospective controlled trial for 48 weeks was exhibited. HLA class I genotyping was performed by strand-specific reverse hybridization with the INNO-LiPA line probe assays for HLA-A and HLA-B and by strand-specific PCR-SSP. We analyzed each amino acid position of HCV proteins using an extension of Fisher's exact test for associations with HLA alleles. In addition, associations of specific HLA alleles with inflammatory activity, liver fibrosis, HCV RNA viral load and virologic treatment outcome were investigated. Results: Separate analyses of HCV subtype 1a and 1b isolates revealed substantially different patterns of HLA-restricted polymorphisms between subtypes. Only one polymorphism within NS5B (V2758x) was significantly associated with HLA B*15 in HCV genotype 1b infected patients (adjusted p=0,048). However, a number of HLA class I-restricted polymorphisms within novel putative HCV CD8+ T cell epitopes (genotype 1a: HLA-A*11 GTRTIASPK1086-1094 [NS3], HLA-B*07 WPAPQGARSL1111-1120 [NS3]; genotype 1b: HLA-A*24 HYAPRPCGI488-496 [E2], HLA-B*44 GENETDVLL530-538 [E2], HLA-B*15 RVFTEAMTRY2757-2766 [NS5B]) were observed with high predicted epitope binding scores assessed by the web-based software SYFPEITHI (>21). Most of the identified putative epitopes were overlapping with already otherwise published epitopes, indicating a high immunogenicity of the accordant HCV protein region. In addition, certain HLA class I alleles were associated with inflammatory activity, stage of liver fibrosis, and sustained virologic response to antiviral therapy. Conclusions: HLA class I restricted HCV sequence polymorphisms are rare. HCV polymorphisms identified within putative HCV CD8+ T cell epitopes in the present study differ in their genomic distribution between genotype 1a and 1b isolates, implying divergent adaptation to the host's immune pressure on the HCV subtype level.
Resumo:
PURPOSE: To provide a mechanistic link between mutations in PRPF31, and essential and ubiquitously expressed gene, and retinitis pigmentosa, a disorder restricted to the eye. METHODS: We investigated the existence of retina-specific PRPF31 isoforms and the expression of this gene in human retina and other tissues, as well as in cultured human cell lines. PRPF31 transcripts were examined by RT-PCR, quantitative PCR, cloning and sequencing. RESULTS: Database searching revealed the presence of a retina-specific PRPF31 isoform in mouse. However, this isoform could not be experimentally identified in transcripts from human retina or from a human whole eye. Nevertheless, four different PRPF31 isoforms, that were common to all analyzed tissues and cell lines, were isolated. Three of these harbored the full-length PRPF31 coding sequence, whereas the fourth was very short and probably non-coding. The amount of PRPF31 mRNA was previously found to be lower in patients with mutations in this gene than in healthy individuals, making it likely that retinal cells are more sensitive to variation in PRPF31 expression. However, quantitative PCR experiments revealed that PRPF31 mRNA levels in human retina were comparable to those detected in other tissues. CONCLUSIONS: Our results show that the retina-restricted phenotype caused by PRPF31 mutations cannot be explained by the presence of tissue-specific isoforms, or by differential expression of PRPF31 in the retina. As a consequence, the etiology of PRPF31-associated retinitis pigmentosa likely relies on other, probably more subtle molecular mechanisms.
Resumo:
Report produced by Iowa Departmment of Agriculture and Land Stewardship
Resumo:
Ants (Hymenoptera, Formicidae) represent one of the most successful eusocial taxa in terms of both their geographic distribution and species number. The publication of seven ant genomes within the past year was a quantum leap for socio- and ant genomics. The diversity of social organization in ants makes them excellent model organisms to study the evolution of social systems. Comparing the ant genomes with those of the honeybee, a lineage that evolved eusociality independently from ants, and solitary insects suggests that there are significant differences in key aspects of genome organization between social and solitary insects, as well as among ant species. Altogether, these seven ant genomes open exciting new research avenues and opportunities for understanding the genetic basis and regulation of social species, and adaptive complex systems in general.
Resumo:
We propose and validate a multivariate classification algorithm for characterizing changes in human intracranial electroencephalographic data (iEEG) after learning motor sequences. The algorithm is based on a Hidden Markov Model (HMM) that captures spatio-temporal properties of the iEEG at the level of single trials. Continuous intracranial iEEG was acquired during two sessions (one before and one after a night of sleep) in two patients with depth electrodes implanted in several brain areas. They performed a visuomotor sequence (serial reaction time task, SRTT) using the fingers of their non-dominant hand. Our results show that the decoding algorithm correctly classified single iEEG trials from the trained sequence as belonging to either the initial training phase (day 1, before sleep) or a later consolidated phase (day 2, after sleep), whereas it failed to do so for trials belonging to a control condition (pseudo-random sequence). Accurate single-trial classification was achieved by taking advantage of the distributed pattern of neural activity. However, across all the contacts the hippocampus contributed most significantly to the classification accuracy for both patients, and one fronto-striatal contact for one patient. Together, these human intracranial findings demonstrate that a multivariate decoding approach can detect learning-related changes at the level of single-trial iEEG. Because it allows an unbiased identification of brain sites contributing to a behavioral effect (or experimental condition) at the level of single subject, this approach could be usefully applied to assess the neural correlates of other complex cognitive functions in patients implanted with multiple electrodes.
Resumo:
The Pseudomonas aeruginosa gene anr, which encodes a structural and functional analog of the anaerobic regulator Fnr in Escherichia coli, was mapped to the SpeI fragment R, which is at about 59 min on the genomic map of P. aeruginosa PAO1. Wild-type P. aeruginosa PAO1 grew under anaerobic conditions with nitrate, nitrite, and nitrous oxide as alternative electron acceptors. An anr deletion mutant, PAO6261, was constructed. It was unable to grow with these alternative electron acceptors; however, its ability to denitrify was restored upon the introduction of the wild-type anr gene. In addition, the activities of two enzymes in the denitrification pathway, nitrite reductase and nitric oxide reductase, were not detectable under oxygen-limiting conditions in strain PAO6261 but were restored when complemented with the anr+ gene. These results indicate that the anr gene product plays a key role in anaerobically activating the entire denitrification pathway.
Identification of Leishmania major cysteine proteinases as targets of the immune response in humans.
Resumo:
In this study, we report the identification of two parasite polypeptides recognized by human sera of patients infected with Leishmania major. Isolation and sequencing of the two genes encoding these polypeptides revealed that one of the genes is similar to the L. major cathepsin L-like gene family CPB, whereas the other gene codes for the L. major homologue of the cysteine proteinase a (CPA) of L. mexicana. By restriction enzyme digestion of genomic DNA, we show that the CPB gene is present in multiple copies in contrast to the cysteine proteinase CPA gene which could be unique. Specific antibodies directed against the mature regions of both types expressed in Escherichia coli were used to analyze the expression of these polypeptides in different stages of the parasite's life cycle. Polypeptides of 27 and 40 kDa in size, corresponding to CPA and CPB respectively, were detected at higher level in amastigotes than in stationary phase promastigotes. Purified recombinant CPs were also used to examine the presence of specific antibodies in sera from either recovered or active cases of cutaneous leishmaniasis patients. Unlike sera from healthy uninfected controls, all the sera reacted with recombinant CPA and CPB. This finding indicates that individuals having recovered from cutaneous leishmaniasis or with clinically apparent disease have humoral responses to cysteine proteinases demonstrating the importance of these proteinases as targets of the immune response and also their potential use for serodiagnosis.
Resumo:
HAMAP (High-quality Automated and Manual Annotation of Proteins-available at http://hamap.expasy.org/) is a system for the automatic classification and annotation of protein sequences. HAMAP provides annotation of the same quality and detail as UniProtKB/Swiss-Prot, using manually curated profiles for protein sequence family classification and expert curated rules for functional annotation of family members. HAMAP data and tools are made available through our website and as part of the UniRule pipeline of UniProt, providing annotation for millions of unreviewed sequences of UniProtKB/TrEMBL. Here we report on the growth of HAMAP and updates to the HAMAP system since our last report in the NAR Database Issue of 2013. We continue to augment HAMAP with new family profiles and annotation rules as new protein families are characterized and annotated in UniProtKB/Swiss-Prot; the latest version of HAMAP (as of 3 September 2014) contains 1983 family classification profiles and 1998 annotation rules (up from 1780 and 1720). We demonstrate how the complex logic of HAMAP rules allows for precise annotation of individual functional variants within large homologous protein families. We also describe improvements to our web-based tool HAMAP-Scan which simplify the classification and annotation of sequences, and the incorporation of an improved sequence-profile search algorithm.