992 resultados para Comparison of nucleotide sequences


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We examine the occurrence of the ≈300 known protein folds in different groups of organisms. To do this, we characterize a large fraction of the currently known protein sequences (≈140,000) in structural terms, by matching them to known structures via sequence comparison (or by secondary-structure class prediction for those without structural homologues). Overall, we find that an appreciable fraction of the known folds are present in each of the major groups of organisms (e.g., bacteria and eukaryotes share 156 of 275 folds), and most of the common folds are associated with many families of nonhomologous sequences (i.e., >10 sequence families for each common fold). However, different groups of organisms have characteristically distinct distributions of folds. So, for instance, some of the most common folds in vertebrates, such as globins or zinc fingers, are rare or absent in bacteria. Many of these differences in fold usage are biologically reasonable, such as the folds of metabolic enzymes being common in bacteria and those associated with extracellular transport and communication being common in animals. They also have important implications for database-based methods for fold recognition, suggesting that an unknown sequence from a plant is more likely to have a certain fold (e.g., a TIM barrel) than an unknown sequence from an animal.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Many eubacterial DNA polymerases are bifunctional molecules having both polymerization (P) and 5′ nuclease (N) activities, which are contained in separable domains. We previously showed that the DNA polymerase I of Thermus aquaticus (TaqNP) endonucleolytically cleaves DNA substrates, releasing unpaired 5′ arms of bifurcated duplexes. Here, we compare the substrate specificities of TaqNP and the isolated 5′ nuclease domain of this enzyme, TaqN. Both enzymes are significantly activated by primer oligonucleotides that are hybridized to the 3′ arm of the bifurcation; optimal stimulation requires overlap of the 3′ terminal nucleotide of the primer with the terminal base pair of the duplex, but the terminal nucleotide need not hybridize to the complementary strand in the substrate. In the presence of Mn2+ ions, TaqN can cleave both RNA and circular DNA at structural bifurcations. Certain anti-TaqNP mAbs block cleavage by one or both enzymes, whereas others can stimulate cleavage of nonoptimal substrates.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A large family of membrane channel proteins selective for transport of water (aquaporins) or water plus glycerol (aquaglyceroporins) has been found in diverse life forms. Escherichia coli has two members of this family—a water channel, AqpZ, and a glycerol facilitator, GlpF. Despite having similar primary amino acid sequences and predicted structures, the oligomeric state and solute selectivity of AqpZ and GlpF are disputed. Here we report biochemical and functional characterizations of affinity-purified GlpF and compare it to AqpZ. Histidine-tagged (His-GlpF) and hemagglutinin-tagged (HA-GlpF) polypeptides encoded by a bicistronic construct were expressed in bacteria. HA-GlpF and His-GlpF appear to form oligomers during Ni-nitrilotriacetate affinity purification. Sucrose gradient sedimentation analyses showed that the oligomeric state of octyl glucoside-solubilized GlpF varies: low ionic strength favors subunit dissociation, whereas Mg2+ stabilizes tetrameric assembly. Reconstitution of affinity-purified GlpF into proteoliposomes increases glycerol permeability more than 100-fold and water permeability up to 10-fold compared with control liposomes. Glycerol and water permeability of GlpF both occur with low Arrhenius activation energies and are reversibly inhibited by HgCl2. Our studies demonstrate that, unlike AqpZ, a water-selective stable tetramer, purified GlpF exists in multiple oligomeric forms under nondenaturing conditions and is highly permeable to glycerol but less well permeated by water.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Of the rules used by the splicing machinery to precisely determine intron–exon boundaries only a fraction is known. Recent evidence suggests that specific short sequences within exons help in defining these boundaries. Such sequences are known as exonic splicing enhancers (ESE). A possible bioinformatical approach to studying ESE sequences is to compare genes that harbor introns with genes that do not. For this purpose two non-redundant samples of 719 intron-containing and 63 intron-lacking human genes were created. We performed a statistical analysis on these datasets of intron-containing and intron-lacking human coding sequences and found a statistically significant difference (P = 0.01) between these samples in terms of 5–6mer oligonucleotide distributions. The difference is not created by a few strong signals present in the majority of exons, but rather by the accumulation of multiple weak signals through small variations in codon frequencies, codon biases and context-dependent codon biases between the samples. A list of putative novel human splicing regulation sequences has been elucidated by our analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Estimation of evolutionary distances has always been a major issue in the study of molecular evolution because evolutionary distances are required for estimating the rate of evolution in a gene, the divergence dates between genes or organisms, and the relationships among genes or organisms. Other closely related issues are the estimation of the pattern of nucleotide substitution, the estimation of the degree of rate variation among sites in a DNA sequence, and statistical testing of the molecular clock hypothesis. Mathematical treatments of these problems are considerably simplified by the assumption of a stationary process in which the nucleotide compositions of the sequences under study have remained approximately constant over time, and there now exist fairly extensive studies of stationary models of nucleotide substitution, although some problems remain to be solved. Nonstationary models are much more complex, but significant progress has been recently made by the development of the paralinear and LogDet distances. This paper reviews recent studies on the above issues and reports results on correcting the estimation bias of evolutionary distances, the estimation of the pattern of nucleotide substitution, and the estimation of rate variation among the sites in a sequence.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The circumsporozoite (CS) protein of malaria parasites (Plasmodium) covers the surface of sporozoites that invade hepatocytes in mammalian hosts and macrophages in avian hosts. CS genes have been characterized from many Plasmodium that infect mammals; two domains of the corresponding proteins, identified initially by their conservation (region I and region II), have been implicated in binding to hepatocytes. The CS gene from the avian parasite Plasmodium gallinaceum was characterized to compare these functional domains to those of mammalian Plasmodium and for the study of Plasmodium evolution. The P. gallinaceum protein has the characteristics of CS proteins, including a secretory signal sequence, central repeat region, regions of charged amino acids, and an anchor sequence. Comparison with CS signal sequences reveals four distinct groupings, with P. gallinaceum most closely related to the human malaria Plasmodium falciparum. The 5-amino acid sequence designated region I, which is identical in all mammalian CS and implicated in hepatocyte invasion, is different in the avian protein. The P. gallinaceum repeat region consists of 9-amino acid repeats with the consensus sequence QP(A/V)GGNGG(A/V). The conserved motif designated region II-plus, which is associated with targeting the invasion of liver cells, is also conserved in the avian protein. Phylogenetic analysis of the aligned Plasmodium CS sequences yields a tree with a topology similar to the one obtained using sequence data from the small subunit rRNA gene. The phylogeny using the CS gene supports the proposal that the human malaria P. falciparum is significantly more related to avian parasites than to other parasites infecting mammals, although the biology of sporozoite invasion is different between the avian and mammalian species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recently sequenced genome of the parasitic bacterium Mycoplasma genitalium contains only 468 identified protein-coding genes that have been dubbed a minimal gene complement [Fraser, C.M., Gocayne, J.D., White, O., Adams, M.D., Clayton, R.A., et al. (1995) Science 270, 397-403]. Although the M. genitalium gene complement is indeed the smallest among known cellular life forms, there is no evidence that it is the minimal self-sufficient gene set. To derive such a set, we compared the 468 predicted M. genitalium protein sequences with the 1703 protein sequences encoded by the other completely sequenced small bacterial genome, that of Haemophilus influenzae. M. genitalium and H. influenzae belong to two ancient bacterial lineages, i.e., Gram-positive and Gram-negative bacteria, respectively. Therefore, the genes that are conserved in these two bacteria are almost certainly essential for cellular function. It is this category of genes that is most likely to approximate the minimal gene set. We found that 240 M. genitalium genes have orthologs among the genes of H. influenzae. This collection of genes falls short of comprising the minimal set as some enzymes responsible for intermediate steps in essential pathways are missing. The apparent reason for this is the phenomenon that we call nonorthologous gene displacement when the same function is fulfilled by nonorthologous proteins in two organisms. We identified 22 nonorthologous displacements and supplemented the set of orthologs with the respective M. genitalium genes. After examining the resulting list of 262 genes for possible functional redundancy and for the presence of apparently parasite-specific genes, 6 genes were removed. We suggest that the remaining 256 genes are close to the minimal gene set that is necessary and sufficient to sustain the existence of a modern-type cell. Most of the proteins encoded by the genes from the minimal set have eukaryotic or archaeal homologs but seven key proteins of DNA replication do not. We speculate that the last common ancestor of the three primary kingdoms had an RNA genome. Possibilities are explored to further reduce the minimal set to model a primitive cell that might have existed at a very early stage of life evolution.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mutations at position C1054 of 16S rRNA have previously been shown to cause translational suppression in Escherichia coli. To examine the effects of similar mutations in a eukaryote, all three possible base substitutions and a base deletion were generated at the position of Saccharomyces cerevisiae 18S rRNA corresponding to E. coli C1054. In yeast, as in E. coli, both C1054A (rdn-1A) and C1054G (rdn-1G) caused dominant nonsense suppression. Yeast C1054U (rdn-1T) was a recessive antisuppressor, while yeast C1054-delta (rdn-1delta) led to recessive lethality. Both C1054U and two previously described yeast 18S rRNA antisuppressor mutations, G517A (rdn-2) and U912C (rdn-4), inhibited codon-nonspecific suppression caused by mutations in eukaryotic release factors, sup45 and sup35. However, among these only C1054U inhibited UAA-specific suppressions caused by a UAA-decoding mutant tRNA-Gln (SLT3). Our data implicate eukaryotic C1054 in translational termination, thus suggesting that its function is conserved throughout evolution despite the divergence of nearby nucleotide sequences.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Determination of the subcellular location of a protein is essential to understanding its biochemical function. This information can provide insight into the function of hypothetical or novel proteins. These data are difficult to obtain experimentally but have become especially important since many whole genome sequencing projects have been finished and many resulting protein sequences are still lacking detailed functional information. In order to address this paucity of data, many computational prediction methods have been developed. However, these methods have varying levels of accuracy and perform differently based on the sequences that are presented to the underlying algorithm. It is therefore useful to compare these methods and monitor their performance. Results: In order to perform a comprehensive survey of prediction methods, we selected only methods that accepted large batches of protein sequences, were publicly available, and were able to predict localization to at least nine of the major subcellular locations (nucleus, cytosol, mitochondrion, extracellular region, plasma membrane, Golgi apparatus, endoplasmic reticulum (ER), peroxisome, and lysosome). The selected methods were CELLO, MultiLoc, Proteome Analyst, pTarget and WoLF PSORT. These methods were evaluated using 3763 mouse proteins from SwissProt that represent the source of the training sets used in development of the individual methods. In addition, an independent evaluation set of 2145 mouse proteins from LOCATE with a bias towards the subcellular localization underrepresented in SwissProt was used. The sensitivity and specificity were calculated for each method and compared to a theoretical value based on what might be observed by random chance. Conclusion: No individual method had a sufficient level of sensitivity across both evaluation sets that would enable reliable application to hypothetical proteins. All methods showed lower performance on the LOCATE dataset and variable performance on individual subcellular localizations was observed. Proteins localized to the secretory pathway were the most difficult to predict, while nuclear and extracellular proteins were predicted with the highest sensitivity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The phosphodiesterase 4 (PDE4) family are cAMP specific phosphodiesterases that play an important role in the inflammatory response and is the major PDE type found in inflammatory cells. A significant number of PDE4 specific inhibitors have been developed and are currently being investigated for use as therapeutic agents. Apremilast, a small molecule inhibitor of PDE 4 is in development for chronic inflammatory disorders and has shown promise for the treatment of psoriasis, psoriatic arthritis as well as other inflammatory diseases. It has been found to be safe and well tolerated in humans and in March 2014 it was approved by the US food and drug administration for the treatment of adult patients with active psoriatic arthritis. The only other PDE4 inhibitor on the market is Roflumilast and it is used for treatment of respiratory disease. Roflumilast is approved in the EU for the treatment of COPD and was recently approved in the US for treatment to reduce the risk of COPD exacerbations. Roflumilast is also a selective PDE4 inhibitor, administered as an oral tablet once daily, and is thought to act by increasing cAMP within lung cells. As both (Apremilast and Roflumilast) compounds selectively inhibit PDE4 but are targeted at different diseases, there is a need for a clear understanding of their mechanism of action (MOA). Differences and similarity of MOA should be defined for the purposes of labelling, for communication to the scientific community, physicians, and patients, and for an extension of utility to other diseases and therapeutic areas. In order to obtain a complete comparative picture of the MOA of both inhibitors, additional molecular and cellular biology studies are required to more fully elucidate the signalling mediators downstream of PDE4 inhibition which result in alterations in pro- and anti-inflammatory gene expression. My studies were conducted to directly compare Apremilast with Roflumilast, in order to substantiate the differences observed in the molecular and cellular effects of these compounds, and to search for other possible differentiating effects. Therefore the main aim of this thesis was to utilise cutting-edge biochemical techniques to discover whether Apremilast and Roflumilast work with different modes of action. In the first part of my thesis I used novel genetically encoded FRET based cAMP sensors targeted to different intracellular compartments, in order to monitor cAMP levels within specific microdomains of cells as a consequence of challenge with Apremilast and Roflumilast, which revealed that Apremilast and Roflumilast do regulate different pools of cAMP in cells. In the second part of my thesis I focussed on assessing whether Apremilast and Roflumilast cause differential effects on the PKA phosphorylation state of proteins in cells. I used various biochemical techniques (Western blotting, Substrate kinase arrays and Reverse Phase Protein array and found that Apremilast and Roflumilast do lead to differential PKA substrate phosphorylation. For example I found that Apremilast increases the phosphorylation of Ribosomal Protein S6 at Ser240/244 and Fyn Y530 in the S6 Ribosomal pathway of Rheumatoid Arthritis Synovial fibroblast and HEK293 cells, whereas Roflumilast does not. This data suggests that Apremilast has distinct biological effects from that of Roflumilast and could represent a new therapeutic role for Apremilast in other diseases. In the final part of my thesis, Phage display technology was employed in order to identify any novel binding motifs that associate with PDE4 and to identify sequences that were differentially regulated by the inhibitors in an attempt to find binding motifs that may exist in previously characterised signalling proteins. Petide array technology was then used to confirm binding of specific peptide sequences or motifs. Results showed that Apremilast and Roflumilast can either enhance or decrease the binding of PDE4A4 to specific peptide sequences or motifs that are found in a variety of proteins in the human proteome, most interestingly Ubiquitin-related proteins. The data from this chapter is preliminary but may be used in the discovery of novel binding partners for PDE4 or to provide a new role for PDE inhibition in disease. Therefore the work in this thesis provides a unique snapshot of the complexity of the cAMP signalling system and is the first to directly compare action of the two approved PDE4 inhibitors in a detailed way.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequences of timestamped events are currently being generated across nearly every domain of data analytics, from e-commerce web logging to electronic health records used by doctors and medical researchers. Every day, this data type is reviewed by humans who apply statistical tests, hoping to learn everything they can about how these processes work, why they break, and how they can be improved upon. To further uncover how these processes work the way they do, researchers often compare two groups, or cohorts, of event sequences to find the differences and similarities between outcomes and processes. With temporal event sequence data, this task is complex because of the variety of ways single events and sequences of events can differ between the two cohorts of records: the structure of the event sequences (e.g., event order, co-occurring events, or frequencies of events), the attributes about the events and records (e.g., gender of a patient), or metrics about the timestamps themselves (e.g., duration of an event). Running statistical tests to cover all these cases and determining which results are significant becomes cumbersome. Current visual analytics tools for comparing groups of event sequences emphasize a purely statistical or purely visual approach for comparison. Visual analytics tools leverage humans' ability to easily see patterns and anomalies that they were not expecting, but is limited by uncertainty in findings. Statistical tools emphasize finding significant differences in the data, but often requires researchers have a concrete question and doesn't facilitate more general exploration of the data. Combining visual analytics tools with statistical methods leverages the benefits of both approaches for quicker and easier insight discovery. Integrating statistics into a visualization tool presents many challenges on the frontend (e.g., displaying the results of many different metrics concisely) and in the backend (e.g., scalability challenges with running various metrics on multi-dimensional data at once). I begin by exploring the problem of comparing cohorts of event sequences and understanding the questions that analysts commonly ask in this task. From there, I demonstrate that combining automated statistics with an interactive user interface amplifies the benefits of both types of tools, thereby enabling analysts to conduct quicker and easier data exploration, hypothesis generation, and insight discovery. The direct contributions of this dissertation are: (1) a taxonomy of metrics for comparing cohorts of temporal event sequences, (2) a statistical framework for exploratory data analysis with a method I refer to as high-volume hypothesis testing (HVHT), (3) a family of visualizations and guidelines for interaction techniques that are useful for understanding and parsing the results, and (4) a user study, five long-term case studies, and five short-term case studies which demonstrate the utility and impact of these methods in various domains: four in the medical domain, one in web log analysis, two in education, and one each in social networks, sports analytics, and security. My dissertation contributes an understanding of how cohorts of temporal event sequences are commonly compared and the difficulties associated with applying and parsing the results of these metrics. It also contributes a set of visualizations, algorithms, and design guidelines for balancing automated statistics with user-driven analysis to guide users to significant, distinguishing features between cohorts. This work opens avenues for future research in comparing two or more groups of temporal event sequences, opening traditional machine learning and data mining techniques to user interaction, and extending the principles found in this dissertation to data types beyond temporal event sequences.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Different selection objectives within the Quarter Horse breed led to the formation of groups with distinct skills, including the racing and cutting lines. With a smaller population size in Brazil, but of great economic representativeness, the racing line is characterized by animals that can reach high speeds over short distances and within a short period of time. The cutting line is destined for functional tests, exploring skills such as agility and obedience. Although the athletic performance of horses is likely to be influenced by a large number of genes, few genetic variants have so far been related to this trait and this was done exclusively in Thoroughbreds, including the g.38973231G>A singlenucleotide polymorphism in the PDK4 gene and the g.22684390C>T single-nucleotide polymorphism in the COX4I2 gene. The results of the present study demonstrate the presence of polymorphic PDK4 and COX4I2 genes in Quarter Horses. The analysis of 296 racing animals and 68 cutting animals revealed significant differences in allele and genotype frequencies between the two lines. The same was not observed when these frequencies were compared between extreme racing performance phenotypes. There were also no significant associations between alleles of the two polymorphisms and the speed index. These results suggest that the alleles of the PDK4 and COX4I2 genes, which are related to better racecourse performance in Thoroughbreds, are probably associated with beneficial adaptations in aerobic metabolism and therefore play secondary roles in sprint racing performance in Quarter Horses, which is mainly anaerobic.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The aim of this investigation was to compare the skeletal stability of three different rigid fixation methods after mandibular advancement. Fifty-five class II malocclusion patients treated with the use of bilateral sagittal split ramus osteotomy and mandibular advancement were selected for this retrospective study. Group 1 (n = 17) had miniplates with monocortical screws, Group 2 (n = 16) had bicortical screws and Group 3 (n = 22) had the osteotomy fixed by means of the hybrid technique. Cephalograms were taken preoperatively, 1 week within the postoperative care period, and 6 months after the orthognathic surgery. Linear and angular changes of the cephalometric landmarks of the chin region were measured at each period, and the changes at each cephalometric landmark were determined for the time gaps. Postoperative changes in the mandibular shape were analyzed to determine the stability of fixation methods. There was minimum difference in the relapse of the mandibular advancement among the three groups. Statistical analysis showed no significant difference in postoperative stability. However, a positive correlation between the amount of advancement and the amount of postoperative relapse was demonstrated by the linear multiple regression test (p < 0.05). It can be concluded that all techniques can be used to obtain stable postoperative results in mandibular advancement after 6 months.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cardiac arrest during heart surgery is a common procedure and allows the surgeon to perform surgical procedures in an environment free of blood and movement. Using a model of isolated rat heart, the authors compare a new cardioplegic solution containing histidine-tryptophan-glutamate (group 2) with the histidine-tryptophan-alphacetoglutarate (group 1) routinely used by some cardiac surgeons. To assess caspase, IL-8 and KI-67 in isolated rat hearts using immunohistochemistry. 20 Wistar male rats were anesthetized and heparinized. The chest was opened, cardioctomy was performed and 40 ml/kg of the appropriate cardioplegic solution was infused. The hearts were kept for 2 hours at 4ºC in the same solution, and thereafter, placed in the Langendorff apparatus for 30 minutes with Ringer-Locke solution. Immunohistochemistry analysis of caspase, IL-8, and KI-67 were performed. The concentration of caspase was lower in group 2 and Ki-67 was higher in group 2, both P<0.05. There was no statistical difference between the values of IL-8 between the groups. Histidine-tryptophan-glutamate solution was better than histidine-tryptophan-alphacetoglutarate solution because it reduced caspase (apoptosis), increased KI-67 (cell proliferation), and showed no difference in IL-8 levels compared to group 1. This suggests that the histidine-tryptophan-glutamate solution was more efficient than the histidine-tryptophan-alphacetoglutarate for the preservation of hearts of rat cardiomyocytes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this study was to analyze the prevalence of diabetes in older people and the adopted control measures. Data regarding older diabetic individuals who participated in the Health Surveys conducted in the Municipality of Sao Paulo, SP, ISA-Capital, in 2003 and 2008, which were cross-sectional studies, were analyzed. Prevalences and confidence intervals were compared between 2003 and 2008, according to sociodemographic variables. The combination of the databases was performed when the confidence intervals overlapped. The Chi-square (level of significance of 5%) and the Pearson's Chi-square (Rao-Scott) tests were performed. The variables without overlap between the confidence intervals were not tested. The age of the older adults was 60-69 years. The majority were women, Caucasian, with an income of between > 0.5 and 2.5 times the minimum salary and low levels of schooling. The prevalence of diabetes was 17.6% (95%CI 14.9;20.6) in 2003 and 20.1% (95%CI 17.3;23.1) in 2008, which indicates a growth over this period (p at the limit of significance). The most prevalent measure adopted by the older adults to control diabetes was hypoglycemic agents, followed by diet. Physical activity was not frequent, despite the significant differences observed between 2003 and 2008 results. The use of public health services to control diabetes was significantly higher in older individuals with lower income and lower levels of education. Diabetes is a complex and challenging disease for patients and the health systems. Measures that encourage health promotion practices are necessary because they presented a smaller proportion than the use of hypoglycemic agents. Public health policies should be implemented, and aimed mainly at older individuals with low income and schooling levels. These changes are essential to improve the health condition of older diabetic patients.