935 resultados para RNA analysis


Relevância:

30.00% 30.00%

Publicador:

Resumo:

RESUMO: Actualmente, a única possibilidade de cura para doentes com adenocarcinoma do pâncreas (PDAC) é a ressecção cirúrgica, no início deste estudo, perguntamo-nos se os predictores clínico-patológicos clássicos de prognostico poderiam ser validados em uma grande cohort de doentes com cancro do pâncreas ressecável e se outros predictores clínicos poderiam ter um papel na decisão de que doentes beneficiariam de ressecção cirúrgica. No capítulo 2, observamos que até 30% dos doentes morrem no primeiro ano após a ressecção cirúrgica, pelo que o nosso objectivo foi determinar factores pré-operatórios que se correlacionam com mortalidade precoce após ressecação cirúrgica com recurso a um instrumento estatisticamente validado, o Charlson-Age Comorbidity Index (CACI), determinamos que um CACI score superior a 4 foi preditivo de internamentos prolongados (p <0,001), complicações pós-operatórias (p = 0,042), e mortalidade em 1 ano pós- ressecção cirúrgica (p <0,001). Um CACI superior a 6 triplicou a mortalidade no primeiro ano pós-cirurgia e estes doentes têm menos de 50% de probabilidade de estarem vivos um ano após a cirurgia. No capítulo 3, o nosso objectivo foi identificar uma proteína de superfície que se correlacionasse estatisticamente com o prognostico de doentes com adenocarcinoma do pâncreas e permitisse a distinção de subgrupos de doentes de acordo com as suas diferenças moleculares, perguntamo-nos ainda se essa proteína poderia ser um marcador de células-estaminais. No nosso trabalho anterior observamos que as células tumorais na circulação sanguínea apresentavam genes com características bifenotípica epitelial e mesenquimal, enriquecimento para genes de células estaminais (ALDH1A1 / ALDH1A2 e KLF4), e uma super-expressão de genes da matriz extracelular (colagénios, SPARC, e DCN) normalmente identificados no estroma de PDAC. Após a avaliação dos tumores primários com RNA-ISH, muitos dos genes identificados, foram encontrados co-localizando em uma sub-população de células na região basal dos ductos pancreáticos malignos. Além disso, observamos que estas células expressam o marcador SV2A neuroendócrino, e o marcador de células estaminais ALDH1A1/2. Em comparação com tumores negativos para SV2, os doentes com tumores SV2 positivos apresentaram níveis mais baixos de CA 19-9 (69% vs. 52%, p = 0,012), tumores maiores (> 4 cm, 23% vs. 10%, p = 0,0430), menor invasão de gânglios linfáticos (69% vs. 86%, p = 0,005) e tumores mais diferenciados (69% vs. 57%, p = 0,047). A presença de SV2A foi associada com uma sobrevida livre de doença mais longa (HR: 0,49 p = 0,009) bem como melhor sobrevida global (HR: 0,54 p = 0,018). Em conjunto, esta informação aponta para dois subtipos diferentes de adenocarcinoma do pâncreas, e estes subtipos co-relacionam estatisticamente com o prognostico de doentes, sendo este subgrupo definido pela presença do clone celular SV2A / ALDH1A1/2 positivo com características neuroendócrinas. No Capítulo 4, a expressão de SV2A no cancro do pâncreas foi validado em linhas celulares primárias. Demonstramos a heterogeneidade do adenocarcinoma do pâncreas de acordo com características clonais neuroendócrinas. Ao comparar as linhas celulares expressando SV2 com linhas celulares negativas, verificamos que as linhas celulares SV2+ eram mais diferenciadas, diferindo de linhas celulares SV2 negativas no que respeita a mutação KRAS, proliferação e a resposta à quimioterapia. No capítulo 5, perguntamo-nos se o clone celular SV2 positivo poderia explicar a resistência a quimioterapia observada em doentes. Observamos um aumento absoluto de clones celulares expressando SV2A, em múltiplas linhas de evidência - doentes, linhas de células primárias e xenotransplantes. Embora, tenhamos sido capazes de demonstrar que o adenocarcinoma do pâncreas é uma doença heterogénea, consideramos que a caracterização genética destes clones celulares expressando SV2A é de elevada importância. Pretendemos colmatar esta limitação com as seguintes estratégias: Após o tratamento com quimioterapia neoadjuvante na nossa coorte, realizamos microdissecação a laser das amostras primarias em parafina, de forma a analisar mutações genéticas observadas no adenocarcinoma pancreático; em segundo lugar, pretendemos determinar consequências de knockdown da expressão de SV2A em nossas linhas celulares seguindo-se o tratamento com gemicitabina para determinação do papel funcional de SV2A; finalmente, uma vez que os nossos esforços anteriores com um promotor - repórter e SmartFlare ™ falharam, o próximo passo será realizar RNA-ISH PrimeFlow™ seguido de FACS e RNA-seq para caracterização deste clone celular. Em conjunto, conseguimos provar com várias linhas de evidência, que o adenocarcinoma pancreático é uma doença heterogénea, definido por um clone de células que expressam SV2A, com características neuroendócrinas. A presença deste clone no tecido de doentes correlaciona-se estatisticamente com o prognostico da doença, incluindo sobrevida livre de doença e sobrevida global. Juntamente com padrões de proliferação e co-expressão de ALDH1A1/2, este clone parece apresentar um comportamento de células estaminais e está associado a resistência a quimioterapia, uma vez que a sua expressão aumenta após agressão química, quer em doentes, quer em linhas de células primárias.----------------------------- ABSTRACT: Currently, the only chance of cure for patients with pancreatic adenocarcinoma is surgical resection, at the beginning of my thesis studies, we asked if the classical clinicopathologic predictors of outcome could be validated in a large cohort of patients with early stage pancreatic cancer and if other clinical predictors could have a role on deciding which patients would benefit from surgery. In chapter 2, we found that up to 30% of patients die within the first year after curative intent surgery for pancreatic adenocarcinoma. We aimed at determining pre-operative factors that would correlate with early mortality following resection for pancreatic cancer using a statistically validated tool, the Charlson-Age Comorbidity Index (CACI). We found that a CACI score greater than 4 was predictive of increased length of stay (p<0.001), post-operative complications (p=0.042), and mortality within 1-year of pancreatic resection (p<0.001). A CACI score of 6 or greater increased 3-fold the odds of death within the first year. Patients with a high CACI score have less than 50% likelihood of being alive 1 year after surgery. In chapter 3 we aimed at identifying a surface protein that correlates with patient’s outcome and distinguishes sub-groups of patients according to their molecular differences and if this protein could be a cancer stem cell marker. The most abundant class of circulating tumor cells identified in our previous work was found to have biphenotypic features of epithelial to mesenchymal transition, enrichment for stem-cell associated genes (ALDH1A1/ALDH1A2 and KLF4), and an overexpression of extracellular matrix genes (Collagens, SPARC, and DCN) normally found in the stromal microenvironment of PDAC primary tumors. Upon evaluation of matched primary tumors with RNA-ISH, many of the genes identified were found to co-localize in a sub-population of cells at the basal region of malignant pancreatic ducts. In addition, these cells expressed the neuroendocrine marker SV2A, and the stem cell marker ALDH1A1/2. Compared to SV2 negative tumors, patients with SV2 positive tumors were more likely to present with lower CA 19-9 (69% vs. 52%, p = 0.012), bigger tumors (size > 4 cm, 23% vs. 10%, p= 0.0430), less nodal involvement (69% vs. 86%, p = 0.005) and lower histologic grade (69% vs. 57%, p = 0.047). The presence of SV2A expressing cells was associated with an improved disease free survival (HR: 0.49 p=0.009) and overall survival (HR: 0.54 p=0.018) and correlated linearly with ALDH1A2. Together, this information points to two different sub-types of pancreatic adenocarcinoma, and these sub-types correlated with patients’ outcome and were defined by the presence of a SV2A/ ALDH1A1/2 expressing clone with neuroendocrine features. In Chapter 4, SV2A expression in cancer was validated in primary cell lines. We were able to demonstrate pancreatic adenocarcinoma heterogeneity according to neuroendocrine clonal features. When comparing SV2 expressing cell lines with SV2 negative cell lines, we found that SV2+ cell lines were more differentiated and differ from SV2 negative cell lines regarding KRAS mutation, proliferation and response to chemotherapy. In Chapter 5 we aimed at determining if this SV2 positive clone could explain chemoresistance observed in patients. We found an absolute increase in SV2A expressing cells, with multiple lines of evidence, in patients, primary cell lines and xenografts. Although, we have been able to show evidence that pancreatic adenocarcinoma is a heterogeneous disease, our findings warrant further investigation. To further characterize SV2A expressing clones after treatment with neoadjuvant chemotherapy in our cohort, we have performed laser capture microdissection of the paraffin embedded tissue in this study and will analyze the tissue for known genetic mutations in pancreatic adenocarcinoma; secondly, we want to know what will happen after knocking down SV2A expression in our cell lines followed by treatment with gemcitabine to determine if SV2A is functionally important; finally, since our previous efforts with a promoter – reporter and SmartFlare™ have failed, we will utilize a novel PrimeFlow™ RNA-ISH assay followed by FACS and RNA sequencing to further characterize this cellular clone. Overall our data proves, with multiple lines of evidence, that pancreatic adenocarcinoma is a heterogeneous disease, defined by a clone of SV2A expressing cells, with neuroendocrine features. The presence of this clone in patients’ tissue correlates with patient’s disease free survival and overall survival. Together with patterns of proliferation and ALDH1A1/2 co-expression, this clone seems to present a stem-cell-like behavior and is associated with chemoresistance, since it increases after chemotherapy, both in patients and primary cell lines.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Molecular monitoring of BCR/ABL transcripts by real time quantitative reverse transcription PCR (qRT-PCR) is an essential technique for clinical management of patients with BCR/ABL-positive CML and ALL. Though quantitative BCR/ABL assays are performed in hundreds of laboratories worldwide, results among these laboratories cannot be reliably compared due to heterogeneity in test methods, data analysis, reporting, and lack of quantitative standards. Recent efforts towards standardization have been limited in scope. Aliquots of RNA were sent to clinical test centers worldwide in order to evaluate methods and reporting for e1a2, b2a2, and b3a2 transcript levels using their own qRT-PCR assays. Total RNA was isolated from tissue culture cells that expressed each of the different BCR/ABL transcripts. Serial log dilutions were prepared, ranging from 100 to 10-5, in RNA isolated from HL60 cells. Laboratories performed 5 independent qRT-PCR reactions for each sample type at each dilution. In addition, 15 qRT-PCR reactions of the 10-3 b3a2 RNA dilution were run to assess reproducibility within and between laboratories. Participants were asked to run the samples following their standard protocols and to report cycle threshold (Ct), quantitative values for BCR/ABL and housekeeping genes, and ratios of BCR/ABL to housekeeping genes for each sample RNA. Thirty-seven (n=37) participants have submitted qRT-PCR results for analysis (36, 37, and 34 labs generated data for b2a2, b3a2, and e1a2, respectively). The limit of detection for this study was defined as the lowest dilution that a Ct value could be detected for all 5 replicates. For b2a2, 15, 16, 4, and 1 lab(s) showed a limit of detection at the 10-5, 10-4, 10-3, and 10-2 dilutions, respectively. For b3a2, 20, 13, and 4 labs showed a limit of detection at the 10-5, 10-4, and 10-3 dilutions, respectively. For e1a2, 10, 21, 2, and 1 lab(s) showed a limit of detection at the 10-5, 10-4, 10-3, and 10-2 dilutions, respectively. Log %BCR/ABL ratio values provided a method for comparing results between the different laboratories for each BCR/ABL dilution series. Linear regression analysis revealed concordance among the majority of participant data over the 10-1 to 10-4 dilutions. The overall slope values showed comparable results among the majority of b2a2 (mean=0.939; median=0.9627; range (0.399 - 1.1872)), b3a2 (mean=0.925; median=0.922; range (0.625 - 1.140)), and e1a2 (mean=0.897; median=0.909; range (0.5174 - 1.138)) laboratory results (Fig. 1-3)). Thirty-four (n=34) out of the 37 laboratories reported Ct values for all 15 replicates and only those with a complete data set were included in the inter-lab calculations. Eleven laboratories either did not report their copy number data or used other reporting units such as nanograms or cell numbers; therefore, only 26 laboratories were included in the overall analysis of copy numbers. The median copy number was 348.4, with a range from 15.6 to 547,000 copies (approximately a 4.5 log difference); the median intra-lab %CV was 19.2% with a range from 4.2% to 82.6%. While our international performance evaluation using serially diluted RNA samples has reinforced the fact that heterogeneity exists among clinical laboratories, it has also demonstrated that performance within a laboratory is overall very consistent. Accordingly, the availability of defined BCR/ABL RNAs may facilitate the validation of all phases of quantitative BCR/ABL analysis and may be extremely useful as a tool for monitoring assay performance. Ongoing analyses of these materials, along with the development of additional control materials, may solidify consensus around their application in routine laboratory testing and possible integration in worldwide efforts to standardize quantitative BCR/ABL testing.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The ability of Mycobacterium tuberculosis to establish a latent infection (LTBI) in humans confounds the treatment of tuberculosis. Consequently, there is a need to discover new therapeutic agents that can kill M. tuberculosis both during active disease and LTBI. The streptomycin-dependent strain of M. tuberculosis, 18b, provides a useful tool for this purpose since upon removal of streptomycin (STR) it enters a non-replicating state that mimics latency both in vitro and in animal models. The 4.41 Mb genome sequence of M. tuberculosis 18b was determined and this revealed the strain to belong to clade 3 of the ancient ancestral lineage of the Beijing family. STR-dependence was attributable to insertion of a single cytosine in the 530 loop of the 16S rRNA and to a single amino acid insertion in the N-terminal domain of initiation factor 3. RNA-seq was used to understand the genetic programme activated upon STR-withdrawal and hence to gain insight into LTBI. This revealed reconfiguration of gene expression and metabolic pathways showing strong similarities between non-replicating 18b and M. tuberculosis residing within macrophages, and with the core stationary phase and microaerophilic responses. The findings of this investigation confirm the validity of 18b as a model for LTBI, and provide insight into both the evolution of tubercle bacilli and the functioning of the ribosome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Proteolytic processing of the CUX1 transcription factor generates an isoform, p110 that accelerates entry into S phase. To identify targets of p110 CUX1 that are involved in cell cycle progression, we performed genome-wide location analysis using a promoter microarray. Since there are no antibodies that specifically recognize p110, but not the full-length protein, we expressed physiological levels of a p110 isoform with two tags and purified chromatin by tandem affinity purification (ChAP). Conventional ChIP performed on synchronized populations of cells confirmed that p110 CUX1 is recruited to the promoter of cell cycle-related targets preferentially during S phase. Multiple approaches including silencing RNA (siRNA), transient infection with retroviral vectors, constitutive expression and reporter assays demonstrated that most cell cycle targets are activated whereas a few are repressed or not affected by p110 CUX1. Functional classes that were over-represented among targets included DNA replication initiation. Consistent with this finding, constitutive expression of p110 CUX1 led to a premature and more robust induction of replication genes during cell cycle progression, and stimulated the long-term replication of a plasmid bearing the oriP replicator of Epstein Barr virus (EBV).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

La détermination de la structure tertiaire du ribosome fut une étape importante dans la compréhension du mécanisme de la synthèse des protéines. Par contre, l’élucidation de la structure du ribosome comme tel ne permet pas une compréhension de sa fonction. Pour mieux comprendre la nature des relations entre la structure et la fonction du ribosome, sa structure doit être étudiée de manière systématique. Au cours des dernières années, nous avons entrepris une démarche systématique afin d’identifier et de caractériser de nouveaux motifs structuraux qui existent dans la structure du ribosome et d’autres molécules contenant de l’ARN. L’analyse de plusieurs exemples d’empaquetage de deux hélices d’ARN dans la structure du ribosome nous a permis d’identifier un nouveau motif structural, nommé « G-ribo ». Dans ce motif, l’interaction d’une guanosine dans une hélice avec le ribose d’un nucléotide d’une autre hélice donne naissance à un réseau d’interactions complexes entre les nucléotides voisins. Le motif G-ribo est retrouvé à 8 endroits dans la structure du ribosome. La structure du G-ribo possède certaines particularités qui lui permettent de favoriser la formation d’un certain type de pseudo-nœuds dans le ribosome. L’analyse systématique de la structure du ribosome et de la ARNase P a permis d’identifier un autre motif structural, nommé « DTJ » ou « Double-Twist Joint motif ». Ce motif est formé de trois courtes hélices qui s’empilent l’une sur l’autre. Dans la zone de contact entre chaque paire d’hélices, deux paires de bases consécutives sont surenroulées par rapport à deux paires de bases consécutives retrouvées dans l’ARN de forme A. Un nucléotide d’une paire de bases est toujours connecté directement à un nucléotide de la paire de bases surenroulée, tandis que les nucléotides opposés sont connectés par un ou plusieurs nucléotides non appariés. L’introduction d’un surenroulement entre deux paires de bases consécutives brise l’empilement entre les nucléotides et déstabilise l’hélice d’ARN. Dans le motif DTJ, les nucléotides non appariés qui lient les deux paires de bases surenroulées interagissent avec une des trois hélices qui forment le motif, offrant ainsi une stratégie élégante de stabilisation de l’arrangement. Pour déterminer les contraintes de séquences imposées sur la structure tertiaire d’un motif récurrent dans le ribosome, nous avons développé une nouvelle approche expérimentale. Nous avons introduit des librairies combinatoires de certains nucléotides retrouvés dans des motifs particuliers du ribosome. Suite à l’analyse des séquences alternatives sélectionnées in vivo pour différents représentants d’un motif, nous avons été en mesure d’identifier les contraintes responsables de l’intégrité d’un motif et celles responsables d’interactions avec les éléments qui forment le contexte structural du motif. Les résultats présentés dans cette thèse élargissent considérablement notre compréhension des principes de formation de la structure d’ARN et apportent une nouvelle façon d’identifier et de caractériser de nouveaux motifs structuraux d’ARN.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Récemment plusieurs récepteurs couplés aux protéines G (RCPGs) ont été caractérisés au niveau des membranes intracellulaires, dont la membrane nucléaire. Notre objectif était de déterminer si les sous-types de récepteurs β-adrénergiques (βAR) et leurs machineries de signalisation étaient fonctionnels et localisés à la membrane nucléaire des cardiomyocytes. Nous avons démontré la présence des β1AR et β3AR, mais pas du β2AR à la membrane nucléaire de myocytes ventriculaires adultes par immunobuvardage, par microscopie confocale, et par des essais fonctionnels. De plus, certains partenaires de signalisation comme les protéines GαS, Gαi, l’adénylate cyclase II, et V/VI y étaient également localisés. Les sous-types de βAR nucléaires étaient fonctionnels puisqu'ils pouvaient lier leurs ligands et activer leurs effecteurs. En utilisant des noyaux isolés, nous avons observé que l'agoniste non-sélectif isoprotérénol (ISO), et que le BRL37344, un ligand sélectif du β3AR, stimulaient l'initiation de la synthèse de l’ARN, contrairement à l'agoniste sélectif du β1AR, le xamotérol. Cette synthèse était abolie par la toxine pertussique (PTX). Cependant, la stimulation des récepteurs nucléaires de type B de l’endothéline (ETB) causaient une réduction de l'initiation de la synthèse d’ARN. Les voies de signalisations impliquées dans la régulation de la synthèse d’ARN par les RCPGs ont ensuite été étudiées en utilisant des noyaux isolés stimulés par des agonistes en présence ou absence de différents inhibiteurs des voies MAP Kinases (proteines kinases activées par mitogènes) et de la voie PI3K/PKB. Les protéines impliquées dans les voies de signalisation de p38, JNK, ERK MAP Kinase et PKB étaient présents dans les noyaux isolés. L'inhibition de PKB par la triciribine, inhibait la synthèse d’ARN. Nous avons ensuite pu mettre en évidence par qPCR que la stimulation par l’ISO entrainait une augmentation du niveau d'ARNr 18S ainsi qu’une diminution de l'expression d’ARNm de NFκB. En contraste, l’ET-1 n’avait aucun effet sur le niveau d’expression de l’ARNr 18S. Nous avons ensuite montré que la stimulation par l’ISO réduisait l’expression de plusieurs gènes impliqués dans l'activation de NFκB, tandis que l’inhibition de ERK1/2 et PKB renversait cet effet. Un microarray global nous a ensuite permis de démontrer que les βARs et les ETRs nucléaires régulaient un grand nombre de gènes distincts. Finalement, les βARs et ETRs nucléaires augmentaient aussi une production de NO de noyaux isolés, ce qui pouvait être inhibée par le LNAME. Ces résultats ont été confirmés dans des cardiomyocytes intacts en utilisant des analogues cagés et perméables d’ISO et de l'ET-1: l'augmentation de NO nucléaire détectée par DAF2-DA, causée par l'ET-1 et l'ISO, pouvait être prévenue par le LNAME. Finalement, l’augmentation de l’initiation de la transcription induite par l'ISO était aussi bloquée par le L-NAME ou par un inbitheur de PKG, le KT5823, suggérant que la voie NO-GC-PKG est impliquée dans la régulation de la transcription par les βAR. En conclusion, les βARs et les ETRs nucléaires utilisent des voies de signalisation différentes et exercent ainsi des effets distincts sur l’expression des gènes cardiaques. Ils représentent donc une avenue intéressante pour le développement de drogues pharmacologiques.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Le rôle des deux paires de bases universelles inverse Hoogsteen U : A ( RHUAs ) présentent chez les ARNt standards , une dans la boucle T et l'autre dans le noyau de la forme en L , a été étudiée. Pour chacun des RHUAs , un criblage génétique spécialisé in vivo chez les bactéries , le système suppresseur ambre ( pour l'étude de la RHUA dans la boucle T ) et le système d'ARNt de la sélénocystéine ( tRNASec ) ( pour l'étude de la RHUA dans le noyau ) , ont été utilisé pour générer des variants fonctionnels à partir de multiples librairies combinatoires . Ces variants ont ensuite été séquencé et soumis à une analyse systématique qui comprend la modélisation informatique et un type d'analyse phylogénétique. Les résultats du système suppresseur ambre ont montré un ensemble de variants fonctionnels qui ne nécessitent pas le motif RHUA dans la boucle T et qui ont remplacé la méthode standard de l'interaction entre les boucles D et T avec une double hélice interboucle , ILDH . D'autres études ont abouti à la détermination d'un modèle In silico de l'alternative à la norme standard de la boucle T, sous le nom de type III . Les résultats du système tRNASec ont révélé que pour cette ARNt exceptionnel, l'absence de RHUA ( dans le noyau ) assure une flexibilité accrue qui est spécifiquement nécessaire pour la fonction de tRNASec . Ainsi, les ARNt standards , à la différence de tRNASec , avec la présence universelle de RHUA dans le noyau , a été naturellement sélectionnée pour être rigide . Pris ensemble, la RHUA joue un rôle essentiel dans la stabilisation des interactions tertiaires.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Computational Biology is the research are that contributes to the analysis of biological data through the development of algorithms which will address significant research problems.The data from molecular biology includes DNA,RNA ,Protein and Gene expression data.Gene Expression Data provides the expression level of genes under different conditions.Gene expression is the process of transcribing the DNA sequence of a gene into mRNA sequences which in turn are later translated into proteins.The number of copies of mRNA produced is called the expression level of a gene.Gene expression data is organized in the form of a matrix. Rows in the matrix represent genes and columns in the matrix represent experimental conditions.Experimental conditions can be different tissue types or time points.Entries in the gene expression matrix are real values.Through the analysis of gene expression data it is possible to determine the behavioral patterns of genes such as similarity of their behavior,nature of their interaction,their respective contribution to the same pathways and so on. Similar expression patterns are exhibited by the genes participating in the same biological process.These patterns have immense relevance and application in bioinformatics and clinical research.Theses patterns are used in the medical domain for aid in more accurate diagnosis,prognosis,treatment planning.drug discovery and protein network analysis.To identify various patterns from gene expression data,data mining techniques are essential.Clustering is an important data mining technique for the analysis of gene expression data.To overcome the problems associated with clustering,biclustering is introduced.Biclustering refers to simultaneous clustering of both rows and columns of a data matrix. Clustering is a global whereas biclustering is a local model.Discovering local expression patterns is essential for identfying many genetic pathways that are not apparent otherwise.It is therefore necessary to move beyond the clustering paradigm towards developing approaches which are capable of discovering local patterns in gene expression data.A biclusters is a submatrix of the gene expression data matrix.The rows and columns in the submatrix need not be contiguous as in the gene expression data matrix.Biclusters are not disjoint.Computation of biclusters is costly because one will have to consider all the combinations of columans and rows in order to find out all the biclusters.The search space for the biclustering problem is 2 m+n where m and n are the number of genes and conditions respectively.Usually m+n is more than 3000.The biclustering problem is NP-hard.Biclustering is a powerful analytical tool for the biologist.The research reported in this thesis addresses the problem of biclustering.Ten algorithms are developed for the identification of coherent biclusters from gene expression data.All these algorithms are making use of a measure called mean squared residue to search for biclusters.The objective here is to identify the biclusters of maximum size with the mean squared residue lower than a given threshold. All these algorithms begin the search from tightly coregulated submatrices called the seeds.These seeds are generated by K-Means clustering algorithm.The algorithms developed can be classified as constraint based,greedy and metaheuristic.Constarint based algorithms uses one or more of the various constaints namely the MSR threshold and the MSR difference threshold.The greedy approach makes a locally optimal choice at each stage with the objective of finding the global optimum.In metaheuristic approaches particle Swarm Optimization(PSO) and variants of Greedy Randomized Adaptive Search Procedure(GRASP) are used for the identification of biclusters.These algorithms are implemented on the Yeast and Lymphoma datasets.Biologically relevant and statistically significant biclusters are identified by all these algorithms which are validated by Gene Ontology database.All these algorithms are compared with some other biclustering algorithms.Algorithms developed in this work overcome some of the problems associated with the already existing algorithms.With the help of some of the algorithms which are developed in this work biclusters with very high row variance,which is higher than the row variance of any other algorithm using mean squared residue, are identified from both Yeast and Lymphoma data sets.Such biclusters which make significant change in the expression level are highly relevant biologically.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Microarray data analysis is one of data mining tool which is used to extract meaningful information hidden in biological data. One of the major focuses on microarray data analysis is the reconstruction of gene regulatory network that may be used to provide a broader understanding on the functioning of complex cellular systems. Since cancer is a genetic disease arising from the abnormal gene function, the identification of cancerous genes and the regulatory pathways they control will provide a better platform for understanding the tumor formation and development. The major focus of this thesis is to understand the regulation of genes responsible for the development of cancer, particularly colorectal cancer by analyzing the microarray expression data. In this thesis, four computational algorithms namely fuzzy logic algorithm, modified genetic algorithm, dynamic neural fuzzy network and Takagi Sugeno Kang-type recurrent neural fuzzy network are used to extract cancer specific gene regulatory network from plasma RNA dataset of colorectal cancer patients. Plasma RNA is highly attractive for cancer analysis since it requires a collection of small amount of blood and it can be obtained at any time in repetitive fashion allowing the analysis of disease progression and treatment response.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a powerful tool for genome-wide transcription studies. Unlike microarrays, it has the ability to detect novel forms of RNA such as alternatively spliced and antisense transcripts, without the need for prior knowledge of their existence. One limitation of using SAGE on an organism with a complex genome and lacking detailed sequence information, such as the hexaploid bread wheat Triticum aestivum, is accurate annotation of the tags generated. Without accurate annotation it is impossible to fully understand the dynamic processes involved in such complex polyploid organisms. Hence we have developed and utilised novel procedures to characterise, in detail, SAGE tags generated from the whole grain transcriptome of hexaploid wheat. RESULTS: Examination of 71,930 Long SAGE tags generated from six libraries derived from two wheat genotypes grown under two different conditions suggested that SAGE is a reliable and reproducible technique for use in studying the hexaploid wheat transcriptome. However, our results also showed that in poorly annotated and/or poorly sequenced genomes, such as hexaploid wheat, considerably more information can be extracted from SAGE data by carrying out a systematic analysis of both perfect and "fuzzy" (partially matched) tags. This detailed analysis of the SAGE data shows first that while there is evidence of alternative polyadenylation this appears to occur exclusively within the 3' untranslated regions. Secondly, we found no strong evidence for widespread alternative splicing in the developing wheat grain transcriptome. However, analysis of our SAGE data shows that antisense transcripts are probably widespread within the transcriptome and appear to be derived from numerous locations within the genome. Examination of antisense transcripts showing sequence similarity to the Puroindoline a and Puroindoline b genes suggests that such antisense transcripts might have a role in the regulation of gene expression. CONCLUSION: Our results indicate that the detailed analysis of transcriptome data, such as SAGE tags, is essential to understand fully the factors that regulate gene expression and that such analysis of the wheat grain transcriptome reveals that antisense transcripts maybe widespread and hence probably play a significant role in the regulation of gene expression during grain development.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Severe acute respiratory syndrome (SARS) coronavirus infection and growth are dependent on initiating signaling and enzyme actions upon viral entry into the host cell. Proteins packaged during virus assembly may subsequently form the first line of attack and host manipulation upon infection. A complete characterization of virion components is therefore important to understanding the dynamics of early stages of infection. Mass spectrometry and kinase profiling techniques identified nearly 200 incorporated host and viral proteins. We used published interaction data to identify hubs of connectivity with potential significance for virion formation. Surprisingly, the hub with the most potential connections was not the viral M protein but the nonstructurall protein 3 (nsp3), which is one of the novel virion components identified by mass spectrometry. Based on new experimental data and a bioinformatics analysis across the Coronaviridae, we propose a higher-resolution functional domain architecture for nsp3 that determines the interaction capacity of this protein. Using recombinant protein domains expressed in Escherichia coli, we identified two additional RNA-binding domains of nsp3. One of these domains is located within the previously described SARS-unique domain, and there is a nucleic acid chaperone-like domain located immediately downstream of the papain-like proteinase domain. We also identified a novel cysteine-coordinated metal ion-binding domain. Analyses of interdomain interactions and provisional functional annotation of the remaining, so-far-uncharacterized domains are presented. Overall, the ensemble of data surveyed here paint a more complete picture of nsp3 as a conserved component of the viral protein processing machinery, which is intimately associated with viral RNA in its role as a virion component.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Caliciviruses are a major cause of gastroenteritis in humans and cause a wide variety of other diseases in animals. Here, the characterization of protein-protein interactions between the individual proteins of Feline calicivirus (FCV), a model system for other members of the family Caliciviridae, is reported. Using the yeast two-hybrid system combined with a number of other approaches, it is demonstrated that the p32 protein (the picornavirus 2B analogue) of FCV interacts with p39 (2C), p30 (3A) and p76 (3CD). The FCV protease/RNA polymerase (ProPol) p76 was found to form homo-oligomers, as well as to interact with VPg and ORF2, the region encoding the major capsid protein VP1. A weak interaction was also observed between p76 and the minor capsid protein encoded by ORF3 (VP2). ORF2 protein was found to interact with VPg, p76 and VP2. The potential roles of the interactions in calicivirus replication are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Phosphorylation of the coronavirus nucleoprotein (N protein) has been predicted to play a role in RNA binding. To investigate this hypothesis, we examined the kinetics of RNA binding between nonphosphorylated and phosphorylated infectious bronchitis virus N protein with nonviral and viral RNA by surface plasmon resonance (Biacore). Mass spectroscopic analysis of N protein identified phosphorylation sites that were proximal to RNA binding domains. Kinetic analysis, by surface plasmon resonance, indicated that nonphospborylated N protein bound with the same affinity to viral RNA as phosphorylated N protein. However, phosphorylated N protein bound to viral RNA with a higher binding affinity than nonviral RNA, suggesting that phosphorylation of N protein determined the recognition of virus RNA. The data also indicated that a known N protein binding site (involved in transcriptional regulation) consisting of a conserved core sequence present near the 5' end of the genome (in the leader sequence) functioned by promoting high association rates of N protein binding. Further analysis of the leader sequence indicated that the core element was not the only binding site for N protein and that other regions functioned to promote high-affinity binding.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have developed a new simple method for transport, storage, and analysis of genetic material from the corals Agaricia agaricites, Dendrogyra cylindrica, Eusmilia ancora, Meandrina meandrites, Montastrea annularis, Porites astreoides, Porites furcata, Porites porites, and Siderastrea siderea at room temperature. All species yielded sufficient DNA from a single FTA(R) card (19 mug-43 ng) for subsequent PCR amplification of both coral and zooxanthellar DNA. The D1 and D2 variable region of the large Subunit rRNA gene (LSUrDNA) was amplified from the DNA of P. furcata and S. siderea by PCR. Electrophoresis yielded two major DNA bands: an 800-base pair (bp) DNA, which represented the coral ribosomal RNA (rRNA) gene, and a 600-bp DNA, which represented the zooxanthellar srRNA gene. Extraction of DNA from the bands yielded between 290 mug total DNA (S. siderea coral DNA) and 9 mug total DNA (P. furcata zooxanthellar DNA). The ability to transport and store genetic material from scleractinian corals without resort to laboratory facilities in the field allows for the molecular Study of a far wider range and variety of coral sites than have been studied to date. (C) 2003 Elsevier Science B.V. All rights reserved.