949 resultados para Reverse-transcriptase
Resumo:
PURPOSE Deep molecular response (MR(4.5)) defines a subgroup of patients with chronic myeloid leukemia (CML) who may stay in unmaintained remission after treatment discontinuation. It is unclear how many patients achieve MR(4.5) under different treatment modalities and whether MR(4.5) predicts survival. PATIENTS AND METHODS Patients from the randomized CML-Study IV were analyzed for confirmed MR(4.5) which was defined as ≥ 4.5 log reduction of BCR-ABL on the international scale (IS) and determined by reverse transcriptase polymerase chain reaction in two consecutive analyses. Landmark analyses were performed to assess the impact of MR(4.5) on survival. RESULTS Of 1,551 randomly assigned patients, 1,524 were assessable. After a median observation time of 67.5 months, 5-year overall survival (OS) was 90%, 5-year progression-free-survival was 87.5%, and 8-year OS was 86%. The cumulative incidence of MR(4.5) after 9 years was 70% (median, 4.9 years); confirmed MR(4.5) was 54%. MR(4.5) was reached more quickly with optimized high-dose imatinib than with imatinib 400 mg/day (P = .016). Independent of treatment approach, confirmed MR(4.5) at 4 years predicted significantly higher survival probabilities than 0.1% to 1% IS, which corresponds to complete cytogenetic remission (8-year OS, 92% v 83%; P = .047). High-dose imatinib and early major molecular remission predicted MR(4.5). No patient with confirmed MR(4.5) has experienced progression. CONCLUSION MR(4.5) is a new molecular predictor of long-term outcome, is reached by a majority of patients treated with imatinib, and is achieved more quickly with optimized high-dose imatinib, which may provide an improved therapeutic basis for treatment discontinuation in CML.
Resumo:
BACKGROUND Small ruminant lentiviruses escaping efficient serological detection are still circulating in Swiss goats in spite of a long eradication campaign that essentially eliminated clinical cases of caprine arthritis encephalitis in the country. This strongly suggests that the circulating viruses are avirulent for goats.To test this hypothesis, we isolated circulating viruses from naturally infected animals and tested the in vitro and in vivo characteristics of these field isolates. METHODS Viruses were isolated from primary macrophage cultures. The presence of lentiviruses in the culture supernatants was monitored by reverse transcriptase assay. Isolates were passaged in different cells and their cytopathogenic effects monitored by microscopy. Proviral load was quantified by real-time PCR using customized primer and probes. Statistical analysis comprised Analysis of Variance and Bonferroni Multiple Comparison Test. RESULTS The isolated viruses belonged to the small ruminant lentiviruses A4 subtype that appears to be prominent in Switzerland. The 4 isolates replicated very efficiently in macrophages, displaying heterogeneous phenotypes, with two isolates showing a pronounced cytopathogenicity for these cells. By contrast, all 4 isolates had a poor replication capacity in goat and sheep fibroblasts. The proviral loads in the peripheral blood and, in particular, in the mammary gland were surprisingly high compared to previous observations. Nevertheless, these viruses appear to be of low virulence for goats except for the mammary gland were histopathological changes were observed. CONCLUSIONS Small ruminant lentiviruses continue to circulate in Switzerland despite a long and expensive caprine arthritis encephalitis virus eradication campaign. We isolated 4 of these lentiviruses and confirmed their phylogenetic association with the prominent A4 subtype. The pathological and histopathological analysis of the infected animals supported the hypothesis that these A4 viruses are of low pathogenicity for goats, with, however, a caveat about the potentially detrimental effects on the mammary gland. Moreover, the high proviral load detected indicates that the immune system of the animals cannot control the infection and this, combined with the phenotypic plasticity observed in vitro, strongly argues in favour of a continuous and precise monitoring of these SRLV to avoid the risk of jeopardizing a long eradication campaign.
Resumo:
Positive-stranded viruses synthesize their RNA in membrane-bound organelles, but it is not clear how this benefits the virus or the host. For coronaviruses, these organelles take the form of double-membrane vesicles (DMVs) interconnected by a convoluted membrane network. We used electron microscopy to identify murine coronaviruses with mutations in nsp3 and nsp14 that replicated normally while producing only half the normal amount of DMVs under low-temperature growth conditions. Viruses with mutations in nsp5 and nsp16 produced small DMVs but also replicated normally. Quantitative reverse transcriptase PCR (RT-PCR) confirmed that the most strongly affected of these, the nsp3 mutant, produced more viral RNA than wild-type virus. Competitive growth assays were carried out in both continuous and primary cells to better understand the contribution of DMVs to viral fitness. Surprisingly, several viruses that produced fewer or smaller DMVs showed a higher fitness than wild-type virus at the reduced temperature, suggesting that larger and more numerous DMVs do not necessarily confer a competitive advantage in primary or continuous cell culture. For the first time, this directly demonstrates that replication and organelle formation may be, at least in part, studied separately during infection with positive-stranded RNA virus. IMPORTANCE The viruses that cause severe acute respiratory syndrome (SARS), poliomyelitis, and hepatitis C all replicate in double-membrane vesicles (DMVs). The big question about DMVs is why they exist in the first place. In this study, we looked at thousands of infected cells and identified two coronavirus mutants that made half as many organelles as normal and two others that made typical numbers but smaller organelles. Despite differences in DMV size and number, all four mutants replicated as efficiently as wild-type virus. To better understand the relative importance of replicative organelles, we carried out competitive fitness experiments. None of these viruses was found to be significantly less fit than wild-type, and two were actually fitter in tests in two kinds of cells. This suggests that viruses have evolved to have tremendous plasticity in the ability to form membrane-associated replication complexes and that large and numerous DMVs are not exclusively associated with efficient coronavirus replication.
Resumo:
BACKGROUND Even among HIV-infected patients who fully suppress plasma HIV RNA replication on antiretroviral therapy, genetic (e.g. CCL3L1 copy number), viral (e.g. tropism) and environmental (e.g. chronic exposure to microbial antigens) factors influence CD4 recovery. These factors differ markedly around the world and therefore the expected CD4 recovery during HIV RNA suppression may differ globally. METHODS We evaluated HIV-infected adults from North America, West Africa, East Africa, Southern Africa and Asia starting non-nucleoside reverse transcriptase inhibitorbased regimens containing efavirenz or nevirapine, who achieved at least one HIV RNA level <500/ml in the first year of therapy and observed CD4 changes during HIV RNA suppression. We used a piecewise linear regression to estimate the influence of region of residence on CD4 recovery, adjusting for socio-demographic and clinical characteristics. We observed 28 217 patients from 105 cohorts over 37 825 person-years. RESULTS After adjustment, patients from East Africa showed diminished CD4 recovery as compared with other regions. Three years after antiretroviral therapy initiation, the mean CD4 count for a prototypical patient with a pre-therapy CD4 count of 150/ml was 529/ml [95% confidence interval (CI): 517–541] in North America, 494/ml (95% CI: 429–559) in West Africa, 515/ml (95% CI: 508–522) in Southern Africa, 503/ml (95% CI: 478–528) in Asia and 437/ml (95% CI: 425–449) in East Africa. CONCLUSIONS CD4 recovery during HIV RNA suppression is diminished in East Africa as compared with other regions of the world, and observed differences are large enough to potentially influence clinical outcomes. Epidemiological analyses on a global scale can identify macroscopic effects unobservable at the clinical, national or individual regional level.
Resumo:
The tall cell (TC) variant of papillary thyroid carcinoma (PTC) has an unfavorable prognosis. The diagnostic criteria remain inconsistent, and the role of a minor TC component is unclear. Molecular diagnostic markers are not available; however, there are two potential candidates: BRAF V600E and telomerase reverse transcriptase (TERT) promoter mutations. Using a novel approach, we enriched a collective with PTCs that harbored an adverse outcome, which overcame the limited statistical power of most studies. This enabled us to review 125 PTC patients, 57 of which had an adverse outcome. The proportion of TCs that constituted a poor prognosis was assessed. All of the tumors underwent sequencing for TERT promoter and BRAF V600E mutational status and were stained with an antibody to detect the BRAF V600E mutation. A 10% cutoff for TCs was significantly associated with advanced tumor stage and lymph node metastasis. Multivariate analysis showed that TCs above 10% were the only significant factor for overall, tumor-specific, and relapse-free survival. Seven percent of the cases had a TERT promoter mutation, whereas 61% demonstrated a BRAF mutation. The presence of TC was significantly associated with TERT promoter and BRAF mutations. TERT predicted highly significant tumor relapse (P<0.001). PTCs comprised of at least 10% TCs are associated with an adverse clinical outcome and should be reported accordingly. BRAF did not influence patient outcome. Nevertheless, a positive status should encourage the search for TCs. TERT promoter mutations are a strong predictor of tumor relapse, but their role as a surrogate marker for TCs is limited.
Resumo:
BACKGROUND HIV-1 viral load (VL) testing is recommended to monitor antiretroviral therapy (ART) but not universally available. We examined monitoring of first-line and switching to second-line ART in sub-Saharan Africa, 2004-2013. METHODS Adult HIV-1 infected patients starting combination ART in 16 countries were included. Switching was defined as a change from a non-nucleoside reverse-transcriptase inhibitor (NNRTI)-based regimen to a protease inhibitor (PI)-based regimen, with a change of ≥1 NRTI. Virological and immunological failures were defined per World Health Organization criteria. We calculated cumulative probabilities of switching and hazard ratios with 95% confidence intervals (CI) comparing routine VL monitoring, targeted VL monitoring, CD4 cell monitoring and clinical monitoring, adjusted for programme and individual characteristics. FINDINGS Of 297,825 eligible patients, 10,352 patients (3·5%) switched during 782,412 person-years of follow-up. Compared to CD4 monitoring hazard ratios for switching were 3·15 (95% CI 2·92-3·40) for routine VL, 1·21 (1·13-1·30) for targeted VL and 0·49 (0·43-0·56) for clinical monitoring. Overall 58.0% of patients with confirmed virological and 19·3% of patients with confirmed immunological failure switched within 2 years. Among patients who switched the percentage with evidence of treatment failure based on a single CD4 or VL measurement ranged from 32·1% with clinical to 84.3% with targeted VL monitoring. Median CD4 counts at switching were 215 cells/µl under routine VL monitoring but lower with other monitoring (114-133 cells/µl). INTERPRETATION Overall few patients switched to second-line ART and switching occurred late in the absence of routine viral load monitoring. Switching was more common and occurred earlier with targeted or routine viral load testing.
Resumo:
In yeasts, the ABC-type transporters are involved in vacuolar sequestration of cadmium. In plants, transport experiments with isolated vacuoles indicate that this is also true. In order to know more about the response of AtMRPs, a subclass of Arabidopsis ABC transporters, to cadmium, their expression pattern was analysed using the microchip technology and semi-quantitative reverse transcriptase-polymerase chain reaction. From 15 putative sequences coding for AtMRPs, transcript levels were detected for 14. All were expressed in the roots as well as in the shoots, although at a different level. In 4-week-old Arabidopsis, transcript levels of four AtMRPs were up-regulated after cadmium treatment. In all cases up-regulation was exclusively observed in the roots. The increase of transcript levels was most pronounced for AtMRP3. A more detailed analysis revealed that induction of AtMRP3 could also be observed in the shoot when leaves were cut and cadmium allowed to be taken up in the shoot. In young plantlets, a far higher portion of Cd2+ was translocated in the aerial part compared with adult plants. Consequently, AtMRP3 transcript levels increased in both root and shoot of young plants. This suggests that 7-day-old seedlings do not exhibit such a strict root–shoot barrier as 4-week-old plants. Expression analysis with mutant plants for glutathione and phytochelatin synthesis as well as with compounds producing oxidative stress indicate that induction of AtMRP3 is likely due to the heavy metal itself.
Resumo:
BACKGROUND: To date, an estimated 10% of children eligible for antiretroviral treatment (ART) receive it, and the frequency of retention in programs is unknown. We evaluated the 2-year risks of death and loss to follow-up (LTFU) of children after ART initiation in a multicenter study in sub-Saharan Africa. METHODS: Pooled analysis of routine individual data from 16 participating clinics produced overall Kaplan-Meier estimates of the probabilities of death or LTFU after ART initiation. Risk factors analysis used Weibull regression, accounting for between-cohort heterogeneity. RESULTS: The median age of 2405 children at ART initiation was 4.9 years (12%, younger than 12 months), 52% were male, 70% had severe immunodeficiency, and 59% started ART with a nonnucleoside reverse transcriptase inhibitor. The 2-year risk of death after ART initiation was 6.9% (95% confidence interval [CI]: 5.9 to 8.1), independently associated with baseline severe anemia (adjusted hazard ratio [aHR]: 4.10 [CI: 2.36 to 7.13]), immunodeficiency (adjusted aHR: 2.95 [CI: 1.49 to 5.82]), and severe clinical status (adjusted aHR: 3.64 [CI: 1.95 to 6.81]); the 2-year risk of LTFU was 10.3% (CI: 8.9 to 11.9), higher in children with severe clinical status. CONCLUSIONS: Once on treatment, the 2-year risk of death is low but the LTFU risk is substantial. ART is still mainly initiated at advanced disease stage in African children, reinforcing the need for early HIV diagnosis, early initiation of ART, and procedures to increase program retention.
Resumo:
OBJECTIVE To describe all patients admitted to children's hospitals in Switzerland with a diagnosis of influenza A/H1N1/09 virus infection during the 2009 influenza pandemic, and to analyse their characteristics, predictors of complications, and outcome. METHODS All patients ≤18-years-old hospitalised in eleven children's hospitals in Switzerland between June 2009 and January 2010 with a positive influenza A/H1N1/09 reverse transcriptase polymerase chain reaction (RT-PCR) from a nasopharyngeal specimen were included. RESULTS There were 326 PCR-confirmed patients of whom 189 (58%) were younger than 5 years of age, and 126 (38.7%) had one or more pre-existing medical condition. Fever (median 39.1 °C) was the most common sign (85.6% of all patients), while feeding problems (p = 0.003) and febrile seizures (p = 0.016) were significantly more frequent in children under 5 years. In 142 (43.6%) patients there was clinical suspicion of a concomitant bacterial infection, which was confirmed in 36 patients (11%). However, severe bacterial infection was observed in 4% of patients. One third (n = 108, 33.1%) of the patients were treated with oseltamivir, 64 (59.3%, or 20% overall) within 48 hours of onset of symptoms. Almost half of the patients (45.1%) received antibiotics for a median of 7 days. Twenty patients (6.1%) required intensive care, mostly for complicated pneumonia (50%) without an underlying medical condition. The median duration of hospitalisation was 2 days (range 0-39) for 304 patients. Two children (<15 months of age with underlying disease) died. CONCLUSIONS Although pandemic influenza A/H1N1/09 virus infection in children is mostly mild, it can be severe, regardless of past history or underlying disease.
Resumo:
miRNAs function to regulate gene expression through post-transcriptional mechanisms to potentially regulate multiple aspects of physiology and development. Whole transcriptome analysis has been conducted on the citron kinase mutant rat, a mutant that shows decreases in brain growth and development. The resulting differences in RNA between mutant and wild-type controls can be used to identify genetic pathways that may be regulated differentially in normal compared to abnormal neurogenesis. The goal of this thesis was to verify, with quantitative reverse transcriptase polymerase chain reaction (qRT-PCR), changes in miRNA expression in Cit-k mutants and wild types. In addition to confirming miRNA expression changes, bio-informatics software TargetScan 5.1 was used to identify potential mRNA targets of the differentially expressed miRNAs. The miRNAs that were confirmed to change include: rno-miR-466c, mmu-miR-493, mmu-miR-297a, hsa-miR-765, and hsa-miR-1270. The TargetScan analysis revealed 347 potential targets which have known roles in development. A subset of these potential targets include genes involved in the Wnt signaling pathway which is known to be an important regulator of stem cell development.
Resumo:
Retroviruses are RNA viruses that replicate through a double-stranded DNA intermediate. The viral enzyme reverse transcriptase copies the retroviral genomic RNA into this DNA intermediate through the process of reverse transcription. Many variables can affect the fidelity of reverse transcriptase during reverse transcription, including specific sequences within the retroviral genome. ^ Previous studies have observed that multiple cloning sites (MCS) and sequences predicted to form stable hairpin structures are hotspots for deletion during retroviral replication. The studies described in this dissertation were performed to elucidate the variables that affect the stability of MCS and hairpin structures in retroviral vectors. Two series of retroviral vectors were constructed and characterized in these studies. ^ Spleen necrosis virus-based vectors were constructed containing separate MCS insertions of varying length, orientation, and symmetry. The only MCS that was a hotspot for deletion formed a stable hairpin structure. Upon more detailed study, the MCS previously reported as a hotspot for deletion was found to contain a tandem linker insertion that formed a hairpin structure. Murine leukemia virus-based vectors were constructed containing separate sequence insertions of either inverted repeat symmetry (122IR) that could form a hairpin structure, or little symmetry (122c) that would form a less stable structure. These insertions were made into either the neomycin resistance marker ( neo) or the hygromycin resistance marker (hyg) of the vector. 122c was stable in both neo and hyg, while 122IR was preferentially deleted in neo and was remarkably unstable in hyg. ^ These results suggest that MCS are hotspots for deletion in retroviral vectors if they can form hairpin structures, and that hairpin structures can be highly unstable at certain locations in retroviral vectors. This information may contribute to improved design of retroviral vectors for such uses as human gene therapy, and will contribute to a greater understanding of the basic science of retroviral reverse transcription. ^
Resumo:
Retroviruses uniquely co-package two copies of their genomic RNA within each virion. The two copies are used as templates for synthesis of the proviral DNA during the process of reverse transcription. Two template switches are required to complete retroviral DNA synthesis by the retroviral enzyme, reverse transcriptase. With two RNA genomes present in the virion, reverse transcriptase can make template switches utilizing only one of the RNA templates (intramolecular) or utilizing both RNA templates (intermolecular) during the process of reverse transcription. The results presented in this study show that during a single cycle of Moloney murine leukemia virus replication, both nonrecombinant and recombinant proviruses predominantly underwent intramolecular minus- and plus-strand transfers during the process of reverse transcription. This is the first study to examine the nature of the required template switches occurring during MLV replication and these results support the previous findings for SNV, and the hypothesis that the required template switches are ordered events. This study also determined rates for deletion and a rate of recombination for a single cycle of MLV replication. The rates reported here are comparable to the rates previously reported for both SNV and MLV. ^
Resumo:
Many lines of clinical and experimental evidence indicate a viral role in carcinogenesis (1-6). Our access to patient plasma, serum, and tissue samples from invasive breast cancer (N=19), ductal carcinoma in situ (N=13), malignant ovarian cancer (N=12), and benign ovarian tumors (N=9), via IRB-approved and informed consent protocols through M.D. Anderson Cancer Center, as well as normal donor plasmas purchased from Gulf Coast Regional Blood Center (N=6), has allowed us to survey primary patient blood and tissue samples, healthy donor blood from the general population, as well as commercially available human cell lines for the presence of human endogenous retrovirus K (HERV-K) Env viral RNA (vRNA), protein, and viral particles. We hypothesize that HERV-K proteins are tumor-associated antigens and as such can be profiled and targeted in patients for diagnostic and therapeutic purposes. To test this hypothesis, we employed isopycnic ultracentrifugation, a microplate-based reverse transcriptase enzyme activity assay, reverse transcription – polymerase chain reaction (RT-PCR), cDNA sequencing, SDS-PAGE and western blotting, immunofluorescent staining, confocal microscopy, and transmission electron microscopy to evaluate v HERV-K activation in cancer. Data from large numbers of patients tested by reverse transcriptase activity assay were analyzed statistically by t-test to determine the potential use of this assay as a diagnostic tool for cancer. Significant reverse transcriptase enzyme activity was detected in 75% of ovarian cancer patients, 53.8% of ductal carcinoma in situ patient, and 42.1% of invasive breast cancer patient samples. Only 11.1% of benign ovarian patient and 16.7% of normal donor samples tested positive. HERV-K Env vRNA, or Env SU were detected in the majority of cancer types screened, as demonstrated by the results shown herein, and were largely absent in normal controls. These findings support our hypothesis that the presence of HERV-K in patient blood circulation is an indicator of cancer or pre-malignancy in vivo, that the presence of HERV-K Env on tumor cell surfaces is indicative of malignant phenotype, and that HERV-K Env is a tumor-associated antigen useful not only as a diagnostic screening tool to predict patient disease status, but also as an exploitable therapeutic target for various novel antibody-based immunotherapies.
Resumo:
Hoy en día, con la evolución continua y rápida de las tecnologías de la información y los dispositivos de computación, se recogen y almacenan continuamente grandes volúmenes de datos en distintos dominios y a través de diversas aplicaciones del mundo real. La extracción de conocimiento útil de una cantidad tan enorme de datos no se puede realizar habitualmente de forma manual, y requiere el uso de técnicas adecuadas de aprendizaje automático y de minería de datos. La clasificación es una de las técnicas más importantes que ha sido aplicada con éxito a varias áreas. En general, la clasificación se compone de dos pasos principales: en primer lugar, aprender un modelo de clasificación o clasificador a partir de un conjunto de datos de entrenamiento, y en segundo lugar, clasificar las nuevas instancias de datos utilizando el clasificador aprendido. La clasificación es supervisada cuando todas las etiquetas están presentes en los datos de entrenamiento (es decir, datos completamente etiquetados), semi-supervisada cuando sólo algunas etiquetas son conocidas (es decir, datos parcialmente etiquetados), y no supervisada cuando todas las etiquetas están ausentes en los datos de entrenamiento (es decir, datos no etiquetados). Además, aparte de esta taxonomía, el problema de clasificación se puede categorizar en unidimensional o multidimensional en función del número de variables clase, una o más, respectivamente; o también puede ser categorizado en estacionario o cambiante con el tiempo en función de las características de los datos y de la tasa de cambio subyacente. A lo largo de esta tesis, tratamos el problema de clasificación desde tres perspectivas diferentes, a saber, clasificación supervisada multidimensional estacionaria, clasificación semisupervisada unidimensional cambiante con el tiempo, y clasificación supervisada multidimensional cambiante con el tiempo. Para llevar a cabo esta tarea, hemos usado básicamente los clasificadores Bayesianos como modelos. La primera contribución, dirigiéndose al problema de clasificación supervisada multidimensional estacionaria, se compone de dos nuevos métodos de aprendizaje de clasificadores Bayesianos multidimensionales a partir de datos estacionarios. Los métodos se proponen desde dos puntos de vista diferentes. El primer método, denominado CB-MBC, se basa en una estrategia de envoltura de selección de variables que es voraz y hacia delante, mientras que el segundo, denominado MB-MBC, es una estrategia de filtrado de variables con una aproximación basada en restricciones y en el manto de Markov. Ambos métodos han sido aplicados a dos problemas reales importantes, a saber, la predicción de los inhibidores de la transcriptasa inversa y de la proteasa para el problema de infección por el virus de la inmunodeficiencia humana tipo 1 (HIV-1), y la predicción del European Quality of Life-5 Dimensions (EQ-5D) a partir de los cuestionarios de la enfermedad de Parkinson con 39 ítems (PDQ-39). El estudio experimental incluye comparaciones de CB-MBC y MB-MBC con los métodos del estado del arte de la clasificación multidimensional, así como con métodos comúnmente utilizados para resolver el problema de predicción de la enfermedad de Parkinson, a saber, la regresión logística multinomial, mínimos cuadrados ordinarios, y mínimas desviaciones absolutas censuradas. En ambas aplicaciones, los resultados han sido prometedores con respecto a la precisión de la clasificación, así como en relación al análisis de las estructuras gráficas que identifican interacciones conocidas y novedosas entre las variables. La segunda contribución, referida al problema de clasificación semi-supervisada unidimensional cambiante con el tiempo, consiste en un método nuevo (CPL-DS) para clasificar flujos de datos parcialmente etiquetados. Los flujos de datos difieren de los conjuntos de datos estacionarios en su proceso de generación muy rápido y en su aspecto de cambio de concepto. Es decir, los conceptos aprendidos y/o la distribución subyacente están probablemente cambiando y evolucionando en el tiempo, lo que hace que el modelo de clasificación actual sea obsoleto y deba ser actualizado. CPL-DS utiliza la divergencia de Kullback-Leibler y el método de bootstrapping para cuantificar y detectar tres tipos posibles de cambio: en las predictoras, en la a posteriori de la clase o en ambas. Después, si se detecta cualquier cambio, un nuevo modelo de clasificación se aprende usando el algoritmo EM; si no, el modelo de clasificación actual se mantiene sin modificaciones. CPL-DS es general, ya que puede ser aplicado a varios modelos de clasificación. Usando dos modelos diferentes, el clasificador naive Bayes y la regresión logística, CPL-DS se ha probado con flujos de datos sintéticos y también se ha aplicado al problema real de la detección de código malware, en el cual los nuevos ficheros recibidos deben ser continuamente clasificados en malware o goodware. Los resultados experimentales muestran que nuestro método es efectivo para la detección de diferentes tipos de cambio a partir de los flujos de datos parcialmente etiquetados y también tiene una buena precisión de la clasificación. Finalmente, la tercera contribución, sobre el problema de clasificación supervisada multidimensional cambiante con el tiempo, consiste en dos métodos adaptativos, a saber, Locally Adpative-MB-MBC (LA-MB-MBC) y Globally Adpative-MB-MBC (GA-MB-MBC). Ambos métodos monitorizan el cambio de concepto a lo largo del tiempo utilizando la log-verosimilitud media como métrica y el test de Page-Hinkley. Luego, si se detecta un cambio de concepto, LA-MB-MBC adapta el actual clasificador Bayesiano multidimensional localmente alrededor de cada nodo cambiado, mientras que GA-MB-MBC aprende un nuevo clasificador Bayesiano multidimensional. El estudio experimental realizado usando flujos de datos sintéticos multidimensionales indica los méritos de los métodos adaptativos propuestos. ABSTRACT Nowadays, with the ongoing and rapid evolution of information technology and computing devices, large volumes of data are continuously collected and stored in different domains and through various real-world applications. Extracting useful knowledge from such a huge amount of data usually cannot be performed manually, and requires the use of adequate machine learning and data mining techniques. Classification is one of the most important techniques that has been successfully applied to several areas. Roughly speaking, classification consists of two main steps: first, learn a classification model or classifier from an available training data, and secondly, classify the new incoming unseen data instances using the learned classifier. Classification is supervised when the whole class values are present in the training data (i.e., fully labeled data), semi-supervised when only some class values are known (i.e., partially labeled data), and unsupervised when the whole class values are missing in the training data (i.e., unlabeled data). In addition, besides this taxonomy, the classification problem can be categorized into uni-dimensional or multi-dimensional depending on the number of class variables, one or more, respectively; or can be also categorized into stationary or streaming depending on the characteristics of the data and the rate of change underlying it. Through this thesis, we deal with the classification problem under three different settings, namely, supervised multi-dimensional stationary classification, semi-supervised unidimensional streaming classification, and supervised multi-dimensional streaming classification. To accomplish this task, we basically used Bayesian network classifiers as models. The first contribution, addressing the supervised multi-dimensional stationary classification problem, consists of two new methods for learning multi-dimensional Bayesian network classifiers from stationary data. They are proposed from two different points of view. The first method, named CB-MBC, is based on a wrapper greedy forward selection approach, while the second one, named MB-MBC, is a filter constraint-based approach based on Markov blankets. Both methods are applied to two important real-world problems, namely, the prediction of the human immunodeficiency virus type 1 (HIV-1) reverse transcriptase and protease inhibitors, and the prediction of the European Quality of Life-5 Dimensions (EQ-5D) from 39-item Parkinson’s Disease Questionnaire (PDQ-39). The experimental study includes comparisons of CB-MBC and MB-MBC against state-of-the-art multi-dimensional classification methods, as well as against commonly used methods for solving the Parkinson’s disease prediction problem, namely, multinomial logistic regression, ordinary least squares, and censored least absolute deviations. For both considered case studies, results are promising in terms of classification accuracy as well as regarding the analysis of the learned MBC graphical structures identifying known and novel interactions among variables. The second contribution, addressing the semi-supervised uni-dimensional streaming classification problem, consists of a novel method (CPL-DS) for classifying partially labeled data streams. Data streams differ from the stationary data sets by their highly rapid generation process and their concept-drifting aspect. That is, the learned concepts and/or the underlying distribution are likely changing and evolving over time, which makes the current classification model out-of-date requiring to be updated. CPL-DS uses the Kullback-Leibler divergence and bootstrapping method to quantify and detect three possible kinds of drift: feature, conditional or dual. Then, if any occurs, a new classification model is learned using the expectation-maximization algorithm; otherwise, the current classification model is kept unchanged. CPL-DS is general as it can be applied to several classification models. Using two different models, namely, naive Bayes classifier and logistic regression, CPL-DS is tested with synthetic data streams and applied to the real-world problem of malware detection, where the new received files should be continuously classified into malware or goodware. Experimental results show that our approach is effective for detecting different kinds of drift from partially labeled data streams, as well as having a good classification performance. Finally, the third contribution, addressing the supervised multi-dimensional streaming classification problem, consists of two adaptive methods, namely, Locally Adaptive-MB-MBC (LA-MB-MBC) and Globally Adaptive-MB-MBC (GA-MB-MBC). Both methods monitor the concept drift over time using the average log-likelihood score and the Page-Hinkley test. Then, if a drift is detected, LA-MB-MBC adapts the current multi-dimensional Bayesian network classifier locally around each changed node, whereas GA-MB-MBC learns a new multi-dimensional Bayesian network classifier from scratch. Experimental study carried out using synthetic multi-dimensional data streams shows the merits of both proposed adaptive methods.
Resumo:
Human fibroblasts whose lifespan in culture has been extended by expression of a viral oncogene eventually undergo a growth crisis marked by failure to proliferate. It has been proposed that telomere shortening in these cells is the property that limits their proliferation. Here we report that ectopic expression of the wild-type reverse transcriptase protein (hTERT) of human telomerase averts crisis, at the same time reducing the frequency of dicentric and abnormal chromosomes. Surprisingly, as the resulting immortalized cells containing active telomerase continue to proliferate, their telomeres continue to shorten to mean lengths below those in control cells that enter crisis. These results provide evidence for a protective function of human telomerase that allows cell proliferation without requiring net lengthening of telomeres.