4 resultados para AND replication
em DRUM (Digital Repository at the University of Maryland)
Resumo:
In today’s big data world, data is being produced in massive volumes, at great velocity and from a variety of different sources such as mobile devices, sensors, a plethora of small devices hooked to the internet (Internet of Things), social networks, communication networks and many others. Interactive querying and large-scale analytics are being increasingly used to derive value out of this big data. A large portion of this data is being stored and processed in the Cloud due the several advantages provided by the Cloud such as scalability, elasticity, availability, low cost of ownership and the overall economies of scale. There is thus, a growing need for large-scale cloud-based data management systems that can support real-time ingest, storage and processing of large volumes of heterogeneous data. However, in the pay-as-you-go Cloud environment, the cost of analytics can grow linearly with the time and resources required. Reducing the cost of data analytics in the Cloud thus remains a primary challenge. In my dissertation research, I have focused on building efficient and cost-effective cloud-based data management systems for different application domains that are predominant in cloud computing environments. In the first part of my dissertation, I address the problem of reducing the cost of transactional workloads on relational databases to support database-as-a-service in the Cloud. The primary challenges in supporting such workloads include choosing how to partition the data across a large number of machines, minimizing the number of distributed transactions, providing high data availability, and tolerating failures gracefully. I have designed, built and evaluated SWORD, an end-to-end scalable online transaction processing system, that utilizes workload-aware data placement and replication to minimize the number of distributed transactions that incorporates a suite of novel techniques to significantly reduce the overheads incurred both during the initial placement of data, and during query execution at runtime. In the second part of my dissertation, I focus on sampling-based progressive analytics as a means to reduce the cost of data analytics in the relational domain. Sampling has been traditionally used by data scientists to get progressive answers to complex analytical tasks over large volumes of data. Typically, this involves manually extracting samples of increasing data size (progressive samples) for exploratory querying. This provides the data scientists with user control, repeatable semantics, and result provenance. However, such solutions result in tedious workflows that preclude the reuse of work across samples. On the other hand, existing approximate query processing systems report early results, but do not offer the above benefits for complex ad-hoc queries. I propose a new progressive data-parallel computation framework, NOW!, that provides support for progressive analytics over big data. In particular, NOW! enables progressive relational (SQL) query support in the Cloud using unique progress semantics that allow efficient and deterministic query processing over samples providing meaningful early results and provenance to data scientists. NOW! enables the provision of early results using significantly fewer resources thereby enabling a substantial reduction in the cost incurred during such analytics. Finally, I propose NSCALE, a system for efficient and cost-effective complex analytics on large-scale graph-structured data in the Cloud. The system is based on the key observation that a wide range of complex analysis tasks over graph data require processing and reasoning about a large number of multi-hop neighborhoods or subgraphs in the graph; examples include ego network analysis, motif counting in biological networks, finding social circles in social networks, personalized recommendations, link prediction, etc. These tasks are not well served by existing vertex-centric graph processing frameworks whose computation and execution models limit the user program to directly access the state of a single vertex, resulting in high execution overheads. Further, the lack of support for extracting the relevant portions of the graph that are of interest to an analysis task and loading it onto distributed memory leads to poor scalability. NSCALE allows users to write programs at the level of neighborhoods or subgraphs rather than at the level of vertices, and to declaratively specify the subgraphs of interest. It enables the efficient distributed execution of these neighborhood-centric complex analysis tasks over largescale graphs, while minimizing resource consumption and communication cost, thereby substantially reducing the overall cost of graph data analytics in the Cloud. The results of our extensive experimental evaluation of these prototypes with several real-world data sets and applications validate the effectiveness of our techniques which provide orders-of-magnitude reductions in the overheads of distributed data querying and analysis in the Cloud.
Resumo:
Turnip crinkle virus (TCV) and Pea enation mosaic virus (PEMV) are two positive (+)-strand RNA viruses that are used to investigate the regulation of translation and replication due to their small size and simple genomes. Both viruses contain cap-independent translation elements (CITEs) within their 3´ untranslated regions (UTRs) that fold into tRNA-shaped structures (TSS) according to nuclear magnetic resonance and small angle x-ray scattering analysis (TCV) and computational prediction (PEMV). Specifically, the TCV TSS can directly associate with ribosomes and participates in RNA-dependent RNA polymerase (RdRp) binding. The PEMV kissing-loop TSS (kl-TSS) can simultaneously bind to ribosomes and associate with the 5´ UTR of the viral genome. Mutational analysis and chemical structure probing methods provide great insight into the function and secondary structure of the two 3´ CITEs. However, lack of 3-D structural information has limited our understanding of their functional dynamics. Here, I report the folding dynamics for the TCV TSS using optical tweezers (OT), a single molecule technique. My study of the unfolding/folding pathways for the TCV TSS has provided an unexpected unfolding pathway, confirmed the presence of Ψ3 and hairpin elements, and suggested an interconnection between the hairpins and pseudoknots. In addition, this study has demonstrated the importance of the adjacent upstream adenylate-rich sequence for the formation of H4a/Ψ3 along with the contribution of magnesium to the stability of the TCV TSS. In my second project, I report on the structural analysis of the PEMV kl-TSS using NMR and SAXS. This study has re-confirmed the base-pair pattern for the PEMV kl-TSS and the proposed interaction of the PEMV kl-TSS with its interacting partner, hairpin 5H2. The molecular envelope of the kl-TSS built from SAXS analysis suggests the kl-TSS has two functional conformations, one of which has a different shape from the previously predicted tRNA-shaped form. Along with applying biophysical methods to study the structural folding dynamics of RNAs, I have also developed a technique that improves the production of large quantities of recombinant RNAs in vivo for NMR study. In this project, I report using the wild-type and mutant E.coli strains to produce cost-effective, site-specific labeled, recombinant RNAs. This technique was validated with four representative RNAs of different sizes and complexity to produce milligram amounts of RNAs. The benefit of using site-specific labeled RNAs made from E.coli was demonstrated with several NMR techniques.
Resumo:
The Picornaviridae family consists of positive-strand RNA viruses that are the causative agents of a variety of diseases in humans and animals. Few drugs targeting picornaviruses are available, making the discovery of new antivirals a high priority. Here, we identified and characterized three compounds from a library of kinase inhibitors that block replication of poliovirus, coxsackievirus B3, and encephalomyocarditis virus. The antiviral effect of these compounds is not likely related to their known cellular targets because other inhibitors targeting the same pathways did not inhibit viral replication. Using an in vitro translation-replication system, we showed that these drugs inhibit different stages of the poliovirus life cycle. A4(1) inhibited the formation of a functional replication complex, while E5(1) and E7(2) affected replication after the replication complex had formed. A4(1) demonstrated partial protection from paralysis in a murine model of poliomyelitis. Poliovirus resistant to E7(2) had a single mutation in the 3A protein. This mutation was previously found to confer resistance to enviroxime-like compounds, which target either PI4KIIIβ (major enviroxime-like compounds) or OSBP (minor enviroxime-like compounds), cellular factors involved in lipid metabolism and shown to be important for replication of diverse positive-strand RNA viruses. We classified E7(2) as a minor enviroxime-like compound, because the localization of OSBP changed in the presence of this inhibitor. Interestingly, both E7(2) and major enviroxime-like compound GW5074 interfered with the viral polyprotein processing. Multiple attempts to isolate resistant mutants in the presence of A4(1) or E5(1) were unsuccessful, showing that effective broad-spectrum antivirals could be developed on the basis of these compounds. Studies with these compounds shed light on pathways shared by diverse picornaviruses that could be potential targets for the development of broad-spectrum antiviral drugs.
Resumo:
Picornaviruses are a group of human and animal pathogens capable of inflicting serious public health diseases and economic burdens. Treatments options through vaccines for prevention or antivirals to cure infection are not available for the vast majority of these viruses. These shortcomings, in the development of vaccines or antivirals therapeutic, are linked to the genetic diversity and to an incomplete understanding of the biology of these viruses. Despite the diverse host range, this group of positive-strand RNA viruses shares the same replication mechanisms, including the development of membranous structures (replication organelles) in the cytoplasm of infected cells. The development of these membranous structures, which serve as sites for the replication of the viral RNA genome, has been linked to the hijacking of elements of the cellular membrane metabolism pathways. Here we show that upon picornavirus infection, there is a specific activation of acyl-CoA synthetase enzymes resulting in strong import and accumulation of long chain fatty acids in the cytoplasm of infected cells. We show that the newly imported fatty acids serve as a substrate for the upregulation of phosphatidylcholine synthesis required for the structural development of replication organelles. In this work, we identified that acyl-CoA synthetase long chain 3 (ACSL3) is required for the upregulation of lipids syntheses and the replication of poliovirus. We have shown that the poliovirus protein 2A was required but not sufficient for the activation of import of long chain fatty acids in infected cells. We demonstrated that the fatty acid import is upregulated upon infection by diverse picornaviruses and that such upregulation is not dependent on activation of ER stress response or the autophagy pathways. In this work, we have demonstrated that phosphatidylcholine was required for the structural development of replication organelles. Phosphatidylcholine synthesis was dispensable for the production of infectious particles at high MOI but required at a low MOI for the protection of the replication complexes from the cellular innate immunity mechanisms.