7 resultados para Vector coding
em AMS Tesi di Dottorato - Alm@DL - Università di Bologna
Resumo:
Motivation An actual issue of great interest, both under a theoretical and an applicative perspective, is the analysis of biological sequences for disclosing the information that they encode. The development of new technologies for genome sequencing in the last years, opened new fundamental problems since huge amounts of biological data still deserve an interpretation. Indeed, the sequencing is only the first step of the genome annotation process that consists in the assignment of biological information to each sequence. Hence given the large amount of available data, in silico methods became useful and necessary in order to extract relevant information from sequences. The availability of data from Genome Projects gave rise to new strategies for tackling the basic problems of computational biology such as the determination of the tridimensional structures of proteins, their biological function and their reciprocal interactions. Results The aim of this work has been the implementation of predictive methods that allow the extraction of information on the properties of genomes and proteins starting from the nucleotide and aminoacidic sequences, by taking advantage of the information provided by the comparison of the genome sequences from different species. In the first part of the work a comprehensive large scale genome comparison of 599 organisms is described. 2,6 million of sequences coming from 551 prokaryotic and 48 eukaryotic genomes were aligned and clustered on the basis of their sequence identity. This procedure led to the identification of classes of proteins that are peculiar to the different groups of organisms. Moreover the adopted similarity threshold produced clusters that are homogeneous on the structural point of view and that can be used for structural annotation of uncharacterized sequences. The second part of the work focuses on the characterization of thermostable proteins and on the development of tools able to predict the thermostability of a protein starting from its sequence. By means of Principal Component Analysis the codon composition of a non redundant database comprising 116 prokaryotic genomes has been analyzed and it has been showed that a cross genomic approach can allow the extraction of common determinants of thermostability at the genome level, leading to an overall accuracy in discriminating thermophilic coding sequences equal to 95%. This result outperform those obtained in previous studies. Moreover, we investigated the effect of multiple mutations on protein thermostability. This issue is of great importance in the field of protein engineering, since thermostable proteins are generally more suitable than their mesostable counterparts in technological applications. A Support Vector Machine based method has been trained to predict if a set of mutations can enhance the thermostability of a given protein sequence. The developed predictor achieves 88% accuracy.
Resumo:
The thesis deals with channel coding theory applied to upper layers in the protocol stack of a communication link and it is the outcome of four year research activity. A specific aspect of this activity has been the continuous interaction between the natural curiosity related to the academic blue-sky research and the system oriented design deriving from the collaboration with European industry in the framework of European funded research projects. In this dissertation, the classical channel coding techniques, that are traditionally applied at physical layer, find their application at upper layers where the encoding units (symbols) are packets of bits and not just single bits, thus explaining why such upper layer coding techniques are usually referred to as packet layer coding. The rationale behind the adoption of packet layer techniques is in that physical layer channel coding is a suitable countermeasure to cope with small-scale fading, while it is less efficient against large-scale fading. This is mainly due to the limitation of the time diversity inherent in the necessity of adopting a physical layer interleaver of a reasonable size so as to avoid increasing the modem complexity and the latency of all services. Packet layer techniques, thanks to the longer codeword duration (each codeword is composed of several packets of bits), have an intrinsic longer protection against long fading events. Furthermore, being they are implemented at upper layer, Packet layer techniques have the indisputable advantages of simpler implementations (very close to software implementation) and of a selective applicability to different services, thus enabling a better matching with the service requirements (e.g. latency constraints). Packet coding technique improvement has been largely recognized in the recent communication standards as a viable and efficient coding solution: Digital Video Broadcasting standards, like DVB-H, DVB-SH, and DVB-RCS mobile, and 3GPP standards (MBMS) employ packet coding techniques working at layers higher than the physical one. In this framework, the aim of the research work has been the study of the state-of-the-art coding techniques working at upper layer, the performance evaluation of these techniques in realistic propagation scenario, and the design of new coding schemes for upper layer applications. After a review of the most important packet layer codes, i.e. Reed Solomon, LDPC and Fountain codes, in the thesis focus our attention on the performance evaluation of ideal codes (i.e. Maximum Distance Separable codes) working at UL. In particular, we analyze the performance of UL-FEC techniques in Land Mobile Satellite channels. We derive an analytical framework which is a useful tool for system design allowing to foresee the performance of the upper layer decoder. We also analyze a system in which upper layer and physical layer codes work together, and we derive the optimal splitting of redundancy when a frequency non-selective slowly varying fading channel is taken into account. The whole analysis is supported and validated through computer simulation. In the last part of the dissertation, we propose LDPC Convolutional Codes (LDPCCC) as possible coding scheme for future UL-FEC application. Since one of the main drawbacks related to the adoption of packet layer codes is the large decoding latency, we introduce a latency-constrained decoder for LDPCCC (called windowed erasure decoder). We analyze the performance of the state-of-the-art LDPCCC when our decoder is adopted. Finally, we propose a design rule which allows to trade-off performance and latency.
Resumo:
Bioremediation implies the use of living organisms, primarily microorganisms, to convert environmental contaminants into less toxic forms. The impact of the consequences of hydrocarbon release in the environment maintain a high research interest in the study of microbial metabolisms associated with the biodegradation of aromatic and aliphatic hydrocarbons but also in the analysis of microbial enzymes that can convert petroleum substrates to value-added products. The studies described in this Thesis fall within the research field that directs the efforts into identifying gene/proteins involved in the catabolism of n-alkanes and into studying the regulatory mechanisms leading to their oxidation. In particular the studies were aimed at investigating the molecular aspects of the ability of Rhodococcus sp. BCP1 to grow on aliphatic hydrocarbons as sole carbon and energy sources. We studied the ability of Rhodococcus sp. BCP1 to grow on gaseous (C2-C4), liquid (C5-C16) and solid (C17-C28) n-alkanes that resulted to be biochemically correlated with the activity of one or more monooxygenases. In order to identify the alkane monooxygenase that is involved in the n-alkanes degradation pathway in Rhodococcus sp. BCP1, PCR-based methodology was applied by using degenerate primers targeting AlkB monooxygenase family members. As result, a chromosomal region, including the alkB gene cluster, was cloned from Rhodococcus sp. BCP1 genome. We characterized the products of this alkB gene cluster and the products of the orfs included in the flanking regions by comparative analysis with the homologues in the database. alkB gene expression studies were carried out by RT-PCR and by the construction of a promoter probe vector containing the lacZ gene downstream of the alkB promoter. B-galactosidase assays revealed the alkB promoter activity induced by n-alkanes and by n-alkanes metabolic products. Furthermore, the transcriptional start of alkB gene was determined by primer extension procedure. A proteomic approach was subsequently applied to compare the protein patterns expressed by BCP1 growing on n-butane, n-hexane, n-hexadecane or n-eicosane with the protein pattern expressed by BCP1 growing on succinate. The accumulation of enzymes specifically induced on n-alkanes was determined. These enzymes were identified by tandem mass spectrometry (LC/MS/MS). Finally, a prm gene, homologue to the gene family coding for soluble di-iron monooxygenases (SDIMOs), has been isolated from Rhodococcus sp. BCP1 genome. This gene product could be involved in the degradation of gaseous n-alkanes in this Rhodococcus strain. The versatility in utilizing hydrocarbons and the discovery of new remarkable metabolic activities outline the potential applications of this microorganism in environmental and industrial biotechnologies.
Resumo:
Many psychophysical studies suggest that target depth and direction during reaches are processed independently, but the neurophysiological support to this view is so far limited. Here, we investigated the representation of reach depth and direction by single neurons in an area of the medial posterior parietal cortex (V6A). Single-unit activity was recorded from V6A in two Macaca fascicularis monkeys performing a fixation-to-reach task to targets at different depths and directions. We found that in a substantial percentage of V6A neurons depth and direction signals jointly influenced fixation, planning and arm movement-related activity in 3D space. While target depth and direction were equally encoded during fixation, depth tuning became stronger during arm movement planning, execution and target holding. The spatial tuning of fixation activity was often maintained across epochs, and this occurred more frequently in depth. These findings support for the first time the existence of a common neural substrate for the encoding of target depth and direction during reaching movements in the posterior parietal cortex. Present results also highlight the presence in V6A of several types of cells that process independently or jointly eye position and arm movement planning and execution signals in order to control reaches in 3D space. It is possible that depth and direction influence also the metrics of the reach action and that this effect on the reach kinematic variables can account for the spatial tuning we found in V6A neural activity. For this reason, we recorded and analyzed behavioral data when one monkey performed reaching movements in 3-D space. We evaluated how the target spatial position, in particular target depth and target direction, affected the kinematic parameters and trajectories describing the motor action properties.
Resumo:
Ultra-relativistic heavy ions generate strong electromagnetic fields which offer the possibility to study γ-γ and γ-nucleus processes at the LHC in the so called ultra-peripheral collisions (UPC). The photoproduction of J/ψ vector mesons in UPC is sensitive to the gluon distribution of the interacting nuclei. In this thesis the study of coherent and incoherent J/ψ production in Pb-Pb collisions at √sNN = 2.76 TeV is described. The J/ψ has been measured via its leptonic decay in the rapidity range -0.9 < y < 0.9. The cross section for coherent and incoherent J/ψ are given. The results are compared to theoretical models for J/ψ production and the coherent cross section is found to be in good agreement with those models which include nuclear gluon shadowing consistent with EPS09 parametrization. In addition the cross section for the process γ γ→ e+e− has been measured and found to be in agreement with the STARLIGHT Monte Carlo predictions. The analysis has been published by the ALICE Collaboration in the European Physical Journal C, with one of its main plot depicted on the cover-front of the November 2013 issue.
Resumo:
The transcribed ultraconserved regions (T-UCRs) are a group of long non-coding RNAs involved in human carcinogenesis. The factors regulating the expression of T-UCRs and their mechanism of action in human cancers are unknown. In this work it was shown that high expression of uc.339 associates with lower survival in 204 non-small cell lung cancer (NSCLC) patients. Moreover, it was shown that uc.339 found up-regulated in archival NSCLC samples, acts as a decoy RNA for miR-339-3p, -663-3p and -95-5p. So, Cyclin E2, a direct target of three microRNAs is up-regulated, inducing cancer growth and migration. Evidence of this mechanism was provided from cell lines and primary samples confirming that TP53 directly regulates uc.339. These results support a key role for uc.339 in lung cancer.