50 resultados para Corpora as translation resources
em Indian Institute of Science - Bangalore - Índia
Resumo:
Identifying translations from comparable corpora is a well-known problem with several applications, e.g. dictionary creation in resource-scarce languages. Scarcity of high quality corpora, especially in Indian languages, makes this problem hard, e.g. state-of-the-art techniques achieve a mean reciprocal rank (MRR) of 0.66 for English-Italian, and a mere 0.187 for Telugu-Kannada. There exist comparable corpora in many Indian languages with other ``auxiliary'' languages. We observe that translations have many topically related words in common in the auxiliary language. To model this, we define the notion of a translingual theme, a set of topically related words from auxiliary language corpora, and present a probabilistic framework for translation induction. Extensive experiments on 35 comparable corpora using English and French as auxiliary languages show that this approach can yield dramatic improvements in performance (e.g. MRR improves by 124% to 0.419 for Telugu-Kannada). A user study on WikiTSu, a system for cross-lingual Wikipedia title suggestion that uses our approach, shows a 20% improvement in the quality of titles suggested.
Resumo:
Scatter/Gather systems are increasingly becoming useful in browsing document corpora. Usability of the present-day systems are restricted to monolingual corpora, and their methods for clustering and labeling do not easily extend to the multilingual setting, especially in the absence of dictionaries/machine translation. In this paper, we study the cluster labeling problem for multilingual corpora in the absence of machine translation, but using comparable corpora. Using a variational approach, we show that multilingual topic models can effectively handle the cluster labeling problem, which in turn allows us to design a novel Scatter/Gather system ShoBha. Experimental results on three datasets, namely the Canadian Hansards corpus, the entire overlapping Wikipedia of English, Hindi and Bengali articles, and a trilingual news corpus containing 41,000 articles, confirm the utility of the proposed system.
Resumo:
Earlier workers have observed that in the leydig cell desensitization brings results in addition to down regulation of receptors, in leisons in the steroidogenlc pathway. In the present study immature rats having heavily leutinized ovaries were given 50 iu hCG and the desenasitized CL removed 48h later were used. At that time no change in the 5 3MSD activity and CAMP binding activity(a measure of CAMP dependent protein kinase) was observed.Followlng desensitization however,l)a significent increase in phosphodiestrase activity,ii)a 50% reduction in total mitochondrial cholesterol level, iii)a significant reduction in its ability to utilize cholesterol or hydrolyse its ester and iv)a significant lowering(by 66%)in cholesterol side chain clean age activity(by measuring pregnanalone formed) was observed. Pregnanalone production was restored to normalcy if exogenous cholesterol was added to the mitohondrial preparation. The results suggest that luteal desensitization is due in addition to down regulation of LH receptors, to a marked reduction in available cholesterol pool in the mitochondrial compartment. The increase in phosphodiestrase activity, though probably a secondary effect,might effectively contribute to the overall reduction in the steroid out-put by increasing the catabolism of CAMP.(Aided by grants from ICMR,New Delhi and WHO, Geneva).
Resumo:
A new clustering technique, based on the concept of immediato neighbourhood, with a novel capability to self-learn the number of clusters expected in the unsupervized environment, has been developed. The method compares favourably with other clustering schemes based on distance measures, both in terms of conceptual innovations and computational economy. Test implementation of the scheme using C-l flight line training sample data in a simulated unsupervized mode has brought out the efficacy of the technique. The technique can easily be implemented as a front end to established pattern classification systems with supervized learning capabilities to derive unified learning systems capable of operating in both supervized and unsupervized environments. This makes the technique an attractive proposition in the context of remotely sensed earth resources data analysis wherein it is essential to have such a unified learning system capability.
Resumo:
Background: The Mycobacterium leprae genome has less than 50% coding capacity and 1,133 pseudogenes. Preliminary evidence suggests that some pseudogenes are expressed. Therefore, defining pseudogene transcriptional and translational potentials of this genome should increase our understanding of their impact on M. leprae physiology. Results: Gene expression analysis identified transcripts from 49% of all M. leprae genes including 57% of all ORFs and 43% of all pseudogenes in the genome. Transcribed pseudogenes were randomly distributed throughout the chromosome. Factors resulting in pseudogene transcription included: 1) co-orientation of transcribed pseudogenes with transcribed ORFs within or exclusive of operon-like structures; 2) the paucity of intrinsic stem-loop transcriptional terminators between transcribed ORFs and downstream pseudogenes; and 3) predicted pseudogene promoters. Mechanisms for translational ``silencing'' of pseudogene transcripts included the lack of both translational start codons and strong Shine-Dalgarno (SD) sequences. Transcribed pseudogenes also contained multiple ``in-frame'' stop codons and high Ka/Ks ratios, compared to that of homologs in M. tuberculosis and ORFs in M. leprae. A pseudogene transcript containing an active promoter, strong SD site, a start codon, but containing two in frame stop codons yielded a protein product when expressed in E. coli. Conclusion: Approximately half of M. leprae's transcriptome consists of inactive gene products consuming energy and resources without potential benefit to M. leprae. Presently it is unclear what additional detrimental affect(s) this large number of inactive mRNAs has on the functional capability of this organism. Translation of these pseudogenes may play an important role in overall energy consumption and resultant pathophysiological characteristics of M. leprae. However, this study also demonstrated that multiple translational ``silencing'' mechanisms are present, reducing additional energy and resource expenditure required for protein production from the vast majority of these transcripts.
Resumo:
Medicinal and aromatic plants (MAPs) are an integral part of our biodiversity. In majority of MAP rich countries, wild collection practices are the livelihood options for a large number of rural peoples and MAPs play a significant role in socio-economic development of their communities. Recent concern over the alarming situation of the status of wild MAP resources, raw material quality, as well as social exploitation of rural communities, leads to the idea of certification for MAP resource conservation and management. On one hand, while MAP certification addresses environmental, social and economic perspectives of MAP resources, on the other hand, it ensures multi-stakeholder participation in improvement of the MAP sector. This paper presents an overview of MAP certification encompassing its different parameters, current scenario (Indian background), implementation strategies as well as stakeholders’ role in MAP conservation. It also highlights Indian initiatives in this direction.
Resumo:
The mechanism of translation in eubacteria and organelles is thought to be similar. In eubacteria, the three initiation factors IF1, IF2, and IF3 are vital. Although the homologs of IF2 and IF3 are found in mammalian mitochondria, an IF1 homolog has never been detected. Here, we show that bovine mitochondrial IF2 (IF2mt) complements E. coli containing a deletion of the IF2 gene (E. coli ΔinfB). We find that IF1 is no longer essential in an IF2mt-supported E. coli ΔinfB strain. Furthermore, biochemical and molecular modeling data show that a conserved insertion of 37 amino acids in the IF2mt substitutes for the function of IF1. Deletion of this insertion from IF2mt supports E. coli for the essential function of IF2. However, in this background, IF1 remains essential. These observations provide strong evidence that a single factor (IF2mt) in mammalian mitochondria performs the functions of two eubacterial factors, IF1 and IF2.
Resumo:
Optimal allocation of water resources for various stakeholders often involves considerable complexity with several conflicting goals, which often leads to multi-objective optimization. In aid of effective decision-making to the water managers, apart from developing effective multi-objective mathematical models, there is a greater necessity of providing efficient Pareto optimal solutions to the real world problems. This study proposes a swarm-intelligence-based multi-objective technique, namely the elitist-mutated multi-objective particle swarm optimization technique (EM-MOPSO), for arriving at efficient Pareto optimal solutions to the multi-objective water resource management problems. The EM-MOPSO technique is applied to a case study of the multi-objective reservoir operation problem. The model performance is evaluated by comparing with results of a non-dominated sorting genetic algorithm (NSGA-II) model, and it is found that the EM-MOPSO method results in better performance. The developed method can be used as an effective aid for multi-objective decision-making in integrated water resource management.
Resumo:
Translation initiation from the ribosomal P-site is the specialty of the initiator tRNAs (tRNA(fMet)). Presence of the three consecutive G-C base pairs (G29-C41, G30-C40 and G31-C39) in their anticodon stems, a highly conserved feature of the initiator tRNAs across the three kingdoms of life, has been implicated in their preferential binding to the P-site. How this feature is exploited by ribosomes has remained unclear. Using a genetic screen, we have isolated an Escherichia coli strain, carrying a G122D mutation in folD, which allows initiation with the tRNA(fMet) containing mutations in one, two or all the three G-C base pairs. The strain shows a severe deficiency of methionine and S-adenosylmethionine, and lacks nucleoside methylations in rRNA. Targeted mutations in the methyltransferase genes have revealed a connection between the rRNA modifications and the fundamental process of the initiator tRNA selection by the ribosome.
Resumo:
We have investigated the possible role of trans-acting factors interacting with the untranslated regions (UTRs) of coxsackievirus B3 (CVB3) RNA. We show here that polypyrimidine tract-binding protein (PTB) binds specifically to both 5' and 3' UTRs, but with different affinity. We have demonstrated that PTB is a bona fide internal ribosome entry site (IRES) trans-acting factor (ITAF) for CVB3 RNA by characterizing the effect of partial silencing of FIB ex vivo in He La cells. Furthermore, IRES activity in BSC-1 cells, which are reported to have a very low level of endogenous FIB, was found to be significantly lower than that in He La cells. Additionally, we have mapped the putative contact points of PTB on the 5' and 3' UTRs by an RNA toe-printing assay. We have shown that the 3' UTR is able to stimulate CVB3 IRES-mediated translation. Interestingly, a deletion of 15 nt at the 5' end or 14 rut at the 3' end of the CVB3 3' UTR reduced the 3' UTR-mediated enhancement of IRES activity ex vivo significantly, and a reduced interaction was shown with PTB. It appears that the FIB protein might help in circularization of the CVB3 RNA by bridging the ends necessary for efficient translation of the viral RNA.
Resumo:
In vitro translation of belladonna mottle virus BDMV(I) genomic RNA in a rabbit reticulocyte lysate system produced proteins of Mr 210,000, 150,000 and 78,000 which form the non-structural proteins. The coat protein, on the other hand, was expressed from a subgenomic RNA which was found to be encapsidated in the empty capsids forming the top component viral particles. The implications of subgenomic RNA encapsidation in viral replication and assembly are discussed.