139 resultados para genetic transcription

em Indian Institute of Science - Bangalore - Índia


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: A genetic network can be represented as a directed graph in which a node corresponds to a gene and a directed edge specifies the direction of influence of one gene on another. The reconstruction of such networks from transcript profiling data remains an important yet challenging endeavor. A transcript profile specifies the abundances of many genes in a biological sample of interest. Prevailing strategies for learning the structure of a genetic network from high-dimensional transcript profiling data assume sparsity and linearity. Many methods consider relatively small directed graphs, inferring graphs with up to a few hundred nodes. This work examines large undirected graphs representations of genetic networks, graphs with many thousands of nodes where an undirected edge between two nodes does not indicate the direction of influence, and the problem of estimating the structure of such a sparse linear genetic network (SLGN) from transcript profiling data. Results: The structure learning task is cast as a sparse linear regression problem which is then posed as a LASSO (l1-constrained fitting) problem and solved finally by formulating a Linear Program (LP). A bound on the Generalization Error of this approach is given in terms of the Leave-One-Out Error. The accuracy and utility of LP-SLGNs is assessed quantitatively and qualitatively using simulated and real data. The Dialogue for Reverse Engineering Assessments and Methods (DREAM) initiative provides gold standard data sets and evaluation metrics that enable and facilitate the comparison of algorithms for deducing the structure of networks. The structures of LP-SLGNs estimated from the INSILICO1, INSILICO2 and INSILICO3 simulated DREAM2 data sets are comparable to those proposed by the first and/or second ranked teams in the DREAM2 competition. The structures of LP-SLGNs estimated from two published Saccharomyces cerevisae cell cycle transcript profiling data sets capture known regulatory associations. In each S. cerevisiae LP-SLGN, the number of nodes with a particular degree follows an approximate power law suggesting that its degree distributions is similar to that observed in real-world networks. Inspection of these LP-SLGNs suggests biological hypotheses amenable to experimental verification. Conclusion: A statistically robust and computationally efficient LP-based method for estimating the topology of a large sparse undirected graph from high-dimensional data yields representations of genetic networks that are biologically plausible and useful abstractions of the structures of real genetic networks. Analysis of the statistical and topological properties of learned LP-SLGNs may have practical value; for example, genes with high random walk betweenness, a measure of the centrality of a node in a graph, are good candidates for intervention studies and hence integrated computational – experimental investigations designed to infer more realistic and sophisticated probabilistic directed graphical model representations of genetic networks. The LP-based solutions of the sparse linear regression problem described here may provide a method for learning the structure of transcription factor networks from transcript profiling and transcription factor binding motif data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose: Waardenburg syndrome (WS) is characterized by sensorineural hearing loss and pigmentation defects of the eye, skin, and hair. It is caused by mutations in one of the following genes: PAX3 (paired box 3), MITF (microphthalmia-associated transcription factor), EDNRB (endothelin receptor type B), EDN3 (endothelin 3), SNAI2 (snail homolog 2, Drosophila) and SOX10 (SRY-box containing gene 10). Duchenne muscular dystrophy (DMD) is an X-linked recessive disorder caused by mutations in the DMD gene. The purpose of this study was to identify the genetic causes of WS and DMD in an Indian family with two patients: one affected with WS and DMD, and another one affected with only WS. Methods: Blood samples were collected from individuals for genomic DNA isolation. To determine the linkage of this family to the eight known WS loci, microsatellite markers were selected from the candidate regions and used to genotype the family. Exon-specific intronic primers for EDN3 were used to amplify and sequence DNA samples from affected individuals to detect mutations. A mutation in DMD was identified by multiplex PCR and multiplex ligation-dependent probe amplification method using exon-specific probes. Results: Pedigree analysis suggested segregation of WS as an autosomal recessive trait in the family. Haplotype analysis suggested linkage of the family to the WS4B (EDN3) locus. DNA sequencing identified a novel missense mutation p.T98M in EDN3. A deletion mutation was identified in DMD. Conclusions: This study reports a novel missense mutation in EDN3 and a deletion mutation in DMD in the same Indian family. The present study will be helpful in genetic diagnosis of this family and increases the mutation spectrum of EDN3.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Yeast Rpb4, a subunit of RNA pol II is not essential for viability but is involved in multiple cellular phenotypes such as temperature sensitivity, enhanced pseudohyphal morphology, and decreased sporulation. Both in vivo and in vitro studies strongly support involvement of Rpb4 in transcription initiation, while its role in transcription elongation is not entirely consistent. Here we show that Rpb4 is not required for recruitment of RNA pol II on the coding region of YLR454w, a representative long gene. Yet we find strong genetic interaction of rpb4 Delta with mutants in many transcription elongation factors such as Paf1, Spt4, Dst1, Elp3 and Rpb9. We demonstrate that, Rpb4 interacts functionally with Paf1 to affect the transcription elongation of the FKS1 gene. Our results suggest that while Rpb4 is not required for general transcription elongation, it could support transcription elongation for specific of class of genes by interaction with other elongation factors. (C) 2014 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Several late gene expression factors (Lefs) have been implicated in fostering high levels of transcription from the very late gene promoters of polyhedrin and p10 from baculoviruses. We cloned and characterized from Bombyx mori nuclear polyhedrosis virus a late gene expression factor (Bmlef2) that encodes a 209-amino-acid protein harboring a Cys-rich C-terminal domain. The temporal transcription profiles of lef2 revealed a 1.2-kb transcript in both delayed early and late periods after virus infection. Transcription start site mapping identified the presence of an aphidicolin-sensitive late transcript arising from a TAAG motif located at -352 nucleotides and an aphidicolin-insensitive early transcript originating from a TTGT motif located 35 nucleotides downstream to a TATA box at -312 nucleotides, with respect to the +1 ATG of lef2. BmLef2 trans-activated very late gene expression from both polyhedrin and p10 promoters in transient expression assays. Internal deletion of the Cys-rich domain from the C-terminal region abolished the transcriptional activation. Inactivation of Lef2 synthesis by antisense lef2 transcripts drastically reduced the very late gene transcription but showed little effect on the expression from immediate early promoter. Decrease in viral DNA synthesis and a reduction in virus titer were observed only when antisense lef2 was expressed under the immediate early (ie-1) promoter. Furthermore, the antisense experiments suggested that lef2 plays a direct role in very late gene transcription.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Our current understanding of the evolution of the histone gene family suffers from a lack of information on plant histone genes1. With a view to gathering some much needed information on these genes, we studied a rice genomic clone in pBR322 carrying H2A, H2B and H4 histone genes on a DNA fragment2 of 6.64 kilobases (kb). A restriction map of the insert was constructed and the organization of the three genes on this insert was determined. H2A and H2B histone genes were located at one end of the insert and H4 gene at the other with a 3.1 kb spacer in between. This cluster of three histone genes was found to be transcribed in a bidirectional fashion with H2A and H2B genes being encoded by one strand and the H4 gene by the other. These results indicate that plant histone gene organization differs from that of the sea urchin, but shows many similarities to the systems in other animals.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

New antiretroviral drugs that offer large genetic barriers to resistance, such as the recently approved inhibitors of HIV-1 protease, tipranavir and darunavir, present promising weapons to avert the failure of current therapies for HIV infection. Optimal treatment strategies with the new drugs, however, are yet to be established. A key limitation is the poor understanding of the process by which HIV surmounts large genetic barriers to resistance. Extant models of HIV dynamics are predicated on the predominance of deterministic forces underlying the emergence of resistant genomes. In contrast, stochastic forces may dominate, especially when the genetic barrier is large, and delay the emergence of resistant genomes. We develop a mathematical model of HIV dynamics under the influence of an antiretroviral drug to predict the waiting time for the emergence of genomes that carry the requisite mutations to overcome the genetic barrier of the drug. We apply our model to describe the development of resistance to tipranavir in in vitro serial passage experiments. Model predictions of the times of emergence of different mutant genomes with increasing resistance to tipranavir are in quantitative agreement with experiments, indicating that our model captures the dynamics of the development of resistance to antiretroviral drugs accurately. Further, model predictions provide insights into the influence of underlying evolutionary processes such as recombination on the development of resistance, and suggest guidelines for drug design: drugs that offer large genetic barriers to resistance with resistance sites tightly localized on the viral genome and exhibiting positive epistatic interactions maximally inhibit the emergence of resistant genomes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Using computer modeling of three-dimensional structures and structural information available on the crystal structures of HIV-1 protease, we investigated the structural effects of mutations, in treatment-naive and treatment-exposed individuals from India and postulated mechanisms of resistance in clade C variants. A large number of models (14) have been generated by computational mutation of the available crystal structures of drug bound proteases. Localized energy minimization was carried out in and around the sites of mutation in order to optimize the geometry of interactions present. Most of the mutations result in structural differences at the flap that favors the semiopen state of the enzyme. Some of the mutations were also found to confer resistance by affecting the geometry of the active site. The E35D mutation affects the flap structure in clade B strains and E35N and E35K mutation, seen in our modeled strains, have a more profound effect. Common polymorphisms at positions 36 and 63 in clade C also affected flap structure. Apart from a few other residues Gln-58, Asn-83, Asn-88, and Gln-92 and their interactions are important for the transition from the closed to the open state. Development of protease inhibitors by structure-based design requires investigation of mechanisms operative for clade C to improve the efficacy of therapy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The leader protease (L-pro) and capsid-coding sequences (P1) constitute approximately 3 kb of the foot-and-mouth disease virus (FMDV). We studied the phylogenetic relationship of 46 FMDV serotype A isolates of Indian origin collected during the period 1968-2005 and also eight vaccine strains using the neighbour-joining tree and Bayesian tree methods. The viruses were categorized under three major groups - Asian, Euro-South American and European. The Indian isolates formed a distinct genetic group among the Asian isolates. The Indian isolates were further classified into different genetic subgroups (<5% divergence). Post-1995 isolates were divided into two subgroups while a few isolates which originated in the year 2005 from Andhra Pradesh formed a separate group. These isolates were closely related to the isolates of the 1970s. The FMDV isolates seem to undergo reverse mutation or onvergent evolution wherein sequences identical to the ancestors are present in the isolates in circulation. The eight vaccine strains included in the study were not related to each other and belonged to different genetic groups. Recombination was detected in the L-pro region in one isolate (A IND 20/82) and in the VP1 coding 1D region in another isolate (A RAJ 21/96). Positive selection was identified at aa positions 23 in the L-pro (P<0.05; 0.046*) and at aa 171 in the capsid protein VP1 (P<0.01; 0.003**).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Consanguineous marriages are strongly favoured among the populations of South India. In a study conducted on 407 infants and children, a total of 35 genetic diseases was diagnosed in 63 persons: 44 with single gene defects, 12 with polygenic disorders, and seven with Down's syndrome. The coefficient of inbreeding of the total study group, F = 0.0414, was significantly higher than that previously calculated for the general population, F = 0.0271, and autosomal recessive disorders formed the largest single disease category diagnosed. The results suggest that long term inbreeding may not have resulted in appreciable elimination of recessive lethals and sub-lethals from the gene pool.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A stretch of 71 nucleotides in a 1.2 kilobase pair Pst I fragment of rice DNA was identified as tRNA~ gene by hybridization and nucleotide sequence analyses. The hybridization of genomic DNA with the tRNA gene showed that there are about 10 glycine tRNA genes per diploid rice genome. The 3' and 5' internal control regions, where RNA polymerase III and transcription factors bind, were found to be present in the coding sequence. The gene was transcribed into a 4S product in an yeast cell-free extract. The substitution of 5' internal control region with analogous sequences from either M13mpl9 or M13mpl8 DNA did not affect the transcription of the gene in vitro. The changes in three highly conserved nucleotides in the consensus 5' internal control region (RGYNNARYGG; R = purine, Y = pyrimidine, N = any nucleotide) did not affect transcription showing that these nucleotides are not essential for promotion of transcription. There were two 16 base pair repeats, 'TGTTTGTTTCAGCTTA' at - 130 and - 375 positions upstream from the start of the gene. Deletion of 5' flanking sequences including the 16 base pair repeat at - 375 showed increased transcription indicating that these sequences negatively modulate the expression of the gene.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Numerically discretized dynamic optimization problems having active inequality and equality path constraints that along with the dynamics induce locally high index differential algebraic equations often cause the optimizer to fail in convergence or to produce degraded control solutions. In many applications, regularization of the numerically discretized problem in direct transcription schemes by perturbing the high index path constraints helps the optimizer to converge to usefulm control solutions. For complex engineering problems with many constraints it is often difficult to find effective nondegenerat perturbations that produce useful solutions in some neighborhood of the correct solution. In this paper we describe a numerical discretization that regularizes the numerically consistent discretized dynamics and does not perturb the path constraints. For all values of the regularization parameter the discretization remains numerically consistent with the dynamics and the path constraints specified in the, original problem. The regularization is quanti. able in terms of time step size in the mesh and the regularization parameter. For full regularized systems the scheme converges linearly in time step size.The method is illustrated with examples.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A model is suggested for mammalian male determination based on interactions postulated to occur among an autosomal repressor gene, an X-linked male-determining gene termed Tdx, and multiple copies of certain DNA sequences on the Y chromosome that do not code for any protein. The repressor, synthesised in limited amounts, has higher affinity for the Y-linked sequences than for Tdx and its affinity for Tdx is greater than that of RNA polymerase. In XY cells the Y effectively binds all available repressor, permitting transcription of Tdx to occur. In XX cells, since competition from the Y-linked high-affinity sequences is absent, the repressor binds to Tdx and prevents transcription. As a result of this competition between Tdx and the Y-linked high-affinity sites for limiting concentrations of the autosomal repressor, the product of the Tdx gene (TDX) is synthesized in the male but not in the female. It is suggested that in determination of the male sex, the role of the Y chromosome is to serve as a sink for the Tdx repressor. The proposed interactions provide a plausible explanation for the genetic properties of several anomalies of sexual development in mouse, man, and other mammals. The model suggests that the postulated multiple, highaffinity sequences on the Y chromosome of the mouse are included among the DNA sequences referred to as the Sxr-Bkm sequences.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Simple formalized rules are proposed for automatic phonetic transcription of Tamil words into Roman script. These rules are syntax-directed and require a one-symbol look-ahead facility and hence easily automated in a digital computer. Some suggestions are also put forth for the linearization of Tamil script for handling these by modern machinery.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The Mycobacterium leprae genome has less than 50% coding capacity and 1,133 pseudogenes. Preliminary evidence suggests that some pseudogenes are expressed. Therefore, defining pseudogene transcriptional and translational potentials of this genome should increase our understanding of their impact on M. leprae physiology. Results: Gene expression analysis identified transcripts from 49% of all M. leprae genes including 57% of all ORFs and 43% of all pseudogenes in the genome. Transcribed pseudogenes were randomly distributed throughout the chromosome. Factors resulting in pseudogene transcription included: 1) co-orientation of transcribed pseudogenes with transcribed ORFs within or exclusive of operon-like structures; 2) the paucity of intrinsic stem-loop transcriptional terminators between transcribed ORFs and downstream pseudogenes; and 3) predicted pseudogene promoters. Mechanisms for translational ``silencing'' of pseudogene transcripts included the lack of both translational start codons and strong Shine-Dalgarno (SD) sequences. Transcribed pseudogenes also contained multiple ``in-frame'' stop codons and high Ka/Ks ratios, compared to that of homologs in M. tuberculosis and ORFs in M. leprae. A pseudogene transcript containing an active promoter, strong SD site, a start codon, but containing two in frame stop codons yielded a protein product when expressed in E. coli. Conclusion: Approximately half of M. leprae's transcriptome consists of inactive gene products consuming energy and resources without potential benefit to M. leprae. Presently it is unclear what additional detrimental affect(s) this large number of inactive mRNAs has on the functional capability of this organism. Translation of these pseudogenes may play an important role in overall energy consumption and resultant pathophysiological characteristics of M. leprae. However, this study also demonstrated that multiple translational ``silencing'' mechanisms are present, reducing additional energy and resource expenditure required for protein production from the vast majority of these transcripts.