926 resultados para Human genome, CpG islands, Markov models, DNA walk
Resumo:
Integration of human immunodeficiency virus (HIV) DNA into the human genome requires the virus-encoded integrase (IN) protein, and therefore the IN protein is a suitable target for antiviral strategies. To find a potent HIV IN inhibitor, we screened a "synthetic peptide combinatorial library." We identified a hexapeptide with the sequence HCKFWW that inhibits IN-mediated 3'-processing and integration with an IC50 of 2 microM. The peptide is active on IN proteins from other retroviruses such as HIV-2, feline immunodeficiency virus, and Moloney murine leukemia virus, supporting the notion that a conserved region of IN is targeted. The hexapeptide was also tested in the disintegration reaction. This phosphoryl-transfer reaction can be carried out by the catalytic core of IN alone, and the peptide HCKFWW was found to inhibit this reaction, suggesting that the hexapeptide acts at or near the catalytic site of IN. Identification of an IN hexapeptide inhibitor provides proof of concept for the approach, and, moreover, this peptide may be useful for structure-function analysis of IN.
Resumo:
Small molecules that specifically bind with high affinity to any designated DNA sequence in the human genome would be useful tools in molecular biology and potentially in human medicine. Simple rules have been developed to rationally alter the sequence specificity of minor groove-binding polyamides containing N-methylimidazole and N-methylpyrrole amino acids. Crescent-shaped polyamides bind as antiparallel dimers with each polyamide making specific contacts with each strand on the floor of the minor groove. Cyclic polyamides have now been synthesized that bind designated DNA sequences at subnanomolar concentrations.
Resumo:
The challenge of the Human Genome Project is to increase the rate of DNA sequence acquisition by two orders of magnitude to complete sequencing of the human genome by the year 2000. The present work describes a rapid detection method using a two-dimensional optical wave guide that allows measurement of real-time binding or melting of a light-scattering label on a DNA array. A particulate label on the target DNA acts as a light-scattering source when illuminated by the evanescent wave of the wave guide and only the label bound to the surface generates a signal. Imaging/visual examination of the scattered light permits interrogation of the entire array simultaneously. Hybridization specificity is equivalent to that obtained with a conventional system using autoradiography. Wave guide melting curves are consistent with those obtained in the liquid phase and single-base discrimination is facile. Dilution experiments showed an apparent lower limit of detection at 0.4 nM oligonucleotide. This performance is comparable to the best currently known fluorescence-based systems. In addition, wave guide detection allows manipulation of hybridization stringency during detection and thereby reduces DNA chip complexity. It is anticipated that this methodology will provide a powerful tool for diagnostic applications that require rapid cost-effective detection of variations from known sequences.
Resumo:
Many human malignant cells lack methylthioadenosine phosphorylase (MTAP) enzyme activity. The gene (MTAP) encoding this enzyme was previously mapped to the short arm of chromosome 9, band p21-22, a region that is frequently deleted in multiple tumor types. To clone candidate tumor suppressor genes from the deleted region on 9p21-22, we have constructed a long-range physical map of 2.8 megabases for 9p21 by using overlapping yeast artificial chromosome and cosmid clones. This map includes the type IIFN gene cluster, the recently identified candidate tumor suppressor genes CDKN2 (p16INK4A) and CDKN2B (p15INK4B), and several CpG islands. In addition, we have identified other transcription units within the yeast artificial chromosome contig. Sequence analysis of a 2.5-kb cDNA clone isolated from a CpG island that maps between the IFN genes and CDKN2 reveals a predicted open reading frame of 283 amino acids followed by 1302 nucleotides of 3' untranslated sequence. This gene is evolutionarily conserved and shows significant amino acid homologies to mouse and human purine nucleoside phosphorylases and to a hypothetical 25.8-kDa protein in the pet gene (coding for cytochrome bc1 complex) region of Rhodospirillum rubrum. The location, expression pattern, and nucleotide sequence of this gene suggest that it codes for the MTAP enzyme.
Resumo:
Benzene is a ubitiquous human environment mental carcinogen. One of the major metabolites is hydroquinone, which is oxidized in vivo to give p-benzoquinone (p-BQ). Both metabolites are toxic to human cells. p-BQ reacts with DNA to form benzetheno adducts with deoxycytidine, deoxyadenosine, and deoxyguanosine. In this study we have synthesized the exocyclic compounds 3-hydroxy-3-N4-benzetheno-2'-deoxycytidine (p-BQ-dCyd) and 9-hydroxy-1,N6-benzetheno-2'-deoxyadenosine (p-BQ-dAdo), respectively, by reacting deoxycytidine and deoxyadenosine with p-BQ. These were converted to the phosphoamidites, which were then used to prepare site-specific oligonucleotides with either the p-BQ-dCyd or p-BQ-dAdo adduct (pbqC or pbqA in sequences) at two different defined positions. These oligonucleotides were efficiently nicked 5' to the adduct by partially purified HeLa cell extracts--the pbqC-containing oligomer more rapidly than the pbqA-containing oligomer. In contrast to the enzyme binding to derivatives produced by the vinyl chloride metabolite chloroacetaldehyde, the oligonucleotides up to 60-mer containing p-BQ adducts did not bind measurably to the same enzyme preparation in a gel retardation assay. Furthermore, there was no competition for the binding observed between oligonucleotides containing 1,N6-etheno A deoxyadenosine (1,N6-etheno-dAdo; epsilon A in sequences) and these oligomers containing either of the p-BQ adducts, even at 120-fold excess. When highly purified fast protein liquid chromatography (FPLC) enzyme fractions were obtained, there appeared to be two closely eluting nicking activities. One of these enzymes bound and cleaved the epsilon A-containing deoxyoligonucleotide. The other enzyme cleaved the pbqA- and pbqC-containing deoxyoligonucleotides. One additional unexpected fact was that bulk p-BQ-treated salmon sperm DNA did compete effectively with the epsilon A-containing oligonucleotide for protein binding. This raises the possibility that such DNA contains other, as-yet-uncharacterized adducts that are recognized by the same enzyme that recognizes the etheno adducts. In summary, we describe a previously undescribed human DNA repair activity, possibly a glycosylase, that excises from DNA pbqC and pbqA, exocyclic adducts resulting from reaction of deoxycytidine and deoxyadenosine with the benzene metabolite, p-BQ. This glycosylase activity is not identical to the one previously reported from this laboratory as excising the four etheno bases from DNA.
Resumo:
Falls are one of the greatest threats to elderly health in their daily living routines and activities. Therefore, it is very important to detect falls of an elderly in a timely and accurate manner, so that immediate response and proper care can be provided, by sending fall alarms to caregivers. Radar is an effective non-intrusive sensing modality which is well suited for this purpose, which can detect human motions in all types of environments, penetrate walls and fabrics, preserve privacy, and is insensitive to lighting conditions. Micro-Doppler features are utilized in radar signal corresponding to human body motions and gait to detect falls using a narrowband pulse-Doppler radar. Human motions cause time-varying Doppler signatures, which are analyzed using time-frequency representations and matching pursuit decomposition (MPD) for feature extraction and fall detection. The extracted features include MPD features and the principal components of the time-frequency signal representations. To analyze the sequential characteristics of typical falls, the extracted features are used for training and testing hidden Markov models (HMM) in different falling scenarios. Experimental results demonstrate that the proposed algorithm and method achieve fast and accurate fall detections. The risk of falls increases sharply when the elderly or patients try to exit beds. Thus, if a bed exit can be detected at an early stage of this motion, the related injuries can be prevented with a high probability. To detect bed exit for fall prevention, the trajectory of head movements is used for recognize such human motion. A head detector is trained using the histogram of oriented gradient (HOG) features of the head and shoulder areas from recorded bed exit images. A data association algorithm is applied on the head detection results to eliminate head detection false alarms. Then the three dimensional (3D) head trajectories are constructed by matching scale-invariant feature transform (SIFT) keypoints in the detected head areas from both the left and right stereo images. The extracted 3D head trajectories are used for training and testing an HMM based classifier for recognizing bed exit activities. The results of the classifier are presented and discussed in the thesis, which demonstrates the effectiveness of the proposed stereo vision based bed exit detection approach.
Resumo:
Inducible epigenetic changes in eukaryotes are believed to enable rapid adaptation to environmental fluctuations. We have found distinct regions of the Arabidopsis genome that are susceptible to DNA (de)methylation in response to hyperosmotic stress. The stress-induced epigenetic changes are associated with conditionally heritable adaptive phenotypic stress responses. However, these stress responses are primarily transmitted to the next generation through the female lineage due to widespread DNA glycosylase activity in the male germline, and extensively reset in the absence of stress. Using the CNI1/ATL31 locus as an example, we demonstrate that epigenetically targeted sequences function as distantly-acting control elements of antisense long non-coding RNAs, which in turn regulate targeted gene expression in response to stress. Collectively, our findings reveal that plants use a highly dynamic maternal 'short-term stress memory' with which to respond to adverse external conditions. This transient memory relies on the DNA methylation machinery and associated transcriptional changes to extend the phenotypic plasticity accessible to the immediate offspring.
Resumo:
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Resumo:
Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in intron 2, 5 kb downstream of the core BRCA1 promoter. The functionality of these elements was examined using homologous-recombination-based mutagenesis of reporter gene-tagged cosmids incorporating these regions and flanking sequences from the BRCA1 locus. This showed that CNS-1 and CNS-2 have differential transcriptional regulatory activity in epithelial cell lines. Mutation of CNS-1 significantly reduced reporter gene expression to 30% of control levels. Conversely mutation of CNS-2 increased expression to 200% of control levels. Regulation is at the level of transcription and shows promoter specificity. Both elements also specifically bind nuclear proteins in vitro. These studies demonstrate that the combination of comparative genomics and functional analysis is a successful strategy to identify novel regulatory elements and provide the first direct evidence that conserved noncoding sequences in BRCA1 regulate gene expression. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
Different DNA motifs are required for optimal stimulation Of mouse and human immune cells by CpG oligode-oxynucleotides (ODN). These species differences presumably reflect sequence differences in TLR9, the CPG DNA receptor. In this study, we show that this sequence specificity is restricted to phosphorothioate (PS)-modified ODN and is not observed when a natural phosphodiester backbone is used. Thus, human and mouse cells have not evolved to recognize different CpG motifs in natural DNA. Nonoptimal PS-ODN (i.e., mouse CpG motif on human cells and vice versa) gave delayed and less sustained phosphorylation of p38 AWK than optimal motifs. When the CpG dinucleotide was inverted to GC In each ODN some residual activity of the PS-ODN was retained in a species-specific, TLR-9-dependent manner. Thus, TLR9 may he responsible for mediating many published CpG-independent responses to PS-ODN.
Resumo:
Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.
Resumo:
The AXIN1 gene has been implicated in caudal duplication anomalies. Its coding region was sequenced in both members of a monozygotic ( MZ) twin pair discordant for a caudal duplication anomaly, but no mutation was found. Using bisulfite sequencing, we examined methylation at the promoter region of the AXIN1 gene in these twins and in twin and age-matched singleton controls. Methylation of the promoter region in peripheral blood mononucleated cells was variable among individuals, including MZ pairs. In the MZ pair discordant for the caudal duplication, this region of the affected twin was significantly more methylated than that of the unaffected twin (), which was significantly more P < .0001 methylated than those of the controls (). We have confirmed that this CpG island does function as a promoter P = .02 in vitro and that its activity is inversely proportional to the extent of methylation. This finding raises the possibility that hypermethylation of the AXIN1 promoter, by mechanisms as yet undetermined, is associated with the malformation. This case may be paradigmatic for some cases of MZ discordance.
Resumo:
Despite our detailed characterization of the human genome at the level of the primary DNA sequence, we are still far from understanding the molecular events underlying phenotypic variation. Epigenetic modifications to the DNA sequence and associated chromatin are known to regulate gene expression and, as such, are a significant contributor to phenotype. Studies of inbred mice and monozygotic twins show that variation in the epigenotype can be seen even between genetically identical individuals and that this, in some cases at least, is associated with phenotypic differences. Moreover, recent evidence suggests that the epigenome can be influenced by the environment and these changes can last a lifetime. However, we also know that epigenetic states in real-time are in continual flux and, as a result, the epigenome exhibits instability both within and across generations. We still do not understand the rules governing the establishment and maintenance of the epigenotype at any particular locus. The underlying DNA sequence itself and the sequence at unlinked loci (modifier loci) are certainly involved. Recent support for the existence of transgenerational epigenetic inheritance in mammals suggests that the epigenetic state of the locus in the previous generation may also play a role. Over the next decade, many of these processes will be better understood, heralding a greater capacity for us to correlate measurable molecular marks with phenotype and providing the opportunity for improved diagnosis and presymptomatic healthcare.
Resumo:
There is a growing societal need to address the increasing prevalence of behavioral health issues, such as obesity, alcohol or drug use, and general lack of treatment adherence for a variety of health problems. The statistics, worldwide and in the USA, are daunting. Excessive alcohol use is the third leading preventable cause of death in the United States (with 79,000 deaths annually), and is responsible for a wide range of health and social problems. On the positive side though, these behavioral health issues (and associated possible diseases) can often be prevented with relatively simple lifestyle changes, such as losing weight with a diet and/or physical exercise, or learning how to reduce alcohol consumption. Medicine has therefore started to move toward finding ways of preventively promoting wellness, rather than solely treating already established illness. Evidence-based patient-centered Brief Motivational Interviewing (BMI) interven- tions have been found particularly effective in helping people find intrinsic motivation to change problem behaviors after short counseling sessions, and to maintain healthy lifestyles over the long-term. Lack of locally available personnel well-trained in BMI, however, often limits access to successful interventions for people in need. To fill this accessibility gap, Computer-Based Interventions (CBIs) have started to emerge. Success of the CBIs, however, critically relies on insuring engagement and retention of CBI users so that they remain motivated to use these systems and come back to use them over the long term as necessary. Because of their text-only interfaces, current CBIs can therefore only express limited empathy and rapport, which are the most important factors of health interventions. Fortunately, in the last decade, computer science research has progressed in the design of simulated human characters with anthropomorphic communicative abilities. Virtual characters interact using humans’ innate communication modalities, such as facial expressions, body language, speech, and natural language understanding. By advancing research in Artificial Intelligence (AI), we can improve the ability of artificial agents to help us solve CBI problems. To facilitate successful communication and social interaction between artificial agents and human partners, it is essential that aspects of human social behavior, especially empathy and rapport, be considered when designing human-computer interfaces. Hence, the goal of the present dissertation is to provide a computational model of rapport to enhance an artificial agent’s social behavior, and to provide an experimental tool for the psychological theories shaping the model. Parts of this thesis were already published in [LYL+12, AYL12, AL13, ALYR13, LAYR13, YALR13, ALY14].
Resumo:
Interethnic differences exist in disease prevalence, especially with regard to cancer and cardiovascular diseases, which involve altered expression or activity of matrix metalloproteinases (MMPs). The hypothesis being tested in this study is that interethnic differences exist between blacks and whites with regard to the distribution of genetic variants of MMP polymorphisms and haplotypes. We examined the distribution of polymorphisms of MMP-2 and MMP-9 genes in 177 black and 140 white subjects. We studied the following polymorphisms: the C(-1306)T in the promoter of the MMP-2 gene, the C(-1562)T and a microsatellite -90(CA)(14-24) in the promoter, and the Q279R in exon 6 of the MMP-9 gene. We have also compared our results with those from Hapmap or Seattle SNPs Projects and estimated the haplotype frequency in these two ethnic groups. The ""C'' allele for the C(-1306)T polymorphism was more common in blacks (91.5%) than in whites (80.4%; p<0.0001). The ""T'' allele for the C(-1562)T polymorphism was more common in blacks (15.0%) than in whites (8.9%; p=0.0279), as well as the alleles with >21 repeats for the -90(CA)(14-24) were more common in blacks than in whites (61.9% in blacks and 49.3% in whites; p=0.0017). We found no interethnic differences for the Q279R polymorphism. Moreover, two haplotypes that combine ""detrimental'' alleles were found at higher frequencies in blacks than in whites (31% vs. 16.4%, respectively; p<0.05). The interethnic differences being reported here replicate those previously found with smaller number of subjects in the Hapmap or Seattle SNPs data and may help explain the higher prevalence of cancer and cardiovascular diseases in blacks compared with whites. Our findings suggest a proportional significance of these polymorphisms in each ethnic group.