931 resultados para wide genome sequencing
Resumo:
The complete genome sequence of Caulobacter crescentus was determined to be 4,016,942 base pairs in a single circular chromosome encoding 3,767 genes. This organism, which grows in a dilute aquatic environment, coordinates the cell division cycle and multiple cell differentiation events. With the annotated genome sequence, a full description of the genetic network that controls bacterial differentiation, cell growth, and cell cycle progression is within reach. Two-component signal transduction proteins are known to play a significant role in cell cycle progression. Genome analysis revealed that the C. crescentus genome encodes a significantly higher number of these signaling proteins (105) than any bacterial genome sequenced thus far. Another regulatory mechanism involved in cell cycle progression is DNA methylation. The occurrence of the recognition sequence for an essential DNA methylating enzyme that is required for cell cycle regulation is severely limited and shows a bias to intergenic regions. The genome contains multiple clusters of genes encoding proteins essential for survival in a nutrient poor habitat. Included are those involved in chemotaxis, outer membrane channel function, degradation of aromatic ring compounds, and the breakdown of plant-derived carbon sources, in addition to many extracytoplasmic function sigma factors, providing the organism with the ability to respond to a wide range of environmental fluctuations. C. crescentus is, to our knowledge, the first free-living α-class proteobacterium to be sequenced and will serve as a foundation for exploring the biology of this group of bacteria, which includes the obligate endosymbiont and human pathogen Rickettsia prowazekii, the plant pathogen Agrobacterium tumefaciens, and the bovine and human pathogen Brucella abortus.
Resumo:
We have shown that the DNA demethylation complex isolated from chicken embryos has a G⋅T mismatch DNA glycosylase that also possesses 5-methylcytosine DNA glycosylase (5-MCDG) activity. Herein we show that human embryonic kidney cells stably transfected with 5-MCDG cDNA linked to a cytomegalovirus promoter overexpress 5-MCDG. A 15- to 20-fold overexpression of 5-MCDG results in the specific demethylation of a stably integrated ecdysone-retinoic acid responsive enhancer-promoter linked to a β-galactosidase reporter gene. Demethylation occurs in the absence of the ligand ponasterone A (an analogue of ecdysone). The state of methylation of the transgene was investigated by Southern blot analysis and by the bisulfite genomic sequencing reaction. Demethylation occurs downstream of the hormone response elements. No genome-wide demethylation was observed. The expression of an inactive mutant of 5-MCDG or the empty vector does not elicit any demethylation of the promoter-enhancer of the reporter gene. An increase in 5-MCDG activity does not influence the activity of DNA methyltransferase(s) when tested in vitro with a hemimethylated substrate. There is no change in the transgene copy number during selection of the clones with antibiotics. Immunoprecipitation combined with Western blot analysis showed that an antibody directed against 5-MCDG precipitates a complex containing the retinoid X receptor α. The association between retinoid receptor and 5-MCDG is not ligand dependent. These results suggest that a complex of the hormone receptor with 5-MCDG may target demethylation of the transgene in this system.
Resumo:
Progress in agricultural and environmental technologies is hampered by a slower rate of gene discovery in plants than animals. The vast pool of genes in plants, however, will be an important resource for insertion of genes, via biotechnological procedures, into an array of plants, generating unique germ plasms not achievable by conventional breeding. It just became clear that genomes of grasses have evolved in a manner analogous to Lego blocks. Large chromosome segments have been reshuffled and stuffer pieces added between genes. Although some genomes have become very large, the genome with the fewest stuffer pieces, the rice genome, is the Rosetta Stone of all the bigger grass genomes. This means that sequencing the rice genome as anchor genome of the grasses will provide instantaneous access to the same genes in the same relative physical position in other grasses (e.g., corn and wheat), without the need to sequence each of these genomes independently. (i) The sequencing of the entire genome of rice as anchor genome for the grasses will accelerate plant gene discovery in many important crops (e.g., corn, wheat, and rice) by several orders of magnitudes and reduce research and development costs for government and industry at a faster pace. (ii) Costs for sequencing entire genomes have come down significantly. Because of its size, rice is only 12% of the human or the corn genome, and technology improvements by the human genome project are completely transferable, translating in another 50% reduction of the costs. (iii) The physical mapping of the rice genome by a group of Japanese researchers provides a jump start for sequencing the genome and forming an international consortium. Otherwise, other countries would do it alone and own proprietary positions.
Resumo:
The genetic basis for virulence in influenza virus is largely unknown. To explore the mutational basis for increased virulence in the lung, the H3N2 prototype clinical isolate, A/HK/1/68, was adapted to the mouse. Genomic sequencing provided the first demonstration, to our knowledge, that a group of 11 mutations can convert an avirulent virus to a virulent variant that can kill at a minimal dose. Thirteen of the 14 amino acid substitutions (93%) detected among clonal isolates were likely instrumental in adaptation because of their positive selection, location in functional regions, and/or independent occurrence in other virulent influenza viruses. Mutations in virulent variants repeatedly involved nuclear localization signals and sites of protein and RNA interaction, implicating them as novel modulators of virulence. Mouse-adapted variants with the same hemagglutinin mutations possessed different pH optima of fusion, indicating that fusion activity of hemagglutinin can be modulated by other viral genes. Experimental adaptation resulted in the selection of three mutations that were in common with the virulent human H5N1 isolate A/HK/156/97 and that may be instrumental in its extreme virulence. Analysis of viral adaptation by serial passage appears to provide the identification of biologically relevant mutations.
Resumo:
We present a method for discovering conserved sequence motifs from families of aligned protein sequences. The method has been implemented as a computer program called emotif (http://motif.stanford.edu/emotif). Given an aligned set of protein sequences, emotif generates a set of motifs with a wide range of specificities and sensitivities. emotif also can generate motifs that describe possible subfamilies of a protein superfamily. A disjunction of such motifs often can represent the entire superfamily with high specificity and sensitivity. We have used emotif to generate sets of motifs from all 7,000 protein alignments in the blocks and prints databases. The resulting database, called identify (http://motif.stanford.edu/identify), contains more than 50,000 motifs. For each alignment, the database contains several motifs having a probability of matching a false positive that range from 10−10 to 10−5. Highly specific motifs are well suited for searching entire proteomes, while generating very few false predictions. identify assigns biological functions to 25–30% of all proteins encoded by the Saccharomyces cerevisiae genome and by several bacterial genomes. In particular, identify assigned functions to 172 of proteins of unknown function in the yeast genome.
Resumo:
Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery.
Resumo:
The genome of the pufferfish (Fugu rubripes) (400 Mb) is approximately 7.5 times smaller than the human genome, but it has a similar gene repertoire to that of man. If regions of the two genomes exhibited conservation of gene order (i.e., were syntenic), it should be possible to reduce dramatically the effort required for identification of candidate genes in human disease loci by sequencing syntenic regions of the compact Fugu genome. We have demonstrated that three genes (dihydrolipoamide succinyltransferase, S31iii125, and S20i15), which are linked to FOS in the familial Alzheimer disease focus (AD3) on human chromosome 14, have homologues in the Fugu genome adjacent to Fugu cFOS. The relative gene order of cFOS, S31iii125, and S20i15 was the same in both genomes, but in Fugu these three genes lay within a 12.4-kb region, compared to >600 kb in the human AD3 locus. These results demonstrate the conservation of synteny between the genomes of Fugu and man and highlight the utility of this approach for sequence-based identification of genes in human disease loci.
Resumo:
The mouse is the best model system for the study of mammalian genetics and physiology. Because of the feasibility and importance of studying genetic crosses, the mouse genetic map has received tremendous attention in recent years. It currently contains over 14,000 genetically mapped markers, including 700 mutant loci, 3500 genes, and 6500 simple sequence length polymorphisms (SSLPs). The mutant loci and genes allow insights and correlations concerning physiology and development. The SSLPs provide highly polymorphic anchor points that allow inheritance to be traced in any cross and provide a scaffold for assembling physical maps. Adequate physical mapping resources--notably large-insert yeast artificial chromosome (YAC) libraries--are available to support positional cloning projects based on the genetic map, but a comprehensive physical map is still a few years away. Large-scale sequencing efforts have not yet begun in mouse, but comparative sequence analysis between mouse and human is likely to provide tremendous information about gene structure and regulation.
Resumo:
Whole genome linkage analysis of type 1 diabetes using affected sib pair families and semi-automated genotyping and data capture procedures has shown how type 1 diabetes is inherited. A major proportion of clustering of the disease in families can be accounted for by sharing of alleles at susceptibility loci in the major histocompatibility complex on chromosome 6 (IDDM1) and at a minimum of 11 other loci on nine chromosomes. Primary etiological components of IDDM1, the HLA-DQB1 and -DRB1 class II immune response genes, and of IDDM2, the minisatellite repeat sequence in the 5' regulatory region of the insulin gene on chromosome 11p15, have been identified. Identification of the other loci will involve linkage disequilibrium mapping and sequencing of candidate genes in regions of linkage.
Resumo:
The human squamous cell carcinoma cell line SCC83-01-82 (SCC) contains mutations in both the H-ras and p53 genes, but it exhibits a nontumorigenic phenotype in nude mice. This cell line can be converted into a cell line with a tumorigenic phenotype, SCC83-01-82CA (CA), by treatment with the mutagen methyl methanesulfonate (MMS). This indicates that additional genetic events leading to expression of a cooperating tumor susceptibility gene(s) may be required for tumorigenicity. To identify the cooperating gene(s), an expression cDNA library was made from tumorigenic Ca cells. The library DNA was transfected into nontumorigenic SCC cells and the transfected SCC cells were then injected into nude mice for the selection of a tumorigenic phenotype. Tumors developed in 3 of the 18 mice after injection. Several new cell lines were established from these transfected cell-induced tumors and designated as CATR cells. Tumor histology and karyotype analysis of these cells indicated that they were of human epithelial cell origin. All the CATR cells have the library vector sequence integrated in their genome. Cell line CATR1 expressed a single message from the integrated library representing a 1.3-kb cDNA insert that was absent from untransfected SCC cells or MMS-converted CA cells. This 1.3-kb cDNA insert was cloned by PCR amplification of reverse-transcribed CATR1 total RNA and was designated CATR1.3. The nucleotide sequence of CATR1.3 encodes a peptide of 79 amino acids, has a long 3' untranslated region, and represents an unknown gene product that was associated with the tumorigenic conversion due to the transfected expression library.
Resumo:
We report a general mass spectrometric approach for the rapid identification and characterization of proteins isolated by preparative two-dimensional polyacrylamide gel electrophoresis. This method possesses the inherent power to detect and structurally characterize covalent modifications. Absolute sensitivities of matrix-assisted laser desorption ionization and high-energy collision-induced dissociation tandem mass spectrometry are exploited to determine the mass and sequence of subpicomole sample quantities of tryptic peptides. These data permit mass matching and sequence homology searching of computerized peptide mass and protein sequence data bases for known proteins and design of oligonucleotide probes for cloning unknown proteins. We have identified 11 proteins in lysates of human A375 melanoma cells, including: alpha-enolase, cytokeratin, stathmin, protein disulfide isomerase, tropomyosin, Cu/Zn superoxide dismutase, nucleoside diphosphate kinase A, galaptin, and triosephosphate isomerase. We have characterized several posttranslational modifications and chemical modifications that may result from electrophoresis or subsequent sample processing steps. Detection of comigrating and covalently modified proteins illustrates the necessity of peptide sequencing and the advantages of tandem mass spectrometry to reliably and unambiguously establish the identity of each protein. This technology paves the way for studies of cell-type dependent gene expression and studies of large suites of cellular proteins with unprecedented speed and rigor to provide information complementary to the ongoing Human Genome Project.
Resumo:
A maioria dos casos de puberdade precoce central (PPC) em meninas permanece idiopática. A hipótese de uma causa genética vem se fortalecendo após a descoberta de alguns genes associados a este fenótipo, sobretudo aqueles implicados com o sistema kisspeptina (KISS1 e KISS1R). Entretanto, apenas casos isolados de PPC foram relacionados à mutação na kisspeptina ou em seu receptor. Até recentemente, a maioria dos estudos genéticos em PPC buscava genes candidatos selecionados com base em modelos animais, análise genética de pacientes com hipogonadismo hipogonadotrófico, ou ainda, nos estudos de associação ampla do genoma. Neste trabalho, foi utilizado o sequenciamento exômico global, uma metodologia mais moderna de sequenciamento, para identificar variantes associadas ao fenótipo de PPC. Trinta e seis indivíduos com a forma de PPC familial (19 famílias) e 213 casos aparentemente esporádicos foram inicialmente selecionados. A forma familial foi definida pela presença de mais de um membro afetado na família. DNA genômico foi extraído dos leucócitos do sangue periférico de todos os pacientes. O estudo de sequenciamento exômico global realizado pela técnica ILLUMINA, em 40 membros de 15 famílias com PPC, identificou mutações inativadoras em um único gene, MKRN3, em cinco dessas famílias. Pesquisa de mutação no MKRN3 realizada por sequenciamento direto em duas famílias adicionais (quatro pacientes) identificou duas novas variantes nesse gene. O MKRN3 é um gene de um único éxon, localizado no cromossomo 15 em uma região crítica para a síndrome de Prader Willi. O gene MKRN3 sofre imprinting materno, sendo expresso apenas pelo alelo paterno. A descoberta de mutações em pacientes com PPC familial despertou o interesse para a pesquisa de mutações nesse gene em 213 pacientes com PPC aparentemente esporádica por meio de reação em cadeia de polimerase seguida de purificação enzimática e sequenciamento automático direto (Sanger). Três novas mutações e duas já anteriormente identificadas, incluindo quatro frameshifts e uma variante missense, foram encontradas, em heterozigose, em seis meninas não relacionadas. Todas as novas variantes identificadas estavam ausentes nos bancos de dados (1000 Genomes e Exome Variant Server). O estudo de segregação familial em três dessas meninas com PPC aparentemente esporádica e mutação no MKRN3 confirmou o padrão de herança autossômica dominante com penetrância completa e transmissão exclusiva pelo alelo paterno, demonstrando que esses casos eram, na verdade, também familiares. A maioria das mutações encontradas no MKRN3 era do tipo frameshift ou nonsense, levando a stop códons prematuros e proteínas truncadas e, portanto, confirmando a associação com o fenótipo. As duas mutações missenses (p.Arg365Ser e p.Phe417Ile) identificadas estavam localizadas em regiões de dedo ou anel de zinco, importantes para a função da proteína. Além disso, os estudos in silico dessas duas variantes demonstraram patogenicidade. Todos os pacientes com mutação no MKRN3 apresentavam características clínicas e hormonais típicas de ativação prematura do eixo reprodutivo. A mediana de idade de início da puberdade foi de 6 anos nas meninas (variando de 3 a 6,5) e 8 anos nos meninos (variando de 5,9 a 8,5). Tendo em vista o fenômeno de imprinting, análise de metilação foi também realizada em um subgrupo de 52 pacientes com PPC pela técnica de MS-MLPA, mas não foram encontradas alterações no padrão de metilação. Em conclusão, este trabalho identificou um novo gene associado ao fenótipo de PPC. Atualmente, mutações inativadoras no MKRN3 representam a causa genética mais comum de PPC familial (33%). O MKRN3 é o primeiro gene imprintado associado a distúrbios puberais em humanos. O mecanismo preciso de ação desse gene na regulação da secreção de GnRH necessita de estudos adicionais
Resumo:
Objective: In Southern European countries up to one-third of the patients with hereditary hemochromatosis (HH) do not present the common HFE risk genotype. In order to investigate the molecular basis of these cases we have designed a gene panel for rapid and simultaneous analysis of 6 HH-related genes (HFE, TFR2, HJV, HAMP, SLC40A1 and FTL) by next-generation sequencing (NGS). Materials and Methods: Eighty-eight iron overload Portuguese patients, negative for the common HFE mutations, were analysed. A TruSeq Custom Amplicon kit (TSCA, by Illumina) was designed in order to generate 97 amplicons covering exons, intron/exon junctions and UTRs of the mentioned genes with a cumulative target sequence of 12115bp. Amplicons were sequenced in the MiSeq instrument (IIlumina) using 250bp paired-end reads. Sequences were aligned against human genome reference hg19 using alignment and variant caller algorithms in the MiSeq reporter software. Novel variants were validated by Sanger sequencing and their pathogenic significance were assessed by in silico studies. Results: We found a total of 55 different genetic variants. These include novel pathogenic missense and splicing variants (in HFE and TFR2), a very rare variant in IRE of FTL, a variant that originates a novel translation initiation codon in the HAMP gene, among others. Conclusion: The merging of TSCA methodology and NGS technology appears to be an appropriate tool for simultaneous and fast analysis of HH-related genes in a large number of samples. However, establishing the clinical relevance of NGS-detected variants for HH development remains a hard-working task, requiring further functional studies.
Resumo:
One to two percent of all children are born with a developmental disorder requiring pediatric hospital admissions. For many such syndromes, the molecular pathogenesis remains poorly characterized. Parallel developmental disorders in other species could provide complementary models for human rare diseases by uncovering new candidate genes, improving the understanding of the molecular mechanisms and opening possibilities for therapeutic trials. We performed various experiments, e.g. combined genome-wide association and next generation sequencing, to investigate the clinico-pathological features and genetic causes of three developmental syndromes in dogs, including craniomandibular osteopathy (CMO), a previously undescribed skeletal syndrome, and dental hypomineralization, for which we identified pathogenic variants in the canine SLC37A2 (truncating splicing enhancer variant), SCARF2 (truncating 2-bp deletion) and FAM20C (missense variant) genes, respectively. CMO is a clinical equivalent to an infantile cortical hyperostosis (Caffey disease), for which SLC37A2 is a new candidate gene. SLC37A2 is a poorly characterized member of a glucose-phosphate transporter family without previous disease associations. It is expressed in many tissues, including cells of the macrophage lineage, e.g. osteoclasts, and suggests a disease mechanism, in which an impaired glucose homeostasis in osteoclasts compromises their function in the developing bone, leading to hyperostosis. Mutations in SCARF2 and FAM20C have been associated with the human van den Ende-Gupta and Raine syndromes that include numerous features similar to the affected dogs. Given the growing interest in the molecular characterization and treatment of human rare diseases, our study presents three novel physiologically relevant models for further research and therapy approaches, while providing the molecular identity for the canine conditions.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-06