974 resultados para Expressed sequence tag analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background. The emergence of multi- and extensively-drug resistant Mycobacterium tuberculosis strains has created an urgent need for new agents to treat tuberculosis (TB). The enzymes of shikimate pathway are attractive targets to the development of antitubercular agents because it is essential for M. tuberculosis and is absent from humans. Chorismate synthase (CS) is the seventh enzyme of this route and catalyzes the NADH- and FMN-dependent synthesis of chorismate, a precursor of aromatic amino acids, naphthoquinones, menaquinones, and mycobactins. Although the M. tuberculosis Rv2540c (aroF) sequence has been annotated to encode a chorismate synthase, there has been no report on its correct assignment and functional characterization of its protein product. Results. In the present work, we describe DNA amplification of aroF-encoded CS from M. tuberculosis (MtCS), molecular cloning, protein expression, and purification to homogeneity. N-terminal amino acid sequencing, mass spectrometry and gel filtration chromatography were employed to determine identity, subunit molecular weight and oligomeric state in solution of homogeneous recombinant MtCS. The bifunctionality of MtCS was determined by measurements of both chorismate synthase and NADH:FMN oxidoreductase activities. The flavin reductase activity was characterized, showing the existence of a complex between FMN ox and MtCS. FMNox and NADH equilibrium binding was measured. Primary deuterium, solvent and multiple kinetic isotope effects are described and suggest distinct steps for hydride and proton transfers, with the former being more rate-limiting. Conclusion. This is the first report showing that a bacterial CS is bifunctional. Primary deuterium kinetic isotope effects show that C4-proS hydrogen is being transferred during the reduction of FMNox by NADH and that hydride transfer contributes significantly to the rate-limiting step of FMN reduction reaction. Solvent kinetic isotope effects and proton inventory results indicate that proton transfer from solvent partially limits the rate of FMN reduction and that a single proton transfer gives rise to the observed solvent isotope effect. Multiple isotope effects suggest a stepwise mechanism for the reduction of FMNox. The results on enzyme kinetics described here provide evidence for the mode of action of MtCS and should thus pave the way for the rational design of antitubercular agents. © 2008 Ely et al; licensee BioMed Central Ltd.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A protocol to produce large amounts of bioactive homogeneous human interferon β1 expressed in Escherichia coli was developed. Human interferon β1 ser17 gene was constructed, cloned and subcloned, and the recombinant protein expressed in E. coli cells. Solubilization of recombinant human interferon β1 ser17 (rhIFN-β1 ser17) was accomplished by employing a brief shift to high alkaline pH in the presence of non-ionic detergent. The recombinant protein was purifi ed by three chromatographic steps. N-terminal amino acid sequencing and mass spectrometry analysis provided experimental evidence for the identity of the recombinant protein. Reverse phase liquid chromatography demonstrated that the content of deamidates and sulphoxides was similar to a commercial standard. Size exclusion chromatography demonstrated the absence of high molecular mass aggregates and dimers. The protocol represents an effi cient and high-yield method to obtain bioactive homogeneous monomeric rhIFN-β1 ser17 protein. It may thus represent an important step towards scaling up for rhIFN-β1 ser17 large-scale production. © 2010 Villela AD, et al.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Uterine Leiomyomas (ULs) are the most common benign tumours affecting women of reproductive age. ULs represent a major problem in public health, as they are the main indication for hysterectomy. Approximately 40-50% of ULs have non-random cytogenetic abnormalities, and half of ULs may have copy number alterations (CNAs). Gene expression microarrays studies have demonstrated that cell proliferation genes act in response to growth factors and steroids. However, only a few genes mapping to CNAs regions were found to be associated with ULs. Methodology: We applied an integrative analysis using genomic and transcriptomic data to identify the pathways and molecular markers associated with ULs. Fifty-one fresh frozen specimens were evaluated by array CGH (JISTIC) and gene expression microarrays (SAM). The CONEXIC algorithm was applied to integrate the data. Principal Findings: The integrated analysis identified the top 30 significant genes (P<0.01), which comprised genes associated with cancer, whereas the protein-protein interaction analysis indicated a strong association between FANCA and BRCA1. Functional in silico analysis revealed target molecules for drugs involved in cell proliferation, including FGFR1 and IGFBP5. Transcriptional and protein analyses showed that FGFR1 (P = 0.006 and P<0.01, respectively) and IGFBP5 (P = 0.0002 and P = 0.006, respectively) were up-regulated in the tumours when compared with the adjacent normal myometrium. Conclusions: The integrative genomic and transcriptomic approach indicated that FGFR1 and IGFBP5 amplification, as well as the consequent up-regulation of the protein products, plays an important role in the aetiology of ULs and thus provides data for potential drug therapies development to target genes associated with cellular proliferation in ULs. © 2013 Cirilo et al.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Objetiva compreender o papel das interações em sala de aula para a construção do conceito de competição. Propõe caracterizar e comparar as concepções de competição, de cada aluno da turma, antes, durante e após as aulas sobre Interações Ecológicas. Analisar a construção desse conceito nas interações alunoaluno e professor-aluno, para alguns dos alunos. Comparar as concepções dos alunos em diferentes momentos e avaliar as contribuições das interações para a elaboração conceitual de quatro alunos, que participaram de um dos grupos, considerando tanto as contribuições de suas interações com os demais colegas quanto comigo, durante uma seqüência didática. A análise das respostas fornecidas pelos estudantes no pré-teste 01, permitiu a elaboração de um segundo instrumento de coleta de dados, o pré-teste 02. As respostas dos estudantes ao pré-teste 02 foram organizadas em categorias, as quais foram comparadas posteriormente, com aquelas provenientes do pós-teste 02. Este estudo foi realizado nas aulas de Ciências de uma turma de 3 Etapa (EJA) de uma Escola Estadual de Ensino Fundamental, com (16) dezesseis alunos que participaram de todas as etapas da pesquisa, dos quais nove são do sexo feminino e sete do sexo masculino. As aulas foram gravadas em fita de vídeo-cassete e em fita cassete comum e após a transcrição das mesmas realizou-se a análise, tendo como critério de seleção dos episódios a forma como quatro alunos que participaram do grupo recombinado 1 em momentos distintos (individual inicial, grupo espontâneo, grupo recombinado e individual final) construíram, individualmente e na interação com o professor, uma resposta escrita consensual para a questão: Comparando todos os episódios do vídeo assistido, você acha que existe alguma semelhança entre essas relações? Por que? Os resultados evidenciaram que dos dezesseis (16) estudantes que participaram de todas as etapas do processo, nove demonstraram melhoria do perfil conceitual e sete alunos apresentaram respostas finais que foram classificadas na mesma categoria de suas respostas iniciais, dentre estes, três tiveram suas respostas classificadas na categoria mais avançada (D), dois nas categorias intermediárias (um em B e outro em C) e dois na categoria mais afastada (A) do conceito científico de competição. Os quatro estudantes selecionados para análise chegaram, ao final, a uma generalização para questão proposta, partindo de explicações fundamentadas, algumas vezes, em generalizações ou explicações que incorporavam termos teóricos, com ou sem domínio conceitual, demonstrando que eles não se apropriaram da mesma forma dos elementos apresentados nas respostas dos grupos que eles haviam participado.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Selection of reference genes is an essential consideration to increase the precision and quality of relative expression analysis by the quantitative RT-PCR method. The stability of eight expressed sequence tags was evaluated to define potential reference genes to study the differential expression of common bean target genes under biotic (incompatible interaction between common bean and fungus Colletotrichum lindemuthianum) and abiotic (drought; salinity; cold temperature) stresses. The efficiency of amplification curves and quantification cycle (C (q)) were determined using LinRegPCR software. The stability of the candidate reference genes was obtained using geNorm and NormFinder software, whereas the normalization of differential expression of target genes [beta-1,3-glucanase 1 (BG1) gene for biotic stress and dehydration responsive element binding (DREB) gene for abiotic stress] was defined by REST software. High stability was obtained for insulin degrading enzyme (IDE), actin-11 (Act11), unknown 1 (Ukn1) and unknown 2 (Ukn2) genes during biotic stress, and for SKP1/ASK-interacting protein 16 (Skip16), Act11, Tubulin beta-8 (beta-Tub8) and Unk1 genes under abiotic stresses. However, IDE and Act11 were indicated as the best combination of reference genes for biotic stress analysis, whereas the Skip16 and Act11 genes were the best combination to study abiotic stress. These genes should be useful in the normalization of gene expression by RT-PCR analysis in common bean, the most important edible legume.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Results Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Conclusion Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background Some organisms can survive extreme desiccation by entering a state of suspended animation known as anhydrobiosis. The free-living mycophagous nematode Aphelenchus avenae can be induced to enter anhydrobiosis by pre-exposure to moderate reductions in relative humidity (RH) prior to extreme desiccation. This preconditioning phase is thought to allow modification of the transcriptome by activation of genes required for desiccation tolerance. Results To identify such genes, a panel of expressed sequence tags (ESTs) enriched for sequences upregulated in A. avenae during preconditioning was created. A subset of 30 genes with significant matches in databases, together with a number of apparently novel sequences, were chosen for further study. Several of the recognisable genes are associated with water stress, encoding, for example, two new hydrophilic proteins related to the late embryogenesis abundant (LEA) protein family. Expression studies confirmed EST panel members to be upregulated by evaporative water loss, and the majority of genes was also induced by osmotic stress and cold, but rather fewer by heat. We attempted to use RNA interference (RNAi) to demonstrate the importance of this gene set for anhydrobiosis, but found A. avenae to be recalcitrant with the techniques used. Instead, therefore, we developed a cross-species RNAi procedure using A. avenae sequences in another anhydrobiotic nematode, Panagrolaimus superbus, which is amenable to gene silencing. Of 20 A. avenae ESTs screened, a significant reduction in survival of desiccation in treated P. superbus populations was observed with two sequences, one of which was novel, while the other encoded a glutathione peroxidase. To confirm a role for glutathione peroxidases in anhydrobiosis, RNAi with cognate sequences from P. superbus was performed and was also shown to reduce desiccation tolerance in this species. Conclusions This study has identified and characterised the expression profiles of members of the anhydrobiotic gene set in A. avenae. It also demonstrates the potential of RNAi for the analysis of anhydrobiosis and provides the first genetic data to underline the importance of effective antioxidant systems in metazoan desiccation tolerance.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction: Enterococcus faecalis is a member of the mammalian gastrointestinal microbiota but has been considered a leading cause of hospital-acquired infections. In the oral cavity, it is commonly detected from root canals of teeth with failed endodontic treatment. However, little is known about the virulence and genetic relatedness among E. faecalis isolates from different clinical sources. This study compared the presence of enterococcal virulence factors among root canal strains and clinical isolates from hospitalized patients to identify virulent clusters of E. faecalis. Methods: Multilocus sequence typing analysis was used to determine genetic lineages of 40 E. faecalis clinical isolates from different sources. Virulence clusters were determined by evaluating capsule (cps) locus polymorphisms, pathogenicity island gene content, and antibiotic resistance genes by polymerase chain reaction. Results: The clinical isolates from hospitalized patients formed a phylogenetically separate group and were mostly grouped in the clonal complex 2, which is a known virulent cluster of E. faecalis that has caused infection outbreaks globally. The clonal complex 2 group comprised capsule-producing strains harboring multiple antibiotic resistance and pathogenicity island genes. On the other hand, the endodontic isolates were more diverse and harbored few virulence and antibiotic resistance genes. In particular, although more closely related to isolates from hospitalized patients, capsuleproducing E. faecalis strains from root canals did not carry more virulence/antibiotic genes than other endodontic isolates. Conclusions: E. faecalis isolates from endodontic infections have a genetic and virulence profile different from pathogenic clusters of hospitalized patients’ isolates, which is most likely due to niche specialization conferred mainly by variable regions in the genome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequenz spezifische biomolekulare Analyseverfahren erweisen sich gerade im Hinblick auf das Humane Genom Projekt als äußerst nützlich in der Detektion von einzelnen Nukleotid Polymorphismen (SNPs) und zur Identifizierung von Genen. Auf Grund der hohen Anzahl von Basenpaaren, die zu analysieren sind, werden sensitive und effiziente Rastermethoden benötigt, welche dazu fähig sind, DNA-Proben in einer geeigneten Art und Weise zu bearbeiten. Die meisten Detektionsarten berücksichtigen die Interaktion einer verankerten Probe und des korrespondierenden Targets mit den Oberflächen. Die Analyse des kinetischen Verhaltens der Oligonukleotide auf der Sensoroberfläche ist infolgedessen von höchster Wichtigkeit für die Verbesserung bereits bekannter Detektions - Schemata. In letzter Zeit wurde die Oberflächen Plasmonen feld-verstärkte Fluoreszenz Spektroskopie (SPFS) entwickelt. Sie stellt eine kinetische Analyse - und Detektions - Methode dar, die mit doppelter Aufzeichnung, d.h. der Änderung der Reflektivität und des Fluoreszenzsignals, für das Interphasen Phänomen operiert. Durch die Verwendung von SPFS können Kinetikmessungen für die Hybridisierung zwischen Peptid Nukleinsäure (PNA), welche eine synthetisierte Nukleinsäure DNA imitiert und eine stabilere Doppelhelix formt, und DNA auf der Sensoroberfläche ausgeführt werden. Mittels einzel-, umfassend-, und titrations- Experimenten sowohl mit einer komplementär zusammenpassenden Sequenz als auch einer mismatch Sequenz können basierend auf dem Langmuir Modell die Geschwindigkeitskonstanten für die Bindungsreaktion des oligomer DNA Targets bzw. des PCR Targets zur PNA ermittelt werden. Darüber hinaus wurden die Einflüsse der Ionenstärke und der Temperatur für die PNA/DNA Hybridisierung in einer kinetischen Analyse aufgezeigt.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The research presented in my PhD thesis is part of a wider European project, FishPopTrace, focused on traceability of fish populations and products. My work was aimed at developing and analyzing novel genetic tools for a widely distributed marine fish species, the European hake (Merluccius merluccius), in order to investigate population genetic structure and explore potential applications to traceability scenarios. A total of 395 SNPs (Single Nucleotide Polymorphisms) were discovered from a massive collection of Expressed Sequence Tags, obtained by high-throughput sequencing, and validated on 19 geographic samples from Atlantic and Mediterranean. Genome-scan approaches were applied to identify polymorphisms on genes potentially under divergent selection (outlier SNPs), showing higher genetic differentiation among populations respect to the average observed across loci. Comparative analysis on population structure were carried out on putative neutral and outlier loci at wide (Atlantic and Mediterranean samples) and regional (samples within each basin) spatial scales, to disentangle the effects of demographic and adaptive evolutionary forces on European hake populations genetic structure. Results demonstrated the potential of outlier loci to unveil fine scale genetic structure, possibly identifying locally adapted populations, despite the weak signal showed from putative neutral SNPs. The application of outlier SNPs within the framework of fishery resources management was also explored. A minimum panel of SNP markers showing maximum discriminatory power was selected and applied to a traceability scenario aiming at identifying the basin (and hence the stock) of origin, Atlantic or Mediterranean, of individual fish. This case study illustrates how molecular analytical technologies have operational potential in real-world contexts, and more specifically, potential to support fisheries control and enforcement and fish and fish product traceability.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nitrogen and water are essential for plant growth and development. In this study, we designed experiments to produce gene expression data of poplar roots under nitrogen starvation and water deprivation conditions. We found low concentration of nitrogen led first to increased root elongation followed by lateral root proliferation and eventually increased root biomass. To identify genes regulating root growth and development under nitrogen starvation and water deprivation, we designed a series of data analysis procedures, through which, we have successfully identified biologically important genes. Differentially Expressed Genes (DEGs) analysis identified the genes that are differentially expressed under nitrogen starvation or drought. Protein domain enrichment analysis identified enriched themes (in same domains) that are highly interactive during the treatment. Gene Ontology (GO) enrichment analysis allowed us to identify biological process changed during nitrogen starvation. Based on the above analyses, we examined the local Gene Regulatory Network (GRN) and identified a number of transcription factors. After testing, one of them is a high hierarchically ranked transcription factor that affects root growth under nitrogen starvation. It is very tedious and time-consuming to analyze gene expression data. To avoid doing analysis manually, we attempt to automate a computational pipeline that now can be used for identification of DEGs and protein domain analysis in a single run. It is implemented in scripts of Perl and R.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Frequent loss of heterozygosity (LOH) at specific chromosomal regions are highly associated with the inactivation of tumor suppressor genes (TSGs) (Weinberg, 1991; Bishop, 1989). Chromosome 8p is the most frequently reported site of LOH (∼60%) in prostate cancer (PC), suggesting that there may be inactivated TSG(s) involved in PC on chromosome 8p. (Bergerheim et. al., 1991; Kagan et. al., 1995). In order to identify the smallest common regions of frequent LOH (SCLs) on chromosome 8, we screened 52 PC patient/tumor samples with 39 polymorphic markers in successive screenings. In the course of refining the SCLs, we identified 3 tumors with >6 Mb homozygous deletions (HZDs) at 8p22 and 8p21, suggesting the presence of candidate TSGs at both loci. These HZDs spanned the two SCLs at 8p22 (46%) and 8p21 (45%). The SCLs were narrowed to 3.2 cM at 8p22 and less than 3 cM at 8p21. ^ In order to identify candidate TSGs within the SCLs on 8p, two approaches were used. In the candidate gene approach, thirty genes that mapped to the SCLs were evaluated for expression in normal prostate and in PC cell lines. One of the candidate genes, Clusterin, showed decreased expression in 4/7 (57%) prostate cancer cell lines by Northern blot analysis. Clusterin will be further examined as a candidate TSG. ^ The second approach involved utilizing subtractive hybridization and hybrid affinity capture to generate pools of expressed sequence tags (ESTs) enriched for genes that are downregulated or deleted in PC and that map to specific regions of interest. We took advantage of a prostate cancer cell line (PC3) with a known HZD of a candidate TSG, CTNNA1 on 5q31, to develop and validate a model system. We then developed subtracted libraries enriched for 8p22 and 8p21 ESTs by this method, using two cell lines, MDAPCa-2b and PC3. The ESTs were cloned, and 40 were sequenced and evaluated for expression in normal prostate and PC cell lines. Three ESTs from the subtracted libraries, C2, C17 and F12, showed decreased expression in 29–57% of the prostate tumor cell lines studied, and will be further examined as candidate TSGs. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Retinitis pigmentosa (RP) is a name given to a group of inherited retinal dystrophies that lead to progressive photoreceptor degeneration, and thus, visual impairment. It is evident at both the clinical and the molecular level that these are heterogeneous disorders, with wide variation in severity, mode of inheritance, and phenotype. The genetics of RP are not simple; the disease can be inherited in dominant, recessive, X-linked, and digenic modes. Autosomal dominant RP (adRP) results from mutations in at least ten mapped loci, but there may be dozens of genetic loci where mutations can cause RP. To date, there are over a hundred genes known to cause retinal degenerative diseases, and less than half of these have been cloned (RetNet). Among the dozens of retinitis pigmentosa loci known to exist, only a few have been identified and the remainders are inferred from linkage studies. Today, the genes for seven of the twelve-adRP loci have been identified, and these are rhodopsin, peripherin/RDS, NRL, ROM1, CRX, RP13 and RP1. My research projects involved a combination of the continued search for genes involved in retinal dystrophies, as well the investigation into the role of peripherin/RDS and RP1 in the disease etiology of autosomal dominant RP. ^ Most of the mutations leading to inherited retinal disorders have been identified in predominately retina expressed genes like rhodopsin, peripherin/RDS, and RP1. Expressed sequence tags (ESTs) that were retina-specific were culled from sequence databases and, together with laboratory analysis, were analyzed as potential candidate genes for retinal dystrophies. Thirteen of the fifty-five identified retina-specific ESTs mapped to within candidate regions for inherited retinopathies. One of these is RP1L1, a homologue of RP1 and a potential cause of adRP. ^ Once a disease-associated gene has been identified, elucidating the role of that gene in the visual process is essential for understanding what happens when the process is defective as it is in adRP. My next projects involved investigating the role of a novel 5′ donor +3 splice site mutation on the mRNA of peripherin/RDS in adRP affected individuals, and comparative sequencing in RP1 to define conserved regions of the protein. Comparative sequencing is a powerful way to delineate critical regions of a sequence because different regions of a gene have different functions, and each region is subject to different levels of functional or structural constraints. Establishing a framework of conserved domains is beneficial not only for structural or functional studies, but can also aid in determining the potential effects of mutations. With the completion of sequencing of human genome, and other organisms such as Saccharomyces cerevisiae, Caenorhabditis elegans , and Drosophila, the facility of comparative sequencing will only increase in the future. Comparative sequencing has already become an established procedure for pinpointing conserved regions of a protein, and is an efficient way to target regions of a protein for experimental and/or evolutionary analysis. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Unique, small sequences (sequence tag sites) have been identified at the 3′ ends of most human genes that serve as landmarks in genome mapping. We investigated whether a single copy gene could be isolated directly from total human DNA by transformation-associated recombination (TAR) cloning in yeast using a short, 3′ unique target. A TAR cloning vector was constructed that, when linearized, contained a small amount (381 bp) of 3′ hypoxanthine phosphoribosyltransferase (HPRT) sequence at one end and an 189-bp Alu repeat at the other end. Transformation with this vector along with human DNA led to selective isolations of the entire HPRT gene as yeast artificial chromosomes (YACs) that extended from the 3′ end sequence to various Alu positions as much as 600 kb upstream. These YACs were retrofitted with a NeoR and a bacterial artificial chromosome (BAC) sequence to transfer the YACs to bacteria and subsequently the BACs to mouse cells by using a Neo selection. Most of the HPRT isolates were functional, demonstrating that TAR cloning retains the functional integrity of the isolated material. Thus, this modified version of TAR cloning, which we refer to as radial TAR cloning, can be used to isolate large segments of the human genome accurately and directly with only a small amount of sequence information.