931 resultados para wide genome sequencing
Resumo:
Taking advantage of the ongoing Dictyostelium genome sequencing project, we have assembled >73 kb of genomic DNA in 15 contigs harbouring 15 genes and one pseudogene of Rho-related proteins. Comparison with EST sequences revealed that every gene is interrupted by at least one and up to four introns. For racC extensive alternative splicing was identified. Northern blot analysis showed that mRNAs for racA, racE, racG, racH and racI were present at all stages of development, whereas racJ and racL were expressed only at late stages. Amino acid sequences have been analysed in the context of Rho-related proteins of other organisms. Rac1a/1b/1c, RacF1/F2 and to a lesser extent RacB and the GTPase domain of RacA can be grouped in the Rac subfamily. None of the additional Dictyostelium Rho-related proteins belongs to any of the well-defined subfamilies, like Rac, Cdc42 or Rho. RacD and RacA are unique in that they lack the prenylation motif characteristic of Rho proteins. RacD possesses a 50 residue C-terminal extension and RacA a 400 residue C-terminal extension that contains a proline-rich region, two BTB domains and a novel C-terminal domain. We have also identified homologues for RacA in Drosophila and mammals, thus defining a new subfamily of Rho proteins, RhoBTB.
Resumo:
FULL-malaria is a database for a full-length-enriched cDNA library from the human malaria parasite Plasmodium falciparum (http://133.11.149.55/). Because of its medical importance, this organism is the first target for genome sequencing of a eukaryotic pathogen; the sequences of two of its 14 chromosomes have already been determined. However, for the full exploitation of this rapidly accumulating information, correct identification of the genes and study of their expression are essential. Using the oligo-capping method, we have produced a full-length-enriched cDNA library from erythrocytic stage parasites and performed one-pass reading. The database consists of nucleotide sequences of 2490 random clones that include 390 (16%) known malaria genes according to BLASTN analysis of the nr-nt database in GenBank; these represent 98 genes, and the clones for 48 of these genes contain the complete protein-coding sequence (49%). On the other hand, comparisons with the complete chromosome 2 sequence revealed that 35 of 210 predicted genes are expressed, and in addition led to detection of three new gene candidates that were not previously known. In total, 19 of these 38 clones (50%) were full-length. From these observations, it is expected that the database contains ∼1000 genes, including 500 full-length clones. It should be an invaluable resource for the development of vaccines and novel drugs.
Resumo:
In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources that operate on the data in GenBank and a variety of other biological data made available through NCBI’s Web site. NCBI data retrieval resources include Entrez, PubMed, LocusLink and the Taxonomy Browser. Data analysis resources include BLAST, Electronic PCR, OrfFinder, RefSeq, UniGene, HomoloGene, Database of Single Nucleotide Polymorphisms (dbSNP), Human Genome Sequencing, Human MapViewer, GeneMap’99, Human–Mouse Homology Map, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, Cancer Genome Anatomy Project (CGAP), SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB) and the Conserved Domain Database (CDD). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov.
Resumo:
A database (SpliceDB) of known mammalian splice site sequences has been developed. We extracted 43 337 splice pairs from mammalian divisions of the gene-centered Infogene database, including sites from incomplete or alternatively spliced genes. Known EST sequences supported 22 815 of them. After discarding sequences with putative errors and ambiguous location of splice junctions the verified dataset includes 22 489 entries. Of these, 98.71% contain canonical GT–AG junctions (22 199 entries) and 0.56% have non-canonical GC–AG splice site pairs. The remainder (0.73%) occurs in a lot of small groups (with a maximum size of 0.05%). We especially studied non-canonical splice sites, which comprise 3.73% of GenBank annotated splice pairs. EST alignments allowed us to verify only the exonic part of splice sites. To check the conservative dinucleotides we compared sequences of human non-canonical splice sites with sequences from the high throughput genome sequencing project (HTG). Out of 171 human non-canonical and EST-supported splice pairs, 156 (91.23%) had a clear match in the human HTG. They can be classified after sequence analysis as: 79 GC–AG pairs (of which one was an error that corrected to GC–AG), 61 errors corrected to GT–AG canonical pairs, six AT–AC pairs (of which two were errors corrected to AT–AC), one case was produced from a non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two other cases left of supported non-canonical splice pairs. The information about verified splice site sequences for canonical and non-canonical sites is presented in SpliceDB with the supporting evidence. We also built weight matrices for the major splice groups, which can be incorporated into gene prediction programs. SpliceDB is available at the computational genomic Web server of the Sanger Centre: http://genomic.sanger.ac.uk/spldb/SpliceDB.html and at http://www.softberry.com/spldb/SpliceDB.html.
Resumo:
TIGRFAMs is a collection of protein families featuring curated multiple sequence alignments, hidden Markov models and associated information designed to support the automated functional identification of proteins by sequence homology. We introduce the term ‘equivalog’ to describe members of a set of homologous proteins that are conserved with respect to function since their last common ancestor. Related proteins are grouped into equivalog families where possible, and otherwise into protein families with other hierarchically defined homology types. TIGRFAMs currently contains over 800 protein families, available for searching or downloading at www.tigr.org/TIGRFAMs. Classification by equivalog family, where achievable, complements classification by orthology, superfamily, domain or motif. It provides the information best suited for automatic assignment of specific functions to proteins from large-scale genome sequencing projects.
Resumo:
The opportunistic pathogenic bacterium Pseudomonas aeruginosa uses quorum-sensing signaling systems as global regulators of virulence genes. There are two quorum-sensing signal receptor and signal generator pairs, LasR–LasI and RhlR–RhlI. The recently completed P. aeruginosa genome-sequencing project revealed a gene coding for a homolog of the signal receptors, LasR and RhlR. Here we describe a role for this gene, which we call qscR. The qscR gene product governs the timing of quorum-sensing-controlled gene expression and it dampens virulence in an insect model. We present evidence that suggests the primary role of QscR is repression of lasI. A qscR mutant produces the LasI-generated signal prematurely, and this results in premature transcription of a number of quorum-sensing-regulated genes. When fed to Drosophila melanogaster, the qscR mutant kills the animals more rapidly than the parental P. aeruginosa. The repression of lasI by QscR could serve to ensure that quorum-sensing-controlled genes are not activated in environments where they are not useful.
Resumo:
The adrenoleukodystrophy protein (ALDP) and the 70-kDa peroxisomal membrane protein (PMP70) are half ATP-binding cassette (ABC) transporters in the human peroxisome membrane. ALDP and PMP70 share sequence homology and both are implicated in genetic diseases. PXA1 and YKL741 are Saccharomyces cerevisiae genes that encode homologs of ALDP and PMP70. Pxa1p, a putative ortholog of ALDP, is involved in peroxisomal beta-oxidation of fatty acids while YKL741 is an open reading frame found by the yeast genome sequencing project. Here we designate YKL741 as PXA2 and show that its protein product, Pxa2p, like Pxa1p, is associated with peroxisomes but not required for their assembly. Yeast strains carrying gene disruption of PXA1, PXA2, or both have similar and, in the case of the latter, nonadditive phenotypes. We also find that the stability of Pxa1p, but not Pxa2p, is markedly reduced in the absence of the other. Finally, we find that Pxa1p and Pxa2p coimmuno-precipitate. These genetic and physical data suggest that Pxa1p and Pxa2p heterodimerize to form a complete peroxisomal ABC transporter involved in fatty acid beta-oxidation. This result predicts the presence of similar heterodimeric ABC transporters in the mammalian peroxisome membrane.
Resumo:
Os rotavírus do grupo A (RVA) são importantes causadores de diarreias virais em crianças e animais jovens de diferentes espécies, com impactos na saúde pública e animal. Visando contribuir para o entendimento e prevenção das rotaviroses assim como suas possíveis relações zoonóticas, caracterizou-se os 11 segmentos de dsRNA de rotavírus codificadores das proteínas estruturais e não estruturais presentes em amostras fecais positivas de suínos coletadas nos anos de 2012-2013, em 2 estados brasileiros. Mediante o emprego de RT-PCR, sequenciamento nucleotídico e análises filogenéticas, todos os segmentos genéticos oriundos de 12 amostras de RVA detectados em suínos foram analisados e comparados com os de outras amostras descritas previamente. As sequências obtidas para os genes codificadores das proteínas NSP2, NSP3 e VP6 contemplaram a open reading frame (ORF) completa do gene, enquanto que a ORF parcial foi determinada para os genes codificadores das proteínas VP1, VP2, VP3, VP4, VP7, NSP1, NSP4, NSP5 e NSP6. Os genotipos de rotavírus suíno provenientes das regiões amostradas concordam com os mais frequentemente descritos nesta espécie animal, apresentando, assim, uma matriz genética suína com a maioria dos segmentos pertencentes à constelação genotípica 1, com exceção dos genes codificadores das proteínas VP6 e NSP1, os quais foram os genotipos I5 e A8, respectivamente. Apesar de predominar o genotipo 1 (Wa-like) nas sequências deste estudo, a análise genômica sugere a existência de uma variação intragenotípica no genoma do rotavírus do grupo A atualmente circulante nas populações suína amostradas dos estados de São Paulo e Mato Grosso. Adicionalmente, buscou-se identificar os aminoácidos relacionados com a adaptação dos rotavírus no hospedeiro e assinaturas genéticas que distinguissem RVA suíno e humano. Para isso, as sequências obtidas neste estudo foram comparadas com outras cepas de RVA detectadas nestas duas espécies e pertencentes ao genotipo 1 (Wa-like) disponíveis no Genbank. Como resultados foram encontrados mais de 75 sítios de mudanças deaminoácidos que diferenciam RVA suíno e humano além de sítios de substituiçãopresentes em algumas proteínas virais que frequentemente covariaram entre elas. Estes resultados proporcionam um maior entendimento da diversidade viral circulante em unidades de produção suína e uma melhor compreensão dos animaiscomo reservatórios genéticos de cepas de rotavírus emergentes em humanos.
Resumo:
São inegáveis o caráter universal e a importância dos avanços tecnológicos e científicos originados das pesquisas genéticas. O sequenciamento do genoma humano, a identificação das principais sequências de DNA contidas nos seus genes e suas respectivas funções biológicas, bem como suas possíveis aplicações biomédicas, são de incalculável importância. Os genes, muito embora possam ser biologicamente caracterizados como compostos químicos, possuem um conteúdo informacional que se revela indispensável ao desenvolvimento da engenharia genética, figurando como elemento básico e central de suporte às inovações biotecnológicas. Desta forma, importante analisar a relevância da aplicação de mecanismos jurídicos como forma de fomento à contínua evolução biotecnológica sob a ótica do desenvolvimento econômico e social do país, princípios constitucionais justificadores da proteção de referidos desenvolvimentos técnicos por meio do intelecto e intervenção humanos na natureza. Para tanto, deve-se levar em consideração que a inexistência de tutela jurídica específica pode gerar desincentivo aos investimentos capazes de possibilitar o desenvolvimento de tais tecnologias, ao passo que uma tutela jurídica muito ampla poderá ocasionar indevida restrição ao acesso a tais insumos biológicos, de modo a gerar um efeito adverso àquele buscado. Assim, deve-se compatibilizar a proteção dos resultados obtidos através do desenvolvimento biotecnológico em relação à potencial dificuldade originada de uma eventual restrição ao acesso a tais elementos fundamentais à pesquisa e desenvolvimento genéticos. É neste contexto que se procura um balizamento entre os diferentes interesses e posicionamentos a respeito da patenteabilidade dos genes humanos, visando solução jurídica que permita um ambiente seguro e propício ao desenvolvimento da engenharia genética, e dos inúmeros benefícios que poderão daí se originar. O presente estudo se voltará, portanto, à análise da necessidade, condições, suficiência e extensão da tutela jurídica a ser conferida pela outorga de direitos patentários aos genes humanos.
Resumo:
Aim: High gamma diversity in tropical montane forests may be ascribed to high geographical turnover of community composition, resulting from population isolation that leads to speciation. We studied the evolutionary processes responsible for diversity and turnover in assemblages of tropical scarab beetles (Scarabaeidae) by assessing DNA sequence variation at multiple hierarchical levels. Location: A 300-km transect across six montane forests (900–1100 m) in Costa Rica. Methods: Assemblages of Scarabaeidae (subfamilies Dynastinae, Rutelinae, Melolonthinae) including 118 morphospecies and > 500 individuals were sequenced for the cox1 gene to establish species limits with a mixed Yule–coalescent method. A species-level phylogenetic tree was constructed from cox1 and rrnL genes. Total diversity and turnover among assemblages were then assessed at three hierarchical levels: haplotypes, species and higher clades. Results: DNA-based analyses showed high turnover among communities at all hierarchical levels. Turnover was highest at the haplotype level (community similarity 0.02–0.12) and decreased with each step of the hierarchy (species: 0.21–0.46; clades: 0.41–0.43). Both compositional and phylogenetic similarities of communities were geographically structured, but turnover was not correlated with distance among forests. When three major clades were investigated separately, communities of Dynastinae showed consistently higher alpha diversity, larger species ranges and lower turnover than Rutelinae and Melolonthinae. Main conclusions: Scarab communities of montane forests show evidence of evolutionary persistence of communities in relative isolation, presumably tracking suitable habitats elevationally to accommodate climatic changes. Patterns of diversity on all hierarchical levels seem to be determined by restricted dispersal, and differences in Dynastinae could be explained by their greater dispersal ability. Community-wide DNA sequencing across multiple lineages and hierarchical levels reveals the evolutionary processes that led to high beta diversity in tropical montane forests through time.
Resumo:
BACKGROUND Stiff skin syndrome and systemic or localized scleroderma are cutaneous disorders characterized by dermal fibrosis and present clinically with induration of the skin, with or without joint, internal organ or vascular involvement. OBJECTIVES To provide clinical, histological and preliminary genetic analysis of two West Highland white terrier siblings presenting with indurated skin resembling stiff skin syndrome in humans. ANIMALS Two client owned full sibling West Highland white terriers from two different litters. METHODS Clinical examination, histopathological examination and whole genome sequencing analysis of affected and unaffected West Highland white terriers. RESULTS Affected dogs exhibited markedly indurated skin that was attached firmly to the underlying tissue and incomplete closure of the mouth and eyes. No abnormalities were found by neurological or orthopaedic examination, radiographs of the head or whole body computed tomography. Histologically, the dermis and pannicular septa were thickened by a marked increase in coarse collagen fibres and a mild to moderate increase in collagen fibre diameter. The syndrome most likely follows an autosomal recessive mode of inheritance. The sequence analysis did not reveal any obvious causative variant in the investigated candidate genes ADAMTSL2 and FBN1. CONCLUSION AND CLINICAL IMPORTANCE The clinical phenotype and histopathological features of two West Highland white terrier siblings resembled stiff skin syndrome in humans. Unlike in humans, or previously described beagles with stiff skin, there was no restriction of joint mobility. Genetic analysis did not detect a candidate causative variant and warrants further research.
Resumo:
Petunia hybrida is a popular bedding plant that has a long history as a genetic model system. We report the whole-genome sequencing and assembly of inbred derivatives of its two wild parents, P. axillaris N and P. inflata S6. The assemblies include 91.3% and 90.2% coverage of their diploid genomes (1.4 Gb; 2n = 14) containing 32,928 and 36,697 protein-coding genes, respectively. The genomes reveal that the Petunia lineage has experienced at least two rounds of hexaploidization: the older gamma event, which is shared with most Eudicots, and a more recent Solanaceae event that is shared with tomato and other solanaceous species. Transcription factors involved in the shift from bee to moth pollination reside in particularly dynamic regions of the genome, which may have been key to the remarkable diversity of floral colour patterns and pollination systems. The high-quality genome sequences will enhance the value of Petunia as a model system for research on unique biological phenomena such as small RNAs, symbiosis, self-incompatibility and circadian rhythms.
Resumo:
Background Lethal chondrodysplasia (bulldog syndrome) is a well-known congenital syndrome in cattle and occurs sporadically in many breeds. In 2015, it was noticed that about 12 % of the offspring of the phenotypically normal Danish Holstein sire VH Cadiz Captivo showed chondrodysplasia resembling previously reported bulldog calves. Pedigree analysis of affected calves did not display obvious inbreeding to a common ancestor, suggesting the causative allele was not a rare recessive. The normal phenotype of the sire suggested a dominant inheritance with incomplete penetrance or a mosaic mutation. Results Three malformed calves were examined by necropsy, histopathology, radiology, and computed tomography scanning. These calves were morphologically similar and displayed severe disproportionate dwarfism and reduced body weight. The syndrome was characterized by shortening and compression of the body due to reduced length of the spine and the long bones of the limbs. The vicerocranium had severe dysplasia and palatoschisis. The bones had small irregular diaphyses and enlarged epiphyses consisting only of chondroid tissue. The sire and a total of four affected half-sib offspring and their dams were genotyped with the BovineHD SNP array to map the defect in the genome. Significant genetic linkage was obtained for several regions of the bovine genome including chromosome 5 where whole genome sequencing of an affected calf revealed a COL2A1 point mutation (g.32473300 G > A). This private sequence variant was predicted to affect splicing as it altered the conserved splice donor sequence GT at the 5’-end of COL2A1 intron 36, which was changed to AT. All five available cases carried the mutant allele in heterozygous state and all five dams were homozygous wild type. The sire VH Cadiz Captivo was shown to be a gonadal and somatic mosaic as assessed by the presence of the mutant allele at levels of about 5 % in peripheral blood and 15 % in semen. Conclusions The phenotypic and genetic findings are comparable to a previously reported COL2A1 missense mutation underlying lethal chondrodysplasia in the offspring of a mosaic French Holstein sire (Igale Masc). The identified independent spontaneous splice site variant in COL2A1 most likely caused chondrodysplasia and must have occurred during the early foetal development of the sire. This study provides a first example of a dominant COL2A1 splice site variant as candidate causal mutation of a severe lethal chondrodysplasia phenotype. Germline mosaicism is a relatively frequent mechanism in the origin of genetic disorders and explains the prevalence of a certain fraction of affected offspring. Paternal dominant de novo mutations are a risk in cattle breeding, especially because the ratio of defective offspring may be very high and be associated with significant animal welfare problems.
Resumo:
The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at ftp://fantom2.gsc.riken.go.jp/RTPS/. (C). 2004 Elsevier Inc. All rights reserved.
Resumo:
Chlamydia pneumoniae is an obligate intracellular respiratory pathogen that causes 10% of community-acquired pneumonia and has been associated with cardiovascular disease. Both whole-genome sequencing and specific gene typing suggest that there is relatively little genetic variation in human isolates of C. pneumoniae. To date, there has been little genomic analysis of strains from human cardiovascular sites. The genotypes of C. pneumoniae present in human atherosclerotic carotid plaque were analysed and several polymorphisms in the variable domain 4 (VD4) region of the outer-membrane protein-A (ompA) gene and the intergenic region between the ygeD and uridine kinase (ygeD-urk) genes were found. While one genotype was identified that was the same as one reported previously in humans (respiratory and cardiovascular), another genotype was found that was identical to a genotype from non-human sources (frog/koala).