934 resultados para gene number


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Gene number can be considered a pragmatic measure of biological complexity, but reliable data is scarce. Estimates for vertebrates are 50-100,000 genes per haploid genome, whereas invertebrate estimates fall below 25,000. We wished to test the hypothesis that the origin of vertebrates coincided with extensive gene creation. A prediction is that gene number will differ sharply between invertebrate and vertebrate members of the chordate phylum. A gene number estimation method requiring limited sequence sampling of genomic DNA was developed and validated by using data for Caenorhabditis elegans. Using the method, we estimated that the invertebrate chordate Ciona intestinalis has 15,500 protein-coding genes (±3,700). This number is significantly lower than gene numbers of vertebrate chordates, but similar to those of invertebrates in distantly related phyla. The data indicate that evolution of vertebrates was accompanied by a dramatic increase in protein-coding capacity of the genome.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Teleost fish underwent whole-genome duplication around 450 Ma followed by diploidization and loss of 80-85% of the duplicated genes. To identify a deep signature of this teleost-specific whole-genome duplication (TSGD), we searched for duplicated genes that were systematically and uniquely retained in one or other of the superorders Ostariophysi and Acanthopterygii. TSGD paralogs comprised 17-21% of total gene content. Some 2.6% (510) of TSGD paralogs were present as pairs in the Ostariophysi genomes of Danio rerio (Cypriniformes) and Astyanax mexicanus (Characiformes) but not in species from four orders of Acanthopterygii (Gasterosteiformes, Gasterosteus aculeatus; Tetraodontiformes, Tetraodon nigroviridis; Perciformes, Oreochromis niloticus; and Beloniformes, Oryzias latipes) where a single copy was identified. Similarly, 1.3% (418) of total gene number represented cases where TSGD paralogs pairs were systematically retained in the Acanthopterygian but conserved as a single copy in Ostariophysi genomes. We confirmed the generality of these results by phylogenetic and synteny analysis of 40 randomly selected linage-specific paralogs (LSPs) from each superorder and completed with the transcriptomes of three additional Ostariophysi species (Ictalurus punctatus [Siluriformes], Sinocyclocheilus species [Cypriniformes], and Piaractus mesopotamicus [Characiformes]). No chromosome bias was detected in TSGD paralog retention. Gene ontology (GO) analysis revealed significant enrichment of GO terms relative to the human GO SLIM database for growth, Cell differentiation, and Embryo development in Ostariophysi and for Transport, Signal Transduction, and Vesicle mediated transport in Acanthopterygii. The observed patterns of paralog retention are consistent with different diploidization outcomes having contributed to the evolution/diversification of each superorder.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Networks exhibiting accelerating growth have total link numbers growing faster than linearly with network size and either reach a limit or exhibit graduated transitions from nonstationary-to-stationary statistics and from random to scale-free to regular statistics as the network size grows. However, if for any reason the network cannot tolerate such gross structural changes then accelerating networks are constrained to have sizes below some critical value. This is of interest as the regulatory gene networks of single-celled prokaryotes are characterized by an accelerating quadratic growth and are size constrained to be less than about 10,000 genes encoded in DNA sequence of less than about 10 megabases. This paper presents a probabilistic accelerating network model for prokaryotic gene regulation which closely matches observed statistics by employing two classes of network nodes (regulatory and non-regulatory) and directed links whose inbound heads are exponentially distributed over all nodes and whose outbound tails are preferentially attached to regulatory nodes and described by a scale-free distribution. This model explains the observed quadratic growth in regulator number with gene number and predicts an upper prokaryote size limit closely approximating the observed value. (c) 2005 Elsevier GmbH. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Osteoporosis and disorders of bone fragility are highly heritable, but despite much effort the identities of few of the genes involved has been established. Recent developments in genetics such as genome-wide association studies are revolutionizing research in this field, and it is likely that further contributions will be made through application of next-generation sequencing technologies, analysis of copy number variation polymorphisms, and high-throughput mouse mutagenesis programs. This article outlines what we know about osteoporosis genetics to date and the probable future directions of research in this field.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Gene number difference among organisms demonstrates that new gene origination is a fundamental biological process in evolution. Exon shuffling has been universally observed in the formation of new genes. Yet to be learned are the ways new exons originate and evolve, and how often new exons appear. To address these questions, we identified 2695 newly evolved exons in the mouse and rat by comparing the expressed sequences of 12,419 orthologous genes between human and mouse, using 743,856 pig ESTs as the outgroup. The new exon origination rate is about 2.71 x 10(-3) per gene per million years. These new exons have markedly accelerated rates both of nonsynonymous substitutions and of insertions/ deletions (indels). A much higher proportion of new exons have Kappa(a)/Kappa(s) ratios > 1 (where K-a is the nonsynonymous substitution rate and K-s is the synonymous substitution rate) than K do the old exons shared by human and mouse, implying a role of positive selection in the rapid evolution. The majority of these new exons have sequences unique in the genome, suggesting that most new exons might originate through "exonization" of intronic sequences. Most of the new exons appear to be alternative exons that are expressed at low levels.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Germin and germin-like proteins (GLPs) are encoded by a family of genes found in all plants. They are part of the cupin superfamily of biochemically diverse proteins, a superfamily that has a conserved tertiary structure, though with limited similarity in primary sequence. The subgroups of GLPs have different enzyme functions that include the two hydrogen peroxide-generating enzymes, oxalate oxidase (OxO) and superoxide dismutase. This review summarizes the sequence and structural details of GLPs and also discusses their evolutionary progression, particularly their amplification in gene number during the evolution of the land plants. In terms of function, the GLPs are known to be differentially expressed during specific periods of plant growth and development, a pattern of evolutionary subfunctionalization. They are also implicated in the response of plants to biotic (viruses, bacteria, mycorrhizae, fungi, insects, nematodes, and parasitic plants) and abiotic (salt, heat/cold, drought, nutrient, and metal) stress. Most detailed data come from studies of fungal pathogenesis in cereals. This involvement with the protection of plants from environmental stress of various types has led to numerous plant breeding studies that have found links between GLPs and QTLs for disease and stress resistance. In addition the OxO enzyme has considerable commercial significance, based principally on its use in the medical diagnosis of oxalate concentration in plasma and urine. Finally, this review provides information on the nutritional importance of these proteins in the human diet, as several members are known to be allergenic, a feature related to their thermal stability and evolutionary connection to the seed storage proteins, also members of the cupin superfamily.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Die phylogenetische Position der Mollusken innerhalb der Trochozoa sowie die interne Evolution der Klassen der Mollusca sind weitgehend unbekannt und wurden in meiner Arbeit anhand molekularer Merkmale untersucht. Phylogenomische Analysen zeigten in der Vergangenheit eine gute Auflösung für ursprüngliche Speziationsereignisse. Daher wurden hier drei neue EST Datensätze generiert: für Sipunculus nudus (Sipuncula), Barentsia elongata (Kamptozoa) und Lepidochitona cinerea, (Polyplacophora, Mollusca). Zusätzlich wurden gezielt Gene verschiedener Mollusken mittels RT-PCR amplifiziert. rnSowohl Kamptozoen als auch Sipunculiden wurden aufgrund morphologischer Kriterien bisher als mögliche Schwestergruppe der Mollusken gehandelt, aber die hier erzielten Ergebnisse zur Evolution der Hämerythrine, Gen-Anordnungen der mitochondrialen Genome und phylogenetische Analysen der ribosomalen und der mitochondriellen Proteine stützen diese Hypothese nicht. Die Position der Kamptozoa erwies sich hier generell als unbeständig; phylogenomische Analysen deuten eine Nähe zu den Bryozoen an, aber diese Position wird stark durch die Auswahl der Taxa beeinflusst. Dagegen weisen meine Analysen klar auf eine nähere Beziehung zwischen Annelida und Sipuncula hin. Die ribosomalen Proteine zeigen Sipuncula (und Echiura) sogar als Subtaxa der Anneliden. Wie den Mollusken fehlt den Sipunculiden jegliche Segmentierung und meine Ergebnisse legen hier die Möglichkeit des Verlusts dieses Merkmals innerhalb der Anneliden bei den Sipunculiden nahe. Innerhalb der Mollusken wurden die Solenogastren bereits als Schwestergruppe aller rezenten Mollusken vorgeschlagen. Im Rahmen meiner Arbeit wurden von drei verschiedenen Solenogastren-Arten die ersten zuverlässigen 18S rRNA-Sequenzen ermittelt, und es zeigte sich, dass alle bisher veröffentlichten 18S-Sequenzen dieser Molluskenklasse höchst unvollständig oder fehlerhaft sind. rnRibosomale Proteine sind gute phylogenetische Marker und hier wurden die Auswahl und Anzahl dieser Gene für phylogenetische Analysen optimiert. Über Sonden-basierte Detektion wurde eine sampling-Strategie getestet, die im Vergleich mit standard-phylogenomischen Ansätzen zukünftige molekulare Stammbaumrekonstruktionen mit größerem Taxonsampling ermöglicht.rn

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We report the construction of the mouse full-length cDNA encyclopedia, the most extensive view of a complex transcriptome, on the basis of preparing and sequencing 246 libraries. Before cloning, cDNAs were enriched in full-length by Cap-Trapper, and in most cases, aggressively subtracted/normalized. We have produced 1,442,236 successful 3'-end sequences clustered into 171,144 groups, from which 60,770 clones were fully sequenced cDNAs annotated in the FANTOM-2 annotation. We have also produced 547,149 5' end reads, which clustered into 124,258 groups. Altogether, these cDNAs were further grouped in 70,000 transcriptional units (TU), which represent the best coverage of a transcriptome so far. By monitoring the extent of normalization/subtraction, we define the tentative equivalent coverage (TEC), which was estimated to be equivalent to >12,000,000 ESTs derived from standard libraries. High coverage explains discrepancies between the very large. numbers of clusters (and TUs) of this project, which also include non-protein-coding RNAs, and the lower gene number estimation of genome annotations. Altogether, S'-end clusters identify regions that are potential promoters for 8637 known genes and S'-end clusters suggest the presence of almost 63,000 transcriptional starting points. An estimate of the frequency of polyadenylation signals suggests that at least half of the singletons in the EST set represent real mRNAs. Clones accounting for about half of the predicted TUs await further sequencing. The continued high-discovery rate suggests that the task of transcriptome discovery is not yet complete.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

AIMS The aims of the study are to characterize changes in JK-1 (FAM134B) at the DNA level in colorectal adenocarcinoma and adenoma and exploring the possible correlations with clinical and pathological features. METHOD JK-1 gene DNA copy number changes were studied in 211 colorectal carcinomas, 32 colorectal adenoma and 20 colorectal non-cancer colorectal tissue samples by real-time quantitative polymerase chain reaction. The results were correlated with clinical and pathological parameters. RESULTS Colorectal adenomas were more likely to be amplified than deleted with regard to JK-1 (FAM134B) DNA copy number change. The copy number level of JK-1 (FAM134B) DNA in colorectal adenocarcinomas was significantly lower in comparison to colorectal adenomas. Changes in JK-1 (FAM134B) DNA copy number were associated with histological subtypes, and cancer stage. Lower copy numbers were associated with higher tumor stage, lymph node stage and overall pathological stage of cancer. Conversely, higher DNA copy numbers were detected more often in the mucinous adenocarcinoma. CONCLUSIONS This is the first study showing significant correlations of the JK-1 (FAM134B) gene copy number alterations with clinical and pathological features in a large cohort of pre-invasive and invasive colorectal malignancies. The changes in DNA copy number associated with progression of colorectal malignancies reflect that JK-1 (FAM134B) gene could play a role in controlling some steps in development of the invasive phenotypes.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

BACKGROUND: Nonparametric Bayesian techniques have been developed recently to extend the sophistication of factor models, allowing one to infer the number of appropriate factors from the observed data. We consider such techniques for sparse factor analysis, with application to gene-expression data from three virus challenge studies. Particular attention is placed on employing the Beta Process (BP), the Indian Buffet Process (IBP), and related sparseness-promoting techniques to infer a proper number of factors. The posterior density function on the model parameters is computed using Gibbs sampling and variational Bayesian (VB) analysis. RESULTS: Time-evolving gene-expression data are considered for respiratory syncytial virus (RSV), Rhino virus, and influenza, using blood samples from healthy human subjects. These data were acquired in three challenge studies, each executed after receiving institutional review board (IRB) approval from Duke University. Comparisons are made between several alternative means of per-forming nonparametric factor analysis on these data, with comparisons as well to sparse-PCA and Penalized Matrix Decomposition (PMD), closely related non-Bayesian approaches. CONCLUSIONS: Applying the Beta Process to the factor scores, or to the singular values of a pseudo-SVD construction, the proposed algorithms infer the number of factors in gene-expression data. For real data the "true" number of factors is unknown; in our simulations we consider a range of noise variances, and the proposed Bayesian models inferred the number of factors accurately relative to other methods in the literature, such as sparse-PCA and PMD. We have also identified a "pan-viral" factor of importance for each of the three viruses considered in this study. We have identified a set of genes associated with this pan-viral factor, of interest for early detection of such viruses based upon the host response, as quantified via gene-expression data.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: The DUB/USP17 subfamily of deubiquitinating enzymes were originally identified as immediate early genes induced in response to cytokine stimulation in mice (DUB-1, DUB-1A, DUB-2, DUB-2A). Subsequently we have identified a number of human family members and shown that one of these (DUB-3) is also cytokine inducible. We originally showed that constitutive expression of DUB-3 can block cell proliferation and more recently we have demonstrated that this is due to its regulation of the ubiquitination and activity of the 'CAAX' box protease RCE1.

Results: Here we demonstrate that the human DUB/USP17 family members are found on both chromosome 4p16.1, within a block of tandem repeats, and on chromosome 8p23.1, embedded within the copy number variable betadefensin cluster. In addition, we show that the multiple genes observed in humans and other distantly related mammals have arisen due to the independent expansion of an ancestral sequence within each species. However, it is also apparent when sequences from humans and the more closely related chimpanzee are compared, that duplication events have taken place prior to these species separating.

Conclusions: The observation that the DUB/USP17 genes, which can influence cell growth and survival, have evolved from an unstable ancestral sequence which has undergone multiple and varied duplications in the species examined marks this as a unique family. In addition, their presence within the beta-defensin repeat raises the question whether they may contribute to the influence of this repeat on immune related conditions.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A polimicrogiria (PMG) é uma malformação do córtex cerebral causada por falhas no seu desenvolvimento, caracterizando-se por um número excessivo de pequenos giros e laminação anormal, dando à superfície cortical uma aparência irregular e grosseira. A gravidade de suas manifestações clínicas se relaciona diretamente com a extensão da malformação e das regiões cerebrais afetadas, sendo que a presença de lesões bilaterais ou unilaterais extensas indica um pior prognóstico. Uma das síndromes de polimicrogiria mais freqüentes e, conseqüentemente, mais bem descritas clinicamente, é a polimicrogiria perisylviana bilateral (PPB). Essa forma de PMG atinge a região que tange a fenda Sylviana, podendo apresentar-se tanto unilateralmente quanto em ambos os hemisférios. Vários genes têm sido relacionados a diferentes formas de polimicrogiria, são eles AFF2,TUBA1A, TUBB2B e TUBA8, SRPX2 e WDR62. Estes genes já foram estudados pelo nosso grupo de pesquisa em um grupo de pacientes compostos de casos familiares e esporádicos, acometidos em sua maioria pela forma perisylviana de PMG. Nenhuma variante deletéria foi identificada nestes genes. Recentemente um novo gene foi implicado na etiologia molecular das PMG, o TUBB3. O gene em questão pertence à mesma família de TUBA1A, TUBB2B e TUBA8 e codifica uma proteína de ligação aos microtúbulos, tendo importante papel na formação do fuso. Além deste gene, também tem sido descritas alterações genômicas, denominadas de Copy Number Variations (CNV), estas variações estruturais tem sido associadas com diversos distúrbios neurológicos, que vão desde transtornos psiquiátricos até malformações do córtex cerebral como a PMG. Desta forma, o objetivo deste trabalho foi analisar a existência de alterações de ponto deletérias no gene TUBB3 em pacientes com PMG e também, o envolvimento de CNVsna etiologia deste tipo de malformação ...

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)