943 resultados para Sequence motif analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

A computer analysis of 2328 protein sequences comprising about 60% of the Escherichia coli gene products was performed using methods for database screening with individual sequences and alignment blocks. A high fraction of E. coli proteins--86%--shows significant sequence similarity to other proteins in current databases; about 70% show conservation at least at the level of distantly related bacteria, and about 40% contain ancient conserved regions (ACRs) shared with eukaryotic or Archaeal proteins. For > 90% of the E. coli proteins, either functional information or sequence similarity, or both, are available. Forty-six percent of the E. coli proteins belong to 299 clusters of paralogs (intraspecies homologs) defined on the basis of pairwise similarity. Another 10% could be included in 70 superclusters using motif detection methods. The majority of the clusters contain only two to four members. In contrast, nearly 25% of all E. coli proteins belong to the four largest superclusters--namely, permeases, ATPases and GTPases with the conserved "Walker-type" motif, helix-turn-helix regulatory proteins, and NAD(FAD)-binding proteins. We conclude that bacterial protein sequences generally are highly conserved in evolution, with about 50% of all ACR-containing protein families represented among the E. coli gene products. With the current sequence databases and methods of their screening, computer analysis yields useful information on the functions and evolutionary relationships of the vast majority of genes in a bacterial genome. Sequence similarity with E. coli proteins allows the prediction of functions for a number of important eukaryotic genes, including several whose products are implicated in human diseases.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The structure-based sequence motif of the distant proteins in evolution, protein tyrosine phosphatases (PTP) I and II superfamilies, as an example, has been defined by the structural comparison, structure-based sequence alignment and analyses on substitut

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We analyzed n-mers (n=3-8) in the local environment of 8,249,446 human SNPs and compared their distribution with that in the genome reference sequences. The results revealed that the short sequences, which contained at least one CpG dinucleotide, occurred

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A genome-wide view of sequence mutability in mice is still limited, although biologists usually assume the same scenario for mice as for humans. In this study, we examined the sequence context in the local environment of 482,528 mouse single nucleotide po

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Inter-simple sequence repeat (ISSR) analysis was used to assess genetic diversity among 10 pairs of male and female Laminaria gametophytes. A total of 58 amplification loci was obtained from 10 selected ISSR primers, of which 34 revealed polymorphism among the gametophytes. Genetic distances were calculated with the Dice coefficient ranging from 0.006 to 0.223. A dendrogram based on the unweighted pair-group method arithmetic (UPGMA) average showed that most male and female gametophytes of the same species were clustered together and that 10 pairs of gametophytes were divided into four groups. This was generally consistent with the taxonomic categories. The main group consisted of six pairs of gametophytes, which were selected from Laminaria japonica Aresch. by intensive inbreeding through artificial hybridization. One specific marker was cloned, but was not converted successfully into a sequence characterized amplified region (SCAR) marker. Our results demonstrate the feasibility of applying ISSR markers to evaluate Laminaria germplasm diversities.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The chromosomal genotype, as judged by multi locus sequence typing, and the episomal genotype, as judged by plasmid profile and cry gene content, were analyzed for a collection of strains of Bacillus thuringiensis. These had been recovered in vegetative form over a period of several months from the leaves of a small plot of clover (Trifolium hybridum). A clonal population structure was indicated, although greater variation in sequence types (STs) was discovered than in previous collections of B. cereus/B. thuringiensis. Isolates taken at the same time had quite different genotypes, whereas those of identical genotypes were recovered at different times. The profiles of plasmid content and cry genes generally bore no relation to each other nor to the STs. Evidently, although relatively little recombination was occurring in the seven chromosomal genes analyzed, a great deal of conjugal transfer, and perhaps recombination, was occurring involving plasmids. A clinical diarrheal isolate of B. cereus and the commercial biopesticide strain HD-1 of B. thuringiensis, both included as out-groups, were found to have very similar STs. This further emphasizes the role of episomal elements in the characteristics and differentiation of these two species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coccolithoviruses are giant dsDNA viruses that infect Emiliania huxleyi, the most ubiquitous marine microalga. Here, we present the genome of the latest coccolithovirus strain to be sequenced, EhV-99B1, and compare it with two other coccolithovirus genomes (EhV-86 and EhV-163). EhV-99B1 shares a pairwise nucleotide identity of 98% with EhV-163 (the two strains were isolated from the same Norwegian fjord but in different years), and just 96.5% with EhV-86 (isolated in the same spring as EhV-99B1 but in the English Channel). We confirmed and extended the list of relevant genomic differences between these EhVs from the Norwegian fjord and EhVs from the English Channel, namely the removal/insertions of: a phosphate permease, an endonuclease, a transposase, and two specific tRNAs. As a whole, this study provided new clues and insights into the diversity and mechanisms driving the evolution of these large oceanic viruses, in particular those processes involving selfish genetic elements.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A detailed study was performed for a sample of low-mass pre-main-sequence (PMS) stars, previously identified as weak-line T Tauri stars, which are compared to members of the Tucanae and Horologium Associations. Aiming to verify if there is any pattern of abundances when comparing the young stars at different phases, we selected objects in the range from 1 to 100 Myr, which covers most of PMS evolution. High-resolution optical spectra were acquired at European Southern Observatory and Observatorio do Pico dos Dias. The stellar fundamental parameters effective temperature and gravity were calculated by excitation and ionization equilibria of iron absorption lines. Chemical abundances were obtained via equivalent width calculations and spectral synthesis for 44 per cent of the sample, which shows metallicities within 0.5 dex solar. A classification was developed based on equivalent width of Li I 6708 angstrom and Ha lines and spectral types of the studied stars. This classification allowed a separation of the sample into categories that correspond to different evolutive stages in the PMS. The position of these stars in the Hertzsprung-Russell diagram was also inspected in order to estimate their ages and masses. Among the studied objects, it was verified that our sample actually contains seven weak-line T Tauri stars, three are Classical T Tauri, 12 are Fe/Ge PMS stars and 21 are post-T Tauri or young main-sequence stars. An estimation of circumstellar luminosity was obtained using a disc model to reproduce the observed spectral energy distribution. Most of the stars show low levels of circumstellar emission, corresponding to less than 30 per cent of the total emission.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background Parasitic wasps constitute one of the largest group of venomous animals. Although some physiological effects of their venoms are well documented, relatively little is known at the molecular level on the protein composition of these secretions. To identify the majority of the venom proteins of the endoparasitoid wasp Chelonus inanitus (Hymenoptera: Braconidae), we have randomly sequenced 2111 expressed sequence tags (ESTs) from a cDNA library of venom gland. In parallel, proteins from pure venom were separated by gel electrophoresis and individually submitted to a nano-LC-MS/MS analysis allowing comparison of peptides and ESTs sequences. Results About 60% of sequenced ESTs encoded proteins whose presence in venom was attested by mass spectrometry. Most of the remaining ESTs corresponded to gene products likely involved in the transcriptional and translational machinery of venom gland cells. In addition, a small number of transcripts were found to encode proteins that share sequence similarity with well-known venom constituents of social hymenopteran species, such as hyaluronidase-like proteins and an Allergen-5 protein. An overall number of 29 venom proteins could be identified through the combination of ESTs sequencing and proteomic analyses. The most highly redundant set of ESTs encoded a protein that shared sequence similarity with a venom protein of unknown function potentially specific of the Chelonus lineage. Venom components specific to C. inanitus included a C-type lectin domain containing protein, a chemosensory protein-like protein, a protein related to yellow-e3 and ten new proteins which shared no significant sequence similarity with known sequences. In addition, several venom proteins potentially able to interact with chitin were also identified including a chitinase, an imaginal disc growth factor-like protein and two putative mucin-like peritrophins. Conclusions The use of the combined approaches has allowed to discriminate between cellular and truly venom proteins. The venom of C. inanitus appears as a mixture of conserved venom components and of potentially lineage-specific proteins. These new molecular data enrich our knowledge on parasitoid venoms and more generally, might contribute to a better understanding of the evolution and functional diversity of venom proteins within Hymenoptera.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report a high-quality draft sequence of the genome of the horse (Equus caballus). The genome is relatively repetitive but has little segmental duplication. Chromosomes appear to have undergone few historical rearrangements: 53% of equine chromosomes show conserved synteny to a single human chromosome. Equine chromosome 11 is shown to have an evolutionary new centromere devoid of centromeric satellite DNA, suggesting that centromeric function may arise before satellite repeat accumulation. Linkage disequilibrium, showing the influences of early domestication of large herds of female horses, is intermediate in length between dog and human, and there is long-range haplotype sharing among breeds.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have searched for a minimal interaction motif in τ protein that supports the aggregation into Alzheimer-like paired helical filaments. Digestion of the repeat domain with different proteases yields a GluC-induced fragment comprising 43 residues (termed PHF43), which represents the third repeat of τ plus some flanking residues. This fragment self assembles readily into thin filaments without a paired helical appearance, but these filaments are highly competent to nucleate bona fide PHFs from full-length τ. Probing the interactions of PHF43 with overlapping peptides derived from the full τ sequence yields a minimal hexapeptide interaction motif of 306VQIVYK311 at the beginning of the third internal repeat. This motif coincides with the highest predicted β-structure potential in τ. CD and Fourier transform infrared spectroscopy shows that PHF43 acquires pronounced β structure in conditions of self assembly. Point mutations in the hexapeptide region by proline-scanning mutagenesis prevent the aggregation. The data indicate that PHF assembly is initiated by a short fragment containing the minimal interaction motif forming a local β structure embedded in a largely random-coil protein.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have identified an amino acid sequence in the Drosophila Transformer (Tra) protein that is capable of directing a heterologous protein to nuclear speckles, regions of the nucleus previously shown to contain high concentrations of spliceosomal small nuclear RNAs and splicing factors. This sequence contains a nucleoplasmin-like bipartite nuclear localization signal (NLS) and a repeating arginine/serine (RS) dipeptide sequence adjacent to a short stretch of basic amino acids. Sequence comparisons from a number of other splicing factors that colocalize to nuclear speckles reveal the presence of one or more copies of this motif. We propose a two-step subnuclear localization mechanism for splicing factors. The first step is transport across the nuclear envelope via the nucleoplasmin-like NLS, while the second step is association with components in the speckled domain via the RS dipeptide sequence.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nerve growth factor-induced differentiation of adrenal chromaffin PC-12 cells to a neuronal phenotype involves alterations in gene expression and represents a model system to study neuronal differentiation. We have used the expressed-sequence-tag approach to identify approximately 600 differentially expressed mRNAs in untreated and nerve growth factor-treated PC-12 cells that encode proteins with diverse structural and biochemical functions. Many of these mRNAs encode proteins belonging to cellular pathways not previously known to be regulated by nerve growth factor. Comparative expressed-sequence-tag analysis provides a basis for surveying global changes in gene-expression patterns in response to biological signals at an unprecedented scale, is a powerful tool for identifying potential interactions between different cellular pathways, and allows the gene-expression profiles of individual genes belonging to a particular pathway to be followed.