963 resultados para Genome-specific Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background
The use of small interfering RNA (siRNA) molecules in animals to achieve double-stranded RNA-mediated interference (RNAi) has recently emerged as a powerful method of sequence-specific gene knockdown. As DNA-based expression of short hairpin RNA (shRNA) for RNAi may offer some advantages over chemical and in vitro synthesised siRNA, a number of vectors for expression of shRNA have been developed. These often feature polymerase III (pol. III) promoters of either mouse or human origin.
Results
To develop a shRNA expression vector specifically for bovine RNAi applications, we identified and characterised a novel bovine U6 small nuclear RNA (snRNA) promoter from bovine sequence data. This promoter is the putative bovine homologue of the human U6-8 snRNA promoter, and features a number of functional sequence elements that are characteristic of these types of pol. III promoters. A PCR based cloning strategy was used to incorporate this promoter sequence into plasmid vectors along with shRNA sequences for RNAi. The promoter was then used to express shRNAs, which resulted in the efficient knockdown of an exogenous reporter gene and an endogenous bovine gene.
Conclusion
We have mined data from the bovine genome sequencing project to identify a functional bovine U6 promoter and used the promoter sequence to construct a shRNA expression vector. The use of this native bovine promoter in shRNA expression is an important component of our future development of RNAi therapeutic and transgenic applications in bovine species.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Biological sequence assembly is an essential step for sequencing the genomes of organisms. Sequence assembly is very computing intensive especially for the large-scale sequence assembly. Parallel computing is an effective way to reduce the computing time and support the assembly for large amount of biological fragments. Euler sequence assembly algorithm is an innovative algorithm proposed recently. The advantage of this algorithm is that its computing complexity is polynomial and it provides a better solution to the notorious “repeat” problem. This paper introduces the parallelization of the Euler sequence assembly algorithm. All the Genome fragments generated by whole genome shotgun (WGS) will be assembled as a whole rather than dividing them into groups which may incurs errors due to the inaccurate group partition. The implemented system can be run on supercomputers, network of workstations or even network of PC computers. The experimental results have demonstrated the performance of our system.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Bovine viral diarrhea virus (BVDV) is a ubiquitous viral pathogen that affects cattle herds’ worldwide causing significant economic loss. The current strategies to control BVDV infection include vaccination (modified-live or killed) and control of virus spread by enhanced biosecurity management, however, the disease remains prevalent. With the discovery of the sequence-specific method of gene silencing known as RNA interference (RNAi), a new era in antiviral therapies has begun. Here we report the efficient inhibition of BVDV replication by small interfering (siRNA) and short hairpin RNA (shRNA)-mediated gene silencing. siRNAs were generated to target the 5′ non-translated (NTR) region and the regions encoding the C, NS4B and NS5A proteins of the BVDV genome. The siRNAs were first validated using an EGFP/BVDV reporter system and were then shown to suppress BVDV-induced cytopathic effects and viral titers in cell culture with surprisingly different activities compared to the reporter system. Efficient viral suppression was then achieved by bovine 7SK-expressed BVDV-specific shRNAs. Overall, our results demonstrated the use of siRNA and shRNA-mediated gene silencing to achieve efficient inhibition of the  replication of this virus in cell culture.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Polymerase chain reaction (PCR) sequencing of specific viral gene segments was used to investigate the phylogenetic relationships among the orbiviruses. Sequence comparisons of the bluetongue virus (BTV) RNA3 from different regions of the world (North America, South Africa, India, Indonesian, Malaysia, Australia and the Caribbean region) showed that geographic separation had resulted in significant divergence, consistent with the evolution of distinct viral populations. There were at least 3 topotypes (Gould, 1987); the Australasian, African - American and another topotype represented by BTV 15 isolated in Australia in 1986. The topotypes of BTV had RNA3 nucleotide sequences that differed by approximately 20 per cent. Analysis of BTV-specific gene segments from animal and insect specimens showed that bluetongue viruses had entered northern Australia from South East Asia, possibly by wind-borne vectors. Nucleotide sequence comparisons were used to show the close genetic relationship between BTV 2 (Ona-A strain) from Florida and BTV 12 from Jamaica, and to investigate the reassortment of BTV genome segments in nature. The mutation rates of the BTV RNA2 and RNA3 segments were estimated to be of the order of 10(-4) nucleotide changes/site/year, similar in magnitude to that reported for other RNA viruses.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Using a milk-cell cDNA sequencing approach we characterised milk-protein sequences from two monotreme species, platypus (Ornithorhynchus anatinus) and echidna (Tachyglossus aculeatus) and found a full set of caseins and casein variants. The genomic organisation of the platypus casein locus is compared with other mammalian genomes, including the marsupial opossum and several eutherians. Physical linkage of casein genes has been seen in the casein loci of all mammalian genomes examined and we confirm that this is also observed in platypus. However, we show that a recent duplication of β-casein occurred in the monotreme lineage, as opposed to more ancient duplications of α-casein in the eutherian lineage, while marsupials possess only single copies of α- and β-caseins. Despite this variability, the close proximity of the main α- and β-casein genes in an inverted tail-tail orientation and the relative orientation of the more distant kappa-casein genes are similar in all mammalian genome sequences so far available. Overall, the conservation of the genomic organisation of the caseins indicates the early, pre-monotreme development of the fundamental role of caseins during lactation. In contrast, the lineage-specific gene duplications that have occurred within the casein locus of monotremes and eutherians but not marsupials, which may have lost part of the ancestral casein locus, emphasises the independent selection on milk provision strategies to the young, most likely linked to different developmental strategies. The monotremes therefore provide insight into the ancestral drivers for lactation and how these have adapted in different lineages.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Much evidence has accumulated to indicate memory deficits in children with specific language impairment. However, most research has focused on working memory impairments in these children. Less is known about the functioning of other memory systems in this population.

Aims: This study examined procedural and declarative memory in young children with and without specific language impairment.

Methods & Procedures: A total of 15 children with specific language impairment and 15 non-impaired children of comparable age, gender and handedness were presented with measures of procedural and declarative memory. Procedural memory was assessed using a Serial Reaction Time (SRT) Task in which children implicitly learnt a ten-item sequence pattern. Declarative memory for verbal and visual information was assessed using paired associative learning tasks.

Outcomes & Results:
The results from the SRT Task showed the children with specific language impairment did not learn the sequence at levels comparable with the non-impaired children. On the measures of declarative memory, differences between the groups were observed on the verbal but not the visual task. The differences on the verbal declarative memory task were found after statistically controlling for differences in vocabulary and phonological short-term memory.

Conclusions & Implications:
The results were interpreted to suggest an uneven profile of memory functioning in specific language impairment. On measures of declarative memory, specific language impairment appears to be associated with difficulties learning verbal information. At the same time, procedural memory is also appears to be impaired. Collectively, this study indicates multiple memory impairments in specific language impairment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study presents a new computational method for guanine (G) and cytosine (C), or GC, content profiling based on the idea of multiple resolution sampling (MRS). The benefit of our new approach over existing techniques follows from its ability to locate significant regions without prior knowledge of the sequence, nor the features being sought. The use of MRS has provided novel insights into bacterial genome composition. Key findings include those that are related to the core composition of bacterial genomes, to the identification of large genomic islands (in Enterobacterial genomes), and to the identification of surface protein determinants in human pathogenic organisms (e.g., Staphylococcus genomes). We observed that bacterial surface binding proteins maintain abnormal GC content, potentially pointing to a viral origin. This study has demonstrated that GC content holds a high informational worth and hints at many underlying evolutionary processes. For online Supplementary Material, see www.liebertonline.com.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Linkage analysis is a successful procedure to associate diseases with specific genomic regions. These regions are often large, containing hundreds of genes, which make experimental methods employed to identify the disease gene arduous and expensive. We present two methods to prioritize candidates for further experimental study: Common Pathway Scanning (CPS) and Common Module Profiling (CMP). CPS is based on the assumption that common phenotypes are associated with dysfunction in proteins that participate in the same complex or pathway. CPS applies network data derived from protein–protein interaction (PPI) and pathway databases to identify relationships between genes. CMP identifies likely candidates using a domain-dependent sequence similarity approach, based on the hypothesis that disruption of genes of similar function will lead to the same phenotype. Both algorithms use two forms of input data: known disease genes or multiple disease loci. When using known disease genes as input, our combined methods have a sensitivity of 0.52 and a specificity of 0.97 and reduce the candidate list by 13-fold. Using multiple loci, our methods successfully identify disease genes for all benchmark diseases with a sensitivity of 0.84 and a specificity of 0.63. Our combined approach prioritizes good candidates and will accelerate the disease gene discovery process.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

EGF domains are extracellular protein modules cross-linked by three intradomain disulfides. Past studies suggest the existence of two types of EGF domain with three-disulfides, human EGF-like (hEGF) domains and complement C1r-like (cEGF) domains, but to date no functional information has been related to the two different types, and they are not differentiated in sequence or structure databases. We have developed new sequence patterns based on the different C-termini to search specifically for the two types of EGF domains in sequence databases. The exhibited sensitivity and specificity of the new pattern-based method represents a significant advancement over the currently available sequence detection techniques. We re-annotated EGF sequences in the latest release of Swiss-Prot looking for functional relationships that might correlate with EGF type. We show that important post-translational modifications of three-disulfide EGFs, including unusual forms of glycosylation and post-translational proteolytic processing, are dependent on EGF subtype. For example, EGF domains that are shed from the cell surface and mediate intercellular signaling are all hEGFs, as are all human EGF receptor family ligands. Additional experimental data suggest that functional specialization has accompanied subtype divergence. Based on our structural analysis of EGF domains with three-disulfide bonds and comparison to laminin and integrin-like EGF domains with an additional interdomain disulfide, we propose that these hEGF and cEGF domains may have arisen from a four-disulfide ancestor by selective loss of different cysteine residues.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The challenge of comparing two or more genomes that have undergone recombination and substantial amounts of segmental loss and gain has recently been addressed for small numbers of genomes. However, datasets of hundreds of genomes are now common and their sizes will only increase in the future. Multiple sequence alignment of hundreds of genomes remains an intractable problem due to quadratic increases in compute time and memory footprint. To date, most alignment algorithms are designed for commodity clusters without parallelism. Hence, we propose the design of a multiple sequence alignment algorithm on massively parallel, distributed memory supercomputers to enable research into comparative genomics on large data sets. Following the methodology of the sequential progressiveMauve algorithm, we design data structures including sequences and sorted k-mer lists on the IBM Blue Gene/P supercomputer (BG/P). Preliminary results show that we can reduce the memory footprint so that we can potentially align over 250 bacterial genomes on a single BG/P compute node. We verify our results on a dataset of E.coli, Shigella and S.pneumoniae genomes. Our implementation returns results matching those of the original algorithm but in 1/2 the time and with 1/4 the memory footprint for scaffold building. In this study, we have laid the basis for multiple sequence alignment of large-scale datasets on a massively parallel, distributed memory supercomputer, thus enabling comparison of hundreds instead of a few genome sequences within reasonable time.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have identified the tRNAs which are incorporated into both wild-type human immunodeficiency virus type 1 strain IIIB (HIV-1IIIB) produced in COS-7 cells transfected with HIV-1 proviral DNA and mutant, noninfectious HIV-1Lai particles produced in a genetically engineered Vero cell line. The mutant proviral DNA contains nucleotides 678 to 8944; i.e., both long terminal repeats and the primer binding site are absent. As analyzed by two-dimensional polyacrylamide gel electrophoresis, both mutant and wild-type HIV-1 contain four major-abundance tRNA species, which include tRNA(1,2Lys), tRNA(3Lys) (the putative primer for HIV-1 reverse transcriptase) and tRNA(Ile). Identification was accomplished by comparing the electrophoretic mobilities and RNase T1 digests with those of tRNA(3Lys) and tRNA(1,2Lys) purified from human placenta and comparing the partial nucleotide sequence at the 3' end of each viral tRNA species with published tRNA sequences. Thus, the absence of the primer binding site in the mutant virus does not affect tRNA(Lys) incorporation into HIV-1. However, only the wild-type virus contains tRNA(3Lys) tightly associated with the viral RNA genome. The identification of the tightly associated tRNA as tRNA(3Lys) is based upon an electrophoretic mobility identical to that of tRNA(3Lys) and the ability of this RNA to hybridize with a tRNA(3Lys)-specific DNA probe. In addition to the four wild-type tRNA species, the mutant HIV-1-like particle contains two tRNA(His) species and three tRNA-sized species that we have been unable to identify. Their absence in wild-type virus makes it unlikely that they are required for viral infectivity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background : Rhabdoid tumors are rare cancers of early childhood arising in the kidney, central nervous system and other organs. The majority are caused by somatic inactivating mutations or deletions affecting the tumor suppressor locus SMARCB1 [OMIM 601607]. Germ-line SMARCB1 inactivation has been reported in association with rhabdoid tumor, epitheloid sarcoma and familial schwannomatosis, underscoring the importance of accurate mutation screening to ascertain recurrence and transmission risks. We describe a rapid and sensitive diagnostic screening method, using high resolution melting (HRM), for detecting sequence variations in SMARCB1. Methods : Amplicons, encompassing the nine coding exons of SMARCB1, flanking splice site sequences and the 5' and 3' UTR, were screened by both HRM and direct DNA sequencing to establish the reliability of HRM as a primary mutation screening tool. Reaction conditions were optimized with commercially available HRM mixes. Results : The false negative rate for detecting sequence variants by HRM in our sample series was zero. Nine amplicons out of a total of 140 (6.4%) showed variant melt profiles that were subsequently shown to be false positive. Overall nine distinct pathogenic SMARCB1 mutations were identified in a total of 19 possible rhabdoid tumors. Two tumors had two distinct mutations and two harbored SMARCB1 deletion. Other mutations were nonsense or frame-shifts. The detection sensitivity of the HRM screening method was influenced by both sequence context and specific nucleotide change and varied from 1: 4 to 1:1000 (variant to wild-type DNA). A novel method involving digital HRM, followed by re-sequencing, was used to confirm mutations in tumor specimens containing associated normal tissue. Conclusions : This is the first report describing SMARCB1 mutation screening using HRM. HRM is a rapid, sensitive and inexpensive screening technology that is likely to be widely adopted in diagnostic laboratories to facilitate whole gene mutation screening.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND : The pigeon crop is specially adapted to produce milk that is fed to newly hatched young. The process of pigeon milk production begins when the germinal cell layer of the crop rapidly proliferates in response to prolactin, which results in a mass of epithelial cells that are sloughed from the crop and regurgitated to the young. We proposed that the evolution of pigeon milk built upon the ability of avian keratinocytes to accumulate intracellular neutral lipids during the cornification of the epidermis. However, this cornification process in the pigeon crop has not been characterised. RESULTS: We identified the epidermal differentiation complex in the draft pigeon genome scaffold and found that, like the chicken, it contained beta-keratin genes. These beta-keratin genes can be classified, based on sequence similarity, into several clusters including feather, scale and claw keratins. The cornified cells of the pigeon crop express several cornification-associated genes including cornulin, S100-A9 and A16-like, transglutaminase 6-like and the pigeon 'lactating' crop-specific annexin cp35. Beta-keratins play an important role in 'lactating' crop, with several claw and scale keratins up-regulated. Additionally, transglutaminase 5 and differential splice variants of transglutaminase 4 are up-regulated along with S100-A10. CONCLUSIONS: This study of global gene expression in the crop has expanded our knowledge of pigeon milk production, in particular, the mechanism of cornification and lipid production. It is a highly specialised process that utilises the normal keratinocyte cellular processes to produce a targeted nutrient solution for the young at a very high turnover.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Current single-locus-based analyses and candidate disease gene prediction methodologies used in genome-wide association studies (GWAS) do not capitalize on the wealth of the underlying genetic data, nor functional data available from molecular biology. Here, we analyzed GWAS data from the Wellcome Trust Case Control Consortium (WTCCC) on coronary artery disease (CAD). Gentrepid uses a multiple-locus-based approach, drawing on protein pathway- or domain-based data to make predictions. Known disease genes may be used as additional information (seeded method) or predictions can be based entirely on GWAS single nucleotide polymorphisms (SNPs) (ab initio method). We looked in detail at specific predictions made by Gentrepid for CAD and compared these with known genetic data and the scientific literature. Gentrepid was able to extract known disease genes from the candidate search space and predict plausible novel disease genes from both known and novel WTCCC-implicated loci. The disease gene candidates are consistent with known biological information. The results demonstrate that this computational approach is feasible and a valuable discovery tool for geneticists.