960 resultados para sequences analysis technology
Resumo:
Raman spectroscopy of formamide-intercalated kaolinites treated using controlled-rate thermal analysis technology (CRTA), allowing the separation of adsorbed formamide from intercalated formamide in formamide-intercalated kaolinites, is reported. The Raman spectra of the CRTA-treated formamide-intercalated kaolinites are significantly different from those of the intercalated kaolinites, which display a combination of both intercalated and adsorbed formamide. An intense band is observed at 3629 cm-1, attributed to the inner surface hydroxyls hydrogen bonded to the formamide. Broad bands are observed at 3600 and 3639 cm-1, assigned to the inner surface hydroxyls, which are hydrogen bonded to the adsorbed water molecules. The hydroxyl-stretching band of the inner hydroxyl is observed at 3621 cm-1 in the Raman spectra of the CRTA-treated formamide-intercalated kaolinites. The results of thermal analysis show that the amount of intercalated formamide between the kaolinite layers is independent of the presence of water. Significant differences are observed in the CO stretching region between the adsorbed and intercalated formamide.
Resumo:
The present study has been carried out with the following objectives: i) To investigate the attributes of source parameters of local and regional earthquakes; ii) To estimate, as accurately as possible, M0, fc, Δσ and their standard errors to infer their relationship with source size; iii) To quantify high-frequency earthquake ground motion and to study the source scaling. This work is based on observational data of micro, small and moderate -earthquakes for three selected seismic sequences, namely Parkfield (CA, USA), Maule (Chile) and Ferrara (Italy). For the Parkfield seismic sequence (CA), a data set of 757 (42 clusters) repeating micro-earthquakes (0 ≤ MW ≤ 2), collected using borehole High Resolution Seismic Network (HRSN), have been analyzed and interpreted. We used the coda methodology to compute spectral ratios to obtain accurate values of fc , Δσ, and M0 for three target clusters (San Francisco, Los Angeles, and Hawaii) of our data. We also performed a general regression on peak ground velocities to obtain reliable seismic spectra of all earthquakes. For the Maule seismic sequence, a data set of 172 aftershocks of the 2010 MW 8.8 earthquake (3.7 ≤ MW ≤ 6.2), recorded by more than 100 temporary broadband stations, have been analyzed and interpreted to quantify high-frequency earthquake ground motion in this subduction zone. We completely calibrated the excitation and attenuation of the ground motion in Central Chile. For the Ferrara sequence, we calculated moment tensor solutions for 20 events from MW 5.63 (the largest main event occurred on May 20 2012), down to MW 3.2 by a 1-D velocity model for the crust beneath the Pianura Padana, using all the geophysical and geological information available for the area. The PADANIA model allowed a numerical study on the characteristics of the ground motion in the thick sediments of the flood plain.
Resumo:
At issue is whether or not isolated DNA is patent eligible under the U.S. Patent Law and the implications of that determination on public health. The U.S. Patent and Trademark Office has issued patents on DNA since the 1980s, and scientists and researchers have proceeded under that milieu since that time. Today, genetic research and testing related to the human breast cancer genes BRCA1 and BRCA2 is conducted within the framework of seven patents that were issued to Myriad Genetics and the University of Utah Research Foundation between 1997 and 2000. In 2009, suit was filed on behalf of multiple researchers, professional associations and others to invalidate fifteen of the claims underlying those patents. The Court of Appeals for the Federal Circuit, which hears patent cases, has invalidated claims for analyzing and comparing isolated DNA but has upheld claims to isolated DNA. The specific issue of whether isolated DNA is patent eligible is now before the Supreme Court, which is expected to decide the case by year's end. In this work, a systematic review was performed to determine the effects of DNA patents on various stakeholders and, ultimately, on public health; and to provide a legal analysis of the patent eligibility of isolated DNA and the likely outcome of the Supreme Court's decision. ^ A literature review was conducted to: first, identify principle stakeholders with an interest in patent eligibility of the isolated DNA sequences BRCA1 and BRCA2; and second, determine the effect of the case on those stakeholders. Published reports that addressed gene patents, the Myriad litigation, and implications of gene patents on stakeholders were included. Next, an in-depth legal analysis of the patent eligibility of isolated DNA and methods for analyzing it was performed pursuant to accepted methods of legal research and analysis based on legal briefs, federal law and jurisprudence, scholarly works and standard practice legal analysis. ^ Biotechnology, biomedical and clinical research, access to health care, and personalized medicine were identified as the principle stakeholders and interests herein. Many experts believe that the patent eligibility of isolated DNA will not greatly affect the biotechnology industry insofar as genetic testing is concerned; unlike for therapeutics, genetic testing does not require tremendous resources or lead time. The actual impact on biomedical researchers is uncertain, with greater impact expected for researchers whose work is intended for commercial purposes (versus basic science). The impact on access to health care has been surprisingly difficult to assess; while invalidating gene patents might be expected to decrease the cost of genetic testing and improve access to more laboratories and physicians' offices that provide the test, a 2010 study on the actual impact was inconclusive. As for personalized medicine, many experts believe that the availability of personalized medicine is ultimately a public policy issue for Congress, not the courts. ^ Based on the legal analysis performed in this work, this writer believes the Supreme Court is likely to invalidate patents on isolated DNA whose sequences are found in nature, because these gene sequences are a basic tool of scientific and technologic work and patents on isolated DNA would unduly inhibit their future use. Patents on complementary DNA (cDNA) are expected to stand, however, based on the human intervention required to craft cDNA and the product's distinction from the DNA found in nature. ^ In the end, the solution as to how to address gene patents may lie not in jurisprudence but in a fundamental change in business practices to provide expanded licenses to better address the interests of the several stakeholders. ^
Resumo:
分子系统学建立在实验和计算的基础之上。DNA快速测序技术的普及为分子系统学家提供了大量数据,而序列分析技术则是探索数据发现知识的重要工具。在基因组时代,随着大量模式生物完整基因组序列的获得,分子系统学正面临着前所未有的机遇和挑战。一方面,生命之树计划有助于确定新的模式生物和开展相应的基因组计划;另一方面,模式生物的基因组计划有助于阐明它们之间的进化关系和基因组的进化模式。更为重要的是,分子系统学序列分析技术已经发展成为探索与整合基因组数据的强有力工具,从而在生命科学中发挥重要作用。事实上,分子系统学和基因组学的相互渗透正在形成一门崭新的交叉学科——系统发育基因组学。 为了奠定分子系统学研究中信息管理和数据分析工作的坚实基础,我们建立了分子系统发育分析平台。该平台为研究人员提供专业数据库服务和数据分析技术支持,以及相关的网络资源。 分子系统发育分析平台包括了3个专业数据库。第一个是DNA凭证标本数据库。该数据库中的记录包括了7项字段:英文科名、中文科名、物种拉丁名、采集人、采集号、采集地和采集时间。用户可以通过设定单个或多个字段的取值进行检索。截止2004年6月1日,该数据库共包括3491条标本记录。第二个是引物数据库。PCR引物是分子系统学实验的重要条件之一。该数据库中的记录包括3项字段:引物名称、序列内容和退火温度。用户可以通过设定单个或多个字段的取值进行检索。截止2004年6月1日,该数据库共包括170条用于扩增植物细胞核、叶绿体和线粒体基因组DNA序列的引物记录。第三个是生物计算数据库。该数据库为研究人员提供传输和保存序列分析数据和结果文件的服务。 为了确保数据库的安全性和使用性,我们开发了数据库的接口和检索工具,以及系统管理员和用户资格认证程序。通过前者,使用者可以进行数据的上传、下载、管理和检索等操作。而后者则是对不同使用者身份和权限进行设定。管理员的权限高于用户,主要负责本系统的日常维护和管理工作,以及对新增管理员和用户进行资格认证。 分析技术支持旨在帮助用户快速掌握常用的系统发育分析方法,进行有效的数据分析,从复杂的统计学算法和计算机程序中解放出来,将精力集中于计算结果的生物学解释。在该部分中,我们首先简要介绍了常用的分析方法,并且针对分子系统学中的不同问题提供了相应的解决方案。这些问题包括:系统发育重建、替代速率和分歧时间的估计、祖先分布区的重建、性状进化假说的检验、以及密码子水平适应性进化的检测。我们特别强调了似然比检验和贝叶斯推测作为方法论上的重要进展在分子系统学中所发挥的关键作用。本部分还包括大量常用的分子系统学程序或软件包及其快速使用说明和命令模块。下载安装之后,用户即可按照说明使用命令模块进行数据分析。 此外,该平台还提供了一些常用的网络资源地址,如生物信息中心、分子进化和系统发育实验室、专业期刊和相关数据库等。 最后还给出了4个应用实例,即针对特定分子系统学问题的解决方案和初步的分析结果。 第一个例子说明系统发育重建方法的应用。为了确定杨梅科的系统学位置,对6种DNA序列和叶绿体trnL-F区内的间隔性状进行了分析。单个分析表明这6种序列之间在系统学信息上存在显著差异。叶绿体基因组序列的合并分析强烈支持杨梅科和(木麻黄科,(桦木科,核果桦科))的姐妹群关系,而间隔性状的存在能够充分提高其分辨率和支持率。 第二个例子说明如何推测历史生物地理学过程。我们对壳斗目8科25属植物叶绿体基因组的trnL-F、matK、rbcL和atpB的合并序列进行了最大简约分析,得到唯一的最大简约树。基于该系统树和25属植物的地理分布数据,采用扩散-替代分析方法重建了系统树每个节点上的祖先分布区,推测了壳斗目的分布历史。结果表明,壳斗目的历史生物地理学过程由3次替代事件和20次扩散事件组成。其中最重要的替代事件是由于冈瓦纳大陆和劳亚大陆分离所导致的南青冈科及其姐妹群之间的分化。另外,在壳斗科和核心高等金缕梅类中多次发生从欧亚大陆到北美洲、甚至南美洲的平行扩散事件。 第三个例子说明如何估计分歧时间。我们仍然使用扩散-替代分析中所用的最大简约树作为分析的依据,并根据等级制似然比检验确定的最优替代模型对该系统树的支长进行了最大似然优化。似然比检验表明,该系统树不服从分子钟假说。我们以冈瓦纳大陆和劳亚大陆分离的地质事件和5个属的最早化石记录作为标定点,采用罚分似然法在没有分子钟的条件下估计了壳斗目的科间分歧时间。结果表明,绝大多数科间分歧事件都发生在白垩纪。 第四个例子说明如何检测密码子水平的适应性进化。分支间可变选择压力模型的似然比检验表明SARS冠状病毒的S基因在跨种传播过程中发生了正选择。
Resumo:
The mechanism for the decomposition of hydrotalcite remains unsolved. Controlled rate thermal analysis enables this decomposition pathway to be explored. The thermal decomposition of hydrotalcites with hexacyanoferrite(II) and hexacyanoferrate(III) in the interlayer has been studied using controlled rate thermal analysis technology. X-ray diffraction shows the hydrotalcites studied have a d(003) spacing of 11.1 and 10.9 Å which compares with a d-spacing of 7.9 and 7.98 Å for the hydrotalcite with carbonate or sulphate in the interlayer. Calculations based upon CRTA measurements show that 7 moles of water is lost, proving the formula of hexacyanoferrite(II) intercalated hydrotalcite is Mg6Al2(OH)16[Fe(CN)6]0.5 .7 H2O and for the hexacyanoferrate(III) intercalated hydrotalcite is Mg6Al2(OH)16[Fe(CN)6]0.66 * 9 H2O. Dehydroxylation combined with CN unit loss occurs in three steps between a) 310 and 367°C b) 367 and 390°C and c) between 390 and 428°C for both the hexacyanoferrite(II) and hexacyanoferrate(III) intercalated hydrotalcite.
Resumo:
Acipenseriformes is an endangered primitive fish group, which occupies a special place in the history of ideas concerning fish evolution, even in vertebrate evolution. However, the classification and evolution of the fishes have been debated. The mitochondrial DNA (mtDNA) ND4L and partial ND4 genes were first sequenced in twelve species of the order Acipenseriformes, including endemic Chinese species. The following points were drawn from DNA sequences analysis: (i) the two species of Huso can be ascribed to Acipenser; (ii) A. dabryanus is the mostly closely related to A. sinensis, and most likely the landlocked form of A. sinensis; (iii) genus Acipenser in trans-Pacific region might have a common origin; (iv) mtDNA ND4L and ND4 genes are the ideal genetic markers for phylogenetic analysis of the order Acipenseriformes.
Resumo:
Acipenseriformes is an endangered primitive fish group, which occupies a special place in the history of ideas concerning fish evolution, even in vertebrate evolution. However, the classification and evolution of the fishes have been debated. The mitochondrial DNA (mtDNA) ND4L and partial ND4 genes were first sequenced in twelve species of the order Acipenseriformes, including endemic Chinese species. The following points were drawn from DNA sequences analysis: (i) the two species of Huso can be ascribed to Acipenser; (ii) A. dabryanus is the mostly closely related to A. sinensis, and most likely the landlocked form of A. sinensis; (iii) genus Acipenser in trans-Pacific region might have a common origin; (iv) mtDNA ND4L and ND4 genes are the ideal genetic markers for phylogenetic analysis of the order Acipenseriformes.
Resumo:
The discovery and clinical application of molecular biomarkers in solid tumors, increasingly relies on nucleic acid extraction from FFPE tissue sections and subsequent molecular profiling. This in turn requires the pathological review of haematoxylin & eosin (H&E) stained slides, to ensure sample quality, tumor DNA sufficiency by visually estimating the percentage tumor nuclei and tumor annotation for manual macrodissection. In this study on NSCLC, we demonstrate considerable variation in tumor nuclei percentage between pathologists, potentially undermining the precision of NSCLC molecular evaluation and emphasising the need for quantitative tumor evaluation. We subsequently describe the development and validation of a system called TissueMark for automated tumor annotation and percentage tumor nuclei measurement in NSCLC using computerized image analysis. Evaluation of 245 NSCLC slides showed precise automated tumor annotation of cases using Tissuemark, strong concordance with manually drawn boundaries and identical EGFR mutational status, following manual macrodissection from the image analysis generated tumor boundaries. Automated analysis of cell counts for % tumor measurements by Tissuemark showed reduced variability and significant correlation (p < 0.001) with benchmark tumor cell counts. This study demonstrates a robust image analysis technology that can facilitate the automated quantitative analysis of tissue samples for molecular profiling in discovery and diagnostics.
Resumo:
Trypanosoma (Megatrypanum) theileri from cattle and trypanosomes of other artiodactyls form a clade of closely related species in analyses using ribosomal sequences. Analysis of polymorphic sequences of a larger number of trypanosomes from broader geographical origins is required to evaluate the Clustering of isolates as suggested by previous studies. Here, we determined the sequences of the spliced leader (SL) genes of 21 isolates from cattle and 2 from water buffalo from distant regions of Brazil. Analysis of SL gene repeats revealed that the 5S rRNA gene is inserted within the intergenic region. Phylogeographical patterns inferred using SL sequences showed at least 5 major genotypes of T. theileri distributed in 2 strongly divergent lineages. Lineage TthI comprises genotypes IA and IB from buffalo and cattle, respectively, from the Southeast and Central regions, whereas genotype IC is restricted to cattle from the Southern region. Lineage Tth II includes cattle genotypes IIA, which is restricted to the North and Northeast, and IIB, found in the Centre, West, North and Northeast. PCR-RFLP of SL genes revealed valuable markers for genotyping T. theileri. The results of this study emphasize the genetic complexity and corroborate the geographical structuring of T. theileri genotypes found in cattle.
Resumo:
Global data-flow analysis of (constraint) logic programs, which is generally based on abstract interpretation [7], is reaching a comparatively high level of maturity. A natural question is whether it is time for its routine incorporation in standard compilers, something which, beyond a few experimental systems, has not happened to date. Such incorporation arguably makes good sense only if: • the range of applications of global analysis is large enough to justify the additional complication in the compiler, and • global analysis technology can deal with all the features of "practical" languages (e.g., the ISO-Prolog built-ins) and "scales up" for large programs. We present a tutorial overview of a number of concepts and techniques directly related to the issues above, with special emphasis on the first one. In particular, we concéntrate on novel uses of global analysis during program development and debugging, rather than on the more traditional application área of program optimization. The idea of using abstract interpretation for validation and diagnosis has been studied in the context of imperative programming [2] and also of logic programming. The latter work includes issues such as using approximations to reduce the burden posed on programmers by declarative debuggers [6, 3] and automatically generating and checking assertions [4, 5] (which includes the more traditional type checking of strongly typed languages, such as Gódel or Mercury [1, 8, 9]) We also review some solutions for scalability including modular analysis, incremental analysis, and widening. Finally, we discuss solutions for dealing with meta-predicates, side-effects, delay declarations, constraints, dynamic predicates, and other such features which may appear in practical languages. In the discussion we will draw both from the literature and from our experience and that of others in the development and use of the CIAO system analyzer. In order to emphasize the practical aspects of the solutions discussed, the presentation of several concepts will be illustrated by examples run on the CIAO system, which makes extensive use of global analysis and assertions.
Resumo:
The analysis of concurrent constraint programs is a challenge due to the inherently concurrent behaviour of its computational model. However, most implementations of the concurrent paradigm can be viewed as a computation with a fixed scheduling rule which suspends some goals so that their execution is postponed until some condition awakens them. For a certain kind of properties, an analysis defined in these terms is correct. Furthermore, it is much more tractable, and in addition can make use of existing analysis technology for the underlying fixed computation rule. We show how this can be done when the starting point is a framework for the analysis of sequential programs. The resulting analysis, which incorporates suspensions, is adequate for concurrent models where concurrency is localized, e.g. the Andorra model. We refine the analysis for this particular case. Another model in which concurrency is preferably encapsulated, and thus suspensions are local to parts of the computation, is that of CIAO. Nonetheless, the analysis scheme can be generalized to models with global concurrency. We also sketch how this could be done, and we show how the resulting analysis framework could be used for analyzing typical properties, such as suspensión freeness.
Resumo:
Constant technology advances have caused data explosion in recent years. Accord- ingly modern statistical and machine learning methods must be adapted to deal with complex and heterogeneous data types. This phenomenon is particularly true for an- alyzing biological data. For example DNA sequence data can be viewed as categorical variables with each nucleotide taking four different categories. The gene expression data, depending on the quantitative technology, could be continuous numbers or counts. With the advancement of high-throughput technology, the abundance of such data becomes unprecedentedly rich. Therefore efficient statistical approaches are crucial in this big data era.
Previous statistical methods for big data often aim to find low dimensional struc- tures in the observed data. For example in a factor analysis model a latent Gaussian distributed multivariate vector is assumed. With this assumption a factor model produces a low rank estimation of the covariance of the observed variables. Another example is the latent Dirichlet allocation model for documents. The mixture pro- portions of topics, represented by a Dirichlet distributed variable, is assumed. This dissertation proposes several novel extensions to the previous statistical methods that are developed to address challenges in big data. Those novel methods are applied in multiple real world applications including construction of condition specific gene co-expression networks, estimating shared topics among newsgroups, analysis of pro- moter sequences, analysis of political-economics risk data and estimating population structure from genotype data.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-07
Resumo:
The complex interaction of the bones of the foot has been explored in detail in recent years, which has led to the acknowledgement in the biomechanics community that the foot can no longer be considered as a single rigid segment. With the advance of motion analysis technology it has become possible to quantify the biomechanics of simplified units or segments that make up the foot. Advances in technology coupled with reducing hardware prices has resulted in the uptake of more advanced tools available for clinical gait analysis. The increased use of these techniques in clinical practice requires defined standards for modelling and reporting of foot and ankle kinematics. This systematic review aims to provide a critical appraisal of commonly used foot and ankle marker sets designed to assess kinematics and thus provide a theoretical background for the development of modelling standards.