998 resultados para clone detection
Resumo:
In all applications of clone detection it is important to have precise and efficient clone identification algorithms. This paper proposes and outlines a new algorithm, KClone for clone detection that incorporates a novel combination of lexical and local dependence analysis to achieve precision, while retaining speed. The paper also reports on the initial results of a case study using an implementation of KClone with which we have been experimenting. The results indi- cate the ability of KClone to find types-1,2, and 3 clones compared to token-based and PDG-based techniques. The paper also reports results of an initial empirical study of the performance of KClone compared to CCFinderX.
Resumo:
Code clone detection helps connect developers across projects, if we do it on a large scale. The cornerstones that allow clone detection to work on a large scale are: (1) bad hashing (2) lightweight parsing using regular expressions and (3) MapReduce pipelines. Bad hashing means to determine whether or not two artifacts are similar by checking whether their hashes are identical. We show a bad hashing scheme that works well on source code. Lightweight parsing using regular expressions is our technique of obtaining entire parse trees from regular expressions, robustly and efficiently. We detail the algorithm and implementation of one such regular expression engine. MapReduce pipelines are a way of expressing a computation such that it can automatically and simply be parallelized. We detail the design and implementation of one such MapReduce pipeline that is efficient and debuggable. We show a clone detector that combines these cornerstones to detect code clones across all projects, across all versions of each project.
Resumo:
Clone detection is well established for imperative programs. It works mostly on the statement level and therefore is ill-suited for func- tional programs, whose main constituents are expressions and types. In this paper we introduce clone detection for functional programs using a new intermediate program representation, dubbed Functional Control Tree. We extend clone detection to the identi cation of non-trivial func- tional program clones based on the recursion patterns from the so-called Bird-Meertens formalism
Resumo:
This report summarises our idea of code clone detection in Haskell code and refactorings based on identified clones as it evolved in our group-of-three discussion
Resumo:
Code clones are portions of source code which are similar to the original program code. The presence of code clones is considered as a bad feature of software as the maintenance of software becomes difficult due to the presence of code clones. Methods for code clone detection have gained immense significance in the last few years as they play a significant role in engineering applications such as analysis of program code, program understanding, plagiarism detection, error detection, code compaction and many more similar tasks. Despite of all these facts, several features of code clones if properly utilized can make software development process easier. In this work, we have pointed out such a feature of code clones which highlight the relevance of code clones in test sequence identification. Here program slicing is used in code clone detection. In addition, a classification of code clones is presented and the benefit of using program slicing in code clone detection is also mentioned in this work.
Resumo:
In recent years, it has been observed that software clones and plagiarism are becoming an increased threat for one?s creativity. Clones are the results of copying and using other?s work. According to the Merriam – Webster dictionary, “A clone is one that appears to be a copy of an original form”. It is synonym to duplicate. Clones lead to redundancy of codes, but not all redundant code is a clone.On basis of this background knowledge ,in order to safeguard one?s idea and to avoid intentional code duplication for pretending other?s work as if their owns, software clone detection should be emphasized more. The objective of this paper is to review the methods for clone detection and to apply those methods for finding the extent of plagiarism occurrence among the Swedish Universities in Master level computer science department and to analyze the results.The rest part of the paper, discuss about software plagiarism detection which employs data analysis technique and then statistical analysis of the results.Plagiarism is an act of stealing and passing off the idea?s and words of another person?s as one?s own. Using data analysis technique, samples(Master level computer Science thesis report) were taken from various Swedish universities and processed in Ephorus anti plagiarism software detection. Ephorus gives the percentage of plagiarism for each thesis document, from this results statistical analysis were carried out using Minitab Software.The results gives a very low percentage of Plagiarism extent among the Swedish universities, which concludes that Plagiarism is not a threat to Sweden?s standard of education in computer science.This paper is based on data analysis, intelligence techniques, EPHORUS software plagiarism detection tool and MINITAB statistical software analysis.
Resumo:
The lack of effective tools have hampered our ability to assess the size, growth and ages of clonal plants. With Serenoa repens (saw palmetto) as a model, we introduce a novel analytical framework that integrates DNA fingerprinting and mathematical modelling to simulate growth and estimate ages of clonal plants. We also demonstrate the application of such life-history information of clonal plants to provide insight into management plans. Serenoa is an ecologically important foundation species in many Southeastern United States ecosystems; yet, many land managers consider Serenoa a troublesome invasive plant. Accordingly, management plans have been developed to reduce or eliminate Serenoa with little understanding of its life history. Using Amplified Fragment Length Polymorphisms, we genotyped 263 Serenoa and 134 Sabal etonia (a sympatric non-clonal palmetto) samples collected from a 20 X 20 m study plot in Florida scrub. Sabal samples were used to assign small field-unidentifiable palmettos to Serenoa or Sabal and also as a negative control for clone detection. We then mathematically modelled clonal networks to estimate genet ages. Our results suggest that Serenoa predominantly propagate via vegetative sprouts and 10000-year-old genets may be common, while showing no evidence of clone formation by Sabal. The results of this and our previous studies suggest that: (i) Serenoa has been part of scrub associations for thousands of years, (ii) Serenoa invasion are unlikely and (ii) once Serenoa is eliminated from local communities, its restoration will be difficult. Reevaluation of the current management tools and plans is an urgent task.
Resumo:
The lack of effective tools has hampered our ability to assess the size, growth and ages of clonal plants. With Serenoa repens (saw palmetto) as a model, we introduce a novel analytical frame work that integrates DNA fingerprinting and mathematical modelling to simulate growth and estimate ages of clonal plants. We also demonstrate the application of such life-history information of clonal plants to provide insight into management plans. Serenoa is an ecologically important foundation species in many Southeastern United States ecosystems; yet, many land managers consider Serenoa a troublesome invasive plant. Accordingly, management plans have been developed to reduce or eliminate Serenoa with little understanding of its life history. Using Amplified Fragment Length Polymorphisms, we genotyped 263 Serenoa and 134 Sabal etonia (a sympatric non-clonal palmetto) samples collected from a 20 x 20 m study plot in Florida scrub. Sabal samples were used to assign small field-unidentifiable palmettos to Serenoa or Sabal and also as a negative control for clone detection. We then mathematically modelled clonal networks to estimate genet ages. Our results suggest that Serenoa predominantly propagate via vegetative sprouts and 10000-year-old genets maybe common, while showing no evidence of clone formation by Sabal. The results of this and our previous studies suggest that: (i) Serenoa has been part of scrub associations for thousands of years, (ii) Serenoa invasions are unlikely and (ii) once Serenoa is eliminated from local communities, its restoration will be difficult. Reevaluation of the current management tools and plans is an urgent task.
Resumo:
Code patterns, including programming patterns and design patterns, are good references for programming language feature improvement and software re-engineering. However, to our knowledge, no existing research has attempted to detect code patterns based on code clone detection technology. In this study, we build upon the previous work and propose to detect and analyze code patterns from a collection of open source projects using NiPAT technology. Because design patterns are most closely associated with object-oriented languages, we choose Java and Python projects to conduct our study. The tool we use for detecting patterns is NiPAT, a pattern detecting tool originally developed for the TXL programming language based on the NiCad clone detector. We extend NiPAT for the Java and Python programming languages. Then, we try to identify all the patterns from the pattern report and classify them into several different categories. In the end of the study, we analyze all the patterns and compare the differences between Java and Python patterns.
Resumo:
The naturally occurring clonal diversity among field isolates of the major human malaria parasite Plasmodium vivax remained unexplored until the early 1990s, when improved molecular methods allowed the use of blood samples obtained directly from patients, without prior in vitro culture, for genotyping purposes. Here we briefly review the molecular strategies currently used to detect genetically distinct clones in patient-derived P. vivax samples, present evidence that multiple-clone P. vivax infections are commonly detected in areas with different levels of malaria transmission and discuss possible evolutionary and epidemiological consequences of the competition between genetically distinct clones in natural human infections. We suggest that, when two or more genetically distinct clones are present in the same host, intra-host competition for limited resources may select for P. vivax traits that represent major public health challenges, such as increased virulence, increased transmissibility and antimalarial drug resistance.
Resumo:
The naturally occurring clonal diversity among field isolates of the major human malaria parasite Plasmodium vivax remained unexplored until the early 1990s, when improved molecular methods allowed the use of blood samples obtained directly from patients, without prior in vitro culture, for genotyping purposes. Here we briefly review the molecular strategies currently used to detect genetically distinct clones in patient-derived P. vivax samples, present evidence that multiple-clone P. vivax infections are commonly detected in areas with different levels of malaria transmission and discuss possible evolutionary and epidemiological consequences of the competition between genetically distinct clones in natural human infections. We suggest that, when two or more genetically distinct clones are present in the same host, intra-host competition for limited resources may select for P. vivax traits that represent major public health challenges, such as increased virulence, increased transmissibility and antimalarial drug resistance.
Resumo:
Myelodysplastic syndrome (MDS) patients with a normal karyotype constitute a heterogeneous group from a biological standpoint and their outcome is often unpredictable. Interphase fluorescence in situ hybridization (I-FISH) studies could increase the rate of detection of abnormalities, but previous reports in the literature have been contradictory. We performed I-FISH and conventional karyotyping (G-banding) on 50 MDS patients at diagnosis, after 6 and 12 months or at any time if a transformation to acute myeloid leukemia (AML) was detected. Applying a probe-panel targeting the centromere of chromosomes 7 and 8, 5q31, 5p15.2 and 7q31, we observed one case with 5q deletion not identified by G-banding. I-FISH at 6 and 12 months confirmed the karyotype results. Eight cases transformed to AML during follow-up, but no hidden clone was detected by I-FISH in any of them. The inclusion of I-FISH during follow-up of MDS resulted in a small improvement in abnormality detection when compared with conventional G-banding.
Resumo:
Internal bacterial communities of synanthropic mites Acarus siro, Dermatophagoides farinae, Lepidoglyphus destructor, and Tyrophagus putrescentiae (Acari: Astigmata) were analyzed by culturing and culture-independent approaches from specimens obtained from laboratory colonies. Homogenates of surface-sterilized mites were used for cultivation on non-selective agar and DNA extraction. Isolated bacteria were identified by sequencing of the 16S rRNA gene. PCR amplified 16S rRNA genes were analyzed by terminal restriction fragment length polymorphism analysis (T-RFLP) and cloning sequencing. Fluorescence in situ hybridization using universal bacterial probes was used for direct bacterial localization. T-RFLP analysis of 16S rRNA gene revealed distinct species-specific bacterial communities. The results were further confirmed by cloning and sequencing (284 clones). L. destructor and D. farinae showed more diverse communities then A. siro and T. putrescentiae. In the cultivated part of the community, the mean CFUs from four mite species ranged from 5.2 × 102 to 1.4 × 103 per mite. D. farinae had significantly higher CFUs than the other species. Bacteria were located in the digestive and reproductive tract, parenchymatical tissue, and in bacteriocytes. Among the clones, Bartonella-like bacteria occurring in A. siro and T. putresecentiae represented a distinct group related to Bartonellaceae and to Bartonella-like symbionts of ants. The clones of high similarity to Xenorhabdus cabanillasii were found in L. destructor and D. farinae, and one clone related to Photorhabdus temperata in A. siro. Members of Sphingobacteriales cloned from D. farinae and A. siro clustered with the sequences of “Candidatus Cardinium hertigii” and as a separate novel cluster.
Resumo:
We used mixtures of genomic DNA from two genetically distinct isolates from Brazil, 42M and 312M, to investigate how accurately 12-locus microsatellite typing describes the overall genetic diversity and characterizes multilocus haplotypes in multiple-clone Plasmodium vivax infections. We found varying PCR amplification efficiencies of microsatellite alleles; for example, from the same 1:1 mixture of 42M and 312M DNA we amplified predominantly 312M-type alleles at 10 loci and 42M-type alleles at 2 loci. All microsatellite alleles were accurately scored in 1:0.5 and 1:0.25 312M:42M DNA mixtures, even when minor peak heights did not meet previously suggested criteria for minor allele detection in multiple-clone infections. Relative proportions of major and minor alleles were unaffected by multiple displacement amplification of template DNA prior to PCR-based microsatellite typing. Although microsatellite typing may detect minor alleles in clone mixtures, amplification biases may lead to inaccurate assignment of predominant haplotypes in multiple-clone P. vivax infections. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
Antigenic variation of the intestinal protozoan parasite Giardia lamblia is caused by an exchange of the parasite's variant surface protein (VSP) coat. Many investigations on antigenic variation were performed with G. lamblia clone GS/M-83-H7 which produces surface antigen VSP H7. To generate novel information on giardial vsp gene transcription, vsp RNA levels were assessed by quantitative reverse transcription-(RT)-PCR in both axenic VSP H7-type trophozoites and subvariants obtained after negative selection of GS/M-83-H7 trophozoites by treatment with a cytotoxic, VSP H7-specific monoclonal antibody. Our investigation was not restricted to the assessment of the sense vsp transcript levels but also included an approach aimed at the detection of complementary antisense vsp transcripts within the two trophozoite populations. We found that sense vsp H7 RNA predominated in VSP H7-type trophozoites while sense RNA from only one (vsp IVg) of 8 subvariant vsp genes totally analysed predominated in subvariant-type trophozoites. Interestingly, the two trophozoite populations exhibited a similar relative distribution regarding the vsp H7 and vsp IVg antisense RNA molecules. An analogous sense versus antisense RNA pattern was also observed when the transcripts of gene cwp 1 (encoding cyst wall protein 1) were investigated. Here, both types of RNA molecules only appeared after cwp 1 had been induced through in vitro encystation of the parasite. These findings for the first time demonstrated that giardial antisense RNA production did not occur in a constitutive manner but was directly linked to complementary sense RNA production after activation of the respective gene systems.