974 resultados para construction techniques
Resumo:
Identifying the correct sense of a word in context is crucial for many tasks in natural language processing (machine translation is an example). State-of-the art methods for Word Sense Disambiguation (WSD) build models using hand-crafted features that usually capturing shallow linguistic information. Complex background knowledge, such as semantic relationships, are typically either not used, or used in specialised manner, due to the limitations of the feature-based modelling techniques used. On the other hand, empirical results from the use of Inductive Logic Programming (ILP) systems have repeatedly shown that they can use diverse sources of background knowledge when constructing models. In this paper, we investigate whether this ability of ILP systems could be used to improve the predictive accuracy of models for WSD. Specifically, we examine the use of a general-purpose ILP system as a method to construct a set of features using semantic, syntactic and lexical information. This feature-set is then used by a common modelling technique in the field (a support vector machine) to construct a classifier for predicting the sense of a word. In our investigation we examine one-shot and incremental approaches to feature-set construction applied to monolingual and bilingual WSD tasks. The monolingual tasks use 32 verbs and 85 verbs and nouns (in English) from the SENSEVAL-3 and SemEval-2007 benchmarks; while the bilingual WSD task consists of 7 highly ambiguous verbs in translating from English to Portuguese. The results are encouraging: the ILP-assisted models show substantial improvements over those that simply use shallow features. In addition, incremental feature-set construction appears to identify smaller and better sets of features. Taken together, the results suggest that the use of ILP with diverse sources of background knowledge provide a way for making substantial progress in the field of WSD.
Resumo:
Generalized hyper competitiveness in the world markets has determined the need to offer better products to potential and actual clients in order to mark an advantagefrom other competitors. To ensure the production of an adequate product, enterprises need to work on the efficiency and efficacy of their business processes (BPs) by means of the construction of Interactive Information Systems (IISs, including Interactive Multimedia Documents) so that they are processed more fluidly and correctly.The construction of the correct IIS is a major task that can only be successful if the needs from every intervenient are taken into account. Their requirements must bedefined with precision, extensively analyzed and consequently the system must be accurately designed in order to minimize implementation problems so that the IIS isproduced on schedule and with the fewer mistakes as possible. The main contribution of this thesis is the proposal of Goals, a software (engineering) construction process which aims at defining the tasks to be carried out in order to develop software. This process defines the stakeholders, the artifacts, and the techniques that should be applied to achieve correctness of the IIS. Complementarily, this process suggests two methodologies to be applied in the initial phases of the lifecycle of the Software Engineering process: Process Use Cases for the phase of requirements, and; MultiGoals for the phases of analysis and design. Process Use Cases is a UML-based (Unified Modeling Language), goal-driven and use case oriented methodology for the definition of functional requirements. It uses an information oriented strategy in order to identify BPs while constructing the enterprise’s information structure, and finalizes with the identification of use cases within the design of these BPs. This approach provides a useful tool for both activities of Business Process Management and Software Engineering. MultiGoals is a UML-based, use case-driven and architectural centric methodology for the analysis and design of IISs with support for Multimedia. It proposes the analysis of user tasks as the basis of the design of the: (i) user interface; (ii) the system behaviour that is modeled by means of patterns which can combine Multimedia and standard information, and; (iii) the database and media contents. This thesis makes the theoretic presentation of these approaches accompanied with examples from a real project which provide the necessary support for the understanding of the used techniques.
Resumo:
The way of organization of the constitutional jurisdiction implies the possibility to extend the democratization of the same one in function of the popular participation in the active legitimacy to constitutional process (procedimentalist model) e, at the same time, to assure technical viable decisions fast and to the complex problems of the constitucional law (substancialist model). The comparison with the constitutional jurisdiction of U.S.A. becomes interesting from the knowledge of the wide power to decide experience of Supreme the Court that for a methodology of construction of rights and not simply of interpretation of the Constitution, brought up to date and reconstructed throughout its historical evolution the direction of the norms of basic rights and the North American principles constitutional. Construction while constitutional hermeneutic method of substancialist matrix works with techniques as the measurement of principles, the protection of interests of minorities and the entailing of the basic rights with values politicians, what it can be brought to evidence of the Brazilian constitutional jurisdiction in order to improve the construction of basic rights that comes being carried through for the judicial ativism in control of the diffuse and abstract constitutionality. To define the limits of construction is to search, on the other hand, a dialogue with the procedimentalists thesis, aiming at the widening of the participation of the citizen in the construction of the basic rights for the constitutional process and to argue forms of the society to evaluate the pronounced decisions activist in the controls diffuse and abstract of constitutionality
Resumo:
We present the exact construction of Riemannian (or stringy) instantons, which are classical solutions of 2D Yang-Mills theories that interpolate between initial and final string configurations. They satisfy the Hitchin equations with special boundary conditions. For the case of U(2) gauge group those equations can be written as the sinh-Gordon equation with a delta-function source. Using the techniques of integrable theories based on the zero curvature conditions, we show that the solution is a condensate of an infinite number of one-solitons with the same topological charge and with all possible rapidities.
Resumo:
We suggest a method for constructing trial eigenfunctions for excited states to be used in the variational method. This method is a generalization of the one that uses a superpotential to obtain the trial functions for the ground state. The construction of an effective hierarchy of Hamiltonians is used to determine excited variational energies. The first four eigenvalues for a quartic double-well potential are calculated for several values of the potential parameter. The results are in very good agreement with the eigenvalues obtained by numerical integration.
Resumo:
A major challenge in cancer radiotherapy is to deliver a lethal dose of radiation to the target volume while minimizing damage to the surrounding normal tissue. We have proposed a model on how treatment efficacy might be improved by interfering with biological responses to DNA damage using exogenous electric fields as a strategy to drastically reduce radiation doses in cancer therapy. This approach is demonstrated at this Laboratory through case studies with prokaryotes (bacteria) and eukaryotes (yeast) cells, in which cellkilling rates induced by both gamma radiation and exogenous electric fields were measured. It was found that when cells exposed to gamma radiation are immediately submitted to a weak electric field, cell death increases more than an order of magnitude compared to the effect of radiation alone. This finding suggests, although does not prove, that DNA damage sites are reached and recognized by means of long-range electric DNA-protein interaction, and that exogenous electric fields could destructively interfere with this process. As a consequence, DNA repair is avoided leading to massive cell death. Here we are proposing the use this new technique for the design and construction of novel radiotherapy facilities associated with linac generated gamma beams under controlled conditions of dose and beam intensity.
Resumo:
Our main purpose in this study was to quantify biological tissue in computed tomography (CT) examinations with the aim of developing a skull and a chest patient equivalent phantom (PEP), both specific to infants, aged between 1 and 5 years old. This type of phantom is widely used in the development of optimization procedures for radiographic techniques, especially in computed radiography (CR) systems. In order to classify and quantify the biological tissue, we used a computational algorithm developed in Matlab (R). The algorithm performed a histogram of each CT slice followed by a Gaussian fitting of each tissue type. The algorithm determined the mean thickness for the biological tissues (bone, soft, fat, and lung) and also converted them into the corresponding thicknesses of the simulator material (aluminum, PMMA, and air). We retrospectively analyzed 148 CT examinations of infant patients, 56 for skull exams and 92 were for chest. The results provided sufficient data to construct a phantom to simulate the infant chest and skull in the posterior anterior or anterior posterior (PA/AP) view. Both patient equivalent phantoms developed in this study can be used to assess physical variables such as noise power spectrum (NPS) and signal to noise ratio (SNR) or perform dosimetric control specific to pediatric protocols.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The first part of the research project of the Co-Advisorship Ph.D Thesis was aimed to select the best Bifidobacterium longum strains suitable to set the basis of our study. We were looking for strains with the abilities to colonize the intestinal mucosa and with good adhesion capacities, so that we can test these strains to investigate their ability to induce apoptosis in “damaged” intestinal cells. Adhesion and apoptosis are the two process that we want to study to better understand the role of an adhesion protein that we have previously identified and that have top scores homologies with the recent serpin encoding gene identified in B. longum by Nestlè researchers. Bifidobacterium longum is a probiotic, known for its beneficial effects to the human gut and even for its immunomodulatory and antitumor activities. Recently, many studies have stressed out the intimate relation between probiotic bacteria and the GIT mucosa and their influence on human cellular homeostasis. We focused on the apoptotic deletion of cancer cells induced by B. longum. This has been valued in vitro, performing the incubation of three B.longum strains with enterocyte-like Caco- 2 cells, to evidence DNA fragmentation, a cornerstone of apoptosis. The three strains tested were defined for their adhesion properties using adhesion and autoaggregation assays. These features are considered necessary to select a probiotic strain. The three strains named B12, B18 and B2990 resulted respectively: “strong adherent”, “adherent” and “non adherent”. Then, bacteria were incubated with Caco-2 cells to investigate apoptotic deletion. Cocultures of Caco-2 cells with B. longum resulted positive in DNA fragmentation test, only when adherent strains were used (B12 and B18). These results indicate that the interaction with adherent B. longum can induce apoptotic deletion of Caco-2 cells, suggesting a role in cellular homeostasis of the gastrointestinal tract and in restoring the ecology of damaged colon tissues. These results were used to keep on researching and the strains tested were used as recipient of recombinant techniques aimed to originate new B.longum strains with enhanced capacity of apoptotic induction in “damaged” intestinal cells. To achieve this new goal it was decided to clone the serpin encoding gene of B. longum, so that we can understand its role in adhesion and apoptosis induction. Bifidobacterium longum has immunostimulant activity that in vitro can lead to apoptotic response of Caco-2 cell line. It secretes a hypothetical eukaryotic type serpin protein, which could be involved in this kind of deletion of damaged cells. We had previously characterised a protein that has homologies with the hypothetical serpin of B. longum (DD087853). In order to create Bifidobacterium serpin transformants, a B. longum cosmid library was screened with a PCR protocol using specific primers for serpin gene. After fragment extraction, the insert named S1 was sub-cloned into pRM2, an Escherichia coli - Bifidobacterium shuttle vector, to construct pRM3. Several protocols for B. longum transformation were performed and the best efficiency was obtained using MRS medium and raffinose. Finally bacterial cell supernatants were tested in a dotblot assay to detect antigens presence against anti-antitrypsin polyclonal antibody. The best signal was produced by one starin that has been renamed B. longum BLKS 7. Our research study was aimed to generate transformants able to over express serpin encoding gene, so that we can have the tools for a further study on bacterial apoptotic induction of Caco-2 cell line. After that we have originated new trasformants the next step to do was to test transformants abilities when exposed to an intestinal cell model. In fact, this part of the project was achieved in the Department of Biochemistry of the Medical Faculty of the University of Maribor, guest of the abroad supervisor of the Co-Advisorship Doctoral Thesis: Prof. Avrelija Cencic. In this study we examined the probiotic ability of some bacterial strains using intestinal cells from a 6 years old pig. The use of intestinal mammalian cells is essential to study this symbiosis and a functional cell model mimics a polarised epithelium in which enterocytes are separated by tight junctions. In this list of strains we have included the Bifidobacterium longum BKS7 transformant strain that we have previously originated; in order to compare its abilities. B. longum B12 wild type and B. longum BKS7 transformant and eight Lactobacillus strains of different sources were co-cultured with porcine small intestine epithelial cells (PSI C1) and porcine blood monocytes (PoM2) in Transwell filter inserts. The strains, including Lb. gasseri, Lb. fermentum, Lb. reuterii, Lb. plantarum and unidentified Lactobacillus from kenyan maasai milk and tanzanian coffee, were assayed for activation of cell lines, measuring nitric oxide by Griess reaction, H202 by tetramethylbenzidine reaction and O2 - by cytochrome C reduction. Cytotoxic effect by crystal violet staining and induction on metabolic activity by MTT cell proliferation assay were tested too. Transepithelial electrical resistance (TER) of polarised PSI C1 was measured during 48 hours co-culture. TER, used to observe epithelium permeability, decrease during pathogenesis and tissue becomes permeable to ion passive flow lowering epithelial barrier function. Probiotics can prevent or restore increased permeability. Lastly, dot-blot was achieved against Interleukin-6 of treated cells supernatants. The metabolic activity of PoM2 and PSI C1 increased slightly after co-culture not affecting mitochondrial functions. No strain was cytotoxic over PSI C1 and PoM2 and no cell activation was observed, as measured by the release of NO2, H202 and O2 - by PoM2 and PSI C1. During coculture TER of polarised PSI C1 was two-fold higher comparing with constant TER (~3000 ) of untreated cells. TER raise generated by bacteria maintains a low permeability of the epithelium. During treatment Interleukin-6 was detected in cell supernatants at several time points, confirming immunostimulant activity. All results were obtained using Lactobacillus paracasei Shirota e Carnobacterium divergens as controls. In conclusion we can state that both the list of putative probiotic bacteria and our new transformant strain of B. longum are not harmful when exposed to intestinal cells and could be selected as probiotics, because can strengthen epithelial barrier function and stimulate nonspecific immunity of intestinal cells on a pig cell model. Indeed, we have found out that none of the strains tested that have good adhesion abilities presents citotoxicity to the intestinal cells and that non of the strains tested can induce cell lines to produce high level of ROS, neither NO2. Moreover we have assayed even the capacity of producing certain citokynes that are correlated with immune response. The detection of Interleukin-6 was assayed in all our samples, including B.longum transformant BKS 7 strain, this result indicates that these bacteria can induce a non specific immune response in the intestinal cells. In fact, when we assayed the presence of Interferon-gamma in cells supernatant after bacterial exposure, we have no positive signals, that means that there is no activation of a specific immune response, thus confirming that these bacteria are not recognize as pathogen by the intestinal cells and are certainly not harmful for intestinal cells. The most important result is the measure of Trans Epithelial Electric Resistance that have shown how the intestinal barrier function get strengthen when cells are exposed to bacteria, due to a reduction of the epithelium permeability. We have now a new strain of B. longum that will be used for further studies above the mechanism of apoptotic induction to “damaged cells” and above the process of “restoring ecology”. This strain will be the basis to originate new transformant strains for Serpin encoding gene that must have better performance and shall be used one day even in clinical cases as in “gene therapy” for cancer treatment and prevention.
Resumo:
The ever-increasing spread of automation in industry puts the electrical engineer in a central role as a promoter of technological development in a sector such as the use of electricity, which is the basis of all the machinery and productive processes. Moreover the spread of drives for motor control and static converters with structures ever more complex, places the electrical engineer to face new challenges whose solution has as critical elements in the implementation of digital control techniques with the requirements of inexpensiveness and efficiency of the final product. The successfully application of solutions using non-conventional static converters awake an increasing interest in science and industry due to the promising opportunities. However, in the same time, new problems emerge whose solution is still under study and debate in the scientific community During the Ph.D. course several themes have been developed that, while obtaining the recent and growing interest of scientific community, have much space for the development of research activity and for industrial applications. The first area of research is related to the control of three phase induction motors with high dynamic performance and the sensorless control in the high speed range. The management of the operation of induction machine without position or speed sensors awakes interest in the industrial world due to the increased reliability and robustness of this solution combined with a lower cost of production and purchase of this technology compared to the others available in the market. During this dissertation control techniques will be proposed which are able to exploit the total dc link voltage and at the same time capable to exploit the maximum torque capability in whole speed range with good dynamic performance. The proposed solution preserves the simplicity of tuning of the regulators. Furthermore, in order to validate the effectiveness of presented solution, it is assessed in terms of performance and complexity and compared to two other algorithm presented in literature. The feasibility of the proposed algorithm is also tested on induction motor drive fed by a matrix converter. Another important research area is connected to the development of technology for vehicular applications. In this field the dynamic performances and the low power consumption is one of most important goals for an effective algorithm. Towards this direction, a control scheme for induction motor that integrates within a coherent solution some of the features that are commonly required to an electric vehicle drive is presented. The main features of the proposed control scheme are the capability to exploit the maximum torque in the whole speed range, a weak dependence on the motor parameters, a good robustness against the variations of the dc-link voltage and, whenever possible, the maximum efficiency. The second part of this dissertation is dedicated to the multi-phase systems. This technology, in fact, is characterized by a number of issues worthy of investigation that make it competitive with other technologies already on the market. Multiphase systems, allow to redistribute power at a higher number of phases, thus making possible the construction of electronic converters which otherwise would be very difficult to achieve due to the limits of present power electronics. Multiphase drives have an intrinsic reliability given by the possibility that a fault of a phase, caused by the possible failure of a component of the converter, can be solved without inefficiency of the machine or application of a pulsating torque. The control of the magnetic field spatial harmonics in the air-gap with order higher than one allows to reduce torque noise and to obtain high torque density motor and multi-motor applications. In one of the next chapters a control scheme able to increase the motor torque by adding a third harmonic component to the air-gap magnetic field will be presented. Above the base speed the control system reduces the motor flux in such a way to ensure the maximum torque capability. The presented analysis considers the drive constrains and shows how these limits modify the motor performance. The multi-motor applications are described by a well-defined number of multiphase machines, having series connected stator windings, with an opportune permutation of the phases these machines can be independently controlled with a single multi-phase inverter. In this dissertation this solution will be presented and an electric drive consisting of two five-phase PM tubular actuators fed by a single five-phase inverter will be presented. Finally the modulation strategies for a multi-phase inverter will be illustrated. The problem of the space vector modulation of multiphase inverters with an odd number of phases is solved in different way. An algorithmic approach and a look-up table solution will be proposed. The inverter output voltage capability will be investigated, showing that the proposed modulation strategy is able to fully exploit the dc input voltage either in sinusoidal or non-sinusoidal operating conditions. All this aspects are considered in the next chapters. In particular, Chapter 1 summarizes the mathematical model of induction motor. The Chapter 2 is a brief state of art on three-phase inverter. Chapter 3 proposes a stator flux vector control for a three- phase induction machine and compares this solution with two other algorithms presented in literature. Furthermore, in the same chapter, a complete electric drive based on matrix converter is presented. In Chapter 4 a control strategy suitable for electric vehicles is illustrated. Chapter 5 describes the mathematical model of multi-phase induction machines whereas chapter 6 analyzes the multi-phase inverter and its modulation strategies. Chapter 7 discusses the minimization of the power losses in IGBT multi-phase inverters with carrier-based pulse width modulation. In Chapter 8 an extended stator flux vector control for a seven-phase induction motor is presented. Chapter 9 concerns the high torque density applications and in Chapter 10 different fault tolerant control strategies are analyzed. Finally, the last chapter presents a positioning multi-motor drive consisting of two PM tubular five-phase actuators fed by a single five-phase inverter.
Resumo:
Water held in the unsaturated zone is important for agriculture and construction and is replenished by infiltrating rainwater. Monitoring the soil water content of clay soils using ground-penetrating radar (GPR) has not been researched, as clay soils cause attenuation of GPR signal. In this study, GPR common-midpoint soundings (CMPs) are used in the clayey soils of the Miller Run floodplain to monitor changes in the soil water content (SWC) before and after rainfall events. GPR accomplishes this task because increases in water content will increase the dielectric constant of the subsurface material, and decrease the velocity of the GPR wave. Using an empirical relationship between dielectric constant and SWC, the Topp relation, we are able to calculate a SWC from these velocity measurements. Non-invasive electromagnetics, resistivity, and seismic were performed, and from these surveys, the layering at the field site was delineated. EM characterized the horizontal variation of the soil, allowing us to target the most clay rich area. At the CMP location, resistivity indicates the vertical structure of the subsurface consists of a 40 cm thick layer with a resistivity of 100 ohm*m. Between 40 cm and 1.5 m is a layer with a resistivity of 40 ohm*m. The thickness estimates were confirmed with invasive auger and trenching methods away from the CMP location. GPR CMPs were collected relative to a July 2013 and September 2013 storm. The velocity observations from the CMPs had a precision of +/- 0.001 m/ns as assessed by repeat analysis. In the case of both storms, the GPR data showed the expected relationship between the rainstorms and calculated SWC, with the SWC increasing sharply after the rainstorm and decreasing as time passed. We compared these data to auger core samples collected at the same time as the CMPs were taken, and the volumetric analysis of the cores confirmed the trend seen in the GPR, with SWC values between 3 and 5 percent lower than the GPR estimates. Our data shows that we can, with good precision, monitor changes in the SWC of conductive soils in response to rainfall events, despite the attenuation induced by the clay.
Resumo:
Highway infrastructure plays a significant role in society. The building and upkeep of America’s highways provide society the necessary means of transportation for goods and services needed to develop as a nation. However, as a result of economic and social development, vast amounts of greenhouse gas emissions (GHG) are emitted into the atmosphere contributing to global climate change. In recognizing this, future policies may mandate the monitoring of GHG emissions from public agencies and private industries in order to reduce the effects of global climate change. To effectively reduce these emissions, there must be methods that agencies can use to quantify the GHG emissions associated with constructing and maintaining the nation’s highway infrastructure. Current methods for assessing the impacts of highway infrastructure include methodologies that look at the economic impacts (costs) of constructing and maintaining highway infrastructure over its life cycle. This is known as Life Cycle Cost Analysis (LCCA). With the recognition of global climate change, transportation agencies and contractors are also investigating the environmental impacts that are associated with highway infrastructure construction and rehabilitation. A common tool in doing so is the use of Life Cycle Assessment (LCA). Traditionally, LCA is used to assess the environmental impacts of products or processes. LCA is an emerging concept in highway infrastructure assessment and is now being implemented and applied to transportation systems. This research focuses on life cycle GHG emissions associated with the construction and rehabilitation of highway infrastructure using a LCA approach. Life cycle phases of the highway section include; the material acquisition and extraction, construction and rehabilitation, and service phases. Departing from traditional approaches that tend to use LCA as a way to compare alternative pavement materials or designs based on estimated inventories, this research proposes a shift to a context sensitive process-based approach that uses actual observed construction and performance data to calculate greenhouse gas emissions associated with highway construction and rehabilitation. The goal is to support strategies that reduce long-term environmental impacts. Ultimately, this thesis outlines techniques that can be used to assess GHG emissions associated with construction and rehabilitation operations to support the overall pavement LCA.
Resumo:
Inactivation by allelic exchange in clinical isolates of the emerging nosocomial pathogen Enterococcus faecium has been hindered by lack of efficient tools, and, in this study, transformation of clinical isolates was found to be particularly problematic. For this reason, a vector for allelic replacement (pTEX5500ts) was constructed that includes (i) the pWV01-based gram-positive repAts replication region, which is known to confer a high degree of temperature intolerance, (ii) Escherichia coli oriR from pUC18, (iii) two extended multiple-cloning sites located upstream and downstream of one of the marker genes for efficient cloning of flanking regions for double-crossover mutagenesis, (iv) transcriptional terminator sites to terminate undesired readthrough, and (v) a synthetic extended promoter region containing the cat gene for allelic exchange and a high-level gentamicin resistance gene, aph(2'')-Id, to distinguish double-crossover recombination, both of which are functional in gram-positive and gram-negative backgrounds. To demonstrate the functionality of this vector, the vector was used to construct an acm (encoding an adhesin to collagen from E. faecium) deletion mutant of a poorly transformable multidrug-resistant E. faecium endocarditis isolate, TX0082. The acm-deleted strain, TX6051 (TX0082Deltaacm), was shown to lack Acm on its surface, which resulted in the abolishment of the collagen adherence phenotype observed in TX0082. A mobilizable derivative (pTEX5501ts) that contains oriT of Tn916 to facilitate conjugative transfer from the transformable E. faecalis strain JH2Sm::Tn916 to E. faecium was also constructed. Using this vector, the acm gene of a nonelectroporable E. faecium wound isolate was successfully interrupted. Thus, pTEX5500ts and its mobilizable derivative demonstrated their roles as important tools by helping to create the first reported allelic replacement in E. faecium; the constructed this acm deletion mutant will be useful for assessing the role of acm in E. faecium pathogenesis using animal models.
Resumo:
Complete NotI, SfiI, XbaI and BlnI cleavage maps of Escherichia coli K-12 strain MG1655 were constructed. Techniques used included: CHEF pulsed field gel electrophoresis; transposon mutagenesis; fragment hybridization to the ordered $\lambda$ library of Kohara et al.; fragment and cosmid hybridization to Southern blots; correlation of fragments and cleavage sites with EcoMap, a sequence-modified version of the genomic restriction map of Kohara et al.; and correlation of cleavage sites with DNA sequence databases. In all, 105 restriction sites were mapped and correlated with the EcoMap coordinate system.^ NotI, SfiI, XbaI and BlnI restriction patterns of five commonly used E. coli K-12 strains were compared to those of MG1655. The variability between strains, some of which are separated by numerous steps of mutagenic treatment, is readily detectable by pulsed-field gel electrophoresis. A model is presented to account for the difference between the strains on the basis of simple insertions, deletions, and in one case an inversion. Insertions and deletions ranged in size from 1 kb to 86 kb. Several of the larger features have previously been characterized and some of the smaller rearrangements can potentially account for previously reported genetic features of these strains.^ Some aspects of the frequency and distribution of NotI, SfiI, XbaI and BlnI cleavage sites were analyzed using a method based on Markov chain theory. Overlaps of Dam and Dcm methylase sites with XbaI and SfiI cleavage sites were examined. The one XbaI-Dam overlap in the database is in accord with the expected frequency of this overlap. The occurrence of certain types of SfiI-Dcm overlaps are overrepresented. Of the four subtypes of SfiI-Dcm overlap, only one has a partial inhibitory effect on the activity of SfiI. Recognition sites for all four enzymes are rarer than expected based on oligonucleotide frequency data, with this effect being much stronger for XbaI and BlnI than for NotI and SfiI. The latter two enzyme sites are rare mainly due to apparent negative selection against GGCC (both) and CGGCCG (NotI). The former two enzyme sites are rare mainly due to effects of the VSP repair system on certain di-tri- and tetranucleotides, most notably CTAG. Models are proposed to explain several of the anomalies of oligonucleotide distribution in E. coli, and the biological significance of the systems that produce these anomalies is discussed. ^