962 resultados para Alignment-free method, dissimilarity, distance, genome, phylogenetic analysis.
Resumo:
Epidemiological processes leave a fingerprint in the pattern of genetic structure of virus populations. Here, we provide a new method to infer epidemiological parameters directly from viral sequence data. The method is based on phylogenetic analysis using a birth-death model (BDM) rather than the commonly used coalescent as the model for the epidemiological transmission of the pathogen. Using the BDM has the advantage that transmission and death rates are estimated independently and therefore enables for the first time the estimation of the basic reproductive number of the pathogen using only sequence data, without further assumptions like the average duration of infection. We apply the method to genetic data of the HIV-1 epidemic in Switzerland.
Resumo:
We examined the sequence variation of mitochondrial DNA control region and cytochrome b gene of the house mouse (Mus musculus sensu lato) drawn from ca. 200 localities, with 286 new samples drawn primarily from previously unsampled portions of their Eurasian distribution and with the objective of further clarifying evolutionary episodes of this species before and after the onset of human-mediated long-distance dispersals. Phylogenetic analysis of the expanded data detected five equally distinct clades, with geographic ranges of northern Eurasia (musculus, MUS), India and Southeast Asia (castaneus, CAS), Nepal (unspecified, NEP), western Europe (domesticus, DOM) and Yemen (gentilulus). Our results confirm previous suggestions of Southwestern Asia as the likely place of origin of M. musculus and the region of Iran, Afghanistan, Pakistan, and northern India, specifically as the ancestral homeland of CAS. The divergence of the subspecies lineages and of internal sublineage differentiation within CAS were estimated to be 0.37-0.47 and 0.14-0.23 million years ago (mya), respectively, assuming a split of M. musculus and Mus spretus at 1.7 mya. Of the four CAS sublineages detected, only one extends to eastern parts of India, Southeast Asia, Indonesia, Philippines, South China, Northeast China, Primorye, Sakhalin and Japan, implying a dramatic range expansion of CAS out of its homeland during an evolutionary short time, perhaps associated with the spread of agricultural practices. Multiple and non-coincident eastward dispersal events of MUS sublineages to distant geographic areas, such as northern China, Russia and Korea, are inferred, with the possibility of several different routes.
Resumo:
There have been no reports of DNA sequences of hepatitis B virus (HBV) strains from Australian Aborigines, although the hepatitis B surface antigen (HBsAg) was discovered among them. To investigate the characteristics of DNA sequences of HBV strains from Australian Aborigines, the complete nucleotide sequences of HBV strains were determined and subjected to molecular evolutionary analysis. Serum samples positive for HBsAg were collected from five Australian Aborigines. Phylogenetic analysis of the five complete nucleotide sequences compared with DNA sequences of 54 global HBV isolates from international databases revealed that three of the five were classified into genotype D and were most closely related in terms of evolutionary distance to a strain isolated from a healthy blood donor in Papua New Guinea. Two of the five were classified into a novel variant genotype C, which has not been reported previously, and were closely related to a strain isolated from Polynesians, particularly in the X and Core genes. These two strains of variant genotype C differed from known genotype C strains by 5.9-7.4% over the complete nucleotide sequence and 4.0-5.6 % in the small-S gene, and had residues Arg(122), Thr(127) and Lys(160) characteristic of serotype ayw3, which have not been reported previously in genotype C. In conclusion, this is the first report of the characteristics of complete nucleotide sequences of HBV from Australian Aborigines. These results contribute to the investigation of the worldwide spread of HBV, the relationship between serotype and genotype and the ancient common origin of Australian Aborigines.
Resumo:
Intergenic spacers of chloroplast DNA (cpDNA) are very useful in phylogenetic and population genetic studies of plant species, to study their potential integration in phylogenetic analysis. The non-coding trnE-trnT intergenic spacer of cpDNA was analyzed to assess the nucleotide sequence polymorphism of 16 Solanaceae species and to estimate its ability to contribute to the resolution of phylogenetic studies of this group. Multiple alignments of DNA sequences of trnE-trnT intergenic spacer made the identification of nucleotide variability in this region possible and the phylogeny was estimated by maximum parsimony and rooted with Convolvulaceae Ipomoea batalas, the most closely related family. Besides, this intergenic spacer was tested for the phylogenetic ability to differentiate taxonomic levels. For this purpose, species from four other families were analyzed and compared with Solanaceae species. Results confirmed polymorphism in the trnE-trnT region at different taxonomic levels.
Resumo:
The complete genome sequences of two Brazilian wild-type rabies viruses (RABV), a BR-DR1 isolate from a haematophagous bat (Desmodus rotundus) and a BR-AL1 isolate from a frugivorous bat (Artibeus lituratus), were determined. The genomes of the BR-DR1 and RR-AL1 had 11,923 and 11,922 nt, respectively, and both encoded the five standard genes of rhabdoviruses. The complete nucleotide sequence identity between the BR-DR1 and BR-AL1 isolates was 97%. The BR-DR1 and BR-AL1 isolates had some conserved functional sites revealed by the fixed isolates, whereas both isolates had unique amino acid substitutions in the antigenic region IV of the nucleocapsid gene. Therefore, it is speculated that both isolates were nearly identical in virologic character. According to our phylogenetic analysis based on the complete genomes, both isolates belonged to genotype 1, and to the previously defined ""vampire bat-related RABV lineage"" which consisted of mainly D. rotundus- and A. lituratus- isolates; however, a branch pattern with high bootstrap values suggested that BR-DR1 was more closely related to the 9001FRA isolate, which was collected from a dog bitten by a bat in French Guiana, than to BR-AL1. This result suggests that the vampire bat-related RABV lineage includes Brazilian vampire bat and Brazilian frugivorous bat RABV and is further divided into Brazilian vampire bat and Brazilian frugivorous bat RABV sub-lineages. The phylogenetic analysis based on the complete genomes was valuable in discriminating among very closely related isolates.
Resumo:
The applicability of a meshfree approximation method, namely the EFG method, on fully geometrically exact analysis of plates is investigated. Based on a unified nonlinear theory of plates, which allows for arbitrarily large rotations and displacements, a Galerkin approximation via MLS functions is settled. A hybrid method of analysis is proposed, where the solution is obtained by the independent approximation of the generalized internal displacement fields and the generalized boundary tractions. A consistent linearization procedure is performed, resulting in a semi-definite generalized tangent stiffness matrix which, for hyperelastic materials and conservative loadings, is always symmetric (even for configurations far from the generalized equilibrium trajectory). Besides the total Lagrangian formulation, an updated version is also presented, which enables the treatment of rotations beyond the parameterization limit. An extension of the arc-length method that includes the generalized domain displacement fields, the generalized boundary tractions and the load parameter in the constraint equation of the hyper-ellipsis is proposed to solve the resulting nonlinear problem. Extending the hybrid-displacement formulation, a multi-region decomposition is proposed to handle complex geometries. A criterium for the classification of the equilibrium`s stability, based on the Bordered-Hessian matrix analysis, is suggested. Several numerical examples are presented, illustrating the effectiveness of the method. Differently from the standard finite element methods (FEM), the resulting solutions are (arbitrary) smooth generalized displacement and stress fields. (c) 2007 Elsevier Ltd. All rights reserved.
Resumo:
The complete nucleotide sequence of the genomic RNA from the insect picorna-like virus Drosophila C virus (DCV) was determined. The DCV sequence predicts a genome organization different to that of other RNA virus families whose sequences are known. The single-stranded positive-sense genomic RNA is 9264 nucleotides in length and contains two large open reading frames (ORFs) which are separated by 191 nucleotides. The 5' ORF contains regions of similarities with the RNA-dependent RNA polymerase, helicase and protease domains of viruses from the picornavirus, comovirus and sequivirus families. The 3' ORF encodes the capsid proteins as confirmed by N-terminal sequence analysis of these proteins. The capsid protein coding region is unusual in two ways: firstly the cistron appears to lack an initiating methionine and secondly no subgenomic RNA is produced, suggesting that the proteins may be translated through internal initiation of translation from the genomic length RNA. The finding of this novel genome organization for DCV shows that this virus is not a member of the Picornaviridae as previously thought, but belongs to a distinct and hitherto unrecognized virus family.
Resumo:
Porcine circovirus 2 (PCV-2) is associated with a broad range of syndromes. In this study, eight pig tissue samples from two Brazilian states were analyzed using six PCR primer pairs amplifying a 1705-bp fragment of the PCV-2 genome. The NJ distance-based method was used for the phylogenetic analysis with the eight field strains herein, 15 GenBank sequences and using PCV-1 as an out-group. This yielded two major clusters (A and B) for this viral species, with the Brazilian strains segregating with European and Asian sequences. Nucleotide identity was 99.7 to 100% among the sequences. This information can be used in further studies of pathogenesis related to PCV-2 in Brazil. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Although the main transmitters of rabies in Brazil are dogs and vampire bats, the role of other species such as insectivorous and frugivorous bats deserves special attention, as the rabies virus has been isolated from 36 bat species. This study describes the first isolation of the rabies virus from the insectivorous bat Eumops perotis. The infected animal was found in the city of Ribeirão Preto, São Paulo. The virus was identified by immunofluorescence antibody test (FAT) in central nervous system (CNS) samples, and the isolation was carried out in N2A cell culture and adult mice. The sample was submitted to antigenic typing using a panel of monoclonal antibodies (CDC/Atlanta/USA). The DNA sequence of the nucleoprotein gene located between nucleotides 102 and 1385 was aligned with homologous sequences from GenBank using the CLUSTAL/W method, and the alignment was used to build a neighbor-joining distance-based phylogenetic tree with the K-2-P model. CNS was negative by FAT, and only one mouse died after inoculation with a suspension from the bat's CNS. Antigenic typing gave a result that was not compatible with the patterns defined by the panel. Phylogenetic analysis showed that the virus isolated segregated into the same cluster related to other viruses isolated from insectivorous bats belonging to genus Nyctinomops ssp. (98.8% nucleotide identity with each other).
Resumo:
Phosphate homeostasis was studied in a monocotyledonous model plant through the characterization of the PHO1 gene family in rice (Oryza sativa). Bioinformatics and phylogenetic analysis showed that the rice genome has three PHO1 homologs, which cluster with the Arabidopsis (Arabidopsis thaliana) AtPHO1 and AtPHO1;H1, the only two genes known to be involved in root-to-shoot transfer of phosphate. In contrast to the Arabidopsis PHO1 gene family, all three rice PHO1 genes have a cis-natural antisense transcript located at the 5 ' end of the genes. Strand-specific quantitative reverse transcription-PCR analyses revealed distinct patterns of expression for sense and antisense transcripts for all three genes, both at the level of tissue expression and in response to nutrient stress. The most abundantly expressed gene was OsPHO1;2 in the roots, for both sense and antisense transcripts. However, while the OsPHO1;2 sense transcript was relatively stable under various nutrient deficiencies, the antisense transcript was highly induced by inorganic phosphate (Pi) deficiency. Characterization of Ospho1;1 and Ospho1;2 insertion mutants revealed that only Ospho1;2 mutants had defects in Pi homeostasis, namely strong reduction in Pi transfer from root to shoot, which was accompanied by low-shoot and high-root Pi. Our data identify OsPHO1;2 as playing a key role in the transfer of Pi from roots to shoots in rice, and indicate that this gene could be regulated by its cis-natural antisense transcripts. Furthermore, phylogenetic analysis of PHO1 homologs in monocotyledons and dicotyledons revealed the emergence of a distinct clade of PHO1 genes in dicotyledons, which include members having roles other than long-distance Pi transport.
Resumo:
The Brazilian Amazon Region is a highly endemic area for hepatitis B virus (HBV). However, little is known regarding the genetic variability of the strains circulating in this geographical region. Here, we describe the first full-length genomes of HBV isolated in the Brazilian Amazon Region; these genomes are also the first complete HBV subgenotype D3 genomes reported for Brazil. The genomes of the five Brazilian isolates were all 3,182 base pairs in length and the isolates were classified as belonging to subgenotype D3, subtypes ayw2 (n = 3) and ayw3 (n = 2). Phylogenetic analysis suggested that the Brazilian sequences are not likely to be closely related to European D3 sequences. Such results will contribute to further epidemiological and evolutionary studies of HBV.
Resumo:
Human papillomavirus type 6 (HPV6) is the major etiological agent of anogenital warts and laryngeal papillomas and has been included in both the quadrivalent and nonavalent prophylactic HPV vaccines. This study investigated the global genomic diversity of HPV6, using 724 isolates and 190 complete genomes from six continents, and the association of HPV6 genomic variants with geographical location, anatomical site of infection/disease, and gender. Initially, a 2,800-bp E5a-E5b-L1-LCR fragment was sequenced from 492/530 (92.8%) HPV6-positive samples collected for this study. Among them, 130 exhibited at least one single nucleotide polymorphism (SNP), indel, or amino acid change in the E5a-E5b-L1-LCR fragment and were sequenced in full. A global alignment and maximum likelihood tree of 190 complete HPV6 genomes (130 fully sequenced in this study and 60 obtained from sequence repositories) revealed two variant lineages, A and B, and five B sublineages: B1, B2, B3, B4, and B5. HPV6 (sub)lineage-specific SNPs and a 960-bp representative region for whole-genome-based phylogenetic clustering within the L2 open reading frame were identified. Multivariate logistic regression analysis revealed that lineage B predominated globally. Sublineage B3 was more common in Africa and North and South America, and lineage A was more common in Asia. Sublineages B1 and B3 were associated with anogenital infections, indicating a potential lesion-specific predilection of some HPV6 sublineages. Females had higher odds for infection with sublineage B3 than males. In conclusion, a global HPV6 phylogenetic analysis revealed the existence of two variant lineages and five sublineages, showing some degree of ethnogeographic, gender, and/or disease predilection in their distribution. IMPORTANCE: This study established the largest database of globally circulating HPV6 genomic variants and contributed a total of 130 new, complete HPV6 genome sequences to available sequence repositories. Two HPV6 variant lineages and five sublineages were identified and showed some degree of association with geographical location, anatomical site of infection/disease, and/or gender. We additionally identified several HPV6 lineage- and sublineage-specific SNPs to facilitate the identification of HPV6 variants and determined a representative region within the L2 gene that is suitable for HPV6 whole-genome-based phylogenetic analysis. This study complements and significantly expands the current knowledge of HPV6 genetic diversity and forms a comprehensive basis for future epidemiological, evolutionary, functional, pathogenicity, vaccination, and molecular assay development studies.
Resumo:
This PhD Thesis is the result of my research activity in the last three years. My main research interest was centered on the evolution of mitochondrial genome (mtDNA), and on its usefulness as a phylogeographic and phylogenetic marker at different taxonomic levels in different taxa of Metazoa. From a methodological standpoint, my main effort was dedicated to the sequencing of complete mitochondrial genomes, and the approach to whole-genome sequencing was based on the application of Long-PCR and shotgun sequences. Moreover, this research project is a part of a bigger sequencing project of mtDNAs in many different Metazoans’ taxa, and I mostly dedicated myself to sequence and analyze mtDNAs in selected taxa of bivalves and hexapods (Insecta). Sequences of bivalve mtDNAs are particularly limited, and my study contributed to extend the sampling. Moreover, I used the bivalve Musculista senhousia as model taxon to investigate the molecular mechanisms and the evolutionary significance of their aberrant mode of mitochondrial inheritance (Doubly Uniparental Inheritance, see below). In Insects, I focused my attention on the Genus Bacillus (Insecta Phasmida). A detailed phylogenetic analysis was performed in order to assess phylogenetic relationships within the genus, and to investigate the placement of Phasmida in the phylogenetic tree of Insecta. The main goal of this part of my study was to add to the taxonomic coverage of sequenced mtDNAs in basal insects, which were only partially analyzed.
Resumo:
Procedures for quantitative walking analysis include the assessment of body segment movements within defined gait cycles. Recently, methods to track human body motion using inertial measurement units have been suggested. It is not known if these techniques can be readily transferred to clinical measurement situations. This work investigates the aspects necessary for one inertial measurement unit mounted on the lower back to track orientation, and determine spatio-temporal features of gait outside the confines of a conventional gait laboratory. Apparent limitations of different inertial sensors can be overcome by fusing data using methods such as a Kalman filter. The benefits of optimizing such a filter for the type of motion are unknown. 3D accelerations and 3D angular velocities were collected for 18 healthy subjects while treadmill walking. Optimization of Kalman filter parameters improved pitch and roll angle estimates when compared to angles derived using stereophotogrammetry. A Weighted Fourier Linear Combiner method for estimating 3D orientation angles by constructing an analytical representation of angular velocities and allowing drift free integration is also presented. When tested this method provided accurate estimates of 3D orientation when compared to stereophotogrammetry. Methods to determine spatio-temporal features from lower trunk accelerations generally require knowledge of sensor alignment. A method was developed to estimate the instants of initial and final ground contact from accelerations measured by a waist mounted inertial device without rigorous alignment. A continuous wavelet transform method was used to filter and differentiate the signal and derive estimates of initial and final contact times. The technique was tested with data recorded for both healthy and pathologic (hemiplegia and Parkinson’s disease) subjects and validated using an instrumented mat. The results show that a single inertial measurement unit can assist whole body gait assessment however further investigation is required to understand altered gait timing in some pathological subjects.
Resumo:
Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.