95 resultados para Kullback-Leibler Divergence
Resumo:
Protein functional annotation relies on the identification of accurate relationships, sequence divergence being a key factor. This is especially evident when distant protein relationships are demonstrated only with three-dimensional structures. To address this challenge, we describe a computational approach to purposefully bridge gaps between related protein families through directed design of protein-like ``linker'' sequences. For this, we represented SCOP domain families, integrated with sequence homologues, as multiple profiles and performed HMM-HMM alignments between related domain families. Where convincing alignments were achieved, we applied a roulette wheel-based method to design 3,611,010 protein-like sequences corresponding to 374 SCOP folds. To analyze their ability to link proteins in homology searches, we used 3024 queries to search two databases, one containing only natural sequences and another one additionally containing designed sequences. Our results showed that augmented database searches showed up to 30% improvement in fold coverage for over 74% of the folds, with 52 folds achieving all theoretically possible connections. Although sequences could not be designed between some families, the availability of designed sequences between other families within the fold established the sequence continuum to demonstrate 373 difficult relationships. Ultimately, as a practical and realistic extension, we demonstrate that such protein-like sequences can be ``plugged-into'' routine and generic sequence database searches to empower not only remote homology detection but also fold recognition. Our richly statistically supported findings show that complementary searches in both databases will increase the effectiveness of sequence-based searches in recognizing all homologues sharing a common fold. (C) 2013 Elsevier Ltd. All rights reserved.
Resumo:
Maximum entropy approach to classification is very well studied in applied statistics and machine learning and almost all the methods that exists in literature are discriminative in nature. In this paper, we introduce a maximum entropy classification method with feature selection for large dimensional data such as text datasets that is generative in nature. To tackle the curse of dimensionality of large data sets, we employ conditional independence assumption (Naive Bayes) and we perform feature selection simultaneously, by enforcing a `maximum discrimination' between estimated class conditional densities. For two class problems, in the proposed method, we use Jeffreys (J) divergence to discriminate the class conditional densities. To extend our method to the multi-class case, we propose a completely new approach by considering a multi-distribution divergence: we replace Jeffreys divergence by Jensen-Shannon (JS) divergence to discriminate conditional densities of multiple classes. In order to reduce computational complexity, we employ a modified Jensen-Shannon divergence (JS(GM)), based on AM-GM inequality. We show that the resulting divergence is a natural generalization of Jeffreys divergence to a multiple distributions case. As far as the theoretical justifications are concerned we show that when one intends to select the best features in a generative maximum entropy approach, maximum discrimination using J-divergence emerges naturally in binary classification. Performance and comparative study of the proposed algorithms have been demonstrated on large dimensional text and gene expression datasets that show our methods scale up very well with large dimensional datasets.
Resumo:
The interaction between the Fermi sea of conduction electrons and a nonadiabatic attractive impurity potential can lead to a power-law divergence in the tunneling probability of charge through the impurity. The resulting effect, known as the Fermi edge singularity (FES), constitutes one of the most fundamental many-body phenomena in quantum solid state physics. Here we report the first observation of FES for Dirac fermions in graphene driven by isolated Coulomb impurities in the conduction channel. In high-mobility graphene devices on hexagonal boron nitride substrates, the FES manifests in abrupt changes in conductance with a large magnitude approximate to e(2)/h at resonance, indicating total many-body screening of a local Coulomb impurity with fluctuating charge occupancy. Furthermore, we exploit the extreme sensitivity of graphene to individual Coulomb impurities and demonstrate a new defect-spectroscopy tool to investigate strongly correlated phases in graphene in the quantum Hall regime.
Resumo:
Although the East African Rift System (EARS) is an archetype continental rift, the forces driving its evolution remain debated. Some contend buoyancy forces arising from gravitational potential energy (GPE) gradients within the lithosphere drive rifting. Others argue for a major role of the diverging mantle flow associated with the African Superplume. Here we quantify the forces driving present-day continental rifting in East Africa by (1) solving the depth averaged 3-D force balance equations for 3-D deviatoric stress associated with GPE, (2) inverting for a stress field boundary condition that we interpret as originating from large-scale mantle tractions, (3) calculating dynamic velocities due to lithospheric buoyancy forces, lateral viscosity variations, and velocity boundary conditions, and (4) calculating dynamic velocities that result from the stress response of horizontal mantle tractions acting on a viscous lithosphere in Africa and surroundings. We find deviatoric stress associated with lithospheric GPE gradients are similar to 8-20 MPa in EARS, and the minimum deviatoric stress resulting from basal shear is similar to 1.6 MPa along the EARS. Our dynamic velocity calculations confirm that a force contribution from GPE gradients alone is sufficient to drive Nubia-Somalia divergence and that additional forcing from horizontal mantle tractions overestimates surface kinematics. Stresses from GPE gradients appear sufficient to sustain present-day rifting in East Africa; however, they are lower than the vertically integrated strength of the lithosphere along most of the EARS. This indicates additional processes are required to initiate rupture of continental lithosphere, but once it is initiated, lithospheric buoyancy forces are enough to maintain rifting.
Resumo:
A detailed understanding of structure and stability of nanowires is critical for applications. Atomic resolution imaging of ultrathin single crystalline Au nanowires using aberration-corrected microscopy reveals an intriguing relaxation whereby the atoms in the close-packed atomic planes normal to the growth direction are displaced in the axial direction leading to wrinkling of the (111) atomic plane normal to the wire axis. First-principles calculations of the structure of such nanowires confirm this wrinkling phenomenon, whereby the close-packed planes relax to form saddle-like surfaces. Molecular dynamics studies of wires with varying diameters and different bounding surfaces point to the key role of surface stress on the relaxation process. Using continuum mechanics arguments, we show that the wrinkling arises due to anisotropy in the surface stresses and in the elastic response, along with the divergence of surface-induced bulk stress near the edges of a faceted structure. The observations provide new understanding on the equilibrium structure of nanoscale systems and could have important implications for applications in sensing and actuation.
Resumo:
Simplified equations are derived for a granular flow in the `dense' limit where the volume fraction is close to that for dynamical arrest, and the `shallow' limit where the stream-wise length for flow development (L) is large compared with the cross-stream height (h). The mass and diameter of the particles are set equal to 1 in the analysis without loss of generality. In the dense limit, the equations are simplified by taking advantage of the power-law divergence of the pair distribution function chi proportional to (phi(ad) - phi)(-alpha), and a faster divergence of the derivativ rho(d chi/d rho) similar to (d chi/d phi), where rho and phi are the density and volume fraction, and phi(ad) is the volume fraction for arrested dynamics. When the height h is much larger than the conduction length, the energy equation reduces to an algebraic balance between the rates of production and dissipation of energy, and the stress is proportional to the square of the strain rate (Bagnold law). In the shallow limit, the stress reduces to a simplified Bagnold stress, where all components of the stress are proportional to (partial derivative u(x)/partial derivative y)(2), which is the cross-stream (y) derivative of the stream-wise (x) velocity. In the simplified equations for dense shallow flows, the inertial terms are neglected in the y momentum equation in the shallow limit because the are O(h/L) smaller than the divergence of the stress. The resulting model contains two equations, a mass conservation equations which reduces to a solenoidal condition on the velocity in the incompressible limit, and a stream-wise momentum equation which contains just one parameter B which is a combination of the Bagnold coefficients and their derivatives with respect to volume fraction. The leading-order dense shallow flow equations, as well as the first correction due to density variations, are analysed for two representative flows. The first is the development from a plug flow to a fully developed Bagnold profile for the flow down an inclined plane. The analysis shows that the flow development length is ((rho) over barh(3)/B) , where (rho) over bar is the mean density, and this length is numerically estimated from previous simulation results. The second example is the development of the boundary layer at the base of the flow when a plug flow (with a slip condition at the base) encounters a rough base, in the limit where the momentum boundary layer thickness is small compared with the flow height. Analytical solutions can be found only when the stream-wise velocity far from the surface varies as x(F), where x is the stream-wise distance from the start of the rough base and F is an exponent. The boundary layer thickness increases as (l(2)x)(1/3) for all values of F, where the length scale l = root 2B/(rho) over bar. The analysis reveals important differences between granular flows and the flows of Newtonian fluids. The Reynolds number (ratio of inertial and viscous terms) turns out to depend only on the layer height and Bagnold coefficients, and is independent of the flow velocity, because both the inertial terms in the conservation equations and the divergence of the stress depend on the square of the velocity/velocity gradients. The compressibility number (ratio of the variation in volume fraction and mean volume fraction) is independent of the flow velocity and layer height, and depends only on the volume fraction and Bagnold coefficients.
Resumo:
In Incompressible Smooth Particle Hydrodynamics (ISPH), a pressure Poisson equation (PPE) is solved to obtain a divergence free velocity field. When free surfaces are simulated using this method a Dirichlet boundary condition for pressure at the free surface has to be applied. In existing ISPH methods this is achieved by identifying free surface particles using heuristically chosen threshold of a parameter such as kernel sum, density or divergence of the position, and explicitly setting their pressure values. This often leads to clumping of particles near the free surface and spraying off of surface particles during splashes. Moreover, surface pressure gradients in flows where surface tension is important are not captured well using this approach. We propose a more accurate semi-analytical approach to impose Dirichlet boundary conditions on the free surface. We show the efficacy of the proposed algorithm by using test cases of elongation of a droplet and dam break. We perform two dimensional simulations of water entry and validate the proposed algorithm with experimental results. Further, a three dimensional simulation of droplet splash is shown to compare well with the Volume-of-Fluid simulations. (C) 2014 Elsevier Ltd. All rights reserved.
Resumo:
Motivated by the recent proposal for the S-matrix in AdS(3) x S-3 with mixed three form fluxes, we study classical folded string spinning in AdS(3) with both Ramond and Neveu-Schwarz three form fluxes. We solve the equations of motion of these strings and obtain their dispersion relation to the leading order in the Neveu-Schwarz flux b. We show that dispersion relation for the spinning strings with large spin S acquires a term given by -root lambda/2 pi b(2) log(2) S in addition to the usual root lambda/pi log S term where root lambda is proportional to the square of the radius of AdS(3). Using SO(2, 2) transformations and re-parmetrizations we show that these spinning strings can be related to light like Wilson loops in AdS(3) with Neveu-Schwarz flux b. We observe that the logarithmic divergence in the area of the light like Wilson loop is also deformed by precisely the same coefficient of the b(2) log(2) S term in the dispersion relation of the spinning string. This result indicates that the coefficient of b(2) log(2) S has a property similar to the coefficient of the log S term, known as cusp-anomalous dimension, and can possibly be determined to all orders in the coupling lambda using the recent proposal for the S-matrix.
Resumo:
The India-Asia collision profoundly influenced the climate, topography and biodiversity of Asia, causing the formation of the biodiverse Himalayas. The species-rich gekkonid genus Cyrtodactylus is an ideal clade for exploring the biological impacts of the India-Asia collision, as previous phylogenetic hypotheses suggest basal divergences occurred within the Himalayas and Indo-Burma during the Eocene. To this end, we sampled for Cyrtodactylus across Indian areas of the Himalayas and Indo-Burma Hotspots and used three genes to reconstruct relationships and estimate divergence times. Basal divergences in Cyrtodactylus, Hemidactylus and the Palaearctic naked-toed geckos were simultaneous with or just preceded the start of the India-Asia collision. Diversification within Cyrtodactylus tracks the India-Asia collision and subsequent geological events. A number of geographically concordant clades are resolved within Indo-Burmese Cyrtodactylus. Our study reveals 17 divergent lineages that may represent undescribed species, underscoring the previously undocumented diversity of the region. The importance of rocky habitats for Cyrtodactylus indicates the Indo-Gangetic flood plains and the Garo-Rajmahal Gap are likely to have been important historical barriers for this group. (C) 2014 Elsevier Inc. All rights reserved.
Resumo:
Bush frogs of the genus Raorchestes are distributed mainly in the Western Ghats Escarpment of Peninsular India. The inventory of species in this genus is incomplete and there is ambiguity in the systematic status of species recognized by morphological criteria. To address the dual problem of taxon sampling and systematic uncertainty in bush frogs, we used a large-scale spatial sampling design, explicitly incorporating the geographic and ecological heterogeneity of the Western Ghats. We then used a hierarchical multi-criteria approach by combining mitochondrial phylogeny, genetic distance, geographic range, morphology and advertisement call to delimit bush frog lineages. Our analyses revealed the existence of a large number of new lineages with varying levels of genetic divergence. Here, we provide diagnoses and descriptions for nine lineages that exhibit divergence across multiple axes. The discovery of new lineages that exhibit high divergence across wide ranges of elevation and across the major massifs highlights the large gaps in historical sampling. These discoveries underscore the significance of addressing inadequate knowledge of species distribution, namely the ``Wallacean shortfall'', in addressing the problem of taxon sampling and unknown diversity in tropical hotspots. A biogeographically informed sampling and analytical approach was critical in detecting and delineating lineages in a consistent manner across the genus. Through increased taxon sampling, we were also able to discern a number of well-supported sub-clades that were either unresolved or absent in earlier phylogenetic reconstructions and identify a number of shallow divergent lineages which require further examination for assessment of their taxonomic status.
Resumo:
Aim Widespread, transcontinental vertebrate groups represent ideal systems for biogeographical studies, because they can shed light on a wide range of questions relating to species diversification across the geographical template. We combined extensive geographical and genetic sampling from across multiple biogeographical realms to examine the timing and location of diversification in Asian sun skinks, a clade characterized by problematic species boundaries and a particularly enigmatic evolutionary history. Location Indian subcontinent, the Philippines, Southeast Asia and Sundaland. Methods We sequenced one mitochondrial and nine nuclear genes for most species in the genus Eutropis, and estimated phylogenetic relationships and divergence times using coalescent methods. To investigate the location of diversification events, we also estimated ancestral geographical ranges using several methods. Finally, we explored patterns of genetic diversity within several poorly understood, but widely distributed species. Results Divergence-time estimates indicate that Eutropis began to diversify during the Eocene. Biogeographical reconstructions show that species diversification was associated with dispersal into three biogeographical realms: India, Sundaland and the Philippines. Main conclusions The results of this study clarify several questions related to the evolutionary history of Eutropis, and place them in the context of classic Southeast Asian biogeography. Our study represents one of the first to compile a heavily sampled multilocus dataset ranging across international boundaries in southern Asia that have historically prevented a unified understanding of biogeographical and evolutionary processes involving the Indian subcontinent, mainland southern Asia and the island archipelagos of Southeast Asia.
Resumo:
A divergence-free velocity field is usually sought in numerical simulations of incompressible fluids. We show that the particle methods that compute a divergence-free velocity field to achieve incompressibility suffer from a volume conservation issue when a finite time-step position update scheme is used. Further, we propose a deformation gradient based approach to arrive at a velocity field that reduces the volume conservation issues in free surface flows and maintains density uniformity in internal flows while retaining the simplicity of first order time updates. (C) 2015 Elsevier Inc. All rights reserved.
Resumo:
1. Host-parasite interactions have the potential to influence broadscale ecological and evolutionary processes, levels of endemism, divergence patterns and distributions in host populations. Understanding the mechanisms involved requires identification of the factors that shape parasite distribution and prevalence. 2. A lack of comparative information on community-level host-parasite associations limits our understanding of the role of parasites in host population divergence processes. Avian malaria (haemosporidian) parasites in bird communities offer a tractable model system to examine the potential for pathogens to influence evolutionary processes in natural host populations. 3. Using cytochrome b variation, we characterized phylogenetic diversity and prevalence of two genera of avian haemosporidian parasites, Plasmodium and Haemoproteus, and analysed biogeographic patterns of lineages across islands and avian hosts, in southern Melanesian bird communities to identify factors that explain patterns of infection. 4. Plasmodium spp. displayed isolation-by-distance effects, a significant amount of genetic variation distributed among islands but insignificant amounts among host species and families, and strong local island effects with respect to prevalence. Haemoproteus spp. did not display isolation-by-distance patterns, showed marked structuring of genetic variation among avian host species and families, and significant host species prevalence patterns. 5. These differences suggest that Plasmodium spp. infection patterns were shaped by geography and the abiotic environment, whereas Haemoproteus spp. infection patterns were shaped predominantly by host associations. Heterogeneity in the complement and prevalence of parasite lineages infecting local bird communities likely exposes host species to a mosaic of spatially divergent disease selection pressures across their naturally fragmented distributions in southern Melanesia. Host associations for Haemoproteus spp. indicate a capacity for the formation of locally co-adapted host-parasite relationships, a feature that may limit intraspecific gene flow or range expansions of closely related host species.
Resumo:
In recent times, zebrafish has garnered lot of popularity as model organism to study human cancers. Despite high evolutionary divergence from humans, zebrafish develops almost all types of human tumors when induced. However, mechanistic details of tumor formation have remained largely unknown. Present study is aimed at analysis of repertoire of kinases in zebrafish proteome to provide insights into various cellular components. Annotation using highly sensitive remote homology detection methods revealed ``substantial expansion'' of Ser/Thr/Tyr kinase family in zebrafish compared to humans, constituting over 3% of proteome. Subsequent classification of kinases into subfamilies revealed presence of large number of CAMK group of kinases, with massive representation of PIM kinases, important for cell cycle regulation and growth. Extensive sequence comparison between human and zebrafish PIM kinases revealed high conservation of functionally important residues with a few organism specific variations. There are about 300 PIM kinases in zebrafish kinome, while human genome codes for only about 500 kinases altogether. PIM kinases have been implicated in various human cancers and are currently being targeted to explore their therapeutic potentials. Hence, in depth analysis of PIM kinases in zebrafish has opened up new avenues of research to verify the model organism status of zebrafish.
Resumo:
We carried out a large-scale phylogenetic analysis of fejervaryan (dicroglossid frogs with `Fejervaryan lines' on the ventral side of the body) frogs, distributed in South and SE Asia, using published and newly generated sequences of unidentified individuals from the northern Western Ghats. The results corroborate the presence of a larger fejervaryan clade with a sister relationship to a clade composed of Sphaerotheca. Two sister clades could be discerned within the lager fejervaryan clade. The unidentified individuals formed a monophyletic group and showed a strong support for a sister relationship with Minervarya sahyadris. The species was found to be highly divergent (16S rRNA-4% and tyr-1%) from its sister lineage Minervarya sahyadris, and the clade composed of these two lineages were found to be deeply nested within the larger clade of Fejervarya. Based on this, the genus Minervarya Dubois, Ohler and Biju, 2001 is synonymized under the genus Fejervarya Bolkay, 1915. The unidentified lineage is recognized, based on phylogenetic position, genetic divergence and morphological divergence, as a distinct species and named here as Fejervarya gomantaki sp. nov. The presence of rictal glands was observed to be a synapomorphic character shared by the nested clade members, Fejervarya sahyadris and Fejervarya gomantaki sp. nov. Based on the presence of rictal gland and small size, Minervarya chilapata, a species from a lowland region in the Eastern Himalayas, is synonymized under Fejervarya and evidence for morphological separation from the new species, Fejervarya gomantaki sp. nov. is provided. For the fejervaryan frogs, currently three generic names (Frost, 2015) are available for the two phylogenetic subclades; the genus Fejervarya Bolkay, 1915 for the species of fejervaryan frogs having distribution in the South East Asia; the genus Zakerana Howlader, 2011 for the species of fejervaryan frogs having distribution in the South Asia and the genus Minervarya Dubois, Ohler and Biju, 2001 nested within the `Zakerana clade'. In the phylogenetic analysis Minervarya sahyadris, the new species described herein as Fejervarya gomantaki sp. nov. are nested within the `Zakerana clade', if the `Zakerana clade' for the fejervaryan frogs having distribution in the South Asia is provided a generic status the nomen `Minervarya' should be considered as per the principle of priority of the ICZN Code. Taking into consideration the overlapping distribution ranges of members of the sister clades within the larger fejervaryan clade and the absence of distinct morphological characteristics, we also synonymize the genus Zakerana Howlader, 2011, a name assigned to one of the sister clades with members predominantly distributed in South Asia, under the genus Fejervarya Bolkay, 1915. We discuss the need for additional sampling to identify additional taxa and determine the geographical ranges of the members of the sister clades within Fejervarya to resolve taxonomy within this group.