30 resultados para complex data


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Semi-supervised learning techniques have gained increasing attention in the machine learning community, as a result of two main factors: (1) the available data is exponentially increasing; (2) the task of data labeling is cumbersome and expensive, involving human experts in the process. In this paper, we propose a network-based semi-supervised learning method inspired by the modularity greedy algorithm, which was originally applied for unsupervised learning. Changes have been made in the process of modularity maximization in a way to adapt the model to propagate labels throughout the network. Furthermore, a network reduction technique is introduced, as well as an extensive analysis of its impact on the network. Computer simulations are performed for artificial and real-world databases, providing a numerical quantitative basis for the performance of the proposed method.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we have quantified the consistency of word usage in written texts represented by complex networks, where words were taken as nodes, by measuring the degree of preservation of the node neighborhood. Words were considered highly consistent if the authors used them with the same neighborhood. When ranked according to the consistency of use, the words obeyed a log-normal distribution, in contrast to Zipf's law that applies to the frequency of use. Consistency correlated positively with the familiarity and frequency of use, and negatively with ambiguity and age of acquisition. An inspection of some highly consistent words confirmed that they are used in very limited semantic contexts. A comparison of consistency indices for eight authors indicated that these indices may be employed for author recognition. Indeed, as expected, authors of novels could be distinguished from those who wrote scientific texts. Our analysis demonstrated the suitability of the consistency indices, which can now be applied in other tasks, such as emotion recognition.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Male squid produce intricate spermatophores that, when transferred to the female, undergo the spermatophoric reaction, a complex process of evagination that leads to the attachment of the spermatangium, that is, the everted spermatophore containing the sperm mass. While this process is still not completely understood, the medical literature includes several reports of "oral stinging" (i.e., punctured wounds in the human oral cavity) following consumption of raw male squid, which contains undischarged spermatophores able to inflict such wounds. Here, we revisit a recent medical report of oral stinging by Shiraki et al. (Pathol Int 61:749-751, 2011), providing an in-depth reanalysis of their histological biopsies and revealing vital information on the functioning of squid spermatophores. The morphology of the spermatangia attached within the oral cavity is similar to the condition found in spermatangia naturally attached to female squids. The spermatangia were able to superficially puncture the superficial layers of the oral stratified squamous epithelium, and numerous, minute stellate particles from the squid spermatophore were found adhered to the oral epithelium. These findings corroborate previous hypotheses on the functioning of squid spermatophores, namely that spermatophore attachment generally involves tissue scarification, and that stellate particles play a vital role in the attachment process. Moreover, spermatophore attachment is confirmed to be autonomous (i.e., performed by the spermatophore itself) in another squid species (possibly a loliginid), and the results strongly indicate that the attachment mechanism is not dependent upon a specialized epithelium, nor a mate's specific chemical stimulus. From the pathological point of view, the best prophylactic measure at present is the removal of the internal organs of the raw squid prior to its consumption.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This work combines structural and geochronological data to improve our understanding of the mechanical behaviour of continental crust involving large amount of magma or partially melted material in an abnormally hot collisional belt. We performed a magnetic and geochronological (U/Pb) study on a huge tonalitic batholith from the Neoproterozoic Aracual belt of East Brazil to determine the strain distribution through space and time. Anisotropy of magnetic susceptibility, combined with rock magnetism investigations, supports that the magnetic fabric is a good proxy of the structural fabric. Field measurements together with the magnetic fabrics highlight the presence in the batholith of four domains characterized by contrasted magmatic flow patterns. The western part is characterized by a gently dipping, orogen-parallel (similar to NS) magmatic foliation that bears down-dip lineations, in agreement with westward thrusting onto the Sao Francisco craton. Eastward, the magmatic foliation progressively turns sub-vertical with a lineation that flips from sub-horizontal to sub-vertical over short distances. This latter domain involves an elongated corridor in which the magmatic foliation is sub-horizontal and bears an orogen-parallel lineation. Finally the fourth, narrow domain displays sub-horizontal lineations on a sub-vertical magmatic foliation oblique (similar to N150 degrees E) to the trend of the belt. U/Pb dating of zircons from the various domains revealed homogeneity in age for all samples. This, together with the lack of solid-state deformation suggests that: 1) the whole batholith emplaced during a magmatic event at similar to 580 Ma, 2) the deformation occurred before complete solidification. and 3) the various fabrics are roughly contemporaneous. The complex structural pattern mapped in the studied tonalitic batholith suggests a 3D deformation of a slowly cooling, large magmatic body and its country rock. We suggest that the development of the observed 3D flow field was promoted by the low viscosity of the middle crust that turned gravitational force as an active tectonic force combining with the East-West convergence between the Sao Francisco and Congo cratons. (C) 2012 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Traditional supervised data classification considers only physical features (e. g., distance or similarity) of the input data. Here, this type of learning is called low level classification. On the other hand, the human (animal) brain performs both low and high orders of learning and it has facility in identifying patterns according to the semantic meaning of the input data. Data classification that considers not only physical attributes but also the pattern formation is, here, referred to as high level classification. In this paper, we propose a hybrid classification technique that combines both types of learning. The low level term can be implemented by any classification technique, while the high level term is realized by the extraction of features of the underlying network constructed from the input data. Thus, the former classifies the test instances by their physical features or class topologies, while the latter measures the compliance of the test instances to the pattern formation of the data. Our study shows that the proposed technique not only can realize classification according to the pattern formation, but also is able to improve the performance of traditional classification techniques. Furthermore, as the class configuration's complexity increases, such as the mixture among different classes, a larger portion of the high level term is required to get correct classification. This feature confirms that the high level classification has a special importance in complex situations of classification. Finally, we show how the proposed technique can be employed in a real-world application, where it is capable of identifying variations and distortions of handwritten digit images. As a result, it supplies an improvement in the overall pattern recognition rate.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Competitive learning is an important machine learning approach which is widely employed in artificial neural networks. In this paper, we present a rigorous definition of a new type of competitive learning scheme realized on large-scale networks. The model consists of several particles walking within the network and competing with each other to occupy as many nodes as possible, while attempting to reject intruder particles. The particle's walking rule is composed of a stochastic combination of random and preferential movements. The model has been applied to solve community detection and data clustering problems. Computer simulations reveal that the proposed technique presents high precision of community and cluster detections, as well as low computational complexity. Moreover, we have developed an efficient method for estimating the most likely number of clusters by using an evaluator index that monitors the information generated by the competition process itself. We hope this paper will provide an alternative way to the study of competitive learning.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The continental margin of southeast Brazil is elevated. Onshore Tertiary basins and Late Cretaceous/Paleogene intrusions are good evidence for post breakup tectono-magmatic activity. To constrain the impact of post-rift reactivation on the geological history of the area, we carried out a new thermochronological study. Apatite fission track ages range from 60.7 +/- 1.9 Ma to 129.3 +/- 4.3 Ma, mean track lengths from 11.41 +/- 0.23 mu m to 14.31 +/- 0.24 mu m and a subset of the (U-Th)/He ages range from 45.1 +/- 1.5 to 122.4 +/- 2.5 Ma. Results of inverse thermal history modeling generally support the conclusions from an earlier study for a Late Cretaceous phase of cooling. Around the onshore Taubate Basin, for a limited number of samples, the first detectable period of cooling occurred during the Early Tertiary. The inferred thermal histories for many samples also imply subsequent reheating followed by Neogene cooling. Given the uncertainty of the inversion results, we did deterministic forward modeling to assess the range of possibilities of this Tertiary part of the thermal history. The evidence for reheating seems to be robust around the Taubate Basin, but elsewhere the data cannot discriminate between this and a less complex thermal history. However, forward modeling results and geological information support the conclusion that the whole area underwent cooling during the Neogene. The synchronicity of the cooling phases with Andean tectonics and those in NE Brazil leads us to assume a plate-wide compressional stress that reactivated inherited structures. The present-day topographic relief of the margin reflects a contribution from post-breakup reactivation and uplift.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Although a large amount of data have been published in past years on the taxonomic status of the Anastrepha fraterculus (Wiedemann) species complex, there is still a need to know how many species this complex comprises, the distribution of each one, and their distinguishing features. In this study, we assessed the morphometric variability of 32 populations from the A. fraterculus complex, located in major biogeographical areas from the Neotropics. Multivariate techniques for analysis were applied to the measurements of 21 variables referring to the mesonotum, aculeus, and wing. For the first time, our results identified the presence of seven distinct morphotypes within this species complex. According to the biogeographical areas, populations occurring in the Mesoamerican dominion (Mexico, Guatemala, and Panama) were clustered within a single natural entity labeled as the "Mexican" morphotype; whereas in the northwestern South American dominion, samples fell into three distinct groups: the "Venezuelan" morphotype with a single population from the Caribbean lowlands of Venezuela, the "Andean" morphotype from the highlands of Venezuela and Colombia, and the third group or "Peruvian" morphotype comprised the samples from the Pacific coastal lowlands of Ecuador and Peru. Three additional groups were identified from the Chacoan and Paranaense sub-regions: the morphotype "Brazilian-1" was recognized as including the Argentinean samples with most pertaining to Brazil, and widely distributed in these biogeographical areas; the morphotype "Brazilian-2" was recognized as including two samples from the state of Sao Paulo (Ilha-Bela and Sao Sebastiao); whereas the morphotype "Brazilian-3" included a single population from Botucatu (state of Sao Paulo). Based on data published by previous authors showing genetic and karyotypic differentiation, as well as reproductive isolation, we have concluded that such morphotypes indeed represent natural groups and distinct taxonomic entities.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Here, we describe a female patient with autism spectrum disorder and dysmorphic features that harbors a complex genetic alteration, involving a de novo balanced translocation t(2;X)(q11;q24), a 5q11 segmental trisomy and a maternally inherited isodisomy on chromosome 5. All the possibly damaging genetic effects of such alterations are discussed. In light of recent findings on ASD genetic causes, the hypothesis that all these alterations might be acting in orchestration and contributing to the phenotype is also considered. (C) 2012 Wiley Periodicals, Inc.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Cretaceous Banhado alkaline complex in southeastern Brazil presents two potassic SiO2-undersaturated series. The high-Ca magmatic series consist of initially fractionated olivine (Fo(92-91)) + diopside (Wo(48-43)En(49-35)Ae(0-7)), as evidenced by the presence of xenocrysts and xenoliths. In that sequence, diopside (Wo(47-38)En(46-37)Ae(0-8)) + phlogopite + apatite + perovskite (Prv(> 92)) crystallized to form the phlogopite melteigite and led to the Ca enrichment of the magma. Diopside (Wo(47-41)En(32-24) Ae(3-14)) continued to crystallize as an early mafic mineral, followed by nepheline (Ne(74.8-70.1)Ks(26.3-21.2)Qz(7.6-0.9)) and leucite (Lc(65-56)) and subsequently by melanite and potassic feldspar (Or(85-99)Ab(1-7)) to form melanite ijolites, wollastonite-melanite urtites and melanite-nepheline syenites. Melanite-pseudoleucite-nepheline syenites are interpreted to be a leucite accumulation. Melanite nephelinite dykes are believed to represent some of the magmatic differentiation steps. The low-Ca magmatic series is representative of a typical fractionation of aegirine-augite (Wo(36-29)En(25-4)Ae(39-18)) + alkali feldspar (Or(57-96)Ab(3-43)) + nepheline (Ne(76.5-69.0)Ks(19.9-14.4)Qz(15.1-7.7)) + titanite from phonolite magma. The evolution of this series from potassic nepheline syenites to sodic sodalite syenites and sodalitolites is attributed to an extensive fractionation of potassic feldspar, which led to an increase of the NaCl activity in the melt during the final stages forming sodalite-rich rocks. Phonolite dykes followed a similar evolutionary process and also registered some crustal assimilation. The mesocratic nepheline syenites showed interactions with phlogopite melteigites, such as compatible trace element enrichments and the presence of diopside xenocrysts, which were interpreted to be due to a mixing/mingling process of phonolite and nephelinite magmas. The geochemical data show higher TiO2 and P2O5 contents and lower SiO2 contents for the high-Ca series and different LILE evolution trends and REE chondrite-normalized patterns as compared to the low-Ca series. The Sr-87/Sr-86, Nd-143/Nd-144, Pb-206/Pb-204 and Pb-208/Pb-204 initial ratios for the high-Ca series (0.70407-0.70526, 0.51242-0.51251, 17.782-19.266 and 38.051-39.521, respectively) were slightly different from those of the low-Ca series (0.70542-0.70583, 0.51232-0.51240, 17.758-17.772 and 38.021-38.061, respectively). For both series, a CO2-rich potassic metasomatized lithospheric mantle enriched the source with rutile-bearing phlogopite clinopyroxenite veins. Kamafugite-like parental magma is attributed to the high-Ca series with major contributions from the melting of the veins. Potassic nephelinite-like parental magma is assigned to the low-Ca series, where the metasomatized wall-rock played a more significant role in the melting process.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Background Ferredoxin-NADP(H) reductases (FNRs) are flavoenzymes that catalyze the electron transfer between NADP(H) and the proteins ferredoxin or flavodoxin. A number of structural features distinguish plant and bacterial FNRs, one of which is the mode of the cofactor FAD binding. Leptospira interrogans is a spirochaete parasitic bacterium capable of infecting humans and mammals in general. Leptospira interrogans FNR (LepFNR) displays low sequence identity with plant (34% with Zea mays) and bacterial (31% with Escherichia coli) FNRs. However, LepFNR contains all consensus sequences that define the plastidic class FNRs. Results The crystal structures of the FAD-containing LepFNR and the complex of the enzyme with NADP+, were solved and compared to known FNRs. The comparison reveals significant structural similarities of the enzyme with the plastidic type FNRs and differences with the bacterial enzymes. Our small angle X-ray scattering experiments show that LepFNR is a monomeric enzyme. Moreover, our biochemical data demonstrate that the LepFNR has an enzymatic activity similar to those reported for the plastidic enzymes and that is significantly different from bacterial flavoenzymes, which display lower turnover rates. Conclusion LepFNR is the first plastidic type FNR found in bacteria and, despite of its low sequence similarity with plastidic FNRs still displays high catalytic turnover rates. The typical structural and biochemical characteristics of plant FNRs unveiled for LepFNR support a notion of a putative lateral gene transfer which presumably offers Leptospira interrogans evolutionary advantages. The wealth of structural information about LepFNR provides a molecular basis for advanced drugs developments against leptospirosis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Background The molecular phylogenetic relationships and population structure of the species of the Anopheles triannulatus complex: Anopheles triannulatus s.s., Anopheles halophylus and the putative species Anopheles triannulatus C were investigated. Methods The mitochondrial COI gene, the nuclear white gene and rDNA ITS2 of samples that include the known geographic distribution of these taxa were analyzed. Phylogenetic analyses were performed using Bayesian inference, Maximum parsimony and Maximum likelihood approaches. Results Each data set analyzed septely yielded a different topology but none provided evidence for the seption of An. halophylus and An. triannulatus C, consistent with the hypothesis that the two are undergoing incipient speciation. The phylogenetic analyses of the white gene found three main clades, whereas the statistical parsimony network detected only a single metapopulation of Anopheles triannulatus s.l. Seven COI lineages were detected by phylogenetic and network analysis. In contrast, the network, but not the phylogenetic analyses, strongly supported three ITS2 groups. Combined data analyses provided the best resolution of the trees, with two major clades, Amazonian (clade I) and trans-Andean + Amazon Delta (clade II). Clade I consists of multiple subclades: An. halophylus + An. triannulatus C; trans-Andean Venezuela; central Amazonia + central Bolivia; Atlantic coastal lowland; and Amazon delta. Clade II includes three subclades: Panama; cis-Andean Colombia; and cis-Venezuela. The Amazon delta specimens are in both clades, likely indicating local sympatry. Spatial and molecular variance analyses detected nine groups, corroborating some of subclades obtained in the combined data analysis. Conclusion Combination of the three molecular markers provided the best resolution for differentiation within An. triannulatus s.s. and An. halophylus and C. The latest two species seem to be very closely related and the analyses performed were not conclusive regarding species differentiation. Further studies including new molecular markers would be desirable to solve this species status question. Besides, results of the study indicate a trans-Andean origin for An. triannulatus s.l. The potential implications for malaria epidemiology remain to be investigated.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Background Once multi-relational approach has emerged as an alternative for analyzing structured data such as relational databases, since they allow applying data mining in multiple tables directly, thus avoiding expensive joining operations and semantic losses, this work proposes an algorithm with multi-relational approach. Methods Aiming to compare traditional approach performance and multi-relational for mining association rules, this paper discusses an empirical study between PatriciaMine - an traditional algorithm - and its corresponding multi-relational proposed, MR-Radix. Results This work showed advantages of the multi-relational approach in performance over several tables, which avoids the high cost for joining operations from multiple tables and semantic losses. The performance provided by the algorithm MR-Radix shows faster than PatriciaMine, despite handling complex multi-relational patterns. The utilized memory indicates a more conservative growth curve for MR-Radix than PatriciaMine, which shows the increase in demand of frequent items in MR-Radix does not result in a significant growth of utilized memory like in PatriciaMine. Conclusion The comparative study between PatriciaMine and MR-Radix confirmed efficacy of the multi-relational approach in data mining process both in terms of execution time and in relation to memory usage. Besides that, the multi-relational proposed algorithm, unlike other algorithms of this approach, is efficient for use in large relational databases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Previous analyses of the mitochondrial gene cytochrome c oxidase subunit 1 (COI) and γ-proteobacterial endosymbiont diversity have suggested that the marine bryozoan Bugula neritina is a complex of three cryptic species, namely Types S, D and N. Types D and N were previously reported to have restricted distributions along California (western USA) and Delaware and Connecticut (eastern USA), respectively, whereas Type S is considered widespread in tropical, subtropical and temperate regions due to anthropogenic transport. Here, Bayesian species delimitation analysis of a data set composed of two mitochondrial (COI and large ribosomal RNA subunit [16S]) and two nuclear genes (dynein light chain roadblock type-2 protein [DYN] and voltage-dependent anion-selective channel protein [VDAC]) demonstrated that Types S, D and N correspond to three biological species. This finding was significantly supported, in spite of the combinations of priors applied for ancestral population size and root age. Furthermore, COI sequences were used to assess the introduction patterns of the cosmopolitan Type S species. Two COI haplotypes of Type S (S1a and S1d) were found occurring at a global scale. Mantel tests showed correlation between these haplotypes and local sea surface temperature tolerance. Accordingly, the distributions of Type S haplotypes may reflect intraspecific temperature tolerance variation, in addition to the role of introduction vectors. Finally, we show that the Type N may also have been introduced widely, as this species was found for the first time in Central California and north-eastern Australia.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract Background Effective malaria control relies on accurate identification of those Anopheles mosquitoes responsible for the transmission of Plasmodium parasites. Anopheles oswaldoi s.l. has been incriminated as a malaria vector in Colombia and some localities in Brazil, but not ubiquitously throughout its Neotropical range. This evidence together with variable morphological characters and genetic differences supports that An. oswaldoi s.l. compromises a species complex. The recent fully integrated redescription of An. oswaldoi s.s. provides a solid taxonomic foundation from which to molecularly determine other members of the complex. Methods DNA sequences of the Second Internal Transcribed Spacer (ITS2 - rDNA) (n = 192) and the barcoding region of the Cytochrome Oxidase I gene (COI - mtDNA) (n = 110) were generated from 255 specimens of An. oswaldoi s.l. from 33 localities: Brazil (8 localities, including the lectotype series of An. oswaldoi), Ecuador (4), Colombia (17), Trinidad and Tobago (1), and Peru (3). COI sequences were analyzed employing the Kimura-two-parameter model (K2P), Bayesian analysis (MrBayes), Mixed Yule-Coalescent model (MYC, for delimitation of clusters) and TCS genealogies. Results Separate and combined analysis of the COI and ITS2 data sets unequivocally supported four separate species: two previously determined (An. oswaldoi s.s. and An. oswaldoi B) and two newly designated species in the Oswaldoi Complex (An. oswaldoi A and An. sp. nr. konderi). The COI intra- and inter-specific genetic distances for the four taxa were non-overlapping, averaging 0.012 (0.007 to 0.020) and 0.052 (0.038 to 0.064), respectively. The concurring four clusters delineated by MrBayes and MYC, and four independent TCS networks, strongly confirmed their separate species status. In addition, An. konderi of Sallum should be regarded as unique with respect to the above. Despite initially being included as an outgroup taxon, this species falls well within the examined taxa, suggesting a combined analysis of these taxa would be most appropriate. Conclusions: Through novel data and retrospective comparison of available COI and ITS2 DNA sequences, evidence is shown to support the separate species status of An. oswaldoi s.s., An. oswaldoi A and An. oswaldoi B, and at least two species in the closely related An. konderi complex (An. sp. nr. konderi, An. konderi of Sallum). Although An. oswaldoi s.s. has never been implicated in malaria transmission, An. oswaldoi B is a confirmed vector and the new species An. oswaldoi A and An. sp. nr. konderi are circumstantially implicated, most likely acting as secondary vectors.