909 resultados para classification and regression trees


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We studied the Paraíba do Sul river watershed , São Paulo state (PSWSP), Southeastern Brazil, in order to assess the land use and cover (LULC) and their implication s to the amount of carbon (C) stored in the forest cover between the years 1985 and 2015. Th e region covers a n area of 1,395,975 ha . We used images made by the Operational Land Imager (OLI) sensor (OLI/Landsat - 8) to produce mappings , and image segmentation techniques to produce vectors with homogeneous characteristics. The training samples and the samples used for classification and validation were collected from the segmented image. To quantify the C stocked in aboveground live biomass (AGLB) , we used an indirect method and applied literature - based reference values. The recovery of 205,690 ha of a secondary Native Forest (NF) after 1985 sequestered 9.7 Tg (Teragram) of C . Considering the whole NF area (455,232 ha), the amount of C accumulated al ong the whole watershed was 3 5 .5 Tg , and the whole Eucalyptus crop (EU) area (113,600 ha) sequester ed 4. 4 Tg of C. Thus, the total amount of C sequestered in the whole watershed (NF + EU) was 3 9 . 9 Tg of C or 1 45 . 6 Tg of CO 2 , and the NF areas were responsible for the large st C stock at the watershed (8 9 %). Therefore , the increase of the NF cover contribut es positively to the reduction of CO 2 concentration in the atmosphere, and Reducing Emissions from Deforestation and Forest Degradation (REDD + ) may become one of the most promising compensation mechanisms for the farmers who increased forest cover at their farms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The fast development of Information Communication Technologies (ICT) offers new opportunities to realize future smart cities. To understand, manage and forecast the city's behavior, it is necessary the analysis of different kinds of data from the most varied dataset acquisition systems. The aim of this research activity in the framework of Data Science and Complex Systems Physics is to provide stakeholders with new knowledge tools to improve the sustainability of mobility demand in future cities. Under this perspective, the governance of mobility demand generated by large tourist flows is becoming a vital issue for the quality of life in Italian cities' historical centers, which will worsen in the next future due to the continuous globalization process. Another critical theme is sustainable mobility, which aims to reduce private transportation means in the cities and improve multimodal mobility. We analyze the statistical properties of urban mobility of Venice, Rimini, and Bologna by using different datasets provided by companies and local authorities. We develop algorithms and tools for cartography extraction, trips reconstruction, multimodality classification, and mobility simulation. We show the existence of characteristic mobility paths and statistical properties depending on transport means and user's kinds. Finally, we use our results to model and simulate the overall behavior of the cars moving in the Emilia Romagna Region and the pedestrians moving in Venice with software able to replicate in silico the demand for mobility and its dynamic.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The advent of omic data production has opened many new perspectives in the quest for modelling complexity in biophysical systems. With the capability of characterizing a complex organism through the patterns of its molecular states, observed at different levels through various omics, a new paradigm of investigation is arising. In this thesis, we investigate the links between perturbations of the human organism, described as the ensemble of crosstalk of its molecular states, and health. Machine learning plays a key role within this picture, both in omic data analysis and model building. We propose and discuss different frameworks developed by the author using machine learning for data reduction, integration, projection on latent features, pattern analysis, classification and clustering of omic data, with a focus on 1H NMR metabolomic spectral data. The aim is to link different levels of omic observations of molecular states, from nanoscale to macroscale, to study perturbations such as diseases and diet interpreted as changes in molecular patterns. The first part of this work focuses on the fingerprinting of diseases, linking cellular and systemic metabolomics with genomic to asses and predict the downstream of perturbations all the way down to the enzymatic network. The second part is a set of frameworks and models, developed with 1H NMR metabolomic at its core, to study the exposure of the human organism to diet and food intake in its full complexity, from epidemiological data analysis to molecular characterization of food structure.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Natural Language Processing (NLP) has seen tremendous improvements over the last few years. Transformer architectures achieved impressive results in almost any NLP task, such as Text Classification, Machine Translation, and Language Generation. As time went by, transformers continued to improve thanks to larger corpora and bigger networks, reaching hundreds of billions of parameters. Training and deploying such large models has become prohibitively expensive, such that only big high tech companies can afford to train those models. Therefore, a lot of research has been dedicated to reducing a model’s size. In this thesis, we investigate the effects of Vocabulary Transfer and Knowledge Distillation for compressing large Language Models. The goal is to combine these two methodologies to further compress models without significant loss of performance. In particular, we designed different combination strategies and conducted a series of experiments on different vertical domains (medical, legal, news) and downstream tasks (Text Classification and Named Entity Recognition). Four different methods involving Vocabulary Transfer (VIPI) with and without a Masked Language Modelling (MLM) step and with and without Knowledge Distillation are compared against a baseline that assigns random vectors to new elements of the vocabulary. Results indicate that VIPI effectively transfers information of the original vocabulary and that MLM is beneficial. It is also noted that both vocabulary transfer and knowledge distillation are orthogonal to one another and may be applied jointly. The application of knowledge distillation first before subsequently applying vocabulary transfer is recommended. Finally, model performance due to vocabulary transfer does not always show a consistent trend as the vocabulary size is reduced. Hence, the choice of vocabulary size should be empirically selected by evaluation on the downstream task similar to hyperparameter tuning.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

To evaluate p16(INK) (4a) immunoexpression in CIN1 lesions looking for differences between cases that progress to CIN2/3 maintain CIN1 diagnosis, or spontaneously regress. Seventy-four CIN1 biopsies were studied. In the follow-up, a second biopsy was performed and 28.7% showed no lesion (regression), 37.9% maintained CIN1, and 33.4% progressed to CIN2/3. Immunostaining for p16(INK) (4a) was performed in the first biopsy and it was considered positive when there was strong and diffuse staining of the basal and parabasal layers. Pearson's chi-square was used to compare the groups (p ≤ 0.05). The age of the patients was similar. There was no significant difference in p16(INK) (4a) immunoexpression in the groups, however, statistical analyses showed a significant association when only the progression and regression groups were compared (p = 0.042). Considering p16(INK) (4a) positivity and the progression to CIN2/3, the sensitivity, specificity, positive, and negative predictive values in our cohort were 45%, 75%, 47%, and 94%, respectively. We emphasize that CIN1 with p16(INK) (4a) staining was associated with lesion progression, but the sensitivity was not high. However, the negative predictive value was more reliable (94%) and p16(INK) (4a) may represent a useful biomarker that can identify CIN1 lesions that need particular attention, complementing morphology.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report four cases of surgically treated intracranial arachnoid cysts, one with cyst-peritoneal shunt and three with craniotomy and arachnoid membrane resection. Their classification and etiopathogeny are discussed, and especially the different methods of treatment comparing the drastic complications (adversities) with the favorable solutions in severe clinical cases (plasticity) treated at our institution.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The creation of the Brazilian Program for the Modernization of the Horticulture by the Secretariat of Agriculture and Supplying of the State of São Paulo at CEAGESP, determined the standardization of fruit and vegetables in the follow aspects: degree of coloration, format, calibers, defects and packing. Therefore, the main goal of this research is to correlate the classification given by the Brazilian Program with the one used by the wholesalers at CEAGESP, verifying if the established norms are being fulfilled for cultivar Carmen and Debora (SAKATA SEED). The results showed, that for cultivar Carmem, for the averages of the observed values it does not move away from the norms created by the Program for sizes small and medium. However, for the case of cultivar Debora, the results showed differences between the adopted classifications. The tomatoes were devaluated, because had been commercialized below of the standardization indicated for the Brazilian Program.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJETIVO: Descrever a relação entre adiposidade na adolescência e obesidade materna. MÉTODOS: Foi realizado estudo transversal com 660 indivíduos de 8 a 18 anos, de ambos os sexos, matriculados em uma escola pública e outra privada do município de São Paulo. A coleta de dados foi realizada por meio de entrevista, medidas antropométricas e inquérito alimentar. A adiposidade na adolescência foi mensurada a partir do índice de massa corporal e, por meio de análise de regressão, verificou-se sua relação com a obesidade materna, ajustada por sexo, idade, estágio de maturação sexual, valor energético total da dieta, atividade física, sedentarismo, peso ao nascer e escolaridade materna. RESULTADOS: Dos adolescentes estudados, 64,7% eram do sexo feminino. A média (desvio-padrão) de idade foi de 12,4 (1,80), variando de 8 a 17 anos. Verificou-se maior prevalência de excesso de peso e obesidade entre os indivíduos do sexo masculino, não sendo observada associação significativa entre estado nutricional e sexo. Após ajuste pelas covariáveis, detectou-se que filhos de mães obesas têm risco quatro vezes maior de ser obesos, quando comparados aos adolescentes filhos de mães não obesas. CONCLUSÃO: Conclui-se que a obesidade materna representa fator de risco importante para o desenvolvimento da obesidade na adolescência.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Em 1992, o Brasil modificou seus critérios de classificação toxicológica de agrotóxicos adequando-os à recomendação de classificação de periculosidade da Organização Mundial da Saúde (OMS). Em 2002, o Sistema Globalmente Harmonizado de Classificação e Rotulagem de Produtos Químicos (GHS) foi adotado pela Organização das Nações Unidas. Em decorrência, a OMS está adequando ao GHS sua recomendação de classificação de agrotóxicos, o que também deverá ser feito pelo Brasil. Considerou-se oportuno estimar o impacto da alteração de critérios, ocorrida em 1992, na reclassificação toxicológica dos produtos comerciais que se encontravam registrados na ocasião. Encontrou-se que 58,6% do total dos agrotóxicos então registrados (74,9% das formulações líquidas e 31,0% das sólidas) podem ter sofrido reclassificação para classes toxicológicas consideradas de me-nor periculosidade, sofrendo mudanças na comunicação de riscos expressa na rotulagem. Isto pode ter ocasionado conseqüências negativas devido a confusões de interpretação pelos agricultores. Nos países que já dispõem de sistemas de classificação de periculosidade de agrotóxicos, como o Brasil, recomenda-se estimar, antes da implantação, os impactos das mudanças que poderão decorrer da adoção do GHS.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The weevil subfamily Scolytinae includes beetles which may feed on the bark, trunk or roots of both live and dead trees and are sometimes considered forest and silvicultural pests. Less frequently, some species feed on seeds and may be cause economic losses when associated to plant cultivars. Spermophthorus apuleiae Costa-Lima is a Neotropical Scolytinae formerly recorded to be "associated" with seeds of Caesalpinia ferrea var. leiostachya Benth, a Brazilian tree popularly known in Portuguese as "pau-ferro". Hitherto, it was not clear whether these beetles actually feed on the seeds of that plant. In order to investigate the ability of S. apuleiae to feed on seeds of "pau-ferro", observations were done and colonies of these beetles were established. Both in the field and in captivity the beetles were not observed feeding on the seeds. Even when beetles were exposed to seeds as the only source of food they were incapable of boring or eating the seeds and died. Our data therefore suggest that S. apuleiae is a frugivorous species which peculiarly does not eat seeds of "pau-ferro".

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Em 1992 o Brasil modificou seus critérios de classificação toxicológica de agrotóxicos adequando-os à recomendação de classificação de periculosidade da Organização Mundial da Saúde (OMS). Em 2002, o Sistema Globalmente Harmonizado de Classificação e Rotulagem de Produtos Químicos (GHS) foi adotado pela Organização das Nações Unidas. Em decorrência, a OMS está adequando ao GHS sua recomendação de classificação de agrotóxicos, o que também deverá ser feito pelo Brasil. Considerou-se oportuno estimar o impacto da alteração de critérios, ocorrida em 1992, na reclassificação toxicológica dos produtos comerciais que se encontravam registrados na ocasião. Encontrou-se que 58,6% do total dos agrotóxicos então registrados (74,9% das formulações líquidas e 31,0% das sólidas) podem ter sofrido reclassificação para Classes Toxicológicas consideradas de menor periculosidade, sofrendo mudanças na comunicação de riscos expressa na rotulagem. Isto pode ter ocasionado conseqüências negativas devido à confusões de interpretação pelos agricultores. Nos países que já dispõem de sistemas de classificação de periculosidade de agrotóxicos, como o Brasil, recomenda-se estimar, antes da implantação, os impactos das mudanças que poderão decorrer da adoção do GHS

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Introduction: Work disability is a major consequence of rheumatoid arthritis (RA), associated not only with traditional disease activity variables, but also more significantly with demographic, functional, occupational, and societal variables. Recent reports suggest that the use of biologic agents offers potential for reduced work disability rates, but the conclusions are based on surrogate disease activity measures derived from studies primarily from Western countries. Methods: The Quantitative Standard Monitoring of Patients with RA (QUEST-RA) multinational database of 8,039 patients in 86 sites in 32 countries, 16 with high gross domestic product (GDP) (>24K US dollars (USD) per capita) and 16 low-GDP countries (<11K USD), was analyzed for work and disability status at onset and over the course of RA and clinical status of patients who continued working or had stopped working in high-GDP versus low-GDP countries according to all RA Core Data Set measures. Associations of work disability status with RA Core Data Set variables and indices were analyzed using descriptive statistics and regression analyses. Results: At the time of first symptoms, 86% of men (range 57%-100% among countries) and 64% (19%-87%) of women <65 years were working. More than one third (37%) of these patients reported subsequent work disability because of RA. Among 1,756 patients whose symptoms had begun during the 2000s, the probabilities of continuing to work were 80% (95% confidence interval (CI) 78%-82%) at 2 years and 68% (95% CI 65%-71%) at 5 years, with similar patterns in high-GDP and low-GDP countries. Patients who continued working versus stopped working had significantly better clinical status for all clinical status measures and patient self-report scores, with similar patterns in high-GDP and low-GDP countries. However, patients who had stopped working in high-GDP countries had better clinical status than patients who continued working in low-GDP countries. The most significant identifier of work disability in all subgroups was Health Assessment Questionnaire (HAQ) functional disability score. Conclusions: Work disability rates remain high among people with RA during this millennium. In low-GDP countries, people remain working with high levels of disability and disease activity. Cultural and economic differences between societies affect work disability as an outcome measure for RA.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: The genetic diversity of the human immunodeficiency virus type 1 (HIV-1) is critical to lay the groundwork for the design of successful drugs or vaccine. In this study we aimed to characterize and define the molecular prevalence of HIV-1 subclade F1 currently circulating in Sao Paulo, Brazil. Methods: A total of 36 samples were selected from 888 adult patients residing in Sao Paulo who had previously been diagnosed in two independent studies in our laboratory as being infected with subclade F1 based on pol subgenomic fragment sequencing. Proviral DNA was amplified from the purified genomic DNA of all 36 blood samples by 5 fragments overlapping PCR followed by direct sequencing. Sequence data were obtained from the 5 fragments of pure subclade F1 and phylogenetic trees were constructed and compared with previously published sequences. Subclades F1 that exhibited mosaic structure with other subtypes were omitted from any further analysis Results: Our methods of fragment amplification and sequencing confirmed that only 5 sequences inferred from pol region as subclade F1 also holds true for the genome as a whole and, thus, estimated the true prevalence at 0.56%. The results also showed a single phylogenetic cluster of the Brazilian subclade F1 along with non-Brazilian South American isolates in both subgenomic and the full-length genomes analysis with an overall intrasubtype nucleotide divergence of 6.9%. The nucleotide differences within the South American and Central African F1 strains, in the C2-C3 env, were 8.5% and 12.3%, respectively. Conclusion: All together, our findings showed a surprisingly low prevalence rate of subclade F1 in Brazil and suggest that these isolates originated in Central Africa and subsequently introduced to South America.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In Natural Language Processing (NLP) symbolic systems, several linguistic phenomena, for instance, the thematic role relationships between sentence constituents, such as AGENT, PATIENT, and LOCATION, can be accounted for by the employment of a rule-based grammar. Another approach to NLP concerns the use of the connectionist model, which has the benefits of learning, generalization and fault tolerance, among others. A third option merges the two previous approaches into a hybrid one: a symbolic thematic theory is used to supply the connectionist network with initial knowledge. Inspired on neuroscience, it is proposed a symbolic-connectionist hybrid system called BIO theta PRED (BIOlogically plausible thematic (theta) symbolic-connectionist PREDictor), designed to reveal the thematic grid assigned to a sentence. Its connectionist architecture comprises, as input, a featural representation of the words (based on the verb/noun WordNet classification and on the classical semantic microfeature representation), and, as output, the thematic grid assigned to the sentence. BIO theta PRED is designed to ""predict"" thematic (semantic) roles assigned to words in a sentence context, employing biologically inspired training algorithm and architecture, and adopting a psycholinguistic view of thematic theory.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Secondary forests are an increasingly common feature in tropical landscapes worldwide and understanding their regeneration is necessary to design effective restoration strategies. It has previously been shown that the woody species community in secondary forests can follow different successional pathways according to the nature of past human activities in the area, yet little is known about patterns of herbaceous species diversity in secondary forests with different histories of land use. We compared the diversity and abundance of herbaceous plant communities in two types of Central Amazonian secondary forests-those regenerating on pastures created by felling and burning trees and those where trees were felled only. We also tested if plant density and species richness in secondary forests are related to proximity to primary forest. In comparison with primary forest sites, forests regenerating on non-burned habitats had lower herbaceous plant density and species richness than those on burned ones. However, species composition and abundance in non-burned stands were more similar to those of primary forest, whereas several secondary forest specialist species were found in burned stands. In both non-burned and burned forests, distance from the forest edge was not related to herbaceous density and species richness. Overall, our results suggest that the natural regeneration of herbaceous species in secondary tropical forests is dependent on a site`s post-clearing treatment. We recommend evaluating the land history of a site prior to developing and implementing a restoration strategy, as this will influence the biological template on which restoration efforts are overlaid.