85 resultados para Genetic clustering analysis

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the southern region of Mato Grosso do Sul state, Brazil, a foot-and-mouth disease (FMD) epidemic started in September 2005. A total of 33 outbreaks were detected and 33,741 FMD-susceptible animals were slaughtered and destroyed. There were no reports of FMD cases in other species than bovines. Based on the data of this epidemic, it was carried out an analysis using the K-function and it was observed spatial clustering of outbreaks within a range of 25km. This observation may be related to the dynamics of foot-and-mouth disease spread and to the measures undertaken to control the disease dissemination. The control measures were effective once the disease did not spread to farms more than 47 km apart from the initial outbreaks.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genetic linkage map for the common bean (Phaseolus vulgaris L.) is a valuable tool for breeding programs. Breeders provide new cultivars that meet the requirements of farmers and consumers, such as seed color, seed size, maturity, and growth habit. A genetic study was conducted to examine the genetics behind certain qualitative traits. Growth habit is usually described as a recessive trait inherited by a single gene, and there is no consensus about the position of the locus. The aim of this study was to develop a new genetic linkage map using genic and genomic microsatellite markers and three morphological traits: growth habit, flower color, and pod tip shape. A mapping population consisting of 380 recombinant F10 lines was generated from IAC-UNA x CAL143. A total of 871 microsatellites were screened for polymorphisms among the parents, and a linkage map was obtained with 198 mapped microsatellites. The total map length was 1865.9 cM, and the average distance between markers was 9.4 cM. Flower color and pod tip shape were mapped and segregated at Mendelian ratios, as expected. The segregation ratio and linkage data analyses indicated that the determinacy growth habit was inherited as two independent and dominant genes, and a genetic model is proposed for this trait.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A conceptual problem that appears in different contexts of clustering analysis is that of measuring the degree of compatibility between two sequences of numbers. This problem is usually addressed by means of numerical indexes referred to as sequence correlation indexes. This paper elaborates on why some specific sequence correlation indexes may not be good choices depending on the application scenario in hand. A variant of the Product-Moment correlation coefficient and a weighted formulation for the Goodman-Kruskal and Kendall`s indexes are derived that may be more appropriate for some particular application scenarios. The proposed and existing indexes are analyzed from different perspectives, such as their sensitivity to the ranks and magnitudes of the sequences under evaluation, among other relevant aspects of the problem. The results help suggesting scenarios within the context of clustering analysis that are possibly more appropriate for the application of each index. (C) 2008 Elsevier Inc. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Macro- and microarrays are well-established technologies to determine gene functions through repeated measurements of transcript abundance. We constructed a chicken skeletal muscle-associated array based on a muscle-specific EST database, which was used to generate a tissue expression dataset of similar to 4500 chicken genes across 5 adult tissues (skeletal muscle, heart, liver, brain, and skin). Only a small number of ESTs were sufficiently well characterized by BLAST searches to determine their probable cellular functions. Evidence of a particular tissue-characteristic expression can be considered an indication that the transcript is likely to be functionally significant. The skeletal muscle macroarray platform was first used to search for evidence of tissue-specific expression, focusing on the biological function of genes/transcripts, since gene expression profiles generated across tissues were found to be reliable and consistent. Hierarchical clustering analysis revealed consistent clustering among genes assigned to 'developmental growth', such as the ontology genes and germ layers. Accuracy of the expression data was supported by comparing information from known transcripts and tissue from which the transcript was derived with macroarray data. Hybridization assays resulted in consistent tissue expression profile, which will be useful to dissect tissue-regulatory networks and to predict functions of novel genes identified after extensive sequencing of the genomes of model organisms. Screening our skeletal-muscle platform using 5 chicken adult tissues allowed us identifying 43 'tissue-specific' transcripts, and 112 co-expressed uncharacterized transcripts with 62 putative motifs. This platform also represents an important tool for functional investigation of novel genes; to determine expression pattern according to developmental stages; to evaluate differences in muscular growth potential between chicken lines, and to identify tissue-specific genes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The present study determined the distribution pattern of the hermit crab Loxopagurus loxochelis by a comparison of catch, depth and environmental factors at two separate bays (Caraguatatuba and Ubatuba) of Sao Paulo State, Brazil. The influence of these parameters on the distribution of males, non- ovigerous females and ovigerous females was also evaluated. Crabs were collected monthly, over a period of one year (from July/2002 to June/2003), in seven depths, from 5 to 35 m. Abiotic factors were monitored as follows: superficial and bottom salinity (psu), superficial and bottom temperature (C), organic matter content (%) and sediment composition (%). In total, 366 hermit crabs were sampled in Caraguatatuba and 126 in Ubatuba. The highest frequency of occurrence was verified at 20 m during winter (July) in Caraguatatuba and 25 m during summer (January) in Ubatuba. The highest occurrences were recorded in the regions with bottom salinities ranging from 34 to 36 psu, bottom temperatures from 18 to 24 C and, low percentages of organic matter, gravel and mud; and large proportion of sand in the substrate. There was no significant correlation between the total frequency of organisms and the environmental factors analyzed in both regions. This evidence suggests that other variables as biotic interactions can influence the pattern of distribution of L. loxochelis in the analyzed region, which is considered the limit of the northern distribution of this species.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: Since establishing universal free access to antiretroviral therapy in 1996, the Brazilian Health System has increased the number of centers providing HIV/AIDS outpatient care from 33 to 540. There had been no formal monitoring of the quality of these services until a survey of 336 AIDS health centers across 7 Brazilian states was undertaken in 2002. Managers of the services were asked to assess their clinics according to parameters of service inputs and service delivery processes. This report analyzes the survey results and identifies predictors of the overall quality of service delivery. Methods: The survey involved completion of a multiple-choice questionnaire comprising 107 parameters of service inputs and processes of delivering care, with responses assessed according to their likely impact on service quality using a 3-point scale. K-means clustering was used to group these services according to their scored responses. Logistic regression analysis was performed to identify predictors of high service quality. Results: The questionnaire was completed by 95.8% (322) of the managers of the sites surveyed. Most sites scored about 50% of the benchmark expectation. K-means clustering analysis identified four quality levels within which services could be grouped: 76 services (24%) were classed as level 1 (best), 53 (16%) as level 2 (medium), 113 (35%) as level 3 (poor), and 80 (25%) as level 4 (very poor). Parameters of service delivery processes were more important than those relating to service inputs for determining the quality classification. Predictors of quality services included larger care sites, specialization for HIV/AIDS, and location within large municipalities. Conclusion: The survey demonstrated highly variable levels of HIV/AIDS service quality across the sites. Many sites were found to have deficiencies in the processes of service delivery processes that could benefit from quality improvement initiatives. These findings could have implications for how HIV/AIDS services are planned in Brazil to achieve quality standards, such as for where service sites should be located, their size and staffing requirements. A set of service delivery indicators has been identified that could be used for routine monitoring of HIV/AIDS service delivery for HIV/AIDS in Brazil (and potentially in other similar settings).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The aim of this study was to describe the population structure, inbreeding and to quantify their effect for different weights, of Santa Ines sheep. For this reason, 6490 data of production and 17,097 animals in the pedigree data set were utilized to evaluate birth weight (BW), weight at 60 days (W60) and weight at 180 days (W180). The genetic structure analysis of the population was realized by the software ENDOG (v.4.6.), resulting in some level of inbreeding for 21.72% of the animals in the pedigree data, being 41.02% the maximum value, and average of 10.74% for the inbred individuals. The population average inbreeding was 2.33% and the average relatedness was 0.73%. The effective number of ancestors was 156 animals and the effective number of founders was 211 individuals. A significant depressive effect of the inbreeding can be verified for all traits. The monitored parameters related with the genetic variability on this population must be constant in order to prevent the decrease in the genetic progress. The utilization of a program for directed mating in the present flock is an appropriate alternative to keep the level of inbreeding under control. (C) 2010 Elsevier B.V. All rights reserved.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Trypanosoma (Megatrypanum) theileri from cattle and trypanosomes of other artiodactyls form a clade of closely related species in analyses using ribosomal sequences. Analysis of polymorphic sequences of a larger number of trypanosomes from broader geographical origins is required to evaluate the Clustering of isolates as suggested by previous studies. Here, we determined the sequences of the spliced leader (SL) genes of 21 isolates from cattle and 2 from water buffalo from distant regions of Brazil. Analysis of SL gene repeats revealed that the 5S rRNA gene is inserted within the intergenic region. Phylogeographical patterns inferred using SL sequences showed at least 5 major genotypes of T. theileri distributed in 2 strongly divergent lineages. Lineage TthI comprises genotypes IA and IB from buffalo and cattle, respectively, from the Southeast and Central regions, whereas genotype IC is restricted to cattle from the Southern region. Lineage Tth II includes cattle genotypes IIA, which is restricted to the North and Northeast, and IIB, found in the Centre, West, North and Northeast. PCR-RFLP of SL genes revealed valuable markers for genotyping T. theileri. The results of this study emphasize the genetic complexity and corroborate the geographical structuring of T. theileri genotypes found in cattle.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

In this paper, we present an algorithm for cluster analysis that integrates aspects from cluster ensemble and multi-objective clustering. The algorithm is based on a Pareto-based multi-objective genetic algorithm, with a special crossover operator, which uses clustering validation measures as objective functions. The algorithm proposed can deal with data sets presenting different types of clusters, without the need of expertise in cluster analysis. its result is a concise set of partitions representing alternative trade-offs among the objective functions. We compare the results obtained with our algorithm, in the context of gene expression data sets, to those achieved with multi-objective Clustering with automatic K-determination (MOCK). the algorithm most closely related to ours. (C) 2009 Elsevier B.V. All rights reserved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We evaluated the genetic and physiological variability of Moniliophthora perniciosa obtained from healthy and diseased branches of cacao (Theobroma cacao) plants. The diversity of the isolates was evaluated by RAPD technique and by studies of virulence and exoenzyme production. The genetic variability of endophytic and pathogenic M. perniciosa was evaluated in association with pathogenicity assays. RAPD analysis showed eight genetic groups, which were not related to plant disease status (healthy versus diseased branches). Isolates from cacao were included in three groups, excluding isolates from other host plants. Pathogenicity and enzyme analysis showed that the virulence of the isolates is not related to exoenzyme production. This is the first evidence that M. perniciosa colonizes healthy parenchymatic tissues, showing that endophytic behavior may occur in this species.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

With the aim of estimating the coefficient of heritability of average annual productivity of Nellore cows (COWPROD), a data set from 24,855 animals with known pedigree was analyzed. COWPROD is defined as the amount (in kilograms) of weaned calves produced yearly by one cow during her remaining time in herd ignoring a fixed period of 365 days. COWPROD was calculated regarding three standards: a) based on the post-weaning weight from the calves ignoring any kind of adjustment (COWPROD_NAJ), b) adjusted weight for the fixed effects (COWPROD_AJFIX) and c) adjusted weight for the fixed effects and for the genetic merit of the sire (COWPROD_AJFIN). The obtained heritabilities were 0.15, 0.15 and 0.16 for COWPROD_NAJ, COWPROD_AJFIX and COWPROD_AJFIN, respectively. A complete set composed of 105,158 COWPROD records on 130,740 animals in pedigree was also analyzed for predicting the genetic merit of all animals in the data set and for the calculation of the genetic, phenotypic and residual trends. Ranking correlation was high for the adjusted and non-adjusted data, yet, for some of the animals, the difference among the genetic values was large. This would be an indication that it would be better to work always with the adjusted weaning weights. The genetic trend was positive, but was of small magnitude (0.26% of the trait average) and the residual trend was negative as a consequence of the large intensification of the production system, which has been occurring in the last years in the farms studied. The phenotypic trend was also negative and intermediate between the genetic and the residual ones.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A simultaneous optimization strategy based on a neuro-genetic approach is proposed for selection of laser induced breakdown spectroscopy operational conditions for the simultaneous determination of macronutrients (Ca, Mg and P), micro-nutrients (B, Cu, Fe, Mn and Zn), Al and Si in plant samples. A laser induced breakdown spectroscopy system equipped with a 10 Hz Q-switched Nd:YAG laser (12 ns, 532 nm, 140 mJ) and an Echelle spectrometer with intensified coupled-charge device was used. Integration time gate, delay time, amplification gain and number of pulses were optimized. Pellets of spinach leaves (NIST 1570a) were employed as laboratory samples. In order to find a model that could correlate laser induced breakdown spectroscopy operational conditions with compromised high peak areas of all elements simultaneously, a Bayesian Regularized Artificial Neural Network approach was employed. Subsequently, a genetic algorithm was applied to find optimal conditions for the neural network model, in an approach called neuro-genetic, A single laser induced breakdown spectroscopy working condition that maximizes peak areas of all elements simultaneously, was obtained with the following optimized parameters: 9.0 mu s integration time gate, 1.1 mu s delay time, 225 (a.u.) amplification gain and 30 accumulated laser pulses. The proposed approach is a useful and a suitable tool for the optimization process of such a complex analytical problem. (C) 2009 Elsevier B.V. All rights reserved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Target region amplification polymorphism (TRAP) markers were used to estimate the genetic similarity (GS) among 53 sugarcane varieties and five species of the Saccharum complex. Seven fixed primers designed from candidate genes involved in sucrose metabolism and three from those involved in drought response metabolism were used in combination with three arbitrary primers. The clustering of the genotypes for sucrose metabolism and drought response were similar, but the GS based on Jaccard`s coefficient changed. The GS based on polymorphism in sucrose genes estimated in a set of 46 Brazilian varieties, all of which belong to the three Brazilian breeding programs, ranged from 0.52 to 0.9, and that based on drought data ranged from 0.44 to 0.95. The results suggest that genetic variability in the evaluated genes was lower in the sucrose metabolism genes than in the drought response metabolism ones.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A comparative study between microsatellite and allozyme markers was conducted on the genetic structure and mating system in natural populations of Euterpe edulis Mart. Three cohorts, including seedlings, saplings, and adults, were examined in 4 populations using 10 allozyme loci and 10 microsatellite loci. As expected, microsatellite markers had a much higher degree of polymorphism than allozymes, but estimates of multilocus outcrossing rate ((t) over cap (m) = 1.00), as well as estimates of genetic structure (F(IS), G(ST)), were similar for the 2 sets of markers. Estimates of R(ST), for microsatellites, were higher than those of GST, but results of both statistics revealed a close agreement for the genetic structure of the species. This study provides support for the important conclusion that allozymes are still useful and reliable markers to estimate population genetic parameters. Effects of sample size on estimates from hypervariable loci are also discussed in this paper.