4 resultados para Data detection
Resumo:
Background: Copy number variations (CNVs) have been shown to account for substantial portions of observed genomic variation and have been associated with qualitative and quantitative traits and the onset of disease in a number of species. Information from high-resolution studies to detect, characterize and estimate population-specific variant frequencies will facilitate the incorporation of CNVs in genomic studies to identify genes affecting traits of importance. Results: Genome-wide CNVs were detected in high-density single nucleotide polymorphism (SNP) genotyping data from 1,717 Nelore (Bos indicus) cattle, and in NGS data from eight key ancestral bulls. A total of 68,007 and 12,786 distinct CNVs were observed, respectively. Cross-comparisons of results obtained for the eight resequenced animals revealed that 92 % of the CNVs were observed in both datasets, while 62 % of all detected CNVs were observed to overlap with previously validated cattle copy number variant regions (CNVRs). Observed CNVs were used for obtaining breed-specific CNV frequencies and identification of CNVRs, which were subsequently used for gene annotation. A total of 688 of the detected CNVRs were observed to overlap with 286 non-redundant QTLs associated with important production traits in cattle. All of 34 CNVs previously reported to be associated with milk production traits in Holsteins were also observed in Nelore cattle. Comparisons of estimated frequencies of these CNVs in the two breeds revealed 14, 13, 6 and 14 regions in high (>20 %), low (<20 %) and divergent (NEL > HOL, NEL < HOL) frequencies, respectively. Conclusions: Obtained results significantly enriched the bovine CNV map and enabled the identification of variants that are potentially associated with traits under selection in Nelore cattle, particularly in genome regions harboring QTLs affecting production traits.
Resumo:
Collecting ground truth data is an important step to be accomplished before performing a supervised classification. However, its quality depends on human, financial and time ressources. It is then important to apply a validation process to assess the reliability of the acquired data. In this study, agricultural infomation was collected in the Brazilian Amazonian State of Mato Grosso in order to map crop expansion based on MODIS EVI temporal profiles. The field work was carried out through interviews for the years 2005-2006 and 2006-2007. This work presents a methodology to validate the training data quality and determine the optimal sample to be used according to the classifier employed. The technique is based on the detection of outlier pixels for each class and is carried out by computing Mahalanobis distances for each pixel. The higher the distance, the further the pixel is from the class centre. Preliminary observations through variation coefficent validate the efficiency of the technique to detect outliers. Then, various subsamples are defined by applying different thresholds to exclude outlier pixels from the classification process. The classification results prove the robustness of the Maximum Likelihood and Spectral Angle Mapper classifiers. Indeed, those classifiers were insensitive to outlier exclusion. On the contrary, the decision tree classifier showed better results when deleting 7.5% of pixels in the training data. The technique managed to detect outliers for all classes. In this study, few outliers were present in the training data, so that the classification quality was not deeply affected by the outliers.
Resumo:
Wine aroma is an important characteristic and may be related to certain specific parameters, such as raw material and production process. The complexity of Merlot wine aroma was considered suitable for comprehensive two-dimensional gas chromatography (GCGC), as this technique offers superior performance when compared to one-dimensional gas chromatography (1D-GC). The profile of volatile compounds of Merlot wine was, for the first time, qualitatively analyzed by HS-SPME-GCxGC with a time-of-flight mass spectrometric detector (TOFMS), resulting in 179 compounds tentatively identified by comparison of experimental GCxGC retention indices and mass spectra with literature 1D-GC data and 155 compounds tentatively identified only by mass spectra comparison. A set of GCGC experimental retention indices was also, for the first time, presented for a specific inverse set of columns. Esters were present in higher number (94), followed by alcohols (80), ketones (29), acids (29), aldehydes (23), terpenes (23), lactones (16), furans (14), sulfur compounds (9), phenols (7), pyrroles (5), C13-norisoprenoids (3), and pyrans (2). GCxGC/TOFMS parameters were improved and optimal conditions were: a polar (polyethylene glycol)/medium polar (50% phenyl 50% dimethyl arylene siloxane) column set, oven temperature offset of 10ºC, 7 s as modulation period and 1.4 s of hot pulse duration. Co-elutions came up to 138 compounds in 1D and some of them were resolved in 2D. Among the coeluted compounds, thirty-three volatiles co-eluted in both 1D and 2D and their tentative identification was possible only due to spectral deconvolution. Some compounds that might have important contribution to aroma notes were included in these superimposed peaks. Structurally organized distribution of compounds in the 2D space was observed for esters, aldehydes and ketones, alcohols, thiols, lactones, acids and also inside subgroups, as occurred with esters and alcohols. The Fischer Ratio was useful for establishing the analytes responsible for the main differences between Merlot and non-Merlot wines. Differentiation among Merlot wines and wines of other grape varieties were mainly perceived through the following components: ethyl dodecanoate, 1-hexanol, ethyl nonanoate, ethyl hexanoate, ethyl decanoate, dehydro-2-methyl-3(2H)thiophenone, 3-methyl butanoic acid, ethyl tetradecanoate, methyl octanoate, 1,4 butanediol, and 6-methyloctan-1-ol.
Resumo:
Brazil is one of the largest beef producers and exporters in the world with the Nelore breed representing the vast majority of Brazilian cattle (Bos taurus indicus). Despite the great adaptability of the Nelore breed to tropical climate, meat tenderness (MT) remains to be improved. Several factors including genetic composition can influence MT. In this article, we report a genome-wide analysis of copy number variation (CNV) inferred from Illumina1 High Density SNP-chip data for a Nelore population of 723 males. We detected >2,600 CNV regions (CNVRs) representing 6.5% of the genome. Comparing our results with previous studies revealed an overlap in 1400 CNVRs (>50%). A total of 1,155 CNVRs (43.6%) overlapped 2,750 genes. They were enriched for processes involving guanosine triphosphate (GTP), previously reported to influence skeletal muscle physiology and morphology. Nelore CNVRs also overlapped QTLs for MT reported in other breeds (8.9%, 236 CNVRs) and from a previous study with this population (4.1%, 109 CNVRs). Two CNVRs were also proximal to glutathione metabolism genes that were previously associated with MT. Genome-wide association study of CN state with estimated breeding values derived from meat shear force identified 6 regions, including a region on BTA3 that contains genes of the cAMP and cGMP pathway. Ten CNVRs that overlapped regions associated with MT were successfully validated by qPCR. Our results represent the first comprehensive CNV study in Bos taurus indicus cattle and identify regions in which copy number changes are potentially of importance for the MT phenotype.