939 resultados para sparse coding
Resumo:
Diabetic Retinopathy (DR) is a complication of diabetes that can lead to blindness if not readily discovered. Automated screening algorithms have the potential to improve identification of patients who need further medical attention. However, the identification of lesions must be accurate to be useful for clinical application. The bag-of-visual-words (BoVW) algorithm employs a maximum-margin classifier in a flexible framework that is able to detect the most common DR-related lesions such as microaneurysms, cotton-wool spots and hard exudates. BoVW allows to bypass the need for pre- and post-processing of the retinographic images, as well as the need of specific ad hoc techniques for identification of each type of lesion. An extensive evaluation of the BoVW model, using three large retinograph datasets (DR1, DR2 and Messidor) with different resolution and collected by different healthcare personnel, was performed. The results demonstrate that the BoVW classification approach can identify different lesions within an image without having to utilize different algorithms for each lesion reducing processing time and providing a more flexible diagnostic system. Our BoVW scheme is based on sparse low-level feature detection with a Speeded-Up Robust Features (SURF) local descriptor, and mid-level features based on semi-soft coding with max pooling. The best BoVW representation for retinal image classification was an area under the receiver operating characteristic curve (AUC-ROC) of 97.8% (exudates) and 93.5% (red lesions), applying a cross-dataset validation protocol. To assess the accuracy for detecting cases that require referral within one year, the sparse extraction technique associated with semi-soft coding and max pooling obtained an AUC of 94.2 ± 2.0%, outperforming current methods. Those results indicate that, for retinal image classification tasks in clinical practice, BoVW is equal and, in some instances, surpasses results obtained using dense detection (widely believed to be the best choice in many vision problems) for the low-level descriptors.
Resumo:
Non-coding RNAs (ncRNAs) were recently given much higher attention due to technical advances in sequencing which expanded the characterization of transcriptomes in different organisms. ncRNAs have different lengths (22 nt to >1, 000 nt) and mechanisms of action that essentially comprise a sophisticated gene expression regulation network. Recent publication of schistosome genomes and transcriptomes has increased the description and characterization of a large number of parasite genes. Here we review the number of predicted genes and the coverage of genomic bases in face of the public ESTs dataset available, including a critical appraisal of the evidence and characterization of ncRNAs in schistosomes. We show expression data for ncRNAs in Schistosoma mansoni. We analyze three different microarray experiment datasets: (1) adult worms' large-scale expression measurements; (2) differentially expressed S. mansoni genes regulated by a human cytokine (TNF-α) in a parasite culture; and (3) a stage-specific expression of ncRNAs. All these data point to ncRNAs involved in different biological processes and physiological responses that suggest functionality of these new players in the parasite's biology. Exploring this world is a challenge for the scientists under a new molecular perspective of host-parasite interactions and parasite development.
Resumo:
Background: Ticks secrete a cement cone composed of many salivary proteins, some of which are rich in the amino acid glycine in order to attach to their hosts' skin. Glycine-rich proteins (GRPs) are a large family of heterogeneous proteins that have different functions and features; noteworthy are their adhesive and tensile characteristics. These properties may be essential for successful attachment of the metastriate ticks to the host and the prolonged feeding necessary for engorgement. In this work, we analyzed Expressed Sequence Tags (ESTs) similar to GRPs from cDNA libraries constructed from salivary glands of adult female ticks representing three hard, metastriate species in order to verify if their expression correlated with biological differences such as the numbers of hosts ticks feed on during their parasitic life cycle, whether one (monoxenous parasite) or two or more (heteroxenous parasite), and the anatomy of their mouthparts, whether short (Brevirostrata) or long (Longirostrata). These ticks were the monoxenous Brevirostrata tick, Rhipicephalus (Boophilus) microplus, a heteroxenous Brevirostrata tick, Rhipicephalus sanguineus, and a heteroxenous Longirostrata tick, Amblyomma cajennense. To further investigate this relationship, we conducted phylogenetic analyses using sequences of GRPs from these ticks as well as from other species of Brevirostrata and Longirostrata ticks. Results: cDNA libraries from salivary glands of the monoxenous tick, R. microplus, contained more contigs of glycine-rich proteins than the two representatives of heteroxenous ticks, R. sanguineus and A. cajennense (33 versus, respectively, 16 and 11). Transcripts of ESTs encoding GRPs were significantly more numerous in the salivary glands of the two Brevirostrata species when compared to the number of transcripts in the Longirostrata tick. The salivary gland libraries from Brevirostrata ticks contained numerous contigs significantly similar to silks of true spiders (17 and 8 in, respectively, R. microplus and R. sanguineus), whereas the Longirostrata tick contained only 4 contigs. The phylogenetic analyses of GRPs from various species of ticks showed that distinct clades encoding proteins with different biochemical properties are represented among species according to their biology. Conclusions: We found that different species of ticks rely on different types and amounts of GRPs in order to attach and feed on their hosts. Metastriate ticks with short mouthparts express more transcripts of GRPs than a tick with long mouthparts and the tick that feeds on a single host during its life cycle contain a greater variety of these proteins than ticks that feed on several hosts.
Resumo:
Despite the wide distribution of transposable elements (TEs) in mammalian genomes, part of their evolutionary significance remains to be discovered. Today there is a substantial amount of evidence showing that TEs are involved in the generation of new exons in different species. In the present study, we searched 22,805 genes and reported the occurrence of TE-cassettes in coding sequences of 542 cow genes using the RepeatMasker program. Despite the significant number (542) of genes with TE insertions in exons only 14 (2.6%) of them were translated into protein, which we characterized as chimeric genes. From these chimeric genes, only the FAST kinase domains 3 (FASTKD3) gene, present on chromosome BTA 20, is a functional gene and showed evidence of the exaptation event. The genome sequence analysis showed that the last exon coding sequence of bovine FASTKD3 is similar to 85% similar to the ART2A retrotransposon sequence. In addition, comparison among FASTKD3 proteins shows that the last exon is very divergent from those of Homo sapiens, Pan troglodytes and Canis familiares. We suggest that the gene structure of bovine FASTKD3 gene could have originated by several ectopic recombinations between TE copies. Additionally, the absence of TE sequences in all other species analyzed suggests that the TE insertion is clade-specific, mainly in the ruminant lineage.
Resumo:
Background: Myelodysplastic syndromes (MDS) are a group of clonal hematological disorders characterized by ineffective hematopoiesis with morphological evidence of marrow cell dysplasia resulting in peripheral blood cytopenia. Microarray technology has permitted a refined high-throughput mapping of the transcriptional activity in the human genome. Non-coding RNAs (ncRNAs) transcribed from intronic regions of genes are involved in a number of processes related to post-transcriptional control of gene expression, and in the regulation of exon-skipping and intron retention. Characterization of ncRNAs in progenitor cells and stromal cells of MDS patients could be strategic for understanding gene expression regulation in this disease. Methods: In this study, gene expression profiles of CD34(+) cells of 4 patients with MDS of refractory anemia with ringed sideroblasts (RARS) subgroup and stromal cells of 3 patients with MDS-RARS were compared with healthy individuals using 44 k combined intron-exon oligoarrays, which included probes for exons of protein-coding genes, and for non-coding RNAs transcribed from intronic regions in either the sense or antisense strands. Real-time RT-PCR was performed to confirm the expression levels of selected transcripts. Results: In CD34(+) cells of MDS-RARS patients, 216 genes were significantly differentially expressed (q-value <= 0.01) in comparison to healthy individuals, of which 65 (30%) were non-coding transcripts. In stromal cells of MDS-RARS, 12 genes were significantly differentially expressed (q-value <= 0.05) in comparison to healthy individuals, of which 3 (25%) were non-coding transcripts. Conclusions: These results demonstrated, for the first time, the differential ncRNA expression profile between MDS-RARS and healthy individuals, in CD34(+) cells and stromal cells, suggesting that ncRNAs may play an important role during the development of myelodysplastic syndromes.
Resumo:
An important topic in genomic sequence analysis is the identification of protein coding regions. In this context, several coding DNA model-independent methods based on the occurrence of specific patterns of nucleotides at coding regions have been proposed. Nonetheless, these methods have not been completely suitable due to their dependence on an empirically predefined window length required for a local analysis of a DNA region. We introduce a method based on a modified Gabor-wavelet transform (MGWT) for the identification of protein coding regions. This novel transform is tuned to analyze periodic signal components and presents the advantage of being independent of the window length. We compared the performance of the MGWT with other methods by using eukaryote data sets. The results show that MGWT outperforms all assessed model-independent methods with respect to identification accuracy. These results indicate that the source of at least part of the identification errors produced by the previous methods is the fixed working scale. The new method not only avoids this source of errors but also makes a tool available for detailed exploration of the nucleotide occurrence.
Resumo:
The purpose of this paper is to provide a cross-linguistic survey of the variation of coding strategies that are available for the grammatical distinction between direct and indirect speech representation with a particular focus on the expression of indirect reported speech. Cross-linguistic data from a sample of 42 languages will be provided to illustrate the range of available grammatical coding strategies.
Resumo:
Messenger RNAs coding for growth factors and receptor tyrosine kinases were measured by quantitative competitive and by semi-quantitative reverse-transcription polymerase chain reaction in whole and dissected chick inner ears. The fibroblast growth factor (FGF) receptor 1 chick embryonic kinase (CEK) 1 was expressed in all structures examined (otocyst, hatchling whole cochlea, cochlear nerve ganglion, and cochlear and vestibular sensory epithelia), although slightly more heavily in the otocyst. The related fibroblast growth factor receptors CEK 2 and 3 were preferentially expressed in the nerve ganglion and in the vestibular sensory epithelium, respectively. FGF 1 mRNA was low in early development, increasing to mature levels at around embryonic age 11 days, while FGF2, mRNA was expressed at constant levels at all ages. In response to ototoxic damage, FGF1 mRNA levels were increased in the early damaged cochlear sensory epithelium. Immunohistochemistry for CEK1 showed that normal hair cells expressed the receptor heavily on the hair cell stereocilia, while with early damage, CEK1 came to be expressed heavily on the apical surfaces of the supporting cells. In normal chicks, the CEK4 and CEK8 eph-class receptor tyrosine kinases were expressed relatively heavily by the cochlear nerve ganglion, and CEK10 was expressed relatively heavily by the cochlear hair cell sensory epithelium. The results suggest that the FGF system may be involved in the response of the cochlear epithelium to ototoxic damage. The eph-class receptor tyrosine kinase CEK10 may be involved in cell interactions in the cochlear sensory epithelium, while CEK4 and CEK8 may play a role in the cochlear innervation.
Resumo:
The nifH gene sequence of the nitrogen-fixing bacterium Acetobacter diazotrophicus was determined with the use of the polymerase chain reaction and universal degenerate oligonucleotide primers. The gene shows highest pair-wise similarity to the nifH gene of Azospirillum brasilense. The phylogenetic relationships of the nifH gene sequences were compared with those inferred from 16S rRNA gene sequences. Knowledge of the sequence of the nifH gene contributes to the growing database of nifH gene sequences, and will allow the detection of Acet. diazotrophicus from environmental samples with nifH gene-based primers.
Resumo:
Krylov subspace techniques have been shown to yield robust methods for the numerical computation of large sparse matrix exponentials and especially the transient solutions of Markov Chains. The attractiveness of these methods results from the fact that they allow us to compute the action of a matrix exponential operator on an operand vector without having to compute, explicitly, the matrix exponential in isolation. In this paper we compare a Krylov-based method with some of the current approaches used for computing transient solutions of Markov chains. After a brief synthesis of the features of the methods used, wide-ranging numerical comparisons are performed on a power challenge array supercomputer on three different models. (C) 1999 Elsevier Science B.V. All rights reserved.AMS Classification: 65F99; 65L05; 65U05.
Resumo:
This study describes a coding system developed to operationalize the sociolinguistic strategies proposed by communication accommodation theory (CAT) in an academic context. Fifty interactions between two students (of Australian or Chinese ethnic background) or a student and faculty member were videotaped. A turn- and episode-based coding system was developed, focusing on verbal and nonverbal behavior. The development of this system is described in detail, before results are presented. Results indicated that status was the main influence on choice of strategies, particularly the extent and type of discourse management and interpersonal control. Participants' sew and ethnicity also played a role: Male participants made more use of interpretability (largely questions), whereas female participants used discourse management to develop a shared perspective. The results make clear that there is no automatic correspondence between behaviors and the strategies they constitute, and they point to the appropriateness of conceptualizing behavior and strategies separately in CAT.
Resumo:
We sequenced across all of the gene boundaries in the mitochondrial genome of the cattle tick, Boophilus microplus, to determine the arrangement of its genes. The mtDNA of B. microplus has a coding region, composed of tRNA(Glu) and 60 bp of the 3' end of ND1, that is repeated five times. Boophilus microplus is the first coelomate animal known to have more than two copies of a coding sequence. The mitochondrial genome of B, microplus has other unusual features, including (1) reduced T arms in tRNAs, (2) an AT bias in codon use, (3) two control regions that have evolved in concert, (4) three gene rearrangements, and (5) a stem-loop between tRNA(Gln) and tRNA(Phe). The short T arms and small control regions (CRs) of B. microplus and other ticks suggest strong selection for small genomes. Imprecise termination of replication beyond its origin, which can account for the evolution of tandem repeats of coding regions in other mitochondrial genomes, cannot explain the evolution of the fivefold repeated sequence in the mitochondrial genome of B. microplus. Instead, slipped-strand mispairing or recombination are the most plausible explanations for the evolution of these tandem repeats.
Resumo:
High performance video codec is mandatory for multimedia applications such as video-on-demand and video conferencing. Recent research has proposed numerous video coding techniques to meet the requirement in bandwidth, delay, loss and Quality-of-Service (QoS). In this paper, we present our investigations on inter-subband self-similarity within the wavelet-decomposed video frames using neural networks, and study the performance of applying the spatial network model to all video frames over time. The goal of our proposed method is to restore the highest perceptual quality for video transmitted over a highly congested network. Our contributions in this paper are: (1) A new coding model with neural network based, inter-subband redundancy (ISR) prediction for video coding using wavelet (2) The performance of 1D and 2D ISR prediction, including multiple levels of wavelet decompositions. Our result shows a short-term quality enhancement may be obtained using both 1D and 2D ISR prediction.