77 resultados para Big Data Hadoop Spark GPSJ


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Accurate and timely traffic flow prediction is crucial to proactive traffic management and control in data-driven intelligent transportation systems (D2ITS), which has attracted great research interest in the last few years. In this paper, we propose a Spatial-Temporal Weighted K-Nearest Neighbor model, named STW-KNN, in a general MapReduce framework of distributed modeling on a Hadoop platform, to enhance the accuracy and efficiency of short-term traffic flow forecasting. More specifically, STW-KNN considers the spatial-temporal correlation and weight of traffic flow with trend adjustment features, to optimize the search mechanisms containing state vector, proximity measure, prediction function, and K selection. urthermore, STW-KNN is implemented on a widely adopted Hadoop distributed computing platform with the MapReduce parallel processing paradigm, for parallel prediction of traffic flow in real time. inally, with extensive experiments on real-world big taxi trajectory data, STW-KNN is compared with the state-of-the-art prediction models including conventional K-Nearest Neighbor (KNN), Artificial Neural Networks (ANNs), Naïve Bayes (NB), Random orest (R), and C4.. The results demonstrate that the proposed model is superior to existing models on accuracy by decreasing the mean absolute percentage error (MAPE) value more than 11.9% only in time domain and even achieves 89.71% accuracy improvement with the MAPEs of between 4% and 6.% in both space and time domains, and also significantly improves the efficiency and scalability of short-term traffic flow forecasting over existing approaches.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recent years have witnessed a growing interest in context-aware recommender system (CARS), which explores the impact of context factors on personalized Web services recommendation. Basically, the general idea of CARS methods is to mine historical service invocation records through the process of context-aware similarity computation. It is observed that traditional similarity mining process would very likely generate relatively big deviations of QoS values, due to the dynamic change of contexts. As a consequence, including a considerable amount of deviated QoS values in the similarity calculation would probably result in a poor accuracy for predicting unknown QoS values. In allusion to this problem, this paper first distinguishes two definitions of Abnormal Data and True Abnormal Data, the latter of which should be eliminated. Second, we propose a novel CASR-TADE method by incorporating the effectiveness of True Abnormal Data Elimination into context-aware Web services recommendation. Finally, the experimental evaluations on a real-world Web services dataset show that the proposed CASR-TADE method significantly outperforms other existing approaches.