9 resultados para Traffic clustering
em Helda - Digital Repository of University of Helsinki
Resumo:
Road traffic accidents are a large problem everywhere in the world. However, regional differences in traffic safety between countries are considerable. For example, traffic safety records are much worse in Southern Europe and the Middle East than in Northern and Western Europe. Despite the large regional differences in traffic safety, factors contributing to different accident risk figures in different countries and regions have remained largely unstudied. The general aim of this study was to investigate regional differences in traffic safety between Southern European/Middle Eastern (i.e., Greece, Iran, Turkey) and Northern/Western European (i.e., Finland, Great Britain, The Netherlands) countries and to identify factors related to these differences. We conducted seven sub-studies in which I applied a traffic culture framework, including a multi-level approach, to traffic safety. We used aggregated level data (national statistics), surveys among drivers, and data on traffic accidents and fatalities in the analyses. In the first study, we investigated the influence of macro level factors (i.e., economic, societal, and cultural) on traffic safety across countries. The results showed that a high GNP per capita and conservatism correlated with a low number of traffic fatalities, whereas a high degree of uncertainty avoidance, neuroticism, and egalitarianism correlated with a high number of traffic fatalities. In the second, third, and fourth studies, we examined whether the conceptualisation of road user characteristics (i.e., driver behaviour and performance) varied across traffic cultures and how these factors determined overall safety, and the differences between countries in traffic safety. The results showed that the factorial agreement for driver behaviour (i.e., aggressive driving) and performance (i.e., safety skills) was unsatisfactory in Greece, Iran, and Turkey, where the lack of social tolerance and interpersonal aggressive violations seem to be important characteristics of driving. In addition, we found that driver behaviour (i.e., aggressive violations and errors) mediated the relationship between culture/country and accidents. Besides, drivers from "dangerous" Southern European countries and Iran scored higher on aggressive violations and errors than did drivers from "safe" Northern European countries. However, "speeding" appeared to be a "pan-cultural" problem in traffic. Similarly, aggressive driving seems largely depend on road users' interactions and drivers' interpretation (i.e., cognitive biases) of the behaviour of others in every country involved in the study. Moreover, in all countries, a risky general driving style was mostly related to being young and male. The results of the fifth and sixth studies showed that among young Turkish drivers, gender stereotypes (i.e., masculinity and femininity) greatly influence driver behaviour and performance. Feminine drivers were safety-oriented whereas masculine drivers were skill-oriented and risky drivers. Since everyday driving tasks involve not only erroneous (i.e., risky or dangerous driving) or correct performance (i.e., normal habitual driving), but also "positive" driver behaviours, we developed a reliable scale for measuring "positive" driver behaviours among Turkish drivers in the seventh study. Consequently, I revised Reason's model [Reason, J. T., 1990. Human error. Cambridge University Press: New York] of aberrant driver behaviour to represent a general driving style, including all possible intentional behaviours in traffic while evaluating the differences between countries in traffic safety. The results emphasise the importance of economic, societal and cultural factors, general driving style and skills, which are related to exposure, cognitive biases as well as age, sex, and gender, in differences between countries in traffic safety.
Resumo:
The Minimum Description Length (MDL) principle is a general, well-founded theoretical formalization of statistical modeling. The most important notion of MDL is the stochastic complexity, which can be interpreted as the shortest description length of a given sample of data relative to a model class. The exact definition of the stochastic complexity has gone through several evolutionary steps. The latest instantation is based on the so-called Normalized Maximum Likelihood (NML) distribution which has been shown to possess several important theoretical properties. However, the applications of this modern version of the MDL have been quite rare because of computational complexity problems, i.e., for discrete data, the definition of NML involves an exponential sum, and in the case of continuous data, a multi-dimensional integral usually infeasible to evaluate or even approximate accurately. In this doctoral dissertation, we present mathematical techniques for computing NML efficiently for some model families involving discrete data. We also show how these techniques can be used to apply MDL in two practical applications: histogram density estimation and clustering of multi-dimensional data.
Resumo:
Wireless access is expected to play a crucial role in the future of the Internet. The demands of the wireless environment are not always compatible with the assumptions that were made on the era of the wired links. At the same time, new services that take advantage of the advances in many areas of technology are invented. These services include delivery of mass media like television and radio, Internet phone calls, and video conferencing. The network must be able to deliver these services with acceptable performance and quality to the end user. This thesis presents an experimental study to measure the performance of bulk data TCP transfers, streaming audio flows, and HTTP transfers which compete the limited bandwidth of the GPRS/UMTS-like wireless link. The wireless link characteristics are modeled with a wireless network emulator. We analyze how different competing workload types behave with regular TPC and how the active queue management, the Differentiated services (DiffServ), and a combination of TCP enhancements affect the performance and the quality of service. We test on four link types including an error-free link and the links with different Automatic Repeat reQuest (ARQ) persistency. The analysis consists of comparing the resulting performance in different configurations based on defined metrics. We observed that DiffServ and Random Early Detection (RED) with Explicit Congestion Notification (ECN) are useful, and in some conditions necessary, for quality of service and fairness because a long queuing delay and congestion related packet losses cause problems without DiffServ and RED. However, we observed situations, where there is still room for significant improvements if the link-level is aware of the quality of service. Only very error-prone link diminishes the benefits to nil. The combination of TCP enhancements improves performance. These include initial window of four, Control Block Interdependence (CBI) and Forward RTO recovery (F-RTO). The initial window of four helps a later starting TCP flow to start faster but generates congestion under some conditions. CBI prevents slow-start overshoot and balances slow start in the presence of error drops, and F-RTO reduces unnecessary retransmissions successfully.
Resumo:
Online content services can greatly benefit from personalisation features that enable delivery of content that is suited to each user's specific interests. This thesis presents a system that applies text analysis and user modeling techniques in an online news service for the purpose of personalisation and user interest analysis. The system creates a detailed thematic profile for each content item and observes user's actions towards content items to learn user's preferences. A handcrafted taxonomy of concepts, or ontology, is used in profile formation to extract relevant concepts from the text. User preference learning is automatic and there is no need for explicit preference settings or ratings from the user. Learned user profiles are segmented into interest groups using clustering techniques with the objective of providing a source of information for the service provider. Some theoretical background for chosen techniques is presented while the main focus is in finding practical solutions to some of the current information needs, which are not optimally served with traditional techniques.
Resumo:
This paper investigates the clustering pattern in the Finnish stock market. Using trading volume and time as factors capturing the clustering pattern in the market, the Keim and Madhavan (1996) and the Engle and Russell (1998) model provide the framework for the analysis. The descriptive and the parametric analysis provide evidences that an important determinant of the famous U-shape pattern in the market is the rate of information arrivals as measured by large trading volumes and durations at the market open and close. Precisely, 1) the larger the trading volume, the greater the impact on prices both in the short and the long run, thus prices will differ across quantities. 2) Large trading volume is a non-linear function of price changes in the long run. 3) Arrival times are positively autocorrelated, indicating a clustering pattern and 4) Information arrivals as approximated by durations are negatively related to trading flow.