Biblioteca Digital

411 resultados para Artificial intelligence (AI)

Infinite-horizon policy-gradient estimation

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Gradient-based approaches to direct policy search in reinforcement learning have received much recent attention as a means to solve problems of partial observability and to avoid some of the problems associated with policy degradation in value-function methods. In this paper we introduce GPOMDP, a simulation-based algorithm for generating a biased estimate of the gradient of the average reward in Partially Observable Markov Decision Processes (POMDPs) controlled by parameterized stochastic policies. A similar algorithm was proposed by Kimura, Yamamura, and Kobayashi (1995). The algorithm's chief advantages are that it requires storage of only twice the number of policy parameters, uses one free parameter β ∈ [0,1) (which has a natural interpretation in terms of bias-variance trade-off), and requires no knowledge of the underlying state. We prove convergence of GPOMDP, and show how the correct choice of the parameter β is related to the mixing time of the controlled POMDP. We briefly describe extensions of GPOMDP to controlled Markov chains, continuous state, observation and control spaces, multiple-agents, higher-order derivatives, and a version for training stochastic policies with internal states. In a companion paper (Baxter, Bartlett, & Weaver, 2001) we show how the gradient estimates generated by GPOMDP can be used in both a traditional stochastic gradient algorithm and a conjugate-gradient procedure to find local optima of the average reward. ©2001 AI Access Foundation and Morgan Kaufmann Publishers. All rights reserved.

Modeling player-like behaviour for game Al design

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The design of artificial intelligence in computer games is an important component of a player's game play experience. As games are becoming more life-like and interactive, the need for more realistic game AI will increase. This is particularly the case with respect to AI that simulates how human players act, behave and make decisions. The purpose of this research is to establish a model of player-like behavior that may be effectively used to inform the design of artificial intelligence to more accurately mimic a player's decision making process. The research uses a qualitative analysis of player opinions and reactions while playing a first person shooter video game, with recordings of their in game actions, speech and facial characteristics. The initial studies provide player data that has been used to design a model of how a player behaves.

Characterizing multi-agent team behavior from partial team tracings : evidence from the English Premier League

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Real-world AI systems have been recently deployed which can automatically analyze the plan and tactics of tennis players. As the game-state is updated regularly at short intervals (i.e. point-level), a library of successful and unsuccessful plans of a player can be learnt over time. Given the relative strengths and weaknesses of a player’s plans, a set of proven plans or tactics from the library that characterize a player can be identified. For low-scoring, continuous team sports like soccer, such analysis for multi-agent teams does not exist as the game is not segmented into “discretized” plays (i.e. plans), making it difficult to obtain a library that characterizes a team’s behavior. Additionally, as player tracking data is costly and difficult to obtain, we only have partial team tracings in the form of ball actions which makes this problem even more difficult. In this paper, we propose a method to overcome these issues by representing team behavior via play-segments, which are spatio-temporal descriptions of ball movement over fixed windows of time. Using these representations we can characterize team behavior from entropy maps, which give a measure of predictability of team behaviors across the field. We show the efficacy and applicability of our method on the 2010-2011 English Premier League soccer data.

There is less to this argument than meets the eye

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this article the authors continue the Artificial Intelligence and the law debate begun with Moles' 1991 article. In it the authors answer the latest criticisms made by Moles and others as they explain and argue the case for the practical benefits to be gained by AI systems involving the law.

Arc Voltage Behaviour of One Drop Per Pulse Mode in GMAW-P

Relevância:

80.00% 80.00%

Publicador:

The State of the Art in Adaptive Information Agents

Relevância:

80.00% 80.00%

Publicador:

Texture for Script identification

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The problem of determining the script and language of a document image has a number of important applications in the field of document analysis, such as indexing and sorting of large collections of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate the use of texture as a tool for determining the script of a document image, based on the observation that text has a distinct visual texture. An experimental evaluation of a number of commonly used texture features is conducted on a newly created script database, providing a qualitative measure of which features are most appropriate for this task. Strategies for improving classification results in situations with limited training data and multiple font types are also proposed.

Bearing Parameter Identification of Rotor-Bearing System Using Clustering-Based Evolutionary Algorithm

Relevância:

80.00% 80.00%

Publicador:

Apprenticeship Learning for Initial Value Functions in Reinforcement Learning

Relevância:

80.00% 80.00%

Publicador:

Derivation of L-system Models from Measurements Biological Branching Structures Using Genetic Algorithms

Relevância:

80.00% 80.00%

Publicador:

An Improvement to Collaborative Filtering for Recommender Systems

Relevância:

80.00% 80.00%

Publicador:

Balancing Board Machines

Relevância:

80.00% 80.00%

Publicador:

A Fair Peer Selection Algorithm for an Ecommerce-Oriented Distributed Recommender System

Relevância:

80.00% 80.00%

Publicador:

A Framework for E-commerce Oriented Recommendation Systems

Relevância:

80.00% 80.00%

Publicador:

Knowledge Discovery from Digital Text Documents

Relevância:

80.00% 80.00%

Publicador:

«
1
2
3
4
5
6
7
8
...
27
28
»