FenixEdu™

Dissertação

{en_GB=Cooperation through Reinforcement Learning in a Collaborative Game} {} EVALUATED

Detalhes: {pt=Este trabalho tem como objectivo criar agentes que consigam trabalhar em conjunto com outros companheiros de equipa anteriorment desconhecidos e sem qualquer coordenação \textit{a priori} para o jogo colaborativo Geometry Friends. Começando com um agente para o personagem círculo do jogo, o trabalho passa por continuar o desenvolvimento do agente existente, que usa um processo de aprendizagem para ir melhorando o seu comportamento. Este processo passa por uma análise dos seus vários componentes, fazendo ajustes sempre que necessário e possível para melhorar o desempenho. Estes mechanismos são depois extrapolados para o personagem do jogo, o retângulo, adaptando sempre que necessário para os problemas específicos que o outro personagem encontra. Depois de os dois agentes estarem desenvolvidos, o trabalho focou-se nos problemas de coordenação entre os agentes e nas dificuldades que a implementação apresenta na extensão dos agentes para os problemas cooperativos. Durante as várias fases de desenvolvimento os agentes vão sendo testados para averiguar o impacto que as alterações têm no desempenho dos agentes. Os testes levantam algumas preocupações sobre o funcionamento interno dos agentes, que não parecem seguir o processo de aprendizagem como seria esperado. Os testes sugerem que o funcionamento interno dos agentes traga algumas incompatibilidades com o comportamento pretendido, vsito que limitam o comportamento que pode ser acrescentado aos agentes. Apesar de haver uma melhoria do desempenho do círculo, o desempenho do retângulo e em situações cooperativas fica aquém das expectativas., en=This work has the objective of creating agents that are able to work together with other previously unknown teammates and without any \textit{a priori} coordination for the collaborative game Geometry Friends. Starting with an agent for the circle character that uses a reinforcement learning approach, this work continues its development to further improve its behavior and performance. This process goes through various stages, analyzing the agent's various components and adjusting their behavior when necessary and possible to improve the agent's performance. These mechanisms are then extrapolated for the other character of the game, the square, adapting the components whenever necessary for the specific problems the square character faces. After both agents are completed, the work focuses on problems of coordination between the agents and the difficulties that the implementation brings to the extension of the agents for the cooperative problems. During the various phases of development, the agents are tested determine the impact that each change has on their performance. The tests suggest that the internal functionalities result in some incompatibilities with the intended behavior, since it limits the behaviors that can be added to the agents. While there is an improvement of the circle agent, the square and cooperative performances are below expectation.}
Keywords: {pt=Aprendizagem, Agentes, Cooperação, Geometry Friends, Equipas Ad Hoc, en=Reinforcement Learning, Agents, Cooperation, Geometry Friends, Ad Hoc Teams}

Discussão: novembro 4, 2015, 14:30