Skip to content

A game playing agent trained using proximal policy optimization

Notifications You must be signed in to change notification settings

abdulrahim2002/SpaceShooterGamePlayingAgent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Game Playing Agent

Performance

Trial1.mp4

t2.mp4

TODO

  • Implement soft-actor-critic (SAC).
  • Emperically optimize, neural network parameters: depth, input layer
  • Hyperparameter-tuning.

Reward Function

The reward funciton is:

timeSurvived + ( 3 * rocksDestroyed ) + ( 5 * enemyShipsDestroyed )

Rational: the agent would be incentivised to shoot down enemy bullets and rocks and miximize its survival time

Please see this script for more details

Input observations are, positions of objects currently in the scene, feeded sequentially with a label after each position to differentiate between objects

Parameters Used:

  • Input layer neurons: 30
  • Depth: 2

Shortcomings

  • Limited Computational Power

About

A game playing agent trained using proximal policy optimization

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages