Open Research Newcastle
Browse

Motivated reinforcement learning for improved head actuation of humanoid robots

Download (327.08 kB)
conference contribution
posted on 2025-05-10, 10:02 authored by Jake Fountain, Josiah Walker, David Budden, Alexandre MendesAlexandre Mendes, Stephan ChalupStephan Chalup
The ability of an autonomous agent to self-localise within its local environment is critically dependent on its ability to make accurate observations of static, salient features. This notion has driven considerable research into the development and improvement of feature extraction and object recognition algorithms, both within RoboCup and the robotics community at large. Instead, this paper focuses on a rarely-considered issue; determining an optimal policy for actuating a robot’s head, to ensure it observes regions of the environment that will maximise the positional nformation provided. The complexity of this task is magnified by a number of common computational issues; specifically high dimensionality state spaces and noisy environmental observations. This paper details the application of motivated reinforcement learning to partially overcome these issues, leading to an 11% improvement (relative to the null case of uniformly distributed actuation policies) in self-localisation for an agent trained online for less than 1 hour. The method is demonstrated as a viable method for improving self-localisation in robotics, without the need for further optimisation of object recognition or tuning of probabilistic filters.

History

Source title

RoboCup 2013: Robot World Cup XVII (Lecture Notes in Computer Science. Volume 8371)

Name of conference

Robot World Cup 2013: 17th Annual RoboCup International Symposium

Location

Eindhoven, Netherlands

Start date

2013-06-26

End date

2013-06-30

Pagination

268-279

Publisher

Springer

Place published

Berlin, Germany

Language

  • en, English

College/Research Centre

Faculty of Engineering and Built Environment

School

School of Electrical Engineering and Computer Science

Rights statement

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-662-44468-9_24

Usage metrics

    Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC