Reinforcement Learning Engineer - Locomanipulation

Humanoid

London, United Kingdom

4 months ago

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Senior
Education: Masters
Posted: 17 Apr 2026 (4 months ago)

Benefits

23 days of annual leave 15 days of paid sick leave Paid company holidays Fully funded private healthcare Equity Pension scheme with 8% contribution Free daily breakfast, catered lunch, and snacks in-office

Save job

Create job alert

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Senior
Education: Masters
Posted: 17 Apr 2026 (4 months ago)

Benefits

Save job

Create job alert

Here at Humanoid, we believe in a future where robots amplify human potential. That’s why we’ve set out on a mission to build the world’s most capable, commercially-scalable, and safe humanoid robots. We’re bringing that mission to life with HMND‑01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we’re growing the team to take it even further.

About The Role

We are looking for a Senior or Staff Reinforcement Learning Engineer to develop learning-based control policies for humanoid robots.

You will design and train reinforcement learning policies that enable dynamic locomotion and loco-manipulation behaviors on real robots. Your work will focus on building scalable training pipelines, designing reward functions and environments, and improving sim-to-real transfer for reliable deployment on hardware.

You will work closely with controls and robotics engineers to integrate learned policies into the robot control stack, ensuring stable and robust behavior in real-world conditions.

Development will involve continuous iteration between large-scale simulation and hardware experiments.

The problems you will work on include dynamic locomotion, balance recovery, contact-rich manipulation, and multi-behavior policy learning.

What You’ll Do

Design and train reinforcement learning policies for humanoid robot control.
Build scalable simulation and training pipelines (e.g., Isaac Lab, MuJoCo).
Design reward functions, observation spaces, and curricula for complex behaviors.
Improve robustness and sim-to-real transfer of learned policies.
Deploy and evaluate policies on real robotic systems.
Integrate policies into the control stack.

What We're Looking For

MS or PhD in Robotics, Machine Learning, Computer Science, or related field.
Strong experience with reinforcement learning (e.g., PPO, SAC, offline RL).
Experience applying RL to robotics or physical systems.
Experience deploying learned policies on real robotic systems.
Experience with physics-based simulation environments (e.g., Isaac Lab, MuJoCo).
Strong programming skills in Python and/or C++.

Nice to have:

Experience with RL for locomotion or legged robots.
Experience with sim-to-real transfer.
Familiarity with robot dynamics, control, or whole-body control.

What We Offer

Meaningful time off to rest and recharge: 23 days of annual leave (accrued), 15 days of paid sick leave, and paid company holidays.
Fully funded private healthcare for UK employees, with broad provider access, virtual and in‑person care, and strong mental health and serious illness support.
Equity included–we believe builders should share in what they build.
Pension scheme with a total 8% contribution (5% employee, 3% employer) on full earnings.
Free daily breakfast, catered lunch, and snacks in‑office.
Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics.
Freedom to influence the product and own key initiatives.

Related Jobs

View all jobs

Spotlight

Head of Sales

LEC Robotics London, United Kingdom

£70,000 – £100,000 pa On-site

Reinforcement Learning Engineer - Manipulation

Humanoid London, United Kingdom

On-site

Simulation Engineer - Manipulation

Humanoid London, United Kingdom

On-site

Robotics Simulation & Control Engineer

Humanoid London, United Kingdom

On-site

Senior Robotics Research Engineer

Ocado Welwyn Hatfield, AL8 6TP, United Kingdom

Hybrid

Senior Robotics Software Engineer

Ocado Welwyn Hatfield, United Kingdom

Hybrid

Engineering Recruitment Lead (Physical AI / Humanoid Robotics)

Humanoid London, United Kingdom

On-site

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Jul 7, 2026

Jobs

How Hard Is It to Get a Robotics Job in the UK? Competition, Success Rates & Timelines (2026)

Robotics jobs in the UK are competitive but winnable. Here is the honest picture on odds, skills bar and timelines for 2026.

Jun 22, 2026

Jobs

Robotics Jobs in the UK (2026): Contractor Day Rates, IR35 & Where Freelance Demand Is

Robotics jobs on a contract basis in 2026: indicative day rates by specialism, what IR35 means for take-home, and where UK freelance demand sits.

Jun 15, 2026

Jobs

Robotics Jobs and AI in the UK (2026): Why Smarter Robots Mean More Robotics Jobs, Not Fewer

Robotics jobs are growing as AI makes robots smarter. See UK 2026 salaries, employers and roles, and why automation creates work.