Reinforcement Learning Engineer - Manipulation

Humanoid
London, United Kingdom
Last month
Job Type
Permanent
Work Pattern
Full-time
Work Location
On-site
Seniority
Mid
Education
Degree
Posted
15 Apr 2026 (Last month)

Benefits

23 days of annual leave 15 days of paid sick leave Paid company holidays Fully funded private healthcare Equity Pension scheme with 8% contribution Free daily breakfast, catered lunch, and snacks in-office

Here at Humanoid, we believe in a future where robots amplify human potential. That’s why we’ve set out on a mission to build the world’s most capable, commercially-scalable, and safe humanoid robots. We’re bringing that mission to life with HMND‑01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we’re growing the team to take it even further.

About the Role

We're hiring aReinforcement Learning Engineer to join our Autonomy team based in London. In this role you will leverage reinforcement learning in both simulation and physical reality to build highly performant and robust manipulation policies.

What You'll Do

  • Train language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world.

  • Construct challenging and diverse suites of manipulation tasks in simulation.

  • Partner with teleoperations to collect trajectories in simulation for behavior cloning.

  • Partner with testing and operations to establish real-world RL training pipelines.

  • Experiment with various ways of bringing policies trained in simulation to the real world.

What We're Looking For

  • 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it.

  • Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference.

  • Experience solving real problems using reinforcement learning with deep neural networks in any domain.

  • Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code.

  • You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply.

Nice to have

  • Experience with simulators for robotics (Isaac Sim, MuJoCo etc.)

  • Experience in RL for robotics.

  • Experience building infrastructure for large-scale RL (e.g. using ray).

  • Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions.

  • Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks.

What We Offer

  • Meaningful time off to rest and recharge: 23 days of annual leave (accrued), 15 days of paid sick leave, and paid company holidays.

  • Fully funded private healthcare for UK employees, with broad provider access, virtual and in‑person care, and strong mental health and serious illness support.

  • Equity included–we believe builders should share in what they build.

  • Pension scheme with a total 8% contribution (5% employee, 3% employer) on full earnings.

  • Free daily breakfast, catered lunch, and snacks in‑office.

  • Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics.

  • Freedom to influence the product and own key initiatives.

Related Jobs

View all jobs

Reinforcement Learning Engineer - Manipulation

Humanoid London, United Kingdom
On-site

AI/Machine Learning Engineer

Matchtech Romsey, Hampshire, United Kingdom
£60,000 – £80,000 pa Hybrid Clearance Required

AI / ML Engineer

CBSbutler Holdings Limited trading as CBSbutler Nursling, Hampshire, SO16 0TF, United Kingdom
£50,000 – £72,000 pa Hybrid Clearance Required

AI/ML Engineer

Copello Romsey, Hampshire, United Kingdom
£65,000 pa Hybrid Clearance Required

Robotics Simulation & Control Engineer

Humanoid London, United Kingdom
On-site

Simulation Engineer - Manipulation

Humanoid London, United Kingdom
On-site

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Robotics Jobs in the UK: Roles, Skills, Salaries and How to Get Hired (2026 Guide)

Robotics Jobs UK 2026: roles, salaries and skills for engineers and researchers in manufacturing, logistics, autonomous vehicles, defence and healthcare. In the UK, most robotics jobs cluster around hubs such as London, Cambridge, Bristol, Oxford, Manchester and Edinburgh, with common titles including Robotics Engineer, SLAM Engineer, Controls Engineer and Mechatronics Engineer. The most efficient way to browse live robotics jobs is via specialist boards like RoboticsJobs.co.uk, which curate roles specifically in this field so you are not lost in generic tech listings. This guide covers everything you need to know about robotics jobs in the UK in 2026, from the roles and skills in demand to where to find live opportunities and how to stand out as a candidate.

Where to Advertise Robotics Jobs in the UK (2026 Guide)

Where to advertise robotics jobs UK in 2026: the specialist boards, university channels and community routes that reach robotics, SLAM and controls talent. The candidate pool spans mechanical engineers, software developers, controls specialists, computer vision researchers and systems integrators — a multidisciplinary mix that general job boards are poorly equipped to reach. The strongest robotics candidates are often embedded in research groups, defence programmes or advanced manufacturing environments, and move between roles through specialist networks and industry events rather than mainstream platforms. This guide, published by RoboticsJobs.co.uk, covers where to advertise robotics roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.