Reinforcement Learning Algorithm Engineer (Agent Reinforcement Learning Engineer)

艾氪集團有限公司

Internship
HK work permit required
No experience limit
Bachelor
More than 8 hours/day, 5 days per week, Fixed
Job Highlights
Apply reinforcement learning in real industrial systems
Design agent-environment interaction systems
Background in reinforcement learning, agents, or decision systems
Job benefits
Year-end bonus
Marriage leave
Job Description

About the Role

We are looking for Agent Reinforcement Learning Engineers to join our Agent Core team.

You will help build learning-capable AI agents that:

•Interact continuously with real-world business environments

•Learn decision policies for pricing, inventory, and operations

•Perform long-horizon reasoning and planning

•Optimize behavior through feedback and preference alignment

•Improve themselves over time in production

This role focuses on applying reinforcement learning together with large language models and agent architectures in real industrial systems — not simulated toy environments.


Focus

•Design agent-environment interaction systems (observations, actions, rewards)

•Apply reinforcement learning to real scenarios such as pricing optimization, inventory allocation, and fulfillment scheduling

•Build long-horizon planning and multi-step reasoning pipelines for agents

•Implement preference learning and feedback optimization (RLHF / RLAIF / online learning)

•Construct simulation environments and offline evaluation pipelines from real business data

•Build closed learning loops: Sense → Decide → Act → Feedback → Improve

•Develop automated training, evaluation, and deployment workflows

•Improve observability and stability of large-scale RL jobs

•Refactor agent, data, and training frameworks for production readiness


Ideal Experience

•Background in reinforcement learning, agents, or decision systems

•Strong Python + PyTorch

•Ability to abstract real-world problems into states, actions, and rewards

•Systems thinking mindset

Nice to have:

•Multi-agent experience

•Operations research / game theory

•Supply chain, pricing, or resource optimization exposure

•LLM agent frameworks (LangGraph, AutoGen, CrewAI)

Typical Problems You’ll Work On

•A pricing strategy behaves differently across regions — how should the agent adapt via reinforcement learning?

•Inventory and fulfillment objectives conflict — how can agents learn trade-offs between profit, cost, and service level?

•Business data is noisy and delayed — how do we design robust reward functions?

•Enterprise preferences shift — how do we quickly realign agent behavior?

Tech Stack

Python / PyTorch

Distributed RL

Agent frameworks

TypeScript / React (internal tools)

View more
Computer Science
Software Engineering
Cantonese
Mandarin
董小姐
Iclick Interactive Asia Group Ltd·HR
Similar jobs
漢陽科技
Design and develop software modules for robots
Familiar with ROS technical system
Proficient in Linux program design, C++ or Python
$25K-50K/M
艾氪集團
Apply reinforcement learning in real industrial systems
Design agent-environment interaction systems
Background in reinforcement learning, agents, or decision systems
$4K-6K/M
HongKong Cloudsway Limited
参与AI搜索引擎算法工作
大模型应用与数据挖掘经验
加分项:搜索/推荐/广告经验
ICHEERZI (HONG KONG) LIMITED
1+ years professional experience in Data Science and Machine Learning
Strong experience with cloud platforms for AI and ML workloads
Experience with vector databases or knowledge graphs for RAG is a plus
Be careful
Don’t provide your bank or credit card details when applying for jobs.
Save