Reinforcement Learning Algorithm Engineer (Agent Reinforcement Learning Engineer)

艾氪集團·IT / E-Business

InternshipFull Time
HK work permit required
No experience limit
Bachelor
9.0 hrs/day, 5 days/wk, Office work
Salary negotiable
HK $4K-6K/Month
Job Highlights
Apply reinforcement learning in real industrial systems
Design agent-environment interaction systems
Background in reinforcement learning, agents, or decision systems
Job benefits
Gratuity
Year-end bonus
Marriage leave
Job Description

About the Role

We are looking for Agent Reinforcement Learning Engineers to join our Agent Core team.

You will help build learning-capable AI agents that:

•Interact continuously with real-world business environments

•Learn decision policies for pricing, inventory, and operations

•Perform long-horizon reasoning and planning

•Optimize behavior through feedback and preference alignment

•Improve themselves over time in production

This role focuses on applying reinforcement learning together with large language models and agent architectures in real industrial systems — not simulated toy environments.


Focus

•Design agent-environment interaction systems (observations, actions, rewards)

•Apply reinforcement learning to real scenarios such as pricing optimization, inventory allocation, and fulfillment scheduling

•Build long-horizon planning and multi-step reasoning pipelines for agents

•Implement preference learning and feedback optimization (RLHF / RLAIF / online learning)

•Construct simulation environments and offline evaluation pipelines from real business data

•Build closed learning loops: Sense → Decide → Act → Feedback → Improve

•Develop automated training, evaluation, and deployment workflows

•Improve observability and stability of large-scale RL jobs

•Refactor agent, data, and training frameworks for production readiness


Ideal Experience

•Background in reinforcement learning, agents, or decision systems

•Strong Python + PyTorch

•Ability to abstract real-world problems into states, actions, and rewards

•Systems thinking mindset

Nice to have:

•Multi-agent experience

•Operations research / game theory

•Supply chain, pricing, or resource optimization exposure

•LLM agent frameworks (LangGraph, AutoGen, CrewAI)

Typical Problems You’ll Work On

•A pricing strategy behaves differently across regions — how should the agent adapt via reinforcement learning?

•Inventory and fulfillment objectives conflict — how can agents learn trade-offs between profit, cost, and service level?

•Business data is noisy and delayed — how do we design robust reward functions?

•Enterprise preferences shift — how do we quickly realign agent behavior?

Tech Stack

Python / PyTorch

Distributed RL

Agent frameworks

TypeScript / React (internal tools)

View more
Computer Science
Software Engineering
Cantonese
Mandarin
唐玲
iKea Group·招聘经理
Company Overview
Echronos AI:A world-leading industrial-grade Agentic OS AI company Company Profile Keywords Industrial-grade Agentic OS, AIOS, Decentralized Multi-Agent Collaboration, Agentic Studio, Enterprise AI-native Infrastructure Company Overview The Echronos ecosystem is powered by a proprietary triad of core technological innovations: •echOS - The world’s first industrial Agentic Operating System, serving as the AI-native infrastructure for decentralized multi-agent orchestration within high-stakes, industrial-scale commercial and trading ecosystems. •JovaAI - A cross-industry Agentic Studio designed for the seamless orchestration and deployment of multi-agent collaboration at scale. •WtreeAI - A next-generation silicon-talent marketplace, empowering enterprises to hire diverse AI employees and specialized, collaborative agent teams on demand. Leveraging ICB, the world’s first real-time cross-industry transaction technology, Echronos AI offers a modular, "Lego-style" library of over 6,000 proprietary AI tools and skills. This empowers organizations to rapidly configure and deploy AI systems precisely tailored to their evolving business scenarios, drastically enhancing efficiency while catalyzing industrial circulation and cross-organizational synergy. By accelerating the development of large-scale agentic ecosystems, Echronos AI is pioneering AI-native industrial clusters and setting the global standard for enterprise interconnectivity and AI infrastructure in the intelligent era. Echronos AI Group maintains a strategic R&D network across key innovation hubs, including Hong Kong, Shenzhen, Beijing, Shanghai, and Chongqing. As a trailblazer in industrial-grade Agentic AI, the company is consistently honored with prestigious accolades such as the "Qianfeng Award" and recognized as a "Leading Enterprise in China’s AI Industry".
Similar jobs
Quick reply
HongKong Cloudsway Limited
研究語義理解算法
構建大規模預訓練模型
電腦相關專業碩士及以上
Quick reply
New
HongKong Cloudsway Limited
参与AI搜索引擎算法工作
大模型应用与数据挖掘经验
C/C++、Python编程能力要求
$25K-50K/M
艾氪集團有限公司
Apply reinforcement learning in real industrial systems
Design agent-environment interaction systems
Background in reinforcement learning, agents, or decision systems
HongKong Cloudsway Limited
研發搜尋引擎語義理解算法
電腦相關專業碩士及以上學歷
熟悉BERT、Transformer等模型
Negotiable
HongKong Cloudsway Limited
深度學習框架技能要求
搜尋引擎相關經驗
NLP算法經驗碩士學歷
Be careful
Don’t provide your bank or credit card details when applying for jobs.
Save