2,474 Open roles
96 Companies
52 Posted today
Jobs / Tencent Games / Research Internship – Reinforcement Learning for Large Foundation Models
This job is no longer available.

This position has been closed.

Posted 2026-05-22

Research Internship – Reinforcement Learning for Large Foundation Models

Description

Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA.

Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.

Responsibilities
  • Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents.
  • Deliver impactful algorithms for real world applications.
  • Publish influential research papers.
Requirements
  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university.
  • Self-motivated and excited about developing novel techniques.
  • Research experiences in natural language processing or machine learning.
  • Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch.
  • Good publication track records and history of creativity and intellectual flexibility.
  • Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.
Benefits
  • 3 months duration (with the possibility of extension).
  • Eligible for 1 hour of paid sick leave for every 30 hours worked.
  • Up to 13 paid holidays throughout the calendar year.
  • Eligible to enroll in the Company-sponsored medical plan.
Similar Active Jobs
IGTProduct & DevelopmentBelgrade, Serbia

Technical Artist

IGT is seeking a Technical Artist in Belgrade to bridge the gap between art and technology in the production of casino games. The role involves implementing 3D assets and animations in Unity while collaborating with international cross-functional teams. Candidates must possess strong technical skills in Unity and Adobe Creative Suite, along with a relevant portfolio of slot or casino artwork.

HybridFull-timeMid-level3 yearsEnglish
2026-07-02
SportradarProduct & DevelopmentVienna, Austria

Senior Application Specialist [m/f/d]

Sportradar is seeking a Senior Application Specialist to take technical ownership of Dynamics 365 F&O and connected financial systems. This role supports strategic initiatives within Finance systems by collaborating with the finance department and stakeholders to deliver customised solutions and enhance operational efficiency. The specialist will manage applications, permissions, provide operational support, and execute compliance controls.

Full-timeSeniorEnglish
2026-07-02
SportradarProduct & DevelopmentBremen, Germany

Senior C++ Software Engineer

Sportradar is seeking a Senior C++ Software Engineer to join its Sports Virtualisation team. The role involves developing innovative products using Unreal Engine 5.6+ by integrating high-performance C++ code with live skeletal tracking data. The engineer will support the team in building interactive virtual sports content, while also performing maintenance and stabilization of running systems and guiding junior developers.

Full-timeSenior3 yearsEnglish
2026-07-02
AristocratProduct & DevelopmentSkopje, North Macedonia

QA Engineer

The company is seeking a QA Engineer to ensure software product quality. This role involves completing manual test cases, assisting with test plans, and tracking defects. The engineer will collaborate with development teams, participate in testing activities, and support automation efforts. This is an opportunity for professional growth within a dedicated quality-focused team.

On-siteFull-timeMid-level1-2 yearsEnglish
2026-07-02
EntainProduct & DevelopmentHyderabad, India

Gaming Operations Executive

The Gaming Operations Executive ensures the stability, integrity, and operational performance of gaming products through advanced monitoring, automation, and risk management. The role involves combining escalation management with commercial risk oversight, focusing on game integrity, platform uptime, supplier performance, and proactive issue detection. This position is an important escalation point for complex technical incidents, requiring investigation and coordination of system-level issues and improvement of automated monitoring tools to protect revenue and player experience.

On-siteFull-timeMid-level1-3 yearsEnglish
2026-07-02