Skip to content
@LAMDA-RL

LAMDA-RL

We are a fork of reinforcement learning researchers from LAMDA Group @ Nanjing University.

LAMDA-RL Lab

LAMDA-RL Lab is at the forefront of advancing the field of reinforcement learning and its application to creating general decision-making intelligence, by pushing the boundaries of what's possible with RL techniques.

We focus on developing novel algorithms and architectures that enable RL systems to learn and make decisions in increasingly general and adaptable ways. Some key areas we are exploring include:

  • Imitation learning;
  • Offline reinforcement learning;
  • Model-based RL and world model learning;
  • Multi-agent and collaborative RL;
  • Planning and learning with large models.

Through both fundamental and application research, our aim is to create RL-based systems that exhibit truly intelligent and general decision-making capabilities. For more information about our lab and research, please refer to our website https://lamda-rl.nju.edu.cn/.

Pinned Loading

  1. OfflineRL-Lib OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    Python 71 7

  2. ODIS ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 39 6

  3. PRDC PRDC Public

    Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    Python 18 3

  4. ACT ACT Public

    Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)

    Python 13 3

  5. Pretrained_BWArea_2.7B_30G Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    Python 9

  6. CPR CPR Public

    Forked from LyndonKong/CPR

    Python 2

Repositories

Showing 10 of 32 repositories
  • Q-Adapter Public Forked from mansicer/Q-Adapter

    Author's implementation of ICLR'25 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"

    LAMDA-RL/Q-Adapter’s past year of commit activity
    Python 0 Apache-2.0 1 0 0 Updated Feb 28, 2025
  • OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    LAMDA-RL/OfflineRL-Lib’s past year of commit activity
    Python 71 MIT 7 1 2 Updated Feb 4, 2025
  • ADMPO Public Forked from HxLyn3/ADMPO

    Any-step Dynamics Model for Policy Optimization

    LAMDA-RL/ADMPO’s past year of commit activity
    Python 3 MIT 4 0 0 Updated Feb 1, 2025
  • WiseRL Public Forked from typoverflow/WiseRL

    PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

    LAMDA-RL/WiseRL’s past year of commit activity
    Python 1 MIT 2 0 0 Updated Dec 6, 2024
  • PRDC Public Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    LAMDA-RL/PRDC’s past year of commit activity
    Python 18 6 0 0 Updated Nov 8, 2024
  • ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    LAMDA-RL/ODIS’s past year of commit activity
    Python 39 Apache-2.0 6 2 0 Updated Oct 31, 2024
  • OPT-AIL Public
    LAMDA-RL/OPT-AIL’s past year of commit activity
    Python 0 0 0 0 Updated Oct 19, 2024
  • Madoc Public Forked from qs1bb/Madoc
    LAMDA-RL/Madoc’s past year of commit activity
    Python 0 1 0 0 Updated Oct 9, 2024
  • Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    LAMDA-RL/Pretrained_BWArea_2.7B_30G’s past year of commit activity
    Python 9 0 0 0 Updated Sep 10, 2024
  • .github Public
    LAMDA-RL/.github’s past year of commit activity
    0 0 0 0 Updated Sep 4, 2024

Top languages

Loading…

Most used topics

Loading…