Add reinforcement learning support by Papaercold · Pull Request #137 · LightwheelAI/leisaac

Papaercold · 2026-03-10T20:32:52Z

Summary

Added a new Reinforcement Learning (RL) module.
Detailed documentation and usage instructions are available in
docs/feature/rl_traning.md.

The overall structure and formatting of the RL module follow the same style as the existing state machine data generation module for consistency.

Since end-to-end RL training is inherently challenging and requires careful reward design and tuning, the current implementation focuses on the lift_cube task as a baseline example.

Additional Changes

Refactored the labels in action_process.py to unify the formatting across:
- RL module
- State machine module
- Teleoperation module

This improves structural consistency and overall code readability.

Auto data generation

LiftCubeStateMachine was deleted; remove stale import and TASK_REGISTRY entry from generate.py, and remove from __all__ in state_machine/__init__.py. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Papaercold and others added 30 commits January 27, 2026 11:30

Add data auto-generate module.

f870f10

Add data auto-generate module.

e52df2e

Add data auto-generate module.

ae15ddc

Merge branch 'LightwheelAI:main' into auto-data-generation

4fea004

Add data auto-generate module.

a245712

Add data auto-generate module.

38df5fc

Add auto_terminate.

83b9e0b

Add auto_terminate.

3b78d34

Add auto_terminate.

47d8504

Add Description.

2e448cf

State Machinecode refactoring.

0e5f008

State Machinecode refactoring.

9750b3d

State Machinecode refactoring.

34be002

State Machinecode refactoring.

d590d27

State Machinecode refactoring.

b8c97a9

State Machinecode refactoring.

4567af7

Add State Machine code.

6ad6606

Apply pre-commit fixes (black/isort/pyupgrade) for several files

e694585

Apply pre-commit fixes (black/isort/pyupgrade) for pick_orange.py

5074699

Merge branch 'LightwheelAI:main' into auto-data-generation

18da4fa

Change structure..

9e73433

Create StateMacchine Class.

310aee6

Refactor code.

fb8304d

Fix bugs.

9328529

Delete redundant files

a5338d6

Delete redundant files.

e1c8729

Change PickOrangeStateMachine

c4f8111

Change PickOrangeStateMachine

e42fc85

Change PickOrangeStateMachine

002133f

Change PickOrangeStateMachine

2460e14

Papaercold and others added 29 commits February 21, 2026 21:46

Change bi_arm_cfg

1342c18

Add RL module - 1st version.

c7d8cca

Change documents.

5dc5b8d

Change bash.

8d1c810

Delete RL part.

f34cb6e

Refactor

db99a6c

Change format

f26debf

Refactor

f4d40b7

Change Isaaclab version==2.3.2

88d5c41

Change Isaaclab version==2.3.0

87ced23

Change documents.

65484c5

Fix bugs.

5155f6a

Change format.

1d3a912

Change format.

4faee0c

Merge pull request #1 from Papaercold/auto-data-generation

360484d

Auto data generation

Add basic RL module.

fc2f686

Add basic RL module.

7791864

Add RL module.

b8f0b40

Delete old module.

eefee70

Refactor.

7bc6508

fix bugs

4762aad

Adjusting the RL reward

d5d0286

Adjusting the RL reward

0680f07

Change document

b168188

Remove lift_cube state machine references from datagen scripts

102afa8

LiftCubeStateMachine was deleted; remove stale import and TASK_REGISTRY entry from generate.py, and remove from __all__ in state_machine/__init__.py. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Update hyperparameters and clean up environment dependencies

3b08b71

Fix bugs

c7dfcb2

Remove redundant files

ea41361

Remove redundant files

7ad3126

Papaercold closed this Mar 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reinforcement learning support#137

Add reinforcement learning support#137
Papaercold wants to merge 71 commits into
LightwheelAI:mainfrom
Papaercold:add-rl-module

Papaercold commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Papaercold commented Mar 10, 2026

Summary

Additional Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant