Intention Learning with Decision Transformer

Overview

This repository builds upon Decision Transformer (NeurIPS 2021) and extends prior work on ARC tasks using object-centric decision transformers.

In this study, we extract popular states from user trajectories and introduce intention information for each action to assess its impact on learning performance.

Installation

This repository has been tested on Ubuntu 22.04 with Python 3.10. To set up the environment, follow these steps:

Download and extract the repository.
Create and activate a new Conda environment.
Install dependencies from requirements.txt.

unzip IntentionLearning-DT.zip  
cd IntentionLearning-DT  
conda create -n intention python=3.10  
conda activate intention  
pip install -r requirements.txt

Running the Code

Data Preprocessing

We suggest intention as the ideal edge of the state space graph among popular nodes. Below images represent 6 popular states of the diagonal flip task and 8 popular states of the stretch task. We add 2 states for start action and end action.

Example 1: Popular States for the Diagonal Flip Task

Example 2: Popular States for the Stretch Task

Although the dataset in this repository already contains annotated intention information, you can preprocess the data manually.

./0_preprocess.sh TASK_NAME TRAIN_OR_TEST

TASK_NAME: Currently, dflip and stretch are supported, representing a 5x5 diagonal flip task.
TRAIN_OR_TEST: Specifies the dataset directory and can be one of the following: train, test_1, test_2, test_3, test_4.

Training

To train the model:

./1_train.sh TASK_NAME MODEL_NAME GPU_ID

TASK_NAME: Currently, only dflip and stretch are supported. (default value: 'dflip')
MODEL_NAME: Specifies the model variant to use. The available options are: (default value: 'default')
- default: Standard Decision Transformer (DT) model.
- pnp: DT model augmented with object information from the previous study.
- intention: DT model augmented with intention information.
- pnp_intention: DT model augmented with both object and intention information.
GPU_ID: Specifies the GPU index to use during training. (default value: 0)

The training script runs for 400 epochs, saving a checkpoint every 20 epochs. Each time the model is saved, its performance is evaluated using 2,000 test samples from the test_1 dataset.

Evaluation

To manually evaluate a trained model:

./2_test.sh TASK_NAME MODEL_NAME TEST_DATASET_NUM GPU_ID

TASK_NAME: The name of the task (currently supports dflip and stretch). (default value: 'dflip')
MODEL_NAME: Specifies the trained model variant (default, pnp, intention, pnp_intention). (default value: 'default')
TEST_DATASET_NUM: Specifies which test dataset to use for evaluation. (default value: 0)
- If 1 to 4: Runs evaluation on test_1 to test_4 using the current training checkpoint model (test with the checkpoint model from ./model/TASK_NAME/).
- If 0: Runs evaluation on all test datasets (test_1 to test_4) using the fully trained model (test the final model from ./model/).
GPU_ID: Specifies the GPU index to use. (default value: 1)

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
dataset		dataset
figure		figure
model		model
src		src
0_preprocess.sh		0_preprocess.sh
1_train.sh		1_train.sh
2_test.sh		2_test.sh
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Intention Learning with Decision Transformer

Overview

Installation

Running the Code

Data Preprocessing

Example 1: Popular States for the Diagonal Flip Task

Example 2: Popular States for the Stretch Task

Training

Evaluation

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

GIST-DSLab/IntentionLearning

Folders and files

Latest commit

History

Repository files navigation

Intention Learning with Decision Transformer

Overview

Installation

Running the Code

Data Preprocessing

Example 1: Popular States for the Diagonal Flip Task

Example 2: Popular States for the Stretch Task

Training

Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages