NotificationsYou must be signed in to change notification settings
Fork10
Star268

Commit80f8efe

committed

release the training and evaluation codes of Seer-Large, which achieves Avg.Len. of 4.3 on CALVIN ABC-D

1 parent4828149 commit80f8efeCopy full SHA for 80f8efe

File tree

27 files changed

+377

-114

lines changed

.gitignore
README.md
docs
eval_calvin.py
models
- seer_model.py
real_controller
- controller.py
scripts
- CALVIN_ABC_D
  - Seer-Large
  - Seer
- REAL
train.py
utils
- arguments_utils.py
- train_utils.py

27 files changed

+377

-114

lines changed

`‎.gitignore‎`

Lines changed: 6 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,9 @@`
	`1`	`+# workspace`
	`2`	`+calvin`
	`3`	`+checkpoints`
	`4`	`+eval_logs`
	`5`	`+evaluate`
	`6`	`+`
`1`	`7`	`# Byte-compiled / optimized / DLL files`
`2`	`8`	`__pycache__/`
`3`	`9`	`*.py[cod]`

`‎README.md‎`

Lines changed: 2 additions & 3 deletions

Original file line number	Diff line number	Diff line change
`@@ -5,7 +5,6 @@`
`5`	`5`
`6`	`6`	`<h3align="center">`
`7`	`7`	`<ahref="https://arxiv.org/pdf/2412.15109">Arxiv</a> \|`
`8`		`- <a>Video</a> \|`
`9`	`8`	`<ahref="https://nimolty.github.io/Seer/">Webpage</a>`
`10`	`9`	`</h3>`
`11`	`10`
`@@ -59,13 +58,13 @@ This section details the pre-training process of Seer in real-world experiments,`
`59`	`58`	`Relevant checkpoints are available on the[website](https://drive.google.com/drive/folders/1F3IE95z2THAQ_lt3DKUFdRGc86Thsnc7?usp=sharing).`
`60`	`59`	`\|Model\|Checkpoint\|`
`61`	`60`	`\|:------:\|:------:\|`
`62`		`-\|CALVIN ABC-D\|[Seer](https://drive.google.com/drive/folders/17Gv9snGCkViuhHmzN3eTWlI0tMfGSGT3?usp=sharing) /[Seer Large](https://drive.google.com/drive/folders/1AFabqfDEi69oMo0FTGhEiH2QSRLYBR9r?usp=drive_link)\|`
	`61`	`+\|CALVIN ABC-D\|[Seer](https://drive.google.com/drive/folders/17Gv9snGCkViuhHmzN3eTWlI0tMfGSGT3?usp=sharing)(Avg.Len. : 3.98)/[Seer Large](https://drive.google.com/drive/folders/1AFabqfDEi69oMo0FTGhEiH2QSRLYBR9r?usp=drive_link) (Avg.Len. : 4.30)\|`
`63`	`62`	`\|Real-World\|[Seer (Droid Pre-trained)](https://drive.google.com/drive/folders/1rT8JKLhJGIo97jfYUm2JiFUrogOq-dgJ?usp=drive_link)\|`
`64`	`63`
`65`	`64`	`##📆 TODO <aname="todos"></a>`
`66`	`65`	`-[x] Release real-world expriment code.`
`67`	`66`	`-[x] Release CALVIN ABC-D experiment code (Seer).`
`68`		`--[] Release CALVIN ABC-D experiment code (Seer-Large).`
	`67`	`+-[x] Release CALVIN ABC-D experiment code (Seer-Large).`
`69`	`68`	`-[ ] Release LIBERO-LONG experiment code.`
`70`	`69`
`71`	`70`	`##License <aname="license"></a>`

`‎docs/CALVIN_ABC-D_INSTALL.md‎`

Lines changed: 7 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`#Installation`
`2`	`2`
`3`		`-(1) Env`
	`3`	`+(1)CondaEnv`
`4`	`4`	```python
`5`	`5`	`conda create-n seer python=3.10`
`6`	`6`	`conda activate seer`
`@@ -28,3 +28,9 @@ cd ${YOUR_PATH_TO_SEER}`
`28`	`28`	`pip install-r requirements.txt`
`29`	`29`	`pip install torch==2.2.0 torchvision==0.17.0 torchaudio==2.2.0--index-url https://download.pytorch.org/whl/cu121`
`30`	`30`	```
	`31`	`+`
	`32`	`+(5) Create a soft link to CALVIN`
	`33`	+```python
	`34`	`+cd${YOUR_PATH_TO_SEER}`
	`35`	`+ln-s$CALVIN_ROOT calvin`
	`36`	+```

`‎docs/CALVIN_ABC-D_RUN.md‎`

Lines changed: 22 additions & 6 deletions

Original file line number	Diff line number	Diff line change
`@@ -5,28 +5,44 @@ For convenience, some checkpoints, such as the MAE-pretrained ViT-B model, are p`
`5`	`5`	`:exclamation:pretrain.sh, finetune.sh, scratch, eval.sh:*`
`6`	`6`	`Please update the following:`
`7`	`7`	`*calvin_dataset_path to the directory where you have stored the CALVIN ABC-D data.`
`8`		`- *checkpoint_path to the parent directory where your experiment checkpoints are saved.`
	`8`	+ *save_checkpoint_path to the parent directory where your experiment checkpoints are saved. Recommend to create a```checkpoints``` folder in the project root directory.
`9`	`9`	`*finetune_from_pretrained_ckpt to the location of your pre-trained checkpoint.`
`10`	`10`	`*resume_from_checkpoint to the location of your fine-tuned checkpoint.`
`11`		`- *vit_ckpt_path to the location of your ViT checkpoint (downloaded from the[website](https://drive.google.com/file/d/1bSsvRI4mDM3Gg51C6xO0l9CbojYw3OEt/view?usp=sharing)).`
	`11`	+ *vit_checkpoint_path to the location of your ViT checkpoint (downloaded from the[website](https://drive.google.com/file/d/1bSsvRI4mDM3Gg51C6xO0l9CbojYw3OEt/view?usp=sharing)). Recommend to be stored in```checkpoints/vit_mae/mae_pretrain_vit_base.pth```.
`12`	`12`
`13`	`13`	`:exclamation:networkx:*`
`14`	`14`	`Due to compatibility issues between the networkx library in CALVIN and Python 3.10, we provide a compatible version of networkx.zip on the[website](https://drive.google.com/file/d/1z-d1SaI0rXfBtBicw1zPSsP-wE-26oLq/view?usp=sharing). Download and unzip it, then replace the existing networkx library in the following path:`
`15`	`15`
`16`	`16`	`##Seer`
`17`	`17`	`###Pre-train`
`18`	`18`	```bash
	`19`	`+# Pre-train Seer on Calvin ABC-D dataset`
`19`	`20`	`bash scripts/CALVIN_ABC_D/Seer/pretrain.sh`
	`21`	`+# Pre-train Seer-Large on Calvin ABC-D dataset`
	`22`	`+bash scripts/CALVIN_ABC_D/Seer-Large/pretrain.sh`
`20`	`23`	```
	`24`	`+`
`21`	`25`	`###Fine-tune`
`22`	`26`	```bash
	`27`	`+# Fine-tune Seer on Calvin ABC-D dataset`
`23`	`28`	`bash scripts/CALVIN_ABC_D/Seer/finetune.sh`
	`29`	`+# Fine-tune Seer-Large on Calvin ABC-D dataset`
	`30`	`+bash scripts/CALVIN_ABC_D/Seer-Large/finetune.sh`
`24`	`31`	```
`25`		`-###Eval`
	`32`	`+`
	`33`	`+###Train from Scratch`
`26`	`34`	```bash
`27`		`-bash scripts/CALVIN_ABC_D/Seer/eval.sh`
	`35`	`+# Train Seer on Calvin ABC-D dataset from scratch`
	`36`	`+bash scripts/CALVIN_ABC_D/Seer/scratch.sh`
	`37`	`+# Train Seer-Large on Calvin ABC-D dataset from scratch`
	`38`	`+bash scripts/CALVIN_ABC_D/Seer-Large/scratch.sh`
`28`	`39`	```
`29`		`-###Scratch`
	`40`	`+`
	`41`	`+###Eval`
`30`	`42`	```bash
`31`		`-bash scripts/CALVIN_ABC_D/Seer/scratch.sh`
	`43`	`+# Evaluate Seer on Calvin ABC-D benchmark`
	`44`	`+bash scripts/CALVIN_ABC_D/Seer/eval.sh`
	`45`	`+# Evaluate Seer-Large on Calvin ABC-D benchmark`
	`46`	`+bash scripts/CALVIN_ABC_D/Seer-Large/eval.sh`
`32`	`47`	```
	`48`	`+`

`‎docs/REAL-WORLD_PRETRAIN.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`#Pre-train`
`2`	`2`	`##Notice`
`3`		`-We provide code for pre-training on both the DROID and OXE datasets. Users should update thecheckpoint_path to the directory where you want to save the training checkpoints, and modify the root_dir to the location where the preprocessed real data is stored. Additionally, users should configure the SLURM information in the provided scripts.`
	`3`	`+We provide code for pre-training on both the DROID and OXE datasets. Users should update thesave_checkpoint_path to the directory where you want to save the training checkpoints, and modify the root_dir to the location where the preprocessed real data is stored. Additionally, users should configure the SLURM information in the provided scripts.`
`4`	`4`
`5`	`5`	`Preparation`
`6`	`6`	```python

`‎eval_calvin.py‎`

Lines changed: 2 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -35,7 +35,7 @@ def main():`
`35`	`35`	`model=SeerAgent(`
`36`	`36`	`finetune_type=args.finetune_type,`
`37`	`37`	`clip_device=device_id,`
`38`		`-checkpoint_path=args.vit_ckpt_path,`
	`38`	`+vit_checkpoint_path=args.vit_checkpoint_path,`
`39`	`39`	`sequence_length=args.sequence_length,`
`40`	`40`	`num_resampler_query=args.num_resampler_query,`
`41`	`41`	`num_obs_token_per_image=args.num_obs_token_per_image,`
`@@ -44,6 +44,7 @@ def main():`
`44`	`44`	`action_pred_steps=args.action_pred_steps,`
`45`	`45`	`obs_pred=args.obs_pred,`
`46`	`46`	`atten_only_obs=args.atten_only_obs,`
	`47`	`+attn_robot_proprio_state=args.attn_robot_proprio_state,`
`47`	`48`	`atten_goal=args.atten_goal,`
`48`	`49`	`atten_goal_state=args.atten_goal_state,`
`49`	`50`	`mask_l_obs_ratio=args.mask_l_obs_ratio,`

`‎models/seer_model.py‎`

Lines changed: 26 additions & 12 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,5 @@`
	`1`	`+`
	`2`	`+importos`
`1`	`3`	`importrandom`
`2`	`4`	`fromfunctoolsimportpartial`
`3`	`5`	`fromcopyimportdeepcopy`
`@@ -18,6 +20,7 @@`
`18`	`20`
`19`	`21`	`defgenerate_attention_mask(K,num_A,num_B,atten_goal,atten_goal_state,`
`20`	`22`	`atten_only_obs,`
	`23`	`+attn_robot_proprio_state,`
`21`	`24`	`mask_l_obs_ratio,`
`22`	`25`	`num_obs_token,action_pred_steps):`
`23`	`26`	`# num_A: 1+1+self.NUM_RESAMPLER_QUERY2+12`
`@@ -43,6 +46,8 @@ def generate_attention_mask(K, num_A, num_B, atten_goal, atten_goal_state,`
`43`	`46`	`attention_mask[start_index+num_A+num_obs_token:start_index+num_A+num_obs_token+action_pred_steps]=-float('inf')`
`44`	`47`	`attention_mask[start_index+num_A+num_obs_token:start_index+num_A+num_obs_token+action_pred_steps,start_index+2:start_index+num_A]=0.0`
`45`	`48`	`attention_mask[start_index+num_A+num_obs_token:start_index+num_A+num_obs_token+action_pred_steps,start_index+num_A:start_index+num_A+num_obs_token]=0.0`
	`49`	`+ifattn_robot_proprio_state:`
	`50`	`+attention_mask[start_index+num_A+num_obs_token:start_index+num_A+num_obs_token+action_pred_steps,start_index+1:start_index+2]=0.0`
`46`	`51`	`ifmask_l_obs_ratio>0:`
`47`	`52`	`count=int(mask_l_obs_ratio* (num_obs_token))`
`48`	`53`	`selected_numbers=np.random.choice(range(num_obs_token),size=count,replace=False)`
`@@ -112,12 +117,13 @@ def __init__(`
`112`	`117`	`self,`
`113`	`118`	`finetune_type,`
`114`	`119`	`clip_device,`
`115`		`-checkpoint_path,`
	`120`	`+vit_checkpoint_path,`
`116`	`121`	`sequence_length=10,`
`117`	`122`	`num_resampler_query=9,`
`118`	`123`	`num_obs_token_per_image=10,`
`119`	`124`	`obs_pred=False,`
`120`	`125`	`atten_only_obs=False,`
	`126`	`+attn_robot_proprio_state=False,`
`121`	`127`	`atten_goal=False,`
`122`	`128`	`atten_goal_state=False,`
`123`	`129`	`mask_l_obs_ratio=0.0,`
`@@ -142,19 +148,20 @@ def __init__(`
`142`	`148`	`self.atten_goal=atten_goal`
`143`	`149`	`self.atten_goal_state=atten_goal_state`
`144`	`150`	`self.atten_only_obs=atten_only_obs`
	`151`	`+self.attn_robot_proprio_state=attn_robot_proprio_state`
`145`	`152`	`self.mask_l_obs_ratio=mask_l_obs_ratio`
`146`	`153`	`self.hidden_dim=hidden_dim`
`147`	`154`	`self.phase=phase`
`148`	`155`	`assertself.phasein ["pretrain","finetune","evaluate"]`
`149`	`156`	`self.gripper_width=gripper_width`
`150`		`-self.checkpoint_path=checkpoint_path`
	`157`	`+self.vit_checkpoint_path=vit_checkpoint_path`
`151`	`158`
`152`	`159`	`# text projector`
`153`	`160`	`self.text_projector=nn.Linear(512,self.hidden_dim)`
`154`	`161`
`155`	`162`	`# state encoder`
`156`		`-ARM_STATE_FEATURE_DIM=384`
`157`		`-GRIPPER_STATE_FEATURE_DIM=384`
	`163`	`+ARM_STATE_FEATURE_DIM=self.hidden_dim`
	`164`	`+GRIPPER_STATE_FEATURE_DIM=self.hidden_dim`
`158`	`165`	`self.arm_state_encoder=nn.Linear(6,ARM_STATE_FEATURE_DIM)`
`159`	`166`	`self.gripper_state_encoder=nn.Linear(2,GRIPPER_STATE_FEATURE_DIM)`
`160`	`167`	`self.state_projector=nn.Linear(ARM_STATE_FEATURE_DIM+GRIPPER_STATE_FEATURE_DIM,self.hidden_dim)`
`@@ -204,6 +211,7 @@ def __init__(`
`204`	`211`	`atten_goal=self.atten_goal,`
`205`	`212`	`atten_goal_state=self.atten_goal_state,`
`206`	`213`	`atten_only_obs=self.atten_only_obs,`
	`214`	`+attn_robot_proprio_state=self.attn_robot_proprio_state,`
`207`	`215`	`mask_l_obs_ratio=self.mask_l_obs_ratio,`
`208`	`216`	`num_obs_token=this_num_obs_token,`
`209`	`217`	`action_pred_steps=self.action_pred_steps),`
`@@ -218,21 +226,22 @@ def __init__(`
`218`	`226`	`self.transformer_backbone=GPT2Model(config)`
`219`	`227`
`220`	`228`	`# action decoder`
	`229`	`+MLP_hidden_dim=self.hidden_dim//2`
`221`	`230`	`self.action_decoder=nn.Sequential(`
`222`		`-nn.Linear(self.hidden_dim,192),`
	`231`	`+nn.Linear(self.hidden_dim,MLP_hidden_dim),`
`223`	`232`	`nn.ReLU(),`
`224`		`-nn.Linear(192,192),`
	`233`	`+nn.Linear(MLP_hidden_dim,MLP_hidden_dim),`
`225`	`234`	`nn.ReLU(),`
`226`	`235`	`)`
`227`	`236`	`self.arm_action_decoder=nn.Sequential(`
`228`		`-nn.Linear(192,6),`
	`237`	`+nn.Linear(MLP_hidden_dim,6),`
`229`	`238`	`torch.nn.Tanh(),`
`230`	`239`	`)`
`231`	`240`	`self.gripper_action_decoder=nn.Sequential(`
`232`		`-nn.Linear(192,1),`
	`241`	`+nn.Linear(MLP_hidden_dim,1),`
`233`	`242`	`torch.nn.Sigmoid(),`
`234`	`243`	`)`
`235`		`-self.IMAGE_DECODER_hidden_dim=384`
	`244`	`+self.IMAGE_DECODER_hidden_dim=self.hidden_dim`
`236`	`245`	`self.NUM_MASK_TOKEN=int(calvin_input_image_size**2/patch_size/patch_size)# i.e. num_patch`
`237`	`246`	`self.PATCH_SIZE=patch_size`
`238`	`247`	`self.mask_token=nn.Parameter(torch.zeros(1,1,self.IMAGE_DECODER_hidden_dim))`
`@@ -249,11 +258,15 @@ def __init__(`
`249`	`258`	`self.initialize_weights()`
`250`	`259`
`251`	`260`	`# freeze vision encoder`
`252`		`-checkpoint=torch.load(checkpoint_path,map_location='cpu')`
`253`		`-msg=self.vision_encoder.load_state_dict(checkpoint['model'],strict=False)`
	`261`	`+print(self.vit_checkpoint_path)`
	`262`	`+vit_checkpoint=torch.load(self.vit_checkpoint_path,map_location='cpu')`
	`263`	`+self.vision_encoder.load_state_dict(vit_checkpoint['model'],strict=False)`
`254`	`264`
`255`	`265`	`# # freeze text encoder`
`256`		`-self.clip_model,self.image_processor=clip.load("ViT-B/32",device=clip_device)`
	`266`	`+ifos.path.exists("checkpoints/clip/ViT-B-32.pt"):`
	`267`	`+self.clip_model,self.image_processor=clip.load("checkpoints/clip/ViT-B-32.pt",device=clip_device)`
	`268`	`+else:`
	`269`	`+self.clip_model,self.image_processor=clip.load("ViT-B/32",device=clip_device)`
`257`	`270`
`258`	`271`	`definitialize_weights(self):`
`259`	`272`	`# initialization`
`@@ -298,6 +311,7 @@ def forward(self, image_primary, image_wrist, state, text_token, action=None):`
`298`	`311`	`atten_goal=self.atten_goal,`
`299`	`312`	`atten_goal_state=self.atten_goal_state,`
`300`	`313`	`atten_only_obs=self.atten_only_obs,`
	`314`	`+attn_robot_proprio_state=self.attn_robot_proprio_state,`
`301`	`315`	`mask_l_obs_ratio=self.mask_l_obs_ratio,`
`302`	`316`	`num_obs_token=this_num_obs_token,`
`303`	`317`	`action_pred_steps=self.action_pred_steps).to(self.device),`

`‎real_controller/controller.py‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -92,7 +92,7 @@ def setup_model(self):`
`92`	`92`	`self.model=SeerAgent(`
`93`	`93`	`finetune_type=self.args.finetune_type,`
`94`	`94`	`clip_device=self.device_id,`
`95`		`-checkpoint_path=self.args.vit_ckpt_path,`
	`95`	`+save_checkpoint_path=self.args.vit_checkpoint_path,`
`96`	`96`	`sequence_length=self.args.sequence_length,`
`97`	`97`	`num_resampler_query=self.args.num_resampler_query,`
`98`	`98`	`num_obs_token_per_image=self.args.num_obs_token_per_image,`

`‎scripts/CALVIN_ABC_D/Seer-Large/eval.sh‎`

Lines changed: 47 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,47 @@`
	`1`	`+#!/bin/bash`
	`2`	`+export GIT_PYTHON_REFRESH=quiet`
	`3`	`+calvin_dataset_path="calvin/dataset/task_ABC_D"`
	`4`	`+calvin_conf_path="calvin/calvin_models/conf"`
	`5`	`+vit_checkpoint_path="checkpoints/vit_mae/mae_pretrain_vit_base.pth"# downloaded from https://drive.google.com/file/d/1bSsvRI4mDM3Gg51C6xO0l9CbojYw3OEt/view?usp=sharing`
	`6`	`+### NEED TO CHANGE the checkpoint path ###`
	`7`	`+resume_from_checkpoint="checkpoints/CALVIN_ABC_D/Seer_Large/12.pth"# checkpoint path to be evaluated`
	`8`	`+IFS='/'read -ra path_parts<<<"$resume_from_checkpoint"`
	`9`	`+run_name="${path_parts[-2]}"`
	`10`	`+log_name="${path_parts[-1]}"`
	`11`	`+log_folder="eval_logs/$run_name"`
	`12`	`+mkdir -p"$log_folder"`
	`13`	`+log_file="eval_logs/$run_name/evaluate_$log_name.log"`
	`14`	`+node=1`
	`15`	`+node_num=8`
	`16`	`+`
	`17`	`+torchrun --nnodes=${node} --nproc_per_node=${node_num} --master_port=10211 eval_calvin.py\`
	`18`	`+ --traj_cons \`
	`19`	`+ --rgb_pad 10 \`
	`20`	`+ --gripper_pad 4 \`
	`21`	`+ --gradient_accumulation_steps 1 \`
	`22`	`+ --bf16_module"vision_encoder" \`
	`23`	`+ --vit_checkpoint_path${vit_checkpoint_path} \`
	`24`	`+ --calvin_dataset${calvin_dataset_path} \`
	`25`	`+ --calvin_conf_path${calvin_conf_path} \`
	`26`	`+ --workers 16 \`
	`27`	`+ --lr_scheduler cosine \`
	`28`	`+ --save_every_iter 50000 \`
	`29`	`+ --num_epochs 20 \`
	`30`	`+ --seed 42 \`
	`31`	`+ --batch_size 64 \`
	`32`	`+ --precision fp32 \`
	`33`	`+ --weight_decay 1e-4 \`
	`34`	`+ --num_resampler_query 16 \`
	`35`	`+ --num_obs_token_per_image 16 \`
	`36`	`+ --run_name${run_name} \`
	`37`	`+ --transformer_layers 24 \`
	`38`	`+ --hidden_dim 1024 \`
	`39`	`+ --transformer_heads 16 \`
	`40`	`+ --phase"evaluate" \`
	`41`	`+ --finetune_type"calvin" \`
	`42`	`+ --action_pred_steps 3 \`
	`43`	`+ --sequence_length 10 \`
	`44`	`+ --future_steps 3 \`
	`45`	`+ --window_size 13 \`
	`46`	`+ --obs_pred \`
	`47`	`+ --resume_from_checkpoint${resume_from_checkpoint}\| tee${log_file} \`

`‎scripts/CALVIN_ABC_D/Seer-Large/finetune.sh‎`

Lines changed: 49 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,49 @@`
	`1`	`+#!/bin/bash`
	`2`	`+### need to change to your path ###`
	`3`	`+calvin_dataset_path="calvin/dataset/task_ABC_D"`
	`4`	`+save_checkpoint_path="checkpoints/"`
	`5`	`+finetune_from_pretrained_ckpt="checkpoints/pretrain_Seer_ptbs512_24layers_16heads_hd1024-Large_calvin_abc_d/9.pth"`
	`6`	`+vit_checkpoint_path="checkpoints/vit_mae/mae_pretrain_vit_base.pth"# downloaded from https://drive.google.com/file/d/1bSsvRI4mDM3Gg51C6xO0l9CbojYw3OEt/view?usp=sharing`
	`7`	`+node=8`
	`8`	`+node_num=8`
	`9`	`+torchrun --nnodes=${node} --nproc_per_node=${node_num} --master_port=10211 train.py \`
	`10`	`+ --traj_cons \`
	`11`	`+ --rgb_pad 10 \`
	`12`	`+ --gripper_pad 4 \`
	`13`	`+ --gradient_accumulation_steps 1 \`
	`14`	`+ --bf16_module"vision_encoder" \`
	`15`	`+ --vit_checkpoint_path${vit_checkpoint_path} \`
	`16`	`+ --calvin_dataset${calvin_dataset_path} \`
	`17`	`+ --workers 8 \`
	`18`	`+ --lr_scheduler cosine \`
	`19`	`+ --save_every_iter 100000 \`
	`20`	`+ --num_epochs 20 \`
	`21`	`+ --seed 42 \`
	`22`	`+ --batch_size 8 \`
	`23`	`+ --precision fp32 \`
	`24`	`+ --learning_rate 1e-3 \`
	`25`	`+ --warmup_epochs 3 \`
	`26`	`+ --finetune_type"calvin" \`
	`27`	`+ --wandb_project seer \`
	`28`	`+ --weight_decay 1e-4 \`
	`29`	`+ --num_resampler_query 16 \`
	`30`	`+ --num_obs_token_per_image 16 \`
	`31`	`+ --run_name finetune_Seer-Large_calvin_abc_d \`
	`32`	`+ --save_checkpoint_path${save_checkpoint_path} \`
	`33`	`+ --transformer_layers 24 \`
	`34`	`+ --hidden_dim 1024 \`
	`35`	`+ --transformer_heads 16 \`
	`36`	`+ --phase"finetune" \`
	`37`	`+ --action_pred_steps 3 \`
	`38`	`+ --sequence_length 10 \`
	`39`	`+ --future_steps 3 \`
	`40`	`+ --window_size 13 \`
	`41`	`+ --obs_pred \`
	`42`	`+ --loss_image \`
	`43`	`+ --loss_action \`
	`44`	`+ --save_checkpoint \`
	`45`	`+ --report_to_wandb \`
	`46`	`+ --offline \`
	`47`	`+ --finetune_from_pretrained_ckpt${finetune_from_pretrained_ckpt} \`
	`48`	`+`
	`49`	`+`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit80f8efe

File tree

27 files changed

27 files changed

`‎.gitignore‎`

`‎README.md‎`

`‎docs/CALVIN_ABC-D_INSTALL.md‎`

`‎docs/CALVIN_ABC-D_RUN.md‎`

`‎docs/REAL-WORLD_PRETRAIN.md‎`

`‎eval_calvin.py‎`

`‎models/seer_model.py‎`

`‎real_controller/controller.py‎`

`‎scripts/CALVIN_ABC_D/Seer-Large/eval.sh‎`

`‎scripts/CALVIN_ABC_D/Seer-Large/finetune.sh‎`

0 commit comments