Material Classification

All objects are labeled by seven material types: ceramic, glass, wood, plastic, iron, polycarbonate, and steel. The task is formulated as a single-label classification problem. Given an RGB image, an impact sound, a tactile image, or their combination, the model must predict the correct material label for the target object.

Usage

Data Preparation

The dataset used to train the baseline models can be downloaded from here

Training & Evaluation

Start the training process, and test the best model on test-set after training:

# Train FENet as an example
python main.py --model FENet --config_location ./configs/FENet.yml \
               --modality_list vision touch audio --batch_size 256 \
               --lr 1e-3 --weight_decay 1e-2 --exp FENet_vision_touch_audio

Evaluate the best model in FENet_vision_touch_audio:

# Evaluate FENet as an example
python main.py --model FENet --config_location ./configs/FENet.yml \
               --modality_list vision touch audio --batch_size 256 \
               --lr 1e-3 --weight_decay 1e-2 --exp FENet_vision_touch_audio \
               --eval

Add your own model

To train and test your new model on ObjectFolder Cross-Sensory Retrieval Benchmark, you only need to modify several files in models, you may follow these simple steps.

Create new model directory
```
mkdir models/my_model
```
Design new model
```
cd models/my_model
touch my_model.py
```

Build the new model and its optimizer

Add the following code into models/build.py:

elif args.model == 'my_model':
    from my_model import my_model
    model = my_model.my_model(args)
    optimizer = optim.AdamW(model.parameters(), lr=args.lr, weight_decay=args.weight_decay)

Add the new model into the pipeline

Once the new model is built, it can be trained and evaluated similarly:

python main.py --model my_model --config_location ./configs/my_model.yml \
               --modality_list vision touch audio --batch_size 256 \
               --lr 1e-3 --weight_decay 1e-2 --exp my_model_vision_touch_audio

Results on ObjectFolder Material Classification Benchmark

The 1, 000 objects are randomly split into train/validation/test = 800/100/100, and the model needs to generalize to new objects during the testing process. Furthermore, we also conduct a cross-object experiment on ObjectFolder Real to test the Sim2Real transferring ability of the models, in which the 100 real objects are randomly split into train/validation/test = 60/20/20.

Results on ObjectFolder

Method	Vision	Touch	Audio	Fusion
ResNet	91.89	74.36	94.91	96.28
FENet	92.25	75.89	95.80	96.60

Results on ObjectFolder Real

Method	Accuracy
ResNet w/o pretrain	45.25
ResNet	51.02

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
dataset		dataset
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
Engine.py		Engine.py
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Material Classification

Usage

Data Preparation

Training & Evaluation

Add your own model

Results on ObjectFolder Material Classification Benchmark

Results on ObjectFolder

Results on ObjectFolder Real

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Material Classification

Usage

Data Preparation

Training & Evaluation

Add your own model

Results on ObjectFolder Material Classification Benchmark

Results on ObjectFolder

Results on ObjectFolder Real

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages