News

NEWS! We have released a new version of InViG dataset including 500K automatically generated human-robot dialogues including a comprehensive suite of benchmark performance: InViG 500K, Paper.

InViG Dataset

InViG Dataset is a dataset, namely InViG, to step towards end-to-end interactive disambiguation.

Citation.

If you find this dataset useful, please cite:

@misc{invigdataset,
    title={InViG: Interactive Visual-Language Disambiguation with 21K Human-to-Human Dialogues},
    author={Zhang, Hanbo and Mo, Yuchen and Xu, Jie and Si, Qingyi and Kong, Tao},
    howpublished = {\url{https://github.com/ZhangHanbo/invig-dataset}},
    year={2023}
}

Intro

As is notoriously known, interaction based on natural language is usually ambiguous, making goal-oriented interactive tasks hard for robots to solve. Therefore, we collect 20K human-to-human disambiguation dialogues based on the images filtered from OpenImages.

Download

InViG dataset can be accessed from 🤗jxu124/invig.

import datasets
ds = datasets.load_datasets("jxu124/invig")

Images (OpenImages datasets) can be accessed from here.

A list of image IDs (a filename list) used in the InViG dataset can be obtained through this python scripts

import os
import datasets

ds = datasets.load_datasets("jxu124/invig")
file_list = [os.path.basename(i)
    for split in ['train', 'test', 'validation']
        for i in ds[split]['image_path']
]

Benchmark

End-to-End Evaluation

([email protected])

Oracle	Guesser	Questioner	Success Rate
XVLM-Oracle	Vilbert-Guesser	Vilbert-Questioner	35.3%
XVLM-Oracle	XVLM-Guesser	XVLM-Questioner	40.1%

Guesser Evaluation

([email protected]) Guesser accuracy on ground-truth dialogs

Guesser Methods	Accuracy
Vilbert-Guesser	55.1%
XVLM-Guesser	59.7%

Examples

Image	Dialogue	对话
	Please pass me that little toy. OK, which one do you want? One of the small boxes in the front. OK, so there are three, which one is it? The purple one. The one on the left? Yes, there is a pearl on it.	请把那个小玩具递给我一下。好的，想要哪个呢？前边的那个小盒里装的其中一个。好的，那有三个呢，是哪个？紫色那个。左边那个？是的，上边有颗珍珠。
	Can you get me a bottle? Which one do you need? The Coke bottle on the right. Is it a greenish bottle? Yes, it's shorter. OK, here you are. Thank you.	帮我拿一个瓶子可以吗？你需要哪一个？右边的那个可乐瓶。是泛绿色的瓶子吗？是的，它比较矮。好的，给你。谢谢。
	There are many keyboards here, all kinds. Yes, maybe the functions are different. Yes, there are big and small ones. Which one do you want? A smaller one, easy to carry. Then you choose one. The one in the lower left corner, the space bar it's long is yellow.	这里有许多键盘，各式各样的。是的，可能功能都不一样。是的，有大有小。想要哪个呢？小点的吧，便于携带。那你挑一个。左下角那个吧，它长的那个空格键是黄色的。

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
imgs		imgs
LICENSE		LICENSE
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News

InViG Dataset

Citation.

Intro

Download

Benchmark

End-to-End Evaluation

Guesser Evaluation

Examples

About

Releases

Packages

Contributors 2

License

ZhangHanbo/invig-dataset

Folders and files

Latest commit

History

Repository files navigation

News

InViG Dataset

Citation.

Intro

Download

Benchmark

End-to-End Evaluation

Guesser Evaluation

Examples

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages