Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Discrepancy in Image ID Alignment Between M3IT and VideoChat2IT #199

Open
patrick-tssn opened this issue Jun 26, 2024 · 4 comments
Open

Comments

@patrick-tssn
Copy link

Could you please provide a script or JSON file of the ID map from M3IT to VideoChat2IT? Matching different files can be quite challenging. For example, coco llava minigpt4 paragraph_captioning textcaps (VideoChat2IT/caption) v.s. coco coco-cn flickr8k-cn image_paragraph_captioning msrvtt textcap (M3IT/captioning). In addition, the image IDs do not completely match; for instance, COCO images in VideoChat2IT have an additional directory compared to those in M3IT. I believe it would be beneficial to fully opensource this.

@Andy1621
Copy link
Collaborator

Andy1621 commented Jun 26, 2024

Hi! You can change these datasets by yourself from M3IT, since we use the original annotations but change the file_name for our data.

@patrick-tssn
Copy link
Author

You mean manually check the file for each split? That's fine, but solely changing file names is confusing and adds unnecessary workload without any benefits.

@patrick-tssn
Copy link
Author

Hi, I didn't find image/caption/minigpt4 from M3IT, how can I obtain these images?

@patrick-tssn patrick-tssn reopened this Jul 16, 2024
@yinanhe
Copy link
Member

yinanhe commented Jul 16, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants