Hello,
I have successfully generated all features (both text and visual) for the COCO dataset. However, when running MLE training, the code throws the following error at the moment it starts validation at 96% of the first epoch:
File "/home/soaresbu/clip-captioning/captioning/utils/clipscore.py", line 177, in forward
refclip_s = self.calc_refclip_s(
File "/home/soaresbu/clip-captioning/captioning/utils/clipscore.py", line 124, in calc_refclip_s
ref_text_feat = ref_text_feat.view(B, -1, dim)
RuntimeError: shape '[4, -1, 512]' is invalid for input of size 64000
Any idea of what could be wrong here? Am I missing something when generating CLIP-S with python scripts/clipscore_prepro_feats.py?
Hello,
I have successfully generated all features (both text and visual) for the COCO dataset. However, when running MLE training, the code throws the following error at the moment it starts validation at 96% of the first epoch:
Any idea of what could be wrong here? Am I missing something when generating CLIP-S with
python scripts/clipscore_prepro_feats.py?