Thanks for the great work in ViViT model 2. Is the value of the num_frame fixed or not? Or does the model process each frame one by one?
Thanks for the great work in ViViT model 2.
Is the value of the num_frame fixed or not? Or does the model process each frame one by one?