You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this paper, it mentions that "For very high-resolution images, we limit the maximum number of grids to 49". For LongVA, each frame of the video is treated as an image 336 * 336 * N. I'm not sure whether it means the maximum number of frame = 49 ?
The text was updated successfully, but these errors were encountered:
In this paper, it mentions that "For very high-resolution images, we limit the maximum number of grids to 49". For LongVA, each frame of the video is treated as an image 336 * 336 * N. I'm not sure whether it means the maximum number of frame = 49 ?
The text was updated successfully, but these errors were encountered: