TY - JOUR AU - TI - Video-LLaVA: Learning United Visual Representation by Alignment Before Projection JF - Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing DO - 10.18653/v1/2024.emnlp-main.342 DA - 2024-01-01 UR - https://www.deepdyve.com/lp/unpaywall/video-llava-learning-united-visual-representation-by-alignment-before-AKzTePvSit DP - DeepDyve ER -