text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Labels help you categorize and filter issues.