Apple trained a large language model to efficiently understand long-form video

Wait 5 sec.

Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means. more…