↓Skip to main content

🏢 Meta GenAI

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

18 January 2025·704 words·4 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Meta GenAI

STEP-KTO: A novel training framework boosts LLMs’ mathematical reasoning by providing binary feedback on both intermediate steps and final answers. This ensures logical reasoning trajectories and impr…

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

19 December 2024·3592 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Meta GenAI

CrossFlow: Directly evolve any modality to another using flow matching, achieving state-of-the-art results across various tasks!

Apollo: An Exploration of Video Understanding in Large Multimodal Models

13 December 2024·1887 words·9 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Meta GenAI

Apollo LMMs achieve SOTA on video understanding tasks by exploring and optimizing the design and training of video-LMMs.