🏢 Kyoto University
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
·2647 words·13 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Visual Question Answering
🏢 Kyoto University
SBS Figures creates a massive, high-quality figure QA dataset via a novel stage-by-stage synthesis pipeline, enabling efficient pre-training of visual language models.