Skip to main content

🏢 University of Sydney

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
·5226 words·25 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 University of Sydney
RealSyn: A new, scalable multimodal dataset revolutionizes vision-language learning by effectively using interleaved image-text documents.
ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
·3437 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Text Generation 🏢 University of Sydney
ORID framework leverages organ-regional information to boost radiology report generation, achieving state-of-the-art accuracy by integrating multi-modal data and reducing noise from unrelated organs.