Skip to main content

🏢 Xi'an Jiaotong University

MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization
·8765 words·42 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Xi'an Jiaotong University
MARS: Optimizing prompts with multi-agent collaboration and Socratic learning for better LLM performance!
MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving
·3857 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Multimodal Reasoning 🏢 Xi'an Jiaotong University
MAPS solves multimodal scientific problems better by combining multiple agents and Socratic learning.
GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction
·2910 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Information Extraction 🏢 Xi'an Jiaotong University
GKG-LLM: Unifying Knowledge Graph Construction with a novel 3-stage framework, empowering domain adaptation & resource efficiency.
PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning
·2524 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Xi'an Jiaotong University
PhysReason benchmark evaluates physics-based reasoning in LLMs, revealing critical limitations and guiding future improvements.
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition
·3329 words·16 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Xi'an Jiaotong University
LaDeCo: a layered approach to automatic graphic design composition, generating high-quality designs by sequentially composing elements into semantic layers.
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
·2950 words·14 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Xi'an Jiaotong University
ChatGen-Evo automates text-to-image generation from freestyle chatting, simplifying the process and significantly improving performance over existing methods.