9 days agoHugging Face Daily PapersSounding that Object: Interactive Object-Aware Image to Audio Generation
9 days agoHugging Face Daily PapersAdvancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
9 days agoHugging Face Daily PapersSuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
9 days agoHugging Face Daily PapersEstablishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
9 days agoHugging Face Daily PapersMMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
9 days agoHugging Face Daily PapersTRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems
9 days agoHugging Face Daily PapersSplatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data
9 days agoHugging Face Daily PapersRex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning
9 days agoHugging Face Daily PapersQQSUM: A Novel Task and Model of Quantitative Query-Focused Summarization for Review-based Product Question Answering