1 day agoHugging Face Daily PapersSparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
1 day agoHugging Face Daily PapersMINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
1 day agoHugging Face Daily PapersAV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
1 day agoHugging Face Daily PapersRevisiting Depth Representations for Feed-Forward 3D Gaussian Splatting
1 day agoHugging Face Daily PapersSeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
1 day agoHugging Face Daily PapersEOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
1 day agoAI News CN (Telegram) - English TranslationReddit sues Anthropic for breach of contract and unfair competition
1 day agoHugging Face Daily PapersMicro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning
1 day agoAI News CN (Telegram) - English TranslationGoogle has postponed the launch of its "Ask Your Photos" AI search feature.
1 day agoAI News CN (Telegram) - English TranslationThe French AI company Mistral has released an ambient programming assistant.
1 day agoHugging Face Daily PapersDiagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts
1 day agoHugging Face Daily PapersThe Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
1 day agoHugging Face Daily PapersQwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models