15 days agoHugging Face Daily PapersTime Blindness: Why Video-Language Models Can't See What Humans Can?
15 days agoHugging Face Daily PapersProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
15 days agoHugging Face Daily PapersViStoryBench: Comprehensive Benchmark Suite for Story Visualization
15 days agoHugging Face Daily PapersMetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
15 days agoHugging Face Daily PapersHarnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
15 days agoHugging Face Daily PapersContext is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
15 days agoHugging Face Daily PapersReflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
15 days agoHugging Face Daily PapersFinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation
15 days agoAI News CN (Telegram) - English TranslationPerplexity's new tools can generate spreadsheets, etc.
15 days agoAI News CN (Telegram) - English TranslationResearchers believe that large models can neither think nor reason.
15 days agoHugging Face Daily PapersLearning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors
15 days agoAI News CN (Telegram) - English TranslationDeepSeek Becomes the Second Largest AI Laboratory in the World, and Chinese AI Catches Up with Its US Counterparts
15 days agoHugging Face Daily PapersHarnessing Large Language Models for Scientific Novelty Detection