3 months agoHugging Face Daily PapersCoMP: Continual Multimodal Pre-training for Vision Foundation Models
3 months agoHugging Face Daily PapersTrajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training
3 months agoHugging Face Daily PapersVideo SimpleQA: Towards Factuality Evaluation in Large Video Language Models
3 months agoAI News CN (Telegram)美媒:“中国式开源”或将为AI行业带来“安卓时刻” 尽管R1模型因其性能和较低成本的宣称而在该领域引起轰动,但一些分析师表示DeepSeek最重大的影响在于推动了....
3 months agoHugging Face Daily PapersFFN Fusion: Rethinking Sequential Computation in Large Language Models
3 months agoHugging Face Daily PapersSimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
3 months agoHugging Face Daily PapersCFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models
3 months agoHugging Face Daily PapersI Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders
3 months agoHugging Face Daily PapersEnhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations
3 months agoHugging Face Daily PapersClassical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code