- Adobe Research Unlocking Long-Term Memory in Video World Models with State-Space Models
By combining State-Space Models (SSMs) for efficient long-range dependency modeling with dense local attention for coherence, and using training strategies like diffusion forcing and frame local attention, researchers from Adobe Research successfully overcome the long-standing challenge of long-term memory in video generation. The post Adobe Research Unlocking Long-Term Memory in Video World Models with State-Space Models first appeared on Synced.
- DeepSeek-V3 New Paper is coming! Unveiling the Secrets of Low-Cost Large Model Training through Hardware-Aware Co-design
A newly released 14-page technical paper from the team behind DeepSeek-V3, with DeepSeek CEO Wenfeng Liang as a co-author, sheds light on the “Scaling Challenges and Reflections on Hardware for AI Architectures.” The post DeepSeek-V3 New Paper is coming! Unveiling the Secrets of Low-Cost Large Model Training through Hardware-Aware Co-design first appeared on Synced.
- DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theorem Proving with Recursive Proof Search and a New Benchmark
DeepSeek AI releases DeepSeek-Prover-V2, an open-source LLM for Lean 4 theorem proving. It uses recursive proof search with DeepSeek-V3 for training data and reinforcement learning, achieving top results on MiniF2F. The post DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theorem Proving with Recursive Proof Search and a New Benchmark first appeared on Synced.
- Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with SRPO
Kwai AI's SRPO framework slashes LLM RL post-training steps by 90% while matching DeepSeek-R1 performance in math and code. This two-stage RL approach with history resampling overcomes GRPO limitations. The post Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with SRPO first appeared on Synced.
- Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models & Global Expansion Ahead of Potential IPO
Zhipu.AI open-sources faster GLM models (8x speedup), launches Z.ai, aiming for global expansion, potentially ahead of IPO. The post Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models & Global Expansion Ahead of Potential IPO first appeared on Synced.
- DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT
DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed at enhancing the scalability of general reward models (GRMs) during the inference phase. The post DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT first appeared on Synced.
- AI Video Generation Race Shifts from Capability to Profitability, Challenging Sora’s Dominance
The AI video generation landscape is shifting from capability to profitability, challenging OpenAI Sora's dominance. Competitors are surpassing Sora in quality and efficiency, with users preferring alternatives. The focus is now on improvements like precise control and style customization for practical applications. The post AI Video Generation Race Shifts from Capability to Profitability, Challenging Sora’s Dominance first appeared on Synced.
- Beyond Next-Token Prediction? Meta’s Novel Architectures Spark Debate on the Future of Large Language Models
Meta AI's recent research introduces the BLT architecture, eliminating tokenizers for improved multimodal processing, and the Large Concept Model (LCM), which operates on semantic "concepts" instead of tokens for more human-like reasoning and better cross-lingual generalization. These innovations challenge the traditional "next-token prediction" paradigm in LLMs. The post Beyond Next-Token Prediction? Meta’s Novel Architectures Spark Debate on the Future of Large Language Models first appeared on Synced.
- Nvidia Intensifies Robot Push with New Humanoid Platform as Industry Giants Eye Lucrative Future
Nvidia will launch Jetson Thor for humanoid robots in H1 2025, entering a growing market where Google is also active. The robotics sector is projected for substantial growth. Nvidia offers integrated hardware and software solutions. Simultaneously, China's rapidly developing domestic humanoid robot market presents emerging competition. The post Nvidia Intensifies Robot Push with New Humanoid Platform as Industry Giants Eye Lucrative Future first appeared on Synced.
- Automating Artificial Life Discovery: The Power of Foundation Models
A research team introduces Automated Search for Artificial Life (ASAL). This novel framework leverages vision-language FMs to automate and enhance the discovery process in ALife research. The post Automating Artificial Life Discovery: The Power of Foundation Models first appeared on Synced.