
ABOUT THIS FEED
AIModels.fyi is a curated resource for tracking the release of new AI models, tools, and datasets. Its Substack RSS feed provides digest-style newsletters that summarize the latest in machine learning research and open-source projects. The platform focuses on making it easier for practitioners and enthusiasts to discover and follow emerging models across natural language processing, computer vision, generative AI, and reinforcement learning. Each post aggregates relevant updates, links, and context, saving readers time compared to browsing multiple sources. The writing is concise yet informative, appealing to developers, researchers, and students who want a quick overview of cutting-edge developments. With a few posts per week, the feed is highly practical for staying on top of the rapidly expanding AI ecosystem without information overload.
Saizen Acuity
- Can unified multimodal models align understanding and generation, without *any* captions?
Reconstruction alignment improves unified multimodal models
- What if LMs could collectively train, slashing RL post-training costs?
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
- Are we training LLMs to confidently guess instead of admitting uncertainty?
Why Language Models Hallucinate
- Can you pick the perfect LLM without breaking the bank?
Adaptive LLM Routing under Budget Constraints
- Can AI learn to prove theorems by thinking step-by-step like a human mathematician, even without perfect instructions?
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
- Can reinforcement learning fix the glaring visual flaws in AI-generated images?
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
- Can doctors trust AI diagnostic tools enough to delegate tasks?
Towards physician-centered oversight of conversational diagnostic AI
- Can seeing the document like a human dramatically boost a RAG system's IQ?
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding
- Can AI reconstruct super-slow-motion 4D models from regular speed multi-camera video?
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture
- An embarrassingly simple defense against LLM abliteration attacks
Defending AI systems against a new form of attack