Editions

(24 days)

Agents & Inference, UTC dates, up to 6 stories/day.

What is an edition?

Each edition is one calendar day (UTC). Stories come from the allowlisted RSS feed. Use the menu to jump to any day; empty days show "empty" and may need a rebuild.

This day

6 stories

NewerOlder

What you'll learn today · 6 stories

  1. 1.GLM-5.2's 1M token context window and 51 Intelligence Index score make it the strongest open text model, but at $4.40/million output tokens it's pricier than competitors for long tasks.
  2. 2.GPT-5.4 autonomously improved a key drug reaction, cutting synthesis time by 40% while maintaining 95% yield, accelerating medicinal chemistry pipelines.
  3. 3.A self-evolving LLM agent for legal case retrieval outperforms human-designed rules on LeCaRD-v2 by iteratively refining query rewrites without parameter training, reducing manual rule engineering costs.
  4. 4.ARD enables agents to dynamically find tools via federated registries, reducing manual integration work while adding ~100ms latency for registry searches during runtime.
  5. 5.Disabling Google Workspace’s AI features eliminates intrusive pop-ups like Gemini, restoring focus during writing tasks but removes potential productivity aids.
  6. 6.Teams building retrieval systems spend weeks on infrastructure integration; Search Toolkit reduces this overhead by unifying ingestion, retrieval, and evaluation into a single framework for faster deployment.

Agents & Inference

Agents & InferenceSimon Willison

GLM-5.2 is probably the most powerful text-only open weights LLM

Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.

Summary A

Chinese AI lab Z.ai released GLM-5.2, a 753 billion-parameter open-weight language model under an MIT license, positioning it as the most powerful text-only open model available. Benchmarks show it leads in performance but consumes more tokens than competitors, while early tests highlight strong coding and creative output, though some results lag behind its predecessor. The model is now accessible via multiple providers at competitive pricing.

Agents & InferenceOpenAI

A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry

Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.

Summary A

A near-autonomous AI chemist powered by advanced language models has successfully enhanced a complex drug-making reaction, marking a significant step forward in medicinal chemistry. The breakthrough demonstrates how AI can accelerate and refine challenging processes in pharmaceutical research.

Agents & InferencearXiv

When Rules Learn: A Self-Evolving Agent for Legal Case Retrieval

Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.

Summary A

Researchers developed a self-evolving AI agent that improves legal case retrieval by automatically refining search rules without additional training. The system, tested on a Chinese legal benchmark, outperformed traditional methods by using large language models to iteratively test and eliminate ineffective rules. Findings highlight the AI's ability to leverage past results and built-in knowledge to enhance search precision.

Agents & InferenceHugging Face

Agentic Resource Discovery: Let agents search

Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.

Summary A

Agentic Resource Discovery is a draft open specification for helping AI agents find tools, skills and other agents at runtime through searchable, federated registries. Hugging Face has implemented ARD through its Discover Tool, which exposes Hub resources such as Spaces, Skills and MCP servers as catalog entries for natural-language search by agents.

Agents & InferenceTechCrunch

How to turn off AI in your Google Docs

Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.

Summary A

Google Docs users can disable AI features like Gemini pop-ups by adjusting settings in Gmail to turn off smart features across Google Workspace. The process avoids the frustration of individually closing intrusive AI prompts, though some users report persistent hover tools that may require separate adjustments. The change helps prevent disruptions while writing or editing documents.

Agents & InferenceMistral

Introducing Search Toolkit

Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.

Summary A

Mistral released Search Toolkit in public preview, an open-source framework for building production search pipelines for AI applications. It combines ingestion, retrieval, and evaluation under a shared interface, aiming to reduce integration work for teams building enterprise search, RAG workflows, internal knowledge systems, and domain-specific retrieval.

Takeaways written by DeepSeek V3 — not one of this week's two contestants.