<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"><channel><title>Oh My Tokens</title><link>https://oh-my-tokens.ai/</link><description>AI token intelligence for builders, agents, and small teams. Multilingual home pages: /zh/, /pt/, /fr/, /ja/, /ko/.</description><item><title>Fable's judgement</title><link>https://simonwillison.net/2026/Jul/3/judgement/</link><guid>https://simonwillison.net/2026/Jul/3/judgement/</guid><pubDate>Fri, 03 Jul 2026 00:00:00 GMT</pubDate><description>Agent savings can come from fewer bad loops, not only cheaper tokens. Agent 省钱不只是少 token，而是少干预、少返工、少错误 loop。</description></item><item><title>Release: llm-coding-agent 0.1a0</title><link>https://simonwillison.net/2026/Jul/2/llm-coding-agent/</link><guid>https://simonwillison.net/2026/Jul/2/llm-coding-agent/</guid><pubDate>Thu, 02 Jul 2026 00:00:00 GMT</pubDate><description>A minimal coding agent maps where token spend happens. 一个 coding agent 到底由哪些 token-consuming steps 组成？</description></item><item><title>Using DSPy to evaluate and improve Datasette Agent's SQL system prompts</title><link>https://simonwillison.net/2026/Jul/2/dspy-datasette-agent-prompts/</link><guid>https://simonwillison.net/2026/Jul/2/dspy-datasette-agent-prompts/</guid><pubDate>Thu, 02 Jul 2026 00:00:00 GMT</pubDate><description>Prompt optimization can be evaluated with harnesses instead of vibes. 少花 token 的第一步：知道你的 system prompt 是否真的有效。</description></item><item><title>Vercel's Andrew Qu on why agents are a new kind of software</title><link>https://www.latent.space/p/vercel-agents-new-software</link><guid>https://www.latent.space/p/vercel-agents-new-software</guid><pubDate>Fri, 03 Jul 2026 00:00:00 GMT</pubDate><description>Agent-readable websites are becoming part of the product surface. 未来网站不只是给人看，也要给 agent 读。</description></item><item><title>How Cursor deploys AI inside the enterprise</title><link>https://www.latent.space/p/cursor-forward-deployed-engineers</link><guid>https://www.latent.space/p/cursor-forward-deployed-engineers</guid><pubDate>Wed, 01 Jul 2026 00:00:00 GMT</pubDate><description>Vibe coding becomes a team budget problem when workflows scale. Cursor 的真正成本不是订阅费，而是整个软件工厂的 token burn。</description></item><item><title>What’s new in Claude Sonnet 5</title><link>https://simonwillison.net/2026/Jun/30/claude-sonnet-5/</link><guid>https://simonwillison.net/2026/Jun/30/claude-sonnet-5/</guid><pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate><description>New model releases affect defaults, agent costs, and failure rates. 模型升级后，是否应该切默认模型？看 docs，不看营销。</description></item><item><title>ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration</title><link>https://huggingface.co/blog/ibm-research/scarfbench</link><guid>https://huggingface.co/blog/ibm-research/scarfbench</guid><pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate><description>Agent benchmarks are moving toward real enterprise migration tasks. 真正有用的 agent benchmark 应该衡量任务完成成本。</description></item><item><title>Ornith-1.0: Self-Scaffolding LLMs for Agentic Coding</title><link>https://simonwillison.net/2026/Jun/29/ornith/</link><guid>https://simonwillison.net/2026/Jun/29/ornith/</guid><pubDate>Mon, 29 Jun 2026 00:00:00 GMT</pubDate><description>Open-weight coding models can change the API cost equation. 如果 open models 足够会写代码，个人 agent 成本结构会变。</description></item><item><title>OpenAI reports median internal Codex output tokens grew dramatically</title><link>https://www.latent.space/p/ainews-openai-reports-median-internal</link><guid>https://www.latent.space/p/ainews-openai-reports-median-internal</guid><pubDate>Fri, 26 Jun 2026 00:00:00 GMT</pubDate><description>Output token growth is a major hidden cost in agent workflows. Agent 时代，贵的不一定是 input，可能是 output 和 loop。</description></item><item><title>GLM-5.2 is the step change for open agents</title><link>https://www.interconnects.ai/p/glm-52-is-the-step-change-for-open</link><guid>https://www.interconnects.ai/p/glm-52-is-the-step-change-for-open</guid><pubDate>Mon, 22 Jun 2026 00:00:00 GMT</pubDate><description>China/open models are part of global agent cost/performance comparisons. 中国模型不是边缘信息，而是全球 agent cost/performance 版图的一部分。</description></item><item><title>Is it agentic enough? Benchmarking open models on your own tooling</title><link>https://huggingface.co/blog/is-it-agentic-enough</link><guid>https://huggingface.co/blog/is-it-agentic-enough</guid><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate><description>Your own tooling may matter more than public leaderboard rank. 别问哪个模型最好，问哪个模型在你的 agent stack 上最便宜地成功。</description></item><item><title>Qwen3 benchmark results</title><link>https://aider.chat/2025/05/08/qwen3.html</link><guid>https://aider.chat/2025/05/08/qwen3.html</guid><pubDate>Thu, 08 May 2025 00:00:00 GMT</pubDate><description>A durable bridge between China models and coding-agent evaluation. 中文/中国模型在 coding agent 里的位置，应该用实际 coding benchmark 讨论。</description></item><item><title>How Claude Code uses prompt caching</title><link>https://code.claude.com/docs/en/prompt-caching</link><guid>https://code.claude.com/docs/en/prompt-caching</guid><pubDate>Sat, 04 Jul 2026 00:00:00 GMT</pubDate><description>Prompt caching directly changes speed and token cost. 你以为 Claude Code 慢/贵，其实可能是 cache miss。</description></item><item><title>Ditching Claude for OpenCode and OpenRouter</title><link>https://www.ianwootten.co.uk/2026/07/01/ditching-claude-for-opencode-and-openrouter/</link><guid>https://www.ianwootten.co.uk/2026/07/01/ditching-claude-for-opencode-and-openrouter/</guid><pubDate>Wed, 01 Jul 2026 00:00:00 GMT</pubDate><description>A real switching case from default tools to open router/model workflows. 什么时候值得离开默认工具，改用开放 router 和开源模型？</description></item><item><title>Contextify - Searchable History for Claude Code and Codex</title><link>https://contextify.sh/</link><guid>https://contextify.sh/</guid><pubDate>Sat, 04 Jul 2026 00:00:00 GMT</pubDate><description>Agent history and reusable context can reduce repeated token spend. 如果每次 agent 都忘记上下文，你就在重复烧 token。</description></item></channel></rss>
