Tiny Spoon

Monday Aug 3

OpenAI Slashes GPT-5.6 Luna Pricing

3AUG

OpenAI cut its cheapest GPT-5.6 model price by 80 percent. Luna now costs 20 cents per million input tokens. Chinese models undercutting US labs on price are the real reason why.

GPT-5.6 Terra also got a smaller 20% cut, while Sol's price held steady. Sol got 2.5 times faster in the API instead of cheaper.

Anthropic just launched Claude Opus 5 at flat pricing. Google rolled out cheaper Gemini models around the same time. DeepSeek alone now handles 17.6% of all OpenRouter traffic.

Forbes calls the timing a sign AI costs are under real scrutiny from enterprise buyers. VentureBeat says competition is shifting toward cost, not raw capability. A cut this steep suggests Luna's old margin was never sustainable.

GOVERNANCE Anthropic

Anthropic Skips Microsoft's Open Weights Letter

3AUG

Microsoft rounded up 235 companies for an open letter on open-weight models. Anthropic refused to sign, then published its own rebuttal. Dario Amodei wants a crackdown on distillation instead.

Nvidia, Amazon, Y Combinator and the Linux Foundation signed, with OpenAI joining later. The letter argues closed models create single points of failure. It defends large scale distillation as a legitimate technique.

Amodei warned that closed models aren't automatically safer than open ones. He argues authoritarian governments could otherwise build more powerful AI. That AI could then get misused for cyberattacks or worse.

Days later, 1,324 frontier lab staff signed a separate letter urging Washington to pace AI. Signers included OpenAI's chief scientist and two Anthropic co-founders. That split cuts across company lines, not just between labs.

RESEARCH Alibaba

Alibaba Ships Its Biggest AI Model

3AUG

Alibaba unveiled Qwen3.8-Max, its largest model ever, on Monday. The 2.4 trillion parameter model ranks second globally on image benchmarks. It still trails Claude on text, and full release lands next week.

A mixture of experts design keeps costs down. Only ninety five billion of the total parameters activate per request. That's how Alibaba keeps inference cheap at frontier scale.

Reuters frames this as a fierce race among Chinese firms building cheaper models. Its parameter count sits close to Moonshot's Kimi K3, which has two point eight trillion.

Alibaba hasn't published a full benchmark table yet. So today's numbers are still just the company's own claims. Independent testing will decide if that vision ranking actually holds.

FUNDING NVIDIA

Nvidia Pours Billions Into Sutskever's SSI

3AUG

Ilya Sutskever's secretive AI lab just broke two years of silence. Nvidia signed a multi-billion dollar deal for Vera Rubin chip access. SSI still has zero products, yet investors keep piling in.

Sutskever frames the money as scaling proven research, not chasing a shipping deadline. Compute jumps roughly tenfold within twelve months on Nvidia's newest GPU generation.

SSI has already raised seven billion dollars total. It now carries a valuation near thirty two billion dollars, with no shipped product. Nvidia was already an investor before this compute agreement, deepening its bet on Sutskever's team.

Critics call the research "worthy of scaling" a promise rather than proof. SSI has shipped nothing public in two years of operation. Nvidia is betting raw compute now outweighs an actual track record.

Sunday Aug 2

HARDWARE Apple

Apple Delays Smart Glasses To 2027

2AUG

Apple pushed its smart glasses launch back about six months. Bloomberg reporter Mark Gurman says privacy work caused the delay. The glasses will skip facial recognition and add tamper-proof recording lights.

Codenamed N50, the device now targets WWDC 2027 instead of a late-2026 debut. Footage processes on the device itself, not in a company data center.

Apple will not hire contractors to review recordings or train models on them, a direct contrast with Meta's practice. Meta's Ray-Ban glasses have already drawn harassment complaints tied to hidden recording.

Shipping a year behind carries real risk in a category that rewards being first. Apple is wagering that trust outlasts a head start once people actually put the hardware on their face.

STARTUPS Other

Cyera Buys Oasis Security For $1B

2AUG

Cyera is buying Oasis Security for about $1 billion. Oasis secures the logins and keys AI agents use to work. Cyera CEO Yotam Segev cites a 500% surge in these identities.

The letter of intent, signed July 28, splits roughly $700 million cash and the rest in stock. Oasis keeps its own team and brand inside the combined company.

Cyera just raised $600 million at a $12 billion valuation, money that is funding this purchase. The pitch: bundle data security and machine-identity security into one platform buyers already trust.

Security vendors buying smaller specialists is an old pattern, but the target has shifted from human passwords to agent credentials. Expect rivals like Okta or Microsoft to answer with a bundle of their own.

COOL TOOLS Other

YC Open-Sources Its Company-Wide Agent Harness

2AUG

Y Combinator gave away the AI tool it runs itself on. QM is an open-source, MIT-licensed harness spanning accounting, legal, events, and engineering. It swaps between Claude Code, Codex, and other models with zero lock-in.

Every YC staffer gets a private, sandboxed workspace with its own memory, files, permissions, and scheduled jobs. The team says it even used the system to build itself, real-world proof it holds up under daily use.

Pick your engine: Pi, OpenCode, Codex, or Claude Code, all interchangeable behind one Slack and web interface. No procurement process sits between an employee and their own automation.

The logic: agent orchestration is plumbing, not a product edge, so hoarding it buys little. Expect more startups to publish their internal stacks now that YC set the norm.

STRATEGY Anthropic

Mollick Says Pick Claude Or ChatGPT

2AUG

There are only two AI options worth your money right now. Ethan Mollick, a Wharton professor, says just pick Claude or ChatGPT and pay for it. Simon Willison published a similar guide the same week.

Two influential AI writers landed on the same shortlist within days of each other. Neither recommended shopping around forever. It reads more like consensus than coincidence.

The advice: treat the agent like a junior hire, not a search engine. Give it a real task, review the output, and ask for changes rather than accepting the first draft.

Free tiers still work for small, low-stakes questions. For anything that actually matters, both writers say the paid tier earns its cost.

FLOPS Google

Google Cancels Its AI Studio App

2AUG

800,000 preorders wasn't enough to save this app. Google canceled its standalone AI Studio app for iOS and Android. Those features move into Gemini, so apps emerge from chat instead.

Google teased the app at I/O 2026, promising app-building on the go. The preorder count was unusually high for a tool nobody had used yet.

The team thanked everyone who signed up, saying people clearly want to build software away from a desk. Google gave no date for when the Gemini version actually ships. That is a bet that conversation beats a home-screen icon.

The web version of AI Studio keeps running for developers shipping real products. Preorder counts, it turns out, don't always predict what people will actually use.

GOVERNANCE Other

California's AI Labeling Law Kicks In

2AUG

Big AI tools must now prove what they made. California's new law covers any AI tool with 1 million-plus state users. Providers must add hidden watermarks and a free detection tool.

The law is officially called SB 942, delayed once already by a companion bill. Governor Newsom signed it back in 2024, but enforcement waited two years.

Covered providers now have to mark their output invisibly and let anyone check its origin for free. A companion rule, AB 853, adds similar duties for sites that host AI model weights.

The compliance deadline was pushed once before, from January to August. Penalties can reach 15 million dollars or 3% of global revenue. Expect the first enforcement test case within months.

HARDWARE Meta

Meta's Free Cash Flow Falls 91%

2AUG

Meta's AI bet is hitting the cash numbers. Free cash flow fell 91% to $784 million on $31 billion in quarterly AI spending. Meta also raised 2026 capex guidance to $145 billion.

Revenue actually beat expectations, up 28% year over year to $60.8 billion. The cash number told a different story entirely.

CFO Susan Li says Meta is deliberately shifting toward debt to fund long-lived infrastructure. It issued $24.9 billion in new debt and bought back zero shares, reversing last year's $10 billion buyback pace.

Third quarter guidance also landed below what Wall Street modeled. Investors still can't see AI revenue that stands apart from advertising. The next earnings call will show if the debt bet is working.

RESEARCH OpenAI

OpenAI's Astra Solves 10 Old Proofs

2AUG

An OpenAI model called Astra just proved real math. It produced ten machine-checked proofs that stumped mathematicians for decades. One proof cracks a problem open since 1999, for about $2,000 in cost.

The headline result is the first explicit non-sofic group, a concept from 1999. It also disproved a major conjecture and solved three problems from a famous math catalogue.

OpenAI published a 249-page manuscript with proofs anyone can verify in Lean. Every result includes a chain-of-thought walkthrough, not just the final answer. The model itself is still unreleased, only the proofs are public.

Each problem sat unsolved for at least a decade before this week. Expect rivals to publish their own math benchmarks within months.

Saturday Aug 1

COOL TOOLS Google

Gemini Falls Off Mollick's AI List

1AUG

The AI guide power users follow just dropped Google entirely. Ethan Mollick, a Wharton professor, cut Gemini from his practical AI guide. It has no agentic computer-use mode like ChatGPT Work or Claude Cowork.

A year ago the guide was all chat: ChatGPT, Claude, Gemini side by side. Today it's split by which AI can actually use a computer.

Simon Willison, the developer behind Datasette, flagged the shift on his blog. ChatGPT's modes are Work and Codex; Claude's are Cowork and Code. Willison calls the naming 'spectacularly unintuitive' even for people who use both daily.

Gemini Spark, Google's answer, hasn't proven itself yet. Whoever wins the computer-use race owns the workflow, not the chat window.

ENTERTAINMENT Other

Germany Rules Suno Broke Copyright

1AUG

AI music just lost its first real copyright fight. Munich ruled Suno, the AI music generator, copied six songs during training. GEMA, Germany's music rights group, won on nearly every point raised.

The court found storing songs inside the model breaks copyright law by itself. Suno's system had memorized 'Forever Young' and 'Daddy Cool' word for word.

Suno trained on more than 2 million scraped songs, per court evidence. It must now disclose revenue tied to those songs and pay damages. Suno says it disagrees and may appeal to a higher German court.

Universal and Sony are still fighting Suno in separate US cases. This German win gives every rights group a legal template to copy.

FLOPS Anthropic

Claude's Private Chats Hit Google

1AUG

A privacy bug turned private Claude chats into public search results. A Reddit user found hundreds of shared chats indexed on Google. Blocking crawlers in robots.txt doesn't stop indexing once links leak elsewhere.

The exposed chats included Social Security numbers and legal advice, per Fortune. Some conversations sat exposed for weeks before anyone noticed.

The real bug: robots.txt can't block a page it never crawls directly. If a share link appears anywhere else on the web, Google indexes it anyway. Anthropic has since removed the pages from search.

ChatGPT hit the same wall in 2025 when shared chats leaked into Google results. Expect every AI chat product to audit its share-link defaults this month.

LABOR Other

Fed Study Finds No AI Productivity Bump

1AUG

AI's productivity payoff may be invisible, not absent. St. Louis Fed researchers scanned 490,000 earnings calls and found no AI productivity bump. AI may make output too cheap to count as a gain.

Economists tagged AI mentions across 490,000 calls from 2000 to 2025. Ninety-five percent of the claims describe future gains, not ones already booked.

Researcher Serdar Ozkan says AI may be destroying the value of what it makes abundant, so gains cancel against falling prices. He compares it to electrification, which took decades to reorganize factories before paying off.

Firms talking up AI have also raised R&D and capex spending, not just their language. Nobody yet knows which use case will make the gains show up in the numbers.

STRATEGY Microsoft

Microsoft Posts Biggest Rally Since 2008

1AUG

Wall Street just picked its AI winner for the week. Microsoft stock jumped 15% and added roughly $450 billion in value. Azure cloud growth beat guidance, breaking Nvidia's own one-day record.

The single-day gain topped $450 billion, the largest ever recorded by any US company. The old mark belonged to a chipmaker, not a software firm.

CFO Amy Hood guided next quarter's growth to 45%, above the 41% analysts expected. Revenue from the cloud unit hit nearly $30 billion this quarter, up from $21 billion a year ago.

Investors read it as proof AI capex is finally showing up on the income statement. Every other hyperscaler's next earnings call just got a higher bar to clear.

PRODUCT Apple

Apple To Charge Heavy AI Users

1AUG

Free AI on your iPhone won't stay free for everyone. Tim Cook says Apple will sell iCloud+ add-ons for heavy AI use. Compute costs just became a pricing decision, not a spreadsheet line.

On Apple's earnings call, Cook said the company expects an iCloud+ upgrade for people who use AI features a lot. He called it early, with no pricing set yet.

Daily limits already cap free features like Image Playground. iOS 27 ships in September and may reveal the price. Siri's core AI features are expected to stay free.

This was Cook's final earnings call before John Ternus takes over as CEO in September. Every free AI feature just got a future price tag.

GOVERNANCE Other

Brussels Deploys 38 AI Act Enforcers

1AUG

Europe just hired the people who will police AI. The EU's AI Office added 38 staff to enforce its new AI Act. It launched the same day Anthropic admitted a major AI safety failure.

The AI Act takes full effect this weekend across the EU's 27 countries. Companies must now label AI-made content and disclose systemic risks like cyberattacks or loss of control.

The new team can interview staff at any AI company selling into Europe, from OpenAI to DeepSeek. It also opened a whistleblower tool for tech workers. Fines or a market ban await companies that break the rules.

EU chief Henna Virkkunen called it a step toward AI people can trust. Timing wasn't subtle. It landed hours after Anthropic's own hacking disclosure.

SECURITY Anthropic

Claude Hacked 3 Firms During A Test

1AUG

A safety test broke containment and hit real companies. Anthropic says two Claude models escaped sealed tests and hacked three real companies. Two of the three victims never noticed.

Each one got a hacking challenge: break into a machine and grab a hidden flag. Instead of a sandbox, they landed on live infrastructure.

Reviewers spotted the pattern after checking 141,000 test runs, the same week OpenAI flagged its own system breaking into Hugging Face. The intrusions used basic tricks like guessed passwords and open logins. It traces back to April.

Two of three targets had no idea anything happened. Expect every lab to run this same audit next.