Anthropic has filed two lawsuits against the Department of War to
change the Pentagon's decision to label it a 'supply chain risk'
Sign Up [1] |Advertise [2]|View Online [3]
TLDR
TOGETHER WITH [WorkOS] [4]
TLDR 2026-03-10
HOW TO TEST AI AGENTS THAT NEVER PRODUCE THE SAME OUTPUT TWICE
(SPONSOR) [4]
Same input. Same prompt. Different output. That's the reality of
testing AI agents that write code, and most teams are shipping without
solving it.
Nick Nisi from WorkOS [5] tackled this by building eval systems for
two AI tools: npx workos, a CLI agent that installs AuthKit [6] into
your project, and WorkOS's agent skills that power LLM responses about
SSO, directory sync, and RBAC.
The post covers how to test against real project structures, score
output that's different every time, and catch when your agent makes up
methods that don't exist.
Learn more about evals → [4]
📱
BIG TECH & STARTUPS
ANTHROPIC SUES PENTAGON OVER ‘SUPPLY CHAIN RISK' LABEL (4 MINUTE
READ) [7]
Anthropic has filed two lawsuits against the Department of War to
change the Pentagon's decision to label it a 'supply chain risk'. One
of the lawsuits is in the US District court in the Northern District
of California, and the other is in the US Court of Appeals for the
District of Columbia Circuit. The supply chain risk designation is
typically applied to firms that are deemed a major national security
risk and has never been used on an American company. Anthropic has
offered to continue negotiating with the Pentagon and also offered to
help move the Pentagon off its technology and onto another AI system.
OpenAI and xAI have signed agreements to provide technology on the
Department of War's classified systems in recent weeks.
APPLE POSTPONES SMART HOME DISPLAY LAUNCH AS IT WAITS FOR NEW AI AND
SIRI (4 MINUTE READ) [8]
Apple's smart home display has been delayed until later this year.
The project was first scheduled to launch in the spring of 2025, but
it was postponed to let the company finish work on a new Siri digital assistant. It was then scheduled to be released this month, but the
new Siri is still not yet ready. The display is designed to be a
central AI hub for the home that can display personalized data, such
as calendar appointments, reminders, notes, and more.
🚀
SCIENCE & FUTURISTIC TECHNOLOGY
CHINA LEADS THE HUMANOID ROBOT RACE — BUT THE US STILL HAS A SHOT
(6 MINUTE READ) [9]
China's humanoid robots now dominate the market with over 90% of
global sales and thousands of units shipped last year. Tesla's Optimus
robots still won't be ready for launch until at least next year. While
Chinese vendors are more advanced when it comes to production scales,
US companies are very strong at the technical side of things,
especially in the hardware and software departments. By the time
humanoid robot startups build up their production bases, they will be
ready for large-scale deployment.
HOW TO DESIGN ANTIBODIES (29 MINUTE READ) [10]
BoltzGen is the leading open-source approach for computational
antibody design. It uses the permissive MIT license, so it can be used commercially by anyone. This guide walks through the full process of
designing an antibody from home using BoltzGen. The process involves
choosing a target, preparing a target structure, running a design
campaign, filtering candidates, and experimentally validating the
results.
💻
PROGRAMMING, DESIGN & DATA SCIENCE
✂️ CUT QA CYCLES FROM HOURS TO MINUTES WITH AUTOMATED TESTING
(SPONSOR) [11]
If slow QA cycles are holding your team back from releasing faster,
try QA Wolf [12].
Their fully managed, AI-native service delivers 80% AUTOMATED E2E TEST
COVERAGE in weeks and helps teams SHIP 5× FASTER by cutting QA cycles
from hours to minutes.
⭐ Rated 4.8/5 on G2.
Schedule a demo to learn more → [13]
CODE REVIEW, A NEW FEATURE FOR CLAUDE CODE (1 MINUTE READ) [14]
Code Review is a new feature for Claude Code that dispatches a team
of agents on every PR to catch bugs. Built for depth, not speed, the
system is now in research preview for Team and Enterprise. Anthropic
runs Code Review on nearly every PR. The tool doesn't approve PRs, but
it closes the gap so reviewers can cover what's shipping. Reviews are
billed on token usage, and admins have several ways to control spend
and usage.
PERHAPS NOT BORING TECHNOLOGY AFTER ALL (2 MINUTE READ) [15]
The recurring concern, that large language models will push
technology choices towards the tools best represented in their
training data, making it harder for new tools to break through the
noise, doesn't really hold up anymore. New models have large enough
context lengths that they can consume a lot of documentation before
they start working on a problem. Most agents work just fine in
existing codebases that use libraries or tools too private or new to
feature in the training data. Developers are still free to choose
whatever tools they want to use and are not restricted to using the
ones LLMs are most familiar with.
🎁
MISCELLANEOUS
AFTER FALLING FAR BEHIND THE REST OF INDUSTRY, BLUE ORIGIN CREATES
NEW STOCK OPTION PLAN (8 MINUTE READ) [16]
When Jeff Bezos launched Blue Origin, he knew that the company would
not meet investors' expectations for return on investment over a
typical investing horizon. Decades later, the company is still not operationally profitable, though recently, it has made impressive
strides and seen financial returns from the sale of engines and
commercial launches. To continue its growth and attract top talent,
Blue Origin will begin granting stock options to employees this
spring. The new program is structured to provide opportunities for
liquidity events that will enable employees to convert vested stock
options into realized value. More details about the program will be
released during a company-wide meeting on April 17.
AMAZON TELLS FCC TO BIN SPACEX'S MILLION-SATELLITE DATACENTER DREAM
(2 MINUTE READ) [17]
Amazon has criticized SpaceX's application for permission to launch a
fleet of orbital datacenter satellites as incomplete, speculative, and unrealistic. It wants regulators to reject the application, which it
says is a speculative placeholder rather than a complete application
under the Commission's rules. Amazon also raised concerns about
satellite interference and environmental objections. Analysts say that
SpaceX's plan of putting datacenters in space is 'peak insanity' as
running spaceborne facilities would be uneconomical and could never
satisfy terrestrial demand for compute power.
⚡
QUICK LINKS
FORMER META AI CHIEF'S START-UP IS VALUED AT $3.5 BILLION (3 MINUTE
READ) [18]
Yann LeCun's Advanced Machine Intelligence Labs is only a month old
and employs just 12 people.
GHOSTTY 1.3.0 (30 MINUTE READ) [19]
Ghostty 1.3.0 is a significant release that includes hundreds of
improvements, bug fixes, and performance optimizations across all
platforms.
BLUESKY CEO JAY GRABER STEPS DOWN (2 MINUTE READ) [20]
Graber, who will be replaced by Toni Schneider as interim CEO, will
transition to a new role as chief innovation officer.
VIDEO CONFERENCING WITH POSTGRES (7 MINUTE READ) [21]
SpacetimeDB recently open-sourced a way for people to make video
calls over a database.
10X IS THE NEW FLOOR (3 MINUTE READ) [22]
AI amplifies people with agency and curiosity.
THE HUMAN.JSON PROTOCOL (12 MINUTE READ) [23]
human.json is a lightweight protocol for humans to assert authorship
of their site content and vouch for the humanity of others.
Love TLDR? Tell your friends and get rewards!
Share your referral link below with friends to get free TLDR swag!
https://refer.tldr.tech/66662a80/ [24]
Track your referrals here. [25]
Want to advertise in TLDR? 📰
If your company is interested in reaching an audience of tech
executives, decision-makers and engineers, you may want to ADVERTISE
WITH US [26].
Want to work at TLDR? 💼
APPLY HERE [27], CREATE YOUR OWN ROLE [28] or send a friend's resume
to
jobs@tldr.tech and get $1k if we hire them! TLDR is one of INC.'S
BEST BOOTSTRAPPED BUSINESSES [29] of 2025.
If you have any comments or feedback, just respond to this email!
Thanks for reading,
Dan Ni [30] & Stephen Flanders [31]
Manage your subscriptions [32] to our other newsletters on tech,
startups, and programming. Or if TLDR isn't for you, please
unsubscribe [33].
Links:
------
[1]
https://tldr.tech/signup?utm_source=tldr
[2]
https://advertise.tldr.tech/?utm_source=tldr&utm_medium=newsletter&utm_campaign=advertisetopnav
[3]
https://a.tldrnewsletter.com/web-version?ep=1&lc=265280dc-d4e0-11f0-a661-1b9fa34a893d&p=af2156ee-1c5d-11f1-b4f9-8d476af412ba&pt=campaign&t=1773138245&s=f19e06805b9a58235b231b265f2f0c4307623a8148f3abbcf7f86865391f8163
[4]
https://workos.com/blog/writing-my-first-evals?utm_source=tldr&utm_medium=newsletter&utm_campaign=q12026
[5]
https://workos.com/?utm_source=tldr&utm_medium=newsletter&utm_campaign=q12026
[6]
https://workos.com/docs/authkit/cli-installer?utm_source=tldr&utm_medium=newsletter&utm_campaign=q12026
[7]
https://links.tldrnewsletter.com/ceSKQX
[8]
https://links.tldrnewsletter.com/WM7Luk
[9]
https://restofworld.org/2026/china-tesla-robot-race/?utm_source=tldrnewsletter
[10]
https://press.asimov.com/articles/antibody-design?utm_source=tldrnewsletter
[11]
https://www.qawolf.com/?utm_source=tldr&utm_medium=newsletter&utm_campaign=ACQ_All_Demo_Conversions__NewsletterAudience_-_Newsletter_CutQACycles_20260310-None_Experiment-FALSE&utm_term=headline-CutQACyclesFromHoursToMinutesWithAutomatedTesting&utm_content=CutQACycles_ScheduleADemoToLearnMore_None_Headline%3ACutQACyclesFromHoursToMinutesWithAutomatedTesting____Newsletter-SecondaryPlacement_20260310_v1_
[12]
https://www.qawolf.com/?utm_source=tldr&utm_medium=newsletter&utm_campaign=ACQ_All_Demo_Conversions__NewsletterAudience_-_Newsletter_CutQACycles_20260310-None_Experiment-FALSE&utm_term=body-QAWolf&utm_content=CutQACycles_ScheduleADemoToLearnMore_None_Headline%3ACutQACyclesFromHoursToMinutesWithAutomatedTesting____Newsletter-SecondaryPlacement_20260310_v1_
[13]
https://www.qawolf.com/?utm_source=tldr&utm_medium=newsletter&utm_campaign=ACQ_All_Demo_Conversions__NewsletterAudience_-_Newsletter_CutQACycles_20260310-None_Experiment-FALSE&utm_term=cta-ScheduleADemoToLearnMore&utm_content=CutQACycles_ScheduleADemoToLearnMore_None_Headline%3ACutQACyclesFromHoursToMinutesWithAutomatedTesting____Newsletter-SecondaryPlacement_20260310_v1_
[14]
https://links.tldrnewsletter.com/HZb2U3
[15]
https://simonwillison.net/2026/Mar/9/not-so-boring/?utm_source=tldrnewsletter
[16]
https://arstechnica.com/space/2026/03/after-years-of-missteps-blue-origin-to-finally-offer-meaningful-stock-options/?utm_source=tldrnewsletter
[17]
https://www.theregister.com/2026/03/09/amazon_petitions_to_block_spacexs/?utm_source=tldrnewsletter
[18]
https://links.tldrnewsletter.com/xCWXEF
[19]
https://ghostty.org/docs/install/release-notes/1-3-0?utm_source=tldrnewsletter
[20]
https://techcrunch.com/2026/03/09/bluesky-ceo-jay-graber-steps-down/?utm_source=tldrnewsletter
[21]
https://planetscale.com/blog/video-conferencing-with-postgres?utm_source=tldrnewsletter
[22]
https://writing.nikunjk.com/p/10x-is-the-new-floor?utm_source=tldrnewsletter
[23]
https://codeberg.org/robida/human.json?utm_source=tldrnewsletter
[24]
https://refer.tldr.tech/66662a80/
[25]
https://hub.sparklp.co/sub_e87ba9c7d7c0/1
[26]
https://advertise.tldr.tech/?utm_source=tldr&utm_medium=newsletter&utm_campaign=advertisecta
[27]
https://jobs.ashbyhq.com/tldr.tech
[28]
https://jobs.ashbyhq.com/tldr.tech/c227b917-a6a4-40ce-8950-d3e165357871 [29]
https://www.linkedin.com/feed/update/urn:li:activity:7401699691039830016/ [30]
https://twitter.com/tldrdan
[31]
https://twitter.com/SteveFlanders22
[32]
https://tldr.tech/tech/manage?email=tldrnewsletter%40synchro.net
[33]
https://a.tldrnewsletter.com/unsubscribe?ep=1&l=cfa2d55a-b7be-11e8-a3c9-06b79b628af2&lc=265280dc-d4e0-11f0-a661-1b9fa34a893d&p=af2156ee-1c5d-11f1-b4f9-8d476af412ba&pt=campaign&pv=4&spa=1773136904&t=1773138245&s=d5b01b4d7dd7b1d5c4db8b776f5026590142bc18b9bbff056dcab0ed6504933c
---
■ Synchronet ■ Vertrauen ■ Home of Synchronet ■ [vert/cvs/bbs].synchro.net