Tech / AI / IT Intelligence Briefing
Period: March 25–26, 2026 | Generated from 205 tweets
Executive Summary
The open-source AI agent space is heating up significantly, with Hermes Agent gaining over 1,600 GitHub stars in 48 hours and positioning itself as a serious competitor to OpenClaw/Claude Code, emphasizing local-first, privacy-preserving operation. Several notable model releases emerged: Qwen 3.5 27B is drawing praise as a top agentic/tool-calling model fitting on a single RTX 5090, while Voxtral-4B-TTS (Mistral) and Cohere Transcribe represent new open-source pushes into speech. Andrej Karpathy posted a widely-shared vision for fully autonomous DevOps agents that could handle end-to-end deployment without human intervention. Meanwhile, discussion around Google's TurboQuant (3-bit KV cache compression) and distributed inference on Apple hardware (exo + RDMA) reflects ongoing experimentation at the hardware/inference layer.
Key Events
-
Hermes Agent surges to 13,300 GitHub stars (+1,600 in 48 hours), with community widely comparing it favorably to OpenClaw on memory management, architecture, and local-first design; a new feature teasing model "jailbreaking/locking" for agents was hinted at. → link
-
Qwen 3.5 27B declared "release of 2026 so far" — agentic model with 256K context, ~28GB in NVFP4, fits on single RTX 5090, described as Claude Sonnet 4.6–quality locally. → link
-
Andrej Karpathy publishes detailed vision for fully autonomous DevOps agents: "build menugen" → deployed web app, zero human clicks — calls this the "actually hard part," notes it's "now just barely technically possible." → link
-
Voxtral-4B-TTS new open-source TTS model released, described as sounding "beautiful" and threatening closed models; demo live on Hugging Face. → link
-
Cohere Transcribe launched — "state-of-the-art in open source speech recognition." → link
-
LLaDA 2.1 merged into Hugging Face Diffusers library — language diffusion models now mainstream and integrated into the standard ML toolchain. → link
-
Apple RDMA (macOS 26.2) + exo cluster: 6× M1 Max Mac Studios pooling memory to run MiniMax M2.5; exo shipped day-0 support; distributed training on MacBooks via AirDrop also demoed. → link
-
OpenClaw gets new Telegram maintainer (@izhukov) and Microsoft Foundry MCP browser-driving support; steipete describes "the human is no longer the bottleneck." → link
-
Google TurboQuant skepticism: 3-bit KV cache compression only tested on 8B models; unknown behavior at 27B–70B scale; community urges caution. → link
-
OpenAI shuts down erotic ChatGPT and Sora features within 48 hours — noted as a policy reversal moment. → link
-
Meta Hyperagents described as "the logical outcome of coding agents"; blog post on shipping production code with 200 autonomous agents circulating. → link
-
Claude Code used to replicate a $199 commercial TV countdown tool in 15 minutes, then open-sourced for free — illustrating "zero moat left except taste." → link
-
HF Papers CLI released for AI agent retrieval over arXiv — enabling autoresearch pipelines. → link
-
OpenCode (thdxr) ships distributed agent architecture: agents run on laptop, remote server, or cloud sandbox; state syncs and persists even when laptop is closed. → link
-
Kimi/Moonshot "Attention Residuals" introduced at GTC — selective memory architecture rather than mechanically accumulating all context. → link
-
RealGeneKim publishes long-form analysis: organizations that narrowed developer roles to "just typing code" may have inadvertently made developers maximally vulnerable to AI automation; self-efficacy is the key variable. → link
-
AI bot problem on X: @levelsio reports muting/blocking ~20,500 AI bots (~2.5% of followers); X ships a "followers-of-followers" reply restriction celebrated as a meaningful fix. → link
Analysis
Patterns
Open-source local AI is consolidating around Hermes Agent. The rapid star growth and consistent community testimony about its advantages over OpenClaw (better memory management, no per-token cost, local-first) suggests a genuine inflection point. The NousResearch team (Teknium) is actively amplifying this. Watch for Hermes Agent reaching 100K stars as a signal of mainstream open-source agent adoption.
The "agentic DevOps" narrative is crystallizing. Karpathy's post, the OpenCode distributed agent feature, Meta's 200-agent production system, and steipete's MCP browser-driving of Azure Foundry all point to the same direction: the next frontier is not code generation but full lifecycle automation including deployment, debugging, and ops. The tooling is converging fast.
Hardware democratization is accelerating. The exo + Apple RDMA story (6× M1 Max = capable cluster for ~$7.2K secondhand) and advice to use RTX 3090s over H100s for local experimentation signal that capable AI inference is moving far down the cost curve. This is enabling a new class of indie/open-source AI practitioners.
Speech/audio open-source is having a moment. Voxtral-4B-TTS and Cohere Transcribe arriving in the same 24-hour window suggests the open-source stack is closing the gap with closed speech models.
Skepticism about AI hype is growing. Multiple practitioners (alexocheema, TheAhmadOsman, sudoingX) are posting warnings about fabricated benchmarks, LLM echo chambers reinforcing delusions, and premature GPU spend. This is a healthy counter-signal to the hype cycle.
What to Watch Next
- Hermes Agent permanent sub-agents (mentioned as coming soon) — could make it a full background autonomous system
- Google TurboQuant validation at larger model scales (27B–70B)
- exo + Apple RDMA maturation — could redefine consumer-grade model hosting
- OpenClaw vs. Hermes Agent community split — could fork the open-source agent ecosystem
- OpenAI content policy direction following erotic feature rollbacks
- Claude Code / Grok model versioning confusion (levelsio notes Claude Code defaults to Grok 3 instead of Grok 4.1) — signals messy multi-model routing in coding agent UIs
Tweet Feed
🤖 AI Models & Releases
@TheAhmadOsman · 2026-03-26T04:09
Qwen 3.5 27B is the release of 2026 so far — Agentic model & great at tool calling, Claude Sonnet 4.6 quality at home, ~28GB in NVFP4, fits on a single RTX 5090, with full context (256K). Amazing model & performance.
@victormustar · 2026-03-26T15:54
New Voxtral-4B-TTS is beautiful 💛 — it sounds so good that it's whispering at closed models: "your time is over" - Demo available on Hugging Face
@victormustar · 2026-03-26T13:53
RT @cohere: Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition.
@victormustar · 2026-03-26T10:56
RT @multimodalart: LLaDA 2.1 is now merged to the diffusers library 🧨 — language diffusion models are mature and usable. and now integrated…
@crystalsssup · 2026-03-26T03:35
RT @Kimi_Moonshot: Zhilin at GTC: Introducing Attention Residuals — Learning selective memory, rather than mechanically accumulating everything…
🛠️ Agentic AI & Developer Tools
@karpathy · 2026-03-26T16:10
When I built menugen ~1 year ago, I observed that the hardest part by far was not the code itself, it was the plethora of services you have to assemble like IKEA furniture to make it real, the DevOps… I am really looking forward to a day where I could simply tell my agent: "build menugen" and it would just work. The whole thing up to the deployed web page… This is the actually hard part, not the code itself.
@thdxr · 2026-03-25T21:57
james has achieved distributed opencode — agents can run on your laptop, on a remote server, in a cloud sandbox provider. shut your laptop and things keep running. open it back up and all the data syncs. delete the sandbox nothing is lost.
@jezell · 2026-03-25T22:28
Hyperagents from meta are the logical outcome of coding agents.
@jezell · 2026-03-25T22:12
RT @xiaopeng163: Beyond Vibe Coding: How We Ship Production Code with 200 Autonomous Agents
@jezell · 2026-03-26T07:06
One of the nice side effects of codex is almost everyone on the team is now directly contributing bugfixes and minor user interface improvements directly, rather than filing issues and letting the backlog pile up.
@steipete · 2026-03-26T14:38
Testing a new feature for Microsoft Foundry support in @openclaw. Their website is a jungle, I used to make screenshots so codex can guide me through it, but now Chrome has an MCP so codex can simply connect and drive my browser session and do all of that for me. The human is no longer the bottleneck.
@levelsio · 2026-03-26T19:29
FYI for some reason Claude Code will always default you to use Grok 3 and not Grok 4.1 but you should use Grok 4.1 or Grok 4.2 (but that one is more expensive)
@RealGeneKim · 2026-03-26T19:23
I love it when Claude Code says things like this: When the "linter" is actually another Claude Code. 😂
@steipete · 2026-03-26T11:29
RT @mazdak: You are welcome to use Claude Code if you want, but remember that your competitor, who's about to kick your ass, is using Codex.
@thdxr · 2026-03-26T13:22
we're doing a lot of introspection on our product process — the default place for our new coding agent abilities to go to is to work on the wrong things. products going from good to bad faster than ever.
@thdxr · 2026-03-25T23:52
now that we have most features we want implemented, we're doing a pass to decompose them all into plugins… now all new features we want to build will be internal plugins, so the plugin api will have to be good.
@LinusEkenstam · 2026-03-26T10:30
RT @techyygarry: I built a watchOS app for Claude Code so Anthropic doesn't have to. Send prompts. Build websites. Review diffs. Approve c…
@juliarturc · 2026-03-26T17:46
This is the golden age of personal education. Claude, in particular, is nailing it. Not only are its answers good, but it becomes interactive at the exact right time: an animation with controls, a list of sub-topics to explore next… If you're an ambitious 15yo, THIS is your leverage.
🦾 Hermes Agent vs. OpenClaw
@sudoingX · 2026-03-26T05:36
11.7K two days ago. 13.3K today. 1,600 stars in less than 48 hours because you all showed up for majestic Hermes agent… openclaw has corporate backing and charges you per token. hermes agent is fully open, fully local focused, fully free.
[→ tweet link](https://x.com/sudoingX/