- 标签:
- AI (136)
- Daily (111)
- Tech Trends (111)
- 技术趋势 (18)
- 周报 (18)
- 推荐系统 (16)
- 日报 (15)
- Recommendation Systems (11)
- Weekly (11)
- Papers (11)
- Agentic Engineering (7)
- 思考 (6)
- 论文 (6)
- 深度学习 (4)
- 工具 (3)
- Harness Engineering (3)
- 推荐 (2)
- 强化学习 (1)
- 思维模型 (1)
- Transformer (1)
- LLM (1)
- 管理 (1)
- 生成式 (1)
Anthropic shattered expectations today, closing a $65B Series H at a $96.5B valuation — surpassing OpenAI to become the world's most valuable AI startup — while simultaneously launching Claude Opus 4.8, its strongest coding model yet. Meanwhile, Meta's SilverTorch redefined recommendation system ret
AI coding and agent infrastructure dominated the news cycle. Cognition AI raised $1B at a $26B valuation, while Fireworks AI is reportedly in talks at $15B — the AI coding race is heating up fast. On the technical side, NVIDIA open-sourced Polar for GRPO training across agent tools, Hugging Face sla
AI's commercial landscape flipped today: Anthropic's revenue likely surpassed OpenAI by at least 35%, driven by enterprise preference for safety and reliability. Meanwhile, AI infrastructure hit a new milestone — Fireworks AI ($15B) and Baseten ($11B) became decacorns, marking the "inference inflect
AI hit major milestones today: OpenAI and Google DeepMind both cracked decades-old Erdős math problems — the first time AI has made such a fundamental mathematical breakthrough. On the efficiency front, HRM-Text trained a SOTA 1B model for just $1,500, challenging the scaling law orthodoxy, while De
Today's report covers a mix of big-picture strategy and hands-on tools. The standout is Ben Evans' deep dive on AI job exposure, which challenges the popular "exposed or not" charts with historical data and counterintuitive logic. On the ground, we see real cost pain: Microsoft banned Claude Code fo
Today's AI landscape is dominated by a single, loud signal: every major model lab is pivoting to become an agent lab. From OpenAI's subtle shift to DeepSeek's new "Harness" team, the race is no longer about the best model — it's about the best agent system. We also see a flurry of open-source releas
Only one narrative thread matters for 2026-W21: agents have formally shifted from "model capability" to "system infrastructure." Google I/O 2026 was the explosion point — Gemini 3.5 Flash packages "frontier intelligence + action" into an API that runs 4x faster at half the cost, Managed Agents lets developers define agents in YAML and deploy into a cloud sandbox, and Antigravity pushes agents into the desktop and background. But Google isn't alone: Qwen3.7-Max landed the same week with 35-hour autonomous execution, Daytona's sandbox infrastructure hits 850k runs per day, and IBM/Hugging Face's Open Agent Leaderboard evaluates full agent systems for the first time, not just models. Three signals point to the same judgment — agents are climbing the infrastructure steep from demo to deployment. The framework layer (Langflow, Multica, 12-Factor Agents) tackles orchestration and observability, the sandbox layer (Daytona, Alibaba Cloud AgentRun, AWS blog solution) handles security and state management, and the evaluation layer (Open Agent Leaderboard, Cameron Wolfe guide) answers "how do I know my agent is good?" Meanwhile, NVIDIA, Together AI, Amazon, and other labs released a dense set of training/inference optimization papers — IXT, Dynatrain, CODA, DualKV — that push efficiency boundaries at the system level. The second thread: autonomous scientific discovery moves from academic speculation to verifiable results. An OpenAI model autonomously solved a discrete geometry conjecture posed by Erdős in 1946 for the first time — Sam Altman called it "a big milestone." Meta FAIR's AIRA system had agents autonomously design neural network architectures that outperform Llama 3.2. These events are few but high-signal: not "AI assists scientists," but "AI as discoverer." One bottom-layer warning this week: the ROPE mechanism's limitations in long contexts were formally proven (arxiv) by UIUC & Amazon AGI, suggesting the current positional encoding paradigm may need fundamental re
Today's report covers 8 articles (5 featured), 19 KOL tweets, 2 GitHub projects, and 2 podcast episodes. The big theme: specialization is beating scale — from a 3B model outperforming frontier APIs in OCR to diffusion models offering 6.5x speed gains over autoregressive generation. Meanwhile, AI's h
Today's AI landscape is dominated by Agent infrastructure — from how to provision compute for agents, to building multi-agent systems, to the economic models of an agent-driven web. We cover 19 articles (5 featured), 5 GitHub projects, 4 podcast episodes, and 30 KOL tweets. The big theme: agents are
Today's AI landscape is dominated by Google's massive I/O 2026 announcements, with the Gemini 3.5 series, Managed Agents, and Gemini Omni marking a clear shift toward agentic AI. The big picture: Google is betting big on agents that can act, not just think. Meanwhile, the open-source ecosystem respo
Today's AI landscape is dominated by two big themes: Agent evaluation is getting serious, and Agent infrastructure is going mainstream. We've got 18 articles total, with 5 featured in depth. The standout is the Open Agent Leaderboard from IBM & Hugging Face — a 5-star resource that finally benchmark