December 2, 2025

⚡ Data centers’ power surge, Nvidia’s open AV model, and Perplexity with memory

AI demand could triple power needs by 2035—pressuring PJM and Texas; Nvidia’s Alpamayo‑R1 narrates its driving plans; Perplexity adds opt‑in, portable memory across chats.

16 articles across 4 sections

🧠 AI News & Trends

One of Google’s biggest AI advantages is what it already knows about you (3min read)

Google aims to deeply personalize AI responses using user data from services like Gmail and Drive, promising helpfulness but raising privacy concerns as the line between assistant and surveillance blurs.

Apple names Amar Subramanya new VP of AI, replacing John Giannandrea (2min read)

Apple named Amar Subramanya as its new VP of AI, bringing experience from Microsoft and Google as it tries to accelerate its lagging AI efforts. He replaces John Giannandrea, who will advise until retiring due to delays to Siri upgrades.

Alibaba and ByteDance allegedly train Qwen and Doubao LLMs using Nvidia chips, despite export controls — Southeast Asian data center leases skirt around U.S. chip restrictions (3min read)

Alibaba and ByteDance are training advanced AI models in Southeast Asia using foreign data centers with Nvidia GPUs to navigate U.S. export controls, while deploying the resulting models in China on local chips as training abroad remains legally permitted.

Data center energy demand forecasted to soar nearly 300% through 2035 (3min read)

Data center power demand is set to reach 106 GW by 2035 as far larger facilities and a surge in AI-driven workloads fuel rapid expansion, especially in the PJM region and Texas, prompting warnings that grid capacity may struggle to keep up.

Nvidia releases open-source software for self-driving car development (1min read)

Nvidia released Alpamayo-R1, an open-source self-driving model that narrates its reasoning as it interprets road scenes and plans movements, offering greater transparency to help developers diagnose and improve autonomous-vehicle behavior.

DeepSeek-V3.2-Exp (Model)

DeepSeek-V3.2-Exp debuts DeepSeek Sparse Attention to boost long-context efficiency with minimal output trade-off, matching V3.1-Terminus in benchmarks and resolving prior RoPE indexing issues.

FLUX.2: Frontier Visual Intelligence (Model)

FLUX.2 delivers high-detail, photoreal image generation with multi-reference support, improved text rendering, and stronger world knowledge, powering creative workflows across open and pro model tiers while advancing Black Forest Labs’ open-core approach.

DeepSeekMath-V2 (Model)

DeepSeekMath-V2 uses a self-verifying approach to train a proof generator and verifier that improve mathematical rigor, enabling strong theorem-proving performance including gold-level IMO 2025 and a 118/120 Putnam 2024 score.

Token-Oriented Object Notation (TOON) (GitHub)

Compact, human-readable, schema-aware JSON for LLM prompts.

Introducing AI assistants with memory (Tooling)

Perplexity now uses a secure memory system that stores preferences and past conversations to deliver more precise, personalized answers across all models, offering continuity, context portability, and full user control over what is remembered.

Nano Banana Pro is available in Flow (X.com)

It is available to all Google AI subscribers, now including Pro and Plus.

You can now use ChatGPT Voice right inside chat (X.com)

You can talk, watch answers appear, review earlier messages, and see visuals like images or maps in real time.

OpenAI is planning to start testing ads on ChatGPT soon (X.com)

New ad related strings appeared in the latest build for Android and will likely be limited to search ads only.

Perplexity's global app downloads have dropped 80% in the last six weeks (X.com)

Equally concerning is what the drop implies: that the earlier growth was mostly from paid marketing (organic growth rarely drops that quickly).

How prompt caching works (Guide)

This guide explains how prompt caching really works, showing that KV-cache reuse depends on identical prefixes across requests and is enabled by vLLM’s paged-attention, which maps token blocks to reusable hashed memory pages for efficient, scalable inference.

Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor’s Impact on Software Projects (Research)

A diff-in-diff study of GitHub projects shows Cursor adoption briefly accelerates development but consistently raises static analysis warnings and code complexity, which later contribute to slowed long-term velocity.

⚡ Data centers’ power surge, Nvidia’s open AV model, and Perplexity with memory

🧠 AI News & Trends

One of Google’s biggest AI advantages is what it already knows about you (3min read)

Apple names Amar Subramanya new VP of AI, replacing John Giannandrea (2min read)

Alibaba and ByteDance allegedly train Qwen and Doubao LLMs using Nvidia chips, despite export controls — Southeast Asian data center leases skirt around U.S. chip restrictions (3min read)

Data center energy demand forecasted to soar nearly 300% through 2035 (3min read)

Nvidia releases open-source software for self-driving car development (1min read)

🛠️ Dev Tools & Frameworks

DeepSeek-V3.2-Exp (Model)

FLUX.2: Frontier Visual Intelligence (Model)

DeepSeekMath-V2 (Model)

Token-Oriented Object Notation (TOON) (GitHub)

Introducing AI assistants with memory (Tooling)

⚡ Quick Bits

Nano Banana Pro is available in Flow (X.com)

You can now use ChatGPT Voice right inside chat (X.com)

OpenAI is planning to start testing ads on ChatGPT soon (X.com)

Perplexity's global app downloads have dropped 80% in the last six weeks (X.com)

📌 Deep Dive

How prompt caching works (Guide)

Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor’s Impact on Software Projects (Research)

⚡ Data centers’ power surge, Nvidia’s open AV model, and Perplexity with memory

🧠 AI News & Trends

🛠️ Dev Tools & Frameworks

⚡ Quick Bits

📌 Deep Dive

5-Minute Weekly AI Briefing for Busy Developers