2026-03-18 18:26:08

Oh look, another new copilot for Microsoft Copilot. Here's Jordan Novet for CNBC:
Microsoft said Tuesday that it’s bringing together the engineering groups for its commercial and consumer Copilot assistants, which have yet to gain broad adoption.
Jacob Andreou, a former Snap executive who works in Microsoft’s artificial intelligence unit, will become an executive vice president in charge of the consumer and commercial Copilot experience, CEO Satya Nadella wrote in a memo to employees.
Andreou will report to Nadella. Executives Ryan Roslansky, Perry Clarke and Charles Lamanna, who will also report to Nadella, will lead Microsoft 365 applications and the Copilot platform, Nadella wrote.
While it undoubtedly makes sense to try to unify these disparate Copilot groups, which are a branding, if not organizational nightmare, the fact that all of these people will now report directly to Satya Nadella is sort of wild. Yes, there are now four copilots of Copilot – well, actually, five:
The Copilot moves will free up executive Mustafa Suleyman, a former co-founder of AI lab DeepMind that Google bought in 2014, to focus more on building new models.
“The next phase of this plan is to restructure our organization to enable me to focus all my energy on our Superintelligence efforts and be able to deliver world class models for Microsoft over the next 5 years,” Suleyman wrote in a memo. “These models will enable us to build enterprise tuned lineages that help improve all our products across the company.”
"Free up" seems like just about the most generous way possible to frame this shift. As it's a pretty clear acknowledgement that Suleyman's endless efforts to create AI products for Microsoft to compete with ChatGPT and Gemini have failed. It now even looks like Microsoft is quickly falling behind Claude...
Microsoft’s Copilot app had 6 million daily active users in February, while OpenAI’s ChatGPT had 440 million and Google’s Gemini had 82 million, according to data from app analytics company Sensor Tower.
Sensor Tower said that so far in March, Anthropic’s Claude, which has gotten extensive media attention because of Anthropic’s standoff with the U.S. Department of Defense, has reached 9 million daily users, while Copilot still stands at 6 million.
None of this is hugely surprising, in fact, it was one of my main predictions for 2026:
Meta and Microsoft reboot their AI efforts – again – When Microsoft pulled off the first "hackquisition" in the form of their Inflection deal, it was clear Satya Nadella felt it was a needed gamble to hedge against their OpenAI bet, which became problematic after "The Blip". In March, we'll hit the two year anniversary of that deal and Mustafa Suleyman being charged with getting Microsoft some sort of traction in AI. With the new OpenAI arrangement giving Microsoft more flexibility, Nadella – now in "founder mode" – is clearly going to want to see some results, fast. I don't see that happening so... do they make another big hackqusition? Do something else to try to vault into the race? Meanwhile, Meta obviously just made one such deal – a second one – to try to catch up in AI after dropping the ball with Llama. I remain skeptical that they can just buy their way back in. And if it's not working, how fast does Mark Zuckerberg act yet again? Another deal?...
Well, it's March. Almost two years to the date of the Inflection buy. Maybe a coincidence, maybe not. Regardless, a natural point to look back to see what is working and what is not.1 And seemingly very little is working, certainly when it comes to consumer and commercial AI.
2026-03-18 01:40:32

I honestly didn't mean to watch the entire NVIDIA GTC Keynote. I have another post I'm working on where a small sliver is relevant, but before I knew it, there I was sitting through an entire two and a half hour presentation. Thank god for 1.5x speed. But honestly, I didn't stop because I didn't want to stop. It was compelling! Because Jensen Huang is so compelling as a showman. Others are good at such presentations – notably Snap's Evan Spiegel – but no one is in command like this since you-know-who. And arguably Jensen's command is more impressive, because it's far more technical with products far less tangible. And despite such technicalities, he's able to weave a narrative that is actually captivating. Perhaps not quite to the mainstream, but certainly to the masses. The attendance and interest in this event has grown in such a way that he can't help but poke fun at it:
"I just want to remind you, this is a tech conference."
The first hour was more or less a history lesson as to how NVIDIA, and the AI industry itself, got here. And a love letter to CUDA, NVIDIA's software layer celebrating 20 years of being an all-important moat for the company. GeForce to pixel shaders to RTX. DLSS 5 – the horribly generic name for a new AI upscaling technology for video games – looks amazing and naturally is already quite controversial. Why? AI, of course.
Speaking of, we finally got into the meat of the presentation from here. A massive slide behind Jensen highlighted a simple message: three key moments over the past two years. ChatGPT in 2023 ushered in the generative AI era. OpenAI's o1 model (and really, o3 model) brought us into the reasoning era in 2024. And last year, Claude Code gave us the agentic era. With that, AI is now able to do productive work, and that means a new era for NVIDIA and AI in general: "The inflection point of inference has arrived."
To hear Jensen tell it, they've been planning on this all along. And certainly there's some truth there in that NVIDIA has been talking up inference for a few years now. But that's mainly been in a defensive manner to insist to everyone that it wasn't a potential vulnerability for this company, but was also the strength.
It felt like you could tell how much this criticism annoyed Jensen in the part of his presentation where he pretended to hold up a wrestling-style championship belt (virtually made for him by SemiAnalysis) on stage. NVIDIA has body slammed the competition! From the top rope! It wasn't even a close call for the "Token King", you see.

Except, NVIDIA's actual actions the past year suggest that it might have been getting a bit too close for comfort as agents and coding came fully into view. Or at the very least, an area of concern.
It was just a few short months ago that NVIDIA made their move to "hackquire" Groq, the AI chip startup focused on, what else? Inference. Why did a company with no concerns about their position in the inference market feel the need to make a $20B deal/no-deal – on Christmas Eve, no less?
Jensen actually addressed this on stage while showing off their (insanely fast) integration of Groq technology within NVIDIA's new stack. Basically, he acknowledged that the use of SRAM in Groq's purpose-built LPUs (Language Processing Units) gave a level of inference speed which NVIDIA couldn't hope to match with their HBM-based (high-bandwidth memory) systems.
But he was also quick to note that Groq couldn't go it alone with only that approach, that they needed the parameter sizes and context that NVIDIA's systems allowed for (because the Groq systems operate with much smaller amounts of their memory). In other words, the systems needed to be paired together, and NVIDIA figured out a way to do that with Dynamo (their "operating system for AI factories"). They can route the inference workflows depending on the situation and need.
Two more interesting elements of the deal: 1) doing this as a "hackquire" clearly allowed them to do it fast, otherwise there's no chance they'd be ready to have Groq technology integrated so quickly. 2) Groq's chips are actually made by Samsung, giving NVIDIA some diversification away from TSMC (and HBM)...
All of this is in line with the various reports about why NVIDIA did the deal – and why Groq felt the need to do the deal. But it also points to Jensen's narratives which seem to shift with the benefit of hindsight. In a way, this has seemingly been the story of NVIDIA from the get-go. He built the company to be perfectly oriented to ride waves that he couldn't possibly foresee coming, only to later note that it was all so obvious. To me, the key to NVIDIA is malleability, not necessarily seeing the exact future. And this Groq situation is no different.
And that's no less impressive, by the way!
Hopefully that helps Jensen calm down a bit when it comes to the competition from AMD and more recently Google with their TPUs. Then again, that fire even when they're crushing – sorry, body slamming – the competition is undoubtedly part of what makes Jensen, Jensen. And what makes NVIDIA the most valuable company on Earth.
Speaking of, did you hear that Jensen now believes they're going to do $1T in sales (well, purchase orders) through next year? Of course, you did, because he made that a focal point of the presentation and that, in turn, made the headlines easy. "DO YOU HEAR THAT WALL STREET? ONE TRILLION." Love, Jensen.
Forget him holding up the wrestling belt, right now I'm envisioning Jensen as Maximus in Gladiator – two literal men in two literal arenas – "ARE YOU NOT ENTERTAINED?" We are Jensen! We are! We bow to thee.
Anyway, Jensen believes these past two years has seen computing demand increase one million times over. That's a pretty precise number for something so imprecise, but he has his own math to back it up: the amount of token generation has increased by ("roughly") 10,000x and the amount of usage has increased by ("probably") 100x. There you go, 1,000,000x.
He then reiterated the most important equation in all of AI at the moment: how they're getting the math to work for the data center build-outs. Basically: if companies get more AI generation capacity, they can generate more tokens, that means revenues are going up, and also more people are using it, which in turn makes the AI smarter. The "positive flywheel system" as Jensen calls it.
Again, basically all of AI is built around this notion at the moment. OpenAI most overtly, but other less forward-facing companies such as private credit players doing the debt financings for many of these data center deals need this system to keep going. No one wants to see what will happen if a "negative flywheel system" starts. Including NVIDIA.
Still, while there are a number of macro risks lingering out there that could puncture The AI Bubble, NVIDIA shows no sign of slowing. In fact, their revenue growth has been increasing again in recent quarters, which is just astonishing at their current size. They also just posted the most profitable quarter ever. Well, for any company aside from one year when an oil crisis fueled Saudi Aramco to new heights. An oil crisis you say...
Never mind all that, at least for now. Jensen's message with all the above was loud and clear: all NIVIDA partners can make their infrastructure commitments with complete confidence. Please and thank you.
To that end, a large portion of the keynote was devoted to a Vera Rubin show-and-tell. Not only are the systems on track to ship in the second half of 2026 (yes, including the new Groq component) – "probably in the Q3 timeframe" – Microsoft has already installed one such rack in Azure. And what used to take two days to install, now takes two hours.
As for the rest of the roadmap, 'Rubin Ultra' – the 'tock' to the 'Rubin' 'tick' – is already taping out according to Jensen and set for 2027. Then comes 'Feynman' in 2028 (after American physicist Richard Feynman). It will be paired with the 'Rosa' (after American physicist Rosalyn Sussman Yalow) CPU and an all-new LPU made with Groq technology, the 'LP40' (no fun scientific naming scheme here, it seems).
From here, Jensen hit on the whole data-centers-in-space thing. But the most interesting aspect may have been how quickly he moved through it – even noting as such. To hear him tell it, there's still a lot of work to do here, most notably with the conduction/convection issues in the vacuum of space. He says they have great engineers working on how to do it but it's "very complicated". One suspects it will be pitched as decidedly less complicated during the SpaceX IPO roadshow...
Far more time was devoted to OpenClaw. Calling it the most successful open source project in history, and just as big of a deal as HTML and Linux, Jensen framed it as the operating system for the "agentic computer". But then he got to the catch: security. Which, naturally, is what NVIDIA viewed as their opportunity here – in particular within enterprise. And while 'NemoClaw' is dropping the 'Open' from the name, Jensen says it will still be open, and the fact that they worked with creator Peter Steinberger, who himself of course now works at OpenAI – would seem to give that credence. Still, we're clearly about to see every large company come in with their own solution here – Perplexity already has, and Meta just did with Manus (as predicted).
As an aside, this section also brought one of my favorite moments of the keynote as Jensen really had to stretch to turn the 'SaaS' acronym into 'AgaaS'. It took me a minute to figure out what he was saying with "A Gas" – but it's 'Agent-as-a-Service', which of course would more naturally brand itself as 'AaaS'. Butt, well...
We stayed on the topic of "open" as Jensen talked up NVIDIA's 'Nemotron' open models, with version 4 in the works, partnering with Black Forest Labs, Cursor, Mistral, Perplexity, Thinking Machines, and others. Honestly, the most interesting bit of this may have been the notion of future companies using token access as a recruiting tool. (Similar to part of Meta's recent MSL pitch for compute access.)
Jensen closed out the keynote with robots. Noting that NVIDIA had been working on self-driving technology for a long time – well before the current AI movement, of course – and now they seem to be right-place/right-time again, with much of the industry lining up with them in time for their own "ChatGPT moment".
Oh yes, and Olaf! A clear highlight for either anyone under 10, or with kids under 10, to see a walking/talking character from Frozen. For as good as Jensen is on stage, Olaf clearly needs some more reps as the interaction was a bit awkward. Still, of all the things to critique, a working, interactive robot on stage... I'll let it go.




2026-03-16 20:20:41
One studio to rule them all last night...
Warner Bros, the studio currently in the final stages of talks over a buyout by rivals Paramount, has taken a record 11 prizes from the 98th Academy Awards.
It ties the record for most wins by a studio at the Academy Awards with MGM, who took that number in 1959 with Ben-Hur, Paramount for 1997’s Titanic and New Line Cinema with The Lord of the Rings: The Return of the King in 2003. That outfit was later absorbed into Warner Bros.
Implied, but not explicit here: Warner Bros' tally was arguably more impressive because it wasn't just riding the coattails of one movie that dominated as those other studios did in their historic nights. Yes, One Battle After Another was the big winner – and rightfully so, IMO – with six wins, but Sinners came in with four big wins. And, of course, Weapons had its own big win in acting.
It points to just how dominant of a run Warner Bros had in 2025 – including at the box office, where they were incredibly beating even the mighty Disney until a last-minute surge at the end of the year. And the fact that they pulled it off while in the midst of a contentious battle to be bought is even more impressive!
Speaking of, this Oscar tally immediately turns Paramount from the biggest loser of the night with zero Oscars – though technically you can't lose if you weren't even nominated for any Oscars, I guess! – to a huge winner. Sure, at the end of the day the Oscars are still just sort of a silly awards show. But without question, it's the one with the most clout and with that, comes cultural relevancy and cachet. Paramount is the studio that had less of that than any other studio in 2025 and is buying the studio that had the most of it.
Of course, none of that equates to the actual and literal bottom line. Here's Kyle Buchanan writing about the night for The New York Times:
Unlike last year’s big winner “Anora,” one of the lowest-grossing films to ever take best picture, “One Battle After Another” grossed a hefty $209 million worldwide. That’s significantly more than Anderson’s past efforts like “There Will Be Blood” and “Phantom Thread,” but because of its reported $130 million budget and huge marketing costs, the film did not make its money back at the box office.
While Paramount is getting good prestige value here, it doesn't exactly equate into actual value thanks to Hollywood's often upside-down finances. See also: Apple's Oscar win of the night: F1. Great movie, and important for Apple in many ways. But not when it comes to that bottom line. On the other hand, both Sinners and Weapons were wildly profitable at the box office, so there's that.
Netflix, meanwhile, walked away with seven wins, the only other studio to take home multiple Oscars; tied for their best showing ever at the awards. In a different world, they just racked up an unprecedented 18 Oscars thanks to their agreed-to-be-acquired new studio. Instead, Paramount Skydance vaulted them as the only entity with more wins. For the low, low price of $111B.



2026-03-16 18:42:01
Well, it's not the full-on live-action reboot that many of us have been dreaming about for years, but it's something that perhaps is far more feasible at this point:
Firefly fans are in for a treat.
Nathan Fillion has just revealed at Awesome Con that an animated Firefly series is in advanced development based on the beloved cult sci-fi franchise—and Deadline has the details.
Fillion spoke on a panel at the Washington, D.C. event and live taping of his “Once We Were Spacemen” podcast alongside co-stars Alan Tudyk, Gina Torres, Jewel Staite, Morena Baccarin, Sean Maher, and Summer Glau, all of whom are expected to reprise their roles. Adam Baldwin, who played Jayne Cobb, will also lend his voice to the project.
Yes, they're getting the whole band back together.1 Well, aside from creator Joss Whedon, for obvious and unfortunate reasons. Though he apparently did give his "blessing" to the project.
Sounds like the script is done and the stage set thusly:
The proposed series is set in the timeline between the original 2002 television run and its 2005 feature film continuation, Serenity, expanding the universe while preserving continuity with the established lore. Early concept art has been developed in collaboration with the Oscar and Emmy-award-winning animation studio ShadowMachine, which can be found below. The fully assembled package is expected to be taken out to buyers shortly.
This seems smart. If you're not going to do a live-action reboot, perhaps lean in to something that would be impossible (well, without some major de-aging tech and/or full-on AI, which would just piss everyone off). With animation, it obviously doesn't matter if you're using the same cast for a story set 20+ years ago. Otherwise you'd be more or less forced to do a story set 20+ years after the events of the movie. And it allows them to bring back at least one key character who was killed in Serenity...
As for where this will play/stream, that's unclear. Fillion production banner is driving it with 20th Television – from what used to be 20th Century Fox. That entity is obviously owned by Disney now, so presumably Disney+ would be the front-runner to stream this just given the rights, but they could sell it, I suppose. And it's slightly more complicated given that Universal actually owns the film rights, though given this is a series set before the film, they should be in the clear?
Almost exactly 15 years ago, Firefly was my jumping off point for an idea I had for Netflix: saving cancelled cult hits. This was just as Netflix was kicking off their own content production and it seemed to me like it could be an easy way to attract natural viewers. Netflix and others ended up doing just that – I also mentioned the notion of bringing back Arrested Development, which is exactly what Netflix did – but Firefly was left adrift... Until now!
Personally, I'm still going to entertain the possibility of a live-action revamp and perhaps this animated attempt helps that along... We're a long way from the days of a cancelled TV show using surprising popularity on DVD to make a movie.


1 Sadly without Ron Glass, who played Shepherd Book, but passed away a decade ago. ↩
2026-03-14 20:43:11

Last July I wrote a post entitled "We're Seemingly Still in the 'Throw Money At It' AI Era..." The crux was that given Elon Musk's ability to get xAI's Grok models up to technical speed and the fact that Mark Zuckerberg was completely rebooting Meta's AI efforts seemingly showed that all it took to catch up in AI maybe the ability to spend massive amounts of money, fast.
Well, as it turns out, that may not be all it takes...
2026-03-13 18:41:08

The biggest question I had about the forthcoming 'iPhone Fold' is which OS it would run. Given that it's still an iPhone, you'd assume iOS. But that would presumably require some new flavor of iOS given the whole foldable screen element. That mixed with the fact that it sure looked like the interior screen, when unfolded, would be similar to an iPad mini led me to wonder if we might get a true hybrid: iOS on the outside, iPadOS on the inside. A sort of mullet device.
Not so, says Mark Gurman in his latest report about the device for Bloomberg:
Apple is developing new iOS app layouts and revamping its core iPhone programs to add sidebars along the left edge of the screen, similar to many of its iPad apps. Developers will also be able to adapt their iPhone software for the new interface, which will use proportions similar to an iPad in landscape mode.
Despite offering an iPad-like app experience, the foldable iPhone will run the standard iOS — not iPadOS, the company’s tablet operating system. This means it will retain a simpler multitasking system, rather than adopting the more desktop-like interface introduced in iPadOS 26. It also won’t run existing iPad apps out of the box.
While the foldable iPhone won’t run several windows at once like an iPad mini, it will be able to show two apps side by side. That matches a key feature of the foldable phones offered by Samsung, Alphabet Inc.’s Google and others.
So the experience will be "iPad-like" but not actually iPadOS. Instead, we get a version of iOS that looks more like iPadOS, but without the true windowed multitasking (though we can argue about how "true" that system is, even now). This sounds potentially confusing for iPad owners – in particular if the inner screen truly is iPad mini-like in size – if it looks like an iPad, but can't do iPad things. But it undoubtedly would have been more confusing to run two different OSes with different functionalities, simply depending on if the device was folded or not. And there's no real point in running standard iOS (just scaled to a new screen size) without taking advantage of the new screen real estate in some way. So this sounds like the compromise: iPadOS-looking iOS.
I'm just as curious about the actual form-factor here, because it continues to sound – and look, via CAD leaks! – like it will be quite different from other foldables to date. I've had a few versions of the Pixel Fold now, which I like and has gotten better with each iteration, but it's really just exactly what it sounds like: a fairly standard smartphone that can unfold into a bigger-screen smartphone. Yes, it's interesting that it's a more square-like experience inside – and yes, it can do the app side-by-side thing – but it still just feels like a smartphone. I think there's a pretty good chance that the 'iPhone Fold' (if Apple even calls it that – all bets are off after the wildcard 'MacBook Neo' naming scheme) will actually feel like something a bit different than just a "foldable iPhone" given the form factor.
In my mind at least, it almost seems more like a 'iPhone mini' on the outside, while yes, an 'iPad mini' on the inside. But this new iOS will be key to the experience, of course. Does Apple tailor it to leverage those new screens in ways that differentiate it from a "regular" iPhone? I suspect they might! Maybe that front screen becomes your sort of "vital" apps/widgets screen. Things you always open and/or apps for fast actions. While inside, that's where you do your real "work" (or play, I suppose!).
I obviously won't even pretend that I'm torn on getting one. I'll certainly have to to see what it's like. I am a bit torn on leaving the old, trusty "standard" iPhone form-factor behind, but would I really "dual-wield" like I'm Ahsoka or something? That's a pretty expensive proposition!
Speaking of, there's also some new leaks around potential pricing for the device. Are you ready? Take a deep breath...
According to the Weibo-based leaker Instant Digital – who had a strong track-record on such things – the range will be from $2,320 to $2,900, depending on storage options (from 256GB up to 1TB). That's, um, quite a bit above my early (and obvious) $1,999 starting point guess I made a year ago.
Of course, those numbers are extrapolated from the more rounded prices in Chinese Yuan by the leaker. When I asked AI to extrapolate how those implied numbers might translate into USD, sure enough, the $1,999 starting price point seems quite likely. Then again, given everything going on with chip shortages and pricing issues around memory... who knows. Perhaps not even Apple just yet.
Would anyone buy a $1,999 iPhone? I mean, yes, it's Apple.
And some already do, as Apple hit that mark with the 2TB iPhone 17 Pro Max option. What about a fully-loaded 'iPhone Fold' at, say, $2,999? It wasn't that long ago that we were debating the viability of a $1,000 iPhone. That seems quaint now. The iPhone Pro models are huge hits for Apple. There's undoubtedly still room to grow – and importantly for Apple, grow that ASP.
One more thing: As I wondered in my prediction post for the year, what if any sort of newfangled experience for the 'iPhone Fold' also included a new type of accessory? Maybe an 'Apple Pencil Mini' – heaven forbid, the actual 'Apple Stylus'?!


