I don't think ChatGPT Agent is all the way there yet, and obviously there is a lot of good use for inserting AI-powered formulas into spreadsheets, but Excel's AI integration and ChatGPT Agent represent two very different bets on what the future of work may look like.
@emollick
-
Excel Copilot vs GPT-5 Agent: UX Design for AI
By
–
Been trying the new COPILOT function in Excel, and it is interesting & useful for things like categorizing, but not sure inserting a prompt into a cell will be the right UX for using AI. For example, GPT-5 Agent can take & edit the whole spreadsheet & provide higher level advice
-
Macroeconomic Complexity Challenges Short-term Analysis
By
–
A lot of macroeconomic things happening at once, and a short timeframe, make teasing apart the details challenging.
-
AI Slowing Junior Hiring in US Tech Fields
By
–
The fact that junior hiring in AI intensive fields has slowed down somewhat in the US seems pretty solid. The evidence linking it to AI is not yet established, we have seen a couple solid attempts that suggest a connection, but it is really hard to tell for sure, given the data.
-
AI Model Reasoning Transparency and Tool Use Auditing Requirements
By
–
I understand that the actual reasoning trace might be obscured, either for IP reasons or because of the way these models work, but it needs to provide evidence of its actual tool use for auditing and further exploration.
-
Generative AI Reduces Junior Hiring While Preserving Senior Roles
By
–
A second paper also finds Generative AI is reducing the number of junior people hired (while not impacting senior roles). This one compares firms across industries who have hired for at least one AI project versus those that have not. Firms using AI were hiring fewer juniors
-
GPT-5 Pro needs transparency in code execution and analysis
By
–
That said, one thing OpenAI needs to do is let me see the analysis being done by GPT-5 Pro. I can't confirm its work without being able to see the python code it ran and what the results were. The fact that the error it spotted was real helps, but the thinking trace is obscured.
-
Deep-Thinking AI Models Undersold: Market Potential Unrealized
By
–
GPT-5 Pro and Gemini 2.5 Pro Deep Think are both very impressive models for hard problems. I think they were both undersold during their respective launches, in part because I am not sure the labs themselves really understand the market for a slow, "deep-thinking" model, yet.
-
Image Generation Flaws Need Comprehensive Benchmark Testing
By
–
Yes, that makes sense. Image generation is deeply flawed today. Would love to see a range of these questions used across a variety of imagegen and done over time. Would be an interesting benchmark.
-
Measuring AI Progress: Benchmarks and Advancement Over Time
By
–
It is worth measuring a wide range of benchmarks to see strengths and weaknesses. But benchmarks need to be repeated over time to measure progress (I have never claimed progress on all of them, btw). This is your prompt in Midjourney v1. Clearly large advances since then.