Tuesday, June 02, 2026

Articles about LLMs and their applications

Every week, one of my colleagues sends around a survey on behalf of my manager+1, asking for anecdata about productivity gains from using so-called "AI" coding assistants. They are very explicit that they only want to hear about wins and not about downsides. Great example of, "How not to do a survey," but eh, this is the state of the software "engineering" in 2026.

Anyway, rather than engage with... all that, I've started sending around some reading material to our local team here in Scotland each week. I'll archive them in this post, which will occasionally be updated.

2026-06-05 Amnesty International: Exposing the human rights costs of generative AI

In case you missed it at the end of May, Amnesty International has published a new briefing on generative AI products, focussing on the data capture, analysis and processing that goes into building these products.

Unlawful by design

https://www.amnesty.org/en/documents/pol40/0996/2026/en/

This briefing examines how standalone generative AI systems, based on unlawful web scraping, are in conflict with international human rights law (IHRL) and standards through their design, development and deployment. While these technologies promise sophisticated automation and efficiency, they rely on data collection and model training practices that abuse privacy rights, enable discrimination, and threaten freedom of expression and thought. Amnesty International finds that standalone generative AI systems, based on unlawful web scraping, depend on mass invasions of privacy by design, and are fundamentally incompatible with IHRL. As such, Amnesty International is calling for a prohibition of such systems.

2026-05-29 It's OK To Want To Have A Good Time

In our ongoing investigation into the nature of "productivity", here is an interesting paper recently presented at the 9th International Conference on the Art, Science and Engineering of Programming.

https://doi.org/10.4230/OASIcs.Programming.2025.5

By far the biggest productivity problem in software development is understanding the purpose for which the software is being written, and not having to throw it away and do it again; something that studies of productivity rarely include. We’re not suggesting that all developer productivity research is bad – we certainly don’t think that is the case. What were are trying to do is to highlight that the total scope of the professional activities of a software engineer are wide, varied and complicated.

2026-06-02 Understanding the Strengths and Limitations of Reasoning Models

A friend directed my attention to this study at Apple, which found that so-called 'Large Reasoning Models' completely collapse when asked complex questions. The results and conclusions of their study are particularly interesting (and they have some nice visualisations.

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

https://arxiv.org/pdf/2506.06941

Recent generations of frontier language models have introduced Large Reasoning Models (LRMs) that generate detailed thinking processes before providing answers. While these models demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scaling properties, and limitations remain insufficiently understood.

...

By comparing LRMs with their standard LLM counterparts under equivalent inference compute, we identify three performance regimes: (1) low- complexity tasks where standard models surprisingly outperform LRMs, (2) medium-complexity tasks where additional thinking in LRMs demonstrates advantage, and (3) high-complexity tasks where both models experience complete collapse. We found that LRMs have limitations in exact computation: they fail to use explicit algorithms and reason inconsistently across scales and problems.

2026-05-15 James Shore: You Need AI That Reduces Maintenance Costs

We have previously discussed what, exactly, does "productivity" mean in the context of using "AI" tools?

https://www.jamesshore.com/v2/blog/2026/you-need-ai-that-reduces-your-maintenance-costs

I’ll get straight to the point: your AI coding agent, the one you use to write code, needs to reduce your maintenance costs. Not by a little bit, either. You write code twice as quick now? Better hope you’ve halved your maintenance costs. Three times as productive? One third the maintenance costs. Otherwise, you’re screwed. You’re trading a temporary speed boost for permanent indenture.

2026-05-07 AI study from Carnegie Mellon, MIT, Oxford and UCLA

A new paper was just released by a multi-institution team of researchers:

AI Assistance Reduces Persistence and Hurts Independent Performance

https://ai-project-website.github.io/AI-assistance-reduces-persistence/

...after just ∼10 minutes of AI-assisted problem-solving, people who lost access to the AI performed worse and gave up more frequently than those who never used it. These findings raise urgent questions about the cumulative effects of daily AI use on human persistence and reasoning. We caution that if such effects accumulate with sustained AI use, current AI systems — optimized only for short-term helpfulness — risk eroding the very human capabilities they are meant to support.

Tuesday, May 26, 2026

Double Chocolate Peanut Butter Cookies

It's been... a while... since I posted a recipe here.

Today's recipe is for some cookies that have been pretty popular recently in my household. The recipe makes 16 large cookies.

Ingredients

  • 100 g butter (softened at room temperature)
  • 100 g soft dark brown sugar
  • 100 g white sugar (granulated or caster)
  • 1 large egg (duck egg if available)
  • pinch of salt
  • 150 g self-raising flour
  • 1.5 tbsp cocoa powder
  • 100 g chocolate chips
  • 50 g rolled oats
  • 75 g crunchy 100% peanut butter

Method

Prepare 2 large baking trays with silicon baking sheets. Preheat the oven to 160 °C fan.

Cream the butter and sugar together until soft and fluffy. Thoroughly mix in the egg.

Mix in the flour, salt and cocoa powder. Fold in the chocolate chips and oats.

Add the peanut butter by swirling it in, so that it's distributed through the dough but not fully mixed in.

After portioning out the dough onto the baking sheets (I recommend using 2 desert spoons to shape each blob of dough), bake in the oven for 9 minutes.

Allow to cool for 15–20 mins before serving (and even then, they'll be pretty gooey).

These cookies will keep well for 4–5 days in a sealed container, but I expect they'll be eaten before then.