Connect with us

Technology

AI chatbots are bad at planning, but this could soon change

Published

on

But how exactly will this innovation change the capabilities of AI chatbots? (Pexels photo)

We might soon see AI step up to the next level, with impending upgrades to artificial intelligence (AI) systems developed by OpenAI and Meta. OpenAI’s GPT-5 will be the new “engine” within the AI chatbot ChatGPT, while Meta’s upgrade will be named Llama 3. Among other things, the current version of Llama powers chatbots on Meta’s social media platforms.

Statements to the media by executives at both OpenAI and Meta suggest that some ability to plan ahead will be incorporated into these upgraded systems. But how exactly will this innovation change the capabilities of AI chatbots?

Imagine you are driving from home to work and want to select the best route – that is, the sequence of choices that is optimal in some sense, based on cost or timing, for example. An AI system would be perfectly capable of choosing the better of two existing routes. But it would be a far more difficult task for it to generate the optimal route from scratch.

A route ultimately consists of a sequence of different choices. However, making individual decisions in isolation is not likely to lead to an optimal overall solution.

For instance, sometimes you have to make a little sacrifice at the start, to reap some benefit later on: maybe joining a slow queue to enter the motorway, in order to move faster later on. This is the essence of a planning problem, a classic topic in artificial intelligence.

There are parallels here with board games such as Go: the outcome of a match depends on the overall sequence of moves, and some moves are aimed at unlocking opportunities that can be exploited later on.

The AI company Google DeepMind developed a powerful AI to play this game called AlphaGo, based on an innovative approach to planning. It was not only able to explore a tree of available options, but also to improve on that ability with experience.

Of course, the real point is not about finding optimal routes for driving or playing games. The technology that powers products such as ChatGPT and Llama 3 are called Large Language Models (LLMs). What is at stake here is providing these AI systems with the ability to consider the long term consequences of their actions. This skill is also necessary to solve mathematical problems, so it potentially unlocks other capabilities for LLMs.

Large language models are designed to predict the next word in a given sequence of words. But in practice, they are used to predict long series of words, such as the answers to questions from human users.

This is currently done by adding one word to the answer, then another word and so on, thereby extending the initial sequence. This is known in the jargon as “autoregressive” prediction. However, LLMs can sometimes paint themselves into corners that are impossible to get out of.

Expected development

An important goal for LLM designers has been to combine planning with deep neural networks, the type of algorithms – or set of rules – that sit behind the models. Deep neural networks were originally inspired by the nervous system. They can improve at what they do through a process called training, where they are exposed to large sets of data.

The wait for LLMs that can plan might be over, according to the comments by OpenAI and Meta executives. However, this comes as no surprise to AI researchers, who have been expecting such a development for some time.

Late last year, OpenAI’s CEO Sam Altman was fired and then rehired by the company. At the time, the drama was rumoured to have involved the company’s development of an advanced algorithm called Q*, although this explanation has since been superseded. Although it’s not clear what Q* does, at the time, the name rang bells with AI researchers because it echoed names for existing methods for planning.

Commenting on those rumours, Meta’s head of AI, Yann LeCun, wrote on X (formerly Twitter that replacing the process of auto regression with planning in LLMs was challenging, but that almost every top lab was working on it. He also thought it was likely that Q* was OpenAI’s attempt to incorporate planning into its LLMs.

LeCun was onto something in what he said about the top labs, because recently, Google DeepMind published a patent application that hinted at planning capabilities.

Intriguingly, the listed inventors were members of the AlphaGo team. The method described in the application looks much like the one that guides AlphaGo towards its goals. It would also be compatible with the current neural network architectures used by large language models.

That brings us to the comments by executives at Meta and OpenAI about the capabilities of their upgrades. Joelle Pineau, vice-president of AI research at Meta, told the FT newspaper: “We are hard at work in figuring out how to get these models not just to talk, but actually to reason, to plan . . . to have memory.”

If that works, we might well see progress on planning and reasoning, moving from simple, step-by-step word generation to planning entire conversations, or even negotiations. Then we might really see AI step up to the next level.The Conversation

Nello Cristianini, Professor of Artificial Intelligence, University of Bath

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Maria in Vancouver

Maria in Vancouver2 weeks ago

Fantabulous Christmas Party Ideas

It’s that special and merry time of the year when you get to have a wonderful excuse to celebrate amongst...

Lifestyle3 weeks ago

How To Do Christmas & Hanukkah This Year

Christmas 2024 is literally just around the corner! Here in Vancouver, we just finished celebrating Taylor Swift’s last leg of...

Lifestyle2 months ago

Nobody Wants This…IRL (In Real Life)

Just like everyone else who’s binged on Netflix series, “Nobody Wants This” — a romcom about a newly single rabbi...

Lifestyle2 months ago

Family Estrangement: Why It’s Okay

Family estrangement is the absence of a previously long-standing relationship between family members via emotional or physical distancing to the...

Lifestyle3 months ago

Becoming Your Best Version

By Matter Laurel-Zalko As a woman, I’m constantly evolving. I’m constantly changing towards my better version each year. Actually, I’m...

Lifestyle3 months ago

The True Power of Manifestation

I truly believe in the power of our imagination and that what we believe in our lives is an actual...

Maria in Vancouver5 months ago

DECORATE YOUR HOME 101

By Matte Laurel-Zalko Our home interiors are an insight into our brains and our hearts. It is our own collaboration...

Maria in Vancouver5 months ago

Guide to Planning a Wedding in 2 Months

By Matte Laurel-Zalko Are you recently engaged and find yourself in a bit of a pickle because you and your...

Maria in Vancouver5 months ago

Staying Cool and Stylish this Summer

By Matte Laurel-Zalko I couldn’t agree more when the great late Ella Fitzgerald sang “Summertime and the livin’ is easy.”...

Maria in Vancouver6 months ago

Ageing Gratefully and Joyfully

My 56th trip around the sun is just around the corner! Whew. Wow. Admittedly, I used to be afraid of...