ChatGPT 5.4 came into play: Artificial intelligence can now use your computer on your behalf

//

Lerato Khumalo

The most important innovation of the model is the built-in ability to use software, websites and digital tools directly. In other words, the system can now act as an assistant working within applications.

The aim of the update is to focus on automation of complex tasks rather than chatting compared to previous versions. It also includes significant improvements in reasoning, writing code and managing long documents.

Here are five key things that have really changed from previous versions:

1. A model that can use the computer directly

The biggest innovation is the feature called “computer use”. GPT-5.4 is OpenAI’s first general-purpose model designed to natively control the computer.

Model:

Able to write code that interacts with programs and websites
Able to use automation libraries
Can create mouse and keyboard commands based on screenshots
In other words, it not only explains what needs to be done, but also can perform operations directly within applications and interfaces.

2. Designed to create autonomous AI agents

OpenAI identifies GPT-5.4 as the most suitable model specifically for developing AI agents.

An AI agent:

Can open a website
can search for information
Can fill out the form
Can complete multi-step tasks using different tools
This approach aims to transform artificial intelligence from being just a chatbot into an operations engine that works in different digital systems.

3. Stronger integration with tools and APIs

Another important difference in GPT-5.4 is that integration with external tools and APIs has been improved.

The model can now perform API calls and tool usage more accurately and efficiently during the mission. Thus, it can complete complex tasks without manual intervention by bringing together different services such as browsers, databases, and enterprise software.

4. Much more powerful in working with complex documents

The model is also optimized for analyzing long and structured texts. It shows high performance especially in complex documents such as contracts or legal documents.

In tests for legal and document analysis, GPT-5.4 achieved high scores for maintaining accuracy and structural integrity in long texts.

Therefore, the model is designed for professional usage scenarios rather than simple chat.

5. “Thinking” version for advanced reasoning

Besides the basic model, a version called GPT-5.4 Thinking was also introduced.

This version specifically focuses on areas such as tasks that require more reasoning, planning, and analyzing complex problems.

It also includes new security measures so that it can be used especially in sensitive areas such as cyber security.

While the Thinking model will be offered to Plus, Team and Pro users, the Pro version, which includes more advanced features, will be available only to Pro and Enterprise subscribers. Free users do not have access to this new version at this stage.