Discover Trending Products & Smart Deals — Curated Daily for Savvy Shoppers Like You

GPT-5.3-Codex Launches: OpenAI’s Quickest AI Coding Agent Units New Benchmark Information

OpenAI has announced GPT-5.3-Codex, its most superior code-focused agent so far. Based on the corporate, the brand new mannequin is 25% quicker than GPT-5.2-Codex and has achieved record-level accuracy on the SWE-bench Professional and Terminal-Bench 2.0 benchmarks. Designed for skilled growth workflows, the system goes past suggesting code by dealing with end-to-end engineering duties throughout environments.

On SWE-bench Professional (Public), which evaluates software program engineering efficiency throughout a number of programming languages, GPT-5.3-Codex reached 56.8% accuracy. Essentially the most notable enchancment appeared in Terminal-Bench 2.0, targeted on terminal command execution, the place efficiency elevated from 64.0% within the earlier model to 77.3%. The mannequin additionally confirmed sturdy outcomes on OSWorld-Verified, a benchmark that measures how nicely brokers use pc imaginative and prescient to finish desktop duties. GPT-5.3-Codex scored 64.7%, approaching the human common of 72% and considerably exceeding the earlier technology’s 38.2%.

A brand new “steering” functionality has been launched within the Codex app, permitting builders to work together with the mannequin whereas it performs advanced operations. This allows real-time changes, discussions, and collaborative problem-solving with out dropping context throughout code technology or debugging. Coaching and deployment run on NVIDIA GB200 NVL72 techniques, reflecting a co-design effort between OpenAI and NVIDIA to optimize inference efficiency and cut back token utilization throughout advanced duties.

OpenAI additionally labeled the mannequin as “Excessive Functionality” in biosafety and cybersecurity duties inside its Preparedness Framework. GPT-5.3-Codex was particularly skilled to establish software program vulnerabilities, prompting the implementation of enhanced automated monitoring and managed entry for defensive analysis.

This launch displays a broader shift from code copilots towards autonomous engineering brokers, combining decrease latency with improved multilingual workflows. Stories have additionally recommended future tasks involving biometric identification techniques to restrict bot exercise on potential social platforms.

Filed in Robots. Learn extra about , and .

Trending Merchandise

- 18% Wi-fi Keyboard and Mouse Combo &#82...
Original price was: $39.99.Current price is: $32.99.

Wi-fi Keyboard and Mouse Combo R...

0
Add to compare
- 21% ASUS TUF Gaming 24” (23.8” view...
Original price was: $189.00.Current price is: $149.00.

ASUS TUF Gaming 24” (23.8” view...

0
Add to compare
- 5% ASUS TUF Gaming 27″ 1080P Mon...
Original price was: $199.00.Current price is: $189.00.

ASUS TUF Gaming 27″ 1080P Mon...

0
Add to compare
- 33% CHONCHOW LED Keyboard and Mouse, 10...
Original price was: $29.99.Current price is: $19.99.

CHONCHOW LED Keyboard and Mouse, 10...

0
Add to compare
0
Add to compare
- 34% SAMSUNG 34″ ViewFinity S50GC ...
Original price was: $349.99.Current price is: $229.99.

SAMSUNG 34″ ViewFinity S50GC ...

0
Add to compare
- 26% Acer Nitro 31.5″ FHD 1920 x 1...
Original price was: $229.99.Current price is: $169.99.

Acer Nitro 31.5″ FHD 1920 x 1...

0
Add to compare
0
Add to compare
0
Add to compare
- 23% Wi-fi Keyboard and Mouse Combo, Lov...
Original price was: $29.99.Current price is: $22.99.

Wi-fi Keyboard and Mouse Combo, Lov...

0
Add to compare
.

We will be happy to hear your thoughts

Leave a reply

SavvyTrendsNow
Logo
Register New Account
Compare items
  • Total (0)
Compare
0
Shopping cart