OpenAI's new AI reportedly reaches human-level intelligence

Ai
OpenAI's new AI reportedly reaches human-level intelligence

OpenAI's o3 system achieved an 85% score on the ARC-AGI test, a benchmark designed to assess general intelligence. This performance matches the human average, significantly surpassing the 55% scored by previous AI systems.

Researchers who evaluated o3 are excited about this advancement, as the system is set to power the recently launched ChatGPT. The ARC-AGI test measures an AI's ability to generalize and adapt to new situations and the results indicate that o3 can effectively identify simple rules that explain observed transformations an essential trait of general intelligence.

Despite the promising results, many questions remain unanswered. OpenAI has been reticent about detailed information, limiting communication to a few media presentations and preliminary tests. To truly understand o3's potential, comprehensive evaluations are necessary, particularly concerning its adaptability to various contexts beyond the ARC-AGI test.