GPT-4o

Generative Pre-trained Transformer 4 Omni (GPT-4o)
Developer(s)	OpenAI
Initial release	May 13, 2024; 2 days ago
Predecessor	GPT-4 Turbo
Type	Multimodal; Large language model; Generative pre-trained transformer; Foundation model;
License	Proprietary
Website	openai.com/index/hello-gpt-4o

GPT-4o (GPT-4 omni) is a multilingual, multimodal generative pre-trained transformer designed by OpenAI. It was announced by OpenAI's CTO Mira Murati during a live-streamed demo on 13 May 2024 and released the same day.^[1] GPT-4o is free, but with a usage limit that is 5 times higher for ChatGPT Plus subscribers.^[2] Its API is twice as fast and half the price of its predecessor, GPT-4 Turbo.^[1]

Background[edit]

GPT-4o was originally shadow launched on LMSYS, as 3 different models. These 3 models were called gpt2-chatbot, im-a-good-gpt2-chatbot, and im-also-a-good-gpt2-chatbot. On 7 May 2024, Sam Altman revealed that OpenAI was responsible for these mysterious new models.^[3]

Capabilities[edit]

GPT-4o achieves state-of-the-art results in voice, multilingual, and vision benchmarks, setting new records in audio speech recognition and translation.^[4] GPT-4o scores 88.7 on the Massive Multitask Language Understanding (MMLU) benchmark compared to 86.5 by GPT-4.^[4]

The model supports over 50 languages,^[1] covering over 97% of speakers. Mira Murati demonstrated the model's multilingual capability by speaking Italian to the model and having it translate between English and Italian during the live-streamed OpenAI demo event on 13 May 2024.

It is currently the leading model in the Large Model Systems Organization (LMSYS) Elo Arena Benchmarks by the University of California, Berkeley.^[5]

Applications[edit]

GPT-4o is integrated into various OpenAI products, including ChatGPT, enhancing its performance in understanding and generating human-like text. It also powers several third-party applications that require advanced natural language processing capabilities.

GPT-4o is utilized in fields such as healthcare, finance, and customer service for tasks like automated support, data analysis, and multilingual communications. Its multimodal capabilities allow it to handle text, image, and voice inputs, making it a versatile tool for diverse applications.

References[edit]

^ ^a ^b ^c Wiggers, Kyle (2024-05-13). "OpenAI debuts GPT-4o 'omni' model now powering ChatGPT". TechCrunch. Retrieved 2024-05-13.
^ Field, Hayden (2024-05-13). "OpenAI launches new AI model GPT-4o and desktop version of ChatGPT". CNBC. Retrieved 2024-05-14.
^ Sam Altman "https://twitter.com/sama/status/1787222050589028528" Twitter, X. Retrieved 14 May 2024.
^ ^a ^b "Hello GPT-4o". OpenAI.
^ Fedus, William. "GPT-4o is our new state-of-the-art frontier model".

[TechCrunch-1] Wiggers, Kyle (2024-05-13). "OpenAI debuts GPT-4o 'omni' model now powering ChatGPT". TechCrunch. Retrieved 2024-05-13.

[2] Field, Hayden (2024-05-13). "OpenAI launches new AI model GPT-4o and desktop version of ChatGPT". CNBC. Retrieved 2024-05-14.

[3] Sam Altman "https://twitter.com/sama/status/1787222050589028528" Twitter, X. Retrieved 14 May 2024.

[Hello_GPT-4o-4] "Hello GPT-4o". OpenAI.

[5] Fedus, William. "GPT-4o is our new state-of-the-art frontier model".

[1]

[2]

[3]

[4]

[5]