{"id":2762,"date":"2025-12-11T19:11:01","date_gmt":"2025-12-11T19:11:01","guid":{"rendered":"https:\/\/microvibenews.com\/?p=2762"},"modified":"2025-12-11T19:11:01","modified_gmt":"2025-12-11T19:11:01","slug":"openai-debuts-gpt-5-2-in-effort-to-silence-concerns-it-is-falling-behind-its-rivals","status":"publish","type":"post","link":"https:\/\/microvibenews.com\/?p=2762","title":{"rendered":"OpenAI debuts GPT-5.2 in effort to silence concerns it is falling behind its rivals"},"content":{"rendered":"<p><img src=\"https:\/\/fortune.com\/img-assets\/wp-content\/uploads\/2025\/12\/GettyImages-2198334790-e1765478723707.jpg?w=2048\" \/><\/p>\n<p>OpenAI, under increasing competitive pressure from Google and Anthropic, has debuted a new AI model, GPT-5.2, that it says beats all existing models by a substantial margin across a wide range of tasks.<\/p>\n<p>The new model, which is being released less than a month after OpenAI debuted its predecessor, GPT-5.1, performed particularly well on a benchmark of complicated professional tasks across a range of \u201cknowledge work\u201d\u2014from law to accounting to finance\u2014as well as on evaluations involving coding and mathematical reasoning, according to data OpenAI released.<\/p>\n<div>\n<p>Fidji Simo, the former InstaCart CEO who now serves as OpenAI\u2019s CEO of applications, told reporters that the model should not been seen as a direct response to Google\u2019s Gemini 3 Pro AI model, which was released last month. That release prompted OpenAI CEO Sam Altman to issue a \u201ccode red,\u201d delaying the rollout of several initiatives in order to focus more staff and computing resources on improving its core product, ChatGPT.<\/p>\n<p>\u201cI would say that [the Code Red] helps with the release of this model, but that\u2019s not the reason it is coming out this week in particular, it has been in the works for a while,\u201d she said.<\/p>\n<p>She said the company had been building GPT-5.2 \u201cfor many months.\u201d \u201cWe don\u2019t turn around these models in just a week. It\u2019s the result of a lot of work,\u201d she said. The model had been known internally by the code name \u201cGarlic,\u201d according to a story in The Information. The day before the model\u2019s release Altman teased its imminent rollout by posting to social media a video clip of him cooking a dish with a large amount of garlic. <\/p>\n<p>OpenAI executives said that the model had been in the hands of \u201cAlpha customers\u201d who help test its performance for \u201cseveral weeks\u201d\u2014a time period that would mean the model was completed prior to Altman\u2019s \u201ccode red\u201d declaration.<\/p>\n<p>These testers included legal AI startup Harvey, note-taking app Notion, and file-management software company Box, as well as Shopify and Zoom.<\/p>\n<p>OpenAI said these customers found GPT-5.2 demonstrated a \u201cstate of the art\u201d ability to use other software tools to complete tasks, as well as excelling at writing and debugging code.<\/p>\n<p>Coding has become one of the most competitive use cases for AI model deployment within companies. Although OpenAI had an early lead in the space, Anthropic\u2019s Claude model has proved especially popular among enterprises, exceeding OpenAI\u2019s marketshare according to some figures. OpenAI is no doubt hoping to convince customers to turn back to its models for coding with GPT-5.2.<\/p>\n<p>Simo said the \u201cCode Red\u201d was helping OpenAI focus on improving ChatGPT. \u201cCode Red is really a signal to the company that we want to marshal resources in one particular area, and that\u2019s a way to really define priorities and define things that can be deprioritized,\u201d she said. \u201cSo we have had an increase in resources focused on ChatGPT in general.\u201d<\/p>\n<p>The company also said its new model is better than the company\u2019s earlier ones at providing \u201csafe completions\u201d\u2014which it defines as providing users with helpful answers while not saying things that might contribute to or worsen mental health crises.<\/p>\n<p>\u201cOn the safety side, as you saw through the benchmarks, we are improving on pretty much every dimension of safety, whether that\u2019s self harm, whether that\u2019s different types of mental health, whether that\u2019s emotional reliance,\u201d Simo said. \u201cWe\u2019re very proud of the work that we\u2019re doing here. It is a top priority for us, and we only release models when we\u2019re confident that the safety protocols have been followed, and we feel proud of our work.\u201d<\/p>\n<p>The release of the new model came on the same day a new lawsuit was filed against the company alleging that ChatGPT\u2019s interactions with a psychologically troubled user had contributed to a murder-suicide in Connecticut. The company also faces several other lawsuits alleging ChatGPT contributed to people\u2019s suicides. The company called the Connecticut murder-suicide \u201cincredibly heartbreaking\u201d and said it is continuing to improve \u201cChatGPT\u2019s training to recognize and respond to signs of mental or emotional distress, de-escalate conversations and guide people toward real-world support.\u201d\u00a0<\/p>\n<p>GPT-5.2 showed a large jump in performance across several benchmark tests of interest to enterprise customers. It met or exceeded human expert performance on a wide range of difficult professional tasks, as measured by OpenAI\u2019s GDPval benchmark, 70.9% of the time. That compares to just 38.8% of the time for GPT-5, a model that OpenAI released in August; 59.6% for Anthropic\u2019s Claude Opus 4.5; and 53.3% for Google\u2019s Gemini 3 Pro.<\/p>\n<p>On the software development benchmark, SWE-Bench Pro, GPT-5.2 scored 55.6%, which was almost 5 percentage points better than its predecessor, GPT-5.1, and more than 12% better than Gemini 3 Pro.<\/p>\n<p>OpenAI\u2019s Aidan Clark, vice president of research (training), declined to answer questions about exactly what training methods had been used to upgrade GPT-5.2\u2019s performance, although he said that the company had made improvements across the board, including in pretraining, the initial step in creating an AI model.<\/p>\n<p>When Google released its Gemini 3 Pro model last month, its researchers also said the company had made improvements in pretraining as well as post-training. This surprised some in the field who believed that AI companies had largely exhausted the ability to wring substantial improvements out of the pretraining stage of model building, and it was speculated that OpenAI may have been caught off guard by Google\u2019s progress in this area.<\/p>\n<\/div>\n<p>#OpenAI #debuts #GPT5.2 #effort #silence #concerns #falling #rivals<\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI, under increasing compe&hellip; <\/p>\n","protected":false},"author":1,"featured_media":2763,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[2],"tags":[704,725,319,2857,2859,2345,715,2858,703,2861,2787,2860],"_links":{"self":[{"href":"https:\/\/microvibenews.com\/index.php?rest_route=\/wp\/v2\/posts\/2762"}],"collection":[{"href":"https:\/\/microvibenews.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/microvibenews.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/microvibenews.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/microvibenews.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2762"}],"version-history":[{"count":0,"href":"https:\/\/microvibenews.com\/index.php?rest_route=\/wp\/v2\/posts\/2762\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/microvibenews.com\/index.php?rest_route=\/wp\/v2\/media\/2763"}],"wp:attachment":[{"href":"https:\/\/microvibenews.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2762"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/microvibenews.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2762"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/microvibenews.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2762"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}