Tag Archives: GOOGL

Google’s Gemini 3 Deep Think Model Gets Major Upgrade, Aiming at Research and Engineering Applications

Without any tool assistance, the model achieved a 48.4% accuracy rate on the “Humanity’s Last Exam” (HLE) benchmark and scored 84.6% on the ARC-AGI-2 test. It also reached gold medal level in the written portions of the 2025 International Physics and Chemistry Olympiads. Google stated that the new model is designed to help researchers tackle “unsolvable” problems—ranging from identifying flaws in research papers to optimizing semiconductor crystal growth.

Gemini 3 Deep Think (NASDAQ:GOOGL), Google’s deep thinking model, has undergone a significant upgrade, taking its reasoning capabilities from abstract theory to practical applications. This upgrade focuses on solving complex challenges in modern scientific research and engineering, marking Google’s strategic investment in the enterprise AI market.

On Thursday, February 12, Google officially announced the Gemini 3 Deep Think upgrade, stating that the updated model achieved breakthrough results across several industry benchmarks, including 84.6% in the ARC-AGI-2 test (verified by the ARC Prize Foundation) and an Elo score of 3455 on the competitive programming platform Codeforces.

The upgraded deep thinking model is now available to Google AI Ultra subscribers and is accessible through the Gemini API for selected researchers, engineers, and enterprise users for early access. Google reported that the model has already shown practical value in real-world research, from detecting logical flaws in research papers to optimizing semiconductor material growth processes.

This release positions Google to directly compete with OpenAI’s o1 series and Anthropic’s Claude in the AI reasoning model race. As general AI capabilities become increasingly commoditized, specialized reasoning abilities have become the new battleground in the enterprise market. The launch of the deep thinking model signals that Google is unwilling to concede in this high-value sector.

From Benchmark Results to Gold Medal Performance
Google highlighted the deep thinking model’s performance on rigorous academic benchmarks. In addition to the previously mentioned results, Gemini 3 Deep Think achieved gold medal levels in the written portions of the 2025 International Physics and Chemistry Olympiads and scored 50.5% in the CMT-Benchmark advanced theoretical physics test.

Comparative results from Google show that Gemini 3 Deep Think surpassed the strongest models from Anthropic and OpenAI in several tests, including outperforming the Gemini 3 Pro preview version. For instance, in the ARC-AGI-2 test, Gemini 3 Deep Think scored 84.6%, while Anthropic’s Claude Opus 4.6 Thinking Max achieved 68.8%, and OpenAI’s GPT-5.2 Thinking xhigh scored 52.9%.

Google’s team stated that this upgrade was developed in close collaboration with scientists and researchers to address research challenges that lack clear boundaries or single correct answers, often involving messy or incomplete data. The model combines deep scientific knowledge with practical engineering capabilities, bridging the gap from abstract theory to practical applications.

Beyond breakthroughs in mathematics and programming, the deep thinking model has extended its performance to multiple scientific fields, including chemistry and physics (including theoretical physics). This broad applicability means the model is no longer limited to specific disciplines, but rather serves as a cross-disciplinary research tool.

Real-World Application Cases Validate Its Value
Early test users have demonstrated the model’s real-world potential. Lisa Carbone, a mathematician at Rutgers University, used the deep thinking model to review a highly specialized mathematical paper while researching the required structures for high-energy physics. The model successfully identified a subtle logical flaw that had previously gone undetected despite peer review.

At Duke University, Wang Lab used the deep thinking model to optimize the manufacturing method for complex crystal growth, aiming at the discovery of potential semiconductor materials. The model successfully designed a formula that grew thin films over 100 microns in thickness, achieving precision that was previously unattainable with prior methods.

Anupam Pathak, head of research and development at Google’s Platforms & Devices Division and former CEO of Liftware, tested the upgraded deep thinking model to accelerate the design of physical components.

Another use case showcased by Google demonstrated how the upgraded Gemini 3 Deep Think could convert sketches into 3D printable physical models. The model can analyze blueprints, model complex shapes, and generate the necessary files for 3D printing.

Strategic Positioning in the Enterprise Market
This upgrade reflects a broader shift in the AI industry—from general chatbots to specialized reasoning engines that can tackle professional-grade problems. For enterprise clients, evaluation criteria are changing, focusing not only on which AI can write code or summarize documents the fastest, but also on reasoning capabilities—whether the model can handle complex financial models, analyze experimental data, identify methodological flaws, or assist in patent research or drug discovery.

Google’s advantage lies in its integration capabilities. The deep thinking model is not an isolated tool but part of the broader Gemini ecosystem, meaning it can leverage Google’s vast knowledge graph, scientific datasets, and research partnerships. Researchers using deep thinking through Google Cloud theoretically have access to computational power and data sources that standalone AI services cannot match.

On Thursday, the company posted on X, saying, “The upgraded deep thinking model is driving discoveries and helping researchers solve ‘unsolvable’ problems—from finding flaws in research papers to optimizing semiconductor (crystal) growth.” This statement underscores the model’s transition from benchmark tests to real-world applications.

From a product strategy perspective, Google is targeting both consumer and enterprise users. Google AI Ultra subscribers can immediately access the model via the Gemini app, while scientists, engineers, and enterprise users can apply for early access through the Gemini API. This layered strategy reflects Google’s dual goal of maintaining a presence in the consumer market while vying for high-value enterprise clients.

AI Reasoning Model Competition Heats Up
The launch of the deep thinking model puts Google in direct competition with OpenAI and Anthropic in the AI reasoning race. OpenAI’s o1 model reportedly spends more time “thinking” before generating responses, using reinforcement learning to improve reasoning chains. Anthropic’s Claude 3 has also carved out a niche in research and analytical tasks. Now, Google has staked its claim in the same field, backed by the infrastructure and distribution advantages of being integrated into Workspace and Cloud Platform.

For professional users, this means making a choice between fast general responses and slower, deeper reasoning, which could lead to a new architectural decision. Applications may route simple queries to standard models while escalating complex issues to the reasoning model, creating a layered AI reasoning approach.

Google posted on X on Thursday: “Gemini 3 Deep Think performed exceptionally well in pushing the frontiers of intelligence in benchmark tests. Specific data: 48.4% in ‘Humanity’s Last Exam’ (without tools), 84.6% in ARC-AGI-2 (verified by the ARC Prize Foundation), and an Elo rating of 3455 on Codeforces.”

Google also pointed out that the model now excels in fields like chemistry and physics.

The true test of this competition, however, will not be the press releases, but real-world adoption. If research institutions and engineering firms begin using deep thinking models to tackle complex tasks, it will validate Google’s judgment—that the future of enterprise AI lies in depth, not speed. The company has made it clear: it is competing for the high-end sector of the AI market, where reasoning matters more than conversation.

OpenAI Races Towards $100 Billion Financing, Altman Claims ChatGPT’s Monthly Growth Rate Returns to Over 10%

Amid growing competitive pressure, OpenAI CEO Sam Altman has informed both employees and investors that the company is maintaining strong momentum in its development.

According to an internal Slack message seen by the media, Altman told OpenAI employees last Friday that the company’s popular AI chatbot, ChatGPT, has “reached a monthly growth rate of over 10% once again.” He also mentioned that OpenAI is preparing to launch an “upgraded chatbot model” this week.

Currently, ChatGPT has surpassed 800 million weekly users, but Google(GOOGL) and Anthropic are steadily eating into its market share. In December last year, OpenAI announced that it had entered a “Code Red” state to make comprehensive improvements to ChatGPT, temporarily putting several projects on hold and concentrating resources on this goal.

In Friday’s internal communication, Altman also mentioned that OpenAI’s programming product Codex had seen a growth of about 50% over the past week.

Codex competes directly with Anthropic’s Claude Code, which has gained a large number of users over the past year.

Last week, OpenAI released a new Codex model—GPT-5.3-Codex—and launched a standalone app for users with Apple computers. According to internal messages, Altman described the growth of Codex as “absolutely crazy.”

“It’s been an amazing week,” Altman wrote.

Investor Messaging
Insiders revealed that as OpenAI approaches completing a financing round that could reach up to $100 billion, Altman and CFO Sarah Friar have been actively pitching the company’s growth story to investors.

In private discussions, the two executives emphasized OpenAI’s advantages in the consumer sector, the expanding enterprise business, and its access to computing resources.

As part of the fundraising discussions, OpenAI presented investors with several charts. According to internal data, Codex is steadily capturing market share from Claude Code.

Insiders said that OpenAI expects the fundraising negotiations to intensify over the next two weeks.

As previously reported, OpenAI’s financing round may occur in two phases. The first phase could involve funding from Microsoft(MSFT), NVIDIA(NVDA), and Amazon(AMZN), with Amazon discussing a potential $50 billion investment in OpenAI. Following this, additional investments from entities like SoftBank could come into play, with SoftBank reportedly considering a $30 billion investment.

However, the specific details of this financing round are still in flux, and the final structure may change.

Ad Testing Launched
On Monday, OpenAI announced the launch of an ad test for ChatGPT in the United States, available to certain free and Go plan subscribers.

OpenAI stated that these ads will be clearly labeled and will appear at the bottom of the chatbot’s responses without affecting the content of the answers.

The digital advertising market has long been dominated by Google and Meta, with Amazon gradually becoming an important player in recent years.

It is reported that OpenAI expects that, in the long term, ad revenue will account for less than half of its overall earnings.

AI Arms Race Sparks Semiconductor Surge, NVIDIA Rises Nearly 8%, Achieving the Strongest Rally in 10 Months

The AI “arms race” has triggered a semiconductor frenzy, with NVIDIA (NASDAQ:NVDA) stock surging nearly 8%, marking its strongest rise in nearly 10 months. Earlier, six of the seven tech giants had already reported their earnings, with the most noteworthy being that Google (NASDAQ:GOOGL), Microsoft (NASDAQ:MSFT), Amazon (NASDAQ:AMZN), and Meta Platforms (NASDAQ:META) will collectively spend approximately $650 billion on capital expenditures in 2026.

Last week, Meta announced its capital expenditures would rise to as much as $135 billion for the year, representing an 87% increase. Meanwhile, Microsoft reported a 66% year-on-year growth in its capital expenditures for Q2, and analysts predict its fiscal year capital spending through June will approach $105 billion.

Jensen Huang, CEO of NVIDIA, specifically praised OpenAI and Anthropic, two leading AI labs, stating that both are “making a lot of money.” NVIDIA invested $10 billion in Anthropic last year, and Huang earlier this week indicated plans to significantly invest in OpenAI’s next funding round. He mentioned, “If they can have twice the computing power, their revenue will increase fourfold.”

Earlier, Taiwan Semiconductor Manufacturing Company (NYSE:TSM), the world’s leading semiconductor foundry, set new performance records and announced plans to significantly increase its capital expenditures to $52-56 billion in 2026, far exceeding market expectations. This is aimed at accelerating the expansion of advanced manufacturing capacity to address the ongoing global shortage of AI chips.

CPO Industry Accelerates, Lumentum Surges Over 9%

NVIDIA, in a recent webinar, announced that three partners—CoreWeave (NYSE:CRWV), Lambda, and TACC—will deploy the IB CPO system in the first half of 2026, and Ethernet CPO products are expected to start shipping in the second half of 2026. The company believes that the CPO industry is advancing more quickly than expected, with the technology first landing in scale-out scenarios and expanding into larger market spaces. As a next-generation optical interconnect solution, the commercial value of CPO continues to become clearer, and its market potential is expanding.

Lumentum (NASDAQ:LITE) CEO Michael Hurlston stated in a recent earnings call that the company’s ongoing growth is primarily driven by cloud optical modules, OCS, and CPO. The development of the OCS business has exceeded expectations, with the first $10 million quarterly revenue target, initially set for Q3, being reached ahead of schedule. Demand for OCS from three core customers has surged, and Hurlston revealed that there is a backlog of over $400 million in OCS orders, most of which are planned to be delivered in the second half of 2026. Orders and revenue are expected to continue growing as they enter 2027.

Roivant Sciences Surges Over 22% After Promising Skin Disease Results

Roivant Sciences (NASDAQ:ROIV) saw its stock rise more than 22% after its subsidiary Priovant Therapeutics reported positive results from the phase 2 trial of its experimental drug brepocitinib. The drug showed improvements in the activity of skin nodular disease at higher doses. Another subsidiary, Pulmovant, also announced the completion of phase 2 trial enrollment involving around 120 patients for its experimental drug mosliciguat, aimed at treating pulmonary arterial hypertension associated with lung disease.

Priovant reported that patients taking a 45 mg dose showed a 22.3-point improvement on a key skin scoring system at week 16, while the placebo group showed only a 0.7-point improvement. The company noted that all patients in the 45 mg group showed significant improvement, with 62% achieving near-complete skin clearance, and 69% achieving complete or nearly complete clearance, while none of the placebo group patients achieved similar results.

Priovant plans to launch phase 3 trials in 2026, following consultations with the U.S. Food and Drug Administration (FDA).