Category: AI Hardware & Chips

Auto Added by WPeMatico

  • The math behind the OpenAI Jalapeño chip

    OpenAI’s financial trajectory hinges heavily on infrastructure costs, a reality that drove the development of the new custom OpenAI Jalapeño chip. Developed in collaboration with Broadcom, the application-specific integrated circuit (ASIC) represents a direct attempt to mitigate the heavy capital expenditure associated with third-party hardware. 

    While Nvidia currently commands an estimated 75% profit margin on its high-end processors, OpenAI operates on tighter margins, keeping roughly 33 cents of profit on each dollar generated after accounting for its massive operational expenses. The financial burden of running large language models at scale is severe. 

    Last year, keeping ChatGPT servers responsive had cost OpenAI a staggering US$8.4 billion. With the platform now attracting 900 million weekly users, that operational cost is projected to reach approximately US$14 billion this year. Over the next eight years, OpenAI has committed roughly US$1.4 trillion to computing power, a massive bet for a company currently generating US$25 billion in annual revenue.

    Designing Hardware for LLM Inference

    The OpenAI Jalapeño chip, dubbed as the company’s first “Intelligence Processor”, is built specifically for large language model (LLM) inference rather than general-purpose AI workloads. OpenAI provided the core architectural design based on its specific model roadmaps and serving systems, while Broadcom managed the silicon engineering and high-performance networking integration. 

    TSMC handles the physical manufacturing in Taiwan, and Celestica is tasked with building the board and rack systems. According to OpenAI, early lab samples are already running frontier workloads, including an unreleased GPT-5.3-Codex-Spark model, at target production frequency and power. 

    Richard Ho, head of OpenAI’s hardware program, noted that the architecture minimizes data movement to push realized utilization closer to its theoretical peak performance. Unlike general-purpose accelerators adapted from legacy AI workloads, this architecture specifically balances compute, memory, and networking resources to solve the data-movement bottlenecks native to interactive LLM serving.

    To achieve this at scale, the platform integrates Broadcom’s Tomahawk networking silicon directly into the design, allowing the custom processors to communicate across massive, clustered data center environments.

    The vertical integration flywheel

    By moving into custom silicon, OpenAI shifts from being a mere software layer to a vertically integrated infrastructure company. This full-stack strategy spans the entire pipeline: chip architecture, software kernels, memory systems, network scheduling, and the final application layer. Much like Apple’s tight coupling of proprietary hardware and iOS, OpenAI can now optimize its infrastructure around its exact internal model roadmaps.

    This integration feeds a continuous operational flywheel. Enhanced infrastructure efficiency lowers the cost of both training and serving models. More affordable serving leads to better, more responsive products, which drives user volume and revenue to be reinvested back into the next generation of custom infrastructure.

    Overcoming the late-mover advantage

    By introducing its own silicon, OpenAI enters a landscape where its primary competitors have spent nearly a decade developing proprietary hardware. Google began deploying its Tensor Processing Units (TPUs) in 2015 and now controls roughly a quarter of global AI computing capacity outside of Nvidia’s supply chain. 

    Amazon has shipped over one million of its custom chips, while Meta and Microsoft continue to scale their own infrastructure.

    “Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant,” said Greg Brockman, president and co-founder of OpenAI. “By designing more of the stack ourselves, we can serve more intelligence with greater efficiency.”

    To close this timeline gap, OpenAI accelerated the development phase. The OpenAI Jalapeño chip transitioned from a blank-slate design to manufacturing tape-out—the final step before physical production—in just nine months. The engineering teams achieved this timeline by utilizing OpenAI’s own language models to automate and optimize portions of the hardware design process.

    This creates a unique feedback loop where the models served to users are actively being leveraged to build the physical infrastructure that will run future iterations. Initial deployment of the hardware into data centres is scheduled to begin by the end of 2026.

    Broadcom CEO Hock Tan confirmed that the rollout will scale alongside infrastructure partners, including Microsoft, to prepare for gigawatt-scale data centre integration.

    (Photo by OpenAI)

    See also: Omio scales travel product development using OpenAI models

    Banner for AI & Big Data Expo by TechEx events.

    Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events, click here for more information.

    AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

    The post The math behind the OpenAI Jalapeño chip appeared first on AI News.

  • Samsung opens ChatGPT Enterprise and Codex access after AI restrictions

    Samsung Electronics is expanding employee access to ChatGPT Enterprise and Codex, giving staff wider use of AI tools for technical and non-technical work.

    According to OpenAI, the deployment covers all Samsung Electronics employees in Korea and all Device eXperience employees worldwide. The DX division includes smartphones, consumer electronics, and home appliances.

    Samsung plans to use the tools in software development, marketing, product development, manufacturing, and other business functions. The tools will support tasks such as information search, document drafting, idea development, data interpretation, and code-related work.

    Samsung revisits employee AI use

    The rollout comes three years after Samsung restricted employee use of generative AI tools over data-security concerns. In 2023, the company limited the use of ChatGPT and similar tools after concerns that sensitive internal information had been uploaded to an external AI platform.

    The new deployment gives employees access to ChatGPT Enterprise, which includes controls for data protection, user access, and security management. OpenAI said the enterprise version allows organisations to manage users, apply access controls, and use AI tools within internal security requirements.

    Samsung’s earlier restrictions applied to employee use of ChatGPT and similar generative AI tools. The new rollout gives employees access through an enterprise product with data protection and access controls.

    Samsung has not limited the deployment to a single business unit or technical group. OpenAI said the tools will be used across a broad range of functions, including technical and non-technical teams.

    OpenAI said ChatGPT can support knowledge-based tasks such as searching for information, analysing material, drafting documents, developing ideas, and interpreting data.

    Codex for technical and non-technical work

    Codex will be used for software-related tasks such as writing, reviewing, and debugging code. OpenAI said the tool is also being used for internal tools, websites, software prototypes, and automated workflows.

    OpenAI said Codex can also support non-technical teams in day-to-day work, including by helping employees create internal tools and automated workflows.

    OpenAI said Codex now has more than five million weekly users across technical and non-technical workflows. In Korea, weekly active users of Codex have grown nearly 800% since February 1, 2026, according to the company.

    Harrison Kim, general manager of OpenAI Korea, said the agreement is one of OpenAI’s largest enterprise deployments. He said Samsung is using AI across teams and functions rather than limiting it to specific departments.

    In October 2025, Samsung said it would work with OpenAI as a strategic memory partner for the Stargate AI infrastructure initiative, with OpenAI’s memory demand projected to reach up to 900,000 DRAM wafers per month.

    Samsung SDS also entered a potential partnership with OpenAI to jointly develop AI data centres and provide enterprise AI services. Samsung said the agreement would allow Samsung SDS to provide consulting, deployment, and management services for businesses integrating OpenAI models into internal systems.

    Samsung SDS also signed a reseller partnership to offer OpenAI services in Korea. Under that arrangement, Samsung SDS said it would support Korean companies adopting ChatGPT Enterprise and other OpenAI services.

    Reuters reported that Samsung Electronics and SK Hynix had signed letters of intent to supply memory chips for OpenAI’s Stargate project. The report said the two South Korean chipmakers together account for about 70% of the global DRAM market and nearly 80% of the high-bandwidth memory market.

    High-bandwidth memory supports fast data movement between memory and processors in AI systems. Reuters reported that OpenAI’s chip demand for Stargate may reach 900,000 wafers per month, citing South Korea’s presidential office.

    Samsung said its semiconductor businesses would support OpenAI’s demand with advanced memory solutions. The company also said its affiliates were exploring broader work with OpenAI in areas including data centres, enterprise services, and AI infrastructure.

    AI adoption and productivity

    Deloitte’s 2026 State of AI in the Enterprise report found that 66% of organisations reported productivity or efficiency gains from enterprise AI adoption. The same report found that 53% reported improved insights and decision-making.

    A Bpifrance survey reported by Reuters found that 77% of 534 French mid-sized company heads said their firms used generative AI, but only 17% of those using it reported time savings.

    Samsung has identified use cases across document work, information analysis, coding, product development, marketing, and manufacturing. The deployment gives employees access to ChatGPT Enterprise and Codex for those tasks under a company-wide agreement.

    OpenAI’s Korea partnerships

    OpenAI has also announced other partnerships in Korea. Seoul National University recently began providing ChatGPT Edu to 47,000 students, faculty, and staff.

    OpenAI has also worked with Kakao to bring ChatGPT responses into KakaoTalk group chats. The company said Korean organisations including LG Electronics, LG Uplus, LG CNS, GS E&C, Samsung SDS, TVING, Krafton, Toss, MUSINSA, Korea Zinc, Nexen Tire, and HanaTour are using ChatGPT Enterprise, OpenAI APIs, or Codex.

    (Photo by Zulfugar Karimov)

    See also: Omio scales travel product development using OpenAI models

    Banner for AI & Big Data Expo by TechEx events.

    Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events, click here for more information.

    AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

    The post Samsung opens ChatGPT Enterprise and Codex access after AI restrictions appeared first on AI News.