Category: Natural Language Processing (NLP)

Auto Added by WPeMatico

  • Omio scales travel product development using OpenAI models

    Omio integrates OpenAI models across its engineering operations to accelerate travel product development and launch booking interfaces.

    The multimodal travel platform coordinates operations with over 3,000 transportation providers across 47 countries. Omio explicitly rejects the superficial addition of technology to outdated internal processes. The company’s CTO, Tomas Vocetka, requires all internal functions to completely redesign their operational execution frameworks from the ground up to operate as a native AI enterprise.

    OpenAI Codex integration

    Vocetka initiated the internal deployment by providing base ChatGPT access to the workforce, establishing a baseline familiarity with generative models before executing the primary technical integration.

    Omio subsequently embedded OpenAI Codex directly into its engineering operations, mandating its application across the entire software development lifecycle. Engineers currently apply Codex to preliminary research, architectural planning, active coding, automated testing, code reviews, and ongoing system maintenance.

    The engineering division constructs custom internal connectors to link proprietary data environments directly with these tools. This setup allows developers to bypass basic information retrieval and proceed directly to active task execution within their integrated development environments.

    Vocetka categorises the initial ChatGPT rollout as a preliminary introduction, emphasising that Codex handles the actual production workload. The deployment execution matured beyond the technical divisions. Management actively expands the use of Codex into non-technical corporate functions across the wider organisation. This expansion ensures standard operational procedures adapt to the new capabilities introduced by the engineering team.

    Internal analysis indicates the technical effort required to build specific products now sits at approximately 20 percent of previous levels. Delivery timelines show corresponding compression. Projects demanding the attention of multiple developers over an entire fiscal quarter now require a single engineer operating for roughly one month.

    Faster cycle times allow the engineering teams to test experimental concepts and validate consumer demand with minimal resource expenditure. Management allocates capital and engineering hours with greater precision, relying on prototyping to eliminate unviable features before committing to full-scale production.

    Lowering the time and cost barrier for software creation enables quicker internal decision-making. The technical teams iterate on existing products at a much higher velocity, pushing updates and new interface elements to the live environment at accelerated pace.

    Conversational commerce built on real-time transport data

    Omio launched one of the earliest conversational travel booking interfaces in 2023 by connecting OpenAI models to its proprietary transportation inventory.

    The system processes natural language queries regarding complex multimodal routes. Travelers input natural language requests asking for the fastest route from Rome to Florence, or comparing flights and trains between Paris and Barcelona.

    Omio aggregates services spanning trains, buses, ferries, and flights. Legacy travel booking required users to navigate multiple websites, manually compare modes of transport, and independently aggregate itineraries across multiple providers. Omio replaces this fractured process with a unified interface capable of parsing consumer intent.

    The generative models analyse text inputs and ping the booking systems to construct viable travel paths. The application functions by grounding the model responses in live pricing and availability data. The architecture prevents the generation of travel options based on static or outdated training data. The resulting output provides consumers with directly bookable itineraries.

    Omio expanded its initial integration into a dedicated ChatGPT experience. This dedicated application directly accesses the global transportation network maintained by the company. By grounding the user interaction in verified data, the technical team ensures high-fidelity responses. Consumers receive highly-personalised journey options rather than generic travel advice.

    Omio defines this structural setup as a new category of conversational commerce. The AI operates as the primary interface layer mediating the interaction between the consumer and the underlying global transportation network. The company views this as a broader departure from legacy search-based interfaces toward native generative customer experiences.

    The deployment points to a future where travel planning relies entirely on interacting with intelligent systems connected directly to live transportation networks.

    Omio’s corporate policy explicitly mandates that human personnel retain full accountability for all deployed code and final business outcomes. Generative tools function strictly as acceleration engines for development, analysis, and decision-making.

    “The responsibility and accountability stay with people. AI helps us develop faster, analyse faster, and make decisions faster, but people stay in charge,” explains Vocetka.

    This governance structure prevents automated systems from independently executing irreversible changes to the booking infrastructure or the core multimodal routing algorithms. The combination of broad employee access to OpenAI tools and rigorous oversight models creates an environment prioritising both speed and systemic stability.

    See also: Mitigating vendor lock-in with Sakana AI Fugu multi-agent models

    Banner for the AI & Big Data Expo event series.

    Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events including the Cyber Security & Cloud Expo. Click here for more information.

    AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

    The post Omio scales travel product development using OpenAI models appeared first on AI News.

  • Mitigating vendor lock-in with Sakana AI Fugu multi-agent models

    Sakana AI launched Fugu to orchestrate multi-agent operations and mitigate single-vendor dependency risks in enterprise deployments.

    Enterprises face operational vulnerabilities when relying entirely on monolithic AI APIs. Japanese AI firm Sakana AI designed Fugu as a response to these concentration risks by creating an orchestration language model that calls upon a pool of varied models to complete multi-step tasks.

    Users access this ecosystem through a single OpenAI-compatible endpoint. Fugu routes queries internally, deciding whether to resolve a prompt directly or to assemble a coordinated team of expert models for deeper analysis. The system handles model selection, delegation, verification, and synthesis internally. Engineering teams interact with what appears to be one model while a background system of specialists executes the actual computation.

    Sakana AI targets the geopolitical and regulatory risks associated with AI sourcing. Recent export controls affecting Anthropic models like Fable and Mythos demonstrated that access to specific foundational architectures can vanish based on foreign policy decisions.

    Fugu functions as a hedge against these sudden supply chain disruptions. The platform relies on a completely swappable agent pool. Fugu dynamically routes traffic around any restricted or degraded provider to maintain service continuity. Sakana AI states this capability provides the resilient architecture required for AI sovereignty.

    Fugu deployment tiers

    Two tiers are available to accommodate different operational latency requirements.

    The standard Fugu model prioritises low latency for daily tasks, integrating into standard developer tools like Codex for live coding and code review. Organisations subject to strict data governance or privacy mandates can manually opt specific underlying models out of the standard Fugu routing pool.

    Fugu Ultra targets complex, multi-step analytical problems that demand maximum accuracy. The Ultra variant coordinates a deeper pool of expert agents for intensive tasks such as academic paper reproduction, literature investigations, and patent analysis.

    Sakana AI reports that Fugu Ultra performs competitively against leading closed models like Fable 5 and Mythos Preview across scientific, engineering, and reasoning benchmarks:

    Benchmarks of Sakana AI Fugu standard and Ultra compared to rival frontier models.

    The orchestration method ensures companies can access top-tier computing capabilities without carrying the vendor concentration risk or export control exposure inherent to those closed models.

    Implementation in cybersecurity

    Almost 500 early users tested the system during an extended beta program focused on lengthy, multi-step computational workflows. With cybersecurity such a focus for models like Claude Mythos, engineering teams deployed Fugu Ultra to automate complete security assessment cycles.

    Human operators issued one scoped instruction, and the orchestration engine executed the entire reconnaissance phase. The model successfully conducted cross-site scripting and SQL injection checks alongside thorough authentication reviews.

    A participating cybersecurity engineer confirmed the model stayed strictly within its operational parameters and avoided initiating destructive actions against the target infrastructure. Fugu concluded the automated engagement by generating a clean vulnerability report complete with verifying evidence and exact retest steps for human remediation teams.

    The implementation demonstrated that multi-agent routing maintains strict compliance boundaries while executing complex penetration testing sequences.

    Software development teams also integrated Fugu Ultra into their primary code review pipelines to compare defect detection rates against established monolithic tools. The orchestration engine consistently outperformed baseline models in identifying logic flaws and security vulnerabilities within complex enterprise codebases.

    “For code review, Fugu Ultra is significantly better than GPT-5.5. It gives comprehensive answers and finds the bugs others miss,” reported a software engineer involved in the beta deployment. “Where other tools flag about three issues, Fugu surfaced more than twenty. It’s become the model I run all my reviews through.”

    Automated research and persona stability

    Data science units deployed the system in an almost fully-automated research mode. Fugu Ultra successfully explored mathematical hypotheses, executed experimental code runs, interpreted failure states, and revised its own approaches to sustain progress over extended periods with minimal human intervention. This capability directly addresses the operational limitations of single-call models that require constant human prompting to recover from logic errors.

    Leadership at an unnamed enterprise platform company identified long-term persona stability as a primary advantage during these extended sessions. Conventional monolithic architectures often suffer from context degradation and identity drift when processing extensive conversational histories.

    “Raw output quality is on par with top frontier models, but Fugu showed unusually strong persona stability across long sessions, holding its identity where other models drift,” the executive stated. “For agent products, that may matter more than raw benchmark scores.”

    Extended benchmark validation

    Sakana AI built the internal routing logic upon extensive research into learned model orchestration. The technical foundation for the product stems from findings published in the company’s ICLR 2026 papers, specifically the Trinity and Conductor frameworks.

    These academic foundations allow Fugu to process requests by understanding precisely when a task requires delegation versus direct resolution. The internal language model dictates communication protocols between the individual agents and structures the final synthesis of their separate computational outputs.

    Validation testing against frontier AI competitors covered complex, open-ended disciplines ranging from financial time series prediction to mechanical design. Fugu also demonstrated high proficiency in niche physical logic tests and visual interpretation tasks, including solving the Rubik’s Cube and performing Japanese handwriting analysis. The capacity to excel in both quantitative financial modelling and qualitative image processing confirms the efficacy of the multi-agent orchestration approach.

    Sakana AI designed the system to scale organically as the broader AI hardware and software market matures. Because the product relies entirely on learned orchestration logic rather than fixed operational rulesets, it automatically benefits from third-party innovations. Sakana AI plans to continuously expand the available pool of expert agents.

    The engineering team will fold newly-released open-source tools and proprietary Sakana AI models into the routing pool as they become available. Both the standard Fugu and Fugu Ultra models are available to enterprise clients today.

    See also: SAP and Google Cloud deploy agentic commerce architecture

    Banner for the AI & Big Data Expo event series.

    Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events including the Cyber Security & Cloud Expo. Click here for more information.

    AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

    The post Mitigating vendor lock-in with Sakana AI Fugu multi-agent models appeared first on AI News.

  • Google Cloud generative AI automates council planning operations

    Government ministries are deploying Google Cloud generative AI across municipal agencies to automate council planning operations.

    Public sector administration handles vast volumes of unstructured data that delay infrastructure development. The UK central government established a target to construct 1.5 million new homes by 2029. Local planning authorities encounter administrative backlogs caused by dense paperwork, delaying these development timelines.

    To address these constraints, the Ministry of Housing, Communities and Local Government (MHCLG) and the Department for Science, Innovation and Technology (DSIT) expanded two machine learning tools designed to accelerate municipal processing. Speaking at the Google Cloud Summit London, officials confirmed the nationwide deployment of the ‘Extract’ application and the progression of the ‘Augmented Planning Decisions’ (APD) prototype.

    Lila Ibrahim, Chief AI Readiness Officer at Google DeepMind, said: “The UK has an opportunity to build the homes our communities need, but local councils face a mountain of paperwork. That’s why we’re co-creating a sophisticated planning tool directly with councils to solve real-world bottlenecks.

    “This will help significantly cut decision times, freeing up planners to focus on the future to get Britain building faster.”

    Householder applications – which include routine domestic modifications such as loft conversions or property extensions – account for nearly 70 percent of all planning applications submitted annually. Evaluating these standard submissions manually requires planning officers to spend hours cross-referencing regional policy documents, historical archives, and unstructured PDF files.

    Such a repetitive evaluation process consumes administrative hours that would otherwise support major infrastructure and commercial developments. The deployment of automation targets this administrative distribution, aiming to reduce application decision timelines by 50 percent.

    Core capabilities of the Google Cloud generative AI tools

    Engineers at MHCLG and the government’s applied AI team, the Incubator for AI (i.AI), built the Extract tool internally using Gemini foundation models. Following trials across more than 20 local planning authorities, administrators expanded the application to every council in England.

    Extract parses unstructured data locked within legacy PDF records, converting hundreds of pages of historical planning documentation into structured digital datasets within minutes. Operational data from the trial phases indicates that the tool will eliminate roughly 255 hours of manual data entry per council annually. This reduction allows local authorities to reallocate personnel to complex evaluation tasks.

    Integrating large language models into public sector workflows requires enterprise-grade security environments. Local authorities process sensitive civic records, requiring strict risk management protocols to prevent data exposure.

    The government hosted the Gemini models on Google Cloud to establish a protected operating environment where data sovereignty is maintained. The cloud environment features active security controls to block malicious inputs, including prompt injection attacks. This technical framework ensures that sensitive municipal data remains secure during both testing and production computing cycles.

    The APD system, meanwhile, acts as an analytical assistant for municipal planning officers by automating four primary administrative tasks:

    1. The system consolidates incoming documentation by pre-processing data backlogs, flagging missing information gaps, and extracting core geographical site data onto a unified user interface for officer review.
    2. The software identifies relevant national and local zoning laws, assesses compliance margins, and appends precise policy citations for manual verification.
    3. The application parses public consultation letters, summarising stakeholder objections or historical legal precedents.
    4. The model generates initial drafts of final evaluation reports, including the technical rationale and recommended approval conditions.

    Protocols dictate that human planning officers retain final decision-making authority over every application. The software does not automate final approvals or rejections independently. Staff members review every line of text generated by the machine learning models, modifying the analytical reasoning before validating the report.

    To maintain regulatory accountability, the APD prototype records its internal processing steps sequentially. This mechanism establishes an auditable chain of thought, creating a verification trail for every processed application to support the officer’s final determination.

    Local council planning trials and scaling timelines

    The development of the APD prototype relies on a collaborative framework linking public sector administrators with engineering teams from Google Cloud, Google DeepMind, and Faculty.

    The alpha version undergoes live testing within three local authorities: the London Borough of Barnet, Dorset Council, and the London Borough of Camden. Testing across these distinct regional jurisdictions provides developers with varied municipal datasets to test the software against diverse local policies. 

    Central planners intend to complete the alpha phase and deploy the APD tool to all 300-plus English local authorities by 2027. Google Cloud provides the elastic computing infrastructure required to manage the thousands of concurrent inferencing queries generated during daily operations.

    Paul Maltby, Director of Public Services at Faculty, commented: “The English planning system is clogged up. Planning officers are forced to spend half their time reviewing applications to convert an attic, putting those for housing estates and warehouses on hold.

    “Built with planning officers, our AI system will take the drudgery out of reviewing simple planning applications so they can make quick decisions. It will let planning officers focus on the major developments that matter, and crucially, let families improve their homes without months of delay and uncertainty.”

    Naisha Polaine, Executive Director for Growth at Barnet Council, added: “The tool’s ability to collect relevant information, undertake a provisional assessment, and draft the foundations of a report has the potential to save significant officer time spent working on the administration of planning applications and direct this to speeding up the decision-making process for residents. In turn, this will contribute significantly to delivering our house building growth targets in the borough.”

    The coordination between MHCLG, i.AI, Google DeepMind, and Faculty establishes a structured division of labour for enterprise software engineering. Public ministries define the policy guidelines and statutory boundaries, while external technical partners engineer and deploy the underlying model architectures.

    The successful integration of these systems demonstrates the feasibility of hosting advanced language models within a secured public cloud infrastructure to process core administrative workloads and modernise public service delivery.

    See also: EU publishes its AI content labelling playbook ahead of the AI Act’s August deadline

    Banner for the AI & Big Data Expo event series.

    Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events including the Cyber Security & Cloud Expo. Click here for more information.

    AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

    The post Google Cloud generative AI automates council planning operations appeared first on AI News.

  • Aviva deploys AI to stop £230M in sophisticated insurance fraud

    Aviva has uncovered a record £230 million in insurance fraud claims and is using AI tools to counter the growing problem.

    The battleground has changed, and the culprits are also coming armed with a new generation of tools. We’re now in an environment where AI is being used not just to defend against fraud, but to perpetrate it.

    The insurance industry has long dealt with opportunistic dishonesty. A bumped car suddenly needs four new doors, or a minor slip becomes a life-altering injury. However, according to Aviva’s data, the nature of the deception is getting deeper, more sophisticated, and harder for the human eye to catch.

    Aviva is fighting fire with fire, deploying its own AI to uncover these elaborate schemes.

    Countering the AI-powered insurance fraud factories

    Aviva reports that scammers are now using AI to generate convincing fakes of car accident scenes. These aren’t clumsy photoshop jobs; they’re detailed, plausible images that can easily fool a claims handler working through a heavy caseload.

    The same generative AI tools are being used to create fake documents, from invoices for repairs that were never done, to medical reports that have no basis in fact. Fraudsters don’t need access to a network of corrupt garages or medical professionals to back up their story. They just need a subscription to an AI service and a bit of imagination. The AI handles the rest, producing official-looking documents that can pass a cursory inspection.

    An individual or small group can now generate the supporting evidence for dozens of high-value claims without ever leaving their desk. How do you validate reality when reality itself can be so easily and cheaply faked?

    Aviva’s response has been to build an AI-powered defence system that can operate at the same scale and speed as the threat. While the company is understandably tight-lipped about the exact architecture, you can piece together what a system like this needs to do.

    At its core, the AI detective carries out pattern recognition at scale. The AI sifts through millions of data points from current and past claims, learning what a legitimate claim looks like—and, more importantly, what it doesn’t.

    When a new claim comes in, the system is cross-referencing everything. Does the damage in the photo match the physics of the described accident? Do the timestamps on the documents make sense? Has this vehicle registration number appeared in other suspicious claims? Are the repair costs quoted on the invoice out of line with the thousands of other similar repairs in the database? It’s a level of forensic analysis that would be impossible to perform manually on every one of the thousands of claims filed each day.

    From organised crime to exaggerated claims

    It’s important to note that this isn’t all about organised criminal gangs. A portion of that £230 million figure comes from what the industry calls “claims inflation.”

    Claims inflation is the more common fraud where policyholders or service providers pad the bill. For instance, a garage might add unnecessary repairs to a quote, or an individual might exaggerate the value of items stolen in a burglary.

    Here, too, AI is proving to be a heavy-duty tool. By analysing vast datasets of repair costs and market values, the system can instantly flag when a quoted price is an outlier. It can compare the cost of a replacement part from one garage against the average from hundreds of others in the same region for the same make and model.

    The goal of Aviva’s AI isn’t to outright deny claims, it’s an augmentation tool for their human investigators. The AI acts as a filter, sifting through the noise to surface the most likely instances of fraud. This human-in-the-loop approach is essential for ensuring fairness and preventing the system from becoming a black box that makes decisions without oversight.

    What Aviva is doing provides a potential route for any customer-facing enterprise in the age of generative AI. The same technology that creates these threats is also the most effective way to combat them.

    As it becomes easier to fake everything from identities to invoices, the only viable defence is an intelligent system that can learn, adapt, and spot deception at a scale that humans alone can’t match.

    See also: Weis Markets adds Instacart AI-powered shopping carts to stores

    Banner for AI & Big Data Expo by TechEx events.

    Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events including the Cyber Security & Cloud Expo. Click here for more information.

    AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

    The post Aviva deploys AI to stop £230M in sophisticated insurance fraud appeared first on AI News.