AI in Production Conference Summary (2026)

Wait 5 sec.

[This article was first published on The Jumping Rivers Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.“AI In Production 2026”, a new conference hostedby Jumping Rivers, was held on 4-5 June, 2026. The conference touched on many of the intricacies ofusing AI in software products, as part of the developer toolkit, and at the wider usage of AIthroughout organisations. Themes that stood out in the talks this year included regulatory andsecurity matters, the deployment and scalability of AI products, how to incorporate customdata/knowledge, and how to make use of AI tooling to simplify data-intensive tasks.In addition to the talks (held on 5th June), there was aday of workshops on the 4thJune.AI within the software-development lifecycleAI-based tooling has revolutionised software development during the 2020s.Colin Eberhardt (Scott Logic) highlighted steps in the evolution ofthe AI developer since 2022 (“Agentic AI and the Future of Software Development”).In 2022, AI could be used as an auto-completion tool with assistive code suggestions.With CoPilot (GitHub) and the launch of Claude (Anthropic), by 2023 AI tools could function asa virtual pair-programmer (proposing code, finding and fixing bugs in new code), though anycontributions needed thorough code review.Since 2024, AI tools have increasingly been used as autonomous contributors or as a componentof continuous code-review operations.This transition is having an impact throughout software organisations:How should a junior, or newly-onboarded, developer get up-to-speed with a project?Where should developers focus their time and effort?How should projects, teams and organisations be structured to take best advantage of AI tooling?On the first point, Rebecca Guiney (Certara) gave an interestinglightning talk (“Learning in Production: Becoming a software engineer during the AI wave”).In a recent project, she used AI to build a tool for schema discovery (over clinical trialdatabases). The tool can work reproducibly on new data.Rebecca highlighted how “contract-first development” is important when building tools using AIassistance. That is, when agents can write code and/or tests, an important role of the developer isto control what the different components of a system are reponsible for, and how they interact.Similarly, when reviewing code a greater emphasis is placed on the purpose of what was implementedthan on how it works. This emphasis on higher-level thinking is also impacting how new developerslearn – learning from AI-generated example code, rather than from step-by-step tutorials.Rebecca won the(CRC-Press sponsored) prize for the best lightning talk.Since the AI toolkit is relatively new, many projects exist upon which AI has never been used.When looking at these projects, Claude (or similar) could find countless issues.So how do you prioritise what to fix first (when you may also have a list of bugs and featureswaiting to be addressed)?Badr Adnani (Roundel Kitchens) gave a lightning talkexplaining how his use of AI helped identify and fix a critical bug in a SQL command(“Breaking Barriers with AI Automation”). The bug itself, could have posed a real financial risk tothe company.As part of his talk, Badr split up the audience by their level of AI usage.Most of our attendees were daily users of AI for well-defined tasks.Had Badr not increased his usage of AI at work, he may not have fixed the above bug (and gotpromoted), and he urged us all to experiment more with AI in our daily work.So how do we broaden the use of AI within programming teams?Firstly we make the tools easy to use, and work out how to use them cheaply and effectively.The Positron IDE is becoming a very popular system for development indata science and allied fields.George Stagg (Posit) gave the closing talk of the meeting describing how thePosit Assistant came into existence(“A Builder’s Guide to Working with AI”).Posit Assistant is an AI agent available in RStudio and Positron, with which a user can exploretheir data and generate analytical dashboards via a conversational interface.Building the Posit Assistant helped George understand AI systems like Claude.These systems can lie to you (or your users), for example, if you ask an AI tool to summarise thetrend in a dataset, it may give you a good enough answer.But it could be guessing – it might not even have seen the data – but may still give you an answer.So it’s sometimes important to check the raw messages that are sent to an AI tool and havean independent way to verify agent outputs.There are existing standards that make using AI coding tools more effective.Project-level definitions of how certain tasks should be performed, or how the project isstructured, can be put into an AGENTS.md file or a ‘skills’ directory.George explained that it is important to make use of these common standards when building adeveloper-facing agent.If you want to limit the cost of AI-usage in a project, limit the amount of data that is sentto- and from- your agent.This could be by cacheing previous messages – because otherwise you may be sending all of yourprevious conversation each time you interact with an agent.AI usage in specific projectsThere are many ways in which AI can be used across organisations.In addition to its use as an aid in software development (above) and within search-engine summaries,it is extremely common to find people using AI to write emails, to summarise meetings, to annotatevideos and speak and to perform a range of other workplace tasks.Nayara Macedo de Medeiros Albrech(Newcastle University) gave aninteresting summary of her research on the use of AI in local government and education(“AI Beyond Industry: Insights from Higher Education and Government”).Government decision making has a focus on ’evidence-based policy’, so AI could contribute in thisarea, but there is a need to prevent AI from introducing bias into decision making.The talk emphasised ethics in the public sector use of AI and the variability in take-up of AItooling across different local government sectors.Continuing on the theme of AI use in government, Katy Morgan introduced some self-serve tools thattheGovernment Internal Audit Agencyhave generated to aid the production of internal audits (“From risks to insights: Driving innovationwith AI-powered tools in internal audit”).The use of AI in these products ranged from highly-guided settings (where user-selection from afixed collection of application-encoded choices creates LLM prompt input automatically), to moreflexible use (where an auditor could freely interact with a chat engine).With these tools, users are encouraged to use their own judgement to interpret any AI-generatedoutput.So AI-embedded tools can fix your code and reduce administrative work.Can AI help resurrect old projects?Shona Ferguson outlined an interesting use-case for LLMs in data-rescue efforts at theUK Centre for Ecology and Hydrology(“Comparing and Evaluating Large Language Models for Efficient and Responsible Data Rescue”).Preservation of historical data resources is a time-consuming, and largely manual, task.The typical steps involve defining some quality checks, writing a data-cleaning script, findingsupporting publications and creating documentation.Shona demonstrated how effective three different LLM tools (free versions of CoPilot, ChatGPT andClaude Sonnet) could be in a project to rescue a dataset related to cloud and rain chemistry.The tools were particularly effective at code generation and making draft documentation.When approaching this type of project, recommendations are to give the LLM prompt as much contextabout the project as possible, to tell the LLM ’not to guess’ results and to be very explicitabout the form of output you expect the LLM to return.Sometimes your data is so large that it wouldn’t be possible to analyse it manually.The team from Wordnerds (Izzie Johnson and Damani Richards) gave atalk that touched on some novel themes for our conference(“A Nerd’s eye view: wrangling the GenAI hype cycle and refusing to become irrelevant”).One of the powers of LLMs is their ability to accept natural language as input.But natural language is complex and there’s an awful lot of natural language in the world.For text-analytics specialists, processing customer reviews or other large-scale collections of textusing LLMs could be pretty expensive: both economically and ecologically.If you want to process natural language efficiently at-scale, maybe an LLM isn’t the right tool touse.Izzie and Damani explained a hybrid approach, using both the Gemini LLM to identify and tag trainingdata, and smaller custom-trained neural networks to analyse millions of records efficiently.Just as human language is a complicated world, so is human behaviour.Grant Beasley from tombola, alongside our own Myles Mitchell (JumpingRivers), presented a project where they used deep-learning to assess player behaviour in theironline betting platform (“Using Deep Learning to monitor player safety on online betting platforms”).tombola monitors players over time, to help identify people at risk of gambling addiction, and tointervene where appropriate.The models used here are not large language models, they are deep neural networks trained ontime-series data.Training this model over large numbers of players was a substantial undertaking.But the tombola project highlighted how, if you are planning to build a model on this scale, it isimportant that you start with simpler benchmark models to compare your larger model against.William Kirby (Wessex Water) presented a lightning talk coveringanother project where real-time interventions can impact on human behaviour.This time the subject was wild-swimming and water quality in inland bathing waters in the UK(“Near Real Time Notifications for Bathing Waters using Machine Learning”).The project integrates data on water quality and climate with sensor and sampling data, and iscapable of classifying risk at their pilot site (Warleigh Weir, near Bath) on a 15 minuteturnaround. They have developed a web application to presenttheir results and are scaling out to additional sites over the next two years.Incorporating custom information into AI-toolsAgentic systems are clearly very powerful, but they can become far more useful in a project orproduct when the data they have access to is tailored to that project.Three talks highlighted different ways to connect an LLM to a custom data source or an externalservice so that the agent can be better informed.The standard way to enhance an LLM with a custom data source, is by using“Retrieval-Augmented Generation”(RAG). Obinna Iheanachor (Rotork; and host of theWisabi Analytics YouTube channel) gave a lightning talkexplaining a two-step AI-driven process where he used RAGs to identify documents related tocorporate insolvency cases, and then to identify evidence within those documents(“RAG in the Real World: Designing Trustworthy LLM Systems for Corporate Insolvency Data”).The aim here was to identify citable evidence, rather than plausible-sounding and eloquently writtenLLM outputs, so the use of RAGs was essential.RAG is a way of embedding document-derived text in a way that can be searched by an LLM.An extension of this concept is the GraphRAG, where relationships betweenentities in a series of documents can also be represented.Jonny Law (Neo4J) presented a talk that explained many of the conceptsunderpinning GraphRAG (“Engineering a Scalable Knowledge Graph Builder on Neo4J Cloud”).Neo4J have developed a “Knowledge Graph Builder” that allows data and relationships to be extractedfrom documents and represented in a graph structure.This graph structure can be queried using an LLM by converting an LLM query into a GraphRAG query.Another way to connect an agent to an external data source or service is the“Model Context Protocol” (MCP).MCP is an open standard and was the subject of Neal Richardson’s (Posit) talk(“MCP, or not MCP”).As a motivation, Neal explained that for security reasons, there are external resources that anAI agent shouldn’t be given access to.For example, you might not want an LLM to be able to read and write to the support message-boardfor your software, or to read any file on your computer.MCPs use a standard API and JSON-based interchange format, but they make it easier to restrict thereach of an AI system.OAuth 2.1 can be used to register an LLM with an MCP, meaning that API secrets needn’t be availableto the LLM – so it can’t leak these.Recommendations for building MCPs, are to only define the tools you actually need, to make APIresponses concise and to sanitise any sensitive data that the MCP might return.Development, deployment and auditing of AI-embedded productsWe have seen software-development tools that make use of AI; specific projects that use AI and/orML; analysis of, and recommendations for, the use of AI; and ways to link additional data toAI tools.But before you incorporate AI into all your workflows, you need to take a step back and consider howto implement everything in a way that’s cost-effective, safe and legal.First we need to consider what we’re allowed to do with AI and the liabilities we create when we useit on the public.Nathan Bilton (Weightmans) provided a thought-provoking insightinto how legislation is evolving to match the growth of AI (“Regulating Artificial Intelligence”).Given that AI can take in vast quantities of personal data and intellectual property duringtraining, does existing legislation provide enough protection?And even if we agree on a better set of rules, how do we enforce them?And who will be the enforcers?Some of these questions may feel out of your remit, and something for your elected lawmakers todebate over.But keeping abreast of these developing laws remains relevant to your deployment of AI,both now and in the uncertain future.The EU AI Act 2024already applies a number of restrictions on how AI can be used, with extraterrestrial effects.If you feed personal data of UK or EU citizens into AI, then GDPR rules still apply.And if you do allow AI to interface with the public on your behalf, are you responsible if things gowrong?Here Nathan cites the case ofMoffatt v. Air Canada, 2024where a Civil Resolution Tribunal found Air Canada to beliable for damages due to “negligent misrepresentation”after incorrect advice was given by a chatbot on their website.Next, how do you ensure your product’s use of AI is secure?Mac Misiura (Red Hat) presented the first technical talk of the day(“Open Source Guardrails for AI: Securing LLM Applications at Scale”).If you are planning to deploy an AI-backed application to production, I strongly urge you to watchthis talk (soon to be on the Jumping RiversYouTube channel).Mac highlighted some of the top risks that an LLM-based app exposes you to.There is a top-10 list hosted by OWASP, that covers thingslike disclosure of sensitive information, and spreading of misinformation.Not addressing these risks can expose you to reputational damage and adversarial attacks.To help secure your app, you can use “Guardrails”, which filter out inappropriate queries.These can range in complexity from rules and regular expressions up to LLM-based assessment ofthe intent of the query (use the simplest guardrails first).There are open-source guardrail collections that you can use with your application, for example,the NVIDIA NeMo Guardrails.Mac also argued that you should run attacks against your own system to identify security weak spots.You may have multiple products in the pipeline.Diego Jimenez and Oliver Thomas from Sage covered their approachto product delivery using what they described as a “repeatable path for controlled AI at enterprisescale” (“Beyond the POC: Architecting Enterprise-Grade Agentic Systems at Sage”).This approach allows them to rapidly iterate from prototype to scalable product, by using commondesigns across those products.They emphasised the importance of monitoring agent behaviour, and similarly to Mac Misiura (above)the use of safety guardrails across their products.Finally, once your app is in production, you’ll need to start paying for tokens.Seb Ringrose (Doubleword) described“Doubleword Code”, a system developed by Doubleword thatperforms pull-request reviews on repositories.During the talk, Seb identified a few challenges with the current use of AI agents: how to selectwhich model to use (from a continually changing collection of models), how much context to provideto an agent, what ancillary tooling to use, and—importantly—the cost of using these agents.Given these desirable qualities of agents – low latency, low cost and high quality – Seb arguedthat you can only have two in any AI system.So if you need a real-time system, then you have to choose between high cost or low quality.But not all systems need to run in real-time.For the pull-request review system, it can run asynchronously or as a batch process, and by doing socan lead to cost savings.Given that for many settings, the number of tokens used per task is increasing, finding ways toreduce cost is becoming extremely important.SummaryThis years talks at “AI In Production” covered a wide spectrum of the uses of AI (and advancedmachine learning) in software products and data projects across several domains. We discussedbest practices for securely releasing AI-embedded software into the world, regulatory matters,and how we can evolve as software professionals as AI tooling becomes commonplace.Jumping Rivers would like to thank everyone who presented talks or workshops at the conference (orsubmitted abstracts), all of the sponsors and the attendees for making this an interestingconference. We hope to see you again next year atAI In Production 2027.The talks from the conference will be added to the Jumping RiversYouTube channel soon.SponsorsJumping RiversDatabricksThe Royal Statistical SocietyPositCRC PressThe National Innovation Centre for Data (NICD)Community PartnersDevITJobs.ukFor updates and revisions to this article, see the original postTo leave a comment for the author, please follow the link and comment on their blog: The Jumping Rivers Blog.R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.Continue reading: AI in Production Conference Summary (2026)