Generative AI for Sustainable Proteins: New Framework from King’s College London

A recent study in Carbon Capture Science & Technology (Kalian et al., 2025) presents a novel multi-agent AI framework tailored to accelerate research into sustainable microbial protein production. The system employs GPT-based agents to automate literature retrieval, data extraction, and even toxicity screening in the context of waste-carbon-utilising microbial fermentation.

10/2/20251 min read

open book lot
open book lot

By benchmarking fine-tuning against structured prompt engineering, the researchers demonstrate notable gains in accuracy (cosine similarity ≥0.94 for fine-tuned models), while highlighting trade-offs in generalisability and computational cost. The modular interface and open-source microservices provide a pathway towards integration with industrial R&D pipelines.

Kalian, A. D., Lee, J., Johannesson, S. P., Otte, L., Hogstrand, C., & Guo, M. (2025). From LLMs to sustainable proteins: Fine-tuning and prompt engineering for multi-agent AI in waste carbon-utilising microbial protein production. Carbon Capture Science & Technology, 100525. https://doi.org/10.1016/j.ccst.2025.100525

Here is a blog-style summary in UK academic English, based on the uploaded article (From LLMs to Sustainable Proteins: Fine-Tuning and Prompt Engineering for Multi-Agent AI in Waste Carbon-Utilising Microbial Protein Production).

  • Year and geographical scope
    2025. Research conducted at King’s College London (UK), though the scope is global with applications in protein production and circular bioeconomy.

  • Generative AI technology
    Multi-agent AI framework built on GPT-based LLMs. Two agents:
    (1) Literature search agent (GPT-4o) optimising microbial strain queries.
    (2) Information extraction agent (GPT-4.1) tested under fine-tuning vs. prompt engineering.
    Applications: automated retrieval of microbial protein data (protein yield, trophic mechanism, substrates, safety/toxicity screening).

  • Research stage
    Proof-of-concept system tested in silico. Includes prototype web interface for interactive use. No industrial pilot yet.

Implementation insight

  • Levels and barriers
    Technological: dependence on closed-source GPT models, trade-off between fine-tuning accuracy and generalisability.
    Organisational: need for integration into R&D pipelines, governance of AI-generated insights, domain expertise to curate prompts/training.
    Barriers: fragmented microbial protein literature, risk of hallucinations, computational costs of fine-tuning.

  • Organisational conclusions
    Multi-agent AI can significantly accelerate discovery in CCU-integrated protein production by automating literature mining and synthesis. A dual-track optimisation strategy (prompt design plus targeted fine-tuning) is recommended. The modular design (APIs, dashboards, toxicity module) allows integration into existing research and industrial innovation pipelines.

  • Economic/financial conclusions
    While no ROI data are presented, the framework promises:

    • Reduced costs of manual literature review and trial-and-error strain selection.

    • Potential acceleration of strain optimisation, lowering time-to-market for sustainable protein.

    • Future industrial use could reduce R&D overheads and improve process economics, though scaling costs and data governance remain challenges.