Consensus-Driven Metacognition in Multi-Agent Systems: A Logic-Based Byzantine Fault-Tolerant Protocol

Authors

DOI:

https://doi.org/10.63412/8vgf0b98

Keywords:

multi-agent systems, large language models, Byzantine fault tolerance, metacognition, defeasible reasoning, epistemic logic, consensus protocols, AI safety, auditability, hallucination mitigation, weighted voting, reputation systems

Abstract

Large language model (LLM) agents fail in a distinctive way: they produce confident wrong answers. Recent work has begun adapting Byzantine fault tolerance (BFT) to multi-LLM networks — notably the weighted BFT protocol of Wang et al. [1] and the Aegean consensus engine [2] — typically by treating agent trust as a scalar that flows continuously through the protocol.

This paper takes the opposite stance: discretization is a feature, not a bug. MBFT (Metacognitive BFT) commits a swarm decision via a small, finite ladder of confidence tiers, defeasible counterproofs, and a reputation-gated veto. The resulting protocol is auditable, deterministically replay able, and its safety / liveness claims are mechanically checkable — properties that continuous Bayesian aggregators struggle to provide. Bayesian and continuous trust schemes remain the right tool for noisy, well calibrated, high-volume regimes (recommender systems, sensor fusion, market making); we argue that high-stakes reasoning swarms — legal, medical, safety-critical agentic deployments — belong to the discrete, defeasible regime that MBFT formalizes. We accompany the paper with an open-source reference implementation whose property suite encodes each theorem as an executable test. 

Downloads

Published

2026-04-26

How to Cite

[1]
S. Bhattacharya, “Consensus-Driven Metacognition in Multi-Agent Systems: A Logic-Based Byzantine Fault-Tolerant Protocol”, IJGIS, vol. 3, no. 4, Apr. 2026, doi: 10.63412/8vgf0b98.

Similar Articles

11-20 of 28

You may also start an advanced similarity search for this article.