This study investigated the process stability of anaerobic digestion and enhanced intelligent control through applications of machine learning. One stability indicator characterized the dynamic balance of anaerobic biochemical reactions by introducing...
This study investigated the process stability of anaerobic digestion and enhanced intelligent control through applications of machine learning. One stability indicator characterized the dynamic balance of anaerobic biochemical reactions by introducing Recovery Potential (RP) and Deterioration Potential (DP). Another stability indicator diagnosed the state of anaerobic digestion through a comprehensive indicator derived using Principal Component Analysis (PCA). A deep learning model combining a Convolutional Neural Network (CNN) and Bidirectional Long Short-Term Memory (BdLSTM) utilized real-time sensor data to provide insights into system state and performance, ensuring economical and stable digester operation. Finally, intelligent control of anaerobic digestion was implemented using a Deep Q-Network (DQN) reinforcement learning model, integrating stability indicators with the CNN-BdLSTM network.
RP and DP were formulated to shed light on the kinetic balance between anaerobic biochemical reactions. RP is gauged by the ratio of the methanogenesis rate (MR) to the acidogenesis rate (AR), while the DP is the sum of the accumulation rate (AcR) and dilution rate (DR) of total VFAs, normalized using the AR. In an anaerobic digester for a mixture of pulverized food waste and liquified sewage sludge, an RP above 1.0 signifies a restorative state in the kinetic balance of anaerobic biochemical reactions across various operational phases, including startup and steady state, and shifts in organic loading rate. Conversely, a DP value of 0.0 or higher denotes a deterioration in the kinetic balance. The instability index (ISI), calculated as the DP to RP ratio, serves as an indicator of an anaerobic digestion state. When the standard deviation of ISI surpasses 0.2, it signifies instability in biochemical reactions; however, an average ISI below 0.05 indicates a stable digestion process. The study underscores the efficacy of RP, DP, and ISI as robust indicators for assessing the stability of anaerobic digestion based on the kinetics of biochemical reactions.
A comprehensive indicator based on PCA has been proposed for diagnosing the state of anaerobic digestion. Various state and performance variables were monitored under different operational modes, including start-up, interruption and resumption of substrate supply, and impulse organic loading rates. While these individual variables are useful for estimating the state of anaerobic digestion, they must be interpreted by experts. Coupled indicators combine these variables with the effect of offering more detailed insights, but they are limited in their universal applicability. Time-series eigenvalues reflected the anaerobic digestion process occurring in response to operational changes: Stable states were identified by eigenvalue peaks below 1.0, and they had an average below 0.2. Slightly perturbed states were identified by a consistent decrease in eigenvalue peaks from a value of below 4.0 or by observing isolated peaks below 3.0. Disturbed states were identified by repeated eigenvalue peaks over 3.0, and they had an average above 0.6. The long-term persistence of these peaks signals an increasing kinetic imbalance, which could lead to process failure. Ultimately, this study demonstrates that time-series eigenvalue analysis is an effective comprehensive indicator for identifying kinetic imbalances in anaerobic digestion.
The immediate response to the state disturbances of anaerobic digestion is essential to prevent anaerobic digestion failure. However, frequent monitoring of the state and performance of anaerobic digestion is challenging. Thus, deep learning models were investigated to predict the state and performance variables from online sensor data. The online sensor data, including pH, electric conductivity, and oxidation-reduction potential, were used as the input features to build deep learning models. The state and performance data measured offline were used as the labels. The model performance was compared for several deep learning models of CNN, LSTM, dense layer, and their combinations. The combined model of CNN and BdLSTM was robust and well-generalized in predicting the state and performance variables (R2=0.978, root mean square error=0.031). The combined model is an excellent soft sensor for monitoring the state and performance of anaerobic digestion from electrochemical sensors.
Reinforcement learning (RL) based on a deep Q-network (DQN) was studied to enable intelligent control of anaerobic digestion processes. Anaerobic digesters operated under statistically designed organic loading rate (OLR) conditions provided sensor data on process states and performance. Variable importance analysis identified key RL components—pH, EC, and ORP as states; OLR (flow rate and COD) as actions; and total reward combining stability and methane production. A deep learning-based environment model was trained to simulate process dynamics, predicting the next states and total reward based on the current states and actions. The architecture of the DQN with ε-greedy and prioritized experience replay was optimized by interacting with the environment model. Offline training effectively pre-trained model parameters, enhancing initial learning performance. The pre-trained DDQN was activated above a total reward threshold, stabilizing process instability and improving methane production under variable OLR conditions. The dueling DDQN (TDQN) showed slower pre-training but rapidly adapted to variability, stabilizing the process and significantly improving methane production. Both pre-trained DDQN and TDQN provide intelligent control frameworks for optimizing anaerobic digestion performance under variable OLR conditions.