http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
Automatic Adaptive Space Segmentation for Reinforcement Learning
Yuki Komori,Akira Notsu,Katsuhiro Honda,Hidetomo Ichihashi 한국지능시스템학회 2012 INTERNATIONAL JOURNAL of FUZZY LOGIC and INTELLIGE Vol.12 No.1
We tested a single pendulum simulation and observed the influence of several situation space segmentation types in reinforcement learning processes in order to propose a new adaptive automation for situation space segmentation. Its segmentation is performed by the Contraction Algorithm and the Cell Division Approach. Also, its automation is performed by “entropy,” which is defined on action values’ distributions. Simulation results were shown to demonstrate the influence and adaptability of the proposed method.
Automatic Adaptive Space Segmentation for Reinforcement Learning
Komori, Yuki,Notsu, Akira,Honda, Katsuhiro,Ichihashi, Hidetomo Korean Institute of Intelligent Systems 2012 INTERNATIONAL JOURNAL of FUZZY LOGIC and INTELLIGE Vol.12 No.1
We tested a single pendulum simulation and observed the influence of several situation space segmentation types in reinforcement learning processes in order to propose a new adaptive automation for situation space segmentation. Its segmentation is performed by the Contraction Algorithm and the Cell Division Approach. Also, its automation is performed by "entropy," which is defined on action values’ distributions. Simulation results were shown to demonstrate the influence and adaptability of the proposed method.
Imanishi, Toshiaki,Hanai, Taizo,Aoyagi, Ichiro,Uemura, Jun,Araki, Katsuhiro,Yoshimoto, Hiroshi,Harima, Takeshi,Honda , Hiroyuki,Kobayashi, Takeshi The Korean Society for Biotechnology and Bioengine 2002 Biotechnology and Bioprocess Engineering Vol.7 No.5
In order to control glucose concentration during fed-batch culture for antibiotic production, we applied so called “software sensor” which estimates unmeasured variable of interest from measured process variables using software. All data for analysis were collected from industrial scale cultures in a pharmaceutical company. First, we constructed an estimation model for glucose feed rate to keep glucose concentration at target value. In actual fed-batch culture, glucose concentration was kept at relatively high and measured once a day, and the glucose feed rate until the next measurement time was determined by an expert worker based on the actual consumption rate. Fuzzy neural network (FNN) was applied to construct the estimation model. From the simulation results using this model, the average error for glucose concentration was 0.88 g/L. The FNN model was also applied for a special culture to keep glucose concentration at low level. Selecting the optimal input variables, it was possible to simulate the culture with a low glucose concentration from the data sets of relatively high glucose concentration. Next, a simulation model to estimate time course of glucose concentration during one day was constructed using the on-line measurable process variables, since glucose concentration was only measured off-line once a day. Here, the recursive fuzzy neural network (RFNN) was applied for the simulation model. As the result of the simulation, average error of RFNN model was 0.91 g/L and this model was found to be useful to supervise the fed-batch culture.
Takeshi Kobayashi,Toshiaki Imanishi,Taizo Hanai,Ichiro Aoyagi,Jun Uemura,Katsuhiro Araki,Hiroshi Yoshimoto,Takeshi Harima,Hiroyuki Honda 한국생물공학회 2002 Biotechnology and Bioprocess Engineering Vol.7 No.5
In order to control glucose concentration during fed-batch culture for antibiotic production, we applied so called “software sensor” which estimates unmeasured variable of interest from measured process variables using software. All data for analysis were collected from industrial scale cultures in a pharmaceutical company. First, we constructed an estimation model for glucose feed rate to keep glucose concentration at target value. In actual fed-batch culture, glucose concentration was kept at relatively high and measured once a day, and the glucose feed rate until the next measurement time was determined by an expert worker based on the actual consumption rate. Fuzzy neural network (FNN) was applied to construct the estimation model. From the simulation results using this model, the average error for glucose concentration was 0.88 g/L. The FNN model was also applied for a special culture to keep glucose concentration at low level. Selecting the optimal input variables, it was possible to simulate the culture with a low glucose concentration from the data sets of relatively high glucose concentration. Next, a simulation model to estimate time course of glucose concentration during one day was constructed using the on-line measurable process variables, since glucose concentration was only measured off-line once a day. Here, the recursive fuzzy neural network (RFNN) was applied for the simulation model. As the result of the simulation, average error of RFNN model was 0.91 g/L and this model was found to be useful to supervise the fed-batch culture.