avatar

Imam Firdaus

Electrical Engineer

Publication: MazeCov-Q: An Efficient Maze-Based Reinforcement Learning Accelerator for Coverage

MazeCov-Q: An Efficient Maze-Based Reinforcement Learning Accelerator for Coverage 🔗Abstract: Reinforcement learning (RL) is an unsupervised machine learning that does not requires pre-assigned labeled data to learn. It is implemented in many areas such as robotics, games, finances, health, transportation, and energy applications. In this paper, we present an application of reinforcement learning accelerator for finding coverage area and its implementation in a mobile robot called MazeCov-Q (Maze-Based Coverage Q-Learning). We define a novel state that is divided into two conditions.

Bayesian Optimization

Bayesian optimization is an optimization algorithm that uses Bayes Theorem to guide the sampling process to find the function minima/maxima. Bayesian optimization is best suited for optimization over continuous domain of less than 20 dimensions, and tolerates stochastic noise in the evaluation. It builds a surrogate for the objective and quantifies the uncertainty in that surrogate using a Bayesian technique. After that the method uses an acquisition function defined from the surrogate function to decide which point to sample. One way to perform bayesian optimization is using scikit-optimize with python. Scikit-optimize can be used to optimize the function with single parameter or multiple parameter.

Publication: Design of Testing Environment for Line-Follower Robot with Obstacles

Design of Testing Environment for Line-Follower Robot with Obstacles 🔗Abstract: To solve shortest path problem using Q-Learning, one testing method is needed. The testing method is showing Q-Learning result using robot and arena with obstacles. This article will explain obstacles and challenges related to the robot kinematics and arena design to fulfill those needs. To be specific, the robot is needed to move in particular manner in testing ground so the test can be conducted properly and the result can be visualized.