Reinforcement learning is a machine learning method which aims to determine a policy to get optimal
actions in dynamic and stochastic environments. But reinforcement learning has high computational
complexity and needs a lot of time to get solution, so it is not easily applicable to uncertain and
continuous environments. To tackle the complexity problem, AC (actor-critic) method is used and it
separates an action-value function into a value function and an action decision policy. Also, in transfer
learning method, the knowledge constructed in one environment is adapted to another environment, so it
reduces the time to learn in a reinforcement learning method. In this paper, we present AC method and
transfer learning method to solve the problem of a reinforcement learning method. Finally, we analyze the
case study which a transfer learning method is used to solve BS(base station) switching problem in
wireless access networks.