When several radio access technologies (e.g., HSPA, LTE, WiFi, and WiMAX) cover the same region, deciding to which one mobiles connect is known as the Radio Access Technology (RAT) selection problem. To reduce network signaling and processing load, decisions are generally delegated to mobile users. Mobile users aim to selfishly maximize their utility. However, as they do not cooperate, their decisions may lead to performance inefficiency. In this paper, to overcome this limitation, we propose a network-assisted approach. The network provides information for the mobiles to make more accurate decisions. By appropriately tuning network information, user decisions are globally expected to meet operator objectives, avoiding undesirable network states.
Deriving network information is formulated as a semi-Markov decision process (SMDP), and optimal policies are computed using the Policy Iteration algorithm. Also, and since network parameters may not be easily obtained, a reinforcement learning approach is introduced to derive what to signal to mobiles. The performances of optimal, learning-based, and heuristic policies, such as blocking probability and average throughput, are analyzed. When tuning thresholds are pertinently set, our heuristic achieves performance very close to the optimal solution. Moreover, although it provides lower performance, our learning-based algorithm has the crucial advantage of requiring no prior parameterization.