The theoretical analysis demonstrates that EDIS displays reduced suboptimality when compared with only using on the internet information or right reusing offline knowledge. EDIS can be a plug-in tactic and will be coupled with existing techniques in offline-to-on-line RL setting. By utilizing EDIS to off-the-shelf solutions Cal-QL and IQL, we notic… Read More