What can online reinforcement learning with function approximation
  benefit from general coverage conditions?

What can online reinforcement learning with function approximation benefit from general coverage conditions?

Papers citing "What can online reinforcement learning with function approximation benefit from general coverage conditions?"