De-Biased Modeling of Search Click Behavior with Reinforcement Learning
De-Biased Modeling of Search Click Behavior with Reinforcement Learning
Users' clicks on Web search results are one of the key signals for evaluating and improving web search quality and have been widely used as part of current state-of-the-art Learning-To-Rank(LTR) models. With a large volume of search logs available for major search engines, effective models of searcher click behavior have …