Understanding Bias and Variance of Learning-to-Rank Algorithms: An Empirical Framework

Ibrahim, Muhammad (2022) Understanding Bias and Variance of Learning-to-Rank Algorithms: An Empirical Framework. Applied Artificial Intelligence, 36 (1). ISSN 0883-9514

[thumbnail of Understanding Bias and Variance of Learning to Rank Algorithms An Empirical Framework.pdf] Text
Understanding Bias and Variance of Learning to Rank Algorithms An Empirical Framework.pdf - Published Version

Download (3MB)

Abstract

Learning-to-rank (LtR) algorithms are at the heart of modern day information retrieval systems. While a good number of LtR algorithms have been developed and scrutinized over the past decade, theoretical underpinnings of these algorithms are not thoroughly investigated so far. Amongst the theoretical aspects of a supervised learning algorithm, the bias-variance profiles are of utmost importance. In this article we aim to better understand the bias-variance profiles of rank-learning algorithms. Firstly, we formalize the bias and variance from a pointwise perspective where each query-document pair is treated as an individual training example. Secondly, we develop a framework to analyze the variability and systematic error of a rank-learner in terms of its ranking error, i.e., we analyze the bias-variance from a listwise perspective where a query and all of its associated documents are treated as a single training example. After developing the theoretical framework, we move on to test its applicability in practice. In particular, we choose a promising algorithm, namely random forest-based rank-learning algorithms for our investigation. We study the effect of varying an important parameter of the algorithm, namely the sub-sample size used to learn each tree, on bias and variance. Our hypothesis is that as the sub-sample size (per tree) increases, classical bias-variance tradeoff should be observed. On two large LtR benchmark datasets, experimental results show that our hypothesis holds true. We also explain the relative performance of two of the top performing LtR algorithms using their bias and variance profiles. To the best of our knowledge, this article presents the first thorough investigation into bias and variance analysis of rank-learning algorithms.

Item Type: Article
Subjects: STM Digital Library > Computer Science
Depositing User: Unnamed user with email support@stmdigitallib.com
Date Deposited: 15 Jun 2023 07:32
Last Modified: 08 Jun 2024 08:04
URI: http://archive.scholarstm.com/id/eprint/1435

Actions (login required)

View Item
View Item