|
|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/41181
|
| Title: | Field features: The impact in learning to rank approaches |
| Authors: | Yang, Hua Gonçalves, Teresa |
| Issue Date: | 2023 |
| Publisher: | Elsevier |
| Citation: | Hua Yang, Teresa Gonçalves. Field features: The impact in learning to rank approaches,
Applied Soft Computing, Volume 138, 2023, 110183,ISSN 1568-4946.
https://doi.org/10.1016/j.asoc.2023.110183. |
| Abstract: | Learning to Rank approaches employ Machine Learning techniques for Information Retrieval. Traditionally, the features needed to train a ranking model are naively combined after being extracted from the various fields of the texts. Nevertheless, if not considered carefully, the learning process can make use of strongly correlated features. Moreover, the learned ranking models are not, to date, systematically analyzed in terms of how the field-based features affect their performances. In this work, the impact of using field-based features in Learning to Rank approaches is investigated. Specifically, the Field Learning to Rank technique is proposed to study if the field-based features perform better than the naively combined features. The experiments are conducted employing eight learning to rank approaches on two sizable benchmark datasets: MQ2007 and MQ2008. The models are assessed using three widely adopted Learning to Rank evaluation measures, namely Precision, Mean Average Precision, and Normalized Discounted Cumulative Gain. The results show that the use of field-based features achieve better performance than the naively combined features. Moreover, models aggregated from different fields further improve the ranking results. It is also observed that among the five examined fields, url and title are significantly more effective than wholedoc (full document), body, and anchor to build ranking models. Further, analyses indicate the existence of strong correlations between field features, such as the features from body and wholedoc, title and anchor, or title and url. The proposed Field Learning to Rank technique is shown to have the advantage of avoiding the combination of correlated features. These findings imply that the use of field-based features for training ranking models is valuable for enhancing the effectiveness of Learning to Rank approaches. |
| URI: | http://hdl.handle.net/10174/41181 |
| Type: | article |
| Appears in Collections: | VISTALab - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|