Towards better systems for complex search tasks

Bah, Ashraf

Towards better systems for complex search tasks

Author(s)	Bah, Ashraf
Date Accessioned	2018-07-12T11:27:18Z
Date Available	2018-07-12T11:27:18Z
Publication Date	2016
Abstract	When users interact with search engines, in a large number of cases, they first formulate a query and examine the results, and then reformulate it one or several more times until they either satisfy their information need or give up. Complex search tasks fall into those cases. Unlike simpler tasks in which users are looking for a particular homepage or a particular single piece of information or an answer to a single specific question, complex search tasks often span multiple search queries (i.e. a sequence of queries) and can span multiple sessions (i.e. multiple sequences of queries). ☐ In this thesis, we present several efforts for building more effective and robust retrieval systems for complex search tasks in situations where we have only small amounts of search history data. We first start by investigating and understanding users’ preferences with respect to document comprehensiveness and topical relevance grade. ☐ Then, using our findings from that experiment, we introduce heuristic data fusion methods to improve search results in a search session by leveraging most recent search history and query logs. ☐ Next, we go beyond simple average effectiveness by considering risk-sensitivity as an essential part of our retrieval systems. For that purpose, we present re-ranking approaches that exploit the “popularity” of documents and we show that they produce results with improved robustness and effectiveness over a variety of retrieval systems used as baselines. Risk-sensitive ranking (or robustness-aware ranking) focuses on improving the robustness of the system by minimizing the risk of obtaining, for any topic, a result subpar with that of the baseline system. In other words, robustness refers to the ability of the ranker to reduce and mitigate poor performance on certain individual queries while striving to improve the overall performance as well. ☐ Our next endeavor consists in going beyond heuristic retrieval models. For that purpose, we propose a probabilistic data fusion framework for retrieval and ranking inspired by the well-known probability ranking function, and we use it to solve search over sessions, as well as ad hoc search, novelty and diversity search. ☐ Finally, in order to achieve high effectiveness for search over a session even in the absence of search history, we propose to simulate search interactions that provide data similar to what we could have obtained if a user were to have prior interactions with the search engine (previous queries, top results returned for previous queries, etc.).	en_US
Advisor	Carterette, Benjamin A.
Degree	Ph.D.
Department	University of Delaware, Department of Computer and Information Sciences
Unique Identifier	1043913594
URL	http://udspace.udel.edu/handle/19716/23622
Publisher	University of Delaware	en_US
URI	https://search.proquest.com/docview/1868510629?accountid=10457
Keywords	Applied sciences	en_US
Keywords	Complex search task	en_US
Keywords	Diversity and novelty search	en_US
Keywords	Information retrieval models	en_US
Keywords	Robust search systems	en_US
Keywords	Session search	en_US
Keywords	Simulation of query reformulations	en_US
Title	Towards better systems for complex search tasks	en_US
Type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: BahRabiou_udel_0060D_12737.pdf
Size:: 2 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.22 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Doctoral Dissertations (Winter 2014 to Present)