A SURVEY OF FACETED SEARCH
Keywords:
Facet, Faceted Search, Faceted Taxonomy, MetricsAbstract
Faceted Search is an exploratory search mechanism, which provides an iterative way to refine search results by a faceted taxonomy. With the benefit of search results diversification, no need for a priori knowledge, and never leading to zero result, it can significantly reduce information overload. Faceted Search has witnessed a booming interest in the last ten years. In this paper, we first analyze the representative facet search models. Next, we present a general faceted search framework, and survey the related methods and techniques, including facet term extraction, hierarchy construction, compound term generation and facet ranking. Then we discuss the metrics for faceted search evaluation, and also highlight the main characteristics of a number of existing faceted search systems. Some directions for future research are finally presented.
Downloads
References
Furnas, G.W., et al., The Vocabulary Problem in Human System Communication.
Communications of the ACM, 1987. 30(11): p. 964-971.
Dumais, S., Cutrell, E., and Chen, H., Optimizing search by showing results in context, in
Proceedings of the SIGCHI conference on Human factors in computing systems. 2001, ACM:
Seattle, Washington, United States. p. 277-284.
Zwol, R.v., et al., Faceted exploration of image search results, in Proceedings of the 19th
international conference on World Wide Web. 2010, ACM: Raleigh, North Carolina, USA. p.
-970.
Tunkelang, D., Dynamic Category Sets: An Approach for Faceted Search, in Proceedings of the
ACM SIGIR'06 Workshop on Faceted Search. 2006: Seattle, WA, USA.
Bergamaschi, S., Guerra, F., and Leiba, B., Information Overload Internet Computing, 2010.
(6): p. 10-13.
Kashyap, A., Hristidis, V., and Petropoulos, M., FACeTOR: Cost-Driven Exploration of Faceted
Query Results, in ACM Conference on Information and Knowledge Management (CIKM).
: Toronto, Ontario, Canada.
Marshall, P., Herman, S., and Rajan, S., In search of more meaningful search. Serials Review,
32(3): p. 172-180.
Zhang, L. and Zhang, Y., Interactive retrieval based on faceted feedback, in Proceeding of the
rd international ACM SIGIR conference on Research and development in information
retrieval. 2010, ACM: Geneva, Switzerland. p. 363-370.
Li, C., et al., Facetedpedia: dynamic generation of query-dependent faceted interfaces for
wikipedia, in Proceedings of the 19th international conference on World Wide Web. 2010,
ACM: Raleigh, North Carolina, USA. p. 651-660.
Hearst, M., UIs for Faceted Navigation: Recent Advances and Remaining Open Problems, in
HCIR08 Second Workshop on Human-Computer Interaction and Information Retrieval. 2008:
Redmond, WA.
eBay.Available from: http://www.ebay.com/.
Amazon.com. Available from: http://www.amazon.com.
Diederich, J. and Balke, W., FecetedDBLP - Navigational Access for Digital Libraries. Bulletin
of the IEEE Technical Committee on Digital Libraries (TCDL), 2008. 4(1).
IEEE Xplore - Home. Available from: http://ieeexplore.ieee.org/Xplore/dynhome.jsp?tag=1.
ISI Web of Knowledge. Available from: http://isiknowledge.com.
The Open Video Project. Available from: http://www.open-video.org/.
Google Images. Available from: http://images.google.com/.
Ranganathan, S.R., Elements of library classification (1st ed). 1991, Bombay, New York: South
Asia Books. 168 p.
Prietodiaz, R., Implementing Faceted Classification for Software Reuse. Communications of the
ACM, 1991. 34(5): p. 88-97.
Spiteri, L., A Simplified Model for Facet Analysis. Canadian Journal of Information and Library
Science, 1998. 23(1-2): p. 1-30.
Yee, K.P., et al., Faceted metadata for image search and browsing, in Proceedings of the
SIGCHI conference on Human factors in computing systems. 2003, ACM: Ft. Lauderdale,
Florida, USA. p. 401-408.
Ben-Yitzhak, O., et al., Beyond basic faceted search, in Proceedings of the international
conference on Web search and web data mining. 2008, ACM: Palo Alto, California, USA. p. 33-
Girgensohn, A., et al., DocuBrowse: faceted searching, browsing, and recommendations in an
enterprise context, in Proceeding of the 14th international conference on Intelligent user
interfaces. 2010, ACM: Hong Kong, China. p. 189-198.
Uddin, M.N. and Janecek, P., The implementation of faceted classification in web site searching
and browsing. Online Information Review, 2007. 31(2): p. 218-233.
Tzitzikas, Y., Evolution of faceted taxonomies and CTCA expressions. Knowledge and
Information Systems, 2007. 13(3): p. 337-365.
Jethava, V., et al., Scalable multi-dimensional user intent identification using tree structured
distributions, in Proceedings of the 34th international ACM SIGIR conference on Research and
development in Information Retrieval. 2011, ACM: Beijing, China. p. 395-404.
Fafalios, P., Kitsos, I., and Tzitzikas, Y., Scalable, flexible and generic instant overview search,
in Proceedings of the 21st international conference companion on World Wide Web. 2012,
ACM: Lyon, France. p. 333-336.
Taylor, A.G., Introduction to Cataloging and Classification (8th ed). 1992, Englewood,
Colorado: Libraries Unlimited.
Dachselt, R., Frisch, M., and Weiland, M., FacetZoom: a continuous multi-scale widget for
navigating hierarchical metadata, in Proceeding of the twenty-sixth annual SIGCHI conference
on Human factors in computing systems. 2008, ACM: Florence, Italy. p. 1353-1356.
Karlson, A.K., et al., FaThumb: a Facet-based Interface for Mobile Search, in Proceedings of the
SIGCHI conference on Human Factors in computing systems. 2006, ACM: Montréal, Québec,
Canada. p. 711-720.
Koren, J., Zhang, Y., and Liu, X., Personalized interactive faceted search, in Proceeding of the
th international conference on World Wide Web. 2008, ACM: Beijing, China. p. 477-486.
Hearst, M., et al., Finding the flow in web site search. Communications of the ACM, 2002.
(9): p. 42-49.
Tunkelang, D., faceted search, G. Marchionini, Editor. 2009, Morgan & Claypool Publishers.
Sacco, G.M., Research results in dynamic taxonomy and faceted search systems, in the 18th
International Conference on Database and Expert Systems Applications(DEXA). 2007: Torino,
Italy p. 201-206, 862.
Allard, P. and Ferre, S., Dynamic Taxonomies for the Semantic Web, in Proceedings of the 19th
International Conference on Database and Expert Systems Application. 2008, IEEE Computer
Society: Turin, Italy. p. 382-386.
nsf.gov - Advanced Funding Search - US National Science Foundation (NSF). Available from:
http://www.nsf.gov/funding/advanced_funding_search.jsp.
Hotelbook.com | Hotel Reservations | Find and Book Hotels with hotelbook.com. Available
from: http://www.hotelbook.com/en/.
Yahoo! Directory. Available from: http://dir.yahoo.com/.
Open Directory Project. Available from: http://www.dmoz.org/.
Quintarelli, E., Resmini, A., and Rosati, L., FaceTag: integrating bottom-up and top-down
classification in a social tagging system, in Proceedings of the 8th Information Architecture
Summit. 2007: Las Vegas, Nevada, United States.
Oren, E., Delbru, R., and Decker, S., Extending faceted navigation for RDF data, in Proceedings
of the 5th International Semantic Web Conference (ISWC). 2006. p. 559-572, 1001.
Dash, D., et al., Dynamic faceted search for discovery-driven analysis, in Proceeding of the 17th
ACM Conference on Information and Knowledge Management (CIKM). 2008, ACM: Napa
Valley, California, USA. p. 3-12.
Clarkson, E.C., Navathe, S.B., and Foley, J.D., Generalized formal models for faceted user
interfaces, in Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries. 2009,
ACM: Austin, TX, USA. p. 125-134.
Tzitzikas, Y., Analyti, A., and Spyratos, N., Compound Term Composition Algebra: The
semantics. LNCS Journal on Data Semantics 2005. 2: p. 58-84.
English, J., et al., Hierarchical faceted metadata in site search interfaces, in CHI '02 extended
abstracts on Human factors in computing systems. 2002, ACM: Minneapolis, Minnesota, USA.
p. 628-639.
Kules, B., et al., What do exploratory searchers look at in a faceted search interface?, in
Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries. 2009, ACM: Austin,
TX, USA. p. 313-322.
Kaki, M., Findex: search result categories help users when document ranking fails, in
Proceedings of the SIGCHI conference on Human factors in computing systems. 2005, ACM:
Portland, Oregon, USA. p. 131-140.
Yogev, S., et al., Towards expressive exploratory search over entity-relationship data, in
Proceedings of the 21st international conference companion on World Wide Web. 2012, ACM:
Lyon, France. p. 83-92.
Bartolini, I., A Multi-faceted Browsing Interface for Digital Photo Collections, in Proceedings of
the 2009 Seventh International Workshop on Content-Based Multimedia Indexing. 2009, IEEE
Computer Society. p. 237-242.
Fujisawa, S. and Andres, F., Multi-facet Category for Cultural Digital Resources, in Proceedings
of the 21st International Conference on Data Engineering Workshops. 2005, IEEE Computer
Society. p. 1227.
Dakka, W. and Ipeirotis, P.G., Automatic extraction of useful facet hierarchies from text
databases, in 2008 IEEE 24th International Conference on Data Engineering. 2008. p. 466-475,
Hearst, M.A., Clustering versus faceted categories for information exploration. Communications
of the ACM, 2006. 49(4): p. 59-61.
Wille, R., Restructuring lattice theory: an approach based on hierarchies of concepts, in Rival, I.
(ed.): Ordered Sets. 1982, Boston. p. 445-470.
Giunchiglia, F., Marchese, M., and Zaihrayeu, I., Encoding Classifications into Lightweight
Ontologies. Journal on Data Semantics, 2007. 8: p. 57-81.
Sacco, G.M., Dynamic taxonomies: a model for large information bases. IEEE Transactions on
Knowledge and Data Engineering, 2000. 12(3): p. 468-479.
Sacco, G.M. and Tzitzikas, Y., Dynamic taxonomies and faceted search: theory, practice, and
experience. The information retrieval series. 2009, Dordrecht, Netherlands; New York: Springer.
Tzitzikas, Y., Armenatzoglou, N., and Papadakos, P., FleXplorer: A Framework for Providing
Faceted and Dynamic Taxonomy-Based Information Exploration, in Proceedings of the 2008
th International Conference on Database and Expert Systems Application. 2008, IEEE
Computer Society. p. 392-396.
Sacco, G.M., DBWorld Xtended: Semantic Dissemination of Information through Dynamic
Taxonomies Proceedings of I-KNOW 2005.
Bonino, D., Corno, F., and Farinetti, L., FaSet: A Set Theory Model for Faceted Search, in
Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence
and Intelligent Agent Technology - Volume 01. 2009, IEEE Computer Society. p. 474-481.
Priss, U., Description Logic and Faceted Knowledge Representation, in Proceedings of the 1999
International Workshop on Description Logics (DL'99). 1999: Linköping, Sweden.
Giunchiglia, F., Dutta, B., and Maltese, V., Faceted Lightweight Ontologies, in Conceptual
Modeling: Foundations and Applications, A. Borgida, et al., Editors. 2009, Springer Berlin /
Heidelberg. p. 36-51.
Priss, U., Faceted Information Representation. in Proceedings of the 8th International
Conference on Conceptual Structures, 2000: p. 84-94.
Schraefel, M.C., et al., MSPACE: Improving information access to multimedia domains with
MultiModal Exploratory Search. Communications of the ACM, 2006. 49(4): p. 47-49.
Haveliwala, T.H., Topic-sensitive PageRank: a Context-sensitive Ranking Algorithm for Web
Search. IEEE Transactions on Knowledge and Data Engineering, 2003. 15(4): p. 784-796.
Liu, T.-Y., Learning to Rank for Information Retrieval. Foundations and Trends in Information
Retrieval, 2009. 3(3): p. 225-331.
Ruthven, I. and Lalmas, M., A survey on the use of relevance feedback for information access
systems. Knowl. Eng. Rev., 2003. 18(2): p. 95-145.
Stoica, E., Hearst, M.A., and Richardson, M., Automating Creation of Hierarchical Faceted
Metadata Structures. Proceedings of the Human Language Technology Conference of the North
American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2007: p.
-251.
Anick, P.G. and Tipirneni, S., The paraphrase search assistant: terminological feedback for
iterative information seeking, in Proceedings of the 22nd annual international ACM SIGIR
conference on Research and development in information retrieval. 1999, ACM: Berkeley,
California, United States. p. 153-159.
Ling, X., et al., Mining multi-faceted overviews of arbitrary topics in a text collection, in
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and
data mining. 2008, ACM: Las Vegas, Nevada, USA. p. 497-505.
LingPipe Home. Available from: http://www.alias-i.com/lingpipe/.
Term Extraction Web Service - YDN. Available from:
http://developer.yahoo.com/search/content/V1/termExtraction.html.
Taxonomy Warehouse. Available from: http://www.taxonomywarehouse.com/.
Roy, S.B., et al., DynaCet: Building Dynamic Faceted Search Systems over Databases, in 2009
IEEE 25th International Conference on Data Engineering(ICDE), Vols 1-3. 2009. p. 1463-1466,
Zhao, B., et al., TEXplorer: keyword-based object search and exploration in multidimensional
text databases, in Proceedings of the 20th ACM Conference on Information and Knowledge
Management (CIKM). 2011, ACM: Glasgow, Scotland, UK. p. 1709-1718.
Zeng, H.-J., et al., Learning to cluster web search results, in Proceedings of the 27th annual
international ACM SIGIR conference on Research and development in information retrieval.
, ACM: Sheffield, UK. p. 210-217.
Dou, Z., et al., Finding dimensions for queries, in Proceedings of the 20th ACM Conference on
Information and Knowledge Management (CIKM). 2011, ACM: Glasgow, Scotland, UK. p.
-1320.
Chen, J. and Li, Q., Concept Hierarchy Construction by Combining Spectral Clustering and
Subsumption Estimation, in Web Information Systems – WISE 2006, K. Aberer, et al., Editors.
, Springer Berlin / Heidelberg. p. 199-209.
Atkins, S., Rundell, M., and Sato, H., The contribution of FrameNet to practical lexicography.
International Journal of Lexicography, 2003. 16(3): p. 333-357.
Sanderson, M. and Croft, B., Deriving concept hierarchies from text. SIGIR'99: Proceedings of
nd International Conference on Research and Development in Information Retrieval, 1999: p.
-213, 339.
Dakka, W., Ipeirotis, P.G., and Wood, K.R., Automatic construction of multifaceted browsing
interfaces, in Proceedings of the 14th ACM Conference on Information and Knowledge
Management (CIKM). 2005, ACM: Bremen, Germany. p. 768-775.
Xing, D., et al., Deep classifier: automatically categorizing search results into large-scale
hierarchies, in Proceedings of the international conference on Web search and web data mining.
, ACM: Palo Alto, California, USA. p. 139-148.
Krishnapuram, R. and Kummamuru, K., Automatic taxonomy generation: Issues and
possibilities, in Proceedings of the 10th International Fuzzy Systems Association World
Congress (IFSA). 2003. p. 52-63.
Holi, M. and Hyvönen, E., Fuzzy View-Based Semantic Search, in The Semantic Web – ASWC
, R. Mizoguchi, Z. Shi, and F. Giunchiglia, Editors. 2006, Springer Berlin / Heidelberg. p.
-365.
Roy, S.B., et al., Minimum-effort driven dynamic faceted search in structured databases, in
Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM).
, ACM: Napa Valley, California, USA. p. 13-22.
Yamamoto, T., Nakamura, S., and Tanaka, K., Extracting adjective facets from community
Q&A corpus, in Proceedings of the 20th ACM Conference on Information and Knowledge
Management (CIKM). 2011, ACM: Glasgow, Scotland, UK. p. 2021-2024.
Kleinberg, J.M., Authoritative sources in a hyperlinked environment, in Proceedings of the ninth
annual ACM-SIAM symposium on Discrete algorithms. 1998, Society for Industrial and Applied
Mathematics: San Francisco, California, United States. p. 668-677.
Uddin, M.N. and Janecek, P., Performance and usability testing of multidimensional taxonomy
in web site search and navigation. Performance Measurement and Metrics, 2007. 8(1): p. 18-33.
Smith, G., et al., FacetMap: a Scalable Search and Browse Visualization. IEEE Transactions on
Visualization and Computer Graphics, 2006. 12(5): p. 797-804.
Kekalainen, J., Binary and graded relevance in IR evaluations - Comparison of the effects on
ranking of IR systems. Information Processing & Management, 2005. 41(5): p. 1019-1033.
Gomadam, K., et al., A Faceted Classification Based Approach to Search and Rank Web APIs,
in Proceedings of the 2008 IEEE International Conference on Web Services. 2008, IEEE
Computer Society. p. 177-184.
Jarvelin, K. and Kekalainen, J., Cumulated gain-based evaluation of IR techniques. ACM
Transactions on Information Systems, 2002. 20(4): p. 422-446.
Moffat, A. and Zobel, J., Rank-biased precision for measurement of retrieval effectiveness.
ACM Transactions on Information Systems, 2008. 27(1): p. 1-27.
Buckley, C. and Voorhees, E.M., Evaluating evaluation measure stability, in Proceedings of the
rd annual international ACM SIGIR conference on Research and development in information
retrieval. 2000, ACM: Athens, Greece. p. 33-40.
Voorhees, E., The TREC-8 question answering track report, in Proceedings of the 8th Text
Retrieval Conference. 1999. p. 77-82.
Buckley, C. and Voorhees, E.M., Retrieval evaluation with incomplete information, in
Proceedings of the 27th annual international ACM SIGIR conference on Research and
development in information retrieval. 2004, ACM: Sheffield, UK. p. 25-32.
Macdonald, C., Ounis, I., and Soboroff, I., Overview of the TREC 2009 Web track, in
Proceedings of the 18th Text REtrieval Conference (TREC 2009). 2009: Gaithersburg,
Maryland, USA.
Pound, J., Paparizos, S., and Tsaparas, P., Facet discovery for structured web search: a query-log
mining approach, in Proceedings of the 2011 ACM SIGMOD International Conference on
Management of data. 2011, ACM: Athens, Greece. p. 169-180.
Xu, Y. and Mease, D., Evaluating web search using task completion time, in Proceedings of the
nd international ACM SIGIR conference on Research and development in information
retrieval. 2009, ACM: Boston, MA, USA. p. 676-677.
Roitman, H., et al., Exploratory search over social-medical data, in Proceedings of the 20th
ACM Conference on Information and Knowledge Management (CIKM). 2011, ACM: Glasgow,
Scotland, UK. p. 2513-2516.
Zhang, J. and Marchionini, G., Evaluation and evolution of a browse and search interface:
Relation Browser++, in Proceedings of the 2005 national conference on Digital government
research. 2005, Digital Government Society of North America: Atlanta, Georgia. p. 179-188.
Grineva, M., et al., Blognoon: exploring a topic in the blogosphere, in Proceedings of the 20th
international conference companion on World Wide Web. 2011, ACM: Hyderabad, India. p.
-216.
Hildebrand, M., van Ossenbruggen, J., and Hardman, L., /facet: A Browser for Heterogeneous
Semantic Web Repositories, in The Semantic Web - ISWC, I. Cruz, et al., Editors. 2006,
Springer Berlin Heidelberg. p. 272-285.
SIMILE:Longwell RDF Browser(2003-2005). Available from: http://simile.mit.edu/longwell.
Lee, T., et al., Tabulator: Exploring and analyzing linked data on the semantic web, in
Procedings of the 3rd International Semantic Web User Interaction Workshop (SWUI). 2006.
Huynh, D. and Karger, D., Parallax and Companion: Set-based Browsing for the Data Web, in
Proceedings of 18th International World Wide Web Conference. 2009.
Dumais, S., et al., Stuff I've seen: a system for personal information retrieval and re-use, in
SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research
and development in informaion retrieval. 2003, ACM: Toronto, Canada. p. 72-79.
Lee, B., et al., FacetLens: Exposing Trends and Rlationships to Support Sensemaking within
Faceted Datasets, in Proceedings of the 27th international conference on Human factors in
computing systems. 2009, ACM: Boston, MA, USA. p. 1293-1302.
Teregowda, P.B., et al., SeerSuite: developing a scalable and reliable application framework for
building digital libraries by crawling the web, in Proceedings of the 2010 USENIX conference
on Web application development. 2010, USENIX Association: Boston, MA. p. 14-14.
Apache Solr. Available from: http://lucene.apache.org/solr.
David Smiley and Pugh, E., Apache Solr 3 Enterprise Search Server. 2011: Packt Publishing.
Flamenco. Available from: http://flamenco.berkeley.edu.
Nowell, L., Hetzler, E., and Tanasse, T., Change blindness in information visualization: A case
study, in Proceedings of the IEEE Symposium on Information Visualization 2001 (INFOVIS'01)
: San Diego, CA, USA. p. 15-22, 171.
Stefaner, M., Urban, T., and Seefelder, M., Elastic Lists for Facet Browsing and Resource
Analysis in the Enterprise, in Proceedings of the 19th International Conference on Database and
Expert Applications Systems. 2008: Turin, Italy. p. 397-401.
Simitsis, A., et al., Multidimensional content eXploration. Proc. VLDB Endow., 2008. 1(1): p.
-671.
Chen, H. and Karger, D.R., Less is more: probabilistic models for retrieving fewer relevant
documents, in Proceedings of the 29th annual international ACM SIGIR conference on Research
and development in information retrieval. 2006, ACM: Seattle, Washington, USA. p. 429-436.
Carterette, B. and Chandar, P., Probabilistic models of ranking novel documents for faceted
topic retrieval, in Proceeding of the 18th ACM Conference on Information and Knowledge
Management (CIKM). 2009, ACM: Hong Kong, China. p. 1287-1296.