A SURVEY OF FACETED SEARCH

Authors

  • BIFAN WEI SPKLSTN Lab, Department of Computer Science Xi’an Jiaotong University
  • JUN LIU SPKLSTN Lab, Department of Computer Science Xi’an Jiaotong University
  • QINGHUA ZHENG SPKLSTN Lab, Department of Computer Science Xi’an Jiaotong University
  • WEI ZHANG Amazon.com, Inc
  • XIAOYU FU Department of Computer Science Xi’an Jiaotong University
  • BOQIN FENG Department of Computer Science Xi’an Jiaotong University

Keywords:

Facet, Faceted Search, Faceted Taxonomy, Metrics

Abstract

Faceted Search is an exploratory search mechanism, which provides an iterative way to refine search results by a faceted taxonomy. With the benefit of search results diversification, no need for a priori knowledge, and never leading to zero result, it can significantly reduce information overload. Faceted Search has witnessed a booming interest in the last ten years. In this paper, we first analyze the representative facet search models. Next, we present a general faceted search framework, and survey the related methods and techniques, including facet term extraction, hierarchy construction, compound term generation and facet ranking. Then we discuss the metrics for faceted search evaluation, and also highlight the main characteristics of a number of existing faceted search systems. Some directions for future research are finally presented.

 

Downloads

Download data is not yet available.

References

Furnas, G.W., et al., The Vocabulary Problem in Human System Communication.

Communications of the ACM, 1987. 30(11): p. 964-971.

Dumais, S., Cutrell, E., and Chen, H., Optimizing search by showing results in context, in

Proceedings of the SIGCHI conference on Human factors in computing systems. 2001, ACM:

Seattle, Washington, United States. p. 277-284.

Zwol, R.v., et al., Faceted exploration of image search results, in Proceedings of the 19th

international conference on World Wide Web. 2010, ACM: Raleigh, North Carolina, USA. p.

-970.

Tunkelang, D., Dynamic Category Sets: An Approach for Faceted Search, in Proceedings of the

ACM SIGIR'06 Workshop on Faceted Search. 2006: Seattle, WA, USA.

Bergamaschi, S., Guerra, F., and Leiba, B., Information Overload Internet Computing, 2010.

(6): p. 10-13.

Kashyap, A., Hristidis, V., and Petropoulos, M., FACeTOR: Cost-Driven Exploration of Faceted

Query Results, in ACM Conference on Information and Knowledge Management (CIKM).

: Toronto, Ontario, Canada.

Marshall, P., Herman, S., and Rajan, S., In search of more meaningful search. Serials Review,

32(3): p. 172-180.

Zhang, L. and Zhang, Y., Interactive retrieval based on faceted feedback, in Proceeding of the

rd international ACM SIGIR conference on Research and development in information

retrieval. 2010, ACM: Geneva, Switzerland. p. 363-370.

Li, C., et al., Facetedpedia: dynamic generation of query-dependent faceted interfaces for

wikipedia, in Proceedings of the 19th international conference on World Wide Web. 2010,

ACM: Raleigh, North Carolina, USA. p. 651-660.

Hearst, M., UIs for Faceted Navigation: Recent Advances and Remaining Open Problems, in

HCIR08 Second Workshop on Human-Computer Interaction and Information Retrieval. 2008:

Redmond, WA.

eBay.Available from: http://www.ebay.com/.

Amazon.com. Available from: http://www.amazon.com.

Diederich, J. and Balke, W., FecetedDBLP - Navigational Access for Digital Libraries. Bulletin

of the IEEE Technical Committee on Digital Libraries (TCDL), 2008. 4(1).

IEEE Xplore - Home. Available from: http://ieeexplore.ieee.org/Xplore/dynhome.jsp?tag=1.

ISI Web of Knowledge. Available from: http://isiknowledge.com.

The Open Video Project. Available from: http://www.open-video.org/.

Google Images. Available from: http://images.google.com/.

Ranganathan, S.R., Elements of library classification (1st ed). 1991, Bombay, New York: South

Asia Books. 168 p.

Prietodiaz, R., Implementing Faceted Classification for Software Reuse. Communications of the

ACM, 1991. 34(5): p. 88-97.

Spiteri, L., A Simplified Model for Facet Analysis. Canadian Journal of Information and Library

Science, 1998. 23(1-2): p. 1-30.

Yee, K.P., et al., Faceted metadata for image search and browsing, in Proceedings of the

SIGCHI conference on Human factors in computing systems. 2003, ACM: Ft. Lauderdale,

Florida, USA. p. 401-408.

Ben-Yitzhak, O., et al., Beyond basic faceted search, in Proceedings of the international

conference on Web search and web data mining. 2008, ACM: Palo Alto, California, USA. p. 33-

Girgensohn, A., et al., DocuBrowse: faceted searching, browsing, and recommendations in an

enterprise context, in Proceeding of the 14th international conference on Intelligent user

interfaces. 2010, ACM: Hong Kong, China. p. 189-198.

Uddin, M.N. and Janecek, P., The implementation of faceted classification in web site searching

and browsing. Online Information Review, 2007. 31(2): p. 218-233.

Tzitzikas, Y., Evolution of faceted taxonomies and CTCA expressions. Knowledge and

Information Systems, 2007. 13(3): p. 337-365.

Jethava, V., et al., Scalable multi-dimensional user intent identification using tree structured

distributions, in Proceedings of the 34th international ACM SIGIR conference on Research and

development in Information Retrieval. 2011, ACM: Beijing, China. p. 395-404.

Fafalios, P., Kitsos, I., and Tzitzikas, Y., Scalable, flexible and generic instant overview search,

in Proceedings of the 21st international conference companion on World Wide Web. 2012,

ACM: Lyon, France. p. 333-336.

Taylor, A.G., Introduction to Cataloging and Classification (8th ed). 1992, Englewood,

Colorado: Libraries Unlimited.

Dachselt, R., Frisch, M., and Weiland, M., FacetZoom: a continuous multi-scale widget for

navigating hierarchical metadata, in Proceeding of the twenty-sixth annual SIGCHI conference

on Human factors in computing systems. 2008, ACM: Florence, Italy. p. 1353-1356.

Karlson, A.K., et al., FaThumb: a Facet-based Interface for Mobile Search, in Proceedings of the

SIGCHI conference on Human Factors in computing systems. 2006, ACM: Montréal, Québec,

Canada. p. 711-720.

Koren, J., Zhang, Y., and Liu, X., Personalized interactive faceted search, in Proceeding of the

th international conference on World Wide Web. 2008, ACM: Beijing, China. p. 477-486.

Hearst, M., et al., Finding the flow in web site search. Communications of the ACM, 2002.

(9): p. 42-49.

Tunkelang, D., faceted search, G. Marchionini, Editor. 2009, Morgan & Claypool Publishers.

Sacco, G.M., Research results in dynamic taxonomy and faceted search systems, in the 18th

International Conference on Database and Expert Systems Applications(DEXA). 2007: Torino,

Italy p. 201-206, 862.

Allard, P. and Ferre, S., Dynamic Taxonomies for the Semantic Web, in Proceedings of the 19th

International Conference on Database and Expert Systems Application. 2008, IEEE Computer

Society: Turin, Italy. p. 382-386.

nsf.gov - Advanced Funding Search - US National Science Foundation (NSF). Available from:

http://www.nsf.gov/funding/advanced_funding_search.jsp.

Hotelbook.com | Hotel Reservations | Find and Book Hotels with hotelbook.com. Available

from: http://www.hotelbook.com/en/.

Yahoo! Directory. Available from: http://dir.yahoo.com/.

Open Directory Project. Available from: http://www.dmoz.org/.

Quintarelli, E., Resmini, A., and Rosati, L., FaceTag: integrating bottom-up and top-down

classification in a social tagging system, in Proceedings of the 8th Information Architecture

Summit. 2007: Las Vegas, Nevada, United States.

Oren, E., Delbru, R., and Decker, S., Extending faceted navigation for RDF data, in Proceedings

of the 5th International Semantic Web Conference (ISWC). 2006. p. 559-572, 1001.

Dash, D., et al., Dynamic faceted search for discovery-driven analysis, in Proceeding of the 17th

ACM Conference on Information and Knowledge Management (CIKM). 2008, ACM: Napa

Valley, California, USA. p. 3-12.

Clarkson, E.C., Navathe, S.B., and Foley, J.D., Generalized formal models for faceted user

interfaces, in Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries. 2009,

ACM: Austin, TX, USA. p. 125-134.

Tzitzikas, Y., Analyti, A., and Spyratos, N., Compound Term Composition Algebra: The

semantics. LNCS Journal on Data Semantics 2005. 2: p. 58-84.

English, J., et al., Hierarchical faceted metadata in site search interfaces, in CHI '02 extended

abstracts on Human factors in computing systems. 2002, ACM: Minneapolis, Minnesota, USA.

p. 628-639.

Kules, B., et al., What do exploratory searchers look at in a faceted search interface?, in

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries. 2009, ACM: Austin,

TX, USA. p. 313-322.

Kaki, M., Findex: search result categories help users when document ranking fails, in

Proceedings of the SIGCHI conference on Human factors in computing systems. 2005, ACM:

Portland, Oregon, USA. p. 131-140.

Yogev, S., et al., Towards expressive exploratory search over entity-relationship data, in

Proceedings of the 21st international conference companion on World Wide Web. 2012, ACM:

Lyon, France. p. 83-92.

Bartolini, I., A Multi-faceted Browsing Interface for Digital Photo Collections, in Proceedings of

the 2009 Seventh International Workshop on Content-Based Multimedia Indexing. 2009, IEEE

Computer Society. p. 237-242.

Fujisawa, S. and Andres, F., Multi-facet Category for Cultural Digital Resources, in Proceedings

of the 21st International Conference on Data Engineering Workshops. 2005, IEEE Computer

Society. p. 1227.

Dakka, W. and Ipeirotis, P.G., Automatic extraction of useful facet hierarchies from text

databases, in 2008 IEEE 24th International Conference on Data Engineering. 2008. p. 466-475,

Hearst, M.A., Clustering versus faceted categories for information exploration. Communications

of the ACM, 2006. 49(4): p. 59-61.

Wille, R., Restructuring lattice theory: an approach based on hierarchies of concepts, in Rival, I.

(ed.): Ordered Sets. 1982, Boston. p. 445-470.

Giunchiglia, F., Marchese, M., and Zaihrayeu, I., Encoding Classifications into Lightweight

Ontologies. Journal on Data Semantics, 2007. 8: p. 57-81.

Sacco, G.M., Dynamic taxonomies: a model for large information bases. IEEE Transactions on

Knowledge and Data Engineering, 2000. 12(3): p. 468-479.

Sacco, G.M. and Tzitzikas, Y., Dynamic taxonomies and faceted search: theory, practice, and

experience. The information retrieval series. 2009, Dordrecht, Netherlands; New York: Springer.

Tzitzikas, Y., Armenatzoglou, N., and Papadakos, P., FleXplorer: A Framework for Providing

Faceted and Dynamic Taxonomy-Based Information Exploration, in Proceedings of the 2008

th International Conference on Database and Expert Systems Application. 2008, IEEE

Computer Society. p. 392-396.

Sacco, G.M., DBWorld Xtended: Semantic Dissemination of Information through Dynamic

Taxonomies Proceedings of I-KNOW 2005.

Bonino, D., Corno, F., and Farinetti, L., FaSet: A Set Theory Model for Faceted Search, in

Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence

and Intelligent Agent Technology - Volume 01. 2009, IEEE Computer Society. p. 474-481.

Priss, U., Description Logic and Faceted Knowledge Representation, in Proceedings of the 1999

International Workshop on Description Logics (DL'99). 1999: Linköping, Sweden.

Giunchiglia, F., Dutta, B., and Maltese, V., Faceted Lightweight Ontologies, in Conceptual

Modeling: Foundations and Applications, A. Borgida, et al., Editors. 2009, Springer Berlin /

Heidelberg. p. 36-51.

Priss, U., Faceted Information Representation. in Proceedings of the 8th International

Conference on Conceptual Structures, 2000: p. 84-94.

Schraefel, M.C., et al., MSPACE: Improving information access to multimedia domains with

MultiModal Exploratory Search. Communications of the ACM, 2006. 49(4): p. 47-49.

Haveliwala, T.H., Topic-sensitive PageRank: a Context-sensitive Ranking Algorithm for Web

Search. IEEE Transactions on Knowledge and Data Engineering, 2003. 15(4): p. 784-796.

Liu, T.-Y., Learning to Rank for Information Retrieval. Foundations and Trends in Information

Retrieval, 2009. 3(3): p. 225-331.

Ruthven, I. and Lalmas, M., A survey on the use of relevance feedback for information access

systems. Knowl. Eng. Rev., 2003. 18(2): p. 95-145.

Stoica, E., Hearst, M.A., and Richardson, M., Automating Creation of Hierarchical Faceted

Metadata Structures. Proceedings of the Human Language Technology Conference of the North

American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2007: p.

-251.

Anick, P.G. and Tipirneni, S., The paraphrase search assistant: terminological feedback for

iterative information seeking, in Proceedings of the 22nd annual international ACM SIGIR

conference on Research and development in information retrieval. 1999, ACM: Berkeley,

California, United States. p. 153-159.

Ling, X., et al., Mining multi-faceted overviews of arbitrary topics in a text collection, in

Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and

data mining. 2008, ACM: Las Vegas, Nevada, USA. p. 497-505.

LingPipe Home. Available from: http://www.alias-i.com/lingpipe/.

Term Extraction Web Service - YDN. Available from:

http://developer.yahoo.com/search/content/V1/termExtraction.html.

Taxonomy Warehouse. Available from: http://www.taxonomywarehouse.com/.

Roy, S.B., et al., DynaCet: Building Dynamic Faceted Search Systems over Databases, in 2009

IEEE 25th International Conference on Data Engineering(ICDE), Vols 1-3. 2009. p. 1463-1466,

Zhao, B., et al., TEXplorer: keyword-based object search and exploration in multidimensional

text databases, in Proceedings of the 20th ACM Conference on Information and Knowledge

Management (CIKM). 2011, ACM: Glasgow, Scotland, UK. p. 1709-1718.

Zeng, H.-J., et al., Learning to cluster web search results, in Proceedings of the 27th annual

international ACM SIGIR conference on Research and development in information retrieval.

, ACM: Sheffield, UK. p. 210-217.

Dou, Z., et al., Finding dimensions for queries, in Proceedings of the 20th ACM Conference on

Information and Knowledge Management (CIKM). 2011, ACM: Glasgow, Scotland, UK. p.

-1320.

Chen, J. and Li, Q., Concept Hierarchy Construction by Combining Spectral Clustering and

Subsumption Estimation, in Web Information Systems – WISE 2006, K. Aberer, et al., Editors.

, Springer Berlin / Heidelberg. p. 199-209.

Atkins, S., Rundell, M., and Sato, H., The contribution of FrameNet to practical lexicography.

International Journal of Lexicography, 2003. 16(3): p. 333-357.

Sanderson, M. and Croft, B., Deriving concept hierarchies from text. SIGIR'99: Proceedings of

nd International Conference on Research and Development in Information Retrieval, 1999: p.

-213, 339.

Dakka, W., Ipeirotis, P.G., and Wood, K.R., Automatic construction of multifaceted browsing

interfaces, in Proceedings of the 14th ACM Conference on Information and Knowledge

Management (CIKM). 2005, ACM: Bremen, Germany. p. 768-775.

Xing, D., et al., Deep classifier: automatically categorizing search results into large-scale

hierarchies, in Proceedings of the international conference on Web search and web data mining.

, ACM: Palo Alto, California, USA. p. 139-148.

Krishnapuram, R. and Kummamuru, K., Automatic taxonomy generation: Issues and

possibilities, in Proceedings of the 10th International Fuzzy Systems Association World

Congress (IFSA). 2003. p. 52-63.

Holi, M. and Hyvönen, E., Fuzzy View-Based Semantic Search, in The Semantic Web – ASWC

, R. Mizoguchi, Z. Shi, and F. Giunchiglia, Editors. 2006, Springer Berlin / Heidelberg. p.

-365.

Roy, S.B., et al., Minimum-effort driven dynamic faceted search in structured databases, in

Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM).

, ACM: Napa Valley, California, USA. p. 13-22.

Yamamoto, T., Nakamura, S., and Tanaka, K., Extracting adjective facets from community

Q&A corpus, in Proceedings of the 20th ACM Conference on Information and Knowledge

Management (CIKM). 2011, ACM: Glasgow, Scotland, UK. p. 2021-2024.

Kleinberg, J.M., Authoritative sources in a hyperlinked environment, in Proceedings of the ninth

annual ACM-SIAM symposium on Discrete algorithms. 1998, Society for Industrial and Applied

Mathematics: San Francisco, California, United States. p. 668-677.

Uddin, M.N. and Janecek, P., Performance and usability testing of multidimensional taxonomy

in web site search and navigation. Performance Measurement and Metrics, 2007. 8(1): p. 18-33.

Smith, G., et al., FacetMap: a Scalable Search and Browse Visualization. IEEE Transactions on

Visualization and Computer Graphics, 2006. 12(5): p. 797-804.

Kekalainen, J., Binary and graded relevance in IR evaluations - Comparison of the effects on

ranking of IR systems. Information Processing & Management, 2005. 41(5): p. 1019-1033.

Gomadam, K., et al., A Faceted Classification Based Approach to Search and Rank Web APIs,

in Proceedings of the 2008 IEEE International Conference on Web Services. 2008, IEEE

Computer Society. p. 177-184.

Jarvelin, K. and Kekalainen, J., Cumulated gain-based evaluation of IR techniques. ACM

Transactions on Information Systems, 2002. 20(4): p. 422-446.

Moffat, A. and Zobel, J., Rank-biased precision for measurement of retrieval effectiveness.

ACM Transactions on Information Systems, 2008. 27(1): p. 1-27.

Buckley, C. and Voorhees, E.M., Evaluating evaluation measure stability, in Proceedings of the

rd annual international ACM SIGIR conference on Research and development in information

retrieval. 2000, ACM: Athens, Greece. p. 33-40.

Voorhees, E., The TREC-8 question answering track report, in Proceedings of the 8th Text

Retrieval Conference. 1999. p. 77-82.

Buckley, C. and Voorhees, E.M., Retrieval evaluation with incomplete information, in

Proceedings of the 27th annual international ACM SIGIR conference on Research and

development in information retrieval. 2004, ACM: Sheffield, UK. p. 25-32.

Macdonald, C., Ounis, I., and Soboroff, I., Overview of the TREC 2009 Web track, in

Proceedings of the 18th Text REtrieval Conference (TREC 2009). 2009: Gaithersburg,

Maryland, USA.

Pound, J., Paparizos, S., and Tsaparas, P., Facet discovery for structured web search: a query-log

mining approach, in Proceedings of the 2011 ACM SIGMOD International Conference on

Management of data. 2011, ACM: Athens, Greece. p. 169-180.

Xu, Y. and Mease, D., Evaluating web search using task completion time, in Proceedings of the

nd international ACM SIGIR conference on Research and development in information

retrieval. 2009, ACM: Boston, MA, USA. p. 676-677.

Roitman, H., et al., Exploratory search over social-medical data, in Proceedings of the 20th

ACM Conference on Information and Knowledge Management (CIKM). 2011, ACM: Glasgow,

Scotland, UK. p. 2513-2516.

Zhang, J. and Marchionini, G., Evaluation and evolution of a browse and search interface:

Relation Browser++, in Proceedings of the 2005 national conference on Digital government

research. 2005, Digital Government Society of North America: Atlanta, Georgia. p. 179-188.

Grineva, M., et al., Blognoon: exploring a topic in the blogosphere, in Proceedings of the 20th

international conference companion on World Wide Web. 2011, ACM: Hyderabad, India. p.

-216.

Hildebrand, M., van Ossenbruggen, J., and Hardman, L., /facet: A Browser for Heterogeneous

Semantic Web Repositories, in The Semantic Web - ISWC, I. Cruz, et al., Editors. 2006,

Springer Berlin Heidelberg. p. 272-285.

SIMILE:Longwell RDF Browser(2003-2005). Available from: http://simile.mit.edu/longwell.

Lee, T., et al., Tabulator: Exploring and analyzing linked data on the semantic web, in

Procedings of the 3rd International Semantic Web User Interaction Workshop (SWUI). 2006.

Huynh, D. and Karger, D., Parallax and Companion: Set-based Browsing for the Data Web, in

Proceedings of 18th International World Wide Web Conference. 2009.

Dumais, S., et al., Stuff I've seen: a system for personal information retrieval and re-use, in

SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research

and development in informaion retrieval. 2003, ACM: Toronto, Canada. p. 72-79.

Lee, B., et al., FacetLens: Exposing Trends and Rlationships to Support Sensemaking within

Faceted Datasets, in Proceedings of the 27th international conference on Human factors in

computing systems. 2009, ACM: Boston, MA, USA. p. 1293-1302.

Teregowda, P.B., et al., SeerSuite: developing a scalable and reliable application framework for

building digital libraries by crawling the web, in Proceedings of the 2010 USENIX conference

on Web application development. 2010, USENIX Association: Boston, MA. p. 14-14.

Apache Solr. Available from: http://lucene.apache.org/solr.

David Smiley and Pugh, E., Apache Solr 3 Enterprise Search Server. 2011: Packt Publishing.

Flamenco. Available from: http://flamenco.berkeley.edu.

Nowell, L., Hetzler, E., and Tanasse, T., Change blindness in information visualization: A case

study, in Proceedings of the IEEE Symposium on Information Visualization 2001 (INFOVIS'01)

: San Diego, CA, USA. p. 15-22, 171.

Stefaner, M., Urban, T., and Seefelder, M., Elastic Lists for Facet Browsing and Resource

Analysis in the Enterprise, in Proceedings of the 19th International Conference on Database and

Expert Applications Systems. 2008: Turin, Italy. p. 397-401.

Simitsis, A., et al., Multidimensional content eXploration. Proc. VLDB Endow., 2008. 1(1): p.

-671.

Chen, H. and Karger, D.R., Less is more: probabilistic models for retrieving fewer relevant

documents, in Proceedings of the 29th annual international ACM SIGIR conference on Research

and development in information retrieval. 2006, ACM: Seattle, Washington, USA. p. 429-436.

Carterette, B. and Chandar, P., Probabilistic models of ranking novel documents for faceted

topic retrieval, in Proceeding of the 18th ACM Conference on Information and Knowledge

Management (CIKM). 2009, ACM: Hong Kong, China. p. 1287-1296.

Downloads

Published

2013-11-20

How to Cite

WEI, B. ., LIU, J. ., ZHENG, Q. ., ZHANG, W. ., FU, X., & FENG, B. . (2013). A SURVEY OF FACETED SEARCH. Journal of Web Engineering, 12(1-2), 041–064. Retrieved from https://journals.riverpublishers.com/index.php/JWE/article/view/4177

Issue

Section

Articles