<?xml version="1.0" encoding="UTF-8"?>
<XML><RECORDS>
<RECORD>
	<REFERENCE_TYPE>3</REFERENCE_TYPE>
	<AUTHORS>
		<AUTHOR>Ekstrand, M. D.</AUTHOR>
		<AUTHOR>Ludwig, M.</AUTHOR>
		<AUTHOR>Konstan, J. A.</AUTHOR>
		<AUTHOR>Riedl, J. T.</AUTHOR>
	</AUTHORS>
	<YEAR>2011</YEAR>
	<TITLE>Rethinking the Recommender Research Ecosystem: Reproducibility, Openness, and LensKit</TITLE>
	<SECONDARY_TITLE>The Fifth ACM Conference on Recommender Systems</SECONDARY_TITLE>
	<PLACE_PUBLISHED>Chicago, IL</PLACE_PUBLISHED>
	<PUBLISHER>Association of Computing Machinery</PUBLISHER>
	<PAGES>133-140</PAGES>
	<TERTIARY_TITLE>Proceedings of the Fifth ACM Conference on Recommender Systems</TERTIARY_TITLE>
	<DATE>10/23/2011</DATE>
	<KEYWORDS>
		<KEYWORD>Recommender</KEYWORD>
		<KEYWORD>systems,</KEYWORD>
		<KEYWORD>implementation,</KEYWORD>
		<KEYWORD>evaluation</KEYWORD>
	</KEYWORDS>
	<ABSTRACT>Recommender systems research is being slowed by the di- culty of replicating and comparing research results. Published research uses various experimental methodologies and metrics that are dicult to compare. It also often fails to suciently document the details of proposed algorithms or the evaluations employed. Researchers waste time reimplementing well-known algorithms, and the new implementations may miss key details from the original algorithm or its subsequent re nements. When proposing new algorithms, researchers should compare them against nely-tuned implementations of the leading prior algorithms using state-of-the-art evaluation methodologies. With few exceptions, published algorithmic improvements in our eld should be accompanied by working code in a standard framework, including test harnesses to reproduce the described results. To that end, we present the design and freely distributable source code of LensKit, a exible platform for reproducible recommender systems research. LensKit provides carefully tuned implementations of the leading collaborative ltering algorithms, APIs for common recommender system use cases, and an evaluation framework for performing reproducible oine evaluations of algorithms. We demonstrate the utility of LensKit by replicating and extending a set of prior comparative studies of recommender algorithms | showing limitations in some of the original results |and by investigating a question recently raised by a leader in the recommender systems community on problems with error-based prediction evaluation.</ABSTRACT>
	<URL>http://doi.acm.org/10.1145/2043932.2043958</URL>
</RECORD>
</RECORDS></XML>
