CPS 296.3 (Spring 2009):
Information Management and Mining

Course Information   Lecture Notes   Readings   Tentative Schedule   Resources

To submit a review, email Jun with subject "[cps296.3] review for paper title" and plain-text message body discussing the following:

  • At least three important things that the paper says;
  • At least two interesting things that you found in the paper (e.g., a non-obvious pitfall, an uncanny insight, a neat trick that could be used elsewhere);
  • At least one thing that you did not like about the paper.
There is no specific requirement on the length of your reviews. A good, insightful review can be as brief as 400 words.


Reading for 2008-02-24 (review due by 2008-02-23 noon):

  • Chu, Kim, Lin, Yu, Bradski, Ng, and Olukotun. "Map-Reduce for Machine Learning on Multicore." NIPS 2006 (PDF)
  • Das, Datar, Garg, Rajaram. "Google News Personalization: Scalable Online Collaborative Filtering." WWW 2007 (URL)


Reading for 2009-03-03 (review due by 2009-03-02 noon):

  • Tantipathananandh, Berger-Wolf, and Kempe. "A Framework for Community Identification in Dynamic Social Networks." KDD 2007 (URL)
  • Leskovec, Backstrom, Kumar, and Tomkins. "Microscopic Evolution of Social Networks." KDD 2008 (URL)
  • Crandall, Cosley, Huttenlocher, Kleinberg, and Suri. "Feedback Effects between Similarity and Social Influence in Online Communities.." KDD 2008 (URL)


Reading for 2009-03-24 (review due by 2009-03-23 noon):

  • Lucchese, Orlando, and Perego. "Parallel Mining of Frequent Closed Patterns: Harnessing Modern Computer Architectures." ICDM 2007 (URL)
  • Li, Fu, Guo, Mowry, and Faloutsos. "Cut-and-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs." KDD 2008 (URL)
  • Papadimitriou and Sun. "DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining." ICDM 2008 (URL)
  • Cormode, Muthukrishnan, and Zhuang. "Conquering the Divide: Continuous Clustering of Distributed Data Streams." ICDE 2007 (URL)


Reading for 2009-04-07 (review due by 2009-04-06 noon):

  • Strohman and Croft. "Efficient Document Retrieval in Main Memory." SIGIR 2007 (URL)
  • Bhagwat, Eshghi, and Mehra. "Content-Based Document Routing and Index Partitioning for Scalable Similarity-Based Searches in a Large Corpus." KDD 2007 (URL)
  • Forman and Rajaram. "Scaling up Text Classification for Large File Systems." KDD 2008 (URL)
  • Theobald, Siddharth, and Paepcke. "SpotSigs: Robust and Efficient Near Duplicate Detection in Large Web Collections." SIGIR 2008 (URL)


Reading for 2009-04-14 (review due by 2009-04-13 noon):

  • Bar-Yossef and Gurevich. "Mining Search Engine Query Logs via Suggestion Sampling." VLDB 2008 (URL)
  • Xu, Huang, Fox, Patterson, and Jordan. "Mining Console Logs for Large-Scale System Problem Detection." SysML 2008 (URL)
  • Xiang, Jin, Fuhry, and Dragan. "Succinct Summarization of Transactional Databases: An Overlapped Hyperrectangle Scheme." KDD 2008 (URL)
  • Parikh and Sundaresan. "Scalable and Near Real-Time Burst Detection from eCommerce Queries." KDD 2008 (URL)


Reading for 2009-04-21 (review due by 2009-04-20 noon):

  • Park and Pennock. "Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing." KDD 2007 (URL)
  • Chen, Zhang, and Chang. "Combinational Collaborative Filtering for Personalized Community Recommendation." KDD 2008 (URL)
  • Bell, Koren, and Volinsky. "Modeling Relationships at Multiple Scales to Improve Accuracy of Large Recommender Systems." KDD 2008 (URL)
  • Mehta and Nejdl. "Attack Resistant Collaborative Filtering." SIGIR 2008 (URL)

Last updated Thu Apr 16 14:40:27 EDT 2009