CPS 399.28 (Spring 2008):
Research Seminar and Project in Databases

Course Information   Lecture Notes   Readings   Tentative Schedule   Resources

To submit a review, email Jun with subject "[cps399.28] review for paper title" and plain-text message body discussing the following:

  • At least three important things that the paper says;
  • At least two interesting things that you found in the paper (e.g., a non-obvious pitfall, an uncanny insight, a neat trick that could be used elsewhere);
  • At least one thing that you did not like about the paper.
There is no specific requirement on the length of your reviews. A good, insightful review can be as brief as 400 words.


Reading for 2008-01-22 (review due by midnight before the lecture):

  • Jeffrey Dean, Sanjay Ghemawat: "MapReduce: Simplified Data Processing on Large Clusters." OSDI 2004: 137-150 (PDF)
  • Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Michael Burrows, Tushar Chandra, Andrew Fikes, Robert Gruber: "Bigtable: A Distributed Storage System for Structured Data." OSDI 2006: 205-218 (PDF)


Reading for 2008-01-29 (review due by midnight before the lecture):

  • Christopher M. Jermaine, Subramanian Arumugam, Abhijit Pol, Alin Dobra: "Scalable Approximate Query Processing with the DBO Engine." SIGMOD 2007: 725-736 (PDF)
  • Lyublena Antova, Christoph Koch, Dan Olteanu: "From Complete to Incomplete Information and Back." SIGMOD 2007: 713-724 (PDF)


Reading for 2008-02-05 (review due by midnight before the lecture):

  • Alan J. Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, Walker M. White: "Towards Expressive Publish/Subscribe Systems." EDBT 2006 (PDF)
  • Zhen Liu, Srinivasan Parthasarthy, Anand Ranganathan, Hao Yang: "Scalable Event Matching for Overlapping Subscriptions in Pub/Sub Systems." DEBS 2007 (PDF)
Optional reading for background/depth (no review due):
  • Alan J. Demers, Johannes Gehrke, Biswanath Panda, Mirek Riedewald, Varun Sharma, Walker M. White: "Cayuga: A General Purpose Event Monitoring System." CIDR 2007 (PDF)
  • Alan Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, Walker White: "A General Algebra and Implementation for Monitoring Event Streams." Technical Report, Cornell University, 2005 (PDF)
  • Eugene Wu, Yanlei Diao, Shariq Rizvi: "High-Performance Complex Event Processing Over Streams." SIGMOD 2006 (PDF)


Reading for 2008-02-12 (review due by midnight before the lecture):

  • Jeremy Schiff, Dominic Antonelli, Alexandros G. Dimakis, David Chu, Martin J. Wainwright: "Robust Message Passing for Statistical Inference in Sensor Networks." IPSN 2007 (PDF)
  • Y. Ahmad, O. Papaemmanouil, U. Cetintemel, J. Rogers: "Simultaneous Equation Systems for Query Processing on Continuous-Time Data Streams." ICDE 2008 (PDF)
Optional reading for background/depth (no review due):
  • Lewis Girod, Yuan Mei, Ryan Newton, Stanislav Rost, Arvind Thiagarajan, Hari Balakrishnan, Samuel Madden: "The Case for a Signal-Oriented Data Stream Management System." CIDR 2007 (PDF)


Reading for 2008-02-19 (review due by midnight before the lecture):

  • Badrish Chandramouli, Junyi Xie, Jun Yang: "On the Database/Network Interface in Large-Scale Publish/Subscribe Systems." SIGMOD 2006 (PDF)
  • Tova Milo, Tal Zur, Elad Verbin: "Boosting Topic-Based Publish-Subscribe Systems With Dynamic Clustering." SIGMOD 2007 (PDF)
Optional reading for background/depth (no review due):
  • Badrish Chandramouli, Jeff M. Phillips, and Jun Yang: "Value-Based Notification Conditions in Large-Scale Publish/Subscribe Systems." VLDB 2007 (PDF)
  • Olga Papaemmanouil, Yanif Ahmad, Ugur Cetintemel, John Jannotti, Yenel Yildirim: "Extensible Optimization in an Overlay Data Dissemination Trees." SIGMOD 2006 (PDF)


Reading for 2008-03-04 (review due by midnight before the lecture):

  • Pankaj K. Agarwal, Junyi Xie, Jun Yang, Hai Yu: "Scalable Continuous Query Processing by Tracking Hotspots." VLDB 2006 (PDF)
  • Mingsheng Hong, Alan J. Demers, Johannes Gehrke, Christoph Koch, Mirek Riedewald, Walker M. White: "Massively Multi-Query Join Processing in Publish/Subscribe Systems." SIGMOD 2007 (PDF)
Optional reading for background/depth (no review due):
  • Hyo-Sang Lim, Jae-Gil Lee, Min-Jae Lee, Kyu-Young Whang, Il-Yeol Song: "Continuous Query Processing in Data Streams Using Duality of Data and Queries." SIGMOD 2006 (PDF)
  • S. Chandrasekaran, M. J. Franklin: "PSoup: A System for Streaming Queries Over Streaming Data." VLDB Journal 2003 (PDF)


Reading for 2008-03-18 (review due by midnight before the lecture):

  • Douglas Burdick, AnHai Doan, Raghu Ramakrishnan, Shivakumar Vaithyanathan: "OLAP over Imprecise Data with Domain Constraints." VLDB 2007 (PDF)
  • Sarvjeet Singh, Chris Mayfield, Sunil Prabhakar, Rahul Shah, Susanne E. Hambrusch: "Indexing Uncertain Categorical Data." ICDE 2007 (PDF)
Optional reading for background/depth (no review due):
  • Christopher Re, Dan Suciu: "Materialized Views in Probabilistic Databases for Information Exchange and Query Optimization." VLDB 2007 (PDF)
  • Nilesh N. Dalvi, Dan Suciu: "Efficient Query Evaluation on Probabilistic Databases." VLDB Journal 2007 (PDF)


Reading for 2008-03-25 (review due by midnight before the lecture):

  • L. Selavo, A. Wood, Q. Cao, T. Sookoor, H. Liu, A. Srinivasan, Y. Wu, W. Kang, J. Stankovic, D. Young, J. Porter: "LUSTER: Wireless Sensor Network for Environmental Research." SenSys 2007 (PDF)
  • Xiaoyan Yang, Hock-Beng Lim, M. Tamer Ozsu, Kian-Lee Tan: "In-Network Execution of Monitoring Queries in Sensor Networks." SIGMOD 2007 (PDF)
Optional reading for background/depth (no review due):
  • Lidan Wang, Amol Deshpande: "Predictive Modeling-Based Data Collection in Wireless Sensor Networks." EWSN 2008 (PDF)
  • David Chu, Feng Zhao, Jie Liu, Michel Goraczko: "{Que}: A Sensor Network Rapid Prototyping Tool With Application Experiences from a Data Center Deployment." EWSN 2008 (PDF)


Reading for 2008-04-01 (review due by midnight before the lecture):

  • Daniel J. Abadi, Adam Marcus, Samuel Madden, Katherine J. Hollenbach: "Scalable Semantic Web Data Management Using Vertical Partitioning." VLDB 2007 (PDF)
  • Alexander Markowetz, Yin Yang, Dimitris Papadias: "Keyword Search on Relational Data Streams." SIGMOD 2007 (PDF)
Optional reading for background/depth (no review due):
  • Daniel J. Abadi, Samuel Madden, Miguel Ferreira: "Integrating Compression and Execution in Column-Oriented Database Systems." SIGMOD 2006 (PDF)
  • Gang Luo, Chunqiang Tang, Philip S. Yu: "Resource-Adaptive Real-Time New Event Detection." SIGMOD 2007 (PDF)


Reading for 2008-04-15 (review due by midnight before the lecture):

  • Hung-Chih Yang, Ali Dasdan, Ruey-Lung Hsiao, Douglas Stott Parker Jr.: "Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters." SIGMOD 2007 (PDF)
  • Eric Chu, Jennifer L. Beckmann, Jeffrey F. Naughton: "The Case for a Wide-Table Approach to Manage Sparse Relational Data Sets." SIGMOD 2007 (PDF)
Optional reading for background/depth (no review due):
  • Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, Jeffrey F. Naughton: "A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data." VLDB 2007 (PDF)

Last updated Sat Mar 29 23:41:47 EDT 2008