Chen Li's Publications

 

Refereed Journal Articles

  1. SEPIA: Estimating Selectivities of Approximate String Predicates in Large Databases. Liang Jin, Chen Li, and Rares Vernica.  To appear in VLDB Journal.
  2. Using Views to Generate Efficient Evaluation Plans for Queries Foto Afrati, Chen  Li, and Jeff Ullman, Journal of Computer and System Sciences, Volume 73, Issue 5 (August 2007) Pages: 703-724 [PDF]
  3. Rewriting Queries Using Views in the Presence of Arithmetic Comparisons, Foto Afrati, Chen  Li, and Prasenjit Mitra, Theoretical Computer Science, Volume 368, Numbers 1-2, pages 88-123, 2006. [PDF]
  4. Supporting Efficient Record Linkage for Large Data Sets Using Mapping Techniques, Chen Li, Liang Jin, and Sharad Mehrotra, World Wide Web Journal, Volume 9, Number 4, pages 557-584, December 2006. [PDF]
  5. Achieving Communication Efficiency through Push-Pull Partitioning of Semantic Spaces to Disseminate Dynamic Information, Amitabha Bagchi, Amitabh Chaudhary, Michael T. Goodrich, Chen Li, and Michal Shmueli-Scheuer. IEEE Transaction on Knowledge and Data Engineering (TKDE), October 2006 (Vol. 18, No. 10). [PDF]
  6. Answering Queries Using Materialized Views with Minimum Size. Rada Chirkova, Chen Li, and Jia Li. VLDB Journal (2006), Volume 15, Number 3, 191-210. [PDF]
  7. Recent Progress on Selected Topics on Database Research -- A Report from Nine Young Chinese Researchers Working in the United States. Zhiyuan Chen, Chen Li, Jian Pei, Yufei Tao, Haixun Wang, Wei Wang, Jiong Yang, Jun Yang, and Donghui Zhang. The Journal of Computer Science and Technology. Vol. 18, No. 5, Pages 538 - 552, September 2003. [PDF]
  8. Computing Complete Answers to Queries in the Presence of Limited Access Patterns. Chen Li. The VLDB Journal (2003) 12: 211-227 [PS] [PDF]
  9. Answering Queries with Useful Bindings. Chen Li and Edward Chang. ACM Transactions on Database Systems (TODS), Volume 26 , Issue 3 (September 2001).[PS] [PDF]
  10. Clustering for Approximate Similarity Search in High-Dimensional Spaces. Chen Li, Edward Chang, Hector Garcia-Molina, and Gio Wiederhold. IEEE Transaction on Knowledge and Data Engineering, Volume 14, Number 4, pp.792-808, July/August 2002 [PS] [PDF]

Refereed Conference Full Papers

  1. Cost-Based Variable-Length-Gram Selection for String Collections to Support Approximate Queries Efficiently, Xiaochun Yang, Bin Wang, and Chen Li, to appear in ACM SIGMOD 2008.
  2. Efficient Merging and Filtering Algorithms for Approximate String Searches, Chen Li, Jiaheng Lu, and Yiming Lu. ICDE 2008. [PDF], [PPT]
  3. Data Exchange with Arithmetic Comparisons, Foto Afrati, Chen Li, and Vassia Pavlaki. EDBT 2008. [PDF]
  4. VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams, Chen Li, Bin Wang, and Xiaochun Yang. VLDB 2007. [PDF], [PPT]
  5. Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems, Ramaswamy Hariharan, Bijit Hore, Chen Li, Sharad Mehrotra, SSDBM 2007.
  6. Protecting Individual Information Against Inference Attacks in Data Publishing, Chen Li, Houtan  Shirani-Mehr, and Xiaochun  Yang. To appear in DASFAA 2007. [PDF]
  7. Supporting Approximate Similarity Queries with Quality Guarantees in P2P Systems, Qi Zhong, Iosif Lazaridis, Mayur Deshpande, Chen Li, Sharad Mehrotra, Hal Stern, COMAD 2006, December 14-16, 2006, Delhi, India. [PDF]
  8. Relaxing Join and Selection Queries. Nick Koudas, Chen Li, Anthony Tung, and Rares Vernica. VLDB 2006, Seoul, Korea, 2006.  (13.2% accepted) [PDF], [PPT]
  9. Selectivity Estimation for Fuzzy String Predicates in Large Data Sets, Liang Jin and Chen Li. VLDB 2005, Trondheim, Norway, August 30 - September 2, 2005. (16% accepted) [PDF], [PPT].
  10. Indexing Mixed Types for Approximate Retrieval, Liang Jin, Nick Koudas, Chen Li, Anthony K.H. Tung.VLDB 2005, Trondheim, Norway, August 30 - September 2, 2005. (16% accepted) [PDF], [PPT].
  11. Secure XML Publishing without Information Leakage in the Presence of Data Inference. Xiaochun Yang and Chen Li. VLDB, Toronto, Canada, August 29 - September 3, 2004. [PDF], [PPT]. (16% accepted)
  12. NNH: Improving Performance of Nearest-Neighbor Searches Using Histograms. Liang Jin, Nick Koudas, Chen Li. EDBT, Crete, Greece, March 2004. (14% accepted) [PDF], [Full version], [PPT]
  13. On Containment of Conjunctive Queries with Arithmetic Comparisons. Foto Afrati, Chen Li, Prasenjit Mitra. EDBT, Crete, Greece, March 2004. (14% accepted) [PDF].
  14. Materializing Views with Minimal Size to Answer Queries. Rada Chirkova and Chen Li. ACM PODS, June 2003, San Diego, CA. (20% accepted). [PDF], [PPT]
  15. Efficient Record Linkage in Large Data Sets, Liang Jin, Chen Li, and Sharad Mehrotra, in the 8th International Conference on Database Systems for Advanced Applications (DASFAA 2003) 26 - 28 March, 2003, Kyoto, Japan. (33% accepted) [PS], [PDF], [PPT]
  16. Executing SQL over Encrypted Data in the Database-Service-Provider Model. Hakan Hacigumus, Bala Iyer, Chen Li, and Sharad Mehrotra. In ACM SIGMOD, June 3-6, 2002 Madison, Wisconsin. (18% accepted) [PDF]
  17. Answering Queries Using Views with Arithmetic Comparisons. Foto Afrati, Chen Li, and Prasenjit Mitra. In ACM Symposium on Principles of Database Systems (PODS), June 3-6, 2002 Madison, Wisconsin. (22% accepted)
  18. Generating Efficient Plans for Queries Using Views. Foto Afrati, Chen Li, and Jeff Ullman. In the Proc. of the 30th ACM SIGMOD Conference, Santa Barbara, CA, May, 2001. (15% accepted) [PS] [PDF] [PPT]
  19. Minimizing View Sets without Losing Query-Answering Power. Chen Li, Mayank Bawa, and Jeff Ullman. In the 8th International Conference on Database Theory (ICDT), London, UK, January, 2001. [PS] [PDF], [PPT]. Full version: [PS] [PDF]. (35% accepted)
  20. On Answering Queries in the Presence of Limited Access Patterns. Chen Li and Edward Chang. In the 8th International Conference on Database Theory (ICDT), London, UK, January, 2001. [PS] [PDF] [PPT]. (35% accepted)
  21. Query Planning with Limited Source Capabilities. Chen Li and Edward Chang. International Conference on Database Engineering (ICDE), pages 401-412, San Diego, CA, February, 2000. (14% accepted) [PS] [PDF] [PPT]. Full version: [PS] [PDF]
  22. Computing Capabilities of Mediators. Ramana Yerneni, Chen Li, Hector Garcia-Molina, Jeffrey Ullman. SIGMOD'99, Philadelphia, PA, May 1999. (20% accepted) [PS] [PDF]. Full version: [PS] [PDF]
  23. Optimizing Large Join Queries in Mediation Systems. Ramana Yerneni, Chen Li, Jeffrey Ullman, Hector Garcia-Molina. International Conference on Database Theory (ICDT), Jerusalem, Israel, January, 1999. (29% accepted) [PS] [PDF]. Full version: [PS] [PDF]
  24. Searching Near-Replicas of Images via Clustering. Edward Chang, Chen Li, James Wang, Peter Mork, and Gio Wiederhold. Proc. of SPIE Symposium of Voice, Video, and Data Communications, Multimedia Storage and Archiving Systems VI, pages 281-292, Boston, MA, September, 1999. [PS] [PDF]
  25. RIME: A Replicated Image Detector for the World-Wide Web. Edward Chang, James Ze Wang, Chen Li, and Gio Wiederhold. Proceedings of SPIE Symposium of Voice, Video, and Data Communications, pages 58--67, Boston, MA, November 1998. [PS] [PDF]
  26. 2D BubbleUp: Managing Parallel Disks for Media Servers. Edward Chang, Hector Garcia-Molina, and Chen Li. The 5th International Conference of Foundations of Data Organization (FODO), pages 221-230, Kobe, Japan, 1998. [PS] [PDF]
  27. Performance Analysis of the Communication Mechanism for POE Workstation Cluster. Weiqiang Zhuang, Chen Li, Meiming Shen. Microcomputer & Micro-system, Jan, 1995

 

Refereed Workshop, Conference Demo Papers, and Other Publications

  1. Quality-Aware Retrieval of Data Objects from Autonomous Sources for Web-Based Repositories, Houtan Shirani-Mehr, Chen Li, Gang Liang, Michal Shmueli-Scheuer, to appear in ICDE 2008 as a poster.
  2. Communication-Efficient Query Answering with Quality Guarantees in Client-Server Applications.  Michal Shmueli-Scheuer, Amitabh Chaudhary, Avigdor Gal, Chen Li.  WebDB 2007
  3. Quality-Driven Approximate Methods for GIS Data Integration. Ramaswamy Hariharan, Michal Schmueli-Scheuer, Chen Li, and Sharad Mehrotra. ACM GIS 2005, November 4-5th, 2005 Bremen, Germany. [PDF]
  4. Answering Aggregation Queries on Hierarchical Web Sites Using Adaptive Sampling. Foto Afrati, Paraskevas Lekeas, and Chen Li. Technical Report, UCI ICS, August 2005. A short version appears in CIKM'2005, 31st October - 5th November, 2005 Bremen, Germany.
  5. XGuard: A System for Publishing XML Documents without Information Leakage in the Presence of Data Inference. Xiaochun Yang, Chen Li, Ge Yu, and Lei Shi. Proc. of ICDE'2005, demo track, Tokyo, Japan, March 2005.
  6. RACCOON: A Peer-Based System for Data Integration and Sharing. Chen Li, Jia Li, Qi Zhong. Proc. of ICDE'2004, demo track. [PDF]
  7. Schema-Guided Wrapper Maintenance for Web-Data Extraction. Xiaofeng Meng, Dongdong Hu, Chen Li. To appear in the Fifth International Workshop on Web Information and Data Management (WIDM'03), New Orleans, Louisiana. [PDF] [PPT].
  8. A Supervised Visual Wrapper Generator for Web-Data Extraction. . Xiaofeng Meng, Haiyan Wang, Dongdong Hu, Chen Li. COMPSAC 2003: 657-662. [PDF]
  9. Using Constraints to Describe Source Contents in Data Integration Systems. Chen Li. IEEE Intelligent Systems 18(5): 49-53 (2003). [PDF]
  10. Describing and Utilizing Constraints to Answer Queries in Data-Integration Systems. Chen Li. IJCAI 2003 workshop on Information Integration on the Web, August 2003, Acapulco, Mexico. [PDF], [PPT]
  11. Towards Perception-Based Image Retrieval. Edward Chang, Beitao Li, and Chen Li. Proceedings of IEEE Workshop on Content-based Access of Image and Video Libraries, p. 401-412, South Carolina, June, 2000. [PS] [PDF]
  12. Managing Parallel Disks for Continuous Media Data. Edward Chang, Chen Li, and Hector Garcia-Molina. A Book Chapter in Information Organization & Databases, p.107-120, Kluwer Publisher, 2000. [PS] [PDF]Answering Queries with Database Restrictions (Research Summary). Chen Li. Symposium on Abstraction, Reformulation and Approximation (SARA), pages 328 - 329, July, 2000, Horseshoe Bay (Lake LBJ), Texas. [PS] [PDF]
  13. I wrote a report of the Workshop on Data Mining in the Internet Age, which was held May 1 - 2, 2000, IBM Almaden Center, San Jose, California. [PS] [PDF]
  14. Capability Based Mediation in TSIMMIS. Chen Li, Ramana Yerneni, Vasilis Vassalos, Hector Garcia-Molina, Yannis Papakonstantinou, Jeffrey Ullman, Murty Valiveti. Proc. of ACM SIGMOD'98, demo track, pages 564 - 566, Seattle, WA, June, 1998. [PS] [PDF]
  15. HiComm -- A New Technique for Improving Communication Performance in Workstation Cluster. Chen Li, Weiqiang Zhuang, Meiming Shen, Dingxing Wang, Weimin Zheng, Proc. of International Workshop on Advanced Parallel Processing Technologies (APPT), October, 1995, Beijing, China.

 

Ph.D. Thesis

Query Processing and Optimization in Information-Integration Systems. Chen Li. Ph.D. Thesis, Computer Science Department, Stanford University, August, 2001.