Chen Li's Publications
Refereed Journal Articles
- SEPIA: Estimating Selectivities
of Approximate String Predicates in Large Databases. Liang Jin, Chen
Li, and Rares Vernica. To appear in VLDB Journal.
- Using Views to Generate Efficient Evaluation
Plans for Queries Foto Afrati, Chen Li, and Jeff Ullman, Journal of Computer and System Sciences,
Volume 73, Issue 5 (August 2007) Pages: 703-724 [PDF]
- Rewriting Queries Using Views in the Presence of Arithmetic Comparisons,
Foto Afrati,
Chen Li, and Prasenjit Mitra, Theoretical Computer Science, Volume 368,
Numbers 1-2, pages 88-123, 2006. [PDF]
- Supporting Efficient
Record Linkage for Large Data Sets Using Mapping Techniques, Chen Li,
Liang Jin, and Sharad Mehrotra,
World Wide Web Journal, Volume 9, Number 4, pages 557-584, December 2006.
[PDF]
- Achieving Communication
Efficiency through Push-Pull Partitioning of Semantic Spaces to
Disseminate Dynamic Information, Amitabha Bagchi, Amitabh Chaudhary, Michael T. Goodrich, Chen Li, and Michal Shmueli-Scheuer. IEEE Transaction on Knowledge and
Data Engineering (TKDE), October 2006 (Vol. 18, No. 10). [PDF]
- Answering Queries Using
Materialized Views with Minimum Size. Rada Chirkova, Chen Li, and Jia
Li. VLDB Journal (2006), Volume 15, Number 3, 191-210. [PDF]
- Recent Progress on
Selected Topics on Database Research -- A Report from Nine Young Chinese
Researchers Working in the United
States. Zhiyuan
Chen, Chen Li, Jian Pei, Yufei
Tao, Haixun Wang, Wei Wang, Jiong
Yang, Jun Yang, and Donghui Zhang. The Journal
of Computer Science and Technology. Vol. 18, No. 5, Pages 538 - 552,
September 2003. [PDF]
- Computing Complete Answers
to Queries in the Presence of Limited Access Patterns. Chen Li. The
VLDB Journal (2003) 12: 211-227 [PS] [PDF]
- Answering Queries with
Useful Bindings. Chen Li and Edward Chang. ACM Transactions on
Database Systems (TODS), Volume 26 , Issue 3 (September 2001).[PS] [PDF]
- Clustering for Approximate
Similarity Search in High-Dimensional Spaces. Chen Li, Edward Chang,
Hector Garcia-Molina, and Gio Wiederhold. IEEE Transaction on Knowledge and Data
Engineering, Volume 14, Number 4, pp.792-808, July/August 2002 [PS] [PDF]
Refereed
Conference Full Papers
- Cost-Based Variable-Length-Gram Selection for String Collections
to Support Approximate Queries Efficiently, Xiaochun
Yang, Bin Wang, and Chen Li, to appear in ACM SIGMOD 2008.
- Efficient Merging and Filtering Algorithms for Approximate String
Searches, Chen Li, Jiaheng Lu, and Yiming Lu. ICDE 2008. [PDF], [PPT]
- Data Exchange with Arithmetic Comparisons, Foto
Afrati, Chen Li, and Vassia
Pavlaki. EDBT 2008. [PDF]
- VGRAM: Improving Performance of Approximate Queries on String
Collections Using Variable-Length Grams, Chen Li, Bin Wang, and Xiaochun Yang. VLDB 2007. [PDF], [PPT]
- Processing Spatial-Keyword (SK) Queries in Geographic Information
Retrieval (GIR) Systems, Ramaswamy Hariharan, Bijit Hore, Chen Li, Sharad Mehrotra, SSDBM 2007.
- Protecting Individual Information Against Inference Attacks in
Data Publishing, Chen Li, Houtan Shirani-Mehr,
and Xiaochun Yang. To appear in DASFAA 2007. [PDF]
- Supporting Approximate Similarity
Queries with Quality Guarantees in P2P Systems, Qi
Zhong, Iosif Lazaridis, Mayur Deshpande, Chen Li, Sharad Mehrotra, Hal Stern, COMAD 2006, December 14-16, 2006,
Delhi, India. [PDF]
- Relaxing Join and Selection Queries. Nick Koudas,
Chen Li, Anthony Tung, and Rares Vernica. VLDB 2006, Seoul, Korea,
2006. (13.2% accepted) [PDF], [PPT]
- Selectivity Estimation for
Fuzzy String Predicates in Large Data Sets, Liang Jin and Chen Li.
VLDB 2005, Trondheim, Norway, August 30 - September
2, 2005. (16% accepted) [PDF], [PPT].
- Indexing Mixed Types for Approximate
Retrieval, Liang Jin, Nick Koudas, Chen Li,
Anthony K.H. Tung.VLDB 2005, Trondheim, Norway,
August 30 - September 2, 2005. (16% accepted) [PDF], [PPT].
- Secure XML Publishing
without Information Leakage in the Presence of Data Inference. Xiaochun Yang and Chen Li. VLDB, Toronto, Canada,
August 29 - September 3, 2004. [PDF], [PPT]. (16% accepted)
- NNH: Improving Performance
of Nearest-Neighbor Searches Using Histograms. Liang Jin, Nick Koudas, Chen Li. EDBT, Crete, Greece,
March 2004. (14% accepted) [PDF],
[Full version], [PPT]
- On Containment of
Conjunctive Queries with Arithmetic Comparisons. Foto
Afrati, Chen Li, Prasenjit
Mitra. EDBT, Crete, Greece,
March 2004. (14% accepted) [PDF].
- Materializing Views with
Minimal Size to Answer Queries. Rada Chirkova and Chen Li. ACM PODS, June 2003, San Diego, CA. (20%
accepted). [PDF], [PPT]
- Efficient Record Linkage
in Large Data Sets, Liang Jin, Chen Li, and Sharad
Mehrotra, in the 8th International Conference on
Database Systems for Advanced Applications (DASFAA 2003) 26 - 28 March,
2003, Kyoto, Japan. (33% accepted) [PS],
[PDF], [PPT]
- Executing SQL over
Encrypted Data in the Database-Service-Provider Model. Hakan Hacigumus, Bala Iyer, Chen Li, and Sharad Mehrotra. In ACM
SIGMOD, June 3-6, 2002 Madison,
Wisconsin. (18% accepted)
[PDF]
- Answering Queries Using
Views with Arithmetic Comparisons. Foto Afrati, Chen Li, and Prasenjit
Mitra. In ACM Symposium on Principles of
Database Systems (PODS), June 3-6, 2002 Madison, Wisconsin.
(22% accepted)
- Generating Efficient Plans
for Queries Using Views. Foto Afrati, Chen Li, and Jeff Ullman.
In the Proc. of the 30th ACM SIGMOD Conference, Santa Barbara, CA,
May, 2001. (15% accepted) [PS] [PDF] [PPT]
- Minimizing View Sets
without Losing Query-Answering Power. Chen Li, Mayank
Bawa, and Jeff Ullman.
In the 8th International Conference on Database Theory (ICDT), London, UK, January, 2001. [PS] [PDF], [PPT]. Full version: [PS] [PDF]. (35% accepted)
- On Answering Queries in
the Presence of Limited Access Patterns. Chen Li and Edward Chang. In
the 8th International Conference on Database Theory (ICDT), London, UK, January, 2001. [PS] [PDF]
[PPT]. (35% accepted)
- Query Planning with
Limited Source Capabilities. Chen Li and Edward Chang.
International Conference on Database Engineering (ICDE), pages 401-412,
San Diego, CA, February, 2000. (14% accepted) [PS] [PDF]
[PPT]. Full version: [PS] [PDF]
- Computing Capabilities of
Mediators. Ramana Yerneni,
Chen Li, Hector Garcia-Molina, Jeffrey Ullman.
SIGMOD'99, Philadelphia,
PA, May 1999. (20%
accepted) [PS] [PDF]. Full version: [PS] [PDF]
- Optimizing Large Join
Queries in Mediation Systems. Ramana Yerneni, Chen Li, Jeffrey Ullman,
Hector Garcia-Molina. International Conference on Database Theory (ICDT), Jerusalem, Israel, January, 1999. (29%
accepted) [PS] [PDF]. Full version: [PS] [PDF]
- Searching Near-Replicas of
Images via Clustering. Edward Chang, Chen Li, James Wang, Peter Mork, and Gio Wiederhold. Proc. of SPIE Symposium of Voice, Video,
and Data Communications, Multimedia Storage and Archiving Systems VI,
pages 281-292, Boston, MA, September, 1999. [PS]
[PDF]
- RIME: A Replicated Image
Detector for the World-Wide Web. Edward Chang, James Ze Wang, Chen Li, and Gio Wiederhold. Proceedings of SPIE Symposium of Voice,
Video, and Data Communications, pages 58--67, Boston, MA, November 1998. [PS] [PDF]
- 2D BubbleUp: Managing Parallel Disks for
Media Servers.
Edward Chang, Hector Garcia-Molina, and Chen Li. The 5th
International Conference of Foundations of Data Organization (FODO), pages
221-230, Kobe, Japan, 1998. [PS] [PDF]
- Performance Analysis of
the Communication Mechanism for POE Workstation Cluster. Weiqiang Zhuang, Chen Li, Meiming Shen.
Microcomputer & Micro-system, Jan, 1995
Refereed
Workshop, Conference Demo Papers, and Other Publications
- Quality-Aware Retrieval of Data Objects from Autonomous Sources
for Web-Based Repositories, Houtan Shirani-Mehr, Chen Li, Gang Liang, Michal Shmueli-Scheuer, ICDE 2008 (poster). [PDF]
- Communication-Efficient Query Answering with Quality Guarantees in
Client-Server Applications.
Michal Shmueli-Scheuer, Amitabh Chaudhary, Avigdor Gal, Chen Li. WebDB
2007
- Quality-Driven Approximate
Methods for GIS Data Integration. Ramaswamy Hariharan, Michal Schmueli-Scheuer,
Chen Li, and Sharad Mehrotra.
ACM GIS 2005, November 4-5th, 2005 Bremen,
Germany. [PDF]
- Answering Aggregation
Queries on Hierarchical Web Sites Using Adaptive Sampling. Foto Afrati, Paraskevas Lekeas, and Chen
Li. Technical Report, UCI ICS, August
2005. A short version appears in CIKM'2005,
31st October - 5th November, 2005 Bremen,
Germany.
- XGuard:
A System for Publishing XML Documents without Information Leakage in the
Presence of Data Inference. Xiaochun Yang,
Chen Li, Ge Yu, and Lei Shi. Proc. of ICDE'2005,
demo track, Tokyo, Japan, March 2005.
- RACCOON: A Peer-Based
System for Data Integration and Sharing. Chen Li, Jia Li, Qi Zhong. Proc. of
ICDE'2004, demo track. [PDF]
- Schema-Guided Wrapper
Maintenance for Web-Data Extraction. Xiaofeng Meng, Dongdong Hu, Chen Li. To
appear in the Fifth International Workshop on Web Information and Data
Management (WIDM'03), New Orleans, Louisiana. [PDF] [PPT].
- A Supervised Visual
Wrapper Generator for Web-Data Extraction. . Xiaofeng
Meng, Haiyan Wang, Dongdong Hu, Chen Li.
COMPSAC 2003: 657-662. [PDF]
- Using Constraints to
Describe Source Contents in Data Integration Systems. Chen Li. IEEE
Intelligent Systems 18(5): 49-53 (2003). [PDF]
- Describing and Utilizing
Constraints to Answer Queries in Data-Integration Systems. Chen Li.
IJCAI 2003 workshop on Information Integration on the Web, August 2003, Acapulco, Mexico. [PDF], [PPT]
- Towards Perception-Based
Image Retrieval. Edward Chang, Beitao Li,
and Chen Li. Proceedings of IEEE Workshop on Content-based Access of Image
and Video Libraries, p. 401-412, South Carolina, June, 2000. [PS] [PDF]
- Managing Parallel Disks
for Continuous Media Data. Edward Chang, Chen Li, and Hector
Garcia-Molina. A Book Chapter in Information Organization & Databases,
p.107-120, Kluwer Publisher, 2000. [PS] [PDF]Answering
Queries with Database Restrictions (Research Summary). Chen Li. Symposium
on Abstraction, Reformulation and Approximation (SARA), pages 328 -
329, July, 2000, Horseshoe Bay (Lake
LBJ), Texas. [PS] [PDF]
- I wrote a report of the Workshop on Data
Mining in the Internet Age, which was held May 1 - 2, 2000, IBM Almaden Center, San Jose, California. [PS] [PDF]
- Capability Based Mediation
in TSIMMIS. Chen Li, Ramana
Yerneni, Vasilis Vassalos, Hector Garcia-Molina, Yannis Papakonstantinou,
Jeffrey Ullman, Murty Valiveti. Proc. of ACM SIGMOD'98, demo track,
pages 564 - 566, Seattle,
WA, June, 1998. [PS] [PDF]
- HiComm
-- A New Technique for Improving Communication Performance in Workstation
Cluster. Chen Li, Weiqiang Zhuang, Meiming Shen, Dingxing Wang, Weimin Zheng, Proc. of
International Workshop on Advanced Parallel Processing Technologies
(APPT), October, 1995, Beijing, China.
Ph.D.
Thesis
Query Processing and Optimization in
Information-Integration Systems. Chen Li. Ph.D.
Thesis, Computer Science Department, Stanford University, August, 2001.