Lydia
Yingyi Bu

Office: Room 2064 DBH
Tel: 949-8789-0298
Department of Computer Science
University of California, Irvine
Irvine, CA 92697

Email: yingyib@ics.uci.edu

I am now a Ph.D. student in Department of Computer Science, University of California, Irvine, working with Prof. Michael J. Carey. Before that, I got a B.Sc. from Nanjing University, China, an M.Phil. from The Chinese University of Hong Kong, and also fulltimely worked in Microsoft SQL Server group .

News: 

Research Interests

My primary area of research interest is in large scale data management systems, especially data-intensive computing systems.

Current projects:

Past projects:

Publications (dlbp entry)

A Bloat-Aware Design for Big Data Applications [PDF][PPT][An independent Chinese translation]
(Open-source systems using our design paradigm: AsterixDB, Hyracks, Pregelix )
Yingyi Bu, Vinayak Borkar, Guoqing Xu, and Michael J. Carey
In Proceedings of the 2013 ACM SIGPLAN International Symposium on Memory Management (ISMM 2013)
Seattle, WA, June 20-21, 2013.
Declarative Systems for Large-Scale Machine Learning [PDF]
Vinayak Borkar, Yingyi Bu, Michael J. Carey, Joshua Rosen, Neoklis Polyzotis, Tyson Condie, Markus Weimer, Raghu Ramakrishnan
IEEE Data Engineering Bulletin. Volume 35, Number 2, June 2012
The HaLoop Approach to Large-Scale Iterative Data Analysis [PDF][Implementation]
Yingyi Bu, Bill Howe, Magdalenda Balazinska, Michael D. Ernst
The VLDB Journal (VLDBJ), Volume 21, Number 2, April 2012.
Combined Static and Dynamic Automated Test Generation [PDF][Implementation]
Sai Zhang, David Suff, Yingyi Bu, Michael D. Ernst
In Proceedings of the 11th International Symposium on Software Testing and Analysis (ISSTA 2011)
Toronto, ON, Canada, July 17 - 21, 2011 (acceptance rate: 35/121=28.9%)
HaLoop: Efficient Iterative Data Processing on Large Clusters [PDF][PPT][Talk in Berkeley][Implementation]
(selected for Best of VLDB 2010 issue of VLDB Journal )
Yingyi Bu, Bill Howe, Magdalenda Balazinska, Michael D. Ernst
In Proceedings of the 36th International Conference on Very Large Data Bases (VLDB 2010)
Singapore, 11-17 September, 2010. (Acceptance Rate: 33/204 = 16.1%)
Efficient Anomaly Monitoring Over Moving Object Trajectory Streams [PDF][PPT][Source Code][Dataset]
Yingyi Bu, Lei Chen, Ada Wai-Chee Fu, Dawei Liu
In Proceedings of the 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2009)
Paris, France, June 28-July 1, 2009. (Acceptance Rate: 105/537 = 19.6%)
Privacy Preserving Serial Data Publishing By Role Composition [PDF][PPT][Source Code][Dataset Link]
Yingyi Bu, Ada Wai-Chee Fu, Raymond Chi-Wing Wong, Lei Chen, Jiuyong Li
In Proceedings of the 34th International Conference on Very Large Data Bases (VLDB 2008)
Auckland, New Zealand on 24-30 Aug, 2008. (Acceptance Rate: 46/273 = 16.8%)
WAT: Finding Top-K Discords in Time Series Database [PDF][Source Code]
Yingyi Bu, Tat-Wing Leung, Ada Wai-Chee Fu, Eamonn Keogh, Jian Pei, Sam Meshkin
In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM 2007)
Minneapolis, MN, USA, April 26-28, 2007. (Acceptance Rate: 25%)

System Demos and Posters

Pregelix: Dataflow-Based Big Graph Analytics [PDF][Open-source System]
Yingyi Bu
In Proceedings of the 2013 ACM SIGMOD/SIGOPS Symposium on Cloud Computing (SOCC 2013)
Santa Clara, CA, October 1-3, 2013.
Comparing SSD-placement strategies to scale a Database-in-the-Cloud [PDF]
Yingyi Bu, Hongrae Lee, Jayant Madhavan
In Proceedings of the 2013 ACM SIGMOD/SIGOPS Symposium on Cloud Computing (SOCC 2013)
Santa Clara, CA, October 1-3, 2013.
ASTERIX: An Open Source System for "Big Data" Management and Analysis [PDF]
Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak R. Borkar, Yingyi Bu, Michael J. Carey, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Nicola Onose, Pouria Pirzadeh, Rares Vernica, Jian Wen
In Proceedings of the 38th International Conference on Very Large Data Bases (VLDB 2012)
─░stanbul, Turkey, August 27-31, 2012.

Honors and Awards