School of Information and Computer Science
University of California Irvine
Irvine, CA 92697
(832) 278 7005
Brief BioI am currently a PhD candidate in the School of Information and Computer Science, University of California Irvine. My advisor is Prof. Cristina Lopes, Department of Informatics, University of California Irvine.
Before joining University of California Irvine, I worked at Tata Research Development and Design Center, India . During summers, I have interned at SRCH2, a startup specialized in improving user experiences in the domain of search, and Interactive Entertainment Business group at Microsoft Silicon Valley campus.
Research InterestsMy primary research interest has been in the area of Software Engineering with the emphasis on Software Evolution. I am interested in developing theoretical and practical techniques and tools for helping people to create, understand, and modify software systems. My research activities to achieve this goal so far span the spectrum from quantitative measures for software properties, to analysis of software changes, to code modularization, to code re-factoring, to developing new debugging techniques.
Machine learning algorithms and Information Retrieval have proven to be of great practical value in a variety of domains. The field of software engineering turns out to be one such fertile domain where many software development and maintenance tasks could be formulated as learning problems and Machine Learning techniques could be applied to obtain solutions.
I believe that in order to ultimately overcome the essential difficulties, it has been recognized that both the processes and products of software development should be formalized and automated, and AI techniques can play an important role in this effort. Of the many AI techniques, ML and IR methods have found their way into the software development in the past twenty years
Publications1. A Parallel and Efficient Approach to Large Scale Code Clone Detection
Hitesh Sajnani and Cristina Lopes. International Workshop on Software Clones IWSC-2013, Co-located with ICSE 2013 in San Francisco, May 2013
2. ASTRA: Bottom-up Construction of Structured Artifact Repositories
Joel Ossher, Hitesh Sajnani, and Cristina Lopes. In the proceedings of Working Conference on Reverse Engineering, WCRE-2012, Kingston, ON, October 2012
3. Automatic Software Architecture Recovery: A Machine Learning Approach
Hitesh Sajnani. In the Proceedings of International Conference on Program Comprehension, Passau, Germany, June 2012
4. Parallel Code Clone Detection Using MapReduce
Hitesh Sajnani, Joel Ossher, and Cristina Lopes. In the Proceedings of International Conference on Program Comprehension, Passau, Germany, June 2012
5. Trendy Bugs: Topic Trends in the Android Bug Reports - PDF
Lee Martie, Vijay Krishna Palepu, Hitesh Sajnani, and Cristina Lopes. In the Proceedings of Mining Software Repositories, Zurich, Switzerland, June 2012
6. Multi-Label Classification of Short Text: A Casestudy on Wikipedia Barnstars - PDF
Hitesh Sajnani, Sarah Javanmardi, David McDonald, and Cristina Lopes. In the Proceedings of Workshop on Analyzing Microtext, AAAI-2011, San Francisco, CA, August 2011
7. Application Architecture Discovery: Towards Domain-driven, Easily Extensible Code Structure - (Best paper award in the industrial track, Most outstanding paper award at TCS Innovation Summit) - PDF
Hitesh Sajnani, Ravindra Naik, and Cristina Lopes. In the Proceedings of 18th Working Conference on Reverse Engineering, Limerick, Ireland, October 2011
8. Easing Software Evolution: A Change-data and Domain-driven Approach - PDF
Hitesh Sajnani, Ravindra Naik, and Cristina Lopes. India Software Engineering Conference, Kanpur, India, February 2012
9. Clone Detection in Open Source Java Projects: The Good, The Bad, and The Ugly - PDF
Joel Ossher, Hitesh Sajnani, and Cristina Lopes. In the Proceedings of International Conference on Software Maintenance, Williamsburg, VA, September 2011
10. Using Change History of Software To Improve Software Evolvability
Ravindra Naik and Hitesh Sajnani. In the Proceedings of India Software Engineering Conferecene, Mysore, India, February 2010
CollaboratorsProf. Cristina Lopes, University of California, Irvine
Sarah Javanmardi, University of California, Irvine
Joel Ossher, University of California, Irvine
Prof. Chen Li, University of California, Irvine
Prof. David McDonald, University of Washington
Ravindra Naik, Tata Research Development and Design Center, Pune, India
Vijay Krishna Palepu, University of California, Irvine
Lee Martie, University of California, Irvine
Recent NewsVery excited for MSR Vision 2020 conference to be held in Kingston, Canada. Looking forward to it.
It was a long trip to Europe in June. Attended ICSE in Zurich to present our MSR challenge paper. Headed to Passau, Germany to attend ICPC and present our paper on scaling clone detection and a doctoral symposium paper on architecture recovery. Good feedback :)
Our paper "Application Architecture Discovery: Towards Domain-driven, Easily Extensible Code Structure" was awarded as the outstanding paper award at TCS's CTB (Corporate Technology Board) meeting. It's good to win corporate awards too :)
I attended India Software Engineering Conference, 2012, Kanpur from Feb 20-25 and presented our work on "Easing Software Evolution: A Change-data and Domain-driven Approach"
I visited Williamsburg, VA for ICSM 2011 where we had a paper in the research track on Clones in open source java systems
I am planning to go to Limerick, Ireland to attend WCRE 2011 where we have a paper in the industrial track
I will be attending Splash 2011 in Portland, Oregon from October 23 - October 29 as a student volunteer!
Check out Omniplaces, a location based, instant, fuzzy search engine. I worked on it during my internship at a startup called Bimaple