Education
Research
I belong to a diverse group working in bioinformatics, chemoinformatics,
and software engineering. The common theme is
machine learning and information retrieval. Currently I am working on the
Sourcerer
project developing methods
for mining program function and developer contributions from source code, as well as improving the way Internet-scale
code
repositories are searched. I am also working on ChemDB, applying
text information retrieval to chemical
search.
Publications (* denotes equal contributors)
Journal:
E. Linstead*,
Internet-Scale Software Repositories. Data Mining and Knowledge Discovery. Volume 2, Number 18. April 2009. (online)
J. Chen, E. Linstead, S. Swamidass, D. Wang, P. Baldi. “ChemDB Update: Full-text Search and Virtual
Chemical Space” Bioinformatics. Volume 23, Number 17. September 2007. (advance access)
Conference:
Evolution. Proceedings of ICMLA 2008: International Conference on Machine Learning and
Applications.
P. Baldi*,
C. Lopes*, E. Linstead*, S. Bajracharya. A Theory of Aspects as Latent
Topics.
OOPSLA 2008. Nashville, TN. October 2008. (online)
E. Linstead,
P. Rigor, S. Bajracharya, C. Lopes, P. Baldi. Mining Internet-Scale Software
Repositories. Advances in Neural Information Processing Systems (NIPS*2007)
March 2008. (online)
E. Linstead,
P. Rigor, S. Bajracharya, C. Lopes, P. Baldi. Mining Concepts from Code with
Probabilistic Topic Models. Proceedings of ASE 2007: International Conference on Automated
Software
Engineering.
Workshop:
E. Linstead,
L. Hughes, C. Lopes, P. Baldi. Exploring
Java Software Vocabulary: A Search and
Mining Perspective. Proceedings of
Development
– Users, Interfaces, Tools, and Environments.
E. Linstead,
P. Rigor, S. Bajracharya, C. Lopes, P. Baldi. Mining Eclipse Developer Contributions via
Author-Topic Models. Fourth International
Workshop on Mining Software Repositories.
MN. May 2007. (Voted best paper, MSR “Scale” Challenge). (online)
Poster:
E. Linstead,
L. Hughes, C. Lopes, P. Baldi. Capturing Java Naming Conventions with First-Order Markov Models.
ICPC 2009: Proceedings of the Seventeenth International Conference on Program Comprehension.
S. Bajracharya, T. Ngo, E. Linstead,
Y. Dou, P. Rigor, P. Baldi & C. Lopes. Sourcerer:
A Search Engine for Open
Source Code
Supporting Structure-Based Search. OOPSLA ’06 Poster
Session.
Technical Report:
S. Bajracharya,
T. Ngo, E. Linstead, P. Rigor, Y. Dou, P. Baldi & C. Lopes.
A Study of Ranking Schemes in
Internet-Scale Code Search.
UCI ISR Technical
Report # UCI-ISR-07-8. Nov. 2007 (online)
Recent Invited Talks:
Searching
and Mining Internet-Scale Software Repositories.
AI and
Machine Learning Seminar. Dept. of Computer Science.
UCI.
November 10, 2008.
Google Tech Talk.
About Me
I'm a fourth year PhD student working in the area of Artificial Intelligence in
the
Because I don't like having free time, I'm also a real-time software engineer
with the Boeing Company, where I spend most of
the day writing code for interesting projects.
I continue to stay involved with
the Department of Math and Computer Science at
teach courses, when called upon, in C/C++, Data Structures, AI, Computer
Architecture, Graphics, Computer Ethics,
and Data Mining.
My wife, Jackie, is a high-school English teacher. We’re currently having lots of fun learning how to fix up our new
house together, as well as hanging out with our two beagles.
Hobbies
My dad and I started restoring old Porsches when I was in high school. I
don't have time to keep up with it right now, but I still
own a 1969 911 S that we started working on my senior year. A few years
ago I became the proud owner of a new
2003 Boxster S. One of my loftier goals for the
future is to own a Ferrari, but for the time being I'm very, very content!
Links
My wife and mother-in-law have recently started a business, Forever Linens Chair Covers. They
specialize in chair covers, chair
treatments, and other linens for special events. If you’re interested, you can learn
about their various chair
covers and linens.
Last Updated: Feb. 17, 2009