INF 141
V 2/26/2009 Lecture Notes
* Kick-off:
V I have one IR paper
V Cards
*
FirefoxScreenSnapz001
FirefoxScreenSnapz001

* Complicated quirky Technical mathematical varied progressed data search query corpus information retrieval fast future trust useful amazing growing information unstructured docs corpus retrieval query complex large efficient purple algorithmic polite search rank index crawl web google search in a large corpus efficiently a way to retrieve information needed search science metadata database data document text math crawl search query score rank map distributedly digging through documents discovering data crawler complex frustrating time consuming tricky mathematical searching documents metadata relational database www aggregate search gather useful difficult fast efficient search relevant search document weighting vast useful technological studied business scholarly text retrieval information overload information sorting document retrieval data retrieval corpus statistics crawlers retrieve and rank info about queries how search engines crawl the web intractable expensive difficult google difficult interesting useful important speedy slow Patterson crawler hard amazing cool confusing search evaluate get gather sort modify retrieving challenge fun useful slow indexing the science of searching for documents google query posting list frequency database index how one searches for different things evolving organization information cool big money google not Microsoft move google need search stop word list distributed hadoop inverted index parsing stemming large important crawl web search corpus linear multi distributed stuff wikipedia reuters magical special fun happy sad all knowing rank query queuing cosine google key web crawling page rank page scores sexy enthralling exciting bright colorful confusing
* Assignment 06
* Learning Objective:
V Cards
* Name
V Video break