Information Retrieval and Mining Massive Data Sets
What you'll learn
- The course is primarily divided into 6 parts.
- Part 1: Building an Information Retrieval System
- Part 2: Mining Frequent Patterns and Associations
- Part 3: Classification and Clustering
- Part 4: Web Mining
- Part 5: Recommendation Systems
Requirements
- Knowledge of probability and linear algebra.
- Good grasp on graduate level algorithms.
- Experience with a programming language ( C, Python, Java)
Description
The goal is to introduce various techniques required to build an IR System. In this course we will explore various methods to solve big data problem. We will evaluate alternative solutions and trade offs. In the later part of the course we will discuss various data mining algorithms to make sense of massive data sets.
Who this course is for:
- Big Data Enthusiast
- Data Scientists
Instructors
I got my Bachelors in Computer Science and Engineering from the Indian Institute of Technology, Delhi in 2002, followed by Masters (2005) and Ph.D (2008) in Computer Science from Stanford University, where I was a Stanford Graduate Fellow. I was a part of the Batzoglou lab, and I worked with Luca Cavalli-Sforza and Marc Feldman on using simulations to build and test models for ancient human migrations. Our simulation work on the genetic prehistory of humans supports a Serial Founder Effect out of Africa. I also worked with Deborah Rogers on agent-based simulations of egalitarian and hierarchical societies to explore how inequality spread in early human history.
I then worked on building and maintaining graph-based ontologies for text mining and categorization at Kosmix (now called @WalmartLabs after being acquired by Walmart).
EdTech remains my primary passion. I designed Odyssey, a Next Generation Science game at The Young Socratics, an education company that I co-founded for providing a historical-philosophical perspective to math and science topics in the middle and high school curriculum. Odyssey had a successful Kickstarter, and is now available for download on Steam.
We bring highly qualified network of mentors who are passionate about developing next generation of engineers. Our professionals are from top US universities like Stanford UC Berkeley and with significant industry experience with companies like IBM, Google, Cisco and VMware. They bring to the table invaluable real time experience in bite-sized video nuggets. Their services make our focused, role based training hard-to-get in industry.