Big Data Analytics is a course on data mining and related topics, taught at UC San Diego by Prof. Roger Bohn. Welcome! This web site, BDA2020.wordpress.com, will be the primary channel for sending out course information.
Up-to-date information for Spring 2018
- New: What to do by Tuesday April 3. Most important: Install the software.
- Should you take this course?
- What are the course requirements (including prerequisites)?
- Order the main textbook. The bookstore ought to have it, but it looked cheaper at Amazon.
- Download the free textbook. Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery by Graham Williams.
- Start thinking about project topics that interest you. Many pages on this site have links to past applications or data sets. I am listing a few candidates, but a topic that you are personally interested in is generally the most rewarding.
- Install RStudio, R, and Rattle. Old instructions for installation are here. They should be enough to get started.
- Recommended: take a 3 to 5 hour tutorial on “Exploratory Data Analysis” using R. There are at least 20 free ones available. Many of them are the first or second lesson of a longer course on Udacity or Coursera. Both Udacity and Coursera are free, although they try to persuade you to pay.
Rbohn@ucsd.edu Please include your name and #BDA18 in all messages.
Text message: 858 381-2015 (That is a Google number.) Please include your name and #BDA18 in all messages.
Twitter users: Posts on this web site are linked to my Twitter feed, @RogerBohn #BDA. All course correspondence should include #BDA.
We will have a TA and a “strongly suggested” weekly workshop on R. The date and time will be set based on what works for most students.