Kyle I S Harrington / kyle@eecs.tufts.edu
Course web page: http://www.github.com/kephale/TuftsCOMP135_Spring2016/
Algorithms, methods, and techniques for learning and prediction based upon data
Use existing data (if an offline task), obtain data (if an online task)
Analyze the data, build a model, update an existing model
Use the knowledge/model to understand the data or make predictions
Supervised learning
Unsupervised learning
Reinforcement learning
and More
Problem domain | Class (output) |
---|---|
Handwritten character recognition | letters/ASCII |
Face detection | Bounding box/features of person |
Spam detection | True/False |
Protein classification | Protein type/attributes |
Astronomical phenomena | Orientation/Location/Star type |
![]() |
![]() |
Prey in the wild | Predator classification |
Predicting what discrete category some observation belongs to based on features of the observation.
Is an email spam? Return a boolean.
What character is this? Return a value from a list.
Make a prediction of a continuous value, often a number.
What is the value of a stock price? Some positive floating point.
What equation represents a collection of numbers?
Problem domain | Class (output) |
---|---|
Tweets | Topic of message |
All images on the internet | What is in the image? |
Gene arrays | What genes are coexpressed? |
Marketing surveys | What groups of consumers are there? |
Problem domain | Class (output) |
---|---|
Robot control | Determine a sequence of actions to carry out (drive a car, fly a quadcopter) |
Game playing | Play backgammon, play as a NPC in games |
Elevator control | Move people between floors as efficiently as possible |
How do we represent the data?
Data is problem specific
Let's start by looking at the data (from Assignment 1) with some easy to use program like Excel or LibreOffice
Map some attributes (input) to a predicted class label (output)
It is a function! Just like a function you program, the internals of the function may be very different depending on the type of classifier
Using the training data, create a classifier that predicts outputs from inputs correctly
Even you could be the ML algorithm
Programming
Math: Probability, Statistics, Calculus, Linear Algebra, Information Theory
Posted in the assignments section
Due: Jan 27
Nearest Neighbors/KD-Trees
Decision Trees