Portfolio

OUTLINE OVERVIEW

The overall goal of the Project Portfolio is to demonstrate to a panel of faculty experts that the student is able to:

  1. Describe a broad overview of the major practice areas in data science
  2. Collect and organize data
  3. Identify patterns in data via visualization, statistical analysis, and data mining
  4. Develop alternative strategies based on the data
  5. Develop a plan of action to implement the business decisions derived from the analyses
  6. Demonstrate communication skills regarding data and its analysis for managers, IT professionals, programmers, statisticians, and other relevant professionals in their organization
  7. Synthesize the ethical dimensions of data science practice (e.g., privacy)

OUTLINE

1. Broad Overview

2. Collect & Organize Data

COLLECTING DATA:

  • From a corpus HW1

  • By scraping the internet HW3_A

CLEANING DATA:

  • How to deal with “dirty” data? HW3_B

LABELING DATA:

  • Using Amazon Mechanical Turk HW5

3. Identify Patterns & Visualize Data

4. Analyze Data

5. Implement Business Decisions

6. Communicate analysis (Visualizations)

  • With word clouds HW1
  • With bar charts HW7_V2
  • World Happiness Report

7. Review ethical ramifications


10 Projects By Class:

IST ??? – Natural Language Processing

NLP_FinalProject

IST ??? – Scripting for Data Analysis

  • IMDB Final Project

IST 736 – Text Mining

HW1

HW2

HW3

HW4 & HW6

HW5

HW7 (v2)

HW8

Final Project

Projects by Topic (with code)

10: IST 736 HW8 – Topic Modeling

HW8

HW8 – Code

Projects In Progress

HW3_A


CLASSES:

iSchool:

IST 718 – BIG DATA ANALYTICS

IST 736 – TEXT MINING

IST 652 – SCRIPTING

IST 664 – NLP

IST 719 – DATA VISUALIZATION

IST 707 – DATA ANALYTICS

IST 659 – DATABASE ADMIN & MGMT

IST 687 – INTRO TO DATA SCIENCE

MBA Program:

FIN 654 – FINANCIAL ANALYTICS

MAR 653 – MARKETING ANALYTICS

SCM 651 – BUSINESS ANALYTICS

MBC 638 – DATA ANALYSIS & DECISION MAKING