The Academic Program

New College offers a Master’s Degree in Applied Data Science, under the Classification of Instructional Programs (CIP) code 11.0104. The degree requires 36 credit hours of graduate work. Students complete 11 courses at 3 credit hours each, one practicum at 3 credit hours, and non-credit earning courses (the Introduction to Data Science Boot Camp, the Industrial Workshops, a second industrial practicum, and the Industrial Seminar Series).

The Applied Data Science Degree Program emphasizes mathematical rigor and computational mastery in the collection, visualization, and use of data with particular focus on the statistical and computational challenges of very large and unstructured data sets. All New College graduate students enroll full-time; there are no part-time enrollment options for this program.

First Year
Pre-Semester
IDC 5100
First Semester
IDC 5204
IDC 5110
IDC 5120
IDC 5210
IDC 5251
January Interterm
IDC 5295
Second Semester
IDC 5205
IDC 5112
IDC 5130
IDC 5131
Year One Summer or Year Two January Interterm
IDC 6293
Second Year
Third Semester
IDC 6200
IDC 6215
IDC 6250
IDC 6253
Fourth Semester
IDC 6294

Graduate Certificates

New College offers four Graduate Certificate Programs integrated with its MS in Applied Data Science program. The purpose of these programs is to make its strong curriculum, and faculty expertise and experience, available to the community, including working professionals and anyone else who seeks to advance their knowledge and skills in the field of applied data science, analytics, and visualization for personal and professional development. Each graduate certificate program is offered as a 3-course sequence that participants will be taking for college credit. Each program is developed with a particular focus area in mind, such as data analytics and visualization, applied statistics, applied machine learning, and distributed computing, and is designed not only to provide foundational knowledge but also to foster development of skills (e.g., programming, modeling) highly demanded in today’s competitive job marketplace.

Admission Requirements

Participants must complete an online application for the intended certificate program and provide documentation of a Bachelor’s degree (in any discipline).

Participants must also meet further requirements as specified by each certificate program (e.g. experience or knowledge in calculus, linear algebra and/or computer programming). Applicants must also submit a short letter of interest/intent to pursue their graduate certificate program of interest.

The graduate admissions committee will review each application and determine selections based on various factors including but not limited to: academic credentials from prior college attendance; knowledge, skills and experience in data science or a related field; interest in data science, applied statistics, business analytics, business intelligence systems or related domains; and overall motivation.

Completion Criteria

Participants of each graduate certificate program must achieve at least 3.0 cumulative GPA over all courses included in the program. A participant has the option of retaking a course (e.g., to increase their cumulative GPA to 3.0) regardless of their grade(s) in earlier attempt(s). After two or more attempts, only the most recent letter grade will be factored into the cumulative GPA calculation. Please note however that all course attempts will be reflected on final transcripts.

Upon successful completion of a graduate certificate program, the student will be provided by the Office of the Registrar a printed certificate and a transcript that details the coursework completed.

Graduate Certificate in Data Analysis and Visualization with R

This 3-course certificate program provides participants with a solid background in data analysis and data visualization using the widely used data science programming language R. Starting with a course that introduces the fundamental principles in data extraction, loading, pre-processing, and analysis, it continues with a course on applied statistics and concludes with a course on Data Visualization.

Prerequisites: None

IDC 5110Data Munging and Exploratory Data Analysis
IDC 5204Applied Statistics I
IDC 5112Data Visualization

Learning Outcomes

  • Demonstrate understanding of fundamental concepts in
    • Data cleansing and exploratory data analysis
    • Introductory statistical analysis
    • Data visualization and its role and importance in data analysis
  • Construct effective data visualizations and communicate findings
  • Demonstrate proficiency in R programming language
  • Demonstrate awareness and recognition of ethical issues in data analysis and visualization
  • Operate effectively in a teamwork environment and communicate effectively with peers
  • Communicate orally and in writing with audiences the results of data analysis or visualization

Graduate Certificate in Machine Learning with Python

This 3-course certificate program aims to equip its participants with knowledge and skills to develop and apply machine learning models to solve complex real-world problems. Starting with a course that introduces the fundamental principles in algorithms and optimization, it continues with models in machine learning and concludes with a course on advanced topics in computing including neural networks and deep learning. All implementations are done in Python.

Prerequisites

  • Knowledge of, or experience in, introductory Python
  • Knowledge of linear algebra
IDC 5120Algorithms for Data Science
IDC 5210Applied Machine Learning
IDC 6215Advanced Applied Computing

Learning Outcomes

  • Demonstrate understanding of fundamental concepts in
    • Algorithms and optimization
    • Machine learning (ML) models and their applications
    • Advanced computational models, including deep learning (DL), and their applications
  • Build data science pipelines that include ML/DL models
  • Demonstrate proficiency in Python programming language
  • Demonstrate awareness and recognition of ethical issues in machine learning and artificial intelligence
  • Operate effectively in a teamwork environment and communicate effectively with peers
  • Communicate orally and in writing with audiences the results of data analysis or visualization

Graduate Certificate in Statistical Modeling

This 3-course certificate program provides participants with a solid background in statistical modeling over a three-course sequence in statistics. Starting with fundamental concepts in descriptive and inferential statistics, it continues by exploring a variety of statistical models such as multivariate linear and logistic regression, time series modeling, survival analysis, Bayesian statistics, among others. Details regarding each course are provided below.

Prerequisites

  • Knowledge of or experience in introductory R (unless the first course of Graduate Certificate in Data Analysis and Visualization with R is taken prior)
IDC 5204Applied Statistics I
IDC 5205Applied Statistics II
IDC 6200Advanced Applied Statistics

Learning Outcomes

  • Demonstrate understanding of fundamental concepts in
    • Descriptive and inferential statistics
    • Statistical modeling and computational techniques in statistical analysis
    • Various types of statistical models, including linear, logistic and generalized linear models, time series models, survival models and more
  • Demonstrate proficiency in R programming language
  • Demonstrate awareness and recognition of ethical issues in statistical modeling
  • Operate effectively in a teamwork environment and communicate effectively with peers
  • Communicate orally and in writing with audiences the results of data analysis or visualization

Graduate Certificate in Distributed Computing

This 3-course certificate program provides solid background as well as hands-on experience in distributed computing. Starting with a course that introduces the fundamental principles in algorithms and optimization, it continues with traditional and modern database systems including SQL and NoSQL databases. It concludes with a course on massively parallel datasets and database systems, and algorithms for parallel architectures. All implementations are done in Python.

Prerequisites

  • Knowledge of, or experience in, introductory Python
IDC 5120Algorithms for Data Science
IDC 5130Databases for Data Science
Distributed Computing

Learning Outcomes

  • Demonstrate understanding of fundamental concepts in
    • Algorithms and optimization
    • Database systems; storage, retrieval and distribution of massive data sets
    • Parallel and distributed computing
  • Demonstrate proficiency in Python programming language
  • Operate effectively in a teamwork environment and communicate effectively with peers
  • Communicate orally and in writing with audiences the results of data analysis or visualization