Skip to main content
Back to Projects

Data Science Fundamentals

End-to-end ML curriculum in Jupyter notebooks covering Python, classical ML, deep learning, and NLP.

PythonJupyterscikit-learnTensorFlowspaCypandasNLTK

Overview

A complete, self-contained ML reference built as Jupyter notebooks. Covers the stack from scratch: Python fundamentals, data preprocessing, classical ML, deep learning, and NLP. Each folder is a standalone topic with its own datasets and examples.

What It Covers

TopicContent
PythonCore language, data structures, OOP, standard library
Data ScienceMissing values, encoding, scaling, train/test split
RegressionLinear, polynomial, SVR, random forest
ClassificationLogistic regression, KNN, Naive Bayes, SVM, decision trees, model comparison
ClusteringK-means
Deep LearningNeural networks, customer churn, audiobooks, MNIST
NLPText preprocessing, sentiment analysis, NER, LDA topic modeling, spaCy (10 notebooks)

Why It Exists

Built as a personal reference while working through the ML curriculum — structured to be reproducible from scratch. Each notebook is self-contained: load the dataset, run the cells, see the output.