CS4248 Natural Language Processing

Click a lecture to expand. New notebooks added weekly as lectures progress.

Jupyter: Base URL: ? Setup Guide

HTML (no setup)

Click the green HTML button to view a pre-rendered version of the notebook in your browser. Read-only, no setup needed.

Google Colab (no setup)

Click the yellow Colab button to open a runnable notebook in Google Colab. To save your work: File > Save a copy in Drive.

Greyed-out Colab buttons indicate notebooks that may not work in Colab due to missing dependencies.

Local Jupyter

Clone the repo and start Jupyter from inside it:

git clone https://github.com/lavanyagarg112/CS4248_NB_Drive.git
cd CS4248_NB_Drive
jupyter lab    # or: jupyter notebook

Install Jupyter if needed: pip install jupyterlab or pip install notebook

Then click the blue Jupyter button on this page to open the notebook in your running server. If the port differs from 8888, update the Base URL field above.

Lecture 1: What is NLP and Why is it so Hard?

5 notebooks

▶

Data Preparation for Training LLMs — An Overview

HTML Colab Jupyter

Token Indexing with Vocabulariesoptional

HTML Colab Jupyter

Working with Batches for Sequence Tasksoptional

HTML Colab Jupyter

Data Batching for Training LLMsoptional

HTML Colab Jupyter

NumPy — Basic Tutorialoptional

HTML Colab Jupyter

Lecture 2: Strings & Words

8 notebooks

▶

Regular Expressions

Colab Jupyter

Text Tokenization

HTML Colab Jupyter

Word Tokenizer (implementation from scratch)

HTML Colab Jupyter

Byte-Pair Encoding

HTML Colab Jupyter

WordPiece

HTML Colab Jupyter

Text Normalization

HTML Colab Jupyter

Stemming & Lemmatization

HTML Colab Jupyter

Porter Stemmeroptional

HTML Colab Jupyter

Lecture 3: n-Gram Language Models

5 notebooks

▶

Language Models

HTML Colab Jupyter

n-Gram Language Models (basic)

Colab Jupyter

n-Gram Language Models (advanced)

Colab Jupyter

RNN-based Language Modelsoptional

HTML Colab Jupyter

Transformer-based Language Modelsoptional

HTML Colab Jupyter

Lecture 4: Structure in Language

5 notebooks

▶

Part-of-Speech Tagging

HTML Colab Jupyter

Constituency Parsing

Colab Jupyter

Dependency Parsing

Colab Jupyter

POS Tagging with HMMsoptional

Colab Jupyter

CYK Algorithmoptional

Colab Jupyter

Lecture 5: Text Classification

4 notebooks

▶

Multinomial Naive Bayes

HTML Colab Jupyter

Vector Space Model

HTML Colab Jupyter

Text Classificationoptional

Colab Jupyter

Naive Bayes Classifier (from scratch)optional

Colab Jupyter

Lecture 6: Introduction into Connectionist Machine Learning

10 notebooks

▶

Logistic Regression (Basics)

HTML Colab Jupyter

Artificial Neural Networks (Basic Architecture)

HTML Colab Jupyter

Gradient Descent

HTML Colab Jupyter

Backpropagation (Basic Examples)

HTML Colab Jupyter

Bias & Variance (Machine Learning)

HTML Colab Jupyter

Logistic Regressionoptional

HTML Colab Jupyter

The Linear Layeroptional

HTML Colab Jupyter

The Softmax Functionoptional

HTML Colab Jupyter

Backpropagationoptional

HTML Colab Jupyter

Implementing an ANN from Scratchoptional

HTML Colab Jupyter

Lecture 7: Word Embeddings

3 notebooks

▶

Word & Text Embeddings (Overview)

HTML Colab Jupyter

Word2Vec (Basics)

HTML Colab Jupyter

Word2Vec (Training from Scratch)

HTML Colab Jupyter

Lecture 8: Encoder-Decoder

3 notebooks

▶

Recurrent Neural Networks

HTML Colab Jupyter

Language Modeling with RNNs

HTML Colab Jupyter

Working with Batches for Sequence Tasks

HTML Colab Jupyter

Lecture 9: Transformers

7 notebooks

▶

Attention Mechanism

HTML Colab Jupyter

Transformers (Basic Architecture)

HTML Colab Jupyter

Positional Encodings (Basics)

HTML Colab Jupyter

Positional Encodings (Original Transformer)

HTML Colab Jupyter

Positional Encodings — RoPEoptional

HTML Colab Jupyter

Machine Translation with Transformers

HTML Colab Jupyter

Masking in Sequence Models

HTML Colab Jupyter

Lecture 10: LLMs

6 notebooks

▶

Positional Encodings (RoPE)

HTML Colab Jupyter

Building a GPT-Style LLM from Scratch

HTML Colab Jupyter

Working with the OpenAI API

HTML Colab Jupyter

Using Pretrained LLMs Locally

HTML Colab Jupyter

Data Preparation for Training LLMsoptional

HTML Colab Jupyter

Data Batching for Training LLMsoptional

HTML Colab Jupyter