Reinforcement Learning with Human Feedback - How to train and fine-tune Transformer Models

Reinforcement Learning with Human Feedback - How to train and fine-tune Transformer Models

Proximal Policy Optimization (PPO) - How to train Large Language Models

Proximal Policy Optimization (PPO) - How to train Large Language Models

Stable Diffusion - How to build amazing images with AI

Stable Diffusion - How to build amazing images with AI

What are Transformer Models and how do they work?

What are Transformer Models and how do they work?

The math behind Attention: Keys, Queries, and Values matrices

The math behind Attention: Keys, Queries, and Values matrices

The Attention Mechanism in Large Language Models

The Attention Mechanism in Large Language Models

The Binomial and Poisson Distributions

The Binomial and Poisson Distributions

Euler's number, derivatives, and the bank at the end of the universe

Euler's number, derivatives, and the bank at the end of the universe

Decision trees - A friendly introduction

Decision trees - A friendly introduction

How do you minimize a function when you can't take derivatives? CMA-ES and PSO

How do you minimize a function when you can't take derivatives? CMA-ES and PSO

What is Quantum Machine Learning?

What is Quantum Machine Learning?

Denoising and Variational Autoencoders

Denoising and Variational Autoencoders

Eigenvectors and Generalized Eigenspaces

Eigenvectors and Generalized Eigenspaces

Thompson sampling, one armed bandits, and the Beta distribution

Thompson sampling, one armed bandits, and the Beta distribution

The Beta distribution in 12 minutes!

The Beta distribution in 12 minutes!

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

The Gini Impurity Index explained in 8 minutes!

The Gini Impurity Index explained in 8 minutes!

The covariance matrix

The covariance matrix

Gaussian Mixture Models

Gaussian Mixture Models

Singular Value Decomposition (SVD) and Image Compression

Singular Value Decomposition (SVD) and Image Compression

ROC (Receiver Operating Characteristic) Curve in 10 minutes!

ROC (Receiver Operating Characteristic) Curve in 10 minutes!

Restricted Boltzmann Machines (RBM) - A friendly introduction

Restricted Boltzmann Machines (RBM) - A friendly introduction

A Friendly Introduction to Generative Adversarial Networks (GANs)

A Friendly Introduction to Generative Adversarial Networks (GANs)

You are much better at math than you think

You are much better at math than you think

Training Latent Dirichlet Allocation: Gibbs Sampling (Part 2 of 2)

Training Latent Dirichlet Allocation: Gibbs Sampling (Part 2 of 2)

Latent Dirichlet Allocation (Part 1 of 2)

Latent Dirichlet Allocation (Part 1 of 2)

Book by Luis Serrano -

Book by Luis Serrano - "Grokking Machine Learning" (40% off promo code)

Serrano.Academy - The art of understanding

Serrano.Academy - The art of understanding

Naive Bayes classifier: A friendly approach

Naive Bayes classifier: A friendly approach

Math and OCD - My story with the Thue-Morse sequence

Math and OCD - My story with the Thue-Morse sequence