I’d like to eventually answer some of the following.

Why is 2001 A Space Odyssey considered a good movie?

Why is GPT-3 so cool?

Why are parameter counts important in deep learning models?

How does a neural network work?

When do I use a null hypothesis, and what is the math involved?