I’d like to eventually answer some of the following.
Why is 2001 A Space Odyssey considered a good movie?
Why is GPT-3 so cool?
Why are parameter counts important in deep learning models?
How does a neural network work?
When do I use a null hypothesis, and what is the math involved?