FastAI Book Chapter 1 Questions

Here I answer (some of) the questions at the end of the first chapter of Deep Learning for Coders with fastai and PyTorch.

What was the name of the first device that was based on the principle of the artificial neuron?
- The name of this device was the Mark I Perceptron. It was primarily developed by Mark Rosenblatt (when he expanded upon work done by Warren McCulloch and Walter Pitts). In “The Design of an Intelligent Automaton”, Rosenblatt wrote about this work: “We are now about to witness the birth of such a machine - a machine capable of perceiving, recognizing and identifying its surroundings without any human training or control”. It was able to successfully recognize simple shapes.
Based on the book of the same name, what are some of the requirements for parallel distributed processing (PDP)?
- A set of processing units
- A state of activation
- An output function for each unit
- A pattern of connectivity among units
- A propogation rule for propogating patterns of activities through the network of connectivities
- An activation rule for combining the inputs impinging on a unit with the current state of that unit to produce an output for the unit
- A learning rule whereby patterns of connectivity are modified by experience
- An environment within which the system must operate Neural networks handle each of these requirements.
What were two theoretical misunderstandings that held back the field of neural networks? -
What is a GPU? -
Why is it hard to use a traditional computer program to recognize images in a photo? -
What did Samuel mean by “weight assignment? -
What term do we normally use in deep learning for what Samuel called “weights”? -
Why is it hard to understand why a deep learning model makes a particular prediction? -
What is the name of the theorem that shows that a neural network can solve any mathematical problem to any level of accuracy? -
What do you need in order to train a model? -
How could a feedback loop impact the rollout of a predictive policing model? -
What is the difference between classification and regression? -
What is a validation set? What is a test set? Why do we need them? -
Can we always use a random sample for a validation set? Why or why not? -
What is overfitting? Provide an example. -
What is a metric? How does it differ from loss? -
How can pretrained models help? -
What is the “head” of a model? -
What kinds of features do the early layers of a CNN find? How about the later layers? -
What is an architecture? -
What is segmentation? -
What are hyperparameters? -

Written on November 24, 2020