FastAI Book Chapter 1 Questions
Here I answer (some of) the questions at the end of the first chapter of Deep Learning for Coders with fastai and PyTorch.
- What was the name of the first device that was based on the principle of the artificial neuron?
- The name of this device was the Mark I Perceptron. It was primarily developed by Mark Rosenblatt (when he expanded upon work done by Warren McCulloch and Walter Pitts). In “The Design of an Intelligent Automaton”, Rosenblatt wrote about this work: “We are now about to witness the birth of such a machine - a machine capable of perceiving, recognizing and identifying its surroundings without any human training or control”. It was able to successfully recognize simple shapes.
- Based on the book of the same name, what are some of the requirements for parallel distributed processing (PDP)?
- A set of processing units
- A state of activation
- An output function for each unit
- A pattern of connectivity among units
- A propogation rule for propogating patterns of activities through the network of connectivities
- An activation rule for combining the inputs impinging on a unit with the current state of that unit to produce an output for the unit
- A learning rule whereby patterns of connectivity are modified by experience
- An environment within which the system must operate Neural networks handle each of these requirements.
- What were two theoretical misunderstandings that held back the field of neural networks? -
- What is a GPU? -
- Why is it hard to use a traditional computer program to recognize images in a photo? -
- What did Samuel mean by “weight assignment? -
- What term do we normally use in deep learning for what Samuel called “weights”? -
- Why is it hard to understand why a deep learning model makes a particular prediction? -
- What is the name of the theorem that shows that a neural network can solve any mathematical problem to any level of accuracy? -
- What do you need in order to train a model? -
- How could a feedback loop impact the rollout of a predictive policing model? -
- What is the difference between classification and regression? -
- What is a validation set? What is a test set? Why do we need them? -
- Can we always use a random sample for a validation set? Why or why not? -
- What is overfitting? Provide an example. -
- What is a metric? How does it differ from loss? -
- How can pretrained models help? -
- What is the “head” of a model? -
- What kinds of features do the early layers of a CNN find? How about the later layers? -
- What is an architecture? -
- What is segmentation? -
- What are hyperparameters? -
Written on November 24, 2020