Deeplearning notes
satya - 12/31/2022, 2:58:07 PM
Deeplearning toolkits
Deeplearning toolkits
satya - 12/31/2022, 2:58:35 PM
Some examples
PyTorch - Facebook
Tensorflow - Google
CNTK - MS
satya - 12/31/2022, 3:26:11 PM
Keras is a wrapper for other toolkits
Keras is a wrapper for other toolkits
satya - 12/31/2022, 3:30:24 PM
So start with what a neuron is
In deep learning, a neuron is a unit of computation that receives input from data or from other neurons, processes this input using a set of weights and biases, and produces an output. Neurons are organized into layers in a deep learning model, with each layer representing a different level of abstraction or feature learning.
For example, in a convolutional neural network (CNN) for image classification, the input layer may contain neurons that receive pixel values from the input image, and subsequent layers may contain neurons that learn to recognize more complex features such as edges, shapes, and objects. The final layer of the network is typically a fully connected layer that produces a set of output values representing the predicted class labels for the input image.
Each neuron in a deep learning model typically uses a nonlinear activation function to process its input and produce its output. This activation function allows the model to learn complex relationships between the input data and the desired output. Common activation functions include the sigmoid function, the rectified linear unit (ReLU) function, and the hyperbolic tangent (tanh) function.
satya - 12/31/2022, 3:32:50 PM
Summary and concepts: Neuron Weights and Biases
Summary and concepts: Neuron Weights and Biases
satya - 12/31/2022, 3:34:31 PM
Activation functions
satya - 12/31/2022, 3:39:24 PM
So what is a neuron?
1. It is a unit of learning based on multiple inputs and a corresponding output
2. It can also be in equivalence seen as a "curve fitting" module
3. Example, if you are given a scatter plot of (x,y), find a line that most fits that linear relationship
4. The resulting values of slope and intercept is what is learned from the given points
5. Of course the relationship can be nonlinear, or a step function, or a parabola or something of that nature
satya - 12/31/2022, 3:42:46 PM
What are the good usecases for Neural Networks?
What are the good usecases for Neural Networks?
satya - 12/31/2022, 3:43:32 PM
Few
1. Image classification
2. language translation
3. time series forecasting
satya - 12/31/2022, 3:43:49 PM
Types of neural networks
Types of neural networks
satya - 12/31/2022, 3:44:43 PM
Neural network types
Convolutional neural networks:Search On Web
Recurrent Neural networks:Search On Web
Self-organizing maps:Search On Web
satya - 12/31/2022, 4:07:07 PM
Some sample code
model = keras.Sequential([
keras.layers.Dense(10, input_shape=(784,), activation='sigmoid')
])
model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])
model.fit(X_train_flattened, y_train, epochs=5)
satya - 12/31/2022, 4:09:17 PM
Define the neural network first
1. Create 10 neurons as the receiving layer
2. There are 784 inputs
3. The activation function in the output layer (of 10) is sigmoid)
satya - 12/31/2022, 4:09:33 PM
Compile it then
Compile it then
satya - 12/31/2022, 4:18:42 PM
Then fit
1. An epochs is a try
2. So here 5 epochs means 5 tries
3. After each try adjust the internal params based on accuracy and retrain and remeasure accuracy
4. Continue till the end of all epochs.
satya - 12/31/2022, 4:23:08 PM
how does a neuron adjust its weights for a new epoch?
how does a neuron adjust its weights for a new epoch?
Search for: how does a neuron adjust its weights for a new epoch?
satya - 12/31/2022, 4:23:15 PM
What is a learning rate?
What is a learning rate?
satya - 12/31/2022, 5:09:55 PM
Satya Komatineni How are the outputs of different neurons in the output layer aggregated?
Satya Komatineni How are the outputs of different neurons in the output layer aggregated?
satya - 12/31/2022, 5:15:14 PM
What parameter to the network decides how the output layer can be aggregated?
What parameter to the network decides how the output layer can be aggregated?
Search for: What parameter to the network decides how the output layer can be aggregated?
satya - 1/2/2023, 12:05:29 PM
Formulas for derivatives of widely used loss functions in neural network back propagation in a table?
Formulas for derivatives of widely used loss functions in neural network back propagation in a table?
satya - 1/2/2023, 12:22:58 PM
A reasonable good video on Chain rules to calculate loss derivative
">A reasonable good video on Chain rules to calculate loss derivativesatya - 1/2/2023, 12:23:26 PM
Sorry the above link is trash. delete it later.
satya - 1/4/2023, 2:44:57 PM
Are there a drag and drop tool to train AI models?
Are there a drag and drop tool to train AI models?
Search for: Are there a drag and drop tool to train AI models?
satya - 1/4/2023, 2:45:34 PM
ChatGPT said
Yes, there are several drag-and-drop tools that allow you to train artificial intelligence (AI) models without having to write any code. These tools typically have a graphical user interface (GUI) that allows you to build and train models by dragging and dropping pre-defined blocks or elements onto a canvas.
One example of a drag-and-drop tool for building and training AI models is Google's Cloud AutoML. Cloud AutoML is a suite of machine learning (ML) tools that allows you to train high-quality custom models with minimal effort. It includes a drag-and-drop interface for building and training models, as well as pre-trained models that you can use as a starting point.
Other examples of drag-and-drop tools for building and training AI models include IBM Watson Studio, DataRobot, and H2O.ai's Driverless AI. These tools also offer a range of pre-built ML models and allow you to build and train custom models without having to write any code.
It's worth noting that while these tools can be a useful way to get started with AI and ML, they may have some limitations in terms of the complexity and flexibility of the models that you can build. In some cases, you may need to write code to build and train more advanced models or to fine-tune the models that you build with the drag-and-drop interface.