Lecture 8 Deep Learning Software

CPU vs GPU CPU vs GPU2 CPU vs GPU3 Bottleneck in Data IO

GPU works much faster in massive parallel problems, because of its # of cores and straightforward structure(CPU matters more on capability and sequential task control)
We could program GPUs by using CUDA(NVIDIA!)(and Higher level APIs cuBLAS, cuFFT, cuDNN), OpenCL(available in AMD, but slower than CUDA)
There might be a bottleneck in reading data and transferring to GPU. Using RAM or SSD HD, using multiple CPU threads to IO could be a solution.

Deeplearning Libraries Computational Graph with Numpy and Tensorflow

Utilities of Deep Learning Libraries
- Easily build big computational graphs
- Easily compute gradients in computational graphs
- Run it efficiently on GPU(cuDNN, cuBLAS or etc)
Goal of Deep Learning Libraries
- Compute as easy as Numpy(or others), and gives gradient function and GPU utilities without struggle.
Trends of Deep Learning Libraries: the framework varies, TensorFlow is leading the trend, industries and big companies are supporting the libraries, which was made in academias.

Tensorflow Code Detail Tensorflow Code Bottleneck Problem Tensorflow Code Detail 2 Tensorflow Code Detail 3 Tensorflow Code Detail 4 Keras Code Detail Range of Higher-level Wrappers

Abstraction of Tensorflow
APIS of Tensorflow

PyTorch Code Detail PyTorch Code Detail 2 PyTorch Code Detail 3 PyTorch Code Detail 4 PyTorch Code Detail 5 PyTorch Code Detail 6 Torch vs PyTorch

Abstraction of PyTorch
- Tensor: Imperative array run-able on GPU
- Variable: Node that can store data and gradient
  - PyTorch Tensors and Variables have the same API, so we just have to change Tensor to Variable
- Module: Neural network layer that can store state or weight
APIs of Pytorch
- Pretrained Models: torchvision.models.model(pretrained=True)
- Visdom: logging the code and visualize the work(But, not visualize computational graph structure yet)

Static vs Dynamic Graph Structure Dynamic Graph Application in Modular Network

Advantage of Static Structure
- Framework can optimize the graph before it runs(fuse operation, reordering,applying efficient method)
- Once graph is built, it can be serialized and do not have to rebuild the graph when re-using it(Otherwise, graph building and execution is interwined in dynamic structure)
Advantage of Dynamic Structure
- Code is a lot cleaner and easier in most of scenarios:conditional situation, loops, else(Tensorflow must build its own entire control-flow methods(magic lines) to support all computational graphs)(There are library TensorFlow Fold that makes dynamic graphs easier in TF, using dynamic batch)
- Available in building Recurrent networks, Recursive network, Modular networks
Google is tring to unify all frameworks as TensorFlow, whereas Facebook is varying Static Structure Framework(Caffe) and Dynamic Structure Framework(PyTorch).

Caffe Overview How to Define Network in Caffe prototxt Caffe Pros and Cons How to choose DL Framework

BuildOurOwnRepublic blog rpblic