Artificial Intelligence Asked by ram bharadwaj on August 24, 2021
I have recently been exposed to the concept of decentralized applications,
I know that neural networks require a lot of parallel computing infra for training.
What are the technical difficulties one may face for training neural networks in a p2p manner?
Data management and bandwidth are key issues for interconnecting multiple GPUs. These are such big issues that it is hard to think about other challenges like neural network architecture, metrics, etc. The key to success for interconnecting multiple GPUs on a single computer is NVIDIA's NVLink:
NVLink is a wire-based communications protocol for near-range semiconductor communications developed by Nvidia that can be used for data and control code transfers in processor systems between CPUs and GPUs and solely between GPUs. NVLink specifies a point-to-point connection with data rates of 20 and 25 Gbit/s (v1.0/v2.0) per differential pair.
Compare 25 Gbit/s to a typical peer to peer connection over the web of 100Mbps. NVLINK provides a 250x advantage assuming everything else is equal which it is not. This means that, considering bandwidth only, a neural network which takes one day to train on a computer with two GPUs connected with NVLINK could take 250 days over the internet using two computers with the same GPU!
Answered by Brian O'Donnell on August 24, 2021
0 Asked on February 8, 2021
1 Asked on February 7, 2021 by vaibhav-thakkar
1 Asked on February 5, 2021 by gideon
keras loss functions policy gradients proximal policy optimization reinforcement learning
1 Asked on February 4, 2021 by gokul
backpropagation batch gradient descent deep learning neural networks stochastic gradient descent
0 Asked on February 3, 2021 by maciek-woniak
1 Asked on February 2, 2021 by a-is-for-ambition
deep learning expectation generative adversarial networks loss functions probability distribution
1 Asked on January 28, 2021 by seewoo-lee
1 Asked on January 28, 2021 by sergiu-ionescu
0 Asked on January 27, 2021 by blue-sky
0 Asked on January 23, 2021 by manish-kausik-hari-baskar
0 Asked on January 22, 2021
0 Asked on January 22, 2021 by ddaedalus
2 Asked on January 21, 2021 by onza
3 Asked on January 20, 2021 by andreas-storvik-strauman
comparison human brain neural circuits neural networks topology
1 Asked on January 17, 2021
deep neural networks explainable ai grad cam neural networks
3 Asked on January 14, 2021 by curious-g
0 Asked on January 10, 2021 by toby
generative adversarial networks gradient descent optimization papers training
1 Asked on January 6, 2021 by dua-fatima
applications hill climbing optimization search travelling salesman problem
1 Asked on January 1, 2021 by mark-mark
algorithmic bias fully connected layer hidden layers neural networks tensorflow
Get help from others!
Recent Questions
Recent Answers
© 2022 AnswerBun.com. All rights reserved. Sites we Love: PCI Database, MenuIva, UKBizDB, Menu Kuliner, Sharing RPP